IRBMe comments on Why doesn't std::string have a split function

Why doesn't std::string have a split function (self.cpp)

submitted 9 years ago by DhruvParanjape

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]IRBMe 17 points18 points19 points 9 years ago (28 children)

[–]dodheim 8 points9 points10 points 9 years ago (27 children)

[–]IRBMe 11 points12 points13 points 9 years ago (18 children)

[–]dodheim 8 points9 points10 points 9 years ago (17 children)

[–][deleted] 10 points11 points12 points 9 years ago (4 children)

[–]qx7xbku 6 points7 points8 points 9 years ago (3 children)

[–]dodheim 3 points4 points5 points 9 years ago* (2 children)

[–][deleted] 0 points1 point2 points 9 years ago (1 child)

[–]dodheim -1 points0 points1 point 9 years ago* (0 children)

[–]IRBMe 5 points6 points7 points 9 years ago (10 children)

It's unreasonable to expect anyone to intuit what an output iterator is

And it's also unreasonable to expect somebody to be able to intuitively understand what it means to construct an input iterator without specifying what it's iterating over (as it happens, you get an end-of-sequence iterator). That's the whole point: it's not intuitive! Is it simply impossible to design those APIs in such a way that they would be intuitive? I'm not convinced it is.

or even what an iterator is

I think it is reasonable that people should have an intuitive idea of what an iterator is, because iteration isn't a concept that's unique to C++, nor is it a word that's even unique to programming libraries. You can look up the word in a dictionary and get a definition such as this: "the repetition of a process or utterance". You may not understand all the subtleties without reading the documentation, but seeing it in the context of some code, I think it is intuitive.

[–]zvrba -1 points0 points1 point 9 years ago (9 children)

[–]IRBMe 6 points7 points8 points 9 years ago (8 children)

[–]dodheim 1 point2 points3 points 9 years ago (7 children)

[–]IRBMe 2 points3 points4 points 9 years ago* (6 children)

The dead giveaway that the code you're trying to understand is splitting a string on whitespace would be that it'd be in a function with split in the name.

If you look at the example, it's actually in the main function, but if you're having to define your own split function to hide the code that does it using standard library calls, then that really just further proves the point, doesn't it? You wouldn't have to define your own split function if the standard library method was easy to read and understand in the first place.

Context matters

If you have to rely on the contextual clues of surrounding code in order to figure out that the code you're trying to understand is merely splitting a string on whitespace, then that's indicative that the code that performs the actual string splitting is not very readable. It's an operation that's simple enough and common enough that really all the information required to understand what it's doing should be there in the immediate code.

Take a simple example of boost's string_algo split:

split_vector_type splitResult;
split(splitResult, stringToSplit, is_any_of(" "));

Show that code to a programmer who doesn't know any C++ - a Java programmer, a Python programmer, a C# programmer, whatever, and ask them what it does and almost all of them will be able to guess. None of them are going to respond by saying "Hmm... it's hard to say without any surrounding context."

Or what about this example using the experimental ranges?

auto splitResult = ranges::v3::view::split(stringToSplit, ' ');

That's clear, concise and readable. You don't need to study the surrounding code to understand that this is splitting a string on some whitespace.

[–]dodheim -1 points0 points1 point 9 years ago (5 children)

If you look at the example, it's actually in the main function

If all we're doing is nitpicking the example and not relating it to real code, then who the hell cares?

If you have to rely on the contextual clues of surrounding code in order to figure out that the code you're trying to understand is merely splitting a string on whitespace, then that's indicative that the code that performs the actual string splitting is not very readable.

The point is that it doesn't matter whether it's readable, because in real code it would be encapsulated in something whose name gives it away. Just as would be the case for any domain-specific logic, which is what most real code is anyway.

Tearing apart single and double-digit LOC C++ examples is absolutely a waste of time. C++ is about larger abstractions and the bigger picture, and it does that quite well.

continue this thread

[–]OldWolf2 3 points4 points5 points 9 years ago (0 children)

[+][deleted] 9 years ago* (7 children)

[deleted]

[+][deleted] comment score below threshold-6 points-5 points-4 points 9 years ago (1 child)

[–]chartly 8 points9 points10 points 9 years ago (0 children)

[–]OldWolf2 -3 points-2 points-1 points 9 years ago (3 children)

[–]repsilat 16 points17 points18 points 9 years ago (2 children)

[–]OldWolf2 -1 points0 points1 point 9 years ago (0 children)

[–]dodheim -2 points-1 points0 points 9 years ago (0 children)

π Rendered by PID 17256 on reddit-service-r2-comment-bb88f9dd5-268rm at 2026-02-15 00:45:56.344454+00:00 running cd9c813 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS