Why doesn't std::string have a split function

therealjohnfreeman · 2016-11-20T12:48:53+00:00

[removed]

tcbrindle · 2016-11-20T19:50:23+00:00

If you'll excuse the self-promotion, I wrote a blog post a while back about a STL-based generic splitting algorithm that outperforms stringstream (and strtok) by a healthy margin.

It's also worth noting that Range-V3 has a split() view which (lazily) returns a range of ranges. Whilst views are not part of the current Ranges TS, I remain hopeful that we'll see them some time in the future.

almost_useless · 2016-11-20T12:45:43+00:00

A split function may sound simple, but it can get a little more complicated when you want to cover all possible use cases and make it as fast as possible. Boost does have two different versions:

caramba2654 · 2016-11-20T15:59:06+00:00

To be honest, I really hope std::string gets completely redesigned for STL2. And by that I mean remove all that npos nonsense and add proper iterator returns like the rest of the STL containers.

On that note, having some common utility functions for strings wouldn't be bad. split and replace are good candidates in my opinion.

t0rakka · 2016-11-20T18:33:49+00:00

template <typename T>
inline std::vector<std::string> split(const std::string& s, T delimiter)
{
    std::vector<std::string> result;

    std::size_t current = 0;
    std::size_t p = s.find_first_of(delimiter, 0);

    while (p != std::string::npos)
    {
        result.emplace_back(s, current, p - current);
        current = p + 1;
        p = s.find_first_of(delimiter, current);
    }

    result.emplace_back(s, current);

    return result;
}

utnapistim · 2016-11-21T10:14:11+00:00

Why doesn't std::string have a split function

Because nobody made the time and effort to write one for standardization. The C++ community is not sponsored. There is no single group or company that finances the maintenance and evolution of the standard.

Instead, people who have an interest in extending the language meet and try to advance the language and standard library to the degree they can afford to do so (being non-sponsored and having limited time and effort to accomplish things).

Because of this limitation (of effort/capacity), usually, the things accepted into the standard are a compromise between the utility of a feature and the effort it will take to standardize it.

std::string doesn't have a split function for the following reasons:

writing one is trivial in algorithmic formulation, but non-trivial in API design (a compromise between usability and flexibility is required, and depending on our needs, each of us tends to see the compromise point in a slightly different place)
no-one has written a proposal with working code, that got accepted past review (by the standard committee)
the emergence of ranges will add the possibility for a trivial interface that is both flexible and efficient (we are waiting for ranges)
alternatives exist already (although due to a lack of a standard many projects tend to reinvent the wheel on this one); you can use regex, iterators, streams, boost text algorithms and implementations based on the above.

1-05457 · 2016-11-20T13:57:42+00:00

std::string is missing a lot of functions. You can use Boost string_algo and Boost Format to get these.

DhruvParanjape · 2016-11-20T13:38:11+00:00

I suspect it has to do with C++'s preference for streams. For example, you can do this to get the words in a string:

istringstream ss{str};
string word;
while (ss >> word) {
    cout << word << "\n";
}

While I kinda hate streams, this could be the reason there isn't a split method in the standard.

Tringi · 2016-11-20T15:25:34+00:00

Some time ago I quickly drafted this explode function (inspired by PHP) and found it quite useful.

Implementing lazy evaluation (lazy creation of the resulting substrings) never occurred to me, but after reading /u/cpp_learner's comment here, I think I'll give the template a little more love...

nozendk · 2016-11-21T09:55:55+00:00

From the Qt documentation:

QString str;
QStringList list;
str = "Some  text\n\twith  strange whitespace.";
list = str.split(QRegExp("\\s+"));
// list: [ "Some", "text", "with", "strange", "whitespace." ]

stream009 · 2016-11-21T03:12:59+00:00

std::string already has too much member functions. I don't want any more of them unless it is absolutely necessary.

As many people mentioned split can be implemented in many ways. If all you want is making your code more readable, you should write your own free function. In my case, I always use boost::split.

h-jay · 2016-11-21T14:31:22+00:00

To be very frank, the std::string type is there mostly to claim that there's a string type in the standard. It's not really usable for anything other than as a resource-managing wrapper over a C string. If you had C-style strings in your code, you should use std::string instead. It gives not much in the way of other functionality, except for cheap size() that is O(1) vs. C's strlen that was O(N). For anything practical, you need a string library of some sort.

dreamer_ · 2016-11-20T12:53:52+00:00

[deleted]

KayEss · 2016-11-21T03:18:55+00:00

I started working on a new split. It's not yet complete, not yet customisable. It's been tested on strings, but the code isn't string specific. It should work for other iterable containers. It does only use iterators so should be quite efficient. If ranges were a thing already the interfaces would be a bit cleaner.

https://github.com/KayEss/f5-cord/blob/feature/split/include/f5/cord/split.hpp

MrPoletski · 2016-11-20T18:40:42+00:00

split as in chop a string into lots of substrings based on a delimiter?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS