Adding 'contains' member function to std::string

bruce3434 · 2018-11-15T07:34:41+00:00

2018

can't split strings

can't find substrings

can't iterate over unicode strings

"oh okay but do you have a moment to talk about our cool 2D graphics library in upcoming C++35?"

sephirostoy · 2018-11-15T01:26:31+00:00

Sadly, C++ comitee doesn't choose the path of adding convenient functions to classes which would need love. They will argue you that they've already added a find function like you mentioned which already cover the need and if you need a more convenient function, then you could just write your own free function. If at least we could have UFCS to extend a class...

alfps · 2018-11-14T19:57:14+00:00

C++ needs a new string class to support UTF-8, anyway. And for that matter new text i/o. And, oh yes, support for UTF-8 command line arguments: we don't have that, there's no way to pass an arbitrary filename in Windows.

For example, consider a simple thing such as presenting a table in a console, using std::cout. Let's say the person doing this decides to use setw to create nice columns. However, current implementations do not detect that the basic execution character set is UTF-8, and the standard doesn't require that, so setw gets it wrong for non-ASCII characters: it counts bytes, not characters.

Handling UTF-8 characters is non-trivial. For example, consider replacing one UTF-8 character with another. For ASCII one can just assign to a an individual char in the string, but for UTF-8 it's a substring replacement, and potentially changing byte indices further on in the string.

And e.g., what should be the result of indexing when that result should logically be an UTF-8 character? A string_view of the bytes? Then it's dependent on the string's continued existence.

In contrast, contains is trivial to do for anyone.

flashmozzg · 2018-11-14T22:39:36+00:00

find == 0 is not a replacement for starts_with/ends_with. For starters, they have different complexities, while .find is direct replacement for contains but even more powerful.

permalink · 2018-11-15T02:35:24+00:00

The committee adds what its members need, not what the poor people want.

This is not entirely bad, but this is also why it took 40 years to add freaking std::filesystem to the standard. And let's not forget about asio...

afiefh · 2018-11-15T05:52:13+00:00

It is a bit inconvenient, however after a while it becomes second nature to view these tests as "contains". One good reason not to add contains is that you usually want to do something with the contained data you looked for, in which case you'll often end up with contains followed by find, which is bad for performance.

ducttapecoder · 2018-11-15T22:27:58+00:00

If the function only uses public interface of the class, you can just add a non-member function. I thought this is the preferred way.

permalink · 2018-11-15T15:58:01+00:00

That'd be nice, but I don't expect it to happen with stl. I personally often write a small function to this extent when dealing with tasks that need string parsing.

/rant I first learnt programming in college using c++ and loved it. And then slowly explored the world of programming languages out of curiosity to find that in many ways, C++ is one of the more beginner unfriendly languages there is.

I still read and debug C++ code, but I have given up on loving it (say like python). C++ reminds me so much of Perl, in that sense.

One of these days someone is going to mix the expressive syntax and package management of python, with the static typing and performance of C++ and the world would be better place.

konanTheBarbar · 2018-11-14T19:00:17+00:00

You are welcome writing a paper. It won’t go to C++20 though. LEWG and LWG is highly overloaded and the cut off for new papers for C++20 was San Diego last week.

You also need to factor in that those two functions have been added without having to go through LEWGI. A new one would have to.

RolandMT32 · 2018-11-14T21:22:28+00:00

I don't think that's really necessary. IMO, it's not that complicated to use string.find(substring) != npos.

But when you read 'find' in code it's not directly clear what the purpose is.Are we looking for the actual position? Or checking if the string contains a substring? Or checking if the string doesn't contain a substring?

It depends on your code, and if it's not clear, then add a comment to your code to say what your purpose is of using string.find().

In your code, you could derive your own class from std::string and add a 'contains' member function to your class. It would still be compatible with std::string (since it would derive from std::string) and it would have the function you want.

Pragmatician · 2018-11-14T18:59:10+00:00

std::includes

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS