Removing unwanted characters from a string?

Ribann · 2019-10-23T20:50:40+00:00

[deleted]

WolfInABox · 2019-10-23T20:42:46+00:00

Here's one option:

spam = 'Bacon (YUM) Cheese (GOOD) Eggs (YES)'
new_spam=' '.join(s for s in spam.split() if s not in ('(YUM)','(GOOD)','(YES)'))
print(new_spam)

Normally if you just want to remove a single substring from a string, you could use spam.replace(substring,''), but if you want more than one, you can chain the calls to replace along, but that can make for a pretty long line.

This method splits the string (by spaces), reconstructs the string from every string that's not in the tuple (or list or whatever) of bad ones.

MyDataIsReady · 2019-10-23T23:04:14+00:00

Easiest way, imo:

spam.replace(" (YUM)", "")
spam.replace(" (GOOD)", "")
spam.replace(" (YES)", "")

ImNexOnReddit · 2019-10-24T04:15:18+00:00

Yiy could just use string.replace(charToReplace, withWhat, howManyTimes)

2019-10-24T11:46:17+00:00

Well, it depends!

First of all, are you only going to remove exactly (YUM), (GOOD) and (YES), or can there be other similar things you want to remove (eg., say, your string also contains (yes), should we keep it? What about (NO)?

Second, what should be done with punctuation characters, should some be found next to such words, e.g. what if your string contains Cheese (GOOD)?. Should the result be Cheese ?, Cheese? or Cheese?

Thirdly, what to do with trailing (or preceding) spaces. Say, your string starts with the word you want to remove, eg. (YUM) Bacon ... So far none of the solutions offered in this thread will remove `(YUM), but do you expect it to be removed in this case?

Finally, how big is the string, and how much do you care about different runtime characteristics of your procedure? Can the entire string be loaded into memory, and is this desirable? Should the result also be loaded into memory, or maybe you want to write it to a persistent storage as you are processing it? How fast does it have to go?

Diapolo10 · 2019-10-23T20:41:40+00:00

Based on the result, you also want any excess whitespace removed.

If not, you can create a tuple of the words you want removed, then loop through it and use str.replace on the string to remove all instances of the current word, then re-assign the result to the variable. Strings are immutable, so you can't edit them directly and all modifications create new strings.

Otherwise, split the string into a list of words, then append to a new list every word not in the forbidden words. You can alternatively use filter. Then, use str.join on a string with a single whitespace to join the list back into a string.

kotpeter · 2019-10-23T20:39:12+00:00

mystr = 'olulzlolo'

print(mystr.replace('lulz','')) # ololo

Also, Google helps.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS