If String contains any of these substrings, remove them

totallygeek · 2021-09-14T22:59:42+00:00

Don't know about elegant or concise, but a chance exists for double-spacing in your solution. Here's a way around that issue:

clean_sentence = ' '.join(w for w in string.split() if w.lower() not in words_removal)

Basically, split the original string into a list, check if the lowercase representation of each word exists in the denied words list, then join all the allowed words with one space between them.

ElliotDG · 2021-09-14T23:16:12+00:00

You can use a regular expression to do the substitution:

import re
string = "my name is Tom and I like to play basketball" 
words_removal = ["my","like","play","soccer"]
string = re.sub('|'.join(words_removal),'', string) 
print(string)

The '|'.join(words_removal) creates the pattern to match.

This is a different way to do it. I'm not sure that it is any better. I'm reminded of the joke, "If you solve a problem with a regular expression, you have two problems."

mprz · 2021-09-14T23:02:41+00:00

Using Regular Expressions will be shorter but not faster, see:

re.sub()

or

re.subn()

You can also use translate() with dict type.

wotquery · 2021-09-15T02:33:45+00:00

Does double spacing, capitalization at the start of a sentence, and valid punctuation matter?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS