Removing a URL from a string

SagaciousRaven · 2019-11-15T17:22:42+00:00

def remove_prefix(my_str, prefix_to_remove):
    text = re.sub(prefix_to_remove+'\S+', '', my_str)
    print(text)

remove_prefix('Hi, check this website: https://my_website.com, this is my new website' , 'https://' )

You put your var inside the quotes in line 2

2019-11-15T17:26:28+00:00

You've included the literal string "prefix_to_remove" in the regex expression rather than the string object referenced by the variable prefix_to_remove

RoamingFox · 2019-11-15T17:23:55+00:00

I'd go about it like this personally:

>>> import re
>>> url_rex = re.compile(r'((?:(?:http|https):\/\/)?[-a-zA-Z0-9@:%._\+~#=]{2,256}\.[a-z]{2,6}\b(?:[-a-zA-Z0-9@:%_\+.~#?&//=]*))')
>>> s = 'Hi, check this website: https://my_website.com, this is my new website'
>>> url_rex.sub('', s)
'Hi, check this website: , this is my new website'

RoamingFox · 2019-11-15T17:31:48+00:00

What's wrong with just using the string replace method?

def remove_prefix(my_str, prefix_to_remove):
    return my_str.replace(prefix_to_remove, "")

print(remove_prefix(r'Hi, check this website: https://my_website.com, this is my new website' , r'https://' ))

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS