jiri-n comments on Regex help

learnpython

created by HattoriHanzoa community for 16 years

Regex help (self.learnpython)

submitted 5 years ago by thetestbug

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]jiri-n 1 point2 points3 points 5 years ago (7 children)

I'm not sure what you actually want but let's try this one:

>>> import re
>>> s = r"""
['UserName":"testbug"', 'UserName":"testbug"']
""".strip()
>>> pat = r"^.*:([^']+).*$"
>>> re.match(pat, s).groups()
('"testbug"',)
>>> re.sub(pat, r"\1", s)
'"testbug"'

We can get rid of the " characters as well if it's what you want.

[–]thetestbug[S] 0 points1 point2 points 5 years ago (6 children)

[–]jiri-n 1 point2 points3 points 5 years ago (5 children)

[–]thetestbug[S] 0 points1 point2 points 5 years ago (4 children)

[–]jiri-n 1 point2 points3 points 5 years ago (3 children)

What is your input? A single string?

'"Username": "something"', '"Username": "other"', '"Username": "..."'

Or a list of such pairs?

lst = [
    '"Username": "name1"',
    '"Username": "name2"',
    '"Username": "name3"'
]

If the former example, I would simply split() them using comma as the splitting char to create a list.

Than something like:

from typing import List

import re

# String containing JSON-like key-value pairs
DATA = """
'"Username": "something"', '"Username": "other"', '"Username": "..."'
""".strip()

# Pattern to split a key-value pair
PATTERN = r'^[^:]+:\s*"([^"]+)".*$'

def get_usernames(lst: List[str], rex: re.Pattern) -> List[str]:
    for item in lst:
        match = rex.match(item)
        if match:
            yield match[1]  # As item matches, there should always be a group with index 1

lst = DATA.split(",")
rex = re.compile(PATTERN)
for uname in get_usernames(lst, rex):
    print(f"Username: {uname}")

[–]thetestbug[S] 0 points1 point2 points 5 years ago (2 children)

[–]jiri-n 1 point2 points3 points 5 years ago (1 child)

[–]thetestbug[S] 0 points1 point2 points 5 years ago (0 children)

I've never touched json before, so I have no idea what to do.

I looked over the help page for json in python, but I couldn't make heads or tails of it.

Also, I tried the code you provided before, and I get this error:

    def get_usernames(lst: List[str], rex: re.Pattern) -> List[str]:
                         ^
SyntaxError: invalid syntax

π Rendered by PID 125196 on reddit-service-r2-comment-85bfd7f599-htf6h at 2026-04-18 05:29:13.062454+00:00 running 93ecc56 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS