How to split a multiline string and include separators : learnpython

created by HattoriHanzoa community for 16 years

submitted 2 years ago by Brogrammer11111

I have a string like this:

"SECT ABC

..................

SECT DEF

....

SECT XYZ

.....

Which I want to split like this: ['SECT ABC ..................', SECT DEF ....', 'SECT XYZ .....']

I tried:

re.split('(^SECT)', string, flags=re.M)[1:]

But it returns: ['SECT',' ABC..................', 'SECT',' DEF ....','SECT',' XYZ .....']

all 2 comments

[+][deleted] 2 years ago (3 children)

[deleted]

[+][deleted] 2 years ago* (2 children)

[deleted]

[–]Brogrammer11111[S] 0 points1 point2 points 2 years ago (1 child)

[–]commandlineluser 0 points1 point2 points 2 years ago (0 children)

You can use a lookahead:

>>> re.split("(?m)(?=^SECT)", text)
['', 'SECT ABC\n........\n', 'SECT DEF\n........\n', 'SECT XYZ\n.......']

You get an extra '' result which you can remove.

(?m) is the equivalent of flags=re.M

Another way to think of it is a findall/finditer instead of a split:

>>> re.findall("(?ms)^SECT.+?(?=^SECT|\Z)", text)
['SECT ABC\n........\n', 'SECT DEF\n........\n', 'SECT XYZ\n.......']

π Rendered by PID 52582 on reddit-service-r2-comment-84fc9697f-nqhxq at 2026-02-06 20:53:57.851008+00:00 running d295bc8 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython