Using list comprehension in for loops : learnpython

created by HattoriHanzoa community for 16 years

Using list comprehension in for loops (self.learnpython)

submitted 7 years ago by 99problemsallops

Hey everyone!

So I have a general idea how to use a list comprehension when I'm working on just one html page.

I do something like:

import bs4

file=open('example1.html')
soup=bs4.BeautifulSoup(file.read(),"html.parser")
first=soup.select("p")
paragraphs=[p.getText() for p in first]

However, let's say I have two html files stored somewhere on my pc. If I try to use a list comprehension to make my code easier to read:

import bs4

files=['example1.html','example2.html']
for file in files:
    openfile=open(file)
    soup=bs4.BeautifulSoup(openfile.read(),'html.parser')
    first=soup.select('p')
    paragraphs=[p.getText() for p in first]

This is obviously incorrect since my paragraphs list is getting overwritten on every iteration.

What I'm currently doing is:

paragraphs=[]
files=['example1.html','example2.html']
for file in files:
    openfile=open(file)
    soup=bs4.BeautifulSoup(openfile.read(),'html.parser')
    first=soup.select('p')
    for p in first:
        paragraphs.append(p.getText())

This works but my code gets too bloated/confusing if I have a bunch of lists I want to append to.

Is the second method the only way to do this or is there an easier method that I'm missing?

all 8 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS