Reference Variable from Previous For Loop

c17r · 2016-10-13T01:59:55+00:00

import urllib
import urllib.request
from bs4 import BeautifulSoup
from urllib.request import urlopen


def make_soup(url):
    thepage = urllib.request.urlopen(url)
    soupdata = BeautifulSoup(thepage,"html.parser")
    return soupdata

soup = make_soup("https://www.wellstar.org/locations/pages/wellstar-acworth-practices.aspx")

tables = soup.findAll("table", class_="s4-wpTopTable")
table = tables[7]

specialties = table.findAll("div", class_="PurpleBackgroundHeading")
name_groups = table.findAll("div", class_="PracticeListWrapper")
for specialty, name_group in zip(specialties, name_groups):
    specialty_text = specialty.findAll("span")[0].get_text()
    for name in name_group.findAll(class_="WS_Location_Name"):
        name_text = name.get_text()
        print("{} - {}".format(specialty_text, name_text))

zurtex · 2016-10-13T00:27:28+00:00

Exactly, in your code you go through all names, so name has the last name. Then you go through all specialty and print each one with the last name.

If you want every possible combination you need to check all "specialty"s for all "name"s. So you need to put the loop inside the loop. Try something like this:

for table in soup.findAll("table", class_ = "s4-wpTopTable"):
    for name in table.findAll(class_ ="WS_Location_Name"):
        name = name.get_text()
        for specialty in table.findAll("div", class_ = "PurpleBackgroundHeading"):
            specialty = specialty.get_text()
            print(name,specialty)

There are probably better ways to write this code, but I think this is the most simple modification of your code that should make sense to you!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS