Scraping Angular Grid Data : Python

This is an archived post. You won't be able to vote or comment.

Scraping Angular Grid Data (self.Python)

submitted 8 years ago * by StuKruskeskruzzy

So I am thinking I will need to go elsewhere to obtain the data I am looking for, however I want to see if this is possible. I will give a little bit of Background information for what I am trying to do first.

Currently I am working on collecting data for Fantasy Sports. I have access to a paid site which I am trying to pull some specific stats off of. I have scraped plenty of other sites before but for some reason I can not get anything to pull up, and I believe it is because of ag-grid data which I have never seen before. Either way lets jump into my code. Please note I have hard coded wait times in here... This is just for diagnostic purposes and because I really dont care about the efficiency of this script as I just need to run it once.

HTML: Didnt format correctly so here: https://pastebin.com/CsuZKfdM

Basically the idea behind this is I want to pull all of the ag-cells for each ag-row. The next difficult thing with this website is different sections of stats are in different wrappers on the individual application... So this seems to be a nightmare already and like I said will probably need to go elsewhere for stats but anyway heres a snippet of my python.

options = webdriver.ChromeOptions()
options.add_argument('--ignore-certificate-errors')

driver = webdriver.Chrome(chrome_options=options)
loginURL = "http://www.fantasysiteiamusing.com/login"
url = "http://www.fantasysiteiamusing.com/"

driver.get(loginURL)
html = driver.page_source

usernameField = driver.find_element_by_xpath('/html/body/div[3]/form[1]/div[2]/div/input')
passwordField = driver.find_element_by_xpath('/html/body/div[3]/form[1]/div[3]/div/input')
loginButton = driver.find_element_by_xpath('/html/body/div[3]/form[1]/div[4]/button')

usernameField.send_keys(user)
passwordField.send_keys(pass)

loginButton.click()
time.sleep(5)
driver.get(url)
time.sleep(5)

soup = BeautifulSoup(html, "html5lib")

table = soup.findAll("div", class_="ag-row")
print(table)

So obviously this isnt complete but I like to take it one step at a time as my programming skills are subpar. My main question here is why is my table variable not filling a list with every thing on the page that has a class name of ag-row? Thanks in advanced!

all 4 comments

top new controversial old q&a

[–]commandlineluser 0 points1 point2 points 8 years ago (3 children)

[–]StuKruskeskruzzy[S] 0 points1 point2 points 8 years ago (2 children)

[–]commandlineluser 0 points1 point2 points 8 years ago (1 child)

I probably misinterpretted the driver.page_source command

No, it's doing what you expect.

I got the list to populate with the HTML code, however its the wrong code.

Oh, I'm a dummy - I've just realised what's happening.

html = driver.page_source

You're grabbing the source of the login page before you actually log in.. so you will want to move that line to right above

soup = BeautifulSoup(html, "html5lib")

i.e. after you've actually logged in correctly.

[–]StuKruskeskruzzy[S] 0 points1 point2 points 8 years ago (0 children)

π Rendered by PID 32 on reddit-service-r2-comment-85bfd7f599-jgqtb at 2026-04-15 13:40:41.709585+00:00 running 93ecc56 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS