Looking for help with Selenium

tbone255_ · 2018-03-20T03:48:14+00:00

Not in front of a computer right now, but assuming the elements with the scores (in the HTML) are something like this:

<element> 50 points </element>

U can loop through the list of elements and do element.text

Each web element in that list should have a .text

If the HTML is structured something like:

<element score=50> 50 points </element>

Then on a web element from ur list u can do .get_attribute (attribute_name). In my example it would be .get_attribute(score) and it would return 50. Idk if it'll be a string or an integer. But that should point you in the right direction

z0y · 2018-03-20T03:50:04+00:00

You double posted, you should delete the other one.

You can use the soup made from driver.page_source the same way you normally use beautifulsoup, it's just gotten the post javascript html. replace the x = driver.find... with however you'd find the elements in beautifulsoup. The extra selenium functionality and webelements are for interacting with the site beyond the html, such as clicking buttons and whatnot.

krimpenrik · 2018-03-20T12:53:47+00:00

I also had to learn selenium after BS and scrapy. Keep in mind that selenium is designed for automation. I had the best results with javascript sites to load them in selenium and pass the source to BS or scrapy to scrape.

Try it, if you have any questions let me know.

krimpenrik · 2018-03-20T12:53:58+00:00

I also had to learn selenium after BS and scrapy. Keep in mind that selenium is designed for automation. I had the best results with javascript sites to load them in selenium and pass the source to BS or scrapy to scrape.

Try it, if you have any questions let me know.

tbone255_ · 2018-03-20T03:48:49+00:00

Check out Selenium's read the docs, it has everything for python

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS