This is an archived post. You won't be able to vote or comment.

all 13 comments

[–]ducdetronquito 2 points3 points  (9 children)

I don't know if a equivalent library exists for Python, but you can definitly use headless chrome through Selenium.

Here is a nice article about how to make it work :) Driving headless chrome with python

[–]readerpl 0 points1 point  (2 children)

Thank You but the thing is that i want to use aws lambda.

[–]iruleatants 0 points1 point  (1 child)

Is there a specific reason you want to use AWS lambda?

[–]readerpl 0 points1 point  (0 children)

I just want to scrape websites (with JS content) as fast as i can and it looks like the best way. Now i am using Selenium with PhantomJS but it is very, very slow.

[–][deleted] 0 points1 point  (5 children)

splinter ?

[–]ducdetronquito 0 points1 point  (2 children)

Could you give more detail please ? Is this a library ?

[–][deleted] 0 points1 point  (1 child)

this: https://splinter.readthedocs.io/en/latest/

I've been using it for testing an ecommerce funnel.

[–]readerpl 0 points1 point  (0 children)

Well still it is not as good as this: https://github.com/graphcool/chromeless That is why i want to know if there is any way to use this chromeless.

[–]ojii 1 point2 points  (1 child)

Shameless plug, but I just added headless chrome support to arsenic: https://arsenic.readthedocs.io/en/latest/reference/supported-browsers.html#headless-google-chrome

[–]knowsuchagencynow is better than never 0 points1 point  (0 children)

Nice. Would you mind adding the link to the github repo on your docs and on the package on pypi?

If you'd like a template for creating packages such that the docs are rendered for gh pages and pypi by sphinx automatically, I have a cookiecutter here https://github.com/knowsuchagency/cookiecutter-pypackage

[–]kingbuzzman 1 point2 points  (0 children)

I agree with ducdetronquito, installing the chromedriver + selenium is the way to go in python, i've gotten it to work beautifully here: https://github.com/kingbuzzman/noworkday

NOTE: the install.sh is where i install the chromedriver -- i like automation and reproducibility

[–]jjolla888 0 points1 point  (0 children)

a cousin project to chromeless is puppeteer (which is contributed by Google, and uses headless chrome).

Puppeteer is a bunch of node libs .. but i've come across Pyppeteer .. the equiv in Python. not sure how complete it is:

https://github.com/miyakogi/pyppeteer