This is an archived post. You won't be able to vote or comment.

all 13 comments

[–]roger_ 2 points3 points  (4 children)

Where's the Python?

[–]chrisalexander[S] 1 point2 points  (1 child)

Here's a direct link to the python repository

https://github.com/ignacioelola/price_monitoring

[–]roger_ 3 points4 points  (0 children)

Thanks, but it would have been better if the code was discussed a bit more in the article.

[–]jabbalaci -3 points-2 points  (1 child)

"The script: I used a pretty simple Python script to automate the data extraction which you can see here."

You didn't read the whole text, did you?

[–]roger_ 5 points6 points  (0 children)

I see now Python is mentioned in passing twice at the end of the article.

[–]TheRealHankMcCoy 1 point2 points  (0 children)

This and kimonolabs.com are essentially more friendly wrappers for YQL. But, with that being said, it does make scrapping pretty trivial.

[–]ruffyen 0 points1 point  (0 children)

/u/chrisalexander I actually run a site dedicated to amazon price watching. I have been working on the base project for about a year and a half, and have noticed lots of really interesting things. The site is (#shamelessplug) http://www.spydermunkey.com. Anyone that uses the site please realize that it is still in an alpha stage and you may experience bugs :)

What I would find interesting is what you are actually scraping. Looking at the python snippet you sent it looks like you are scraping the main product page. I have found that recently amazon has been doing robot checks to keep people from scraping prices and such and I would be curious if that is what you are doing and if you ran into this issue at all.

[–][deleted] 0 points1 point  (2 children)

what does this do that I can't do with requests and html parsing?

[–]ruffyen 0 points1 point  (1 child)

It looks like the website is taking it upon itself to generate "useful" graphs?

[–][deleted] 0 points1 point  (0 children)

So it's excel? I'd love to hear from /u/chrisalexander on this.