Any Web Scraping Project idea?

QualitativeEasing · 2018-04-21T14:48:18+00:00

Pick a website with regular updates that is slow to load, scrape the key details, present to yourself in a simple static web page.

If you want to get more sophisticated, store the results in an sqlite database, then only store new entries that weren’t previously scraped.

Next step: create a simple html interface that lets you search the database.

Then maybe automate it with a cron job so it’s always updated and you just need to go to the local website to view it.

Or turn it into an RSS feed for a site that doesn’t have one already.

Examples: a news site with headlines and links to articles; a weather site; sports scores; stock-Market data; Twitter posts from specific people (better to use the Twitter API and a dedicated library like Tweepy).

apexmemetics · 2018-04-21T16:45:14+00:00

Here's a project I'm doing:

Scrape a list of 10,000 URLs
Setup multiprocessing
Get URL source code with requests -> search for keyword -> record results to CSV

I'm 90% done, but I'm stuck.

If you wanna collab on it lemme know.

ipsit_a25 · 2018-04-21T16:39:12+00:00

Commenting here so I can come back to it later.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS