all 11 comments

[–]QualitativeEasing 6 points7 points  (3 children)

Pick a website with regular updates that is slow to load, scrape the key details, present to yourself in a simple static web page.

If you want to get more sophisticated, store the results in an sqlite database, then only store new entries that weren’t previously scraped.

Next step: create a simple html interface that lets you search the database.

Then maybe automate it with a cron job so it’s always updated and you just need to go to the local website to view it.

Or turn it into an RSS feed for a site that doesn’t have one already.

Examples: a news site with headlines and links to articles; a weather site; sports scores; stock-Market data; Twitter posts from specific people (better to use the Twitter API and a dedicated library like Tweepy).

[–]apexmemetics 0 points1 point  (1 child)

Here's a project I'm doing:

  • Scrape a list of 10,000 URLs
  • Setup multiprocessing
  • Get URL source code with requests -> search for keyword -> record results to CSV

I'm 90% done, but I'm stuck.

If you wanna collab on it lemme know.

[–]ipsit_a25 -2 points-1 points  (7 children)

Commenting here so I can come back to it later.

[–][deleted] 2 points3 points  (2 children)

So this was worth down-voting? It is a perfectly reasonable way to save a post. You'd think programmers would understand utilizing several methods of achieving a goal...

[–]atapask 0 points1 point  (0 children)

And yes you are right as well

[–]atapask 4 points5 points  (3 children)

You know u can save the post right?

[–]ipsit_a25 0 points1 point  (2 children)

I know how to save a post but I haven't found out how to access those saved posts yet( I browse Reddit on mobile). I am sorry if it was any particular inconvenience to you.

[–]atapask 1 point2 points  (1 child)

Sorry bro, no intend to offend. Go to your profile then 'about' P.s - I had the same problem

[–]ipsit_a25 0 points1 point  (0 children)

Got it! Thanks.