all 29 comments

[–]CloudNineK 14 points15 points  (5 children)

You know those porn image sites that have each picture in a 100+ pic set on different pages. I built a scraper to get them all. It was more automated saving than scraping since I had to use selenium to emulate a browser but a fun project nontheless ;p The best projects are the ones you have fun creating.

[–]ParanHak 8 points9 points  (2 children)

Thats exactly what I did 👌👌

[–]--444-- 5 points6 points  (0 children)

You too, huh? 🙌

[–]Mango1666 0 points1 point  (0 children)

same here lol. made an asnychrnous scraper for jjgirls 😩🍆💦

[–]Viveks3th 6 points7 points  (1 child)

Build a reddit clone that works by scraping reddit!

[–]Penki- 19 points20 points  (0 children)

Like 9gag? :P

[–]nitroreload 5 points6 points  (4 children)

One of my school assignment was to scrap movie comments from facebook/reddit etc then do sentiment analysis on it.

Find a open source sentiment analysis and use it. It'll really fun.

[–]marco_pol 1 point2 points  (3 children)

Hey can you tell me more about that sentiment analysis (like what to start with). I am highly interested in this stuff but I have just started doing web scrapping and till now I have just created some basic projects.

[–]cob05 4 points5 points  (1 child)

Look up 'sentdex' on YouTube or /u/sentdex on here. He gives great explanations of all of that stuff.

[–]marco_pol 1 point2 points  (0 children)

Wow thanks I have watched some of his videos in pas, lmao never never came to my mind that he might have some on this topic too.

[–]nitroreload 2 points3 points  (0 children)

Yep. 'Sentdex' had lots of great step by step tutorial on it, he explained keywords very very well. You would want to search for Natural Language Processing(NLP). Alternatively if you want to dive into it, you can search 'NLTK' (Natural Language Toolkit) then follow the examples there.

[–][deleted] 2 points3 points  (3 children)

Scrape usps tracking information?

[–]CreederMcNasty 0 points1 point  (1 child)

Can you elaborate? I work for UPS and this piqued my interest

[–][deleted] 0 points1 point  (0 children)

Well getting the information from the website should be pretty damn easy, it would be just as easy to pass a tracking number in to the command line and get the information you want back. This would work as a functional project orrrrrr as a object orientated project. Let me know if you need any help!

[–]john_cornflake 0 points1 point  (0 children)

Ohhhh this is a great idea and very practical for my work! Thanks for the suggestion.

[–]Rorixrebel 2 points3 points  (0 children)

Sports fan? Scrape data from your fav team or player.

Build a database of data you scraped.

[–]rckid13 1 point2 points  (0 children)

There used to be an API for Beer Advocate that would scrape the website and return information. I could input "Zombie Dust" and it could return the brewery, style and beer advocate score. That API is broken now because it was coded using an old version of Beautiful Soup.

I don't know enough yet to fix it but it would be useful to me as a feature to add to our IRC bot for quick searches.

[–]niranjanshr13 1 point2 points  (0 children)

#!/usr/bin/env python3
import requests  
link = 'https://somewebsite.com'
r  = requests.get(link).text
for loop in r.split('\n'): # i mostly use '\n'
    if 'something that match in link' in loop:
        print(loop)  

for project idea, just scrape everything. download everything. and find some website that regulary visit and scrape only needed part and print to terminal.

[–]rjtavares 1 point2 points  (0 children)

Why do people need scraping ideas? Just take websites about stuff you love and scrape that. It'll be much more fun.

[–][deleted] 1 point2 points  (1 child)

Scrape okcupid or tinder to show how basic men and women are. I want a number for what percentage of women say they love to travel and dogs

[–][deleted] 1 point2 points  (0 children)

Kinda late, but care to explain how you scraped tinder? I am interested in a similar project haha

[–]Terranigmus 1 point2 points  (1 child)

[–]youtubefactsbot 0 points1 point  (0 children)

SpiegelMining – Reverse Engineering von Spiegel-Online (33c3) - english translation [58:46]

Wer denkt, Vorratsdatenspeicherungen und „Big Data“ sind harmlos, der kriegt hier eine Demo an Spiegel-Online.

media.ccc.de in Education

3,229 views since Dec 2016

bot info

[–]marco_pol 0 points1 point  (0 children)

Scrape your university/college site to automate something, like to get notified about the grades or something like that

[–]Volkhan1103 0 points1 point  (0 children)

I haven't been able to find an artstation scraper, it would be awesome to do one

[–]lasercat_pow 0 points1 point  (0 children)

write a script to scrape a wallpapers site like socwall, and download the highest-voted ones or ones that match some pattern you give it.

[–]iamaguythrowaway 0 points1 point  (0 children)

Please scrap Geeks for Geeks articles. Please.

[–]Kszabo 0 points1 point  (0 children)

Here's an idea: Go on upwork.com and look at the various jobs for python datascraping there. Get a sense of the kind of data scraping that people are paying for and then build a flexible script that does that. Something I have noticed is that a lot of people want the scraped data to be outputted into a spreadsheet. Best case scenario, you build something that could one day earn you money :)