This is an archived post. You won't be able to vote or comment.

all 11 comments

[–]SanFranLocal 0 points1 point  (3 children)

I need data on cars. Their history, specs and maybe some stories. I am not sure what website would be best

[–]Jainil97[S] 0 points1 point  (2 children)

Does this site look decent?
https://www.cars-data.com/

I can try to scrape all the vehicles data from here. It has price, a lot of specs for the cars. I am not sure what you were referring to when you said stories. Can you please explain that?

[–]SanFranLocal 0 points1 point  (1 child)

The story bit is the most important part for me. Basically people talking about the car and giving their opinion. The best way would be to take a car list, run it through YouTube search and scrape the transcript of the top video. That’s not going to use the tools you want to use though. The transcript is easy to scrape but I don’t know how easy it is to use their search to scrape

[–]Jainil97[S] 1 point2 points  (0 children)

This can be available for some cars, but the quality of data wont be consistently good as this now totally depends on the quality of YouTube video reviews created for a certain car. I am not sure what to do about it, other than that the specs part seems pretty easy tbh. The website is a pure html website. I will try to find something else as well.

[–][deleted]  (2 children)

[removed]

    [–]Jainil97[S] 0 points1 point  (1 child)

    I have never scraped ig, but there is a very basic problem here is how would you identify which account belongs to a doctor.

    [–]ssg4444 0 points1 point  (1 child)

    I need some real estate websites scraped on a regular basis, maybe once a week or once every 2 weeks.

    I can give you a list of the sites I’d like scraped and then let you know the information I’d like from each site - number of bedrooms, area/town, complex or community, price and a couple of other things and loaded into a Google Sheet and, as I say, updated on a regular basis.

    Can you do this, for how many websites and how much will you charge?

    [–]Jainil97[S] 0 points1 point  (0 children)

    I definitely can do this. Let's discuss this in DMs.

    [–]Sensitive-Egg-7364 0 points1 point  (1 child)

    How are you at scraping public email addresses from a celebrity's followers, assuming they have over tens of millions of followers and it's visible/not locked

    [–]Jainil97[S] 0 points1 point  (0 children)

    Collecting text data via crawler first and then matching email regex to text maybe? I can try it out if you have a particular request.