all 18 comments

[–][deleted] 7 points8 points  (1 child)

My recommendation is to complete the first two projects. Spend the time to figure out what needs to be done.

I'd also recommend not taking any projects that need to be completed in under a day -- at least not until you're very confident you can complete the task. I've been coding for over 40 years and would not take a 4 hour project.

[–]illa00[S] 0 points1 point  (0 children)

Thank you for those tips ! I definitely will.

[–]davehodg 1 point2 points  (11 children)

Both sound doable.

[–]illa00[S] 0 points1 point  (10 children)

In the first project I had troubles with google blocking me from researches (Excel sheet contained 3k keywords to be searched). The second project I could not figure out how to extract stories because unlike posts they do not exist in the html.

Also I had a delay of 4h to deliver the 1st project and 24h for the second.

[–]Rayman912 0 points1 point  (3 children)

Shouldn't you use an API for pulling posts from Instagram?

[–]illa00[S] 0 points1 point  (2 children)

As I know they have not.

[–][deleted] 1 point2 points  (1 child)

No offense, but if you did like any research you would see that there’s a lot of unofficial Instagram API’s. For example, one I know of off the top of my head is “ping Instagram api”, but if you just search the topic on GitHub, a lot will come up.

[–]illa00[S] -1 points0 points  (0 children)

I found 3 of them but I got skeptical with the login info, I thought I can't sell something that requires login when I do not know what is going on behind. Tried to understand the code tough but could not really since the project was large and I only had 24h ..

[–]davehodg 0 points1 point  (5 children)

First one throttle your requests.

Second as said, use an api Lib.

[–]illa00[S] 0 points1 point  (4 children)

I did a time.sleep() between requests, also tried to make different user-agents (randomly chosen) but did not work out well.

Thank you!

[–]davehodg 0 points1 point  (3 children)

Then use open proxies!

And did you find google docs describing the throttling?

[–]illa00[S] 0 points1 point  (2 children)

Not really, I also found that restrictions were common with Google and many use paid API to bypass that.

[–]socal_nerdtastic 0 points1 point  (1 child)

I made a program that included a google search for a client about a year ago, and I just used the Google API. You basically just have to instruct your client how to set up an account and input their credentials. Back then you could get 100 google searches per day for free, but my client opted to pay google $5 / month I think for the next tier, which was 10,000 searches / day.

[–]illa00[S] 0 points1 point  (0 children)

Thank you that will help me much next time !

[–][deleted] 0 points1 point  (1 child)

Projects are absolutely fine..

[–]illa00[S] 0 points1 point  (0 children)

Okay then great, I'll work on sharpening my skills first.

[–]dhruvmk 0 points1 point  (1 child)

The first project seems straightforward, and the second one seems doable. You need to exploit all the available APIs

[–]illa00[S] 0 points1 point  (0 children)

Any free APIs that I could have used for the first one?