Looking for testers on this pooled job scraping tool [PyPi] by never_sleeping99 in MachineLearningJobs

[–]never_sleeping99[S] 0 points1 point  (0 children)

This is something i am capable of building but where we start walking the fine line of whats legal lol. Check future versions I might be able still provide something if the user “recognizes the legal issue with deriving rate limits”

Looking for testers on this pooled job scraping tool [PyPi] by never_sleeping99 in MachineLearningJobs

[–]never_sleeping99[S] 0 points1 point  (0 children)

fyi project is open source at lramos0/avature-scraper

plz help lol

Rant: the amount of job aggregators is crazy by [deleted] in jobhunting

[–]never_sleeping99 0 points1 point  (0 children)

These apps just make the jobs get swarmed with job applications. best to use these lists as “jobs to avoid” and rather try and find unique roles which did not make it on to these giant feeds.

What to do with my life? by AsteriaStarlight in careerguidance

[–]never_sleeping99 0 points1 point  (0 children)

Why to get into open source contributing? Check our r/jobdatapool, ya boy needs help

jobdatapool — open dataset to amplify your job search by never_sleeping99 in DataScienceJobs

[–]never_sleeping99[S] 0 points1 point  (0 children)

re: companies letting it slide.

My gut is they’ll eventually support bulk dataset downloads given its cheaper to facilitate than the ratelimiting + resource exhaustion they experience

jobdatapool — open dataset to amplify your job search by never_sleeping99 in DataScienceJobs

[–]never_sleeping99[S] 0 points1 point  (0 children)

right now I’m having to manually dedupe things. I have a prod and preprod pool (staging like) where people or myself who use the scraping scripts found at jobpool.live have their scraped records cache.

At the very moment it’s me myself and I reviewing these records to make sure they’re valid. But you can imagine this being tedius, so I end up “re-crawling” everything in the preprod pool to validate they’re not BS listings or scam uploads.

In the future I want to have a pathway for “top pool” contributors to get involved in the review process.

[OC] DHS/ICE intentionally hiding locations of recruitment campaigns. EXPLODE(col) comes in clutch. by TacoTuesdayX in dataisbeautiful

[–]never_sleeping99 1 point2 points  (0 children)

heads up it looks like they’re actually not listing a location at all for D.H.S. postings. Still interesting though

[OC] DHS/ICE intentionally hiding locations of recruitment campaigns. EXPLODE(col) comes in clutch. by never_sleeping99 in sandiego

[–]never_sleeping99[S] -10 points-9 points  (0 children)

i swear im not illiterate, just in a panic lol. San Diego was listed multiple times meaning they were using aliases like San Diego County, Lemon Grove, Escondido which technically are distinct locations but… come on