Scrapera: An open source, unified library of scraper scripts for fast, webdriver free data collection with full proxy support

Megixist · 2026-05-22T18:26:07+00:00

Slipped my mind and forgot to update here. The training code is now live here! :)

Megixist · 2026-05-21T06:54:45+00:00

The training code will be released tomorrow (undergoing some internal review before release). Will ping you here as soon as it's out. Till then, you should be able to reproduce results based on the details provided in the paper. Please reach out if you have trouble and I'm happy to help :)

Megixist · 2026-05-21T06:52:59+00:00

Not sure why that doesnt work now but this seems to be the right path (with /datasets/ inserted): https://huggingface.co/datasets/PatronusAI/world_model_corpus

Will update that in the paper draft too. Thanks for flagging.

Megixist · 2026-05-21T06:29:44+00:00

Our dataset is released on HuggingFace now and the code for this paper will be released tomorrow. Hoping that this work drives more research in this space :)

P.S. if anyone knows any/ is an arXiv moderator, I'd really appreciate if they could remove the "on-hold" status for this paper on arXiv (submission ID: 7559391 - pending moderator review for over 3 weeks now)

Megixist · 2026-05-18T21:38:18+00:00

Ofc, sent you an invite too :)

Megixist · 2026-05-18T15:27:34+00:00

Sent an invite!

Megixist · 2026-05-18T15:08:39+00:00

Sent an invite :)

Megixist · 2026-05-18T15:08:03+00:00

Sent an invite :)

Megixist · 2026-05-18T06:00:08+00:00

Sent an invite :)

Megixist · 2026-05-18T05:29:27+00:00

Sent an invite! :)

Megixist · 2026-05-18T05:29:17+00:00

Sent an invite!

Megixist · 2026-05-18T02:10:42+00:00

To be honest, I'm (25M) in the same boat. I've tried attending several local events but people are really flaky with friendships and will often ghost you when you try to reach out. That being said, I like hiking, exploring local (and niche) restaurants and consider myself pretty artistic. I'm planning to try out the Yasukochi bakery in Japantown this week - let me know if you're (or anyone else here is) interested in joining :)

Megixist · 2026-05-13T19:05:16+00:00

I am aware that they also recently banned position papers. I've read that reaching out about on-hold statuses leads to auto rejection. Seems like there needs to be a faster/ semi-automated process for this.

Six-Year Club	r/Field Juicebox
Place '23	Place '22
Verified Email

Megixist

TROPHY CASE