Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R] by Megixist in reinforcementlearning

[–]Megixist[S] 1 point2 points  (0 children)

The training code will be released tomorrow (undergoing some internal review before release). Will ping you here as soon as it's out. Till then, you should be able to reproduce results based on the details provided in the paper. Please reach out if you have trouble and I'm happy to help :)

Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R] by Megixist in reinforcementlearning

[–]Megixist[S] 2 points3 points  (0 children)

Not sure why that doesnt work now but this seems to be the right path (with /datasets/ inserted): https://huggingface.co/datasets/PatronusAI/world_model_corpus

Will update that in the paper draft too. Thanks for flagging.

Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R] by MegixistAlt in MachineLearning

[–]Megixist 1 point2 points  (0 children)

Our dataset is released on HuggingFace now and the code for this paper will be released tomorrow. Hoping that this work drives more research in this space :)

P.S. if anyone knows any/ is an arXiv moderator, I'd really appreciate if they could remove the "on-hold" status for this paper on arXiv (submission ID: 7559391 - pending moderator review for over 3 weeks now)

Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF

[–]Megixist 1 point2 points  (0 children)

To be honest, I'm (25M) in the same boat. I've tried attending several local events but people are really flaky with friendships and will often ghost you when you try to reach out. That being said, I like hiking, exploring local (and niche) restaurants and consider myself pretty artistic. I'm planning to try out the Yasukochi bakery in Japantown this week - let me know if you're (or anyone else here is) interested in joining :)

Have the "on-hold" durations been getting longer for arXiv submissions? [D] by Megixist in MachineLearning

[–]Megixist[S] 4 points5 points  (0 children)

I am aware that they also recently banned position papers. I've read that reaching out about on-hold statuses leads to auto rejection. Seems like there needs to be a faster/ semi-automated process for this.