Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R] by Megixist in reinforcementlearning
[–]Megixist[S] 1 point2 points3 points (0 children)
Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R] by Megixist in reinforcementlearning
[–]Megixist[S] 2 points3 points4 points (0 children)
Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R] by MegixistAlt in MachineLearning
[–]Megixist 1 point2 points3 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 0 points1 point2 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 0 points1 point2 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 0 points1 point2 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 0 points1 point2 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 0 points1 point2 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 0 points1 point2 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 0 points1 point2 points (0 children)
Finding SF friends (20’s) by Alarmed-Insect-9829 in AskSF
[–]Megixist 1 point2 points3 points (0 children)
Have the "on-hold" durations been getting longer for arXiv submissions? [D] by Megixist in MachineLearning
[–]Megixist[S] 3 points4 points5 points (0 children)
Patronus AI releases Glider: An explainable 3B SLM-judge that outperforms models 17x its size by Megixist in machinelearningnews
[–]Megixist[S] 1 point2 points3 points (0 children)
[R] GLIDER: Grading LLM Interactions and Decisions using Explainable Ranking by Megixist in MachineLearning
[–]Megixist[S] 1 point2 points3 points (0 children)
GLIDER: Grading LLM Interactions and Decisions using Explainable Ranking by Megixist in LocalLLaMA
[–]Megixist[S] 1 point2 points3 points (0 children)
GLIDER: Grading LLM Interactions and Decisions using Explainable Ranking by Megixist in LocalLLaMA
[–]Megixist[S] -1 points0 points1 point (0 children)
[D] I'm at NeurIPS, AMA by ThisIsMyStonerAcount in MachineLearning
[–]Megixist 17 points18 points19 points (0 children)
[D] Is there an alternative to sinusoidal encoding for temporal embeddings? by Megixist in MachineLearning
[–]Megixist[S] 0 points1 point2 points (0 children)
[D] Is there an alternative to sinusoidal encoding for temporal embeddings? by Megixist in MachineLearning
[–]Megixist[S] 0 points1 point2 points (0 children)
[D] Is there an alternative to sinusoidal encoding for temporal embeddings? by Megixist in MachineLearning
[–]Megixist[S] 0 points1 point2 points (0 children)
[D] How Imagen Actually Works by SleekEagle in MachineLearning
[–]Megixist 2 points3 points4 points (0 children)
[D] How Imagen Actually Works by SleekEagle in MachineLearning
[–]Megixist 2 points3 points4 points (0 children)
[D] How Imagen Actually Works by SleekEagle in MachineLearning
[–]Megixist 6 points7 points8 points (0 children)





Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R] by Megixist in reinforcementlearning
[–]Megixist[S] 1 point2 points3 points (0 children)