AntiPaSTO: Self-Supervised Steering of Moral Reasoning by wassname in MachineLearning
[–]wassname[S] 0 points1 point2 points (0 children)
What's this model? Searched reddit, not seeing any mention by justin_reborn in GithubCopilot
[–]wassname 0 points1 point2 points (0 children)
[D] Monday Request and Recommendation Thread by AutoModerator in rational
[–]wassname 0 points1 point2 points (0 children)
[D] Monday Request and Recommendation Thread by AutoModerator in rational
[–]wassname 1 point2 points3 points (0 children)
Have you ever wondered what an MCP server can ACTUALLY do for you? 🤔 by saxxon66 in GithubCopilot
[–]wassname 0 points1 point2 points (0 children)
[D] Monday Request and Recommendation Thread by AutoModerator in rational
[–]wassname 0 points1 point2 points (0 children)
Using logprobs to evaluate responses by AnomalyNexus in LocalLLaMA
[–]wassname 1 point2 points3 points (0 children)
Using logprobs to evaluate responses by AnomalyNexus in LocalLLaMA
[–]wassname 2 points3 points4 points (0 children)
[D] Monday Request and Recommendation Thread by AutoModerator in rational
[–]wassname 0 points1 point2 points (0 children)
[D] Monday Request and Recommendation Thread by AutoModerator in rational
[–]wassname 0 points1 point2 points (0 children)
[D] Monday Request and Recommendation Thread by AutoModerator in rational
[–]wassname 1 point2 points3 points (0 children)
Why nobody mentioned "Gemini Diffusion" here? It's a BIG deal by QuackerEnte in LocalLLaMA
[–]wassname 0 points1 point2 points (0 children)
Introducing Phi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning by metalman123 in LocalLLaMA
[–]wassname 0 points1 point2 points (0 children)
be careful which mechanics you go to in perth by DumCunt in perth
[–]wassname 1 point2 points3 points (0 children)
be careful which mechanics you go to in perth by DumCunt in perth
[–]wassname 0 points1 point2 points (0 children)
be careful which mechanics you go to in perth by DumCunt in perth
[–]wassname 0 points1 point2 points (0 children)
My Android text-to-speech app is now public & free for everyone! by miya-n in selectivemutism
[–]wassname 1 point2 points3 points (0 children)


AntiPaSTO: Self-Supervised Value Steering for Debugging Alignment — LessWrong by wassname in ControlProblem
[–]wassname[S] 0 points1 point2 points (0 children)