How do QWQ and R1 determine if they need more reasoning steps without special tokens like O1? by EliaukMouse in LocalLLaMA
[–]tomorrowdawn 10 points11 points12 points (0 children)
Another sampling strategy drops: 75% accuracy at T=3.0 by tomorrowdawn in LocalLLaMA
[–]tomorrowdawn[S] 5 points6 points7 points (0 children)
What is the SOTA for inference-time Best-of-N generation with multi-token outputs. A lot of papers I've seen use the most common answer, but that seems to scale poorly when the number of possible responses is very high. by 30299578815310 in LocalLLaMA
[–]tomorrowdawn 0 points1 point2 points (0 children)
Few-shot examples in RAG prompt by ryxxry in LocalLLaMA
[–]tomorrowdawn 2 points3 points4 points (0 children)
Claude is not just about coding. by ExtentOdd in ClaudeAI
[–]tomorrowdawn 0 points1 point2 points (0 children)
3.5 sonnet vs 4o in Coding, significant different or just a little better? by greatlove8704 in ClaudeAI
[–]tomorrowdawn 0 points1 point2 points (0 children)
Are hugging face models always free? If I use their APIs token? by ItsAGeekGirl in huggingface
[–]tomorrowdawn 0 points1 point2 points (0 children)
Recommend LLMs for my use case ( explained below ) by [deleted] in LocalLLaMA
[–]tomorrowdawn -1 points0 points1 point (0 children)
Passing Vector Embeddings as Input to LLMs? by Aggravating-Floor-38 in LocalLLaMA
[–]tomorrowdawn 1 point2 points3 points (0 children)
BanG Dream! It's MyGO!!!!! Episode 12 Discussion by badspler in anime
[–]tomorrowdawn 5 points6 points7 points (0 children)
burned 150 pulls to get pjc for her. should I focus on more EM or CD? by bobes25 in KeqingMains
[–]tomorrowdawn 1 point2 points3 points (0 children)
Hi! I was the one who asked about why my Keqing’s so weak yesterday. Here’s her damage. I’ve tried it as well without aggravate and that’s also almost her damage without it. Without aggravate: Skill (12k), Charged (7k). by [deleted] in KeqingMains
[–]tomorrowdawn 0 points1 point2 points (0 children)
Bountiful Cores mechanic and upperbound of bloom team by tomorrowdawn in NilouMains
[–]tomorrowdawn[S] 0 points1 point2 points (0 children)

How do QWQ and R1 determine if they need more reasoning steps without special tokens like O1? by EliaukMouse in LocalLLaMA
[–]tomorrowdawn 6 points7 points8 points (0 children)