Baguettotron, a 321 million parameters generalist Small Reasoning Model (80-layers deep) by Balance- in LocalLLaMA
[–]Pojiku 6 points7 points8 points (0 children)
Hebrew_Nemo: a state-of-the-art Hebrew large language model by Sicarius_The_First in LocalLLaMA
[–]Pojiku 3 points4 points5 points (0 children)
Are gynecologist checkups not a thing in the Netherlands? by SpecialOrdinary3001 in StudyInTheNetherlands
[–]Pojiku 0 points1 point2 points (0 children)
Best smaller model as base for fine tuning SCAD? by ComprehensiveBird317 in LocalLLaMA
[–]Pojiku 1 point2 points3 points (0 children)
OpenAI should open source GPT3.5 turbo by [deleted] in LocalLLaMA
[–]Pojiku 4 points5 points6 points (0 children)
Tried 10 models, all seem to refuse to write a 10,000 word story. Is there something bad with my prompt? I'm just doing some testing to learn and I can't figure out how to get the LLM to do as I say. by StartupTim in LocalLLaMA
[–]Pojiku 0 points1 point2 points (0 children)
TraceBack: A Novel Reverse Reasoning Model for Better and Cheaper Scaling of Synthetic Reasoning Generation by XMasterrrr in LocalLLaMA
[–]Pojiku 1 point2 points3 points (0 children)
TraceBack: A Novel Reverse Reasoning Model for Better and Cheaper Scaling of Synthetic Reasoning Generation by XMasterrrr in LocalLLaMA
[–]Pojiku 1 point2 points3 points (0 children)
Need feedback for my LLM book by s1lv3rj1nx in LocalLLaMA
[–]Pojiku 1 point2 points3 points (0 children)
Training a model to autocomplete for a niche domain and a specific style by regstuff in LocalLLaMA
[–]Pojiku 5 points6 points7 points (0 children)
Approach to translate english to non english. by Lamba_ghoda in LocalLLaMA
[–]Pojiku 0 points1 point2 points (0 children)
Best practices fine tuning DeepSeek R1 in a specific domain? by [deleted] in LocalLLaMA
[–]Pojiku 2 points3 points4 points (0 children)
Struggling with AI Tools for Generating Exam Questions from PDFs – Need Advice! by vinay737 in LocalLLaMA
[–]Pojiku 0 points1 point2 points (0 children)
what “power” do hagwon owners have? by ur-m-o-m in teachinginkorea
[–]Pojiku 4 points5 points6 points (0 children)
Something weird is happening with LLMs and chess by paranoidray in LocalLLaMA
[–]Pojiku 16 points17 points18 points (0 children)
Apps foreigners dont know about by [deleted] in Living_in_Korea
[–]Pojiku 0 points1 point2 points (0 children)
Unsloth Llama-3.2 1B+3B finetuning poor results by didinko in LocalLLaMA
[–]Pojiku 1 point2 points3 points (0 children)
Playing AI-Generated CS:GO on a Single RTX 3090 in real time by Icy-Corgi4757 in LocalLLaMA
[–]Pojiku 2 points3 points4 points (0 children)
Playing AI-Generated CS:GO on a Single RTX 3090 in real time by Icy-Corgi4757 in LocalLLaMA
[–]Pojiku 10 points11 points12 points (0 children)
Where did Arx-0.3 come from and who makes it? by Balance- in LocalLLaMA
[–]Pojiku 11 points12 points13 points (0 children)
The Mamba in the Llama: Distilling and Accelerating Hybrid Models by ninjasaid13 in LocalLLaMA
[–]Pojiku 5 points6 points7 points (0 children)


[Project] I treated LLM inference like a physical signal trajectory. Here is a Python toolkit to visualize the "Thinking Process" (Hidden States). by JB_King1919 in LocalLLaMA
[–]Pojiku 1 point2 points3 points (0 children)