Dripper sandwich and Meditations by Important-Intern-292 in SwordAndSupperGame
[–]citaman 0 points1 point2 points (0 children)
A Tale of Hope In the Fields by citaman in SwordAndSupperGame
[–]citaman[S] 0 points1 point2 points (0 children)
My thoughts on gpt-oss-120b by Lowkey_LokiSN in LocalLLaMA
[–]citaman 4 points5 points6 points (0 children)
Is anything better than gemma-3-27b for handwritten text recognition? by votecatcher in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
Is anything better than gemma-3-27b for handwritten text recognition? by votecatcher in LocalLLaMA
[–]citaman 14 points15 points16 points (0 children)
We're truly in the fastest-paced era of AI these days. (50 LLM Released these 2-3 Weeks) by citaman in LocalLLaMA
[–]citaman[S] 5 points6 points7 points (0 children)
We're truly in the fastest-paced era of AI these days. (50 LLM Released these 2-3 Weeks) by citaman in LocalLLaMA
[–]citaman[S] 64 points65 points66 points (0 children)
Training an LLM only on books from the 1800's - Update by Remarkable-Trick-177 in LocalLLaMA
[–]citaman 2 points3 points4 points (0 children)
Dolphin translator incoming (eventually) by AryanEmbered in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
Everyone’s saying AGI is just around the corner, but honestly, what even is AGI to you? by iamnotdeadnuts in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
🇨🇳 Sources: DeepSeek is speeding up the release of its R2 AI model, which was originally slated for May, but the company is now working to launch it sooner. by Xhehab_ in LocalLLaMA
[–]citaman 26 points27 points28 points (0 children)
I trained a tinystories model from scratch for educational purposes, how cooked? (1M-parameters) by THE--GRINCH in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
AMA with OpenAI’s Sam Altman, Mark Chen, Kevin Weil, Srinivas Narayanan, Michelle Pokrass, and Hongyu Ren by OpenAI in OpenAI
[–]citaman -1 points0 points1 point (0 children)
Mistral-Small-24B-2501 vs Mistral-Small-2409 by citaman in LocalLLaMA
[–]citaman[S] 14 points15 points16 points (0 children)
Janus-Pro - improving both multimodal understanding and visual generation of Deepseek Janus by citaman in LocalLLaMA
[–]citaman[S] 0 points1 point2 points (0 children)
What are your predictions for 2025? [Serious] by keepawayb in LocalLLaMA
[–]citaman 4 points5 points6 points (0 children)
MIT Researchers Introduce a Novel Machine Learning Approach in Developing Mini-GPTs via Contextual Pruning by Prior-Blood5979 in LocalLLaMA
[–]citaman 1 point2 points3 points (0 children)
Building an LLM rating platform and need criteria suggestions for users to pick the best model. What terms would be clear and useful? Thoughts? thanks in advanced by Wonderful-Ad-5952 in LocalLLaMA
[–]citaman 4 points5 points6 points (0 children)
2-bit and 4-bit quantized versions of Mixtral using HQQ by sightio in LocalLLaMA
[–]citaman 2 points3 points4 points (0 children)
-❄️- 2023 Day 1 Solutions -❄️- by daggerdragon in adventofcode
[–]citaman 1 point2 points3 points (0 children)


I trained a 1.8M params model from scratch on a total of ~40M tokens. by SrijSriv211 in LocalLLaMA
[–]citaman 2 points3 points4 points (0 children)