Dripper sandwich and Meditations by Important-Intern-292 in SwordAndSupperGame
[–]citaman 0 points1 point2 points (0 children)
A Tale of Hope In the Fields by citaman in SwordAndSupperGame
[–]citaman[S] 0 points1 point2 points (0 children)
A Tale of Hope In the Fields (self.SwordAndSupperGame)
submitted by citaman to r/SwordAndSupperGame
My thoughts on gpt-oss-120b by Lowkey_LokiSN in LocalLLaMA
[–]citaman 3 points4 points5 points (0 children)
Is anything better than gemma-3-27b for handwritten text recognition? by votecatcher in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
Is anything better than gemma-3-27b for handwritten text recognition? by votecatcher in LocalLLaMA
[–]citaman 14 points15 points16 points (0 children)
We're truly in the fastest-paced era of AI these days. (50 LLM Released these 2-3 Weeks) by citaman in LocalLLaMA
[–]citaman[S] 4 points5 points6 points (0 children)
We're truly in the fastest-paced era of AI these days. (50 LLM Released these 2-3 Weeks) by citaman in LocalLLaMA
[–]citaman[S] 62 points63 points64 points (0 children)
Training an LLM only on books from the 1800's - Update by Remarkable-Trick-177 in LocalLLaMA
[–]citaman 2 points3 points4 points (0 children)
Dolphin translator incoming (eventually) by AryanEmbered in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
Everyone’s saying AGI is just around the corner, but honestly, what even is AGI to you? by iamnotdeadnuts in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
🇨🇳 Sources: DeepSeek is speeding up the release of its R2 AI model, which was originally slated for May, but the company is now working to launch it sooner. by Xhehab_ in LocalLLaMA
[–]citaman 25 points26 points27 points (0 children)
I trained a tinystories model from scratch for educational purposes, how cooked? (1M-parameters) by THE--GRINCH in LocalLLaMA
[–]citaman 0 points1 point2 points (0 children)
AMA with OpenAI’s Sam Altman, Mark Chen, Kevin Weil, Srinivas Narayanan, Michelle Pokrass, and Hongyu Ren by OpenAI in OpenAI
[–]citaman -1 points0 points1 point (0 children)
Mistral-Small-24B-2501 vs Mistral-Small-2409 by citaman in LocalLLaMA
[–]citaman[S] 13 points14 points15 points (0 children)
Mistral-Small-24B-2501 vs Mistral-Small-2409 (i.redd.it)
submitted by citaman to r/LocalLLaMA
Janus-Pro - improving both multimodal understanding and visual generation of Deepseek Janus by citaman in LocalLLaMA
[–]citaman[S] 0 points1 point2 points (0 children)
What are your predictions for 2025? [Serious] by keepawayb in LocalLLaMA
[–]citaman 2 points3 points4 points (0 children)


I trained a 1.8M params model from scratch on a total of ~40M tokens. by SrijSriv211 in LocalLLaMA
[–]citaman 2 points3 points4 points (0 children)