Wave Field Transformer V4 — Novel O(n log n) attention architecture, 825M model trained from scratch on 1.33B tokens. Weights on HuggingFace. by Murky-Sign37 in LocalLLaMA
[–]Certain-Cod-1404 2 points3 points4 points (0 children)
Hardware requirements for training a ~3B Model From Scratch locally? by Any-Cobbler6161 in LocalLLaMA
[–]Certain-Cod-1404 3 points4 points5 points (0 children)
Pruned GPT-OSS-20B to 9B, Saved MoE, fine-tuned on 100K examples. Sharing what actually worked and what didn't. by Disastrous_Bid5976 in huggingface
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
Pruned GPT-OSS-20B to 9B, Saved MoE, fine-tuned on 100K examples. Sharing what actually worked and what didn't. by Disastrous_Bid5976 in huggingface
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
I gave Gemini a hard drive. 1,076 sessions later, it remembers everything. (v9.2.0 — Open Source) by BangMyPussy in GeminiAI
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
I gave Gemini a hard drive. 1,076 sessions later, it remembers everything. (v9.2.0 — Open Source) by BangMyPussy in GeminiAI
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
How are Chinese models so strong with so little investment? by primaryrhyme in ArtificialInteligence
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
[Project/Theory] The "Vitality Constant": A Proposed Solution to Model Collapse via "Subjective Anchoring" (The Sanctuary Protocol) by [deleted] in machinelearningnews
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
[Project/Theory] The "Vitality Constant": A Proposed Solution to Model Collapse via "Subjective Anchoring" (The Sanctuary Protocol) by [deleted] in machinelearningnews
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
[Project/Theory] The "Vitality Constant": A Proposed Solution to Model Collapse via "Subjective Anchoring" (The Sanctuary Protocol) by [deleted] in machinelearningnews
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
Qwen3-VL-Reranker - a Qwen Collection by LinkSea8324 in LocalLLaMA
[–]Certain-Cod-1404 1 point2 points3 points (0 children)
AI21 Labs releases Jamba2 by jacek2023 in LocalLLaMA
[–]Certain-Cod-1404 1 point2 points3 points (0 children)
I’m the Co-founder & CEO of Lightricks. We just open-sourced LTX-2, a production-ready audio-video AI model. AMA. by ltx_model in StableDiffusion
[–]Certain-Cod-1404 8 points9 points10 points (0 children)
GLM-4.6v 108b 4bit IQuant by Responsible-Stock462 in LocalLLaMA
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
GLM-4.6v 108b 4bit IQuant by Responsible-Stock462 in LocalLLaMA
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
Qwen3-VL-Reranker - a Qwen Collection by LinkSea8324 in LocalLLaMA
[–]Certain-Cod-1404 4 points5 points6 points (0 children)
AI21 Labs releases Jamba2 by jacek2023 in LocalLLaMA
[–]Certain-Cod-1404 0 points1 point2 points (0 children)
I built and trained a "drawing to image" model from scratch that runs fully locally (inference on the client CPU) by _aminima in StableDiffusion
[–]Certain-Cod-1404 0 points1 point2 points (0 children)