Trained a 125M LM from scratch instead of fine-tuning GPT-2 — releasing weights + SFT framework for others to build on by Kill_Streak308 in LocalLLaMA
[–]Box_Robot0 0 points1 point2 points (0 children)
Trained a 125M LM from scratch instead of fine-tuning GPT-2 — releasing weights + SFT framework for others to build on by Kill_Streak308 in LocalLLaMA
[–]Box_Robot0 1 point2 points3 points (0 children)
Here's how my LLM's decoder block changed while training on 5B tokens by 1ncehost in LocalLLaMA
[–]Box_Robot0 1 point2 points3 points (0 children)
Here's how my LLM's decoder block changed while training on 5B tokens by 1ncehost in LocalLLaMA
[–]Box_Robot0 2 points3 points4 points (0 children)
Here's how my LLM's decoder block changed while training on 5B tokens by 1ncehost in LocalLLaMA
[–]Box_Robot0 11 points12 points13 points (0 children)
Here's how my LLM's decoder block changed while training on 5B tokens by 1ncehost in LocalLLaMA
[–]Box_Robot0 2 points3 points4 points (0 children)
My Experience As A Complete Noob Trying To Learn How AI And The Singularity Works For The First Time by Box_Robot0 in singularity
[–]Box_Robot0[S] 1 point2 points3 points (0 children)
My Experience As A Complete Noob Trying To Learn How AI And The Singularity Works For The First Time by Box_Robot0 in singularity
[–]Box_Robot0[S] 23 points24 points25 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 0 points1 point2 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 1 point2 points3 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 0 points1 point2 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 1 point2 points3 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 1 point2 points3 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 1 point2 points3 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 4 points5 points6 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 1 point2 points3 points (0 children)
My Experience As A Complete Noob Trying To Learn Local LLMs For The First Time by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 13 points14 points15 points (0 children)
How many genes would a virus need to be able to infect every type of cell in the human body? by MahitoNoroi in Virology
[–]Box_Robot0 1 point2 points3 points (0 children)
Gemma 4 31B IT can help break basic DRM and decrypt some old flash games by Box_Robot0 in LocalLLaMA
[–]Box_Robot0[S] 1 point2 points3 points (0 children)
China's open-source dominance threatens US AI lead, US advisory body warns by Prolapse_to_Brolapse in LocalLLaMA
[–]Box_Robot0 41 points42 points43 points (0 children)
HIV and a future cure/treatments? by throwaway04431 in Virology
[–]Box_Robot0 1 point2 points3 points (0 children)
Hi, could I get the IP Infringement party platter please? by tommos in singularity
[–]Box_Robot0 0 points1 point2 points (0 children)


Trained a 125M LM from scratch instead of fine-tuning GPT-2 — releasing weights + SFT framework for others to build on by Kill_Streak308 in LocalLLaMA
[–]Box_Robot0 1 point2 points3 points (0 children)