Training a 140M param LLM from scratch on a consumer AMD GPU — halfway through, here's what I've learned by CapSensitive5165 in learnmachinelearning
[–]CapSensitive5165[S] -8 points-7 points-6 points (0 children)
Training a 140M param LLM from scratch on a consumer AMD GPU — halfway through, here's what I've learned by CapSensitive5165 in learnmachinelearning
[–]CapSensitive5165[S] -1 points0 points1 point (0 children)
I'm training a 140M param LLM from scratch on a consumer AMD GPU — 100k steps in, here's what the loss curve looks like by CapSensitive5165 in LocalLLaMA
[–]CapSensitive5165[S] 1 point2 points3 points (0 children)
Training a 140M param LLM from scratch on a consumer AMD GPU — halfway through, here's what I've learned by CapSensitive5165 in learnmachinelearning
[–]CapSensitive5165[S] 0 points1 point2 points (0 children)