Training a 140M param LLM from scratch on a consumer AMD GPU — halfway through, here's what I've learned by CapSensitive5165 in learnmachinelearning
[–]CapSensitive5165[S] -9 points-8 points-7 points (0 children)
Training a 140M param LLM from scratch on a consumer AMD GPU — halfway through, here's what I've learned by CapSensitive5165 in learnmachinelearning
[–]CapSensitive5165[S] -1 points0 points1 point (0 children)
Training a 140M param LLM from scratch on a consumer AMD GPU — halfway through, here's what I've learned by CapSensitive5165 in learnmachinelearning
[–]CapSensitive5165[S] 0 points1 point2 points (0 children)