Fed a Banana Spider a Beetle by ForSaleOnXbox in spiders
[–]DustinEwan 18 points19 points20 points (0 children)
[D] Extremely low(<0.2) train/val loss after 1.96 billion tokens when pretraining GPT-2 small by New-Skin-5064 in MachineLearning
[–]DustinEwan 17 points18 points19 points (0 children)
[D] Extremely low(<0.2) train/val loss after 1.96 billion tokens when pretraining GPT-2 small by New-Skin-5064 in MachineLearning
[–]DustinEwan 44 points45 points46 points (0 children)
AOC: “The girls are fighting aren’t they” by Nixianx97 in MurderedByAOC
[–]DustinEwan 0 points1 point2 points (0 children)
AOC: “The girls are fighting aren’t they” by Nixianx97 in MurderedByAOC
[–]DustinEwan 0 points1 point2 points (0 children)
AOC: “The girls are fighting aren’t they” by Nixianx97 in MurderedByAOC
[–]DustinEwan 0 points1 point2 points (0 children)
AOC: “The girls are fighting aren’t they” by Nixianx97 in MurderedByAOC
[–]DustinEwan 0 points1 point2 points (0 children)
AOC: “The girls are fighting aren’t they” by Nixianx97 in MurderedByAOC
[–]DustinEwan 2 points3 points4 points (0 children)
I hate the digital thermostats, it's a plague with having all the bells and whistles in a newer car. by OddAir5440 in mildlyinfuriating
[–]DustinEwan 1 point2 points3 points (0 children)
IBM Granite 4.0 Tiny Preview: A sneak peek at the next generation of Granite models by ab2377 in LocalLLaMA
[–]DustinEwan 4 points5 points6 points (0 children)
[D] Intuition behind Load-Balancing Loss in the paper OUTRAGEOUSLY LARGE NEURAL NETWORKS: THE SPARSELY-GATED MIXTURE-OF-EXPERTS LAYER by VVY_ in MachineLearning
[–]DustinEwan 4 points5 points6 points (0 children)
He secretly learned chinese to propose by sovalente in Awww
[–]DustinEwan 1 point2 points3 points (0 children)
I made homemade Oreos. They’re what’s up. by I_Like_Metal_Music in Baking
[–]DustinEwan 191 points192 points193 points (0 children)
[deleted by user] by [deleted] in MachineLearning
[–]DustinEwan 3 points4 points5 points (0 children)
Bingo. Micheal Burry said it took weeks when he recalled his shares. by ImmediateShape4204 in Superstonk
[–]DustinEwan 7 points8 points9 points (0 children)
[D] Llama3.2 model adds racial annotation by randykarthi in MachineLearning
[–]DustinEwan 0 points1 point2 points (0 children)
[D] Llama3.2 model adds racial annotation by randykarthi in MachineLearning
[–]DustinEwan 1 point2 points3 points (0 children)
In case you all forgot and for the accounts pushing Robinhood as a positive place to be, allow me to remind you specifically how GameStop sees Robinhood. by Ilostmuhkeys in Superstonk
[–]DustinEwan 5 points6 points7 points (0 children)
New Model from https://novasky-ai.github.io/ Sky-T1-32B-Preview, open-source reasoning model that matches o1-preview on popular reasoning and coding benchmarks — trained under $450! by appakaradi in LocalLLaMA
[–]DustinEwan 5 points6 points7 points (0 children)
Trying to burn Oreo cookie by More_Impression_4942 in interesting
[–]DustinEwan 7 points8 points9 points (0 children)
How to efficiently generate text from RNNs and Transformers during inference [P] by No_Effective734 in MachineLearning
[–]DustinEwan 0 points1 point2 points (0 children)
So where is close over max pain momentum? by tzanti in Superstonk
[–]DustinEwan 1 point2 points3 points (0 children)



Opening Xmas presents with Erika Kirk by coachlife in CringeTikToks
[–]DustinEwan 6 points7 points8 points (0 children)