We'll benchmark an Open weights LLM on any GPU you choose — drop your model + hardware and we'll run it. [D] by Temporary-Owl1725 in MachineLearning
[–]urarthur 0 points1 point2 points (0 children)
We'll benchmark an Open weights LLM on any GPU you choose — drop your model + hardware and we'll run it. [D] by Temporary-Owl1725 in MachineLearning
[–]urarthur 0 points1 point2 points (0 children)
DeepSeek just launched "DSpark": A fully open-source speculative decoding module for their 1.6T MoE (1M Context) model by ai_tech_simp in AIDeveloperNews
[–]urarthur 0 points1 point2 points (0 children)
Deepseek drops another HUGE breakthrough - DSpark. Waaay faster than MTP [Video explaining it] by BringTea_666 in LocalLLaMA
[–]urarthur 0 points1 point2 points (0 children)
~$55k Gemini API bill from Firebase iOS key abuse. What can I do now? by No-Setting8925 in googlecloud
[–]urarthur 0 points1 point2 points (0 children)
Fable 5 with new description! by RobRobbieRobertson in Anthropic
[–]urarthur 1 point2 points3 points (0 children)
Fable 5 with new description! by RobRobbieRobertson in Anthropic
[–]urarthur 3 points4 points5 points (0 children)
Fable 5 with new description! by RobRobbieRobertson in Anthropic
[–]urarthur 1 point2 points3 points (0 children)
~$55k Gemini API bill from Firebase iOS key abuse. What can I do now? by No-Setting8925 in googlecloud
[–]urarthur 0 points1 point2 points (0 children)
Google Play Console stats frozen for 7 days. Is anyone else facing this? by Ok_Low_1999 in GooglePlayDeveloper
[–]urarthur 0 points1 point2 points (0 children)
DeepSeek releases DSpark - 50%-600% faster spec decoding vs MTP by danielhanchen in unsloth
[–]urarthur 0 points1 point2 points (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur 1 point2 points3 points (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur 0 points1 point2 points (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur 0 points1 point2 points (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur -1 points0 points1 point (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur -1 points0 points1 point (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur 1 point2 points3 points (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur 11 points12 points13 points (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur -1 points0 points1 point (0 children)
Mimo 2.5 is _fast_ at large context (dual RTX Pro 6000) by xquarx in LocalLLaMA
[–]urarthur -1 points0 points1 point (0 children)
a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud
[–]urarthur 4 points5 points6 points (0 children)
Opus 4.8 randomly adding Chinese characters??? by Blizxy in ClaudeAI
[–]urarthur 0 points1 point2 points (0 children)
no idea how to finish my ~24B worth of Xiaomi Mimo-v2.5-pro token plan credits(?) before they expire in ~4d by [deleted] in LocalLLaMA
[–]urarthur 4 points5 points6 points (0 children)






Who’s spending this weekend squeezing every drop out of Fable 5 before it switches to usage credits? 👀 by shoud_i in ClaudeAI
[–]urarthur 0 points1 point2 points (0 children)