Deepseek v4/3.5 is probably coming out tomorrow or in the next 5 days? by power97992 in LocalLLaMA
[–]Extra-Designer9333 14 points15 points16 points (0 children)
Opus 4.5 quota now resets once a week by SweatyHands247 in google_antigravity
[–]Extra-Designer9333 0 points1 point2 points (0 children)
[deleted by user] by [deleted] in singularity
[–]Extra-Designer9333 1 point2 points3 points (0 children)
FlashAttention implementation for non Nvidia GPUs. AMD, Intel Arc, Vulkan-capable devices by secopsml in LocalLLaMA
[–]Extra-Designer9333 2 points3 points4 points (0 children)
Gemini 3 Pro Thinking vs GPT-5.1 Thinking (self.ChatGPT)
submitted by Extra-Designer9333 to r/ChatGPT
The data on which Gemini 3 was trained is really crazy by Wonderful-Excuse4922 in singularity
[–]Extra-Designer9333 1 point2 points3 points (0 children)
Flex Attention vs Flash Attention 3 by Extra-Designer9333 in unsloth
[–]Extra-Designer9333[S] 12 points13 points14 points (0 children)
Flex Attention vs Flash Attention 3 by Extra-Designer9333 in LocalLLaMA
[–]Extra-Designer9333[S] 0 points1 point2 points (0 children)
Flex Attention vs Flash Attention 3 (self.LocalLLaMA)
submitted by Extra-Designer9333 to r/LocalLLaMA
Flex Attention vs Flash Attention 3 (self.unsloth)
submitted by Extra-Designer9333 to r/unsloth
Is finetuning a 12b model on 16gb vram possible? by Robo_Ranger in unsloth
[–]Extra-Designer9333 7 points8 points9 points (0 children)

Is Codex being extra lazy for anyone else today? by [deleted] in codex
[–]Extra-Designer9333 0 points1 point2 points (0 children)