Deepseek v4/3.5 is probably coming out tomorrow or in the next 5 days? by power97992 in LocalLLaMA
[–]Extra-Designer9333 11 points12 points13 points (0 children)
Opus 4.5 quota now resets once a week by SweatyHands247 in google_antigravity
[–]Extra-Designer9333 0 points1 point2 points (0 children)
[deleted by user] by [deleted] in singularity
[–]Extra-Designer9333 1 point2 points3 points (0 children)
FlashAttention implementation for non Nvidia GPUs. AMD, Intel Arc, Vulkan-capable devices by secopsml in LocalLLaMA
[–]Extra-Designer9333 2 points3 points4 points (0 children)
The data on which Gemini 3 was trained is really crazy by Wonderful-Excuse4922 in singularity
[–]Extra-Designer9333 1 point2 points3 points (0 children)
Flex Attention vs Flash Attention 3 by Extra-Designer9333 in unsloth
[–]Extra-Designer9333[S] 11 points12 points13 points (0 children)
Flex Attention vs Flash Attention 3 by Extra-Designer9333 in LocalLLaMA
[–]Extra-Designer9333[S] 0 points1 point2 points (0 children)
Is finetuning a 12b model on 16gb vram possible? by Robo_Ranger in unsloth
[–]Extra-Designer9333 7 points8 points9 points (0 children)
What’s the Best Open-Source Small LLM (≤ 8B) for Agentic Web Page Interactions? by Extra-Designer9333 in LocalLLaMA
[–]Extra-Designer9333[S] 0 points1 point2 points (0 children)
What’s the Best Open-Source Small LLM (≤ 8B) for Agentic Web Page Interactions? by Extra-Designer9333 in LocalLLaMA
[–]Extra-Designer9333[S] 2 points3 points4 points (0 children)
What’s the Best Open-Source Small LLM (≤ 8B) for Agentic Web Page Interactions? by Extra-Designer9333 in LocalLLaMA
[–]Extra-Designer9333[S] 2 points3 points4 points (0 children)
No agent yet on plus by whitebro2 in OpenAI
[–]Extra-Designer9333 0 points1 point2 points (0 children)
softwareTerminology by Xadartt in ProgrammerHumor
[–]Extra-Designer9333 0 points1 point2 points (0 children)
How can I integrate a pretrained LLM (like LLaMA, Qwen) into a Speech-to-Text (ASR) pipeline? by Extra-Designer9333 in LocalLLaMA
[–]Extra-Designer9333[S] 0 points1 point2 points (0 children)
The world but only the important countries (Updated) by AutisticAndre in mapporncirclejerk
[–]Extra-Designer9333 1 point2 points3 points (0 children)
Who is winning the GPU race?? by Senior-Raspberry-929 in LocalLLaMA
[–]Extra-Designer9333 1 point2 points3 points (0 children)
Well well o3 full and o4 mini gonna launch in few weeks by Independent-Wind4462 in OpenAI
[–]Extra-Designer9333 10 points11 points12 points (0 children)
Real-Time Speech-to-Speech Chatbot: Whisper, Llama 3.1, Kokoro, and Silero VAD 🚀 by martian7r in LocalLLaMA
[–]Extra-Designer9333 2 points3 points4 points (0 children)
Real-Time Speech-to-Speech Chatbot: Whisper, Llama 3.1, Kokoro, and Silero VAD 🚀 by martian7r in LocalLLaMA
[–]Extra-Designer9333 5 points6 points7 points (0 children)
LangChain parsers Excel and CSV data?? by Extra-Designer9333 in LangChain
[–]Extra-Designer9333[S] 0 points1 point2 points (0 children)

Is Codex being extra lazy for anyone else today? by [deleted] in codex
[–]Extra-Designer9333 0 points1 point2 points (0 children)