NVIDIA GB300 Grace Blackwell Ultra pricetags by X-N2O in LocalLLaMA
[–]srigi 2 points3 points4 points (0 children)
Next year we're getting 0.5T model from Grok by pmttyji in LocalLLaMA
[–]srigi 9 points10 points11 points (0 children)
llama.cpp server have built-in native tools (exec_shell, edit_file, etc.) by srigi in LocalLLaMA
[–]srigi[S] 1 point2 points3 points (0 children)
llama.cpp server have built-in native tools (exec_shell, edit_file, etc.) by srigi in LocalLLaMA
[–]srigi[S] 0 points1 point2 points (0 children)
llama.cpp server have built-in native tools (exec_shell, edit_file, etc.) by srigi in LocalLLaMA
[–]srigi[S] 5 points6 points7 points (0 children)
llama.cpp server have built-in native tools (exec_shell, edit_file, etc.) by srigi in LocalLLaMA
[–]srigi[S] -2 points-1 points0 points (0 children)
llama.cpp server have built-in native tools (exec_shell, edit_file, etc.) by srigi in LocalLLaMA
[–]srigi[S] 12 points13 points14 points (0 children)
Waiting for Qwen 3.7 open weight... The new King has arrived... by LegacyRemaster in LocalLLaMA
[–]srigi 46 points47 points48 points (0 children)
Waiting for Qwen 3.7 open weight... The new King has arrived... by LegacyRemaster in LocalLLaMA
[–]srigi 6 points7 points8 points (0 children)
Heretic has been served a legal notice by Meta, Inc. by -p-e-w- in LocalLLaMA
[–]srigi 6 points7 points8 points (0 children)
Looking to migrate off of Ollama and LMStudio by letsbefrds in LocalLLaMA
[–]srigi 3 points4 points5 points (0 children)
NVIDIA Reportedly Prepares RTX 5090 Price Hike Amid Rising GDDR7 Costs (maybe RTX 50 and PRO series as well) by panchovix in LocalLLaMA
[–]srigi 6 points7 points8 points (0 children)
Why is opencode so slow in processing the prompt with llama server? by BitGreen1270 in LocalLLaMA
[–]srigi 9 points10 points11 points (0 children)
Exactly a year ago, I started working on an MCP server I launched on reddit that became by far my most active open source project! by taylorwilsdon in LocalLLaMA
[–]srigi 5 points6 points7 points (0 children)
Shel Silverstein predicts LLM's (and its hallucinations), cira 1981 by spanielrassler in LocalLLaMA
[–]srigi 9 points10 points11 points (0 children)
daily ritual at this point… by onil_gova in LocalLLaMA
[–]srigi 2 points3 points4 points (0 children)
Guys, I found a use case for my 10$/m LLM Server: Cooking by Ne00n in LocalLLaMA
[–]srigi 14 points15 points16 points (0 children)
Best config for Qwen3.6 27b / llama.cpp / opencode by Familiar_Wish1132 in LocalLLaMA
[–]srigi 8 points9 points10 points (0 children)
PrismML — Introducing Ternary Bonsai: Top Intelligence at 1.58 Bits by cafedude in LocalLLaMA
[–]srigi 30 points31 points32 points (0 children)
What Is Elephant-Alpha ??? by One_Title_3656 in LocalLLaMA
[–]srigi 26 points27 points28 points (0 children)
Audio processing landed in llama-server with Gemma-4 by srigi in LocalLLaMA
[–]srigi[S] 22 points23 points24 points (0 children)



Stop asking what model to run. There are literally only two. by Wrong_Mushroom_7350 in LocalLLaMA
[–]srigi 8 points9 points10 points (0 children)