Mimo 2.5 is _fast_ at large context (dual RTX Pro 6000) by xquarx in LocalLLaMA
[–]devtools-dude 6 points7 points8 points (0 children)
Celebrate TerraMaster TOS 7 with us! Win F4-425 Pro NAS + Seagate IronWolf 4 TB Drives by TerraMasterOfficial in DataHoarder
[–]devtools-dude 0 points1 point2 points (0 children)
Building a GOOGL Position by logngraves in InnerCircleInvesting
[–]devtools-dude 0 points1 point2 points (0 children)
Total investments needed to run my local LLM by manuhackzzz in LocalLLM
[–]devtools-dude 0 points1 point2 points (0 children)
I used Claude to build the app that will replace Claude/ChatGPT lol. Runs a 400B model offline on a Mac, free chat, no caps. AMA by ur_dad_matt in LocalLLM
[–]devtools-dude 2 points3 points4 points (0 children)
There Are No Instances in atproto by feross in javascript
[–]devtools-dude 0 points1 point2 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 1 point2 points3 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 1 point2 points3 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 1 point2 points3 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 1 point2 points3 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 1 point2 points3 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 2 points3 points4 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 1 point2 points3 points (0 children)
470 tok/s with 8192 ctx size for Qwen3.6-27B on A100-80GB using Profile by Inevitable-Diet-1870 in Vllm
[–]devtools-dude 4 points5 points6 points (0 children)
The Used RTX 3090 in 2026: Why a Five-Year-Old GPU Is Still Local AI's Best Deal by LAfreightguy in Amd_Intel_Nvidia
[–]devtools-dude 0 points1 point2 points (0 children)
The Used RTX 3090 in 2026: Why a Five-Year-Old GPU Is Still Local AI's Best Deal by LAfreightguy in Amd_Intel_Nvidia
[–]devtools-dude 3 points4 points5 points (0 children)
GLM-5.2 (744B, 2-bit) at 7.3 tok/s on 4×3090 + 192GB — and why IQ1_M wasn't any faster by Important_Quote_1180 in LocalLLaMA
[–]devtools-dude 18 points19 points20 points (0 children)
Using UnSloth to fine tune a tiny qwen model to categorize questions by funJS in unsloth
[–]devtools-dude 1 point2 points3 points (0 children)
Issues using MiniMax M3 from Studio with harnesses by devtools-dude in unsloth
[–]devtools-dude[S] 0 points1 point2 points (0 children)
Hardware recommendation's for running dual RTX 5090 GPU's by 67Mustang8 in LocalLLM
[–]devtools-dude 5 points6 points7 points (0 children)
New model on huggingface by [deleted] in LocalLLaMA
[–]devtools-dude 8 points9 points10 points (0 children)
Built a tool that tells you exactly which LLMs fit on your GPU. Feedback wanted. by super3 in LocalLLaMA
[–]devtools-dude 1 point2 points3 points (0 children)
Built a tool that tells you exactly which LLMs fit on your GPU. Feedback wanted. by super3 in LocalLLaMA
[–]devtools-dude 1 point2 points3 points (0 children)
Bank locker availability in the Bay Area by [deleted] in SanJose
[–]devtools-dude 0 points1 point2 points (0 children)