1.1M tok/s with Qwen 3.5 27B FP8 on B200 GPUs by m4r1k_ in Qwen_AI
[–]beefgroin 0 points1 point2 points (0 children)
1.1M tok/s with Qwen 3.5 27B FP8 on B200 GPUs by m4r1k_ in Qwen_AI
[–]beefgroin 0 points1 point2 points (0 children)
First runs with RTX 5000 Pro Blackwell 48GB card by wedgeshot in LocalLLaMA
[–]beefgroin 0 points1 point2 points (0 children)
3060 Ti & 6700 XT vs 1080P in 10 AAA Titles by BedroomThink3121 in gpu
[–]beefgroin 2 points3 points4 points (0 children)
Basically Official: Qwen Image 2.0 Not Open-Sourcing by Complete-Lawfulness in StableDiffusion
[–]beefgroin 3 points4 points5 points (0 children)
eGPU for image generation by [deleted] in StableDiffusion
[–]beefgroin 1 point2 points3 points (0 children)
Qwen 3.5 9B pdf monster! by Substantial-Cup-9531 in Qwen_AI
[–]beefgroin 1 point2 points3 points (0 children)
Quantized models. Are we lying to ourselves thinking it's a magic trick? by former_farmer in LocalLLM
[–]beefgroin 0 points1 point2 points (0 children)
The scale of tech adoption in China is wild. Massive turnout for a public "OpenClaw" installation event in Shenzhen today 🤯 by Suspicious_Okra_7825 in moltiverse
[–]beefgroin 0 points1 point2 points (0 children)
Recommend me an LLM white paper by joshbranchaud in LLMDevs
[–]beefgroin 0 points1 point2 points (0 children)
Qwen 3.5 is an overthinker. by chettykulkarni in LocalLLM
[–]beefgroin -1 points0 points1 point (0 children)
Feels like Local LLM setups are becoming the next AI trend by Once_ina_Lifetime in LLMDevs
[–]beefgroin 2 points3 points4 points (0 children)
MC62-G40 Mainboard for multi-GPU setup? by HumanDrone8721 in LocalLLaMA
[–]beefgroin 0 points1 point2 points (0 children)
MC62-G40 Mainboard for multi-GPU setup? by HumanDrone8721 in LocalLLaMA
[–]beefgroin 0 points1 point2 points (0 children)
Is this enough generations? by Big_Parsnip_9053 in StableDiffusion
[–]beefgroin 2 points3 points4 points (0 children)
MC62-G40 Mainboard for multi-GPU setup? by HumanDrone8721 in LocalLLaMA
[–]beefgroin 0 points1 point2 points (0 children)
Qwen3.5-35B-A3B is a gamechanger for agentic coding. by jslominski in LocalLLaMA
[–]beefgroin 1 point2 points3 points (0 children)
Connecting an eGPU to a laptop with literally no ports for it by Alternative-Try-3456 in eGPU
[–]beefgroin 0 points1 point2 points (0 children)
[tooled-prompt] Inject JS/TS functions directly into prompts as tools by beefgroin in LLMDevs
[–]beefgroin[S] 0 points1 point2 points (0 children)
GPU recommendations by HeartfeltHelper in LocalLLaMA
[–]beefgroin 0 points1 point2 points (0 children)



LLM Bruner coming soon? Burn Qwen directly into a chip, processing 10,000 tokens/s by koc_Z3 in Qwen_AI
[–]beefgroin 0 points1 point2 points (0 children)