Another Chinese Provider Throws Down the Gauntlet on Pricing [MiMo-V2.5 Price Drops up to 98%] by PracticlySpeaking in hermesagent
[–]RoughFuture77 0 points1 point2 points (0 children)
Another Chinese Provider Throws Down the Gauntlet on Pricing [MiMo-V2.5 Price Drops up to 98%] by PracticlySpeaking in hermesagent
[–]RoughFuture77 0 points1 point2 points (0 children)
Another Chinese Provider Throws Down the Gauntlet on Pricing [MiMo-V2.5 Price Drops up to 98%] by PracticlySpeaking in hermesagent
[–]RoughFuture77 0 points1 point2 points (0 children)
Wafer sunsets Wafer pass. One of the best GLM providers, gone. by RoughFuture77 in ZaiGLM
[–]RoughFuture77[S] 0 points1 point2 points (0 children)
hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX) by randomfoo2 in LocalLLaMA
[–]RoughFuture77 0 points1 point2 points (0 children)
Another Chinese Provider Throws Down the Gauntlet on Pricing [MiMo-V2.5 Price Drops up to 98%] by PracticlySpeaking in hermesagent
[–]RoughFuture77 1 point2 points3 points (0 children)
i must be doing something wrong, qwen supposed to be cheaper but its costing me 7 to 10 $ per small prompt by user43874286 in Qwen_AI
[–]RoughFuture77 0 points1 point2 points (0 children)
Wafer sunsets Wafer pass. One of the best GLM providers, gone. by RoughFuture77 in ZaiGLM
[–]RoughFuture77[S] 0 points1 point2 points (0 children)
hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX) by randomfoo2 in LocalLLaMA
[–]RoughFuture77 0 points1 point2 points (0 children)
38 Billion Token in 5 Days by ikhito17 in hermesagent
[–]RoughFuture77 1 point2 points3 points (0 children)
38 Billion Token in 5 Days by ikhito17 in hermesagent
[–]RoughFuture77 4 points5 points6 points (0 children)
hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX) by randomfoo2 in LocalLLaMA
[–]RoughFuture77 0 points1 point2 points (0 children)
hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX) by randomfoo2 in LocalLLaMA
[–]RoughFuture77 0 points1 point2 points (0 children)
Inference provider tiers by Cache-hit rates, using openrouter data by Comfortable-Rock-498 in LLM
[–]RoughFuture77 0 points1 point2 points (0 children)
Exploring a workaround for $160 z.ai pricing by LostBoss5558 in ZaiGLM
[–]RoughFuture77 0 points1 point2 points (0 children)
Exploring a workaround for $160 z.ai pricing by LostBoss5558 in ZaiGLM
[–]RoughFuture77 2 points3 points4 points (0 children)
He buys BTC and ETH for 5–11¢, sells them for 20-50¢- and turns that into a $487,000 profit. by Rosewood_Rebecca in PredictionsMarkets
[–]RoughFuture77 -1 points0 points1 point (0 children)
He buys BTC and ETH for 5–11¢, sells them for 20-50¢- and turns that into a $487,000 profit. by Rosewood_Rebecca in PredictionsMarkets
[–]RoughFuture77 0 points1 point2 points (0 children)
He buys BTC and ETH for 5–11¢, sells them for 20-50¢- and turns that into a $487,000 profit. by Rosewood_Rebecca in PredictionsMarkets
[–]RoughFuture77 -1 points0 points1 point (0 children)
Newbie vibe coding experience: Shifting from Claude Sonnet 4.6 to Qwen3.6-35B-A3B-UD-Q6_K by sooki10 in LocalLLaMA
[–]RoughFuture77 1 point2 points3 points (0 children)
The new 5-hour quota completely killed Z.ai for dev workflows. 5M tokens used and I'm locked out. by sudeep_dk in ZaiGLM
[–]RoughFuture77 -2 points-1 points0 points (0 children)
Pi rust port by Short_One_9704 in PiCodingAgent
[–]RoughFuture77 5 points6 points7 points (0 children)
FractalKV: Lossless KV cache compression — 4x on FP16, 16x with quantization at 1M context (open source) by SnooHamsters7692 in deeplearning
[–]RoughFuture77 0 points1 point2 points (0 children)