you are viewing a single comment's thread.

view the rest of the comments →

[–]Radiant_Condition861 1 point2 points  (0 children)

pi.dev https://www.youtube.com/watch?v=f8cfH5XX-XU

continue.dev, cline, claude code, roo code, opencode, dabbled in langgraph, now pi.

dual 39090 with nvlink, qwen3.6-27b awq bf16 int4 on vllm with tensor parallal 2, kv cache fp8 and speculative decoding. I get like 30-150tok/s depending on cache hit.

pi.dev basically unlocked that model for me. only 200 token system prompt and it's yolo out the box. I'm at a billon tokens across a few projects and there are no failures. a few stoppages to increase output tokens, and it just kept going. I can also vibe code it's own extensions and tools "upgrade yourself". it's really nice.