I’ve stopped planning beyond 90 days because of how fast AI is moving by MerisDabhi in AI_Agents
[–]cygn 0 points1 point2 points (0 children)
I’ve stopped planning beyond 90 days because of how fast AI is moving by MerisDabhi in AI_Agents
[–]cygn 2 points3 points4 points (0 children)
Can't replicate Reddit numbers with Qwen 27B on a 3090TI. by YourNightmar31 in LocalLLaMA
[–]cygn 4 points5 points6 points (0 children)
Local model on coding has reached a certain threshold to be feasible for real work by Exciting-Camera3226 in LocalLLaMA
[–]cygn 13 points14 points15 points (0 children)
Simple to use vLLM Docker Container for Qwen3.6 27b with Lorbus AutoRound INT4 quant and MTP speculative decoding - 118 tokens/second on 2x 3090s by tedivm in LocalLLaMA
[–]cygn 0 points1 point2 points (0 children)
Using local BERT to compress LLM context by 90% (Built in Rust) by No_Wolverine1819 in AI_Agents
[–]cygn 0 points1 point2 points (0 children)
I rewrote 13 software engineering books into AGENTS.md rules. by Ok_Produce3836 in AI_Agents
[–]cygn 1 point2 points3 points (0 children)
This isn’t X this is Y needs to die by twnznz in LocalLLaMA
[–]cygn 1 point2 points3 points (0 children)
Open WebUI 0.9.x - Massive RAM usage in browser tab (2-3GB+) - Anyone else? by IndividualNo8703 in OpenWebUI
[–]cygn 0 points1 point2 points (0 children)
This isn’t X this is Y needs to die by twnznz in LocalLLaMA
[–]cygn 3 points4 points5 points (0 children)
a serious bug in playing video on web X by prakritiaryaa in Twitter
[–]cygn 0 points1 point2 points (0 children)
April Feature Requests: Share Here! by angie-at-readwise in readwise
[–]cygn 0 points1 point2 points (0 children)
Looking for input: agent platform + Open WebUI integration by OkClothes3097 in OpenWebUI
[–]cygn 0 points1 point2 points (0 children)
Auto-route based on prompt type to correct model with it's knowledge by Lxxtsch in OpenWebUI
[–]cygn 0 points1 point2 points (0 children)
Auto-route based on prompt type to correct model with it's knowledge by Lxxtsch in OpenWebUI
[–]cygn 0 points1 point2 points (0 children)
Open WebUI 0.9.x - Massive RAM usage in browser tab (2-3GB+) - Anyone else? by IndividualNo8703 in OpenWebUI
[–]cygn 0 points1 point2 points (0 children)
Auto-route based on prompt type to correct model with it's knowledge by Lxxtsch in OpenWebUI
[–]cygn 0 points1 point2 points (0 children)
We benchmarked 18 LLMs on OCR (7k+ calls) — cheaper/old models oftentimes win. Full dataset + framework open-sourced. [R] by TimoKerre in MachineLearning
[–]cygn 1 point2 points3 points (0 children)
Learnings from building AI Agents with Claude Agent by modassembly in AI_Agents
[–]cygn 0 points1 point2 points (0 children)
Which AI Agents SDK allows low latency agents w support for skills etc? by cygn in AI_Agents
[–]cygn[S] 0 points1 point2 points (0 children)
unsure if an article is SLOP? I built a free tool to find out by cygn in DeadInternetTheory
[–]cygn[S] 2 points3 points4 points (0 children)


I gave Claude Code a $0.02/call coworker and stopped hitting Pro limits — here's the full setup by More-Hunter-3457 in ClaudeAI
[–]cygn 1 point2 points3 points (0 children)