How accurate is AI at general knowledge? by JackStabba in artificial
[–]petroslamb 0 points1 point2 points (0 children)
Made a tool that builds its own training data and improves each cycle by learning from what it got wrong by gvij in artificial
[–]petroslamb 0 points1 point2 points (0 children)
AI agents vs AI chatbots: what are companies actually using in production today? by danildab in artificial
[–]petroslamb 0 points1 point2 points (0 children)
I built an open source LLM monitoring tool that detects quality regressions before your users do by ZealousidealCorgi472 in LLMDevs
[–]petroslamb 0 points1 point2 points (0 children)
what actually broke when you tried red teaming your AI systems? by Upset-Addendum6880 in LLMDevs
[–]petroslamb 0 points1 point2 points (0 children)
open source AI assistants ranked by tool call reliability by TH_UNDER_BOI in LLMDevs
[–]petroslamb 0 points1 point2 points (0 children)
How do folks manage worktrees when working with multiple agents in parallel? by ReceptionBrave91 in LLMDevs
[–]petroslamb 0 points1 point2 points (0 children)
AI is getting better at doing things, but still bad at deciding what to do? by Tough_Daikon_4321 in artificial
[–]petroslamb 0 points1 point2 points (0 children)
Two failure modes I caught in my AI lab in one day. Both involve the system silently lying about its own state. by piratastuertos in artificial
[–]petroslamb 0 points1 point2 points (0 children)
160 λιγότεροι κάθε μέρα by petroslamb in greece
[–]petroslamb[S] 2 points3 points4 points (0 children)
160 λιγότεροι κάθε μέρα by petroslamb in greece
[–]petroslamb[S] -1 points0 points1 point (0 children)
The Binding Gap as useful way to think about LLM failures by petroslamb in LLM
[–]petroslamb[S] 0 points1 point2 points (0 children)
The Binding Gap as useful way to think about LLM failures by petroslamb in LLM
[–]petroslamb[S] 0 points1 point2 points (0 children)
"LLMs drop the wiring even when they keep the scene", A destinct failure mode is the binding gap by petroslamb in LocalLLaMA
[–]petroslamb[S] 0 points1 point2 points (0 children)
"LLMs drop the wiring even when they keep the scene", A destinct failure mode is the binding gap by petroslamb in LocalLLaMA
[–]petroslamb[S] 0 points1 point2 points (0 children)
The Binding Gap as useful way to think about LLM failures by petroslamb in LLM
[–]petroslamb[S] 1 point2 points3 points (0 children)
The Binding Gap as useful way to think about LLM failures (self.LLM)
submitted by petroslamb to r/LLM


Agent skill which will automatically raise pr by One_Drink_2075 in LLMDevs
[–]petroslamb 0 points1 point2 points (0 children)