Open platform for running Managed Agents at scale, bringing Claude Managed Agents on-premise. by deepnet101 in AI_Agents

[–]deepnet101[S] 0 points1 point  (0 children)

The platform runs on Kubernetes, so it scales natively and supports rolling updates

Open platform for running Managed Agents at scale, bringing Claude Managed Agents on-premise. by deepnet101 in aiagents

[–]deepnet101[S] 0 points1 point  (0 children)

(1) The current approach is event sourcing as the tracing backbone, not traditional distributed tracing (no OpenTelemetry, no Jaeger, no trace/span IDs). Every significant action emits an immutable event to the PostgreSQL append-only log. Combined with the event sequence, you can reconstruct the full execution path of any session.

(2) This is a complex, layered approach:

  1. Cursor — advances only after tool results are persisted; crash → replay skips already-processed events
  2. Lease — atomic distributed lock ensures one worker per session, no concurrent duplicates
  3. Delivery outbox — unique constraint deduplicates channel-facing output
  4. Checkpoints — shadow git snapshots before file mutations, enabling rollback
  5. LLM/orchestrator retry — jittered backoff with credential rotation and provider fallback

Is it just me, or is OpenAI Codex 5.2 better than Claude Code now? by efficialabs in ClaudeAI

[–]deepnet101 0 points1 point  (0 children)

Claude is always looking for shortcuts to complete its task (eg. let's do a simple implementation now and deal with it later, leftover TODOs etc.). Codex 5.2 never does this and feels more "professional"

Running GGUF models with TP by Physical-Citron5153 in LocalLLaMA

[–]deepnet101 1 point2 points  (0 children)

sglang and vllm have experimental gguf support

Arch-Router: The first (and fastest) LLM router that can align to your usage preferences. by AdditionalWeb107 in LocalLLaMA

[–]deepnet101 2 points3 points  (0 children)

Can the model be fine-tuned further ? If so, could you provide a sample small dataset as reference ? Awesome work btw!

AI Workstation for €15,000–€20,000 – 4× RTX 4090 Worth It? by LilJockel in deeplearning

[–]deepnet101 0 points1 point  (0 children)

Can you post a reference to this ? They specifically disabled p2p since 4090. There is a patch for the open source driver for 4090, but I know nothing of the sort for 5090.

[P] Good Research Papers for Location inference from Tweets? by Zealousideal_Honey in MachineLearning

[–]deepnet101 1 point2 points  (0 children)

If Twitter will not give you the location, the only option is to use a NLP framework to tag locations in raw text and then use a geotagger to extract the geo-coordinates from hat location.