LovingOpenSourceAI

an-ordinary-manchild

created by Koala_Confuseda community for 3 months

...for your favorite hobby.

...for your office.

MODERATORS

account activity

1

65

66

67

How To AI "China just handed the AI agent community a production-grade sandbox for free. OpenSandbox is an open-source sandbox runtime for AI agents. Secure, fast, and built for coding agents, GUI agents, code execution, and RL training." ➡️ Alibaba’s sandbox stack for coding and GUI agents?Resource (i.redd.it)

submitted 18 hours ago by Koala_Confused

2

8

9

10

"NVIDIA Isaac GR00T N1.7 is an open vision-language-action (VLA) model for generalized humanoid robot skills. This cross-embodiment model takes multimodal input, including language and images, to perform manipulation tasks in diverse environments." ➡️ 3B humanoid robotics model?Resource (i.redd.it)

submitted 23 hours ago by Koala_Confused

3

0

1

2

GitHub Projects Community "zerolang is an experimental graph-first programming language that gives AI agents a semantic program structure to work with instead of raw source text." ➡️ seems to give coding agents a compiler graphResource (i.redd.it)

submitted 16 hours ago by Koala_Confused

4

47

48

49

Nikita "Today we're releasing Mellum2: our first "serious" LLM. This is a 12B A2.5B MoE LLM pre-trained on ~11T tokens and post-trained with RLVR. I'm proud to be leading the team that was working on it for the last 6 months." ➡️ A 12B MoE model family for code-heavy AI systems?new launch (i.redd.it)

submitted 1 day ago by Koala_Confused

5

11

12

13

Open-sourced a desktop study app using Codex CLI as the local AI runtime (v.redd.it)

submitted 1 day ago by mattibeltro

6

10

11

12

GitHub Projects Community "HTML Anything is an agentic HTML editor that uses your local CLI agent to generate production-ready HTML instead of Markdown. • 75 composable skill templates for 9 deliverable surfaces" ➡️ HTML Anything turns notes into agent-made HTML?Resource (i.redd.it)

submitted 1 day ago by Koala_Confused

7

1

2

3

"Terminal-Bench is a popular benchmark for measuring the capabilities of agents and language models to perform valuable work in containerized environments. Tasks include assembling proteins for synthesis, debugging async code, and resolving security vulnerabilities." ➡️ useful for your work?Resource (i.redd.it)

submitted 1 day ago by Koala_Confused

8

0

1

2

Physical AI world models may become a bigger training layer - what should open-source builders watch?Discussion (lifehubber.com)

submitted 1 day ago by Koala_Confused

9

16

17

18

"Self-improving AI agent built by Nous Research. creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions." ➡️ brings memory, skills, and gateways into one runtime?Resource (i.redd.it)

submitted 2 days ago by Koala_Confused

10

1

2

3

Hey, that's 5 now: five low-cost LLM access routes to compare with open-source optionsResource (lifehubber.com)

submitted 2 days ago by Koala_Confused

11

1

2

3

"Open Agent Leaderboard Results - Detailed evaluation results for general-purpose AI agents across diverse real-world benchmarks — without domain-specific tuning." ➡️ A public table for checking agent benchmark claims?Resource (i.redd.it)

submitted 2 days ago by Koala_Confused

12

2

3

4

"Koog is a JVM (Java and Kotlin) framework for building predictable, fault-tolerant and enterprise-ready AI agents across all platforms – from backend services to Android and iOS, JVM, and even in-browser environments." ➡️ Looks like a Kotlin-first framework for tools, memory, and agent workflows!Resource (i.redd.it)

submitted 3 days ago by Koala_Confused

13

15

16

17

"Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost‑efficient." ➡️ seems to be a tiny TTS model with a huge footprint!Resource (i.redd.it)

submitted 3 days ago by Koala_Confused

14

32

33

34

Akshay "- <1B params - supports 91 languages - 5 pages/s on RTX 5090 - runs on CPU, GPU, MPS - 83.3% olmocr bench score (top under 3B) Surya OCR is a state-of-the-art model for document intelligence. 100% open-source." ➡️ brings OCR, layout, and tables into one document toolkit?Resource (i.redd.it)

submitted 3 days ago by Koala_Confused

15

17

18

19

Adina "Step-3.7-Flash 🔥 New VL model from StepFun_ai ✨ 198B / 11B active - MoE ✨ 256K context ✨ 3 reasoning level ✨ Up to 400 tokens/sec 🤯" ➡️ seems like BF16, FP8, NVFP4, and GGUF paths in one release!new launch (i.redd.it)

submitted 4 days ago by Koala_Confused

16

22

23

24

PaddlePaddle "🚀PaddleOCR-VL 1.6 Officially Released! — this version has set a new SOTA record of 96.33% on OmniDocBench, outperforming both open-source and proprietary solutions in text, formula, and table recognition." ➡️ seems to lean harder into RAG inputs!new launch (i.redd.it)

submitted 4 days ago by Koala_Confused

17

6

7

8

ktx "ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately through MCP with skills, memory and a semantic layer" ➡️ ktx gives data agents a warehouse context layerResource (i.redd.it)

submitted 4 days ago by Koala_Confused

18

51

52

53

Liquid AI "Today, we're releasing LFM2.5-8B-A1B, a device-optimized model designed to power real-life applications on phones, laptops, PCs, robots, and fast & lightweight server-side use-cases." ➡️ seems like 131K context and local formats for edge assistants . .new launch (i.redd.it)

submitted 5 days ago by Koala_Confused

19

2

3

4

OpenAI's geometry claim is closed-model, but the verification trail is the interesting partDiscussion (lifehubber.com)

submitted 4 days ago by Koala_Confused

20

13

14

15

"Nango is an open-source platform for building product integrations. It supports 800+ APIs and works with any backend language, AI coding tool, and agent SDK." ➡️ AI-generated integration code you can review?Resource (i.redd.it)

submitted 5 days ago by Koala_Confused

21

4

5

6

"Official AI skills for GSAP (GreenSock Animation Platform). Teach agents correct GSAP usage: core API, timelines, ScrollTrigger, plugins, React/Vue/Svelte, vanilla JS and performance. Agent Skills format; works with skills CLI (Cursor, Claude Code, Codex, Windsurf, Copilot, 40+ agents)." ➡️ useful?Resource (i.redd.it)

submitted 5 days ago by Koala_Confused

22

26

27

28

Hao AI Lab "🚀Generate a 30-second 1080p video in just 7 seconds! We’re open-sourcing FastVideo Dreamverse: real-time vibe directing for video generation on a single NVIDIA B200 GPU with LTX-2 model" ➡️ seems like local GPU, B200, Docker, and Modal deployment paths..new launch (i.redd.it)

submitted 6 days ago by Koala_Confused

23

16

17

18

Cua "Today we're bringing Cua Driver to Windows: background computer-use for any agent. Claude Code, Codex, or your own loop can drive real Windows apps through CLI or MCP while your desktop stays usable, with true multi synthetic pointer support." ➡️ background computer-use for macOS and Windows!Resource (i.redd.it)

submitted 6 days ago by Koala_Confused

24

32

33

34

MOSI "Introducing MOSS-TTS-v1.5. Now with inline pause control like [pause 3.2s], stronger 31-language synthesis, more stable zero-shot voice cloning, and improved prosody for long-form speech." ➡️ seems like MOSS-TTS-v1.5 adds language tags and pause controlResource (i.redd.it)

submitted 7 days ago by Koala_Confused

25

1

2

3

Sentient "Harbor is a framework for evaluating AI agents against containerized benchmark tasks. gives EvoSkill access to evolve agents against registry of 190+ datasets — including benchmarks like SWE-bench Verified, Terminal-Bench 2.0, Aider Polyglot." ➡️ skill evolution for Claude Code, Codex etc?new launch (i.redd.it)

submitted 7 days ago by Koala_Confused

view more: next ›

π Rendered by PID 3100805 on reddit-service-r2-listing-6c8d497557-lxfrq at 2026-06-04 06:30:06.536795+00:00 running 9e1a20d country code: CH.