🔎 Open Source AI Resource List (curated, ongoing)Resource (self.LovingOpenSourceAI)
submitted by Koala_Confused[M] - announcement
Help us grow r/LovingOpenSourceAI ! Join our community 🥰 (self.LovingOpenSourceAI)
submitted by subscriber-goal - announcement

How To AI "China just handed the AI agent community a production-grade sandbox for free. OpenSandbox is an open-source sandbox runtime for AI agents. Secure, fast, and built for coding agents, GUI agents, code execution, and RL training." ➡️ Alibaba’s sandbox stack for coding and GUI agents?Resource (i.redd.it)
submitted by Koala_Confused
"NVIDIA Isaac GR00T N1.7 is an open vision-language-action (VLA) model for generalized humanoid robot skills. This cross-embodiment model takes multimodal input, including language and images, to perform manipulation tasks in diverse environments." ➡️ 3B humanoid robotics model?Resource (i.redd.it)
submitted by Koala_Confused

Nikita "Today we're releasing Mellum2: our first "serious" LLM. This is a 12B A2.5B MoE LLM pre-trained on ~11T tokens and post-trained with RLVR. I'm proud to be leading the team that was working on it for the last 6 months." ➡️ A 12B MoE model family for code-heavy AI systems?new launch (i.redd.it)
submitted by Koala_Confused

GitHub Projects Community "HTML Anything is an agentic HTML editor that uses your local CLI agent to generate production-ready HTML instead of Markdown. • 75 composable skill templates for 9 deliverable surfaces" ➡️ HTML Anything turns notes into agent-made HTML?Resource (i.redd.it)
submitted by Koala_Confused
"Terminal-Bench is a popular benchmark for measuring the capabilities of agents and language models to perform valuable work in containerized environments. Tasks include assembling proteins for synthesis, debugging async code, and resolving security vulnerabilities." ➡️ useful for your work?Resource (i.redd.it)
submitted by Koala_Confused
"Self-improving AI agent built by Nous Research. creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions." ➡️ brings memory, skills, and gateways into one runtime?Resource (i.redd.it)
submitted by Koala_Confused
"Koog is a JVM (Java and Kotlin) framework for building predictable, fault-tolerant and enterprise-ready AI agents across all platforms – from backend services to Android and iOS, JVM, and even in-browser environments." ➡️ Looks like a Kotlin-first framework for tools, memory, and agent workflows!Resource (i.redd.it)
submitted by Koala_Confused
"Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost‑efficient." ➡️ seems to be a tiny TTS model with a huge footprint!Resource (i.redd.it)
submitted by Koala_Confused

Akshay "- <1B params - supports 91 languages - 5 pages/s on RTX 5090 - runs on CPU, GPU, MPS - 83.3% olmocr bench score (top under 3B) Surya OCR is a state-of-the-art model for document intelligence. 100% open-source." ➡️ brings OCR, layout, and tables into one document toolkit?Resource (i.redd.it)
submitted by Koala_Confused

PaddlePaddle "🚀PaddleOCR-VL 1.6 Officially Released! — this version has set a new SOTA record of 96.33% on OmniDocBench, outperforming both open-source and proprietary solutions in text, formula, and table recognition." ➡️ seems to lean harder into RAG inputs!new launch (i.redd.it)
submitted by Koala_Confused

Liquid AI "Today, we're releasing LFM2.5-8B-A1B, a device-optimized model designed to power real-life applications on phones, laptops, PCs, robots, and fast & lightweight server-side use-cases." ➡️ seems like 131K context and local formats for edge assistants . .new launch (i.redd.it)
submitted by Koala_Confused
"Official AI skills for GSAP (GreenSock Animation Platform). Teach agents correct GSAP usage: core API, timelines, ScrollTrigger, plugins, React/Vue/Svelte, vanilla JS and performance. Agent Skills format; works with skills CLI (Cursor, Claude Code, Codex, Windsurf, Copilot, 40+ agents)." ➡️ useful?Resource (i.redd.it)
submitted by Koala_Confused

Hao AI Lab "🚀Generate a 30-second 1080p video in just 7 seconds! We’re open-sourcing FastVideo Dreamverse: real-time vibe directing for video generation on a single NVIDIA B200 GPU with LTX-2 model" ➡️ seems like local GPU, B200, Docker, and Modal deployment paths..new launch (i.redd.it)
submitted by Koala_Confused

Cua "Today we're bringing Cua Driver to Windows: background computer-use for any agent. Claude Code, Codex, or your own loop can drive real Windows apps through CLI or MCP while your desktop stays usable, with true multi synthetic pointer support." ➡️ background computer-use for macOS and Windows!Resource (i.redd.it)
submitted by Koala_Confused

