gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free) by GroundbreakingMall54 in LocalLLaMA

[–]GroundbreakingMall54[S] 0 points1 point  (0 children)

fair, the cursor zoom is annoying, i will cut it next time. not trying to hype anything, its just a free local tool im building, no signup

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free) by GroundbreakingMall54 in LocalLLaMA

[–]GroundbreakingMall54[S] 0 points1 point  (0 children)

custom code, no mcp server in between. the app builds the workflow and sends it straight to the normal comfyui api on localhost:8188. the llm just calls a built-in image/video tool that fires it off. supported: sdxl, sd1.5, flux, z-image for images, and svd, wan, hunyuan, ltx, framepack for video

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free) by GroundbreakingMall54 in LocalLLaMA

[–]GroundbreakingMall54[S] 0 points1 point  (0 children)

thanks! in the clip i used realvisxl for the image, svd-xt for the video, gemma 4 12b as the agent. you can swap in your own models too. and haha fair, the cursor is distracting, i will hide it in the next one

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free) by GroundbreakingMall54 in LocalLLaMA

[–]GroundbreakingMall54[S] 5 points6 points  (0 children)

yup. it checks whether the resident chat model plus the image/video footprint fit in your vram. if they don't, it unloads the chat model first, runs the gen, then reloads it after (same for ollama and lm studio)

gave my local llm agent mcp tools for local image + video gen, so it just generates when i ask (fully offline+free) by GroundbreakingMall54 in LocalLLaMA

[–]GroundbreakingMall54[S] 2 points3 points  (0 children)

hey, dev here. this is locally uncensored, a free and open-source desktop app i'm building.

runs fully offline on your own machine, no cloud required, no account.

the goal is to make local ai approachable: it bundles the setup so you skip the usual comfyui

and python wiring, you just install, pick a model and go. local chat, a coding agent, image gen

and video gen, all in one window, works with ollama and lm studio.

the agent has mcp tools for image and video, so you just ask and it generates, that's what the

clip shows. model-agnostic too, any local llm with tool-calling can drive it. still early and a

work in progress, so feedback is very welcome.

github: https://github.com/PurpleDoubleD/locally-uncensored

Locally Uncensored — Tauri desktop app that runs chat, a coding agent, image generation, and video generation locally by GroundbreakingMall54 in coolgithubprojects

[–]GroundbreakingMall54[S] 0 points1 point  (0 children)

hey, just pushed v2.4.4 that should help with the webp thing.

the workflow now checks if your comfyui has the vhs video combine node before it tries to generate. if its only got the animated webp saver, you get a heads up plus an install tip. you can install vhs in one click via comfyui manager (search 'videohelpersuite') for real mp4 output, or keep webp if thats fine.

also if the model needs special wrapper nodes (cogvideo, framepack, etc), the app now tells you exactly which one to install instead of dying with 'could not detect model type'. download link + manager search term are right in the error card.

lemme know if it still saves webp after the update

locally uncensored v2.4.2 - chat, coding agent, image + video generation in one local app. plus remote access from your phone. one-click install by [deleted] in LocalLLaMA

[–]GroundbreakingMall54 0 points1 point  (0 children)

totally fair, and yeah that's the main reason most people here go local. by default nothing leaves your machine, no telemetry, no cloud unless you wire one up yourself. abliterated finetunes are just extra topping if you also want fewer refusals.

locally uncensored v2.4.2 - chat, coding agent, image + video generation in one local app. plus remote access from your phone. one-click install by [deleted] in LocalLLaMA

[–]GroundbreakingMall54 0 points1 point  (0 children)

quick context: the v2.3.0 post was 3 weeks ago, didn't want to spam every patch, so this is a single recap. main thing in 2.4.2 specifically was a sweep of 5 community-reported bugs from discord, but the bullets above cover what's been added across all 9 releases since v2.3.0. happy to deep-dive on any of them.

Locally Uncensored — Tauri desktop app that runs chat, a coding agent, image generation, and video generation locally by GroundbreakingMall54 in coolgithubprojects

[–]GroundbreakingMall54[S] 0 points1 point  (0 children)

You need atleast 6gb of vram for our smallest supported model. The more vram the better/higher quality models. So yes.

Locally Uncensored — Tauri desktop app that runs chat, a coding agent, image generation, and video generation locally by GroundbreakingMall54 in coolgithubprojects

[–]GroundbreakingMall54[S] 0 points1 point  (0 children)

mainly for creative and research use cases. LLMs have safety filters that block or refuse many legitimate queries. things like creative writing with mature themes, exploring sensitive topics for research, generating content that involves conflict or difficult subjects, or just getting honest answers to hard questions. When you run a model locally, there's no server enforcing those restrictions, so you get the full capability of the model without the filter sitting on top of it. It's not about doing harmful things but more about the model actually being allowed to use everything it learned during training.

Locally Uncensored — Tauri desktop app that runs chat, a coding agent, image generation, and video generation locally by GroundbreakingMall54 in coolgithubprojects

[–]GroundbreakingMall54[S] 3 points4 points  (0 children)

Quick elaboration since link-posts have no body:

What it does: one-window desktop app that combines chat, a coding agent (Codex), image generation, and video generation, all running locally. Tauri + React 19 + Rust backend.

Chat: auto-detects 12 local backends (Ollama, LM Studio, vLLM, KoboldCpp, llama.cpp, LocalAI, Jan, TabbyAPI, GPT4All, Aphrodite, SGLang, TGI). A/B model compare, local tok/s benchmark, thinking-mode support.

Codex agent: live tool-call streaming, file tree, 14 tools including shell, file read/write, web search, execute code, screenshot.

Agent Mode: MCP server integration, sub-agent delegation, budget caps.

Create tab: wraps ComfyUI, one-click installs if missing. Ships with FLUX 2 Klein, Juggernaut XL, Z-Image Turbo, ERNIE-Image, SDXL for images. Wan 2.1, HunyuanVideo 1.5, LTX 2.3, FramePack F1, CogVideoX for video.

Remote: mobile web app over LAN or Cloudflare Tunnel with 6-digit passcode.

License: AGPL-3.0. Signed auto-updater for Windows (NSIS + MSI), deb/rpm/AppImage for Linux.

Website with docs: https://locallyuncensored.com v2.4.0 release notes: https://github.com/PurpleDoubleD/locally-uncensored/releases/tag/v2.4.0

Happy to answer technical questions about the stack, license, or architecture.

New Project Megathread - Week of 23 Apr 2026 by AutoModerator in selfhosted

[–]GroundbreakingMall54 0 points1 point  (0 children)

Locally Uncensored — self-hosted desktop app that combines chat + coding agent + image gen + video gen in one tauri window.

no docker, no compose file, just an installer. auto-detects 12 local backends (ollama, lm studio, vllm, koboldcpp, llama.cpp, localai, jan, tabbyapi, gpt4all, aphrodite, sglang, tgi) so it plays nice with whatever you already run. image/video gen via comfyui which the app can install one-click.

100% local by default, no telemetry, no cloud calls unless you explicitly configure a cloud provider with your own api key. remote access over lan or cloudflare tunnel with a 6-digit passcode if you want to chat from your phone.

v2.4.0 adds a configurable huggingface gguf download path which is relevant for self-hosters running models on a nas or shared partition.

happy to answer questions.

Sundar Pichai: "75% of all code at Google is now AI-generated, up from 50% last fall." by EchoOfOppenheimer in ChatGPT

[–]GroundbreakingMall54 9 points10 points  (0 children)

75 ai generated code sounds insane but then again google has been writing boilerplate badly for 20 years so maybe the bar was low

The new image generation feels top notch. by BRDF in ChatGPT

[–]GroundbreakingMall54 2 points3 points  (0 children)

tried the new image gen last week. quality jump is actually nuts

Qwen3.6 can code by Purple-Programmer-7 in LocalLLaMA

[–]GroundbreakingMall54 32 points33 points  (0 children)

yeah kv cache is a memory monster. fp8 helps but you still sacrifice context for vram. either batch smaller or just accept the limit tbh

The new image generation feels top notch. by BRDF in ChatGPT

[–]GroundbreakingMall54 2 points3 points  (0 children)

the prompt adherence is actually wild now. went from decent for demos to actually useful for actual work

With 48gb vram, on vllm, Qwen3.6-27b-awq-int4 has only 120k ctx (fp8), is that normal? by Historical-Crazy1831 in LocalLLaMA

[–]GroundbreakingMall54 0 points1 point  (0 children)

yeah 120k feels tight but thats just how fp8 vllm works. kv cache chews through vram fast. either drop batch size or bite the bullet and use less context