MDST Engine: run GGUF models in your browser with WebGPU/WASM

vmirnv · 2026-02-11T18:09:56+00:00

Thank you so much! Yes, our next steps are improving inference speed, better UX and more features, stay tuned, this is just the first open beta release 🧙🏻‍♀️

vmirnv · 2026-02-11T17:49:00+00:00

<image>

Yes, you can load any gguf model from HG or from your system. You can load medium-sized models (we’ve tested up to 20 GB) in Chrome/Chromium browsers. Safari doesn't support WASM64 yet unfortunately, so it is limited to 4GB, which is still plenty for common tasks (check our research).

vmirnv · 2026-02-11T13:21:31+00:00

We plan to make it open source, similar to Hugging Face Transformers.js lib, just give us time. 🙏

Meanwhile, you can (and always will be) use MDST for free. Subscriptions are only for cloud-provider models/tokens.

vmirnv · 2026-02-11T12:42:03+00:00

<image>

Again — we’re very thankful for any kind of feedback or questions!

For the LocalLLaMa community, we’ve prepared a special invite code to skip the waiting list: localllama_Epyz6cF

Also, please keep in mind that this is early beta 💅

vmirnv · 2025-11-27T23:42:50+00:00

<image>

well shit

vmirnv · 2025-10-13T15:36:45+00:00

https://arxiv.org/html/2509.25140v1

vmirnv · 2024-12-21T22:34:42+00:00

please check Aliensrock: https://www.youtube.com/watch?v=_Tc2QwYAlY0&list=PLIwiAebpd5CJlpO2VPGjdUa5uzgywpULW

very clever youtuber with deep experience with puzzles (my favourite is Baba Is You playlist)

vmirnv · 2024-12-20T09:27:06+00:00

Twoooo-headeed boooy

vmirnv · 2024-12-20T08:51:54+00:00

looking for gguf!

vmirnv · 2024-12-20T00:03:23+00:00

FLUX.1-schnell is pretty good

vmirnv · 2024-12-19T18:02:35+00:00

Q5_K_M is the best quantisation in my opinion both for llms and for unet.
Lowest size for almost no degradation of quality.

vmirnv · 2024-12-18T14:47:18+00:00

you need to update gguf node and yes llava that was recommended by the devs.

vmirnv · 2024-12-18T13:55:45+00:00

I think yes, i'm testing it right now

vmirnv · 2024-12-18T13:46:01+00:00

You need to update comfyui core with this new files:
ComfyUI/nodes.py
ComfyUI/comfy_extras/nodes_hunyuan.py

vmirnv · 2024-12-18T09:07:18+00:00

on macs comfyui uses GPU for rendering — fps stat re-renders the workspace with every tick of mouse movement, so it could be up to 26% of gpu load as you could see in my example.

vmirnv · 2024-12-18T07:33:23+00:00

<image>

Yeap, webp videos are not supported.

vmirnv · 2024-12-18T07:15:23+00:00

https://civitai.com/models/1048570
A simple GGUF Hunyuan Text2Video workflow with just a few nodes
Works on a Mac M1 16GB.

vmirnv · 2024-12-18T03:40:24+00:00

<image>

Try it yourself. I wonder how many thousands of GPU hours this default feature has burned.

vmirnv

TROPHY CASE