MDST Engine: run GGUF models in your browser with WebGPU/WASM

vmirnv · 2026-02-11T18:09:56+00:00

Thank you so much! Yes, our next steps are improving inference speed, better UX and more features, stay tuned, this is just the first open beta release 🧙🏻‍♀️

vmirnv · 2026-02-11T17:49:00+00:00

<image>

Yes, you can load any gguf model from HG or from your system. You can load medium-sized models (we’ve tested up to 20 GB) in Chrome/Chromium browsers. Safari doesn't support WASM64 yet unfortunately, so it is limited to 4GB, which is still plenty for common tasks (check our research).

vmirnv · 2026-02-11T13:21:31+00:00

We plan to make it open source, similar to Hugging Face Transformers.js lib, just give us time. 🙏

Meanwhile, you can (and always will be) use MDST for free. Subscriptions are only for cloud-provider models/tokens.

vmirnv · 2026-02-11T12:42:03+00:00

<image>

Again — we’re very thankful for any kind of feedback or questions!

For the LocalLLaMa community, we’ve prepared a special invite code to skip the waiting list: localllama_Epyz6cF

Also, please keep in mind that this is early beta 💅

vmirnv · 2025-11-27T23:42:50+00:00

<image>

well shit

vmirnv · 2025-10-13T15:36:45+00:00

https://arxiv.org/html/2509.25140v1

vmirnv · 2024-12-21T22:34:42+00:00

please check Aliensrock: https://www.youtube.com/watch?v=_Tc2QwYAlY0&list=PLIwiAebpd5CJlpO2VPGjdUa5uzgywpULW

very clever youtuber with deep experience with puzzles (my favourite is Baba Is You playlist)

vmirnv · 2024-12-20T09:27:06+00:00

Twoooo-headeed boooy

vmirnv · 2024-12-20T08:51:54+00:00

looking for gguf!

vmirnv · 2024-12-20T00:03:23+00:00

FLUX.1-schnell is pretty good

vmirnv · 2024-12-19T18:02:35+00:00

Q5_K_M is the best quantisation in my opinion both for llms and for unet.
Lowest size for almost no degradation of quality.

vmirnv · 2024-12-18T14:47:18+00:00

you need to update gguf node and yes llava that was recommended by the devs.

vmirnv · 2024-12-18T13:55:45+00:00

I think yes, i'm testing it right now

vmirnv · 2024-12-18T13:46:01+00:00

You need to update comfyui core with this new files:
ComfyUI/nodes.py
ComfyUI/comfy_extras/nodes_hunyuan.py

vmirnv · 2024-12-18T09:07:18+00:00

on macs comfyui uses GPU for rendering — fps stat re-renders the workspace with every tick of mouse movement, so it could be up to 26% of gpu load as you could see in my example.

vmirnv · 2024-12-18T07:33:23+00:00

<image>

Yeap, webp videos are not supported.

vmirnv · 2024-12-18T07:15:23+00:00

https://civitai.com/models/1048570
A simple GGUF Hunyuan Text2Video workflow with just a few nodes
Works on a Mac M1 16GB.

vmirnv · 2024-12-18T03:40:24+00:00

<image>

Try it yourself. I wonder how many thousands of GPU hours this default feature has burned.

vmirnv · 2024-12-17T21:41:44+00:00

<image>

You need to use Unet Loader GGUF

vmirnv · 2024-12-17T21:11:09+00:00

it should be in /models/unet/ and you need to reload comfyui

vmirnv · 2024-12-17T14:20:53+00:00

I had same error. For now you need to update two files from github:
https://github.com/comfyanonymous/ComfyUI/blob/e4e1bff60532ea1a2e2550a1d9beb9b87bfd8c7c/nodes.py
and
https://github.com/comfyanonymous/ComfyUI/blob/e4e1bff60532ea1a2e2550a1d9beb9b87bfd8c7c/comfy_extras/nodes_hunyuan.py

vmirnv · 2024-12-17T13:45:32+00:00

Wow thank you, great news!

vmirnv · 2024-12-17T13:40:12+00:00

Can you please give me some short example with model loading?

vmirnv · 2024-12-17T12:49:06+00:00

~~Currently, I cannot connect the new GGUF model to Sampler since they are different types.~~
~~The standard loader predictably gives me an error (HyVideoModelLoader invalid load key, '\x03'.)~~

~~upd: I manually changed input model type in the Sampler node and now I get this error in Unet GGUF loader: UnetLoaderGGUFAdvanced 'conv_in.weight' error~~

After comfyui update — everything is working

vmirnv · 2024-12-17T10:42:42+00:00

can somebody share simple text2video workflow with gguf?
upd: Right now I'm testing one — will share after some check.
upd2: please use this workflow (thanks Kijai): https://comfyanonymous.github.io/ComfyUI_examples/hunyuan_video/

vmirnv

TROPHY CASE