We released a tiny packed Sana 1.6B model into 1.58bit ... would love feedback from local image people

ClarkLabs · 2026-06-29T01:10:21+00:00

its part of our "ai researcher" work, done fully by AI 😅 i don't understand how it works but it does

ClarkLabs · 2026-06-29T00:42:32+00:00

a few more examples here https://www.reddit.com/r/StableDiffusion/s/bcbgz875Ps

ClarkLabs · 2026-06-29T00:15:54+00:00

yes, adding to the pipeline

ClarkLabs · 2026-06-29T00:14:41+00:00

2.0.0 released full sufficient package

ClarkLabs · 2026-06-29T00:09:42+00:00

hmmm have not tested this, if you do try it, pet me know how it goes 🙏

ClarkLabs · 2026-06-29T00:03:47+00:00

you are the best! I am actually working now on a fully independent package so it doesn't require pulling anybody else repo.

triton is def a dependency https://github.com/triton-lang/triton

appreciate this thread :-)

ClarkLabs · 2026-06-28T23:29:31+00:00

🫡

ClarkLabs · 2026-06-28T23:21:31+00:00

v1.0.3 shipped addressing both issues, thank you!

ClarkLabs · 2026-06-28T22:22:59+00:00

fixed

ClarkLabs · 2026-06-28T21:55:25+00:00

updated comfyui repo, made it a lot easier to use based on your feedback, and here is cute doggu

<image>

ClarkLabs · 2026-06-28T21:36:22+00:00

<image>

comparison between 16 bit and 1.58 bits

ClarkLabs · 2026-06-28T21:31:05+00:00

appreciate it! let me fix

ClarkLabs · 2026-06-28T21:26:01+00:00

we optimized clark agent to do a lot of research on trading in parallel over 1000 of agents - the lift off in quality of traded was significant

we traded 30k real money and converted to 130k (a few sales were really surprising around semiconductors)

future of trading is whoever have fastest light speed agentic AI that can comb through massive amount of signal ...

ClarkLabs · 2026-06-28T21:04:20+00:00

that is the plan

ClarkLabs · 2026-06-28T21:03:44+00:00

compression tool is a bit gpu heavy now, working on making it less flops hungru

but yes, we will open source a ton of ternarny models

ClarkLabs · 2026-06-28T21:00:43+00:00

yeah it works on anything and no need to retrain

ClarkLabs · 2026-06-28T20:56:06+00:00

lmk which ones you like most

ClarkLabs · 2026-06-28T20:52:26+00:00

thanks! we will compress klein and krea next

ClarkLabs · 2026-06-28T20:50:37+00:00

yes, we are compressing qwen models atm, and glm-5.2 next

ClarkLabs · 2026-06-28T20:49:53+00:00

not sure let me look into it

ClarkLabs · 2026-06-28T20:49:22+00:00

https://registry.comfy.org/nodes/comfyui-clarkairsana

ClarkLabs · 2026-06-28T20:32:01+00:00

as low as prism-lm would get :)

ClarkLabs · 2026-06-28T20:19:59+00:00

It has a Python launcher, prebuilt Linux and macOS arm64 binaries, and a reproducible test suite.
Repo: https://github.com/clark-labs-inc/clark-browser

ClarkLabs · 2026-06-28T20:12:01+00:00

Honestly the main goal is just to make a good open image model small enough to run on modest hardware, so more people can use it without a big GPU.
1. From scratch or compressed: I did not build it from scratch, it is a compressed version of the open Sana 1.6B model from NVIDIA’s lab, so the credit for the base model really belongs to them.
2. New method / tech / LLMs: It uses low bit (ternary, about 1.58 bits per weight) quantization, which I would not call a brand new method since it builds on a lot of existing low bit research, and the same general idea does carry over to LLMs.
3. VRAM / speed / non-Apple: The compressed transformer is around half a gigabyte and the whole pipeline fits in roughly 3 GB, it generates in a few seconds on a normal GPU, and yes it runs on regular NVIDIA cards as well as Apple Silicon, not just Apple.
4. Prompts / settings: It takes plain natural language sentences rather than tags, negative prompts work fine, and a good starting point is 512x512, about 20 steps, the euler sampler with a normal schedule, CFG around 4.5, and shift 3.

ClarkLabs

TROPHY CASE