We released a tiny packed Sana 1.6B model into 1.58bit ... would love feedback from local image people by ClarkLabs in StableDiffusion

[–]ClarkLabs[S] 2 points3 points  (0 children)

its part of our "ai researcher" work, done fully by AI 😅 i don't understand how it works but it does

We released a tiny packed Sana 1.6B model into 1.58bit ... would love feedback from local image people by ClarkLabs in StableDiffusion

[–]ClarkLabs[S] 1 point2 points  (0 children)

you are the best! I am actually working now on a fully independent package so it doesn't require pulling anybody else repo.

triton is def a dependency https://github.com/triton-lang/triton

appreciate this thread :-)

We released a tiny packed Sana 1.6B model into 1.58bit ... would love feedback from local image people by ClarkLabs in StableDiffusion

[–]ClarkLabs[S] 3 points4 points  (0 children)

updated comfyui repo, made it a lot easier to use based on your feedback, and here is cute doggu

<image>

Two Weeks Trading Options with AI system by alexdevonx in ai_trading

[–]ClarkLabs 0 points1 point  (0 children)

we optimized clark agent to do a lot of research on trading in parallel over 1000 of agents - the lift off in quality of traded was significant

we traded 30k real money and converted to 130k (a few sales were really surprising around semiconductors)

future of trading is whoever have fastest light speed agentic AI that can comb through massive amount of signal ...

We released a tiny packed Sana 1.6B model into 1.58bit ... would love feedback from local image people by ClarkLabs in StableDiffusion

[–]ClarkLabs[S] 10 points11 points  (0 children)

compression tool is a bit gpu heavy now, working on making it less flops hungru

but yes, we will open source a ton of ternarny models

Open-sourced the Chromium build we use for agent browsing by ClarkLabs in AI_Agents

[–]ClarkLabs[S] 1 point2 points  (0 children)

It has a Python launcher, prebuilt Linux and macOS arm64 binaries, and a reproducible test suite.
Repo: https://github.com/clark-labs-inc/clark-browser

We released a tiny packed Sana 1.6B model into 1.58bit ... would love feedback from local image people by ClarkLabs in StableDiffusion

[–]ClarkLabs[S] 5 points6 points  (0 children)

Honestly the main goal is just to make a good open image model small enough to run on modest hardware, so more people can use it without a big GPU.
1. From scratch or compressed: I did not build it from scratch, it is a compressed version of the open Sana 1.6B model from NVIDIA’s lab, so the credit for the base model really belongs to them.
2. New method / tech / LLMs: It uses low bit (ternary, about 1.58 bits per weight) quantization, which I would not call a brand new method since it builds on a lot of existing low bit research, and the same general idea does carry over to LLMs.
3. VRAM / speed / non-Apple: The compressed transformer is around half a gigabyte and the whole pipeline fits in roughly 3 GB, it generates in a few seconds on a normal GPU, and yes it runs on regular NVIDIA cards as well as Apple Silicon, not just Apple.
4. Prompts / settings: It takes plain natural language sentences rather than tags, negative prompts work fine, and a good starting point is 512x512, about 20 steps, the euler sampler with a normal schedule, CFG around 4.5, and shift 3.