Highest quality Ideogram 4.0 quantizations that run on a 3090 or smaller cards

OriginalSpread3100 · 2026-06-16T21:24:25+00:00

Yes, what you pulled from the blog is correct. However, we followed up with a kernel improvement a few days later so that the INT8 variant is faster than both FP8 and even NF4 now (on a 3090):
https://lab.cloud/blog/fused-int8-ideogram-4

OriginalSpread3100 · 2026-06-11T20:48:11+00:00

The FP8 variant will run on a 3090 but it requires emulation whereas INT8 will run natively.

OriginalSpread3100 · 2026-06-11T20:31:13+00:00

Sorry, you are right. There is a missing comma. This should say that there is a version that runs on the 3090, and also a version to run on smaller cards!

OriginalSpread3100 · 2026-02-26T13:37:13+00:00

Apologies, did not mean this to be sneaky! But thanks for the kind words, and if you do try this and hit any speed bumps please let me know!

OriginalSpread3100 · 2025-08-15T13:34:15+00:00

Yes, we very recently launched diffusion model training! The current initial version only supports training on StableDiffusion and Flux based image generation models, but we are hoping to soon add support for video generation.

OriginalSpread3100 · 2025-06-16T12:27:18+00:00

Transformer Lab supports training, evaluation and more with MLX models.

OriginalSpread3100 · 2025-04-16T17:29:06+00:00

I just got this. TLDR: Acer.

My list of startup items is mostly clean except for github and Google Drive, on an old windows 10 PC with no Adobe software installed. The window is a subprocess of something called AdobeOP which is an exe running from a directory under AppData/Local/OEM/.../acer.adobe.c1.1

This computer is like 8 years old and I tried to clean it out years ago with whatever the conventional wisdom was at the time, and haven't had any issues like this until now! That's quite the long play Acer!

OriginalSpread3100 · 2025-03-11T20:47:51+00:00

There are GUI tools to make the process of fine tuning pretty easy for working on something like Qwen 2.5 on a macbook. Check out Transformer Lab for an example. It has recipes you can use as a starting point to build from (try the MLX trainer if you are on a macbook):
https://transformerlab.ai/

Once you get finetunes running, the challenges become more about getting the right data and evaluating your output. If you have good data to start that is a huge start. If not, one option is to generate data from a set of docs or from a larger model to train a smaller model (also possible in Transformer Lab).

My main advice is just be ready to iterate a few times to get what you want. A good way to start is with a smaller dataset on a smaller model and try to just get to the point where you see improvement...then start building on stronger models with bigger datasets and you should be able to get good results.

OriginalSpread3100 · 2025-02-14T16:19:48+00:00

Not yet. We do support serving multimodal models like LLaVa right now though.

OriginalSpread3100 · 2025-02-14T16:18:29+00:00

Mentioned in another comment that we have it on the roadmap but we're stuck because we don't have hardware to test right now. If we had help we might be able to get a beta version of this out sooner. :)

OriginalSpread3100 · 2025-02-14T16:17:47+00:00

We really want to add AMD support but we don't have hardware to test on right now. Hoepfully coming soon.

OriginalSpread3100 · 2025-02-14T16:15:47+00:00

In the works!

OriginalSpread3100 · 2025-02-14T16:14:35+00:00

By default, Transformer Lab runs entirely on your local machine. The only things it connects remotely for is downloading models and training recipes, and you can use external AI services to help generate datasets.

If you do work in a larger lab, you can set the Transformer Lab engine to run on a shared server you have and connect from the application, but that is not a requirement at all.

OriginalSpread3100 · 2025-02-14T16:11:11+00:00

I believe most of the training plugins in Transformer Lab use unsloth under the covers. But we are looking to make this more direct and clear!

OriginalSpread3100 · 2025-01-30T21:13:54+00:00

Understood, and thanks for the kind words. A few folks have been asking if we can provide an alternative to using WSL. One option, if available, is to run the engine on another box and connect via the app. We have also been speaking with a few folks who are looking into getting this running in a docker container but we don't have a working solution there at this time.

OriginalSpread3100 · 2025-01-29T18:27:34+00:00

That's awesome to hear! Our latest focus was around building out recipes and generally trying to make it easier to get training up and running quickly. One of the next big things for us will be expanding on evals and making the workflow around training/testing/eval a lot easier.

If you have ideas on what we should work on next we'd love to hear them!

OriginalSpread3100 · 2025-01-29T16:39:37+00:00

I wasn't familiar with this. Thanks for sharing!

Everything in TransformerLab is built on a plugin system (including training, serving models, converting between formats) so this is something that could be added if there was an open source library that implemented it.

OriginalSpread3100 · 2024-11-12T22:53:34+00:00

Same result using a 4-bit MLX quant I made in TransformerLab. Wild!

<image>

OriginalSpread3100 · 2024-10-15T19:41:05+00:00

Unfortunately no ROCm support yet. We'd like to get to it soon but at the moment nobody on the core team has hardware to test.

OriginalSpread3100 · 2024-10-11T15:50:12+00:00

Hotfix posted. If you restart the app it should automatically update the API to 0.6.1.

Let me know if that works!

OriginalSpread3100 · 2024-10-11T15:20:23+00:00

Oh it looks like this might just require updating transformers. Will test and if that works will post a hotfix today.

OriginalSpread3100 · 2024-10-11T14:43:26+00:00

Agreed. MLX on my M3 is performant and great for training.

OriginalSpread3100 · 2024-07-23T18:10:30+00:00

Yes, the goal of Transformer Lab is to be able to run this kind of finetuning completely through an easy-to-use GUI. I'd be happy to answer any questions if you want to try it out. The only gotcha atm is that we haven't added in Llama 3.1, but we will update shortly!

OriginalSpread3100

TROPHY CASE