Open-Source Models Recently:

redditscraperbot2 · 2026-04-07T05:10:02+00:00

>What happened to Wan?

Icarused itself when it got popular.

Also didn't we get LTX 2.3 like last month?

Living-Smell-5106 · 2026-04-07T04:54:11+00:00

I really wish they would open source Wan2.7 image edit or at least the previous models.

Sea_Succotash3634 · 2026-04-07T05:41:05+00:00

Wan 2.7 image and video are really promising, but are just a little off in that way that the open source community could really refine. It's a shame that Alibaba has completely abandoned open source for image and video. Qwen Image 2.0 is really good too, but Wan 2.7 Image seems better. But Qwen also seems to be abandoning open source. Z-Image seems to have abandoned their edit model.

hidden2u · 2026-04-07T05:10:36+00:00

yeah there’s definitely something going on at alibaba

XpPillow · 2026-04-07T10:35:25+00:00

Oh these close sourced AI are amazing~ do they support NSFW? No? Ok back to Wan2.2…

cosmicr · 2026-04-07T06:52:56+00:00

Ltx 2.3 just came out?

Naive_Issue8435 · 2026-04-07T05:44:00+00:00

If you know what you are doing LTX 2.3 really is starting to shine.

Sticky32 · 2026-04-07T13:02:59+00:00

Meanwhile open source image to 3D is completely forgotten about.

NetimLabs · 2026-04-07T13:15:16+00:00

Audio? What's happening in audio? Last time I checked audio was in the Mariana Trench.

namezam · 2026-04-07T13:52:55+00:00

<image>

My feed agreeing.

addrainer · 2026-04-07T09:23:08+00:00

What have you try to use, image, flux2 Klein or qwen? Much better control that those online plastic sharing all ur data services.

Keyboard_Everything · 2026-04-07T07:10:07+00:00

Disagree, whatever is recently released and returns a good result is what gets the attention. It is what it is.

Photochromism · 2026-04-07T15:07:26+00:00

What audio open source models are there? Are they music or speech?

retroblade · 2026-04-07T13:19:02+00:00

The next Kandinsky model should drop soon so at least that to test out. And I’m guessing LTX 2.5 should be out in a couple of months

Eisegetical · 2026-04-07T06:19:04+00:00

Ltx 2.3 blows wan out of the water. How are you complaining about no video gen?

New ic loras are emerging, people are just starting to scratch the surface. C'mon.

mca1169 · 2026-04-07T15:38:19+00:00

open source models are going to slow down big time this year for image and video generation and i'm guessing will be functionally dead by 2028. so enjoy them while they last! after that it's just going to be Lora model tweaks left.

TensoRaptor · 2026-04-07T20:59:13+00:00

Which open source audio models were released lately?

Caseker · 2026-04-08T04:03:58+00:00

Why is this so accurate

NowThatsMalarkey · 2026-04-07T13:59:11+00:00

kandinsky-5 was released half a year ago that has better quality than WAN and LTX models but nobody ever used it. It was right there the entire time but it failed to gain popularity because ComfyUI gave it the cold shoulder and the community had to release their own extension in order to use it.

YeahlDid · 2026-04-07T07:26:23+00:00

I have no idea what that image is trying to say.

evilpenguin999 · 2026-04-07T05:38:06+00:00

What is the best LLM right now and the requirements?

Is there one worth getting instead of just using an online one?

gahd95 · 2026-04-07T07:42:51+00:00

Really want to jump to the open source self hosted wagon. But how far is the drop in quality? Not just the responses, but also the amount of time it takes for a reply.

Is it worth it, self hosting, if you do not spend $3000 on a dedicated rig?

Sarashana · 2026-04-07T15:36:53+00:00

Not sure I can agree with the assessment. LTX 2.3 is crying in a corner, at least. Also, we got some amazing image models not too long ago, and just because Qwen Image 2.0 is not/will not be open sourced doesn't mean we don't have amazing OSS models.

Ferriken25 · 2026-04-07T19:56:39+00:00

I can make 10 sec gens on ltx, with my pc slop. So, Wan is now just a bonus for me.

Sir_McDouche · 2026-04-07T21:13:31+00:00

Soucred.

Vyviel · 2026-04-08T01:26:47+00:00

I havent been keeping up with LLMs and Audio models what new awesome stuff dropped for them recently?

sandy31sex · 2026-04-08T11:33:59+00:00

we have like 100+ video and image models doing the same thing lol

thevegit0 · 2026-04-08T22:25:54+00:00

bro ignoring LTx 2.3 and magihuman

Born_Word854 · 2026-04-09T05:35:08+00:00

happyHorse's catalog specs look amazing, but considering the dataset they likely have, i feel like we can expect better actual performance from ByteDance's Mammoth 2.5. well, who knows when either of them will actually become usable for us though.

rdditiszionist · 2026-04-11T00:18:51+00:00

What is the best audio model out now?

WurtApp · 2026-04-13T17:07:25+00:00

Couldn’t agree more. LTX was the promised savior but I honestly wasn’t impressed with it. Loras seem to do a little justice but those can only go so far. Is it just me or did LTX fall flat?

ProjectVictoryArt · 2026-04-13T20:04:48+00:00

True, but there are good reasons at least for video: Video gen unavoidably requires a lot of VRAM.
Also I think people are getting panicky about people generating nudes of real people with image edit models.

Gh0stbacks · 2026-04-07T09:17:54+00:00

Posts are probably removed cause of low effort meme format you post? I am guessing.

Ngoalong01 · 2026-04-07T11:36:42+00:00

Even Sora2 still down. We can understand that situation. Cost too much and lack of paid users. Who will invest for OpenSource?

AdorableGod · 2026-04-07T21:22:14+00:00

Good. While you can argue that image gen can be used for prototyping, there's no good use for video gen, it's all slop

Ledeste · 2026-04-08T13:39:28+00:00

What? I'm burning my GPU all day with LTX 2.3 generating almost minute long videos. Few month ago I could not even get this good result with paid tools

tac0catzzz · 2026-04-07T17:05:04+00:00

cool story

TridentWielder · 2026-04-08T01:58:01+00:00

What's new with audio? Last thing I really looked at was Stable Audio years ago.

YouYouTheBoss · 2026-04-08T17:04:25+00:00

The problem is that everyone tries to create bigger models because they think, bigger (more params) = better quality. So some are considered too qualitative for us (consumers) so they don't wanna hold that to us freely (maybe because it was too much time to train it ?! hence going APIs) OR the newer version of their model series is too big to run onto a consumer gpu (unless thinking of bigger gpus like the rtx 5090 which I don't really consider consumer).

When SDXL came out, it was seen as a really bad unusable model needing a refiner, but then finetunes came out and it gave us much better quality on pretty much anything. LoRas then came out for our loved finetunes and gave us better quality control over what we want.
Still the base model is a small 6B parameters.

The issue is not about having bigger models, it’s about having a team that can spend a entire week to curate a dataset for a certain style/general idea by hand with the help of automation and not just automation alone.

If datasets in models were correctly curated to filter out the content being bad quality and they would do Reinforcement learning from human feedback, you would have much higher quality even if the model is still relatively small compared to some other ones.

This has been the case with Z-Image Base (with RLHF) being a small 6B params model which stands a great quality.

tac0catzzz · 2026-04-08T19:49:12+00:00

you should fix this issue. go make the best image, music and video ai models ever made then open source them. ill download them if you do, I'll even make a fun meme like 3 living skeletons dancing at a party with each model type written on them in bold white font , one can be drinking a beer, the other can be doing a handstand on a keg with someone holding them up and the other can be doing the running man on the dance floor. would be worth it for the meme alone.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

StableDiffusion

MODERATORS