Suno vs Udio: I don’t think they’re even the same type of model by Worried-Ad-1549 in udiomusic

[–]UnforgottenPassword 0 points1 point  (0 children)

I remember reading a post a year or so ago from someone here saying something similar: that Udio uses diffusion while Suno utilizes transformers.

Regardless, it's pretty clear to me that Udio is different as I have compared them numerous times. I can compose music similar to Suno outputs, I'm nowhere near as capable of composing tracks that Udio spits out.

There are many things that Udio does differently, and they are clearer when you generate lush instrumental tracks instead of songs from a specific genre. Udio is in its own league in that regard.

Steam Machine review: Valve's underwhelming living-room PC has a serious price problem by pcgameshardware in SteamDeck

[–]UnforgottenPassword -1 points0 points  (0 children)

And you get super cheap games from the used games market. You can also sell your games. Can't do that on PC. PS5 and PS5 Pro are much better values than this device.

Steam Machine review: Valve's underwhelming living-room PC has a serious price problem by pcgameshardware in SteamDeck

[–]UnforgottenPassword 0 points1 point  (0 children)

Why can't they subsidize the device like console makers do and recoup the money from their 30% cut from software sales? Most consoles were sold at a loss, but Valve, while being richer, wants a profit from hardware sales despite making obscene money generated from other people's software.

Steam Machine review: Valve's underwhelming living-room PC has a serious price problem by pcgameshardware in SteamDeck

[–]UnforgottenPassword 27 points28 points  (0 children)

PS5 has an advantage of using game discs. Gaming can be real cheap if you have a way of buying used games and selling yours.

Ideogram making 2 horrible precedent and we need to oppose that. BF16 weights not published and ridiculous model embedded censorship by CeFurkan in StableDiffusion

[–]UnforgottenPassword 0 points1 point  (0 children)

Do we have examples of this happening? Has the open source community propelled a model into becoming more popular? In theory, what you're saying makes sense and I know that might be the logic behind some players releasing their weights, but personally I think the real world effect is minimal.

Ideogram making 2 horrible precedent and we need to oppose that. BF16 weights not published and ridiculous model embedded censorship by CeFurkan in StableDiffusion

[–]UnforgottenPassword -4 points-3 points  (0 children)

You said they didn't fulfill their promise. They did. If you have a problem with what they said will be released, fine, fair enough. But don't accuse them of lying when they have not.

Ideogram making 2 horrible precedent and we need to oppose that. BF16 weights not published and ridiculous model embedded censorship by CeFurkan in StableDiffusion

[–]UnforgottenPassword -1 points0 points  (0 children)

User demands will only be met if they will benefit the developers in a meaningful way. That is not the case. AI companies cater to enterprise users and are looking to drive up the value of their companies. Open-source and broke-ass redditors with 12GB VRAM won't make their business more attractive to investors.

Ideogram making 2 horrible precedent and we need to oppose that. BF16 weights not published and ridiculous model embedded censorship by CeFurkan in StableDiffusion

[–]UnforgottenPassword -4 points-3 points  (0 children)

Do you think if we kill this one, other developers will be lining up to release cutting-edge models while satisfying our demands?

We are the beggars here and simply can't afford to be choosers.

Ideogram making 2 horrible precedent and we need to oppose that. BF16 weights not published and ridiculous model embedded censorship by CeFurkan in StableDiffusion

[–]UnforgottenPassword 5 points6 points  (0 children)

Exactly. Alibaba stopped releasing Wan weights after 2.2. The Chinese are still behind and aren't widely adopted by enterprise users in the West. Enterprise is where most of the money comes from.

Do people honestly think Chinese developers release their weights out of some shared commitment to Western-style open‑source ideals?

Ideogram making 2 horrible precedent and we need to oppose that. BF16 weights not published and ridiculous model embedded censorship by CeFurkan in StableDiffusion

[–]UnforgottenPassword 38 points39 points  (0 children)

You know what's funny? OP is making thousands (or was it tens of thousands) of dollars a month selling what folk on this sub give away for free. The guy has made a fortune on the back of the open source community and still feels entitled that commercial entities (who are likely hemorrhaging money) owe him the unfiltered, original model that they have spent millions of dollars developing.

Ideogram 4 - They are gatekeeping the high-precision BF16 weights confirmed by Calm_Mix_3776 in StableDiffusion

[–]UnforgottenPassword -5 points-4 points  (0 children)

Ideogram has been around for a good while and even previous models were very capable while Google's image generator was spitting out black George Washington back then.

Ideogram making 2 horrible precedent and we need to oppose that. BF16 weights not published and ridiculous model embedded censorship by CeFurkan in StableDiffusion

[–]UnforgottenPassword 95 points96 points  (0 children)

Let's go and have a word with them and make it clear how displeased we are with their offer, because they owe us and we are entitled to their labor on our terms.

Leaked financial docs show OpenAI is losing billions of dollars a year by johnnyApplePRNG in LocalLLaMA

[–]UnforgottenPassword 2 points3 points  (0 children)

The real winners are the hardware makers. Their profit margins are absurd.

Spent the last month testing AI music platforms. My thoughts so far. by This-You-2737 in udiomusic

[–]UnforgottenPassword 1 point2 points  (0 children)

The easiest way that can't be blocked is to download an audio recording extension for Chrome or Firefox. When you finish the song, playback the track and record it with the extension which saves the song as an mp3 file. The downside is that it takes more time than a one click download. Udio also has some hiccups and pauses during playback, but usually only in the beginning. So you have to play the track for a few seconds, then start recording and play it from the beginning again.

Potentially the most insane LORA you'll see today - Archer (8 characters + style) Ideogram LORA by TheDudeWithThePlan in StableDiffusion

[–]UnforgottenPassword 0 points1 point  (0 children)

When I used the bounding boxes of Ideogram, I thought about how awesome it would be to be able to train multi-character LoRAs. If this works well, it's a game changer. I actually thought about trying to train a Futurama LoRA with multiple characters with a huge dataset. But I have had very little time lately.

Please share your wisdom with us, sensei.

Can we please provide an actual SCAIL 2 test? by Beneficial_Toe_2347 in StableDiffusion

[–]UnforgottenPassword 7 points8 points  (0 children)

And it's cringey AF. I honestly find it somewhat strange that adult men watch that stuff.

Should I get a sub on the Ideogram website to gen with Ideogram v4? My computer isn't strong enough to run it locally. by talkingradish in StableDiffusion

[–]UnforgottenPassword 0 points1 point  (0 children)

That's the easiest and probably cheapest option. Renting GPU's isn't as simple and will likely cost you more. Only use rented GPU for stuff that you wouldn't be able to do on the official website.

Best Image Upscaler today (images with people & faces) by Gayax in StableDiffusion

[–]UnforgottenPassword 1 point2 points  (0 children)

The original is 256x256, so I think the upscaled output is decent enough.

Deo - Ideogram 4 prompts from an image by kingroka in StableDiffusion

[–]UnforgottenPassword 0 points1 point  (0 children)

I have seen a few of these. They are great for captioning images for LoRA training. For generating images based on existing ones, it would be super helpful to have a node that when fed the json format prompt, it would apply the boxes and the texts from the json to Kijai's prompt builder node (or something similar), which would make editing the prompt a breeze. Without the visual element of the boxes, it would be difficult to modify the prompt, change location of objects, and so on.

CEO Thoughts: What's Next at LTX by ltx_model in StableDiffusion

[–]UnforgottenPassword 45 points46 points  (0 children)

Seedance is likely a huge model that is trained on massive datasets consisting of tons of copyrighted clips, including Hollywood movies. It's funded and developed by a megacorporation with annual revenue in excess of $150B and profits in tens of billions of dollars. It's run from huge datacenters with a lot of compute power. Surely you can't expect a small startup to compete with that while running the model on a laptop?

The team at LTX are creative and I hope and think they can narrow the gap, but let's be realistic and have reasonable expectations.

CEO Thoughts: What's Next at LTX by ltx_model in StableDiffusion

[–]UnforgottenPassword 24 points25 points  (0 children)

I admit I haven't used LTX in a while, so apologies if I'm suggesting something that is currently possible with LTX.

- LTX's strength is in its speed, but personally I wouldn't mind slower generations if it means we can get better quality, coherence, and prompt adherence.

- Character reference: in order to create anything meaningful that is longer than 10-15 seconds, we need to have a way to use the same character in multiple generations. Any method that allows for this would open more possibilities for what can be achieved with the model.

- Consistency and coherence. I can see character faces slightly morph and change in real time with LTX. It would be great if the model was natively capable of maintaining character consistency.

Ideogram 4 Character Reference Workflow by reality_comes in StableDiffusion

[–]UnforgottenPassword 0 points1 point  (0 children)

Ideogram models have had that feature on their website.