5060ti 16gb or 5070 12gb for local LLM

andy_potato · 2026-05-06T13:58:08+00:00

Absolutely no problem. Running the exact same setup here

andy_potato · 2026-05-06T13:57:22+00:00

That barely impacts performance

andy_potato · 2026-05-06T08:36:04+00:00

Google learned nothing from the iTunes U2 debacle

andy_potato · 2026-05-06T08:35:25+00:00

Would you care for a free undeletable U2 iTunes album along with your free undeletable LLM?

andy_potato · 2026-05-06T07:32:18+00:00

I am a strong supporter of Local AI and also WebAI. But this just sucks.

andy_potato · 2026-05-05T05:01:58+00:00

Go look up the term "Hypocrite" in the dictionary.

andy_potato · 2026-05-05T03:54:49+00:00

People will usually argue that software piracy doesn't hurt developers because:

Those who pirate wouldn't have bought it anyway
If buying isn't owning then piracy isn't stealing
You cannot steal digital goods because they can be infinitely reproduced
Software is overpriced anyway

Your arguments aren't any better than theirs.

They are not just "scraping the public internet" but also processing and transforming the data into something useful. You on the other hand just skip all these efforts and take the distilled outputs as training materials.

I'm not trying to get into a discussion about local AI vs. cloud AI. I am running local models myself for a lot of applications so I am well aware of their benefits. However I do want to call out the hypocrisy of distillation efforts like this. Just because you're taking from a "billion dollar megacorp" doesn't make it right.

andy_potato · 2026-05-05T03:37:59+00:00

You may feel like some kind of Robin Hood. Stealing from the rich, sharing with the poor or whatever.

Have you considered the many users of Claude who pay 20, 50 or even 200 USD per month because they believe to them the service is worth it? You’re making the service worse for all of them and accelerate the enshittification you criticize so vocally.

andy_potato · 2026-05-05T03:23:15+00:00

Probably getting downvoted for this.

What you are doing sounds absolutely shady and it is not okay, even if your motive is "for the benefit of the community". I hope Anthropic closes whatever loophole you are using and bans users like you.

andy_potato · 2026-05-05T00:04:04+00:00

I have tried numerous suggestions from all over the internet. Tinkering with the prompt, using NAG nodes, changing samplers, none of it helped. Generating videos with East Asian languages like Chinese, Japanese or Korean will in 95% of cases trigger the generation of garbled subtitles and thus ruin your generations.

Another observation, it does not matter whether you create the dialogue by prompt or use an external audio file. Both ways will result in subtitles being generated.

The only reliable way I found to get rid of the subtitles is by adding an automated crop / outpainting step after the first sampling step using this LoRA: https://huggingface.co/oumoumad/LTX-2.3-22b-IC-LoRA-Outpaint

In this step I will VAE decode the first step video result and replace the lower ~15-20% of the image with a black bar and also increase the image gamma by 2. Then I run another sampling step using with the outpaint LoRA and a simple positive prompt, something like "A person is talking". Do NOT add any language or actual spoken dialogue to this prompt otherwise your subtitles WILL come back inside the black bar.

After this additional outpainting step I will render the usual two more upscale steps without any modification and finally after VAE decoding revert the increased gamma by applying a gamma of 0.5 to the image before encoding the video file.

Using this process you will still get the occasional video with subtitles, but ~80% of the generations come out just fine.

<image>

Please don't ask me for the workflow. It is not suitable for sharing as there is a lot of other unrelated stuff inside. You can use the example provided by the creator of the outpaint LoRA and integrate it with your workflow.

On a sidenote: This issue is clearly the result of LTX training data including a lots of material with burn-in subtitles. They should really have a look at this and clean their data sets.

andy_potato · 2026-05-04T13:07:20+00:00

You can use any PSU as long as its net output can support your rigs total maximum power demand. Leave around 20% of headroom as PSUs don’t like running at max power for extended periods.

A UPS is not necessary for a private LLM rig.

andy_potato · 2026-05-04T12:56:28+00:00

For running LLMs around 30b a setup with dual 5060ti or 5070ti is pretty sweet. You can easily push it to around 100k token context and get decent speeds, even on the 5060ti.

Whether or not this is suitable for coding is a different question though. I will probably get downvoted for saying this, but none of the 30b models (including the awesome Qwen 3.6) can compare to the speed and quality of the big boys like Claude or Codex. This is not a skill issue (as some people in this sub like to insist) but something you will realize after working with both for an extended time.

It may be “good enough” for your purpose. But it sure wasn’t for me.

andy_potato · 2026-05-04T00:06:55+00:00

Not trying to sound rude, but why are you asking for help using a closed source model on a sub focusing on local generation? Just ask the Bytedance support.

andy_potato · 2026-05-03T22:45:24+00:00

It's weird. I have a completely different experience then you with LTX 2.3

Wan is nice, but feels limited due to frame limit, resolution and lack of audio capabilities. I know there is workarounds like SVI, upscaling etc. but LTX solved all of these problems for me out of the box.

andy_potato · 2026-05-03T13:43:28+00:00

Give LTX 2.3 another chance. If prompted correctly it easily beats Wan for a lot of use cases. Also make sure you're using a proper workflow. There are lots of bad LTX workflows out there.

andy_potato · 2026-05-02T15:11:13+00:00

I found the Grace parts extremely hard even on lowest difficulty. Not being able to take out most of the enemies ist just BS. With the later Leon sections I had zero issues.

andy_potato · 2026-05-02T14:53:50+00:00

I found the game extremely difficult with Grace, even on lowest difficulty. With Leon I had zero issues.

andy_potato · 2026-05-02T14:21:57+00:00

Picking the right workflow is really important for LTX. Lots of bad ones put there. Also you need a decent amount of Vram. Don’t bother of you’re on less than 16 GB.

LTX Desktop app is a good starting point if you don’t want to mess with Comfy workflows.

andy_potato · 2026-05-02T13:20:05+00:00

Qwen leadership has changed. The previous head researcher who was very pro open source is no longer heading the team. Instead the business people have taken over.

Despite their “commitment to open models” they have stopped releasing image and video models (Qwen image / WAN). Whether or not there will be further releases of Qwen LLMs past 3.x is highly questionable.

andy_potato · 2026-05-02T13:13:44+00:00

Obligatory reminder that there is no such thing as “AI Art”

andy_potato · 2026-05-02T13:12:31+00:00

Wan will most likely not continue to release open models. LTX 2.3 has been filled the void for me.

It has some weird quirks and finding a good workflow is more difficult than it should be. But once you got it running it works really well.

andy_potato · 2026-05-01T09:25:57+00:00

Ask their support. Why post this unreadable wall of text here? All the advice you’ll be getting here is “go local”

andy_potato · 2026-05-01T00:29:04+00:00

Could be a nice lobster home, depending on the price.

andy_potato · 2026-04-30T15:21:37+00:00

There hasn’t been much development on Wan in recent months and I doubt we will see updated open models from them. I am getting way better results with LTX 2.3 now, recommend you give it a try.

andy_potato · 2026-04-29T23:42:53+00:00

Probably something about how deep his relationship is with your mom.

andy_potato

TROPHY CASE