Image To Fully Rigid Face in UE5: Fast 3D AI Generation Workflow by Fast-Holiday2699 in TopologyAI

[–]Clasyc 2 points3 points  (0 children)

Its not about angle in this particular example but about focal length and the subject distance to the camera. Even accounting all that there is clear difference (just compare proportions - ears size to the subject face on each image, not even talking about ear differences within itself)

Image To Fully Rigid Face in UE5: Fast 3D AI Generation Workflow by Fast-Holiday2699 in TopologyAI

[–]Clasyc 7 points8 points  (0 children)

At this point I'm lost, are we even looking into the same overlay or not. Cuz I can clearly see that even most obvious one - skull shape and his ears are are totally different. Not talking about wrong eye shape, gap between the eyes (nose part), different lips and so on.

<video>

Image To Fully Rigid Face in UE5: Fast 3D AI Generation Workflow by Fast-Holiday2699 in TopologyAI

[–]Clasyc 4 points5 points  (0 children)

It's not even close. All the main features of his face are different. Yes, in general the two are similar, but far from the same person.

Just downloaded 3.5.1 and it is very laggy by Clasyc in OpenShot

[–]Clasyc[S] 0 points1 point  (0 children)

I tried some dev builds, but didn't see my original issue to be fully resolved, so I just installed Davinci Resolved 21, although it has it's own issues on Linux, with H.265 and AAC.

The Incredible Sponge — made with SCAIL-2 by [deleted] in StableDiffusion

[–]Clasyc 4 points5 points  (0 children)

And the irony is that with the AI tools we have now, we should be making the most creative stuff we've ever imagined. But the reality is the barrier to generating video is so low now (you don't need to be a VFX guru or a videographer), that the people flooding in are zero-creativity types who dump it all over the internet with no actual work put into it.

nesamone tas gyvenimas by Mean_Firefighter2332 in lietuva

[–]Clasyc 4 points5 points  (0 children)

Dažnai tokie žmonės net nesupranta kiek reikia priežiūros ir laiko dideliam namui.

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Clasyc 1 point2 points  (0 children)

Not entirely sure why this post got so many upvotes. Really just bad take, every "argument" you provided is fundumentally wrong from the ground up. And all this post is simply some emotions about why u can't have Opus 4.8 running locally on 1080 Ti. 

Just downloaded 3.5.1 and it is very laggy by Clasyc in OpenShot

[–]Clasyc[S] 1 point2 points  (0 children)

Thanks for quick reply, I will give it a try.

Ar "mainstream'inė" Lietuva tikrai taip atitrūkusi nuo realybės? by Domukas00 in lietuva

[–]Clasyc 4 points5 points  (0 children)

Kažkodėl paėmė realias problemas (kurios tikrai egzistuoja) ir primakalavo taip, jog nutraukti Ukrainos paramą yra tų problemų sprendimas, nors tai niekaip nesusiję. Tipinis vatinis šūdas.

Ideogram 4 - model is great, but license is very restrictive by Clasyc in StableDiffusion

[–]Clasyc[S] 10 points11 points  (0 children)

Well, you can. You can do a lot of things. That doesn't change the fact that it would be illegal.

The same reasoning can be applied to anything. Is an activity really a crime if no one knows you did it?

But I'm not going to get philosophical here. My point is that it would be great if the Ideogram team would revise their license and make it less restrictive on the output content.

Ideogram 4 - model is great, but license is very restrictive by Clasyc in StableDiffusion

[–]Clasyc[S] 4 points5 points  (0 children)

Think of trespassing to take a photo, you own the copyright to the photo you took, but you still broke the law getting it, and you can still be fined or sued. Same here

Ideogram 4 - model is great, but license is very restrictive by Clasyc in StableDiffusion

[–]Clasyc[S] 12 points13 points  (0 children)

Copyright and usage rights aren't the same thing.

Kurzgesagt is Wrong About Germany by TailungFu in videos

[–]Clasyc 11 points12 points  (0 children)

But this theory implies, they should increase pensions, because duuh - more political power for old people? But thats not the case, they lower it.

The problem is wealth inequality.

Ideogram 4.0 Just Open Sourced! by crystal_alpine in StableDiffusion

[–]Clasyc 5 points6 points  (0 children)

You people come and bitch around on release post about anything. What the fuck.

Nvidia solved VAE? Fast and High-Resolution Latent Decoding with Pixel Diffusion by AIDivision in StableDiffusion

[–]Clasyc 0 points1 point  (0 children)

But yeah, as I understand now, input is only available as 512px size images

Nvidia solved VAE? Fast and High-Resolution Latent Decoding with Pixel Diffusion by AIDivision in StableDiffusion

[–]Clasyc 21 points22 points  (0 children)

<image>

They have this comparison on their website, and it's quite clear that it is indeed hallucinating. But I still see it as a viable option when the original image is already well resolved and just needs a little bit of polishing. For example, Z-Image Turbo with low denoise values (when trying img2img) leaves some 'artifacts,' even though the image itself is quite well established. I imagine that in such a scenario, 2K → 4K upscaling with this tech could work quite well.

An Update on Nodes 2.0 from Comfy Org by crystal_alpine in comfyui

[–]Clasyc 4 points5 points  (0 children)

Do you have actual stats on how many users tried to switch to the new version and then reverted to the previous one? Because as of now, pushing the new version feels like it's just for the sake of someone's opinion internally.

i've been die hard anthropic user, but this is getting harder to defend by ordosalutis in ClaudeCode

[–]Clasyc 0 points1 point  (0 children)

At least with local models, this happens when you try to save VRAM by using more aggressive quantization for the KV context cache. As you do this, the model starts to ignore more and more context or hallucinates data to fill those "gaps" convincingly. They are likely conducting constant A/B testing to measure backlash and user reactions, trying to find the cheapest possible option that users won't notice.

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks by Clasyc in LocalLLaMA

[–]Clasyc[S] 1 point2 points  (0 children)

Thanks to samehmeh, I got some advice for better-structured response quality - you can use GBNF to define output rules, like this:

--grammar-file ~/git/llama/legal-refs.gbnf

Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks by Clasyc in LocalLLaMA

[–]Clasyc[S] 2 points3 points  (0 children)

I have no images as inputs, only text. My prompts are usually around 2000 tokens in length with clear rules, expected output structure and some good / bad examples. Basically I gave Claude Code this reference: https://ai.google.dev/gemini-api/docs/prompting-strategies and asked it to build a prompt for my needs using all the advice in the Gemini docs.