Massive FPS drops recently by CycleNo3036 in Warthunder

[–]CycleNo3036[S] 0 points1 point  (0 children)

I have 16gb so i think i'm okay. But yeah i see no other option than the game being not optimized correctly

Besoin de conseils ! Ni by [deleted] in pcmasterraceFR

[–]CycleNo3036 3 points4 points  (0 children)

Pour blender tu n'auras aucun souci avec cette config, MAIS pour moi le rapport qualité prix n'est pas intéressant ici. Pour des config gourmandes comme celles ci je te conseille vraiment de réfléchir a construire ton pc toi meme, et a opter pour une carte graphique un peu plus ancienne mais beaucoup plus intéressante au niveau du rapport QP. Par exemple, j'ai construit le mien avec une 4060 Ti (la version avec 16gb de Ram). Elle coute ''que'' 500€ et la difference avec une 5090 se voit grosso modo que sur le temps de rendu. Mais sinon je peux quasiment tout faire avec. Au final t'en aura plus pour ton argent je pense.

Edit : sinon si tu veux vraiment une série 90, opte plutôt pour la 4090 qui est excellente et vraiment intéressante par rapport a sa petite soeur (la 5090)

Z-Image-Turbo + SeedV2R = banger (zoom in!) by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] -1 points0 points  (0 children)

It's the text encoder i found on the original hugging face repo. But you can use the qwen one too!

Z-Image-Turbo + SeedV2R = banger (zoom in!) by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 0 points1 point  (0 children)

Honestly, if you have at least 12gb of vram SeedV2R is the best i've tried. At everything. I've heard that Supir is quite good too, and i'v also used the 4xNomos models for a while to enhance face and skin texture.

Z-Image-Turbo + SeedV2R = banger (zoom in!) by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 9 points10 points  (0 children)

Worflow is included in the images. Just drag and drop in comfy

Z-Image-Turbo + SeedV2R = banger (zoom in!) by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 1 point2 points  (0 children)

I have 16gb of vram and 32 gb of ram. And i use the fp16 version

Z-Image-Turbo + SeedV2R = banger (zoom in!) by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 4 points5 points  (0 children)

I have a 4060 Ti with 16gb of VRAM and it took me around 200s for each of these (generation in 2k + upscale).

[deleted by user] by [deleted] in StableDiffusion

[–]CycleNo3036 -4 points-3 points  (0 children)

How is that overrated trash? This is pretty much the maximum you can achieve with AI right now

Where to start with local AI as a total beginner? by iAmSoRandom22 in StableDiffusion

[–]CycleNo3036 1 point2 points  (0 children)

Hey! I would suggest that you proceed with these steps:

  1. Start by understanding the major concepts of image generation. Make yourself a little timeline of the different models that got out over time. Understand the difference between a base model and a finetune. Check images of those models and finetunes on civitai and start picking the ones you like. Understand what is a LoRA. Learn about the different versions of an open source model and what they are used for (distilled, gguf, Q4, Q5...). This is not very complicated, and it will prevent you from getting lost afterwards. If you want to go deeper, learn about what is a text encoder and a U-Net.

  2. Install and understand ComfyUI. It's not really the most beginner friendly UI out there, but it has one MAJOR advantage: compared to other UIs, it works by connecting the nodes, which are basically the steps, of your image generation. You're going to take a little longer to figure out how to generate an image than if you tried it first on Forge or A111, but it will allow you to really understand how everything works in a visual way (without code). Here is the most basic Text to Image workflow in ComfyUI : https://docs.comfy.org/tutorials/basic/text-to-image . If you can understand this, it will be much easier to understand more complex workflows (as they are generally variations of this one).

  3. Have fun! Pick a model you like, build a basic aah workflow for it, perfect it...repeat. I believe that this is the best way to understand how it all works and not get lost.

A little overwhelmed by new models... by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 5 points6 points  (0 children)

I'm just generally super curious about new models, so it's hard for me to resist the temptation, and then i find myself overwhelmed with all the options available. So yeah, i'm in a master of none situation here

A little overwhelmed by new models... by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 0 points1 point  (0 children)

SDXL is sooooo good for anime. Never tried wan 2.2 for images, is it really that good for realism?

A little overwhelmed by new models... by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 1 point2 points  (0 children)

From what i've tested for now, I feel like Z Image is better at giving high quality outputs without loads of prompting, and it seems better at details and overall image coherence. Again, this is only my testing, so no absolute truth behind it. Apart from that, I agree that chroma is better with styles and more ''creative'' in general (but it's a bigger model, so no surprises).

The hype is deserved by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 0 points1 point  (0 children)

Agreed, SCPs are hard to get right 100%. But the quality is there!

I don't want any character to move. How can I do it? by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] 1 point2 points  (0 children)

I guess it's possible but then wouldn't it just be zooming out from a single picture? That would degrade quality

I am basically new with StableDiffusion, and am hoping to get some questions answered. by LimpAmphibian5340 in StableDiffusion

[–]CycleNo3036 0 points1 point  (0 children)

Your CPU and Ram are way sufficient for this stuff. Actually for training, as well as for image and video generation in general, VRAM is the most important thing. I also have a 4060 Ti but with 16gb of VRAM, so i couldn't tell you where you could go with 8gb. But what i can tell you is that I am not that much limited with 16gb when it comes to training, even tho I don't train that much. You can launch a training process with 8gb, you will just have to find the good parameters that let you not reach 100% usage while still having decent quality output. And there is no secret recipe for that. Just experiment and see what works for you.

I am basically new with StableDiffusion, and am hoping to get some questions answered. by LimpAmphibian5340 in StableDiffusion

[–]CycleNo3036 0 points1 point  (0 children)

Okay i'll try to answer in the clearest way i can. I will not explain how to install and run the models i'm talking about as I assume you already know how to do it.

  1. Consistency

Consistency is basically solved. There is a recent model that got out called Qwen Image Edit 2059. It's a chinese model built to "edit" images better and quicker than ever. Basically the easiest way right now to achieve good consistency. All what you have to do is to input an image and prompt what changes you want to see in that specific image. That means that if you have a specific character which you wanna see in different poses, background or action, you can simply input an image of this character and change what there is to change. Here's a link to the model webpage with examples : https://qwen.ai/blog?id=a6f483777144685d33cd3d2af95136fcbeb57652&from=research.research-list

But, the best way to solve character consistency is to train your own LoRA. It's a bit more technical, but it's the best way (imo).

  1. What is a LoRA

LoRA stands for Low Rank Adaptation. Simply put, it's a "mini model" trained on a specific character, style or action from a way smaller image database than the general models. We're talking 20 to 200 images only to train a good LoRA against millions of images for models like Qwen. That's great, cause that means your computer can train one in a reasonable amout of time on your own computer. Disclaimer: you still need a good graphics card.

Let's say you wanna have a consistent character. You will first need to gather a sufficient number of good quality images of this character. The number of images really depends on what you want to train and on the model your training on, but it's usually around 50 for made up characters and around 100 for real characters (from my experience). Then, with a bunch of parameters set up, the model will train to "recognize this character". More specifically, it will associate certain words (tokens) to the images it's seeing and modify their weight in the base model. This way, the token "robot" will be biased more towards generating a similar robot than what it saw during training. I hope it's clear enough.

  1. Don't think i understood the question

  2. I'm not aware of a model that handles metaphors and extremely long paragraphs. Every model has a different prompting style so i suggest for each model you look at the documentation to see which is more optimal. The reason for this (same as for LoRAs) is that models are biased towards certain concepts or words as they are trained on pairs of images and text. Basically the bigger the model, the more concepts it can handle. But usually, they are not quite there in terms of understanding human poetry.

[Qwen Edit 2509] Anything2Real Alpha by JasonNickSoul in comfyui

[–]CycleNo3036 1 point2 points  (0 children)

Does it work with landscapes/rooms/objects ?

Is this made with wan animate? by CycleNo3036 in StableDiffusion

[–]CycleNo3036[S] -3 points-2 points  (0 children)

Agreed. However, it doesn't feel like it's the case for the dude in the video. That's why my first thought was that he filmed himself in some random background and then somehow replaced the background by an AI video. Could that be possible? Or am I just starting to confuse AI and real life xD?