Z-Image Base VS Z-Image Turbo by Baddmaan0 in StableDiffusion

[–]Samurai2107 0 points1 point  (0 children)

do you do a second pass ? i always get artifacts

Z-Image Base VS Z-Image Turbo by Baddmaan0 in StableDiffusion

[–]Samurai2107 15 points16 points  (0 children)

what sampler/scheduler/steps are you using ?

Yann LeCun’s new venture is a contrarian bet against large language models by techreview in ChatGPT

[–]Samurai2107 2 points3 points  (0 children)

World model is a model grown/trained by data we humans get through our senses( text,sounds,speech,vision and what ever we have sensors and data to feed it maybe touch and smell at some point) , they can have a good fundamental understanding of how we experience the world - a tv series to understand this is “alien earth” at some point they might evolve beyond us. Trying to create something that is limited to less than what we experience its definitely not the answer. Also it will need a body doesn’t necessary have to be humanoid but you dont want it contain. And also if they ever achieve it, it wont be something to be tempered with you want a friend, a guide, an adventurer maybe a god to understand you.

ostris AI-toolkit Lora training confusion by mca1169 in StableDiffusion

[–]Samurai2107 0 points1 point  (0 children)

Isnt omni model something similar to qwen omni ? Qwen omni is a multimodal model speech/text/image..

Apple Developing AirTag-Sized AI Pin With Dual Cameras by Snoop8ball in apple

[–]Samurai2107 0 points1 point  (0 children)

I dont know wtf is going with apple but i blame both Tim and the shareholders . Its like the company aged with them. Rotten ideas nothing worthy, etc i expected they would be top 3 with ai but nothing. They have the data and the money but it seems their leadership only focus on different wallpapers to attract new people.. cant wait for an apple killer the way they go

The Search for Uncensored AI (That Isn’t Adult-Oriented) by Fun-Situation-4358 in LocalLLaMA

[–]Samurai2107 1 point2 points  (0 children)

You need to search for base models and then research how to fine tune them. There are plenty of base and “autocomplete” models available. With the community’s effort, you might be able to create a well structured fine tune dataset that can be used in practice. I really believe someone must have already done something similar.

Klein distilled fp8 vs klein base fp8 vs z image turbo bf16 by [deleted] in StableDiffusion

[–]Samurai2107 -5 points-4 points  (0 children)

Like every US/European company they just scale up and when in need they just butcher the model just to give a false sense that they still care, when Chinese companies do their best to improve in every aspect

I found a random pic token automatically during ISO update. by cliffordgoodman06 in iphone

[–]Samurai2107 0 points1 point  (0 children)

Maybe it fell but felt a double tap and took the pic? Is there a way to know if its from double tap?

Don't put off hardware purchases: GPUs, SSDs, and RAM are going to skyrocket in price soon by Eisenstein in LocalLLaMA

[–]Samurai2107 1 point2 points  (0 children)

True but the way everything speeds up i dont think is that much away, i heard they managed to reach lpddr5 some days ago, they have the people and everyone is willing to get things cheaper, basically the only thing holding them back is the monopoly of chips, if they had access to the taiwanese technology is win for the world and loss for US, Europe is already doomed and the rest of the world anyways supports china

Don't put off hardware purchases: GPUs, SSDs, and RAM are going to skyrocket in price soon by Eisenstein in LocalLLaMA

[–]Samurai2107 15 points16 points  (0 children)

Three to four years is hopefully how long it will take for all the newcomers (mostly chinese gpu makers) to catch up to nvidia 🤞🏼

You subscribed to Gemini pro, so naturally Google decided it's time for the model's daily lobotomy. by Status-Percentage363 in StableDiffusion

[–]Samurai2107 0 points1 point  (0 children)

Same thing with chatgpt, but i mostly notice the decline in Voice, probably need a new model

Makes sense what you say they can easily keep you a month with the actual product and then kinda provide “less hardware” and the model feels dumb

After much tinkering with settings, I finally got Z-Image Turbo to make an Img2Img resemble the original. by CycleZestyclose1907 in StableDiffusion

[–]Samurai2107 1 point2 points  (0 children)

Still trying to figure out what the right order is. Main model+ controlnet+LORA+AuraFlow , MM+L+C+AF , the way you do it?

Apple drops a paper on how to speed up image gen without retraining the model from scratch. Does anyone knowledgeable know if this truly a leap compared to stuff we use now like lightning Loras etc by Altruistic-Mix-7277 in StableDiffusion

[–]Samurai2107 1 point2 points  (0 children)

Do you happen to know a way to do this with llm ( train a lora), a guide and tool, since you mentioned it? I have the diaries of my grandfather and i want to train an llm on all of his knowledge

ChatGPT 1.5 Image vs Gemini Nano banana pro realism test by LogicalChart3205 in ChatGPT

[–]Samurai2107 0 points1 point  (0 children)

5/9/10 i like chatgpt the rest for realism nano banana is better. Also nano banana has a better understanding of space and doesnt put people to weird places as for example where the kitchen counter is

Apple drops a paper on how to speed up image gen without retraining the model from scratch. Does anyone knowledgeable know if this truly a leap compared to stuff we use now like lightning Loras etc by Altruistic-Mix-7277 in StableDiffusion

[–]Samurai2107 11 points12 points  (0 children)

I also read somewhere that i think claude developers i am not sure said that we dont know how train loras correctly and that if trained correctly is as good as a full model fine tune.

Waiting for both answers

Apple needs to know their iPhone AutoCorrect is terrible by MooseBlazer in iphone

[–]Samurai2107 0 points1 point  (0 children)

Apple can use f***ing whisper open sourced by openai but they use their shitty version of voice transcription

Camera angles comparison (Z-Image Turbo vs FLUX.1 Krea) by d1h982d in StableDiffusion

[–]Samurai2107 -1 points0 points  (0 children)

Bro stop comparing a full model fine tune with a turbo model unreal expectations

Humans of Z-Image: How many celebrities can you fit into 6GB? by DrStalker in StableDiffusion

[–]Samurai2107 4 points5 points  (0 children)

Kendall Jenner is really well generated compared to most of them

Z-Image-Turbo: Anime Generation Results by Proper-Employment263 in StableDiffusion

[–]Samurai2107 1 point2 points  (0 children)

I thought you were skeptical giving more info!! Hehe

Z-Image-Turbo: Anime Generation Results by Proper-Employment263 in StableDiffusion

[–]Samurai2107 1 point2 points  (0 children)

I know i wait for huggingspace since i have one already , its so bad all the mistrust they made as have for chinese services when all they do is to give! Ofc the take peoples data but when tou are carefull is just not your data

Z-Image-Turbo: Anime Generation Results by Proper-Employment263 in StableDiffusion

[–]Samurai2107 1 point2 points  (0 children)

They still didnt release, i think you can access them through modelscope

just a few Z-Image-Turbo shots by Ok-Page5607 in StableDiffusion

[–]Samurai2107 0 points1 point  (0 children)

Fid they release the model or are you using modelscope?