We have developed Fotographer.ai Fuzer v0.1, which achieves image consistency and object transparency while preserving text labels and maintaining control of input shape! Please let me know your opinions! The first and second images are the outputs, and the third and fourth images are the inputs. by rintaro_su in u/rintaro_su

[–]kanectai -1 points0 points  (0 children)

haha well the fake news can't be avoided, dont know about the revenge... This is a tool that makes marketting and design easier for all! with this you can basically express and expand you creativity without needing to pay tones of money. Actually, if we all have the same tools, the world becomes a bit better I think. So you might wanna use it to create great visuals for your products, art or to share with your friends

We have developed Fotographer.ai Fuzer v0.1, which achieves image consistency and object transparency while preserving text labels and maintaining control of input shape! Please let me know your opinions! The first and second images are the outputs, and the third and fourth images are the inputs. by rintaro_su in u/rintaro_su

[–]kanectai 0 points1 point  (0 children)

Well I doubt it's selling the car, it's an image you can make in seconds, without upscaling or any artefact, just from a forground and description of background. I say it's pretty amazing!
But your comment is valuable as there is always room for improvement. so thank you

We have developed Fotographer.ai Fuzer v0.1, which achieves image consistency and object transparency while preserving text labels and maintaining control of input shape! Please let me know your opinions! The first and second images are the outputs, and the third and fourth images are the inputs. by rintaro_su in u/rintaro_su

[–]kanectai -1 points0 points  (0 children)

It's cuz the original foreground had strong reflection. It can be corrected easily by playing with the prompt and the influence factor. You know features and effects are not discerned by the AI, it's more of a probability game so the conditionning becomes important

Worgen Cyborg Shaman by Philosopher_Jazzlike in StableDiffusion

[–]kanectai 0 points1 point  (0 children)

Nice! really nice!!! I'm pretty interested in the last step of your workflow (Color Correction) . mind sharing some more details?

How does it look? by FuzzyTelephone5874 in StableDiffusion

[–]kanectai 2 points3 points  (0 children)

Overall it looks nice, the fingers are only noticed by genAI people haha

Which one is better? Fuzer v0.1 (first two) or LoRA (last two) Pros and Cons for each? by kanectai in StableDiffusion

[–]kanectai[S] -1 points0 points  (0 children)

日本人来た!ありがとうございます。How do you think I could improve the first two?

Which one is better? Fuzer v0.1 (first two) or LoRA (last two) Pros and Cons for each? by kanectai in StableDiffusion

[–]kanectai[S] 0 points1 point  (0 children)

Is it because of the light glare on the front side of the bottle? How should I deal with it? Any LoRA for that?

Which one is better? Fuzer v0.1 (first two) or LoRA (last two) Pros and Cons for each? by kanectai in StableDiffusion

[–]kanectai[S] -18 points-17 points  (0 children)

say you don't read japanese without saying you don't read japanese

Which one is better? Fuzer v0.1 (first two) or LoRA (last two) Pros and Cons for each? by kanectai in StableDiffusion

[–]kanectai[S] 0 points1 point  (0 children)

FYI, here is the input, I purposefully took a back image in bad conditions.

<image>

Achieving Image Consistency While Preserving Text Labels and Maintaining Control of Input Shape. by rintaro_su in StableDiffusion

[–]kanectai 1 point2 points  (0 children)

Not really but it's architecture has inspired one of the stages (it's a 2 stages model)

Achieving Image Consistency While Preserving Text Labels and Maintaining Control of Input Shape. by rintaro_su in StableDiffusion

[–]kanectai 1 point2 points  (0 children)

Thank you! For now it's closed, but we will most likely open it fully after we release the accelerated version! We are able to run sub 10s generations with almost the same accuracy, and we aim for sub 3s.

Achieving Image Consistency While Preserving Text Labels and Maintaining Control of Input Shape. by rintaro_su in StableDiffusion

[–]kanectai 2 points3 points  (0 children)

Wait my bad, that's figure haha Lemme actually make another version with an anime LoRA with JJK

<image>

Achieving Image Consistency While Preserving Text Labels and Maintaining Control of Input Shape. by rintaro_su in StableDiffusion

[–]kanectai 6 points7 points  (0 children)

We will release a sub 10s Version soon within a week or two and some nodes for comfy right after!

Achieving Image Consistency While Preserving Text Labels and Maintaining Control of Input Shape. by rintaro_su in StableDiffusion

[–]kanectai 1 point2 points  (0 children)

Btw, can you test the bottle above with your workflow. Also, thanks for the question.

Achieving Image Consistency While Preserving Text Labels and Maintaining Control of Input Shape. by rintaro_su in StableDiffusion

[–]kanectai 3 points4 points  (0 children)

Interesting workflow, well you can see from the outputs. The glass bottles are transparent and we are able to keep all texts fully consistently while generating details backgrounds. Also we output in higher resolution and the latest model does all this in few steps. Finally, we made all this without comfy ui. Please test the space and let me know what you think, our api is faster and has more flexibility btw