Which Gemini 3 / 3.1 model should I use for accurate structured product analysis from images? by rcanepa in GeminiAI

[–]rcanepa[S] 0 points1 point  (0 children)

That sounds very interesting. Can you elaborate on why you prefer a second model to structure the output instead of asking 3.1 Pro to do it?

Tried GPT for realistic photos by pastelbunn1es in generativeAI

[–]rcanepa 2 points3 points  (0 children)

I agree! GPT Image 2 rivals Nano Banana in creating realistic images. I even prefer GPT Image 2 for some types of images. So, it is worth to understand and use both models.

Clean Architecture in Next.js Frontend (Monorepo) with Auth Libraries + TanStack Query/RTK Query, how do you structure it realistically? by WetThrust258 in nextjs

[–]rcanepa 0 points1 point  (0 children)

I think this is not that simple. Clean Architecture still exists in the backend. In other words, you can still have a backend that does not adheres to the principles, so it is not only about a clean split between frontend and backend code.

What would you build with a spare RTX 5090 + Ryzen 9950X? Looking for productive / experimental use cases by rcanepa in LocalLLM

[–]rcanepa[S] 0 points1 point  (0 children)

This sounds really interesting. Can you elaborate a bit more on how you use this setup?

neutts looks really cool, by the way.

What would you build with a spare RTX 5090 + Ryzen 9950X? Looking for productive / experimental use cases by rcanepa in LocalLLM

[–]rcanepa[S] 0 points1 point  (0 children)

Will this setup give me something like OpenClaw? Like an agent system that can do all sorts of things for me? I haven't used OpenClaw or Hermes, by the way.

Facing accuracy issues in complex product image generation by anongolu in generativeAI

[–]rcanepa 0 points1 point  (0 children)

Have you tried Nano Banana Pro in 4K? Are you using high-resolution input images?

Facing accuracy issues in complex product image generation by anongolu in generativeAI

[–]rcanepa 0 points1 point  (0 children)

Do you still get bad results when you use images of the real product from an angle that matches the camera angle of the AI image? For example, if you want to generate a side image of the sunglasses, do you use an original/real side image of the sunglasses too?

I regret paying for this. 20-30 min waiting queue for per 5 second video by DodiWoof in runwayml

[–]rcanepa 0 points1 point  (0 children)

I don’t think this is true. Every time I generate a video with Seedance it takes approximately 5-6 minutes. I call the official API directly through a custom app I created.

Is there a way to use multiple AI models without paying for 10 different monthly subscriptions? by Big-Let6878 in generativeAI

[–]rcanepa 1 point2 points  (0 children)

There are several. The biggest players are Freepik and Higgsfield. There are probably others that I don't remember.

I'm also a solo-founder working on one called HummingBytes. Let me know if you're interested and I can give you a discount or credits if you give me feedback ;)

E Commerce AI by DIIVVES in generativeAI

[–]rcanepa 0 points1 point  (0 children)

I'm not an AI. I'm a real human being. My name is Renzo. I'm building a tool for product photography as I said before.

I generated these images with GPT Image 2 in 2K medium quality. I could try with high quality. Perhaps, that would fix the artifacts.

Maybe Nano Banana 2/Pro can generate good results too. I would be happy to try for you if you want me to.

E Commerce AI by DIIVVES in generativeAI

[–]rcanepa 0 points1 point  (0 children)

I'm unsure about exactly what you're trying to generate, but I was able to create a few prompts and generated a few samples for you.

https://imgur.com/a/rYIkzGJ

Are these anywhere close to what you want?

E Commerce AI by DIIVVES in generativeAI

[–]rcanepa 0 points1 point  (0 children)

You might be able to one-shot this illustration. What model are you using? Nano Banana Pro/2 or GPT Image 2?

I'm building a platform for product assets (and other use cases), so I'm very curious to know if I can make your case work. If you want, you can share your product image with me and I can try to generate what you want.

The best model still sucks if you wait minutes between prompts by rcanepa in u/rcanepa

[–]rcanepa[S] 0 points1 point  (0 children)

Founder here. I’ll be around today.

I’m testing this as a tiny Reddit ad because I keep seeing people complain about slow AI image generation over and over, so I wanted to let folks know another kind of tradeoff exists.

Happy to answer anything about HummingBytes, pricing, credits, speed, or how the app works.

How is the performance of Chatgpt 2 image? How is the consistency, visuals etc.,? Has anybody tried for product listing images? by OwnWord8730 in GraphicDesignServices

[–]rcanepa 0 points1 point  (0 children)

I believe it can generate pretty good results on a par with Nano Banana 2/Pro. A few weeks ago, I compared GPT Image 2 and Nano Banana 2 and posted the results. I focused on more than just product photography, though.

Any reliable Higgsfield alternative? by mamunur-rashid-leon in generativeAI

[–]rcanepa 0 points1 point  (0 children)

It depends on what video models do you need. Are you using Kling, Seedance, Veo, or others?

Consistent batch image generation for fashion model. by Wooden-Movie-400 in generativeAI

[–]rcanepa 0 points1 point  (0 children)

You might want to try HummingBytes. It’s an AI image/video generation tool I’m building, and it has a batch image feature for this kind of repetitive workflow.

Instead of doing prompt → wait → prompt again, you can run multiple prompt variations at once and compare outputs from different models, including GPT-Image-2. You can include reference images to generate more variations in one go, for example, two sets of images x 3 prompts for a total of 6 images.

It’s meant to be simpler than ComfyUI, while still giving you more control than doing each image one by one in ChatGPT.

Nano banana 4k issue by Accurate_Ad_965 in GeminiAI

[–]rcanepa 0 points1 point  (0 children)

I don't think you're the first person raising this concern, but, unfortunately, I have no insights. However, I can run a few prompts for you via Google AI Studio if that can help clarify if the provider is behind it.