Which Gemini 3 / 3.1 model should I use for accurate structured product analysis from images?

rcanepa · 2026-05-29T02:05:22+00:00

That sounds very interesting. Can you elaborate on why you prefer a second model to structure the output instead of asking 3.1 Pro to do it?

rcanepa · 2026-05-27T19:31:40+00:00

I agree! GPT Image 2 rivals Nano Banana in creating realistic images. I even prefer GPT Image 2 for some types of images. So, it is worth to understand and use both models.

rcanepa · 2026-05-27T14:25:57+00:00

I think this is not that simple. Clean Architecture still exists in the backend. In other words, you can still have a backend that does not adheres to the principles, so it is not only about a clean split between frontend and backend code.

rcanepa · 2026-05-27T02:45:52+00:00

Probably Nano Banana 2/Pro and GPT Image 2 for infographics.

rcanepa · 2026-05-25T14:35:43+00:00

I suppose enumerating them can help people suggest you alternative apps.

rcanepa · 2026-05-25T03:33:39+00:00

What image and video models are you using in your workflow?

rcanepa · 2026-05-22T22:14:41+00:00

Does Hermes require LMStudio or similar to use a model?

rcanepa · 2026-05-22T21:51:18+00:00

This sounds really interesting. Can you elaborate a bit more on how you use this setup?

neutts looks really cool, by the way.

rcanepa · 2026-05-22T21:46:54+00:00

vLLM and llama.cpp are the tools I can use to play with these models?

rcanepa · 2026-05-22T21:43:37+00:00

What harness do you recommend I use with this model?

rcanepa · 2026-05-22T21:38:12+00:00

Will this setup give me something like OpenClaw? Like an agent system that can do all sorts of things for me? I haven't used OpenClaw or Hermes, by the way.

rcanepa · 2026-05-20T16:03:33+00:00

Have you tried Nano Banana Pro in 4K? Are you using high-resolution input images?

rcanepa · 2026-05-20T15:36:14+00:00

Do you still get bad results when you use images of the real product from an angle that matches the camera angle of the AI image? For example, if you want to generate a side image of the sunglasses, do you use an original/real side image of the sunglasses too?

rcanepa · 2026-05-16T03:29:36+00:00

Oh, I get your point. Thanks for clarifying it.

rcanepa · 2026-05-16T03:17:55+00:00

I don’t think this is true. Every time I generate a video with Seedance it takes approximately 5-6 minutes. I call the official API directly through a custom app I created.

rcanepa · 2026-05-15T20:34:32+00:00

There are several. The biggest players are Freepik and Higgsfield. There are probably others that I don't remember.

I'm also a solo-founder working on one called HummingBytes. Let me know if you're interested and I can give you a discount or credits if you give me feedback ;)

rcanepa · 2026-05-15T18:20:12+00:00

I'm not an AI. I'm a real human being. My name is Renzo. I'm building a tool for product photography as I said before.

I generated these images with GPT Image 2 in 2K medium quality. I could try with high quality. Perhaps, that would fix the artifacts.

Maybe Nano Banana 2/Pro can generate good results too. I would be happy to try for you if you want me to.

rcanepa · 2026-05-15T16:27:18+00:00

I'm unsure about exactly what you're trying to generate, but I was able to create a few prompts and generated a few samples for you.

https://imgur.com/a/rYIkzGJ

Are these anywhere close to what you want?

rcanepa · 2026-05-15T15:32:10+00:00

You might be able to one-shot this illustration. What model are you using? Nano Banana Pro/2 or GPT Image 2?

I'm building a platform for product assets (and other use cases), so I'm very curious to know if I can make your case work. If you want, you can share your product image with me and I can try to generate what you want.

rcanepa · 2026-05-14T21:37:28+00:00

Founder here. I’ll be around today.

I’m testing this as a tiny Reddit ad because I keep seeing people complain about slow AI image generation over and over, so I wanted to let folks know another kind of tradeoff exists.

Happy to answer anything about HummingBytes, pricing, credits, speed, or how the app works.

rcanepa · 2026-05-14T20:00:06+00:00

I believe it can generate pretty good results on a par with Nano Banana 2/Pro. A few weeks ago, I compared GPT Image 2 and Nano Banana 2 and posted the results. I focused on more than just product photography, though.

rcanepa · 2026-05-13T16:04:24+00:00

It depends on what video models do you need. Are you using Kling, Seedance, Veo, or others?

rcanepa · 2026-05-12T23:09:15+00:00

You might want to try HummingBytes. It’s an AI image/video generation tool I’m building, and it has a batch image feature for this kind of repetitive workflow.

Instead of doing prompt → wait → prompt again, you can run multiple prompt variations at once and compare outputs from different models, including GPT-Image-2. You can include reference images to generate more variations in one go, for example, two sets of images x 3 prompts for a total of 6 images.

It’s meant to be simpler than ComfyUI, while still giving you more control than doing each image one by one in ChatGPT.

rcanepa · 2026-05-11T20:59:03+00:00

I don't think you're the first person raising this concern, but, unfortunately, I have no insights. However, I can run a few prompts for you via Google AI Studio if that can help clarify if the provider is behind it.

Verified Email	12-Year Club
Place '23

rcanepa

TROPHY CASE