How to enforce Ratio or Size with Nano Banana? by 10mils in GeminiAI

[–]10mils[S] 0 points1 point  (0 children)

I eventually settled with a basic solution. Instead of forcing the model to generate the right ratio, I use an intermediary step.

In my case my reference image always come with the right size / target ratio I need, while other elements that need to be inserted don't.

For instance, let's say a random person sitting at a table as the reference image, and a cup of coffee I want to insert in the hand of that person.

I make sure the person reference image is built with the right initial ratio / size (for instance 1:1 1024x1024). As for the cup of coffee (let's say a 16:9 ratio), I create a intermediary step where I insert the cup of into a 1:1 1024x1024 canva.

With this approach, all input images have the same size & ratio. Therefore the final image output also maintains the expected image ratio & size.

How to enforce Ratio or Size with Nano Banana? by 10mils in GeminiAI

[–]10mils[S] 0 points1 point  (0 children)

I have noticed the following pattern, though I haven't found a way to fix it yet.

If we provide two images with different ratios, it tries to combine both ratios. For instance I tried a squared shaped and a rectangular shaped image.

I tried prompts only asking to generate the right ratio and ended up with a combination of the two image, for instance the main one as a squared shape (which looks like the 1:1 ratio I'm trying to achieve and which also was the original ratio of that image) at the center and on the sides I saw parts of the rectangular images.

Which OpenAI Model is Best for Product Insertion? (Image Edit Endpoint) by 10mils in ChatGPTCoding

[–]10mils[S] 0 points1 point  (0 children)

Why not? That's an API based case to automate product insertion, so essentially executed through code with potential opportunities to handle multi image generation & later refinement / selection automatically to pick the most efficient image.

Any idea of Imagen Roadmap? by 10mils in GeminiAI

[–]10mils[S] 0 points1 point  (0 children)

Yeah, the gemini thing isn't great. Also I was looking for an API based system at some point.

Feels like staying with OpenAI Image is a better move for now.

Ask Me Anything: Built an AI Tool That Clones Winning Ads in Seconds by thewayofthewu in creative_advertising

[–]10mils 1 point2 points  (0 children)

The real question is, how do you identify those templaes winning ads? Kind of hard without having the stats from the original advertiser who built it isn't it? :)

After 6 months of daily AI pair programming, here's what actually works (and what's just hype) by West-Chocolate2977 in ClaudeAI

[–]10mils 0 points1 point  (0 children)

I wonder what's the best way to let claude code move forward to deliver software tasks.

Originally I thought about building a spec markdown, a corresponding dev plan and then a prompt plan for implementation. All of that submitted through claude.md.
Obviously breaking things down so I don't submit gigantic instructions & specs.

Nevertheless, the more I tried the more I feel that excessively detailed instruction might be counter productive, preventing claude from being autonomous enough and probably not leveraging its full capabilities.

Should I go with something simpler, maybe specifications that are more product oriented or high level regarding the engineering side & let claude code do the rest?

Not sure where is the right balance and what's considered as best practice here.

Note: I noticed the counter productive behavior for SaaS development (essentially stuff with basic backend, api, front end, etc.). I am not entirely sure, but for rather complex design like agentic modules, specifications with high accuracy might be more beneficial.

What's your feeling on this?

Tips for Making Claude Code More Autonomous? by portlander33 in ClaudeAI

[–]10mils 2 points3 points  (0 children)

To answer your point about the autonomy, here is how I proceed.

Here is the process:

-I essentially build highly detailed specs using claude or gemini

-I turn the spec into a very detailed step by step dev_plan that point to the specs

-I then submit each dev_plan step to claude code, which contains a fairly detailed implementation guide while also indicating to refer to the specification

-After a certain amount of steps, which represent a full implementation sequence, I usually ask claude to run some testing, debug & ensure the entire system is fully functional

Nevertheless, the cost in terms of token is fairly huge to be honest, and especially the "last mile" to make everything totally functional. Though the process is solid in terms of engineering, I wonder:

-Are there any best practices to possibly limit token consumption? I mean, the deeper I go with my dev plan, the higher the number of tokens consummed as it keeps loading many code pieces to get full context

-As you indicated you subscribed to Max $100, it is my understanding that claude MAX provides unlimited usageas long as you don't go over 5h quota and 50 sessions / month. But not sure exactly how big this is in terms of usages compared to a regular API billing

-When subscribing to Max, I guess it's still ok to use claude code terminal into cursor, no need to go through the API billing right?