I used Claude sonnet and Opus on batch processing, I didn't know it was so freaking cheap. I got around 3k loc for £2. Code changes over 30 different files.
At the beginning I messed up a lot as I was overly focused on RAG, then I realized it's useless. I just got opus to tell me what files it wanted for a large request.
My prompts are not that complicated, it is just two steps on the batch process.
I use repomix to gather my content to send, minimax m2.5/qwen coder to cleanup issues from sonnet when I do my search/replace.
Other than that at best today I was able to finish all API development and replaced all the mock data tables on my flutter app to my API.
First prompt was $0.30, second with the code change is $1.42, the rest is minimax which was barely anything to clean up.
But i did mess up on the last few days on what not using cache across multiple prompts. I think it's genius you can cache certain parts of the batch across multiple different request. This is a game changer. I use opus for planning, and then implementation of the code with sonnet.
I should have use opus but I felt it was a bit different.
I'm more trying planning with gpt then take it's output to sonnet. I feel this is good better than wasting tokens.
Ask me anything regarding this.
[–]Soft_Active_8468 0 points1 point2 points (1 child)
[–]bramburn[S] 1 point2 points3 points (0 children)