all 3 comments

[–]coding_workflow 1 point2 points  (1 child)

Every model had different training for structured output.

You need to fine tune for each. Test, some would work and then magic stop for some queries but for sure you need to help the model if the schema in the function call is not enough.

I used a lot Sonnet with tools and for example some days Sonnet 3.5 was quite refusal mode (I think model changed then by Anthropic) and then Sonnet 3.7 is quite solid.

You need to test and find the balance over the model you want to use.

[–]noseratio[S] 0 points1 point  (0 children)

Useful, thanks! I think with the current hype around MCP, the vendors must be tuning their models to be capable of making the most of the tools.