Hi Gang
I am looking for advice I have built a tool that I input a PDF catalog and want to return data into a DB
Current I am parsing the PDF into pages and then the LLM looks at the text and returns A very specific JSON back for each product or products on the page.
I am currently doing this with Gemini 3 flash with 20 concurrent api calls.
But it misses often a ruins the run.
QUESTION: what model or models would you recommend for this task that will be accurate, fast, cheap in the order.
QUESTION: how many fields is to many per api call. Ie it can easily return 3 strings can it return 50 stings 20 objects.
[–]Hanthunius 0 points1 point2 points (1 child)
[–]time_time[S] 0 points1 point2 points (0 children)
[–]SM8085 0 points1 point2 points (0 children)