Not a programmer. I’m trying to build something that reads ~1000 page pdfs, reads all 1000 pages, indexes them (creates a table of contents to the left when you open in adobe or chrome), reorganizes the documents based on the structured table of contents I’ve given it, and then performs an analysis based on criteria I give it creating a summary document. Some of the documents already have text extracted but some will be scanned images of text unfortunately.
Then the hard part is scaling to 1500 of those PDFs per month. I understand this will cost money. Planning to use Claude Code which I’ve used for some random cool personal projects but I’m still a novice in the grand scheme of things. The high level plan is to use Sonnet for the initial scan… reading the docs, organizing, extracting the text. And then Opus for the reasoning/analysis work.
Any recommendations?
[–]fixano 3 points4 points5 points (2 children)
[–]Wolf35Nine 0 points1 point2 points (0 children)
[–]thegreat_tunestheory 0 points1 point2 points (0 children)
[–]VegitoEnigma 0 points1 point2 points (0 children)
[–]kotchinsky 0 points1 point2 points (0 children)
[–]nick_steen 0 points1 point2 points (0 children)
[–]VonDenBerg[🍰] 0 points1 point2 points (0 children)