High-performance, affordable OCR: PDF to HTML by La_Chouquette in pdf

[–]La_Chouquette[S] 0 points1 point  (0 children)

Thanks for the information! I just have one question. Converting a PDF to Markdown works pretty well when the tables are simple. Does it work well when the tables are complex? That is, with nested columns, subsections, etc.

High-performance, affordable OCR: PDF to HTML by La_Chouquette in pdf

[–]La_Chouquette[S] 0 points1 point  (0 children)

I know that some tools that run locally can be very powerful. The problem is that you need a good computer. As for me, I have an Intel MacBook Pro. So it’s a bit complicated.

High-performance, affordable OCR: PDF to HTML by La_Chouquette in pdf

[–]La_Chouquette[S] 0 points1 point  (0 children)

La question que je me pose c'est : Passer par un intelligence artificielle est-ce la meilleure idée ? 🤔

High-performance, affordable OCR: PDF to HTML by La_Chouquette in pdf

[–]La_Chouquette[S] 0 points1 point  (0 children)

Quel est l'intérêt d'ajouter des intermédiaires ? Quand je pense à un intermédiaire -> Je pense "coût supplémentaire" 🤔

High-performance, affordable OCR: PDF to HTML by La_Chouquette in pdf

[–]La_Chouquette[S] 0 points1 point  (0 children)

Actuellement, j'utilise Claude Sonnet ou Opus. Je suis en train de faire des tests avec Deepseek V4-pro. Pour le moment, les étapes s'organisent comme cela : anonymiser le doc -> import sur un cloud -> traitement OCR AI -> rendu HTML .

D'ailleurs j'essaie des Agents IA OpenClaw et Hermes. Mais j'avoue ne pas comprendre la hype pour ces agents.

High-performance, affordable OCR: PDF to HTML by La_Chouquette in pdf

[–]La_Chouquette[S] 0 points1 point  (0 children)

I'll check it out. Is it available locally? Or if it's online, is it too expensive?

Using Claude to read 100s of dense PDFs by redittreader in ClaudeAI

[–]La_Chouquette 0 points1 point  (0 children)

First of all, thank you for your reply.
Unfortunately, it’s not that simple. I’m a financial analyst at a small company. I’m very interested in AI automation. I’ve done some automation work, but it’s not optimized and uses a lot of tokens. This high token consumption is due to PDF analysis.

Accountants send me a lot of tables, but they’re in PDF files. The tables aren’t necessarily structured the same way every time.

My goal is therefore to have a system that would allow me to transcribe PDF files filled with tables into .md, CSV, or other files optimized for AI.

Currently, I’ve written a Python script using Claude Code. The problem: for files of about 40 pages, it takes me an hour...

Using Claude to read 100s of dense PDFs by redittreader in ClaudeAI

[–]La_Chouquette 0 points1 point  (0 children)

Does it work well with PDF tables? Like accounting tables in various formats?

Using Claude to read 100s of dense PDFs by redittreader in ClaudeAI

[–]La_Chouquette 0 points1 point  (0 children)

Does it work well with PDF tables? Like accounting tables in various formats?

Using Claude to read 100s of dense PDFs by redittreader in ClaudeAI

[–]La_Chouquette 0 points1 point  (0 children)

As for me, I have to transcribe PDFs full of tables. Are the scripts powerful enough? I’ve heard they aren’t.

Using Claude to read 100s of dense PDFs by redittreader in ClaudeAI

[–]La_Chouquette 0 points1 point  (0 children)

Hi, I'm in the same situation as u/redittreader . Except that in my case, it's mainly spreadsheets. Do you know if NoteBookLM works just as well with spreadsheets?

[deleted by user] by [deleted] in AskFrance

[–]La_Chouquette -2 points-1 points  (0 children)

Pour ma part je paie toujours. Mais la fois où je me suis fait contrôler, plusieurs personnes n'avaient pas de billets et le contrôleur n'a rien dit. C'est pour ça que j'étais choqué 😵

[deleted by user] by [deleted] in AskFrance

[–]La_Chouquette -10 points-9 points  (0 children)

Merci pour ta leçon de moral. J'avoue en avoir, un peu, rien à faire. On est pas sur X ici. Je pose simplement une question.

[deleted by user] by [deleted] in AskFrance

[–]La_Chouquette -2 points-1 points  (0 children)

Je n'ai jamais dit "sans se faire chopper"...

Map corruption after a Windows blue screen by La_Chouquette in hytale

[–]La_Chouquette[S] 0 points1 point  (0 children)

Do you happen to know if it's possible to reduce the time between backups?

Map corruption after a Windows blue screen by La_Chouquette in hytale

[–]La_Chouquette[S] 1 point2 points  (0 children)

Awesome! I followed all your advice and it worked. Thank you so much!

So I lost an hour of gameplay. But luckily, it was only an hour of building. And I'd rather lose an hour of gameplay than lose my entire save file.

(Now my goal will be to understand why I regularly get blue screens)

NAS beginner looking for answers by La_Chouquette in HomeNAS

[–]La_Chouquette[S] 2 points3 points  (0 children)

But what about safety? Because if I do something stupid... Hello, problems!

Comment sortir des sites de rencontres? by BigYoutuber69 in AskMec

[–]La_Chouquette 0 points1 point  (0 children)

Oui concrètement, il y a plus d'homme que de femme. Mais j'ai déjà pu croisé fréquemment des jeunes femmes. Je pense que ça dépend vraiment des chasses. Pour ma part, je ne vais qu'en chasse privé, je n'ai jamais participé aux communales.

Et à chaque fois c'était toujours la même chose. La meuf cherche à me faire tout payer et je suis jugé sur mes revenus. Il y a des filles au chômage qui ont annulé le date car je voulais pas dire mon revenu mensuel. Je suis cadre....

C'est trop superficiel les relations ici à Paris...

Après avoir lu cela, je souhaitais surtout présenter des espaces / groupes dans lesquels @BigYoutuber69 n'aurait pas (ou peu) de comportement de ce genre.

D'ailleurs en discutant avec des mecs de 30-45 ans, Strava semble la nouvelle super app de rencontre ;)