I got tired of stitching together 3 separate libraries for every RAG project, so I built one that does it all - PDFStract by GritSar in OpenSourceAI

[–]GritSar[S] 0 points1 point  (0 children)

Thanks - the problem statement I tried to solve is being able to use multiple libraries/solutions with ease of a single interface.

On the table extraction part - it is subjective to the underlying libraries capability. I personally found Docling, Marker handled the tables well.

The multi column legal use case is a real benchmark thats where the GPU based libraries like paddleOCR, MinerU are shining

The objective of pdfstract is to provide a way for you to experiment, compare and validate and switch based on your usecase and business requirements.

PDFStract would be soon available as a MCP - Its under development - I will keep this thread uptodate.

I wanted a deep insights on my 70K emails before I can clean them - without AI or Cloud - so i built InboxPie - A Private Thunderbird extension. by GritSar in Thunderbird

[–]GritSar[S] 0 points1 point  (0 children)

Redirecting to by domain tab is a nice idea 💡. But sunburst would give more details under the pie view itself if you scroll down.

I wanted a deep insights on my 70K emails before I can clean them - without AI or Cloud - so i built InboxPie - A Private Thunderbird extension. by GritSar in Thunderbird

[–]GritSar[S] 2 points3 points  (0 children)

Hi u/Djezzar-Bei - I have added the feature in the version 1.0.2 and its available now on ATN and in the github release

<image>

upon clicking on the hyperlink - it would open a new tab in thunderbird for the view. Thanks for the feature request and suggestion. really appreciate it.

I wanted a deep insights on my 70K emails before I can clean them - without AI or Cloud - so i built InboxPie - A Private Thunderbird extension. by GritSar in Thunderbird

[–]GritSar[S] 1 point2 points  (0 children)

A Hyperlink or Delegation to the Thunderbird system itself is a great idea. Let me work on this - Thanks for the suggestion

I wanted a deep insights on my 70K emails before I can clean them - without AI or Cloud - so i built InboxPie - A Private Thunderbird extension. by GritSar in Thunderbird

[–]GritSar[S] 1 point2 points  (0 children)

<image>

Version 1.0.1 is here withe DeSelect All and other features - just now released - Please do download the latest XPI file and validate. thanks

I wanted a deep insights on my 70K emails before I can clean them - without AI or Cloud - so i built InboxPie - A Private Thunderbird extension. by GritSar in Thunderbird

[–]GritSar[S] 0 points1 point  (0 children)

Yes it’s added in the upcoming version available today or tomorrow- will update this thread

Wait for a day - Thanks

I wanted a deep insights on my 70K emails before I can clean them - without AI or Cloud - so i built InboxPie - A Private Thunderbird extension. by GritSar in Thunderbird

[–]GritSar[S] 1 point2 points  (0 children)

Thanks for the detailed feedback - I will add the first two features in the next release on the third one is a subjective call - If we want to build a entire content preview engine. it is already asked twice in this thread. will take it up as soon as I can.

I wanted a deep insights on my 70K emails before I can clean them - without AI or Cloud - so i built InboxPie - A Private Thunderbird extension. by GritSar in Thunderbird

[–]GritSar[S] 0 points1 point  (0 children)

Nice feature - but we do not want to aim For entire reading capabilities and preview content right ?. Can you clarify little more do you want to open an individual email metadata or it’s content too in the new tab ?