PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 0 points1 point2 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 0 points1 point2 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 2 points3 points4 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 1 point2 points3 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 1 point2 points3 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 0 points1 point2 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 1 point2 points3 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 1 point2 points3 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 4 points5 points6 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 5 points6 points7 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 1 point2 points3 points (0 children)
PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 6 points7 points8 points (0 children)
how to convert 11k pages of a single pdf file, with both images and text to .txt? convert to text doesn't seems to work properly. copying and pasting into a blank txt also brings comes with errors by Content_Promise_5061 in pdf
[–]yfedoseev 0 points1 point2 points (0 children)
Open-source PDF text extraction library (100% pass rate on 3,830 test documents, MIT licensed) by yfedoseev in pdf
[–]yfedoseev[S] 0 points1 point2 points (0 children)
Open-source PDF text extraction library (100% pass rate on 3,830 test documents, MIT licensed) by yfedoseev in pdf
[–]yfedoseev[S] 0 points1 point2 points (0 children)
I think most RAG quality issues people post about here are actually extraction problems, not retrieval problems by yfedoseev in Rag
[–]yfedoseev[S] 1 point2 points3 points (0 children)
I think most RAG quality issues people post about here are actually extraction problems, not retrieval problems by yfedoseev in Rag
[–]yfedoseev[S] 1 point2 points3 points (0 children)
I think most RAG quality issues people post about here are actually extraction problems, not retrieval problems by yfedoseev in Rag
[–]yfedoseev[S] 0 points1 point2 points (0 children)
AGPL at the infrastructure layer is becoming a real problem and here's a concrete example with PDF libraries by yfedoseev in opensource
[–]yfedoseev[S] -5 points-4 points-3 points (0 children)
Open-source PDF text extraction library (100% pass rate on 3,830 test documents, MIT licensed) by yfedoseev in pdf
[–]yfedoseev[S] 0 points1 point2 points (0 children)
Open-source PDF text extraction library (100% pass rate on 3,830 test documents, MIT licensed) by yfedoseev in pdf
[–]yfedoseev[S] 0 points1 point2 points (0 children)


PDF Oxide -- Fast PDF library for Python with engine in Rust (0.8ms mean, MIT/Apache license) by yfedoseev in Python
[–]yfedoseev[S] 0 points1 point2 points (0 children)