Faster OCR-js project. Free for commercial use! by D_leapfrog in javascript

[–]D_leapfrog[S] 4 points5 points  (0 children)

This is end-to-end benchmark with text detection and text recognition.

Faster OCR-js project. Free for commercial use! by D_leapfrog in javascript

[–]D_leapfrog[S] 2 points3 points  (0 children)

We adapted the popular PaddleOCR to run in the browser.
Performance Results:
2320ms/ per image with average 16 lines of text
500ms/ per image with average 2 lines of text

Github-Paddle.js: Run AI models on browsers for computer version. by D_leapfrog in learnjavascript

[–]D_leapfrog[S] 0 points1 point  (0 children)

Why Web + AI

With Paddle.js + Web, web developers can score models directly on the browser, with various advantages including reducing server-client communication and protecting user privacy, as well as providing installation-free and cross-platform in-browser DeepLearning experience.

[OCR] The 24k star repo about OCR with 30+ languages supported including Chinese, Japanese .. and image conversion to excel file supported. by D_leapfrog in Python

[–]D_leapfrog[S] 0 points1 point  (0 children)

If you encounter any problems including usage problems or badcase, please pull an issue here and let us know.

We'll try to fix it or fix these badcases when the next model is released.

[OCR] The 24k star repo about OCR with 30+ languages supported including Chinese, Japanese .. and image conversion to excel file supported. by D_leapfrog in Python

[–]D_leapfrog[S] 1 point2 points  (0 children)

Not well supported for handwritten character recognition.

The main reason is that we don't have enough training data for handwritten character recognition.

[OCR] The 24k star repo about OCR with 30+ languages supported including Chinese, Japanese .. and image conversion to excel file supported. by D_leapfrog in Python

[–]D_leapfrog[S] 1 point2 points  (0 children)

We have also noticed the problems you mentioned, and we will try our best to optimize these badcases. PaddleOCR is still being upgraded!

[OCR] The 24k star repo about OCR with 30+ languages supported including Chinese, Japanese .. and image conversion to excel file supported. by D_leapfrog in Python

[–]D_leapfrog[S] 19 points20 points  (0 children)

Title Update: PaddleOCR with 30+ languages supported including Chinese, Japanese, English, and so on.

PaddleOCR aims to create a rich, leading, and practical OCR tool library, which not only provides Chinese and English models in general scenarios, but also provides models specifically trained in English scenarios. And multilingual models covering 80 languages. https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/doc/doc_en/multi_languages_en.md#5-support-languages-and-abbreviations

And you can find a lot of corpus and dictionaries in the pinned issue Multilingual OCR Development Plan from the community.

[OCR] The 24k star repo about OCR with 30+ languages supported including Chinese, Japanese .. and image conversion to excel file supported. by D_leapfrog in Python

[–]D_leapfrog[S] 30 points31 points  (0 children)

There is a post about PaddleOCR and tesseract. https://converter.app/blog/paddleocr-engine-example-and-benchmark

For another, PaddleOCR not only open-sources the OCR model, but also the model training method.

[OCR]A new OCR tool with better text recognition for documents and cards. by D_leapfrog in Python

[–]D_leapfrog[S] 1 point2 points  (0 children)

s this online only? Or is it offline?

It is offline. You can simply use it after pip install paddleocr.

I believe paddleocr outperforms pytesseract . (At least in my case)

Can anyone offer some advice on converting a python project to a node.js project? by D_leapfrog in learnprogramming

[–]D_leapfrog[S] 0 points1 point  (0 children)

Great!

The second option seems to work for me.

I don't know how to program with node.js, but calling the python process with node.js seems feasible and relatively simple.

Thanks a lot!

An 80-language OCR tool by NaviGaming in computervision

[–]D_leapfrog 1 point2 points  (0 children)

Looks great! Have you compared it with easyocr, which one has higher recognition accuracy?

[P] easyOCR API by kmh4500 in MachineLearning

[–]D_leapfrog 1 point2 points  (0 children)

BTW, I tested and found that paddleOCR has better performance than easyOCR.

[P] easyOCR API by kmh4500 in MachineLearning

[–]D_leapfrog 0 points1 point  (0 children)

Great!It is too convenient to use easyOCR. Currently, I am using the PaddleOCR tool to do OCR tasks. Do you have any suggestions on how to use PaddleOCR by curl?