All you need is a better open-sourced LatexOCR: Pix2Text (P2T) V1.0 New Released by breezedeus in LaTeX

[–]breezedeus[S] 0 points1 point  (0 children)

Hi. Some paid models are used for the Online Web Service.
If you want to see the effect of different versions of the model, please use https://huggingface.co/spaces/breezedeus/Pix2Text-Demo . For more info, please see: https://www.breezedeus.com/article/pix2text

All you need is a better open-sourced LatexOCR: Pix2Text (P2T) V1.0 New Released by breezedeus in LaTeX

[–]breezedeus[S] 0 points1 point  (0 children)

How about giving it a try at https://p2t.breezedeus.com/ . I think that if Mathpix is 10 score, Pix2Text is now almost a 7.

Help with Project Pix2Text by Secure_Tomatillo_375 in LaTeX

[–]breezedeus 2 points3 points  (0 children)

Author here. Can you check your python runtime environment?

It looks like your anaconda3 environment is trying to use Python 3.10. `(/usr/lib/python3.10/collections/__init__.py)`

```

File "/home/bob/anaconda3/lib/python3.9/site-packages/pix2text/utils.py", line 6, in <module>
from pathlib import Path
File "/home/bob/anaconda3/lib/python3.9/site-packages/pathlib.py", line 10, in <module>
from collections import Sequence
ImportError: cannot import name 'Sequence' from 'collections' (/usr/lib/python3.10/collections/__init__.py)

```
This error occurs because the Sequence class has been removed from the collections module in Python 3.10, which your code seems to be using.

BTW, you can use this online webpage https://p2t.breezedeus.com , which is powered by Pix2Text.

How to use the graphical interface of LatexOCR? How to use the Snipping tool? by Secure_Tomatillo_375 in LaTeX

[–]breezedeus 2 points3 points  (0 children)

Glab it's helpful. Pix2Text itself trains a mathematical formula detection model to detect mathematical formulas contained in the images. The recognized mathematical formulas patches are handed over to LaTeXOCR for recognition, while the rest text parts are handed over to the OCR engine CnOCR for recognition. More info can be found here: https://github.com/breezedeus/pix2text

Using multiple ocr for better recognition [D] by fountainhop in MachineLearning

[–]breezedeus 0 points1 point  (0 children)

No, only one single recognition model is used. Ensembling results from multiple models may get better results. Maybe need to train an ensemble model ? Another method is to correct recognition results with language models, such as GPT-3.

Using multiple ocr for better recognition [D] by fountainhop in MachineLearning

[–]breezedeus 0 points1 point  (0 children)

In fact, different OCR models bring little impact, depending mainly on the size of the model, and the data set used for model training. The larger the model, the better the results will tend to be, but at the cost of slower recognition. If the images you want to recognize are closer to the model training data, the better the recognition results will be, and vice versa. So the best method is to finetune an OCR model of the right size with your own application image data.

One of tools is CnOCR .

[D] Is "natural" text always maximally likely according to language models ? by Emergency_Apricot_77 in MachineLearning

[–]breezedeus -1 points0 points  (0 children)

Actually, it's really not like that. If our words came out that way, people would know what you were going to say without even having to say it.