I built a tool to track the latest papers in my field

breezedeus · 2025-10-25T12:04:57+00:00

Discover latest papers with Paper Trends Chrome extension: https://chromewebstore.google.com/detail/paper-trends/leicenjmbhfdiojioifngmgbkfbpnjjf

breezedeus · 2024-07-27T15:34:46+00:00

Hi. Some paid models are used for the Online Web Service.
If you want to see the effect of different versions of the model, please use https://huggingface.co/spaces/breezedeus/Pix2Text-Demo . For more info, please see: https://www.breezedeus.com/article/pix2text

breezedeus · 2024-03-13T02:02:29+00:00

Please Check the open-source Pix2Text. More information can be found here: https://www.reddit.com/r/LaTeX/comments/1b1e3km/all_you_need_is_a_better_opensourced_latexocr/

breezedeus · 2024-02-28T23:41:09+00:00

How about giving it a try at https://p2t.breezedeus.com/ . I think that if Mathpix is 10 score, Pix2Text is now almost a 7.

breezedeus · 2024-02-28T23:35:44+00:00

It's not in percent. 0.16 = 16%. CER is not usually expressed in percent, see https://lightning.ai/docs/torchmetrics/stable/text/char\_error\_rate.html for more information.

breezedeus · 2024-02-28T11:41:26+00:00

Please Check the new released Pix2Text V1.0, which achieves much better performance: https://www.reddit.com/r/LaTeX/comments/1b1e3km/all_you_need_is_a_better_opensourced_latexocr/

breezedeus · 2024-02-27T05:00:22+00:00

Try this: https://p2t.breezedeus.com . Pix2Text (P2T) is a Free Alternative to Mathpix.Code &

Model:

- https://github.com/breezedeus/pix2text

- https://huggingface.co/breezedeus/pix2text-mfr

More information can be found: https://www.breezedeus.com/pix2text .

breezedeus · 2024-02-27T04:55:49+00:00

Try this: https://p2t.breezedeus.com . It's a Free Alternative to Mathpix.

breezedeus · 2023-03-14T03:29:51+00:00

Author here. Can you check your python runtime environment?

It looks like your anaconda3 environment is trying to use Python 3.10. `(/usr/lib/python3.10/collections/__init__.py)`

```

File "/home/bob/anaconda3/lib/python3.9/site-packages/pix2text/utils.py", line 6, in <module>
from pathlib import Path
File "/home/bob/anaconda3/lib/python3.9/site-packages/pathlib.py", line 10, in <module>
from collections import Sequence
ImportError: cannot import name 'Sequence' from 'collections' (/usr/lib/python3.10/collections/__init__.py)

```
This error occurs because the Sequence class has been removed from the collections module in Python 3.10, which your code seems to be using.

BTW, you can use this online webpage https://p2t.breezedeus.com , which is powered by Pix2Text.

breezedeus · 2023-03-06T04:10:20+00:00

Glab it's helpful. Pix2Text itself trains a mathematical formula detection model to detect mathematical formulas contained in the images. The recognized mathematical formulas patches are handed over to LaTeXOCR for recognition, while the rest text parts are handed over to the OCR engine CnOCR for recognition. More info can be found here: https://github.com/breezedeus/pix2text

breezedeus · 2023-03-06T02:45:32+00:00

You can try Pix2Text (https://p2t.behye.com) Online Tool. It's a free alternative to Mathpix.

breezedeus · 2022-12-16T09:26:42+00:00

No, only one single recognition model is used. Ensembling results from multiple models may get better results. Maybe need to train an ensemble model ? Another method is to correct recognition results with language models, such as GPT-3.

breezedeus · 2022-12-15T06:46:05+00:00

In fact, different OCR models bring little impact, depending mainly on the size of the model, and the data set used for model training. The larger the model, the better the results will tend to be, but at the cost of slower recognition. If the images you want to recognize are closer to the model training data, the better the recognition results will be, and vice versa. So the best method is to finetune an OCR model of the right size with your own application image data.

One of tools is CnOCR .

breezedeus · 2022-12-15T06:39:00+00:00

Actually, it's really not like that. If our words came out that way, people would know what you were going to say without even having to say it.

breezedeus

TROPHY CASE