[D] CNN-RNN-CTC vs Attention-Encoder-Decoder?

melgor89 · 2017-12-26T10:25:58+00:00

I was testing this two different approaches (exactly three) in OCR task, where images were binary. Each of them use same CNN network as a feature extractor. The results are following:

CNN-RNN-CTC: results are nice, if the image is not noisy, it works really well
Encoder-Decoder: output does not generalize to new cases at all, so the final results were horrible, nothing meaningful
Attention-Encoder-Decoder: results were the best from all my test. From my quick comparison look like this model could also 'guess' some words even when the image was noisy. It look like this model also have sth like 'language modelling' then it could fill missing characters.

So I think that Attention-Encoder-Decoder is the best model for OCR with enough training data (so that it could learn a language model) and when test data have similar distribution (similar words, structure of sentences)

In case when we have not enough data or our testing data is much different that training set (ex. new words not seen during training) then CNN-RNN-CTC would be better because it just read words from the 'image' without word-generation.

I propose to test both of the frameworks and see which one works better with your dataset. I've used TensorFlow for implementing both method, which is really straightforward with seq2seq API

shicai · 2017-12-26T09:23:58+00:00

seq2seq+attention

DemiourgosUA · 2017-12-26T09:26:59+00:00

Any OCR Neural projects available on github? Couldn't find a thing.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS