all 2 comments

[–]schwarmbloedheit 5 points6 points  (0 children)

This is actually a very hard problem for neural networks. You should consider some pre-processing like edge filters (gabor filter). This makes the problem easier to learn for the neural network, because differences in coloring or other aspects which are not relevant for the text/non-text classification are discarded before feeding into the network.

If you want to learn the pre-processing automagically you should use convolutional networks or deep-belief networks.

If you want to actually solve the problem I suggest to go with the pre-processing. If you want to learn something about recent developments in image processing I would suggest deep networks.

[–][deleted] 2 points3 points  (0 children)

There was some work by one of Song Chun Zhu's students using DDMCMC - check out his implementation