OCR

Oct 9, 2015


Papers

Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

End-to-End Text Recognition with Convolutional Neural Networks

Word Spotting and Recognition with Embedded Attributes

Reading Text in the Wild with Convolutional Neural Networks

Deep structured output learning for unconstrained text recognition

  • intro: “propose an architecture consisting of a character sequence CNN and an N-gram encoding CNN which act on an input image in parallel and whose outputs are utilized along with a CRF model to recognize the text content present within the image.”
  • arxiv: http://arxiv.org/abs/1412.5903

Deep Features for Text Spotting

Reading Scene Text in Deep Convolutional Sequences

DeepFont: Identify Your Font from An Image

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

Writer-independent Feature Learning for Offline Signature Verification using Deep Convolutional Neural Networks

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

End-to-End Interpretation of the French Street Name Signs Dataset

End-to-End Subtitle Detection and Recognition for Videos in East Asian Languages via CNN Ensemble with Near-Human-Level Performance

Smart Library: Identifying Books in a Library using Richly Supervised Deep Scene Text Reading

Improving Text Proposals for Scene Images with Fully Convolutional Networks

  • intro: Universitat Autonoma de Barcelona (UAB) & University of Florence
  • intro: International Conference on Pattern Recognition (ICPR) - DLPR (Deep Learning for Pattern Recognition) workshop
  • arxiv: https://arxiv.org/abs/1702.05089

Scene Text Eraser

https://arxiv.org/abs/1705.02772

Attention-based Extraction of Structured Information from Street View Imagery

STN-OCR: A single Neural Network for Text Detection and Text Recognition

Text Detection

Object Proposals for Text Extraction in the Wild

Text-Attentional Convolutional Neural Networks for Scene Text Detection

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

Synthetic Data for Text Localisation in Natural Images

Scene Text Detection via Holistic, Multi-Channel Prediction

Detecting Text in Natural Image with Connectionist Text Proposal Network

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection

Detecting Oriented Text in Natural Images by Linking Segments

Deep Direct Regression for Multi-Oriented Scene Text Detection

Cascaded Segmentation-Detection Networks for Word-Level Text Spotting

https://arxiv.org/abs/1704.00834

Text-Detection-using-py-faster-rcnn-framework

WordFence: Text Detection in Natural Images with Border Awareness

SSD-text detection: Text Detector

R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

R-PHOC: Segmentation-Free Word Spotting using CNN

Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks

https://arxiv.org/abs/1707.03985

EAST: An Efficient and Accurate Scene Text Detector

Text Recognition

Sequence to sequence learning for unconstrained scene text recognition

Drawing and Recognizing Chinese Characters with Recurrent Neural Network

Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition

Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition

Visual attention models for scene text recognition

https://arxiv.org/abs/1706.01487

Breaking Captcha

Using deep learning to break a Captcha system

Breaking reddit captcha with 96% accuracy

I’m not a human: Breaking the Google reCAPTCHA

Neural Net CAPTCHA Cracker

Recurrent neural networks for decoding CAPTCHAS

Reading irctc captchas with 95% accuracy using deep learning

端到端的OCR:基于CNN的实现

I Am Robot: (Deep) Learning to Break Semantic Image CAPTCHAs

SimGAN-Captcha

Handwritten Recognition

High Performance Offline Handwritten Chinese Character Recognition Using GoogLeNet and Directional Feature Maps

Recognize your handwritten numbers

https://medium.com/@o.kroeger/recognize-your-handwritten-numbers-3f007cbe46ff#.jllz62xgu

Handwritten Digit Recognition using Convolutional Neural Networks in Python with Keras

MNIST Handwritten Digit Classifier

如何用卷积神经网络CNN识别手写数字集?

LeNet – Convolutional Neural Network in Python

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

MLPaint: the Real-Time Handwritten Digit Recognizer

Training a Computer to Recognize Your Handwriting

https://medium.com/@annalyzin/training-a-computer-to-recognize-your-handwriting-24b808fb584#.gd4pb9jk2

Using TensorFlow to create your own handwriting recognition engine

Building a Deep Handwritten Digits Classifier using Microsoft Cognitive Toolkit

Hand Writing Recognition Using Convolutional Neural Networks

Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling

Plate Recognition

Reading Car License Plates Using Deep Convolutional Neural Networks and LSTMs

Number plate recognition with Tensorflow

end-to-end-for-plate-recognition

Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN

  • intro: International Workshop on Advanced Image Technology, January, 8-10, 2017. Penang, Malaysia. Proceeding IWAIT2017
  • arxiv: https://arxiv.org/abs/1701.06439

License Plate Detection and Recognition Using Deeply Learned Convolutional Neural Networks

Adversarial Generation of Training Examples for Vehicle License Plate Recognition

https://arxiv.org/abs/1707.03124

Blogs

Applying OCR Technology for Receipt Recognition

Hacking MNIST in 30 lines of Python

Optical Character Recognition Using One-Shot Learning, RNN, and TensorFlow

https://blog.altoros.com/optical-character-recognition-using-one-shot-learning-rnn-and-tensorflow.html

Creating a Modern OCR Pipeline Using Computer Vision and Deep Learning

https://blogs.dropbox.com/tech/2017/04/creating-a-modern-ocr-pipeline-using-computer-vision-and-deep-learning/

Projects

ocropy: Python-based tools for document analysis and OCR

Extracting text from an image using Ocropus

CLSTM : A small C++ implementation of LSTM networks, focused on OCR

OCR text recognition using tensorflow with attention

Digit Recognition via CNN: digital meter numbers detection

Attention-OCR: Visual Attention based OCR

umaru: An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm

Tesseract.js: Pure Javascript OCR for 62 Languages

DeepHCCR: Offline Handwritten Chinese Character Recognition based on GoogLeNet and AlexNet (With CaffeModel)

deep ocr: make a better chinese character recognition OCR than tesseract

https://github.com/JinpengLI/deep_ocr

Practical Deep OCR for scene text using CTPN + CRNN

https://github.com/AKSHAYUBHAT/DeepVideoAnalytics/blob/master/notebooks/OCR/readme.md

Videos

LSTMs for OCR

Resources

Deep Learning for OCR

https://github.com/hs105/Deep-Learning-for-OCR

Scene Text Localization & Recognition Resources

Scene Text Localization & Recognition Resources

awesome-ocr: A curated list of promising OCR resources

https://github.com/wanghaisheng/awesome-ocr