Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Face_recognition | 47,583 | 45 | 16 days ago | 21 | February 20, 2020 | 707 | mit | Python | ||
The world's simplest facial recognition api for Python and the command line | ||||||||||
Easyocr | 17,453 | 37 | 2 days ago | 30 | June 02, 2022 | 196 | apache-2.0 | Python | ||
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. | ||||||||||
Flair | 12,611 | 24 | 52 | 14 hours ago | 27 | May 20, 2022 | 76 | other | Python | |
A very simple framework for state-of-the-art Natural Language Processing (NLP) | ||||||||||
Computervision Recipes | 8,817 | 2 months ago | 65 | mit | Jupyter Notebook | |||||
Best Practices, code samples, and documentation for Computer Vision. | ||||||||||
Jetson Inference | 6,243 | 2 days ago | 938 | mit | C++ | |||||
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson. | ||||||||||
Deepface | 5,875 | 3 | 6 days ago | 74 | May 10, 2022 | 6 | mit | Python | ||
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python | ||||||||||
Lstm Human Activity Recognition | 3,074 | 5 months ago | 19 | mit | Jupyter Notebook | |||||
Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier | ||||||||||
Eagleeye | 3,045 | a year ago | 18 | wtfpl | Python | |||||
Stalk your Friends. Find their Instagram, FB and Twitter Profiles using Image Recognition and Reverse Image Search. | ||||||||||
Crnn | 1,670 | 4 years ago | 85 | mit | Lua | |||||
Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition. | ||||||||||
Anago | 1,428 | 5 | a year ago | 14 | July 17, 2018 | 43 | mit | Python | ||
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on. |
This software implements the Convolutional Recurrent Neural Network (CRNN), a combination of CNN, RNN and CTC loss for image-based sequence recognition tasks, such as scene text recognition and OCR. For details, please refer to our paper http://arxiv.org/abs/1507.05717.
UPDATE Mar 14, 2017 A Docker file has been added to the project. Thanks to @varun-suresh.
UPDATE May 1, 2017 A PyTorch port has been made by @meijieru.
UPDATE Jun 19, 2017 For an end-to-end text detector+recognizer, check out the CTPN+CRNN implementation by @AKSHAYUBHAT.
The software has only been tested on Ubuntu 14.04 (x64). CUDA-enabled GPUs are required. To build the project, first install the latest versions of Torch7, fblualib and LMDB. Please follow their installation instructions respectively. On Ubuntu, lmdb can be installed by apt-get install liblmdb-dev
.
To build the project, go to src/
and execute sh build_cpp.sh
to build the C++ code. If successful, a file named libcrnn.so
should be produced in the src/
directory.
A demo program can be found in src/demo.lua
. Before running the demo, download a pretrained model from here. Put the downloaded model file crnn_demo_model.t7
into directory model/crnn_demo/
. Then launch the demo by:
th demo.lua
The demo reads an example image and recognizes its text content.
Example image:
Expected output:
Loading model...
Model loaded from ../model/crnn_demo/model.t7
Recognized text: available (raw: a-----v--a-i-l-a-bb-l-e---)
Another example:
Recognized text: shakeshack (raw: ss-h-a--k-e-ssh--aa-c--k--)
The pretrained model can be used for lexicon-free and lexicon-based recognition tasks. Refer to the functions recognizeImageLexiconFree
and recognizeImageWithLexicion
in file utilities.lua
for details.
Follow the following steps to train a new model on your own dataset.
tool/create_dataset.py
. Refer to the function createDataset
for details (need to pip install lmdb
first).model/
. For example, model/foo_model
. Then create
configuraton file config.lua
under the model directory. You can copy model/crnn_demo/config.lua
and do modifications.src/
and execute th main_train.lua ../models/foo_model/
. Model snapshots and logging file will be saved into the model directory.docker build -t crnn_docker .
nvidia-docker run -it crnn_docker
.Please cite the following paper if you are using the code/model in your research paper.
@article{ShiBY17,
author = {Baoguang Shi and
Xiang Bai and
Cong Yao},
title = {An End-to-End Trainable Neural Network for Image-Based Sequence Recognition
and Its Application to Scene Text Recognition},
journal = {{IEEE} Trans. Pattern Anal. Mach. Intell.},
volume = {39},
number = {11},
pages = {2298--2304},
year = {2017}
}
The authors would like to thank the developers of Torch7, TH++, lmdb-lua-ffi and char-rnn.
Please let me know if you encounter any issues.