Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Text Detection Ctpn | 3,310 | 4 months ago | 299 | mit | Python | |||||
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network | ||||||||||
East | 2,831 | a year ago | 256 | gpl-3.0 | C++ | |||||
A tensorflow implementation of EAST text detector | ||||||||||
Tensorflow_psenet | 401 | 3 years ago | 8 | mit | C++ | |||||
This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog: | ||||||||||
East_icpr | 150 | 5 years ago | 1 | gpl-3.0 | Python | |||||
Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE | ||||||||||
Text Detection Using Yolo Algorithm In Keras Tensorflow | 122 | 2 months ago | 6 | mit | Jupyter Notebook | |||||
Implemented the YOLO algorithm for scene text detection in keras-tensorflow (No object detection API used) The code can be tweaked to train for a different object detection task using YOLO. | ||||||||||
Shintb | 89 | 6 years ago | 16 | Python | ||||||
Textboxes : Image Text Detection Model : python package (tensorflow) | ||||||||||
Differentiablebinarization | 49 | 3 years ago | 4 | Python | ||||||
DB (Real-time Scene Text Detection with Differentiable Binarization) implementation in Keras and Tensorflow | ||||||||||
Psenet Tensorflow | 41 | 3 years ago | 7 | Python | ||||||
TensorFlow implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Networkt) | ||||||||||
Videotext | 25 | 5 years ago | 4 | |||||||
Text detection and recognition in natural videos | ||||||||||
Crnn_ctpn | 18 | 3 years ago | Python | |||||||
Unified text detection and recognition |
This is a tensorflow re-implementation of EAST: An Efficient and Accurate Scene Text Detector. The features are summarized blow:
Thanks for the author's (@zxytim) help! Please cite his paper if you find this useful.
If you want to train the model, you should provide the dataset path, in the dataset path, a separate gt text file should be provided for each image and run
python multigpu_train.py --gpu_list=0 --input_size=512 --batch_size_per_gpu=14 --checkpoint_path=/tmp/east_icdar2015_resnet_v1_50_rbox/ \
--text_scale=512 --training_data_path=/data/ocr/icdar2015/ --geometry=RBOX --learning_rate=0.0001 --num_readers=24 \
--pretrained_model_path=/tmp/resnet_v1_50.ckpt
If you have more than one gpu, you can pass gpu ids to gpu_list(like --gpu_list=0,1,2,3)
Note: you should change the gt text file of icdar2015's filename to img_*.txt instead of gt_img_*.txt(or you can change the code in icdar.py), and some extra characters should be removed from the file. See the examples in training_samples/
If you've downloaded the pre-trained model, you can setup a demo server by
python3 run_demo_server.py --checkpoint-path /tmp/east_icdar2015_resnet_v1_50_rbox/
Then open http://localhost:8769 for the web demo. Notice that the URL will change after you submitted an image.
Something like ?r=49647854-7ac2-11e7-8bb7-80000210fe80
appends and that makes the URL persistent.
As long as you are not deleting data in static/results
, you can share your results to your friends using
the same URL.
URL for example below: http://east.zxytim.com/?r=48e5020a-7b7f-11e7-b776-f23c91e0703e
run
python eval.py --test_data_path=/tmp/images/ --gpu_list=0 --checkpoint_path=/tmp/east_icdar2015_resnet_v1_50_rbox/ \
--output_dir=/tmp/
a text file will be then written to the output path.
Here are some test examples on icdar2015, enjoy the beautiful text boxes!
Please let me know if you encounter any issues(my email [email protected] dot com).