Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Zh Ner Tf | 1,761 | 3 years ago | 67 | Python | ||||||
A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow) | ||||||||||
Deep_ocr | 1,452 | 5 years ago | 28 | Python | ||||||
make a better chinese character recognition OCR than tesseract | ||||||||||
Scenereco | 908 | 5 years ago | 32 | Python | ||||||
ctpn+crnn Scene character recognition | ||||||||||
Evil | 623 | 5 years ago | 2 | October 22, 2018 | 3 | mit | Swift | |||
Optical Character Recognition in Swift for iOS&macOS. 银行卡、身份证、门牌号光学识别 | ||||||||||
Javaverify | 204 | 8 years ago | 3 | Java | ||||||
A Java CAPTCHA recognition library for sticky characters | ||||||||||
Chinese Character Recognition | 182 | 6 years ago | Python | |||||||
This project shows how to use CNN to perform Chinese character recognition, a much more complicated task compared to MNIST digit recognition. | ||||||||||
Ssocr | 172 | 4 months ago | gpl-3.0 | C | ||||||
Seven Segment Optical Character Recognition | ||||||||||
Ocr Ios Example | 157 | 10 years ago | 4 | Objective-C | ||||||
A simple example of how to do optical character recognition (OCR) on iOS. | ||||||||||
Simpsonrecognition | 128 | 6 years ago | 5 | Jupyter Notebook | ||||||
Detect and recognize The Simpsons characters using Keras and Faster R-CNN | ||||||||||
Gocarina | 124 | 5 years ago | 1 | August 27, 2018 | 1 | Go | ||||
simple Optical Character Recognition in Go |
This project aims to extract tables from scanned image PDFs using Optical Character Recognition.
Tesseract OCR
sudo apt-get install tesseract-ocr
Imagemagick
sudo apt-get install imagemagick
PDF Utilities
sudo apt-get install poppler-utils
Python packages
sudo pip install -r requirements.txt
Clear the pdf/ folder and copy all your pdf files to be scanned in it.
Run the OCR:
python3 shellocr.py
The scanned text files shall be available in the txt/ folder once the process completes.
If the above doesn't work for you, try the alternate method.
Save your file as input.pdf in the root directory.
Run
python3 pdf_miner.py