Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Books | 131 | 3 years ago | ||||||||
Digital some old math books and build as ebook | ||||||||||
Ocr2text | 67 | 2 years ago | 5 | mit | Python | |||||
Convert a PDF via OCR to a TXT file in UTF-8 encoding | ||||||||||
Dedoc | 49 | 3 months ago | 10 | November 24, 2023 | 1 | apache-2.0 | Python | |||
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Document logical extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser) | ||||||||||
Saram | 48 | 5 years ago | 9 | March 12, 2018 | 4 | mit | Python | |||
Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI: | ||||||||||
Sar_tf | 36 | 4 years ago | 7 | Python | ||||||
This is an implementation of Show, Attend and Read with tensorflow | ||||||||||
Ocr Pipeline | 31 | 7 years ago | 4 | other | Python | |||||
Convert a corpus of PDF to clean text files on a distributed architecture | ||||||||||
Pdftotxt | 29 | 4 years ago | 3 | mit | Python | |||||
Python code to read text from a PDF file (OCR). | ||||||||||
Pdf2txt | 16 | 8 years ago | 2 | lgpl-3.0 | Visual Basic | |||||
Batch convert PDF files to text under Windows, using several text extraction methods or OCR | ||||||||||
Dango Ocr | 15 | 3 years ago | Python | |||||||
DangoOCR: screenshot OCR recognize 文字识别,支持多种语言,识别后翻译,播放声音 | ||||||||||
Genete_ocr_data | 7 | 4 years ago | 1 | Python | ||||||
ocr data ,detect data ,recognize data |