Pdf2txt

Batch convert PDF files to text under Windows, using several text extraction methods or OCR
Alternatives To Pdf2txt
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Books131
3 years ago
Digital some old math books and build as ebook
Ocr2text67
2 years ago5mitPython
Convert a PDF via OCR to a TXT file in UTF-8 encoding
Dedoc49
3 months ago10November 24, 20231apache-2.0Python
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Document logical extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser)
Saram48
5 years ago9March 12, 20184mitPython
Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with auto rotation for wrong orientation. PYPI:
Sar_tf36
4 years ago7Python
This is an implementation of Show, Attend and Read with tensorflow
Ocr Pipeline31
7 years ago4otherPython
Convert a corpus of PDF to clean text files on a distributed architecture
Pdftotxt29
4 years ago3mitPython
Python code to read text from a PDF file (OCR).
Pdf2txt16
8 years ago2lgpl-3.0Visual Basic
Batch convert PDF files to text under Windows, using several text extraction methods or OCR
Dango Ocr15
3 years agoPython
DangoOCR: screenshot OCR recognize 文字识别,支持多种语言,识别后翻译,播放声音
Genete_ocr_data7
4 years ago1Python
ocr data ,detect data ,recognize data
Alternatives To Pdf2txt
Select To Compare


Alternative Project Comparisons
Popular Ocr Projects
Popular Txt Projects
Popular Media Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Ocr
Visual Basic
Txt