Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for ocr pdf to text
ocr
x
pdf-to-text
x
5 search results found
Unstructured
⭐
4,404
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Pd3f
⭐
131
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
Adobe Pdf Library Samples
⭐
75
Sample code for the Datalogics C++, Java, and .NET interfaces of the Adobe PDF Library
Iron Ocr Image To Text In Csharp
⭐
49
Image to Text Tutorial in C# - See https://ironsoftware.com/csharp/ocr/tutorials/how-
Pdf Text Data Extractor
⭐
32
PDF text data extraction web app with OCR for scanned documents
Ocr Python
⭐
13
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
Aiopytesseract
⭐
13
A Python asyncio wrapper for Tesseract-OCR.
Pdf2dataset
⭐
8
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features
Pdftotext
⭐
6
A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.
Related Searches
Python Ocr (1,610)
Ocr Tesseract (1,069)
Java Ocr (368)
C Plus Plus Ocr (256)
C Sharp Ocr (183)
Deep Learning Ocr (182)
Machine Learning Ocr (175)
Html Ocr (109)
Ocr Optical Character Recognition (94)
Natural Language Processing Ocr (86)
1-5 of 5 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.