Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python pdf parser
pdf-parser
x
python
x
9 search results found
Pypdf
⭐
7,377
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Scipdf_parser
⭐
236
Python PDF parser for scientific publications: content and figures
Sypht Python Client
⭐
163
A python client for the Sypht API
Casparser
⭐
105
Parser for Consolidated Account Statements (CAS) generated from CAMS/Karvy/Kfintech
Dedoc
⭐
49
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Document logical extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser)
Pyxpdf
⭐
23
Fast and memory-efficient Python PDF Parser based on xpdf sources
Pypdfcrack
⭐
15
Investigation in PDF encryption
Scanipy
⭐
11
Scanipy stands for "scan it with Python"—it's your smart Python library for scanning and parsing complex PDF files like books, reports, articles, and academic papers. Utilizing cutting-edge Deep Learning algorithms, Scanipy transforms your PDFs into a treasure trove of extractable information: tables, images, equations, and text.
Camelot Sharp
⭐
10
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).
Lyrapdf
⭐
7
LyraPDF: convert a PDF to JSON or MarkDown
Auto Law Review
⭐
7
Automate the case review on legal case documents.
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
1-9 of 9 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.