Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python pdf files
pdf-files
x
python
x
97 search results found
Pdftabextract
⭐
1,994
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Pdfrw
⭐
1,719
pdfrw is a pure Python library that reads and writes PDFs
Peepdf
⭐
764
Powerful Python tool to analyze PDF documents
Pdf.tocgen
⭐
444
A CLI toolset to generate table of contents for PDF files automatically.
Pdfcropmargins
⭐
273
pdfCropMargins -- a program to crop the margins of PDF files
Mkdocs Pdf Export Plugin
⭐
264
An MkDocs plugin to export content pages as PDF files
Kubernetes Doc Pdf
⭐
238
Kubernetes PDF Documentation
Telegram Pdf Bot
⭐
169
A Telegram bot that can do a lot of things related to PDF files
Pdfsuite
⭐
153
Python scripts, Automator Services and Quartz Filters for MacOS (OS X) that create, manipulate, and query PDF files
Alfred Pdf Tools
⭐
139
Optimize, encrypt and manipulate PDF files.
Analyzepdf
⭐
136
Tool to help analyze PDF files
Krop
⭐
107
A simple graphical tool to crop the pages of PDF files, written in Python/Qt
Pdfdiff
⭐
107
Command-line tool to inspect the difference between (the text in) two PDF files
Zowie
⭐
81
Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)
Pdfextract_text
⭐
80
This is the beta version of PDF Extract, it only extracts text out of user-selected PDF files.
Pdfreader
⭐
79
Python API for PDF documents
Erpnext_ocr
⭐
75
🐍 ⚗️ Optical Character Recognition using tesseract within Frappe.
Pdf Bot
⭐
68
A bot for PDF for doing Many Things....
Minipdf
⭐
66
A python library for making PDF files in a very low level way.
Pdftables
⭐
62
A library for extracting tables from PDF files
Django Renderpdf
⭐
54
📄 Django app to render django templates as PDF files.
Pdfcompare
⭐
47
compare two PDF files, write a resulting PDF with highlighted changes
Cellular Automata Posters
⭐
46
Simple Python script that generates cellular automata posters as PDF files.
Qubes App Linux Pdf Converter
⭐
45
Qubes component: app-linux-pdf-converter
Linkedin Pdf Parsing
⭐
44
Parsing resumes in a PDF format from linkedIn
Django Afip
⭐
43
⚖️ AFIP invoice integration for django.
Rmbyext
⭐
42
Recursively removes all files with given extension(s)
Convert Document
⭐
39
A docker container for LibreOffice and unoconv, used to generate PDF files from office-type documents.
Python Django Exporting Files
⭐
39
Financial Data Collection From Web
⭐
38
A python scripe that collecting financial data from ju-chao web, and can download pdf files from it , more important is it can parase data you want from pdf files using pdfplumber .
Remove Pdf Watermark
⭐
38
Short script for removing watermarks from PDF files. Requires pdftk.
Pdf Split Merge
⭐
32
simple pdf file split and merge tool
Pdf Quench
⭐
31
A visual tool for cropping pdf files
Menextract2pdf
⭐
29
Extract Mendely annotations to PDF FIles
Ocr4wikisource
⭐
29
OCR for WikiSource using Google Drive OCR
Pdfmef
⭐
28
Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)
Snbopen
⭐
27
convert samsung S-note files (.snb) to pdf or open them.
Pdf_hide
⭐
25
A steganographic tool in Python for hiding data inside PDF files
Pdfdownloader
⭐
24
An Innvoative Web Scrapping Solution to Download PDF Files
Bytescout Sdk Sourcecode
⭐
24
ALL source code samples for ByteScout SDKs and Web API API products.
Pdf2bib
⭐
24
A python library/command-line tool to quickly and automatically generate BibTeX data starting from the pdf file of a scientific publication.
Pdfviewer
⭐
23
PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.
Mendeley Filesync
⭐
23
A script to synchronise PDF files in Mendeley across multiple machines
Pdfcrack Opencl
⭐
21
OpenCL pdfcracker implemented in python
Molminer
⭐
18
Python library and command-line tool for extracting compounds from scientific literature. Written in Python.
Pypdfform
⭐
18
🔥 The Python library for PDF forms.
Pdf_downloader
⭐
17
A Scrapy Spider for downloading PDF files from a webpage.
Irads
⭐
17
Internet Research Agency Facebook ads as structured data
Edapy
⭐
16
Exploratory Data Analysis with Python
Nautilus Pdf Tools
⭐
16
Tools to work with PDF files from Nautilus
Pypdflite
⭐
16
A lightweight utility for creating PDF files, written in Python
Pdf Rest Api Samples
⭐
16
pdfRest API Toolkit is a REST API service for processing PDF documents, made by developers, for developers. Rapidly integrate PDF workflows with your existing projects and applications, simply and seamlessly. Get started for free in seconds.
Imprenta
⭐
15
An AWS lambda in python 3 that generates PDF files from HTML using jinja, pdfkit and wkhtmltopdf.
Pdf Corpus
⭐
15
Python script to quickly create hand-crafted PDF files
Phoneypdf
⭐
15
A virtual PDF analysis framework
Pdf_merge
⭐
15
I hated using online tools for merging my PDF files so I wrote a Python 3.6 script to merge all PDF files in a folder to a new PDF file.
Pdfxplr
⭐
14
Extract hidden data from pdf files
Prsannots
⭐
14
Get annotated PDFs from your Sony PRS-T1 ereader
Merge Pdf
⭐
13
My first PyPi Package. Merge Image and PDF files using customizations within a folder using the Command line.
Pdftables
⭐
13
forked from the scraperwiki pdftables (0.0.4) project which was removed Github
Krop
⭐
13
This is a python/Qt app which can be used to crop pdf documents. Small, simple and useful. Copy of the original software Krop by Armin Straub.
Pypergrabber
⭐
13
Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Safedocs
⭐
13
Artifacts from the DARPA-funded SafeDocs research program
Impositioner
⭐
12
Basic imposition of PDF files
Google Patents Scraper
⭐
11
Automatically download all PDF files of searching results & their patent families found on Google Patents.
Djangolatex
⭐
11
Serve PDF files using Django templates and LaTeX
Rb2py
⭐
11
Experimental Ruby to Python 3 translator
Tweets2pdf
⭐
11
Backup your tweets into FANTANSTIC PDF files
Academicbib
⭐
11
Dercuano
⭐
11
a quick system I hacked together to bundle a few thousand pages of notes I mostly haven’t published before up into an archive of pregenerated HTML
Pdf2text
⭐
10
Project to convert PDF files to Text files using google OCR
Sciscraper
⭐
10
A bulk academic PDF extractor program, designed specifically for papers about behavioral science and design.
Multi Pdf Finder
⭐
10
Are you looking for a word in many pdf files? Do it one time. ⚡
Greek_laws_consolidation_code
⭐
10
Code for implementation of a semi-automatic system for the consolidation of Greek legislative texts
Pdf Page Counter
⭐
10
Sum up the pages of all pdf files in a directory 📖
Nesa
⭐
9
Extracts data from the Network Rail (NR) National Electronic Sectional Appendix data
Marisol
⭐
9
Library for bates numbering, text-stamping, and redacting PDF files.
Pdf_form_ocr
⭐
8
Table Recognition and Content Extraction in PDF Files
Financial Documents Ocr Deep Learning
⭐
8
Pdf2images
⭐
8
Convert pdf to pages of images
Slidegrubber
⭐
7
Slidegrubber is a python package that can download SlideShare presentations as PDF files.
Pdf2emb_nlp
⭐
7
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query
Pdftex
⭐
7
An easy way to generate PDF files which could be imported into overleaf with python/matplotlib
Transpdf
⭐
7
Extracting text from pdf files and translate the text into designated language by calling google api
Professional It Certifications
⭐
7
Metathief
⭐
7
PoC for extracting office files into PDF file metadata
Lip2sql
⭐
6
Parser that turns downloaded LinkedIn PDF resumes into an SQLite database
Jersey City Budget Pdf Liberation
⭐
6
This project will liberate data from pdf files found on http://www.cityofjerseycity.com/pub-info.aspx?id=2 and will create .csv and .json files to be uploaded on https://data.openjerseycity.org/dataset/jersey-cit
Pdfcracker
⭐
6
Crack PDF password is easy
Collective.sendaspdf
⭐
6
A Plone product that allows downloading the current page as a PDF and also sending it by e-mail
Evoker Lite
⭐
6
Evoker Lite is a python tool to generate cluster plots (PNG/PDF files) given a set of PLINK + intensity files. Compatible with UK Biobank v2 data.
Airflow Pdf2embeddings
⭐
6
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.
Haptipediaextractor
⭐
5
An API for extracting metadata, text, section titles, figures, and references from a PDF file
Pycpdf
⭐
5
A Python extension to extract content and metadata from PDF files efficiently
Itcr Courses
⭐
5
Repository of my notes in the career of Computer Engineering.
Pdfcomparator
⭐
5
Compares two PDF files by appearance, not by content. It can be used in the command line, in order to use it inside bigger scripts.
Pdf_merge_and_edit
⭐
5
Python script to merge and edit sensitive PDF files you don't want to upload to random sites you find on Google
Kicad_picknplace_assistant
⭐
5
KiCad PCB pick and place assistant
Pdf Image Binarization
⭐
5
Binarize all images in a scanned PDF file. 扫描版 PDF 黑白化/二值化。
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Script (17,004)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
1-97 of 97 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.