Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python docx
docx
x
python
x
103 search results found
Pdf2docx
⭐
1,935
Open source Python library converting pdf to docx.
Python Docx Template
⭐
1,566
Use a docx as a jinja2 template
Ebook Gpt Translator
⭐
1,528
Enjoy reading with your favorite style.
Python Docx
⭐
1,030
Reads, queries and modifies Microsoft Word 2007/2008 docx files.
Chatweb
⭐
573
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Msoffcrypto Tool
⭐
496
Python tool and library for decrypting and encrypting MS Office files using passwords or other keys
Python Docx2txt
⭐
450
A pure python based utility to extract text and images from docx files.
Pandoc Fignos
⭐
279
A pandoc filter for numbering figures and figure references.
Docx Mailmerge
⭐
268
Mail merge for Office Open XML (docx) files without the need for Microsoft Office Word.
Docem
⭐
268
Uility to embed XXE and XSS payloads in docx,odt,pptx,etc (OXML_XEE on steroids)
Ramile
⭐
192
China software copyright extraction tool - 中国软件著作权代码自动提取工具
Docgpt Langchain
⭐
178
🔐Free GPT-3.5 chat with your docs (PDF, WORD, CSV, TXT)
Ilovepdf
⭐
176
Telegram Bot that helps you to convert Images to pdf, pdf to images, 45+ file formats to pdf, more features Soon..
Pandoc Eqnos
⭐
175
A pandoc filter for numbering equations and equation references.
Keras English Resume Parser And Analyzer
⭐
158
keras project that parses and analyze english resumes
Pydocx
⭐
146
An extendable docx file format parser and converter
Goreport
⭐
130
A Python script to collect campaign data from Gophish and generate a report
Plagiarism Checker
⭐
121
A utility to check if a document's contents are plagiarised
Docx2python
⭐
113
Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.
Foliant
⭐
104
Comprehensive markdown-based documentation toolkit
Openvasreporting
⭐
101
OpenVAS Reporting: Convert OpenVAS XML report files to reports
Simplify Docx
⭐
95
Simplify DOCX files to JSON
Pandoc Tablenos
⭐
87
A pandoc filter for numbering tables and table references.
Excelcy
⭐
81
Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.
Baiduwenku
⭐
62
百度文库!进来吧,这是就是你想要的百度文库爬虫,必能运行!word生成可编辑的word文档,ppt和
Pandocacademic
⭐
61
A Sublime Text 2/3 plugin to handle Pandoc document (syntax) add functionnalities (bibliography, etc.) and convert them to other formats.
Ddetect
⭐
56
Simple DDE object detector
Html2docx
⭐
55
Convert html to docx
Docx2csv
⭐
54
Extracts tables from .docx files and saves them as .csv or .xls files
Python Code
⭐
50
Variety of miscellaneous code of python
Dedoc
⭐
49
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Document logical extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser)
Fengspider
⭐
45
自己学习爬虫有关的经验总结和在网上看到的一些好的爬虫例子
Word2html
⭐
45
a quick and dirty script to convert a Word (docx) document to html.
Docximport Sigil Plugin
⭐
43
Import DOCX documents into Sigil as epubs
Pdfconverter
⭐
42
Best PDF Converter! PDF to any format, pdf2word/excel/xml/html/txt...
Jupyter Docx Bundler
⭐
38
Jupyter bundler extension to export notebook as a docx file
Docx2html
⭐
37
Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx
Sarna
⭐
37
Security Assessment Report geNerated Automatically
Cbeta
⭐
35
大正藏在线阅读
Sublimepandoc
⭐
34
Sublime Text 2 Plugin for the Pandoc Markdown Renderer
Md2docx
⭐
30
Markdown to Word docx file conversion
Allure Docx
⭐
27
docx report generation based on allure-generated json files
Rstdoc
⭐
21
rst documentation support using Pandoc and Sphinx to produce DOCX, PDF and HTML; reformat rst tables for vim
Kylo Docs
⭐
18
Usfm Tools
⭐
18
Tools for parsing and rendering USFM files
Docassemble Alweaver
⭐
18
A tool to help quickly generate draft interviews from an existing document (pdf or DOCX) for the docassemble platform.
Coloradorla
⭐
17
Software to facilitate risk-limiting audits at the state level, developed for the state of Colorado.
Plagiarism_checker
⭐
17
Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.
Anonymize
⭐
16
A script to change authorship to ODT and DOCX comments, redlines and whatnot.
Fbl
⭐
16
FBL is tool to find broken links in articles and files
Npttf2utf
⭐
16
Python module/script to map Nepali ASCII font faces like Preeti, Sagarmatha, and more to devanagari unicode with direct mapping support for docx and txt files.
Python Openxml
⭐
15
OpenXML library for Python: create .docx & .pptx files
Baiduwenkuspider
⭐
15
实现对百度文库文档以pdf形式原格式下载
Dango Ocr
⭐
15
DangoOCR: screenshot OCR recognize 文字识别,支持多种语言,识别后翻译,播放声音
Mistune Docx
⭐
15
Markdown to Docx converter
Mlsa Certificate Automate
⭐
15
Automate your Microsoft Learn Student Ambassadors event certificate with Python
Persian Tts
⭐
13
Df Extract
⭐
12
DF Extract Lib
Tencent Weibo Exporter
⭐
12
腾讯微博导出工具,无需登录。
Deepl Tr Pyppeteer
⭐
12
machine translate docx/txt via deepl and pyppeteer
Semantic Ai
⭐
11
An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Large Language Model).
Pubmed2doc
⭐
11
Write PubMed search results with two display options (citation or listview) to PDF or Word
Unogenerator
⭐
11
Libreoffice files generator programmatically with python and Libreoffice server instances
Sven
⭐
10
sven django project
Docxsphinx
⭐
10
Generate word documents from sphinx documentation
Tpyo_revealo
⭐
10
🙈 assistant for hunting down tpyos
Pandoc Secnos
⭐
9
A pandoc filter for numbering section references.
Deedee
⭐
9
Stealthy DDE Exploit Payload generator and injector for DOCX files
Docxlatex
⭐
8
A python library for extracting text from .docx files with support for inserted mathematical equations
Ehscripter
⭐
8
Ethical Hacking Scripter
Python Flask Cms App
⭐
8
This is a Content Management System(CMS) App build by Python Flask
Resume Parser In Python
⭐
8
A parser to extract information from resumes in PDF and DOCX formats written in Python
Polyglot Database
⭐
8
Database of polyglot files. By polyglot, we mean files readable in multiple formats
Expose Text
⭐
7
This is a prototype of a Python module for simple modification of document files.
Django Template Engines
⭐
7
Additional template engines for Django (ODT & PDF)
Cybertechwatch
⭐
7
Multi-thread monitoring tool for cyber security threats; generates nicely formatted report in docx
Docxbuilder
⭐
7
Sphinx docx builder extension
Polyglot
⭐
7
Polyglot is a CLI tool that automates translations tasks. Using the DeepL API, Polyglot generates a translated file from a given source.
Metadatos
⭐
7
Example script that recovers metadata from PDF and DOCX files
Libreofficetopdf
⭐
7
Utility to open a docx document, update indexes and save as pdf.
Hoshi
⭐
7
【星】pdf扫描件 转 docx
Python Docx Reading Paragraphs Tables And Images In Document Order
⭐
7
The Python docx package cannot read paragraphs, tables and images in document order. It can only render all the paragraphs at once or all tables at once or all images at once. Here, I provide a way in which paragraphs, tables and images present in a docx file can be read in document order into a dataframe in python.
Python Ssp
⭐
6
FedRAMP SSP API in Python
Kwaliteitsaanpak
⭐
6
ICTU's Kwaliteitsaanpak: a set of guidelines for software development projects
Py Snippet
⭐
6
学习py过程中的一些代码段
Tranalator
⭐
6
基于百度和google翻译,可实现文档翻译 Based on Baidu translation and Google translation, document translation can be achieved
Gitlaw Au
⭐
5
Ever wondered what it would look like if Commonwealth Legislation was on Github?
Docx Content Modify
⭐
5
Python编写的处理法务邮单自动批量生成的脚本小工具-提取判决书内容免去手输填充邮单-Legal agency postal receipt automatically generate app
Bookify
⭐
5
Transform pdf files into booklets for double-sided printing
Python Redlines
⭐
5
Docx tracked change redlines for the Python ecosystem.
Py Smartisan
⭐
5
A kindergarten-level python implementation of b*s functions in smartisan TNT
Dropkindle
⭐
5
DropKindle 将 Dropbox 、坚果云、OneDrive中的文档推送到 Kindle
Openxml
⭐
5
openxml is a Python library to create and manipulate .docx and .pptx files
Docxtpl Cli
⭐
5
docxtpl-cli
Robot Ooxml
⭐
5
Provides .docx and .xlsx file parsers for Robot Framework
2018se_group12
⭐
5
2018秋软件工程12组
Changeoffice
⭐
5
Most of the python MsOffice processing libraries are dealing with new 07 file formate (ie: xlsx,pptx,docx). This package can change doc2docx,xls2xlsx,ppt2pptx.
Resume Parser
⭐
5
Takes a folder of resumes (or outlook messages containing resumes), and creates a spreadsheet of results
Sphinxpapyrus Docxbuilder
⭐
5
A Sphinx extention for Word (.docx) file output
Gcp Docx2pdf
⭐
5
Example of (micro)services to do conversion from Microsoft Word Docx files to PDF using products on Google Cloud Platform
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Script (17,004)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
1-100 of 103 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.