Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python html parser
html-parser
x
python
x
19 search results found
Goose3
⭐
744
A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html
Html5 Parser
⭐
656
Fast C based HTML 5 parsing for python
Justext
⭐
509
Heuristic based boilerplate removal tool
Pywebcopy
⭐
455
Locally saves webpages to your hard disk with images, css, js & links as is.
Breadability
⭐
191
Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
Harser
⭐
138
Easy way for HTML parsing and building XPath
Htmldate
⭐
101
Fast and robust date extraction from web pages, with Python or on the command-line
Advancedhtmlparser
⭐
94
Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modification, and formatting. Also XPath.
Nba Search
⭐
56
flask application designed to explore NBA data 🏀
Dedoc
⭐
49
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Document logical extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser)
Sec Parser
⭐
43
Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual structure of the document.
Procyclingstats
⭐
38
procyclingstats scraper
Beautifulscraper
⭐
37
Python web-scraping library that wraps urllib2 and BeautifulSoup
Leaf
⭐
32
Simple Python library for HTML parsing
Ehp
⭐
30
Easy Html Parser is an AST generator for html/xml documents. You can easily delete/insert/extract tags in html/xml documents as well as look for patterns.
Crawler.py
⭐
23
async web crawler
Imslp
⭐
22
🎼 The clean and modern way of accessing IMSLP data and scores programmatically. 🎶
Aws Saml Login
⭐
14
AWS SAML login helper library for Python
Html5
⭐
12
A Python library for HTML5 web apps in Pyodide.
Pirateproxy
⭐
12
Python-based HTTP/HTTPS proxy, comparable with CGIproxy but standalone.
Pgreaper
⭐
11
A Python library for loading data from various formats into PostgreSQL databases.
Bbscraper
⭐
10
Simple phpBB forum thread web scraper written in Python
Plugin.video.zdf_de_2016
⭐
9
Flask Youtube
⭐
9
YouTube Downloader Flask version.
Tipi
⭐
8
Typographic replacements in HTML
Parker
⭐
8
Parker is a Python-based web spider for collecting specific data across a set of configured sites.
Python Gratisdns
⭐
8
A project which aims to combine the ease of python, with the power of GratisDNS.
Bookmarks2evernote
⭐
8
Slurp
⭐
7
BeautifulSoup4 packaged into a command line tool
Reestr
⭐
6
Сбор данных из реестра российского ПО с сайта https://reestr.minsvyaz.ru
Django Janitor
⭐
6
django-janitor allows you to use bleach to clean HTML stored in a Model's field.
Apifier
⭐
6
Apifier is a very simple HTML parser written in Python based on CSS selectors
Developer Tools
⭐
6
Developer Tools - HTML Parser, CSS Converter | AppSeed
Humble_catalog
⭐
5
A script to parse the saved Humble Bundle library HTML
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
1-19 of 19 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.