Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for lxml
lxml
x
358 search results found
Opencorpora Tools
⭐
42
Python interface to http://opencorpora.org/
Vim Xpath
⭐
41
XPath search plugin for Vim
Label Annotation Voc Pascal
⭐
41
A tool for annotation using VOC Pascal format
Xml4h
⭐
40
XML for Humans in Python
Get_fshare
⭐
38
Python library to get Fshare link with your account.
Scrapydemo
⭐
38
ScrapyDemo : Redis MySQLdb logging IngoreHttpRequestMiddleware UserAgentMiddleware HttpProxyMiddleware rules
Crunchy Xml Decoder
⭐
38
Gdml
⭐
37
FreeCAD GDML Workbench - AddonManager Installable
Pylinkchecker
⭐
36
standalone and pure python link checker and crawler that traverses a web site and reports errors
Django Xml
⭐
35
A python module which provides an abstraction to lxml's XPath and XSLT functionality in a manner resembling django database models.
Lxml Stubs
⭐
35
Type stubs for the lxml package
Simulix
⭐
34
A third-party Simulink tool to generate FMUs from models using the C-API
Webpager
⭐
34
Paginating the web
Instagram_stalker_scraper
⭐
34
(UNMAINTAINED) Fetch data of any public Instagram profile, without using api
Analyze Spec Benchmarks
⭐
33
Ezlog
⭐
33
Easy blog system powered by django
Python Sitemap
⭐
33
Python library for parsing & generating sitemaps
Weibo_daily_hotkey
⭐
33
Weibo's daily TOP5 hotkey. 自动爬取、筛选新浪微博每日热搜词 TOP5。https://github.com/TauWu/weibo_daily_hotkey/b
Leaf
⭐
32
Simple Python library for HTML parsing
Soup Strainer
⭐
32
A reimplementation of the Readability/Decruft algorithm using BeautifulSoup and html5lib
Pomgen
⭐
31
Utility that turns Bazel-built jars into Maven compatible artifacts
Mycvt
⭐
31
Checkpoint Firewall Ruleset Auditor ( For the HTML exports when you do not have the object files )
Endomondo Export
⭐
30
Export the most recent Endomondo workouts as TCX files.
Chakert
⭐
29
Python typography enhacer tool for lxml-based html and raw text
Google Image Downloader
⭐
28
A script to download images from images.google.com
Civics_cdf_validator
⭐
28
Mexican Jobs 2018
⭐
28
Reddit bots, web scraper and utility scripts used to perform EDA on thousands of job listings from the official Mexican job board.
Spider Course 5
⭐
28
Codeforcesimporter
⭐
27
Just a small script for importing user statistics and past submissions on codeforces
Steam
⭐
27
抓取 steam 商店游戏信息
Aws Lambda Py3
⭐
27
Pre-compiled Python3(3.6+) packages for AWS Lambda layers
Webcomix
⭐
26
Webcomic downloader
Terrain_generator
⭐
26
A wizard that generates terrains for Gazebo using height maps.
Frogress
⭐
26
frogress - a progress tool for shell
Kissanime_dl
⭐
26
web-crawler / package manager / kissanime downloader
Gpipe43
⭐
26
A full text RSS generator which can hosted on google app engine
Bookcreator
⭐
26
A scrapper that takes an online book from ORilley and turns into an epub book, because I want to read'em in my nook, away from my computer, and since I know python I figured out it could not be that hard...
Kernelconfig
⭐
26
Generate custom Linux kernel configurations from curated sources
Xtdiff
⭐
25
⚠️ THIS REPO IS DEPRECATED ⚠️ Python library to compare two XML trees and generate a set of actions that transform one into the other
Clixpath
⭐
25
Command-line tool to easily extract data from HTML or XML documents. Produces machine readable output.
Pdfdownloader
⭐
24
An Innvoative Web Scrapping Solution to Download PDF Files
Sciencebeam Gym
⭐
23
ScienceBeam Gym
Hodor
⭐
23
🕷Configuration based html scraper
Types Lxml
⭐
23
Complete lxml external type annotation
Vocabularyanalyzer
⭐
23
英语词汇分析器,可用于提取文本中的高阶词汇
Ades Master
⭐
22
ADES implementation based on a master XML file
Habr Observer
⭐
22
An automatically updated feed with summaries of the best Habr.com articles generated by the YandexGPT neural network.
Dweixin
⭐
22
Wechat development based on Django
Extract News Summary
⭐
22
Pure python script that takes user query and summarizes news related to it.
Chopper
⭐
22
Chopper is a tool to extract elements from HTML by preserving ancestors and CSS rules
Solr For Datascience
⭐
22
Ihealth_crawler
⭐
21
iHealth 项目的内容爬虫(一个基于 python 和 MongoDB 的医疗咨询爬虫)
Lbduodian
⭐
21
《爬取多点商城整站商品》申明:如果侵犯了某公司权益,请及时告诉我,我会马上删除爬取的整站的商品信息。 多点 >商城商品信息,爬取< 多点 >商城整站商品信息。1、分析< 多点 >商城特点;2、使用爬取方式;3、爬取数据解析(重点)。
Yzucoursebot
⭐
21
元智選課機器人
Sinacrawlerv
⭐
21
backup posts and comments of specify user in sina
Codechef Rank Comparator
⭐
21
Web application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
Naver Blog Crawler
⭐
21
네이버 블로그 크롤러
Enlivepy
⭐
21
Python port of clojure enlive library for html transformation
Vra Rdf Project
⭐
21
VRA-RDF-Project
Openvenues
⭐
21
Python Odml
⭐
21
odML libraries
Beets Rymgenre
⭐
21
A beets plugin to fetch genre information from rateyourmusic.com
Ncdiff
⭐
20
NETCONF Diff Engine
Justdail Scrapper
⭐
20
A 100% working Justdial scrapper, Just enter the url and it'll extract business info from it
Mexican Jobs 2020
⭐
20
Data ETL & Analysis on thousands of job listings from the official Mexican job board (2020 edition).
Api Oreilly Free Books
⭐
19
Web Scraping to download books of the section "programming" from o'reilly free books
Arxiv_leaks
⭐
19
Whisper of the arxiv: read comments in tex of papers
Allen Bradley Toolkit
⭐
19
This project aims to wrap the native python lxml library for the purpose of building a strong L5X editting API.
Av_data_capture
⭐
19
AV元数据抓取工具,配合kodi等本地媒体管理工具使用
Crawl Me
⭐
19
Crawl-me is a light-weight fast plugin based web picture crawler
Gctools
⭐
19
Geocaching Tools and Scripts
X Path Walker
⭐
18
Aliexpressorders
⭐
18
Track your Aliexpress orders in Google Sheets
Pyqchem
⭐
18
Python interface for Q-Chem
Amazon Mobile Sentiment Analysis
⭐
18
Opinion mining of Mobile reviews on Amazon platform
Pymods
⭐
17
process MODS records from Python
Raft
⭐
17
Response Analysis and Further Testing RAFT is a testing tool for the identification of vulnerabilities in web applications. RAFT is a suite of tools that utilize common shared elements to make testing and analysis easier. The tool provides visibility in to areas that other tools do not such as various client side storage. RAFT uses markup to create templates for fuzz testing.
Structominer
⭐
17
Data scraping for a more civilized age
Parsel Cli
⭐
17
cli for evaluating css and xpath selectors
Python3 Mal
⭐
17
Python interface to MyAnimeList
Ipproxys
⭐
17
Amazon_reviews_allpages
⭐
16
Script to scrape all reviews on all Amazon pages
Myfitnesspal Python Dashboard
⭐
16
Visualize your meal tracking from MyFitnessPal in a Dashboard connected to your Raspberry Pi
Chia Anime Downloader
⭐
16
Anime batch downloader script for https://chia-anime.tv
Dominic
⭐
16
jquery-based python-pure implementation of CSS Selectors, good for using with google app engine
Jd Simulator
⭐
15
模拟京东商城的两种登录请求、查询商品信息、修改购物车以及订单页面信息、提交订单、获取优惠券等服务
Lianjiaspider
⭐
15
Pynzb
⭐
15
pynzb is a unified API for parsing NZB files, with several concrete implementations included
Phoneypdf
⭐
15
A virtual PDF analysis framework
Douban Movie
⭐
15
Get movie info from douban(豆瓣) and display in your terminal
Senscritiquescraper
⭐
15
Python API to extract data from senscritique.com.
Kafnafparserpy
⭐
15
Parser for KAF NAF files written in Python
Ansible Compile Python
⭐
15
Ansible role that lets you compile any given version of Python
Scse_asistant_server
⭐
14
「华软校园助手」- 这是「华软校园助手」小程序前端完整代码和Python后端部分代码(mysise)
Genrss
⭐
14
RSS generator for python
Ssh Attack Visualisation
⭐
14
Solr
⭐
14
Allegro Common Lisp interface to Solr
Amazon Seller List
⭐
14
Dnevnikru
⭐
14
dnevnik.ru parser
Spiderqq
⭐
13
101-200 of 358 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.