Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for html scraper
html
x
scraper
x
455 search results found
Cheerio
⭐
26,901
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
Easyspider
⭐
16,589
A visual no-code/code-free web crawler/spider易采集:一个可视化爬虫软件,可以无代码图形化的设计和执行爬虫任务
Requests Html
⭐
13,100
Pythonic HTML Parsing for Humans™
Jsoup
⭐
10,317
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Nyt 2020 Election Scraper
⭐
1,788
Aos Avp
⭐
1,735
NOVA opeN sOurce Video plAyer: main repository to build them all
Scrapely
⭐
1,668
A pure-python HTML screen-scraping library
Upton
⭐
1,615
A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)
Scraper
⭐
1,505
HTML parsing and querying with CSS selectors
How To Prevent Scraping
⭐
1,127
The ultimate guide on preventing Website Scraping
Parsel
⭐
937
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Mlscraper
⭐
935
🤖 Scrape data from HTML websites automatically by just providing examples
Website Downloader
⭐
895
💡 Download the complete source code of any website (including all assets). [ Javascripts, Stylesheets, Images ] using Node.js
Scala Scraper
⭐
704
A Scala library for scraping content from HTML pages
Skrape.it
⭐
667
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Xidel
⭐
592
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Gazpacho
⭐
543
🥫 The simple, fast, and modern web scraping library
Se Scraper
⭐
477
Javascript scraping module based on puppeteer for many different search engines...
Ultimate Web Scraper
⭐
400
A PHP library/toolkit designed to handle all of your web scraping needs under a MIT or LGPL license. Also has web server and WebSocket server classes for building custom servers.
Pywebcopy
⭐
373
Locally saves webpages to your hard disk with images, css, js & links as is.
Lambdasoup
⭐
362
Functional HTML scraping and rewriting with CSS in OCaml
Scrape Linkedin Selenium
⭐
353
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Hquery.php
⭐
342
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Basketball_reference_web_scraper
⭐
326
NBA Stats API via Basketball Reference
Scrapysharp
⭐
307
reborn of https://bitbucket.org/rflechner/scrapysharp
Elixir Scrape
⭐
300
Scrape any website, article or RSS/Atom Feed with ease!
Juriscraper
⭐
291
An API to scrape American court websites for metadata.
Youtube Projects
⭐
272
This repository contains all the code I use in my YouTube tutorials.
Leetcode
⭐
266
Leetcode Questions - Sorted by likes, likes-dislikes ratio and much more
Torrent Search Api
⭐
258
Yet another node torrent scraper (supports iptorrents, torrentleech, torrent9, torrentz2, 1337x, thepiratebay, Yggtorrent, TorrentProject, Eztv, Yts, LimeTorrents)
Tagsoup
⭐
225
Haskell library for parsing and extracting information from (possibly malformed) HTML/XML documents
Requests Html
⭐
207
Pythonic HTML Parsing for Humans™
Cheers
⭐
194
Scrape a website efficiently, block by block, page by page. Based on cheerio and curl.
Daath Ai Parser
⭐
184
Daath AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.
Spider
⭐
173
Scheduler of spiders for scraping and parsing HTML and JSON pages
Unhtml.rs
⭐
173
A magic html parser
Xquery
⭐
155
Extract data or evaluate value from HTML/XML documents using XPath
Scrapi
⭐
154
LOOKING FOR A MAINTAINER
Tokio
⭐
144
Web scraping made simple.
Nibbler
⭐
142
A cute HTML scraper / data extraction tool in under 70 lines of code
Htmlsql
⭐
121
htmlSQL is a experimental PHP library which allows you to access HTML values by an SQL like syntax.
Go Latest
⭐
120
Simple way to check version is latest or not from various sources in Golang
Html Metadata
⭐
115
MetaData html scraper and parser for Node.js (supports Promises and callback style)
Nimquery
⭐
111
Nim library for querying HTML using CSS-selectors (like JavaScripts document.querySelector)
Python Web Scraping Cookbook
⭐
107
Python Web Scraping Cookbook, published by Packt
Html2rss
⭐
100
📰 Build RSS 2.0 feeds from websites (and JSON APIs) with a few CSS selectors.
Web Scraper
⭐
97
Perl web scraping toolkit
Daily Scraper
⭐
92
Fetches information about every webpage 🤖
Animeez
⭐
88
AnimeEZ - An Anime Streaming website without any ads for free (Demo - https://animeez.live) BTW ITS MADE IN HTML
Openscraper
⭐
80
An open source webapp for scraping: towards a public service for webscraping
Google Covid19 Mobility Reports
⭐
79
Data extraction of Google's COVID-19 Mobility Reports
Tatooine
⭐
78
A powerful scraper for JavaScript Developers.
Webdext
⭐
74
Intelligent Web Data Extractor
Meteor Scrape
⭐
73
Scrape any Website or RSS/Atom-Feed with ease.
Laravel Intelligent Scraper
⭐
72
Service to scrape a web page easily without knowing their HTML structure.
Autoscrape Py
⭐
70
An automated, programming-free web scraper for interactive sites
Thinkdiff
⭐
68
My open source project links, programming and software development related code and tutorials are in this repo. Content types: Python, JavaScript, Dart | Django, React, Flutter, React-Native etc.
Top Github Scraper
⭐
67
Scape top GitHub repositories and users based on keywords
Venom
⭐
66
Your preferred open source focused crawler for the deep web.
Names
⭐
66
Analysis of most poisoned names in US
Nipper
⭐
65
A Rust crate for manipulating HTML with CSS selectors
Newspaperjs
⭐
63
News extraction and scraping. Article Parsing
Stack Scraper
⭐
62
Comicsrss.com
⭐
62
RSS feeds for comics
Mechaml
⭐
61
OCaml functional web scraping library
Nscrape
⭐
57
A web scraping framework for .NET
Dolarpy
⭐
56
Checks USD/PYG exchange rate from several sites, with a calculator, RESTful API and a twitter bot
Selectorlib
⭐
55
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Euro2016_terminalapp
⭐
55
⚽️ Instantly find 🏆EURO 2016 live-streams & highlights, now a Web App!
Scraping Helper Chrome Extension
⭐
53
Scraping Helper will help you to find out the best html/css selector for certain elements
Html Table Extractor
⭐
51
extract data from html table
Newscoverageonwuhan
⭐
51
Chinese News coverage on Wuhan during the epidemics outbreak
Chegg Scraper
⭐
50
Download Chegg homework-help questions to self-sufficient HTML files
Crawler
⭐
49
Web Scraping Framework
Trex
⭐
47
youtube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling
Hext
⭐
46
Domain-specific language for extracting structured data from HTML documents
Termin
⭐
45
Simple PHP script for notifying for a free appointments on the Berlin services website.
Scraper Fourone Jobs
⭐
43
This is a anti-scraping cracker for extracting apply information of one of Taiwan jobs recruiting website.
Yellowpages Scraper
⭐
43
Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Unfurl
⭐
42
Extract rich metadata from URLs
Rscraping Jsm 2016
⭐
42
Repository for one-day course "A Primer to Web Scraping with R"
Ronin Web
⭐
41
ronin-web is a collection of useful web helper methods and commands.
Searchscraperapi
⭐
41
Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of results.
Uber_data
⭐
40
Uber web interface crawler / scraper - Convert the trips table into a CSV file
Formless
⭐
40
Completely transparent, unobtrusive form populator for web applications and content scrapers.
Tvseries
⭐
36
TV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendly navigation UI.
Linkebot
⭐
36
🔎 um bot de Web Scraping para mostrar vagas do LinkedIn
Readability Cli
⭐
35
A CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!
Docparser
⭐
34
A ruby web/screen scraping tool / gem.
Pyitau
⭐
32
Unofficial client to access your Itaú bank data
Scrape Metadata
⭐
32
📜 HTML metadata scraper
Wwwclient
⭐
31
Advanced web browsing, scraping and automation
Interactive Facebook Reactions
⭐
30
Jupyter notebook + Code for processing Facebook Reactions data and making Interactive Charts
Xpath Selector
⭐
28
Library implementing easy XPath queries. Very useful for HTML and XML web scraping.
Tiktokr
⭐
28
An R Scraper for Tiktok
Wandering Labs Availability 2
⭐
27
Screen scrapes popular campground reservation website looking for availabilities.
Geoip Scraper
⭐
27
Scrapes specified files, generating a pretty google powered map with geoip results
Skrape
⭐
27
Kotlin DSL to scrape HTML and convert it to JSON
Webgrude
⭐
27
A java annotation library for Web scraping.
Genscrape
⭐
25
JavaScript library that aids in scraping person data off of genealogy websites
Related Searches
Javascript Html (50,508)
Html Css (20,889)
Python Html (6,892)
Html Bootstrap (5,651)
Php Html (5,615)
Html Theme (5,550)
Html Jquery (5,474)
Html Jekyll (5,387)
Html Markdown (5,022)
Html Reactjs (4,782)
1-100 of 455 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2023 Awesome Open Source. All rights reserved.