Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for html scraper
html
x
scraper
x
109 search results found
Easyspider
⭐
36,416
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化
Requests Html
⭐
13,100
Pythonic HTML Parsing for Humans™
Scrapely
⭐
1,668
A pure-python HTML screen-scraping library
Scraper
⭐
1,639
HTML parsing and querying with CSS selectors
Upton
⭐
1,615
A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)
How To Prevent Scraping
⭐
1,417
The ultimate guide on preventing Website Scraping
Parsel
⭐
1,010
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Mlscraper
⭐
935
🤖 Scrape data from HTML websites automatically by just providing examples
Website Downloader
⭐
895
💡 Download the complete source code of any website (including all assets). [ Javascripts, Stylesheets, Images ] using Node.js
Gazpacho
⭐
716
🥫 The simple, fast, and modern web scraping library
Skrape.it
⭐
714
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Se Scraper
⭐
477
Javascript scraping module based on puppeteer for many different search engines...
Pywebcopy
⭐
455
Locally saves webpages to your hard disk with images, css, js & links as is.
Opensanctions
⭐
427
An open database of international sanctions data, persons of interest and politically exposed persons
Ultimate Web Scraper
⭐
400
A PHP library/toolkit designed to handle all of your web scraping needs under a MIT or LGPL license. Also has web server and WebSocket server classes for building custom servers.
Lambdasoup
⭐
394
Functional HTML scraping and rewriting with CSS in OCaml
Basketball_reference_web_scraper
⭐
382
NBA Stats API via Basketball Reference
Hquery.php
⭐
345
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Juriscraper
⭐
314
An API to scrape American court websites for metadata.
Elixir Scrape
⭐
300
Scrape any website, article or RSS/Atom Feed with ease!
Youtube Projects
⭐
272
This repository contains all the code I use in my YouTube tutorials.
Torrent Search Api
⭐
258
Yet another node torrent scraper (supports iptorrents, torrentleech, torrent9, torrentz2, 1337x, thepiratebay, Yggtorrent, TorrentProject, Eztv, Yts, LimeTorrents)
Requests Html
⭐
207
Pythonic HTML Parsing for Humans™
Cheers
⭐
194
Scrape a website efficiently, block by block, page by page. Based on cheerio and curl.
Unhtml.rs
⭐
173
A magic html parser
Spider
⭐
173
Scheduler of spiders for scraping and parsing HTML and JSON pages
Xquery
⭐
155
Extract data or evaluate value from HTML/XML documents using XPath
Scrapi
⭐
154
LOOKING FOR A MAINTAINER
Tokio
⭐
144
Web scraping made simple.
Nibbler
⭐
142
A cute HTML scraper / data extraction tool in under 70 lines of code
Htmlsql
⭐
121
htmlSQL is a experimental PHP library which allows you to access HTML values by an SQL like syntax.
Nimquery
⭐
111
Nim library for querying HTML using CSS-selectors (like JavaScripts document.querySelector)
Html2rss
⭐
106
📰 Build RSS 2.0 feeds from websites (and JSON APIs) with a few CSS selectors.
Animeez
⭐
88
AnimeEZ - An Anime Streaming website without any ads for free (Demo - https://animeez.live) BTW ITS MADE IN HTML
Openscraper
⭐
80
An open source webapp for scraping: towards a public service for webscraping
Google Covid19 Mobility Reports
⭐
79
Data extraction of Google's COVID-19 Mobility Reports
Tatooine
⭐
78
A powerful scraper for JavaScript Developers.
Webdext
⭐
74
Intelligent Web Data Extractor
Venom
⭐
72
Your preferred open source focused crawler for the deep web.
Laravel Intelligent Scraper
⭐
72
Service to scrape a web page easily without knowing their HTML structure.
Autoscrape Py
⭐
70
An automated, programming-free web scraper for interactive sites
Thinkdiff
⭐
68
My open source project links, programming and software development related code and tutorials are in this repo. Content types: Python, JavaScript, Dart | Django, React, Flutter, React-Native etc.
Newspaper4k
⭐
66
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
Nipper
⭐
65
A Rust crate for manipulating HTML with CSS selectors
Newspaperjs
⭐
63
News extraction and scraping. Article Parsing
Dolarpy
⭐
58
Checks USD/PYG exchange rate from several sites, with a calculator, RESTful API and a twitter bot
Selectorlib
⭐
55
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Scraping Helper Chrome Extension
⭐
53
Scraping Helper will help you to find out the best html/css selector for certain elements
Html Table Extractor
⭐
51
extract data from html table
Chegg Scraper
⭐
50
Download Chegg homework-help questions to self-sufficient HTML files
Crawler
⭐
49
Web Scraping Framework
Hext
⭐
48
Domain-specific language for extracting structured data from HTML documents
Termin
⭐
48
Simple PHP script for notifying for a free appointments on the Berlin services website.
Yellowpages Scraper
⭐
43
Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Rscraping Jsm 2016
⭐
42
Repository for one-day course "A Primer to Web Scraping with R"
Searchscraperapi
⭐
41
Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of results.
Ronin Web
⭐
40
ronin-web is a collection of useful web helper methods and commands.
Uber_data
⭐
40
Uber web interface crawler / scraper - Convert the trips table into a CSV file
Linkebot
⭐
36
🔎 um bot de Web Scraping para mostrar vagas do LinkedIn
Scrape Metadata
⭐
32
📜 HTML metadata scraper
Pyitau
⭐
32
Unofficial client to access your Itaú bank data
Wwwclient
⭐
31
Advanced web browsing, scraping and automation
Interactive Facebook Reactions
⭐
30
Jupyter notebook + Code for processing Facebook Reactions data and making Interactive Charts
Xpath Selector
⭐
28
Library implementing easy XPath queries. Very useful for HTML and XML web scraping.
Tiktokr
⭐
28
An R Scraper for Tiktok
Webgrude
⭐
27
A java annotation library for Web scraping.
Wandering Labs Availability 2
⭐
27
Screen scrapes popular campground reservation website looking for availabilities.
Skrape
⭐
27
Kotlin DSL to scrape HTML and convert it to JSON
Skim
⭐
25
Scrape websites simply in Node.js. Streaming HTML scraper using CSS selectors.
Genscrape
⭐
25
JavaScript library that aids in scraping person data off of genealogy websites
Med7369 Specialist Investigative Journalism
⭐
24
Module on both the MA Data Journalism and MA Multiplatform and Mobile Journalism at Birmingham City University
Cookcountyjail2
⭐
24
A new version of the cook county jail scraper, inspired by the Supreme Chi-Town Coding Crew
Hodor
⭐
23
🕷Configuration based html scraper
Igscraperkit
⭐
23
Create dynamic web scraper in Objective-C or Ruby!
Api Flight.com
⭐
22
Main API Flight Git Repository
Oge
⭐
21
Page metadata as a service
Scraper
⭐
20
For scraping content out of pages and/or feeds.
Web Scraping
⭐
20
short tutorial on how to scrape the internets
Skyscraper
⭐
19
Rust HTML Scraping with XPath Expressions
Manolo_scraper
⭐
18
Scraper de registro de visitas online. Usa Scrapy.
Istanbul Transportation Network
⭐
18
Istanbul Transportation Network Scraping & Analysis
Pirula Time
⭐
18
Measures the mean duration for Pirula's videos
Web Scraping With R Extended Edition
⭐
18
Repository for one-day course "Web Scraping with R, extended edition"
Rjdl
⭐
17
Radio Javan downloader and info scraper for Node.js
Scrapy_facebooker
⭐
17
Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Angleparse
⭐
17
HTML parsing and processing tool for PowerShell.
Html Table To Json
⭐
17
Generate JSON representations of HTML tables
Sejmrp
⭐
17
Qddate
⭐
17
Quick and dirty date parsing Python library to parse HTML dates really fast
Struktur
⭐
16
Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.
Pycon_2017
⭐
16
Fantastic Data and Where To Find Them: An introduction to APIs, RSS, and Scraping
Penjabarberita
⭐
16
Extract the article list from its raw news HTML
Webhere
⭐
16
HTML scraping for Objective-C.
Django Scraper
⭐
16
Django application which crawls and downloads online content following instructions
Bets
⭐
16
Betting odds scraper for multiple bookies
Shup
⭐
15
A POSIX shell script to parse HTML
Scraper
⭐
15
All In One API to easily scrape data from any website, without worrying about captchas and bot detection mecanisms.
Image Scraper
⭐
15
Python script to scrap images from a website
Docker Scraper
⭐
15
Dockerised python web scraper with NHS Choices website spider.
Scrapinode
⭐
14
content driven and route based scraper
Related Searches
Javascript Html (52,781)
Html Css (19,526)
Python Html (6,892)
Html Jquery (5,656)
Html Bootstrap (5,651)
Php Html (5,615)
Html Theme (5,550)
Html Jekyll (5,387)
Typescript Html (5,136)
Html Markdown (5,082)
1-100 of 109 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2025 Awesome Open Source. All rights reserved.