Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataset scraper
dataset
x
scraper
x
36 search results found
Complete Life Cycle Of A Data Science Project
⭐
499
Complete-Life-Cycle-of-a-Data-Science-Project
Cryptocmd
⭐
472
Cryptocurrency historical price data library in Python. Data from https://coinmarketcap.com.
Idt
⭐
206
Image Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.
Trump Lies
⭐
175
Tutorial: Web scraping in Python with Beautiful Soup
Place 2022
⭐
155
Analysing and storing raw data from r/Place 2022 event
Jimutmap
⭐
112
API to get enormous amount of high resolution satellite images from satellites.pro quickly through multi-threading! create map your own map dataset. Bringing data to Humans.
Rust Repos
⭐
87
Dataset of Rust source code repositories
Amazon Reviews Scraper
⭐
82
Yet another multi language scraper for Amazon targeting reviews.
Google Covid19 Mobility Reports
⭐
79
Data extraction of Google's COVID-19 Mobility Reports
Deuce
⭐
78
R package for web scraping of tennis data
Curation Corpus
⭐
77
Code for obtaining the Curation Corpus abstractive text summarisation dataset
Laravel Intelligent Scraper
⭐
72
Service to scrape a web page easily without knowing their HTML structure.
Web Scraping Reddit
⭐
59
Web scraping Reddit without using Reddit API, and making a dataset, and using the dataset for a machine learning project.
The Weather Scraper
⭐
58
A Lightweight Weather Scraper
Leetcode
⭐
52
At present contains scraped data from around 1500 problems present on the site. More to follow....
Datadoubleconfirm
⭐
50
Simple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: http://projectosyo.wix.com/datadoubleconfirm.
Mtnt
⭐
48
Code for the collection and analysis of the MTNT dataset
Data Inventories
⭐
48
A simple script to look for and process all the federal data.json data inventories.
Trscraper
⭐
47
TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madenciliği yapma imkanı sunan bir uygulamadır.
Awesome Georgian Datasets
⭐
46
Useful datasets, specific to Georgia
Raplyrics Scraper
⭐
43
Data sourcing and pre-processing for raplyrics.eu - A rap music lyrics generation project
Fifa Fut Data
⭐
39
Web-scraping script that writes the data of all players from FutHead and FutBin to a CSV file or a DB
Shopify App Store Scraper
⭐
38
Crawler behind the Shopify App Marketplace dataset
Thar
⭐
37
Mining all surnames used in Nepal.
Myanimelist Data Set Creator
⭐
35
Collection of some simple python scripts to create https://myanimelist.net/ anime and user data set.
Scrapecars
⭐
34
Building a car image dataset from scraping.
Open Australian Legal Corpus Creator
⭐
34
The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents.
Web Scraping Using Python
⭐
33
This project scrapes Wikipedia for its articles using BeautifulSoup to create a dataset and then draws analysis on the collected data.
Iclr2023 Openreviewdata
⭐
30
Crawl & Visualize ICLR 2023 Data from OpenReview
Dbrd
⭐
29
110k Dutch Book Reviews Dataset for Sentiment Analysis
Imagenetscraper
⭐
24
👁 Bulk-download all thumbnails from an ImageNet synset, with optional rescaling
Steam Games Scraper
⭐
23
Extract information from all games published in Steam thanks to its Web API, and store it in JSON format.
Sp Subway Scraper
⭐
23
🚆This web scraper builds a dataset for São Paulo subway operation status
Board Game Scraper
⭐
21
Board game data scraper
Berlin_corona_cases
⭐
20
Scraper for the official dashboard with current Corona case numbers, traffic light indicators ("Corona-Ampel") and vaccination situation for Berlin.
Python_for_datascience
⭐
20
Python for Data Science
Tradetheevent
⭐
20
Implementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Hepsiburada Review Scraper
⭐
20
Hepsiburada review/comment and rating scraper. Turkish text dataset creator for data science and NLP projects. 📜
Data
⭐
19
Scraping world-wide data about COVID-19
Nft Dataset
⭐
19
Includes data about over 250 NFT Collections
Iranian Politicians Twitter Dataset Persian
⭐
18
Iranian politicians twitter dataset persian | دیتاست کامل توییت های سیاسیون ایرانی در توییتر برای کارهای پردازش متن
Text Classification Python
⭐
17
An example of retails products classification using scikit and nltk -
News_summary
⭐
17
Dataset and scripts for scraping the news articles from popular sources along with the summary of the article.
Covid 19
⭐
16
Current and historical coronavirus covid-19 confirmed, recovered, deaths and active case counts segmented by country and region. Includes csv, json and sqlite data along with an interactive website explorer.
Data Projects
⭐
15
Personal Data Projects: Datasets created for stories published on Medium, open to the public.
Imdb Scraper
⭐
14
Scrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.
Statscraper
⭐
13
A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.
Readabilityinscience
⭐
12
Internet Affordability
⭐
12
🌍 Dataset that shows the Internet affordability by country (a shocking reality!)
Dataset Indian Companies
⭐
12
Web Scraping "List of companies in India" from AmbitionBox Website using Python and Beautiful Soup
Math Genealogy Scraper
⭐
10
Code for scraping (and a mirror of) the Math Genealogy Database
Dex
⭐
9
A Collection of Pokedex data and tools
Private Building Web Scraper
⭐
9
Scraper of https://bmis1.buildingmgt.gov.hk/bd_hadbiex/conten
India Trade Data
⭐
9
A web scraper written in Python to gather trade data for India across commodities and countries
Gpo_tools
⭐
8
Scraping and parsing tools for the GPO's congressional hearings dataset.
Southparkr
⭐
8
R package that scrapes South Park transcripts from wikia.com.
Coursera Web Scraper
⭐
7
A multiprocessing webscraper for Coursera.org to build a dataset for all courses with details like ratings, difficulty, etc.
Image Scraper
⭐
7
Image scraper for DuckDuckGo and Google for creating DL datasets
Digiklothes
⭐
7
A dataset of more than 55,000 clothing items in the digikala website and their current information, such as, name, item url, image url (+ current price, rating & discount).
Laliga Dataset
⭐
6
LaLiga 2018-2019 Season - Advanced Player Statistics Dataset
Sverige_postnummer
⭐
6
Sweden's post-codes, street names, and box numbers.
Vulntoolkit
⭐
6
R package with NOAA and PSMSL web scrapers and analytical tools for analysis of coastal and estuarine datasets.
Pascalsentencedataset
⭐
5
Scraping Program for Pascal Sentence Dataset
Gutenberg
⭐
5
Project Gutenberg scraper, parser and LDA analysis with Mallet
Short Text Corpus With Focus On Humor Detection
⭐
5
Indexda
⭐
5
Natural Language Processing of academic papers for dataset indexing
Related Searches
Python Dataset (14,792)
Jupyter Notebook Dataset (6,824)
Python Scraper (3,513)
Deep Learning Dataset (2,364)
Machine Learning Dataset (2,279)
Javascript Scraper (2,047)
Dataset Pytorch (1,847)
Dataset Tensorflow (1,583)
Scraper Scrape (1,534)
Scraper Web Crawler (1,528)
1-36 of 36 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.