Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Trafilatura | 2,447 | 66 | 3 months ago | 39 | November 29, 2023 | 66 | gpl-3.0 | Python | ||
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments | ||||||||||
Spider | 907 | 6 years ago | 3 | gpl-3.0 | Java | |||||
A configurable web spider with a easy-to-use web console | ||||||||||
Extractnet | 118 | 4 months ago | 9 | November 06, 2022 | 3 | mit | HTML | |||
A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package | ||||||||||
Lisc | 81 | 4 months ago | 5 | October 15, 2023 | 1 | apache-2.0 | Python | |||
Literature Scanner: Automated collection & analyses of the scientific literature. | ||||||||||
Trscraper | 47 | 3 years ago | 1 | mit | Python | |||||
TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madenciliği yapma imkanı sunan bir uygulamadır. | ||||||||||
Text Analysis | 32 | 7 years ago | Jupyter Notebook | |||||||
Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling. | ||||||||||
Newshound | 25 | a year ago | 1 | October 06, 2021 | 1 | mit | ||||
This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages. | ||||||||||
Scrapeadvisor | 22 | a year ago | n,ull | Python | ||||||
A user-friendly python-based GUI which provides sentiment analysis of users' reviews toward a specific TripAdvisor facility | ||||||||||
Hepsiburada Review Scraper | 20 | 5 years ago | gpl-3.0 | Python | ||||||
Hepsiburada review/comment and rating scraper. Turkish text dataset creator for data science and NLP projects. 📜 | ||||||||||
Restaurant Finder Featurereviews | 19 | 4 years ago | mit | Python | ||||||
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews). |