Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Data Science | 3,898 | 4 months ago | 5 | Jupyter Notebook | ||||||
Collection of useful data science topics along with articles, videos, and code | ||||||||||
Trafilatura | 2,447 | 66 | 4 months ago | 39 | November 29, 2023 | 66 | gpl-3.0 | Python | ||
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments | ||||||||||
Clean Text | 810 | 1 | 25 | a year ago | 7 | February 02, 2022 | 17 | other | Python | |
🧹 Python package for text cleaning | ||||||||||
Bookcorpus | 698 | 10 months ago | 5 | mit | Python | |||||
Crawl BookCorpus | ||||||||||
Complete Life Cycle Of A Data Science Project | 499 | 4 months ago | 4 | mit | ||||||
Complete-Life-Cycle-of-a-Data-Science-Project | ||||||||||
Weibo_terminator_workflow | 259 | 7 years ago | 3 | Python | ||||||
Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done! | ||||||||||
Summarizer | 236 | 2 years ago | mit | Python | ||||||
A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences. | ||||||||||
Web Database Analytics | 144 | 4 years ago | mit | Jupyter Notebook | ||||||
Web scrapping and related analytics using Python tools | ||||||||||
Knowledge Gpt | 91 | a year ago | 9 | mit | Python | |||||
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources. | ||||||||||
Twitterscraper | 86 | a year ago | 4 | mit | Python | |||||
Scrape a User's Twitter data! Bypass the 3,200 tweet API limit for a User! |