Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Trafilatura | 2,447 | 66 | 5 months ago | 39 | November 29, 2023 | 66 | gpl-3.0 | Python | ||
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments | ||||||||||
Textdescriptives | 256 | 1 | 5 months ago | 32 | October 31, 2023 | 2 | apache-2.0 | Python | ||
A Python library for calculating a large variety of metrics from text | ||||||||||
Cadmium | 155 | 4 years ago | 9 | mit | Crystal | |||||
Natural Language Processing (NLP) library for Crystal | ||||||||||
A Smattering Of Nlp In Python | 150 | 6 years ago | 1 | apache-2.0 | ||||||
A very brief introduction to Natural Language Processing programming in Python | ||||||||||
Nlpserver | 74 | 2 years ago | 8 | mit | Python | |||||
NLP Web Service | ||||||||||
Lorca | 67 | 6 | 6 years ago | 13 | February 03, 2018 | 2 | mit | JavaScript | ||
Natural Language Processing for Spanish in Node.js. Stemmer, sentiment analysis, readability, tf-idf with batteries, concordance and more! | ||||||||||
Spacy_readability | 31 | 2 | 5 years ago | 5 | January 29, 2019 | 6 | mit | Python | ||
spaCy pipeline component for adding text readability meta data to Doc objects. | ||||||||||
Trf | 26 | 5 years ago | 6 | mit | Python | |||||
This is the repository for TRF (text readability features) publication. | ||||||||||
Neural Scam Artist | 15 | 3 years ago | mit | Python | ||||||
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset. | ||||||||||
Dnlp | 12 | a year ago | mit | Python | ||||||
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа |