Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Gensim Data | 492 | 6 years ago | 14 | lgpl-2.1 | Python | |||||
Data repository for pretrained NLP models and NLP corpora. | ||||||||||
Narrativeqa | 362 | 4 years ago | apache-2.0 | Shell | ||||||
This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers. | ||||||||||
Chakin | 313 | 1 | 5 years ago | 7 | March 27, 2019 | 8 | mit | Python | ||
Simple downloader for pre-trained word vectors | ||||||||||
Wikitables | 279 | 3 | 1 | 3 years ago | 14 | August 26, 2021 | 4 | mit | Python | |
Import tables from any Wikipedia article as a dataset in Python | ||||||||||
Datasets | 192 | 5 months ago | 1 | CSS | ||||||
Interesting datasets you could use with Algolia | ||||||||||
Qb | 160 | 2 years ago | 7 | mit | Python | |||||
QANTA Quiz Bowl AI | ||||||||||
Legislator | 90 | 3 months ago | 1 | April 24, 2020 | 1 | R | ||||
Interface to the Comparative Legislators Database | ||||||||||
Ambigqa | 86 | 2 years ago | Python | |||||||
An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions" | ||||||||||
Awesome Wikipedia | 76 | 8 months ago | 2 | cc0-1.0 | ||||||
A curated list of awesome Wikipedia-related frameworks, libraries, software, datasets and references. | ||||||||||
Text Segmentation | 73 | 5 years ago | 3 | Python | ||||||
Implementation of the paper: Text Segmentation as a Supervised Learning Task |