Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data science corpus
corpus
x
data-science
x
9 search results found
Sadedegel
⭐
81
A General Purpose NLP library for Turkish
Parallel Corpora Tools
⭐
39
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Shabby Pages
⭐
34
ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.
Hiphopathy
⭐
28
HipHopathy, An introductory Data Science Unit Using Rap Lyrics. The goal of this unit is to connect cultural relevance to computing by introducing elementary techniques of natural language processing with a corpus of hip-hop data.
Annie
⭐
12
A NLP Chatbot trained using a corpus of Reddit data.
Langdist
⭐
10
Multilingual Language Modeling Toolkit
Textdirectory
⭐
7
TextDirectory allows you to filter, transform, and combine multiple text files into one aggregated file.
Le Traducteur
⭐
6
A Neural Machine Translation framework built with PyTorch and AllenNLP.
Svevo Letters Analysis
⭐
5
Topic Modeling and Sentiment Analysis on Italo Svevo Epistolary Corpus
Related Searches
Python Data Science (6,905)
Machine Learning Data Science (5,390)
Jupyter Notebook Data Science (3,734)
Python Corpus (2,447)
R Data Science (1,164)
Deep Learning Data Science (1,039)
Html Data Science (872)
Natural Language Processing Corpus (510)
Dataset Corpus (342)
Java Corpus (308)
1-9 of 9 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.