Baleen

An automated ingestion service for blogs to construct a corpus for NLP research.
Alternatives To Baleen
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Mimic Recording Studio425
a year ago33apache-2.0JavaScript
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
Seqmatchseq258
7 years ago3Lua
Toiro110
9 months ago8July 31, 20231apache-2.0Python
A comparison tool of Japanese tokenizers
Baleen82
6 years ago6April 18, 201623mitPython
An automated ingestion service for blogs to construct a corpus for NLP research.
Pythia77
8 years agootherJupyter Notebook
Supervised learning for novelty detection in text
Open Discourse64
a year ago14mitPython
Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
Aspen29
8 months ago1mitJavaScript
🔎 📖 ✨ Custom, private search engine for text documents built with NextJS/React/ES6/ES7
Poetrycorpus29
6 years ago11apache-2.0Python
Поэтический корпус русского языка
Re Verb21
3 years ago8mitPython
speaker diarization system using an LSTM
Newsfeed Corpus18
3 years agomitJavaScript
A Dockerized RSS feed fetcher for NLP work, using asyncio
Alternatives To Baleen
Select To Compare


Alternative Project Comparisons
Popular Corpus Projects
Popular Docker Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Docker
Natural Language Processing
Corpus
Mongo
Flask Application