| MycroftAI/mimic-recording-studio |
425 |
|
0 |
0 |
about 3 years ago |
0 |
|
33 |
apache-2.0 |
JavaScript |
| Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2 |
| medialab/hyphe |
308 |
|
0 |
0 |
over 2 years ago |
0 |
|
56 |
agpl-3.0 |
JavaScript |
| Websites crawler with built-in exploration and control web interface |
| shuohangwang/SeqMatchSeq |
258 |
|
0 |
0 |
almost 9 years ago |
0 |
|
3 |
|
Lua |
| taishi-i/toiro |
110 |
|
0 |
0 |
almost 3 years ago |
10 |
November 02, 2025 |
1 |
apache-2.0 |
Python |
| A comparison tool of Japanese tokenizers |
| DistrictDataLabs/baleen |
82 |
|
0 |
0 |
almost 8 years ago |
6 |
April 18, 2016 |
23 |
mit |
Python |
| An automated ingestion service for blogs to construct a corpus for NLP research. |
| Lab41/pythia |
77 |
|
0 |
0 |
over 9 years ago |
0 |
|
0 |
other |
Jupyter Notebook |
| Supervised learning for novelty detection in text |
| open-discourse/open-discourse |
64 |
|
0 |
0 |
over 3 years ago |
0 |
|
14 |
mit |
Python |
| Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag). |
| IlyaGusev/PoetryCorpus |
29 |
|
0 |
0 |
over 8 years ago |
0 |
|
11 |
apache-2.0 |
Python |
| Поэтический корпус русского языка |
| statico/aspen |
29 |
|
0 |
0 |
almost 3 years ago |
0 |
|
1 |
mit |
JavaScript |
| 🔎 📖 ✨ Custom, private search engine for text documents built with NextJS/React/ES6/ES7 |
| team-re-verb/RE-VERB |
21 |
|
0 |
0 |
over 5 years ago |
0 |
|
8 |
mit |
Python |
| speaker diarization system using an LSTM |