Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for corpus japanese
corpus
x
japanese
x
38 search results found
Chinese Names Corpus
⭐
3,719
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词
Khcoder
⭐
295
KH Coder: for Quantitative Content Analysis or Text Mining
Fasttextjapanesetutorial
⭐
174
Tutorial to train fastText with Japanese corpus
Kanji Frequency
⭐
116
Kanji usage frequency data collected from various sources
Toiro
⭐
110
A comparison tool of Japanese tokenizers
Chive
⭐
105
Japanese word embedding with Sudachi and NWJC 🌿
Jlm
⭐
99
A fast LSTM Language Model for large vocabulary language like Japanese and Chinese
Ja.text8
⭐
74
Japanese text8 corpus for word embedding.
Jrte Corpus
⭐
73
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
Kwdlc
⭐
71
Kyoto University Web Document Leads Corpus
Laboro Bert Japanese
⭐
68
Laboro BERT Japanese: Japanese BERT Pre-Trained With Web-Corpus
Small_parallel_enja
⭐
61
50k English-Japanese Parallel Corpus for Machine Translation Benchmark.
Japanese Words To Vectors
⭐
57
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
Jparacrawl Finetune
⭐
57
An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.
Kyotocorpus
⭐
47
Kyoto University Text Corpus
Kantan Ej Dictionary
⭐
44
English-Japanese dictionary
Bsd
⭐
40
The Business Scene Dialogue corpus
Open2ch Dialogue Corpus
⭐
34
おーぷん2ちゃんねるをクロールして作成した対話コーパス
Ud_japanese Gsd
⭐
34
Japanese data from the Google UDT 2.0.
Make Meidai Dialogue
⭐
33
Get Japanese dialogue corpus
Extractive_rc_by_runtime_mt
⭐
30
Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"
Jsut Label
⭐
26
context labels and pronunciation data for JSUT corpus
Workshop Ijta
⭐
24
Rによる日本語テキスト分析入門
Asdc
⭐
22
Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)
Ud_japanese Bccwj
⭐
18
Maxixe
⭐
16
A small statistical segmenter for any language.
Annotatedfkccorpus
⭐
16
Annotated Fuman Kaitori Center Corpus
Pretrained_doc2vec_ja
⭐
15
Sentencegator
⭐
15
Find japanese sentences by WaniKani level
Technological Book Corpus Ja
⭐
14
日本語で書かれた技術書を収集した生コーパス/ツール
Mslt Corpus
⭐
13
Microsoft Speech Language Translation (MSLT) Corpus
Japanese
⭐
11
This repo contains a list of the 44,998 most common Japanese words in order of frequency, as determined by the University of Leeds Corpus.
Bilingualcorpus
⭐
11
Jmrd
⭐
7
Japanese Movie Recommendation Dialogue dataset
Keyakitreebank
⭐
7
Keyaki Treebank Parsed Corpus
Aozora Corpus Generator
⭐
6
Generates plain or tokenized text files from the Aozora Bunko
Haruniwa2
⭐
5
pipeline for parsing Japanese trained on data of the NPCMJ
Jpstats
⭐
5
tools for japanese corpus linguistics (as a hobbyist) in rust
Feliscatuszero
⭐
5
This system answers world history essay questions in Japanese and evaluate the answers.
Ami Meeting Parallel Corpus
⭐
5
AMI Meeting Parallel Corpus
Jp Ner
⭐
5
[abandoned] Work on generating an NER dataset for Japanese
Related Searches
Python Corpus (2,447)
Python Japanese (717)
Javascript Japanese (572)
Natural Language Processing Corpus (510)
Ruby Japanese (348)
Dataset Corpus (342)
Java Corpus (308)
Language Corpus (261)
1-38 of 38 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.