NLPgithub
star⭐️
watchfork❤️❤️❤️
🍆 🍒 🍐 🍊 | 🌻 🍓 🍈 🍅 🍍 |
---|---|
* * * * * * * * * |
* * * * * * * * * |
* * * * * * * * * NLP * |
* * * * * * * NLP * NLP * NLP * * |
Name | Description | |
---|---|---|
wainshine/Chinese-Names-Corpus | ||
Chinese-Word-Vectors | github repo | |
, PTT, , , ,, | link | |
json | github | |
2dva | ||
3GHTMLJSONnameaccountIDtitlecontent | github | |
github | ||
LeaderboardState-of-the-art | github | |
/(ASR) | github | |
LitBankNLP | 100 | github |
ULMFiT | github | |
github | ||
github | ||
github | ||
5809385800 | github | |
851620135M | github | |
github repo |
||
nlp17GB+9MB2-3 Gbit/s | github | |
700,000 couplets, 70 | github | |
github | ||
42GBJD(CSDD) | github | |
70 | link | |
github | ||
4 | Homepage | |
github | ||
fake news corpus | github | |
/ | github | |
github | ||
github | ||
-()--baseline-- | github | |
github | ||
CLUEDatasetSearch | NLPNLPNLP | github |
github | ||
139M + | paper and code | |
/ | github | |
NLP | github | |
// | github | |
&&& | github | |
OpenCLaP | github | |
BERT | DRCDSQuAD CMRC 2018:SQuAD |
github |
Dakshina | / | github |
OPUS-100 | (100) | github |
github | ||
github | ||
() | github | |
NLP/ | github | |
LitBankNLP | 100 | github |
70 | github | |
- | github | |
COLDDateset | paper |
Name | Description | |
---|---|---|
textfilter | observerss/textfilter | |
-> | cocoNLP | |
: ; : ;: /n /n /vn | github | |
() () () | kfcd/chaizi | |
:0.400704566541 : 0.37006739587 |
rainarch/SentiBridge | |
dongxiexidian/Chinese | ||
python-pinyin | mozillazg/python-pinyin | |
zhtools | skydark/nstools | |
say wo i ni # | tinyfool/ChineseWithEnglish | |
chinese_dictionary | guotong1988/chinese_dictionary | |
wordninja | wordninja | |
data | ||
THU | IT | link |
856, 280,20W13 | github | |
+ | - pea6 | |
Bi-LSTM + CRF+ | keras | link |
Universal Transformer + CRF | link | |
java version | ||
chinese-xinhua | api | github |
SpaCy | Parser, NER, packagespacyspacy | github |
github | ||
Synonyms | github | |
HarvestText | -- | github |
word2word | -62/3,564 | github |
github | ||
github | ||
103976 | sqlcsvExcel | github |
github | ||
github | ||
186 | github | |
github | ||
(featurizer) | github | |
char_featurizer - | github | |
mecabPython | github | |
g2pC | github | |
ssc, Sound Shape Code | - |
version 1 version 2 blog/introduction |
/ | github | |
Tokenizer | github | |
Tokenizers | github | |
github | ||
token2indexPyTorch/Tensorflow | github | |
github | ||
NLP | github | |
68916 | github |
Name | Description | |
---|---|---|
BMList | github | |
bert | link | |
bertslides | link | |
github | ||
bert tutorial | github | |
bert pytorch | github | |
bert pytorch | github | |
BERTBERT | github | |
bertELMO | github | |
BERT Pre-trained models and downstream applications | github | |
/BERT & ERNIE | github | |
Kashgarigpt-2 | github | |
Facebook LAMA | Transformer-XL/BERT/ELMo/GPT | github |
GPT2 | github | |
XLMFacebook | github | |
ALBERT | github | |
Transformers 20 | TensorFlow 20 PyTorch (BERT, GPT-2, RoBERTa, XLM, DistilBert, XLNet) 8/33/102 | github |
8BERT | github | |
RoBERTa | 138GBRoBERTa | link |
ELECTREA | pretrain Chinese Model | github |
albert-chinese-ner | ALBERTNER | github |
github | ||
ELECTRA | github | |
Transformers(BERT, XLNet, Bart, Electra, Roberta, XLM-Roberta)() | github | |
TensorFlow Hub | 40+() | link |
UER | BERTGPTELMO | github |
github | ||
github | ||
Language Model as a Service (LMaaS) | github | |
GPT-NeoX-20B | 200 | github |
CSL | 396,209 CSL NLP | github |
github |
Name | Description | |
---|---|---|
python package cocoNLP |
java version python version |
|
pytorch | github | |
bert pytorch | github | |
(Keyphrase) pke | github | |
BLINK | github | |
BERT/CRF | github | |
LatticeLSTM | github | |
python | github | |
TensorFlowBERT | - Entity and Relation Extraction Based on TensorFlow and BERT TensorFlowBERT2019Schema based Knowledge Extraction, SKE 2019 | github |
NeuroNER vs BertNER | github | |
BERT | github | |
github | ||
bert | tensorflow | github |
bert-Kashgari | keras Kashgari | github |
cocoNLP | rake | github |
Microsoft// | github | |
github | ||
NER | github | |
github | ||
github | ||
chinese_keyphrase_extractor (CKPE) | A tool for chinese keyphrase extraction | github |
github | ||
BERT-NER-PytorchBERTNER | github |
Name | Description | |
---|---|---|
XLORE | link | |
github | ||
github repogithub |
||
github | ||
AmpliGraph (Python) | github | |
github | ||
github | ||
Zincbase | github | |
github | ||
github | ||
() | github | |
github | ||
github | ||
132 | link | |
(COKG-19) | link | |
github | ||
50 | github | |
14 | github | |
Jiagu | BiLSTM | github |
medical_NER - | github | |
// | github | |
LibKGE | github | |
mongodb | 81005800jiebademo | github |
github | ||
github | ||
github | ||
BLINK | github | |
/ | github | |
dstlr | github | |
BERT | github | |
COVID-19 |
github github |
|
DGL-KE | github | |
method data | ||
link |
Name | Description | |
---|---|---|
Texar | Toolkit for Text Generation and Beyond | github |
Ehud Reiter | link NLG | |
github | ||
link | ||
github | ||
github | ||
BLEURT | link | |
link 70 |
||
TransformerHacker News | github | |
SQL | github | |
github | ||
github | ||
GPT2/ | github | |
github | ||
TextFooler/ | github | |
SimBERT | UniLMBERT | github |
GPT-2 | github | |
github | ||
github | ||
Name | Description | |
---|---|---|
/ | github | |
github | ||
TextTeaser | github | |
BERT | github | |
Python | link | |
(Colab)( | github |
Name | Description | |
---|---|---|
github | ||
robot qingyun | qingyun | github |
github | ||
qa | Amodel-for-Retrivalchatbot - Chinese Retreival chatbot | git |
ConvLab | github | |
rasa | github | |
-() | github | |
github | ||
MiningZhiDaoQACorpus | 580580 | github |
GPT2GPT2-chitchat | github | |
(LeaderboardsDatasetsPapers) | github | |
github | ||
chatbot-list | github | |
Chinese medical dialogue data | github | |
110400 | github | |
CrossWOZ | paper & data | |
github | ||
2020(DSTC9 2020) | github | |
QuoraT5(Paraphrase) | github | |
GoogleTaskmaster-2 | github | |
Haystack(QA) | github | |
github | ||
Amazon- | github | |
webqadureaderAlbert Large QA | github | |
CommonsenseQAQA | link | |
MedQuAD() | github | |
AlbertElectra | github | |
14W | github |
Name | Description | |
---|---|---|
github | ||
github | ||
python | github | |
GitHub Typo CorpusGitHub/ | github | |
BertPuncBERT | github | |
github | ||
Chinese Spell Checking (CSC) and Grammatical Error Correction (GEC) | github | |
link |
Name | Description | |
---|---|---|
1 | github | |
Chinese-CLIP | CLIP & | github |
Name | Description | |
---|---|---|
ASR + | github | |
THCHS30 |
data_thchs30tgz-OpenSLR data_thchs30tgz test-noisetgz-OpenSLRtest-noisetgz resourcetgz-OpenSLR resourcetgz Free ST Chinese Mandarin Corpus Free ST Chinese Mandarin Corpus AIShell-1 -OpenSLR AIShell-1 Primewords Chinese Corpus Set 1-OpenSLR Primewords Chinese Corpus Set 1 |
|
github | ||
Common Voice | 42,0001,400github | link |
speech-aligner | github | |
ASR/ | github | |
github | ||
masr | github | |
github | ||
(MOSNet, BSSEval, STOI, PESQ, SRMR) | github | |
/ | github | |
CoVoSTFacebook- | 11() | github |
ParakeetPaddlePaddle- | github | |
(Java) | github | |
CoVoSTFacebook- | github | |
TensorFlow 2 | github | |
Python | github | |
ViSQOL | github | |
zhrtvc | github | |
aukit | github | |
phkit | github | |
zhvoice | 832009001300 | github |
audio | github | |
github | ||
Python | github | |
Audioset | github | |
github |
Name | Description | |
---|---|---|
LayoutLM-v3 | github | |
PyLaia | github | |
github | ||
DocSearch | github | |
fdfgen | link | |
pdfx | link | |
invoice2data | invoice2data | |
github | ||
PDFMiner | PDFMinerPDFPDF(HTML)PDF | link |
PyPDF2 | PyPDF 2python PDFPDFPDFPDF | link |
PyPDF2 | PyPDF 2python PDFPDFPDFPDF | link |
ReportLab | ReportLabPDF PDFPython5LinuxWikipedia/ | link |
SIMPdfPythonPDF | github | |
pdf-diff | PDFdiff pdf | github |
Name | Description | |
---|---|---|
unet | github | |
pdftabextract | OCR | link |
tabula-py | pdfpandasdataframejavapython | |
camelot | link | |
pdfplumber | ||
PubLayNet | link | |
github | ||
BERT | github | |
|
||
GAN | github | |
carefree-learn(PyTorch) | (AutoML) | github |
github | ||
github | ||
TaBERT | paper | |
Awesome-Table-Recognition | github |
Name | Description | |
---|---|---|
QAMatchZoo | github | |
github | ||
similarity | java, | github |
Hownet | gihtub | |
Python | github | |
Siamese bilstm, | 10 | github |
Name | Description | |
---|---|---|
NLPEDA | github | |
NLP | github | |
github | ||
nlp | link | |
NLP | github |
Name | Description | |
---|---|---|
python package cocoNLP | ||
phone_number | python package cocoNLP | |
IDCards_pattern = r'^([1-9]\d{5}[12]\d{3}(0[1-9]|1[012])(0[1-9]|[12][0-9]|3[01])\d{3}[0-9xX]) IDs = re.findall(IDCards_pattern, text, flags=0) |
||
IP | (25[0-5]| 2[0-4]\d| [0-1]\d{2}| [1-9]?\d).(25[0-5]| 2[0-4]\d| [0-1]\d{2}| [1-9]?\d).(25[0-5]| 2[0-4]\d| [0-1]\d{2}| [1-9]?\d).(25[0-5]| 2[0-4]\d| [0-1]\d{2}| [1-9]?\d) | |
[1-9]([0-9]{5,11}) | ||
[0-9-()]{7,18} | ||
[A-Za-z0-9_-\u4e00-\u9fa5]+ | ||
+ | github | |
github |
Name | Description | |
---|---|---|
github | ||
/BERT/ | link | |
Deepmatch | github | |
wwsearch | github | |
aili - the fastest in-memory index in the East | github | |
RapidFuzz | a fast string matching library for Python and C++, which is using the string similarity calculations from FuzzyWuzzy | github |
Name | Description | |
---|---|---|
github | ||
/BERT/ | link | |
Deepmatch | github | |
allennlp | github |
Name | Description | |
---|---|---|
github | ||
awesome-nlp-sentiment-analysis | github | |
github |
Name | Description | |
---|---|---|
github | ||
NLP | github | |
PyTorchBERT(ACE 2005 corpus) | github | |
github |
Name | Description | |
---|---|---|
github | ||
NLLB | 200+NLLB | link |
Easy-Translate | Facebook/Meta AI M2M100NLLB200200+ | github |
Name | Description | |
---|---|---|
()- | github | |
github | ||
github |
Name | Description | |
---|---|---|
github baidu ink code a0qq |
Name | Description | |
---|---|---|
TextCluster Short text cluster | github |
Name | Description | |
---|---|---|
NeuralNLP-NeuralClassifier | github |
Name | Description | |
---|---|---|
GraphbrainAI | github | |
() |
Name | Description | |
---|---|---|
github |
Name | Description | |
---|---|---|
TextAttack | github | |
OpenBackdoor: | OpenBackdoorPythonPyTorch | github |
Name | Description | |
---|---|---|
Scattertext (python) | github | |
whatlies | spacy | |
PySS3AISS3 | github | |
3D | github | |
attnvisGPT2BERTtransformer | github | |
Texthero | github |
Name | Description | |
---|---|---|
NLP | github | |
brat rapid annotation tool | link | |
Poplar | github | |
LIDA | github | |
doccano | github | |
Datasaurai | link |
Name | Description | |
---|---|---|
langid | 97 | https://github.com/saffsd/langid.py |
langdetect | https://code.google.com/archive/p/language-detection/ |
Name | Description | |
---|---|---|
jieba | jieba | |
hanlp | hanlp | |
nlp4han | (//////NER/N/HMM/// | github |
link | ||
PytorchBert | github | |
nlp4han | //////NER/N/HMM/// | github |
github | ||
BERT | github | |
jieba_fast jieba | github | |
StanfordNLP | Python | link |
Python() | github | |
PreNLP | github | |
nlp | (Word Embedding)(NER)(Text Classificatin)(Text Generation)(Text Similarity)nlpkerastensorflow | github |
Python/NLP | github | |
Fortepipeline | github | |
stanzaNLP | github | |
Fancy-NLP | github | |
NLP | github | |
DSSMpipeline | github | |
Texthero | github | |
nlpgnn | github | |
Macadam | Tensorflow(Keras)bert4keras | github |
LineFlowNLP | github | |
ArabicaPython | github | |
Python SMSBoom | github |
Name | Description | |
---|---|---|
phunterlau/wangfeng-rnn | ||
github | ||
NLP | github | |
github link | ||
github | ||
CoupletAI - | CNN+Bi-LSTM+Attention | github |
github | ||
14W | github | |
COPE - | github | |
Paper2GUI | AIAPP18+AIOCR | github |
github paper | ||
Python | homepage gitee |
Name | Description | |
---|---|---|
link | ||
link | ||
link | ||
link | ||
link | ||
link | ||
link | ||
link | ||
3D | link | |
link | ||
link | ||
cs224n | link pytorch link | |
github | ||
Natural Language Processingby Jacob Eisenstein | github | |
ML-NLP | (Machine Learning)NLP | github |
NLP | github | |
2019NLP | download | |
nlp-recipes-- | github | |
github | ||
Transfer Learning in Natural Language Processing (NLP) | youtube | |
link github |
Name | Description | |
---|---|---|
NLPTOP | github | |
2019(7) | github |
Name | Description | |
---|---|---|
BDCI2019 | github | |
github | ||
github | ||
-() | github | |
github |
Name | Description | |
---|---|---|
NLP | github | |
spaCy | github | |
python | github | |
github repogithub | ||
Chinese medical dialogue data | github | |
110400 | github | |
COVID-19 |
github github |
Name | Description | |
---|---|---|
BlackstonespaCy pipelineNLP | github | |
github | ||
-() | github | |
856, 280,20W13 | github |
Name | Description | |
---|---|---|
Dalle-mini | DALLE | github |
Name | Description | |
---|---|---|
phone | ls0f/phone | |
phone | AfterShip/phone | |
ngender | observerss/ngender | |
NLP | link | |
PDF PPT | github | |
comparxiv arXiv | pypi | |
CHAMELEON | github | |
github | ||
Python | github |