Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for natural language processing chinese
chinese
x
natural-language-processing
x
47 search results found
D2l Zh
⭐
56,684
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Qwen
⭐
11,085
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Chinese Bert Wwm
⭐
8,600
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
Nlp_chinese_corpus
⭐
8,344
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Text_classification
⭐
7,628
all kinds of text classification models and more with deep learning
Gpt2 Chinese
⭐
7,249
Chinese version of GPT2 training code, using BERT tokenizer.
Ansj_seg
⭐
6,442
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
Baichuan 7b
⭐
5,493
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Awesome Chinese Llm
⭐
5,477
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微
Huatuo Llama Med Chinese
⭐
3,776
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
Awesome Pretrained Chinese Nlp Models
⭐
3,738
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Text2vec
⭐
3,610
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-
Baichuan2
⭐
3,527
A series of large language models developed by Baichuan Intelligent Technology
Deepke
⭐
3,035
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
Linly
⭐
2,964
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
Gpt2 Chitchat
⭐
2,870
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
Chinese Clip
⭐
2,816
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Uer Py
⭐
2,802
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Cluedatasetsearch
⭐
2,778
搜索所有中文NLP数据集,附常用英文NLP数据集
Jionlp
⭐
2,724
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Cv
⭐
2,637
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
Baichuan 13b
⭐
2,579
A 13B large language model developed by Baichuan Intelligent Technology
Mnbvc
⭐
2,533
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文
Gse
⭐
2,352
Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others.
Tigerbot
⭐
2,096
TigerBot: A multi-language multi-task LLM
Information Extraction Chinese
⭐
2,086
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Kcws
⭐
2,044
Deep Learning Chinese Word Segment
Awesome_chinese_medical_nlp
⭐
1,847
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽
Gpt2 Ml
⭐
1,674
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Bert Ner Pytorch
⭐
1,666
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Chinese Annotator
⭐
1,431
Annotator for Chinese Text Corpus (UNDER DEVELOPMENT) 中文文本标注工具
Chinesenlp
⭐
1,329
Datasets, SOTA results of every fields of Chinese NLP
Chinese Electra
⭐
1,253
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
Jieba Php
⭐
1,193
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
Nlp_xiaojiang
⭐
1,129
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Senten Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征
Bpemb
⭐
1,068
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Data Juicer
⭐
994
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Tencentpretrain
⭐
951
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Lightnlp
⭐
725
基于Pytorch和torchtext的自然语言处理深度学习框架。
Thuocl
⭐
653
THUOCL(THU Open Chinese Lexicon)中文词库
Algorithm_interview_notes Chinese
⭐
603
2018/2019/校招/春招/秋招/自然语言处理(NLP)/深度学习(Deep Learning)/机器学习(Machine Learning)/C/C++/Python/面试笔记,此外,还包括创建者看到的所有机器学习/深度学 除了其中 DL/ML 相关的,其他与算法岗相关的计算机知识也会记录。 但是不会包括如前端/测试/JAVA/Android等岗位中有关的问题。
Jieba Rs
⭐
585
The Jieba Chinese Word Segmentation Implemented in Rust
Gpt2 Newstitle
⭐
528
Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。
Cluecorpus2020
⭐
517
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Chinese_models_for_spacy
⭐
498
SpaCy 中文模型 | Models for SpaCy that support Chinese
Macbert
⭐
489
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
Med Chatglm
⭐
462
Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调
Ckip Transformers
⭐
439
CKIP Transformers
Clause
⭐
390
🏇 聊天机器人,自然语言理解,语义理解
Glyce
⭐
387
Code for NeurIPS 2019 - Glyce: Glyph-vectors for Chinese Character Representations
Chinese Nlp Corpus
⭐
378
Collections of Chinese NLP corpus
Cope
⭐
377
A modern IDE for writing classical Chinese poetry 格律诗编辑程序
Opencc4j
⭐
366
🇨🇳Open Chinese Convert is an opensource project for conversion between Traditional Chinese and Simplified Chinese.(java 中文繁简体转换)
Chinesenlpcorpus
⭐
362
An collection of Chinese nlp corpus including basic Chinese syntatic wordset, semantic wordset, historic corpus and evaluate corpus. 中文自然语言处理的语料集合,包括语义词、领域共时、历时语料库、评测语料库等。
Cmrc2018
⭐
313
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
Chinese2digits
⭐
286
最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八","负百分之四十"等众多汉语表达方法。 The Best Tool of Chinese Number to Digits
Jiebar
⭐
277
Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :https://qinwenfeng.com/jiebaR/ )
Ahanlp
⭐
275
啊哈自然语言处理包,提供包括分词、依存句法分析、语义角色标注、自动摘要、语义相似度计算、LDA 主题预测、词云等服务。
Multi Criteria Cws
⭐
260
Simple Solution for Multi-Criteria Chinese Word Segmentation
Macropodus
⭐
256
自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分 of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NER(name entity recognition),Find(new words discovery),Keyword(keyword extraction),Summarize(text summarization),Sim(text similarity),Calculate(scientific calculator),Chi2num(chinese number to arabic number)
Cnn Text Classification Tf Chinese
⭐
235
CNN for Chinese Text Classification in Tensorflow
Jiayan
⭐
232
甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标 the 1st NLP toolkit designed for Classical Chinese, supports lexicon construction, tokenizing, POS tagging, sentence segmentation and punctuation.
Chineseembedding
⭐
224
Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.
Monpa
⭐
222
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Awesome Caffe
⭐
220
Awesome Caffe
Chineseblue
⭐
212
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Segmentit
⭐
208
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
Chinese Poetry Generation
⭐
195
An RNN-based Chinese Poem Generator
Doc Han Att
⭐
193
Hierarchical Attention Networks for Chinese Sentiment Classification
Cornucopia Llama Fin Chinese
⭐
178
聚宝盆(Cornucopia): 基于中文金融知识的LLaMA微调模型;涉及SFT、RLHF、GPU训练部署等
Varbook
⭐
177
适合中文程序员的变量命名助手,NLP+翻译,规范变量命名,定制化变量命名规则
Nlp Public Dataset
⭐
172
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
Thuctc
⭐
167
An Efficient Chinese Text Classifier
Mgpt
⭐
162
Multilingual Generative Pretrained Model
All4nlp
⭐
157
All For NLP, especially Chinese.
Word Checker
⭐
156
🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测,中文拼写检测纠正。英文单词拼写校验工具)
Deeplearning_nlp
⭐
149
基于深度学习的自然语言处理库
Guyu
⭐
148
pre-training and fine-tuning framework for text generation
Radish
⭐
146
C++ model train&inference framework
Awesome Text Classification
⭐
144
Awesome-Text-Classification Projects,Papers,Tutorial .
Pre Modern_chinese_corpus_dataset
⭐
132
近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
Elmo Chinese
⭐
131
Deep contextualized word representations for Chinese
Id Cnn Cws
⭐
130
Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"
Visual Chinese Llama Alpaca
⭐
129
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
Nlpcc Wordseg Weibo
⭐
121
NLPCC 2016 微博分词评测项目
Dataclue
⭐
117
DataCLUE: 数据为中心的NLP基准和工具包
Chinese_nlu_by_using_rasa_nlu
⭐
106
使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build a Chinese Natural Language Understanding System (NLU)
Treebankpreprocessing
⭐
106
Python scripts preprocessing Penn Treebank and Chinese Treebank
Chinese Hip Pop Generation
⭐
104
Generate Chinese hip-pop lyrics using GAN
Segment
⭐
98
The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)
Nlp Hanzi Similar
⭐
94
The hanzi similar tool.(汉字相似度计算工具。中文形近字算法)
Cmrc2019
⭐
90
A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)
Rasa_milktea_chatbot
⭐
88
Chatbot with bert chinese model, base on rasa framework(中文聊天机器人,结合bert意图分析,基于rasa框架)
Tf Idf Python
⭐
86
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Zhopenie
⭐
86
Chinese Open Information Extraction (Tree-based Triple Relation Extraction Module)
Nlp Resources
⭐
85
A useful list of NLP(Natural Language Processing) resources
Unihandecode
⭐
71
unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language preference priorities
Bert Chinese Text Classification Pytorch
⭐
68
This repo contains a PyTorch implementation of a pretrained BERT model for text classification.
Nlpdataaugmentation
⭐
67
Chinese NLP Data Augmentation, BERT Contextual Augmentation
Awesome Nlp Chinese Corpus
⭐
59
A curated list of resources of chinese corpora for NLP(Natural Language Processing)
Related Searches
Python Natural Language Processing (7,915)
Jupyter Notebook Natural Language Processing (4,405)
Machine Learning Natural Language Processing (3,939)
Deep Learning Natural Language Processing (2,414)
Python Chinese (1,892)
Pytorch Natural Language Processing (1,212)
Artificial Intelligence Natural Language Processing (1,010)
Dataset Natural Language Processing (1,010)
Tensorflow Natural Language Processing (909)
Javascript Natural Language Processing (843)
1-47 of 47 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.