Awesome Open Source
Awesome Open Source
Combined Topics
nlp-machine-learning
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 85 Nlp Machine Learning Open Source Projects
Categories
>
Machine Learning
>
Nlp Machine Learning
Deeppavlov
⭐
5,145
An open source library for deep learning end-to-end dialog systems and chatbots.
Nemo
⭐
2,582
NeMo: a toolkit for conversational AI
Chatbot
⭐
2,175
一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本、tf2.0版本、pytorch版本。
Codesearchnet
⭐
1,391
Datasets, tools, and benchmarks for representation learning of code.
Text_classification
⭐
1,310
Text Classification Algorithms: A Survey
Tika Python
⭐
1,022
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Rasa Ui
⭐
803
Rasa UI is a frontend for the Rasa Framework
Tapas
⭐
605
End-to-end neural table-text understanding models.
Chinese_models_for_spacy
⭐
546
SpaCy 中文模型 | Models for SpaCy that support Chinese
Nlp_base
⭐
525
自然语言基础模型
Babyai
⭐
503
BabyAI platform. A testbed for training agents to understand and execute language commands.
Awesome Sentiment Analysis
⭐
460
Repository with all what is necessary for sentiment analysis and related areas
Hands On Nltk Tutorial
⭐
426
The hands-on NLTK tutorial for NLP in Python
Text_mining_resources
⭐
365
Resources for learning about Text Mining and Natural Language Processing
Nlp Conference Compendium
⭐
357
Compendium of the resources available from top NLP conferences.
Lingua
⭐
351
👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Contextualized Topic Models
⭐
346
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Dab
⭐
294
Data Augmentation by Backtranslation (DAB) ヽ( •_-)ᕗ
Ner
⭐
290
Named Entity Recognition
Dstc8 Schema Guided Dialogue
⭐
282
The Schema-Guided Dialogue Dataset
Data Science Hacks
⭐
275
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Customer_satisfaction_analysis
⭐
267
基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。
Hierarchical Attention Networks Pytorch
⭐
244
Hierarchical Attention Networks for document classification
Python Ai Assistant
⭐
237
Python AI assistant 🧠
Machine Learning Resources
⭐
234
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Melusine
⭐
226
Melusine is a high-level library for emails classification and feature extraction "dédiée aux courriels français".
Natural Language Processing With Tensorflow
⭐
224
Natural Language Processing with TensorFlow, published by Packt
Deepehr
⭐
222
Chronic Disease Prediction Using Medical Notes
Chinese Poetry Generation
⭐
211
An RNN-based Chinese Poem Generator
Character Based Cnn
⭐
207
Implementation of character based convolutional neural network
Sarah
⭐
201
Terminal Assistant For SemiCode OS
Datastories Semeval2017 Task4
⭐
187
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Hntitlenator
⭐
185
Test your HN title against a neural network
Nlp_profiler
⭐
185
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Ktext
⭐
182
Utilities for preprocessing text for deep learning with Keras
Natural Language Processing Specialization
⭐
178
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Pytorch Sentiment Neuron
⭐
177
Pytorch Question Answering
⭐
161
Important paper implementations for Question Answering using PyTorch
Java Deep Learning Cookbook
⭐
159
Code for Java Deep Learning Cookbook
Financial News Dataset
⭐
154
Reuters and Bloomberg
Awesome Nlp Polish
⭐
153
A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
Onnxt5
⭐
147
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
Hands On Natural Language Processing With Python
⭐
146
This repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Zzz Retired__openstt
⭐
145
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Lazy
⭐
142
Lazy, AI chatbot service.
Seq2seq_tutorial
⭐
133
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Dan Jurafsky Chris Manning Nlp
⭐
126
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Nlp Pretrained Model
⭐
122
A collection of Natural language processing pre-trained models.
Elastic_transformers
⭐
121
Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers
Dl Text
⭐
119
Text pre-processing library for deep learning (Keras, tensorflow).
G Reader
⭐
117
2018年机器阅读理解技术竞赛模型,国内外1000多支队伍中BLEU-4评分排名第6, ROUGE-L评分排名第14。(未ensemble,未嵌入训练好的词向量,无dropout)
Lingo
⭐
113
package lingo provides the data structures and algorithms required for natural language processing
Bertqa Attention On Steroids
⭐
112
BertQA - Attention on Steroids
Mrc_book
⭐
110
《机器阅读理解:算法与实践》代码
Atnre
⭐
109
Adversarial Training for Neural Relation Extraction
Lemminflect
⭐
109
A python module for English lemmatization and inflection.
Textaugmentation Gpt2
⭐
109
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Repo 2016
⭐
104
R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Question Generation
⭐
101
Given a sentence automatically generate reading comprehension style factual questions from that sentence, such that the sentence contains answers to those questions.
Writeup Frontend
⭐
97
Beat Writer's Block with AI
Wiki Split
⭐
96
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Monkeylearn
⭐
95
⛔️ ARCHIVED ⛔️ 🐒 R package for text analysis with Monkeylearn 🐒
Datascience
⭐
92
It consists of examples, assignments discussed in data science course taken at algorithmica.
Doc2vec
⭐
92
📓 Long(er) text representation and classification using Doc2Vec embeddings
Lda Topic Modeling
⭐
91
A PureScript, browser-based implementation of LDA topic modeling.
Nlp Paper
⭐
91
自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Summarus
⭐
88
Models for automatic abstractive summarization
Ml_things
⭐
79
This is where I put things I find useful that speed up my work with Machine Learning. Ever looked in your old projects to reuse those cool functions you created before? Well, this repo is designed to be a Python Library of functions I created in my previous project that can be reused. I also share some Notebooks Tutorials and Python Code Snippets.
Russian_news_corpus
⭐
76
Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Cracking The Da Vinci Code With Google Interview Problems And Nlp In Python
⭐
76
A guide on how to crack combinatorics puzzles shown in The Da Vinci Code movie using CS fundamentals and NLP
Intent_classifier
⭐
67
Aiops_platform
⭐
64
An Artificial Intelligence Platform for IT Operations.
How To Mine Newsfeed Data And Extract Interactive Insights In Python
⭐
61
A practical guide to topic mining and interactive visualizations
Argument Reasoning Comprehension Task
⭐
57
The Argument Reasoning Comprehension Task: Source codes & Datasets
Wongnai Corpus
⭐
57
Collection of Wongnai's datasets
Text Classification Keras
⭐
53
📚 Text classification library with Keras
Gec Pseudodata
⭐
50
Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Predicting Myers Briggs Type Indicator With Recurrent Neural Networks
⭐
46
Coursera Natural Language Processing Specialization
⭐
45
Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.
News_push_project
⭐
44
Real Time News Scraping and Recommendation System - React | Tensorflow | NLP | News Scrapers
Mitie_chinese_wikipedia_corpus
⭐
44
Pre-trained Wikipedia corpus by MITIE
Talismane
⭐
40
NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Letslearnai.github.io
⭐
33
Lets Learn AI
Sdtm_mapper
⭐
27
AI SDTM mapping (R for ML, Python, TensorFlow for DL)
Click2analyze Androiddevchallenge
⭐
20
An app to analyze the text and fixing the anomaly of the message that deviates from what is standard, normal, or expected. #AndroidDevChallenge
1-85 of 85 projects
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210