Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for nlp library
nlp-library
x
158 search results found
Transformers
⭐
124,049
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Spacy
⭐
28,628
💫 Industrial-strength Natural Language Processing (NLP) in Python
Awesome Pytorch List
⭐
14,715
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Openprompt
⭐
4,006
An Open-Source Framework for Prompt-Learning.
Fastnlp
⭐
2,940
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Fnlp
⭐
2,634
中文自然语言处理工具包 Toolkit for Chinese natural language processing
Farm
⭐
1,706
🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Spacy Models
⭐
1,429
💫 Models for the spaCy Natural Language Processing (NLP) library
Awesome Pytorch List Cnversion
⭐
1,426
Awesome-pytorch-list 翻译工作进行中......
Tika Python
⭐
1,316
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Underthesea
⭐
1,288
Underthesea - Vietnamese NLP Toolkit
Contextualized Topic Models
⭐
1,141
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Pythainlp
⭐
902
Thai Natural Language Processing in Python.
Skweak
⭐
898
skweak: A software toolkit for weak supervision applied to NLP tasks
Opendelta
⭐
810
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Janome
⭐
776
Japanese morphological analysis engine written in pure Python
Kagome
⭐
769
Self-contained Japanese Morphological Analyzer written in pure Go
Kuromoji
⭐
688
Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search
Sudachi
⭐
684
A Japanese Tokenizer for Business
Octis
⭐
647
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
Lingua
⭐
622
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Cn2an
⭐
589
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Ekphrasis
⭐
583
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Treasure Of Transformers
⭐
541
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
Awesome Japanese Nlp Resources
⭐
522
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
Indic_nlp_library
⭐
511
Resources and tools for Indian language Natural Language Processing
Pynlpl
⭐
466
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP spec
Giveme5w1h
⭐
451
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
Medspacy
⭐
448
Library for clinical NLP with spaCy.
Deep_qa
⭐
410
A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)
Pyarabic
⭐
407
pyarabic
Nagisa
⭐
365
A Japanese tokenizer based on recurrent neural networks
Camel_tools
⭐
351
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Chatbot_ner
⭐
320
chatbot_ner: Named Entity Recognition for chatbots.
Sudachipy
⭐
318
Python version of Sudachi, a Japanese tokenizer.
Nlp Natural Language Processing
⭐
289
Projects and useful articles / links
Transfer Nlp
⭐
281
NLP library designed for reproducible experimentation management
Zshot
⭐
278
Zero and Few shot named entity & relationships recognition
Quick Nlp
⭐
275
Pytorch NLP library based on FastAI
Urduhack
⭐
274
An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
Multi Task Nlp
⭐
269
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
Nlp_profiler
⭐
227
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Spaczz
⭐
217
Fuzzy matching and more functionality for spaCy.
Bllip Parser
⭐
207
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
Wefe
⭐
169
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Cadmium
⭐
155
Natural Language Processing (NLP) library for Crystal
Mindnlp
⭐
145
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
Danlp
⭐
141
DaNLP is a repository for Natural Language Processing resources for the Danish Language.
Spacy Udpipe
⭐
129
spaCy + UDPipe
Mutate
⭐
111
A library to synthesize text datasets using Large Language Models (LLM)
Toiro
⭐
110
A comparison tool of Japanese tokenizers
Turkish Deasciifier
⭐
103
Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs
Lingo
⭐
102
package lingo provides the data structures and algorithms required for natural language processing
Nlp_toolkit
⭐
101
Library of state-of-the-art models (PyTorch) for NLP tasks
Lima
⭐
92
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Punkt Segmenter
⭐
86
Ruby port of the NLTK Punkt sentence segmentation algorithm
Rwordnet
⭐
84
A pure Ruby interface to the WordNet database
Spacy Cpp
⭐
83
C++ wrapper library for the NLP library spaCy
Minie
⭐
82
An open information extraction system that provides compact extractions
Classy
⭐
82
classy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Tf Transformers
⭐
76
State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).
Mlconjug
⭐
69
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Uralicnlp
⭐
65
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English
Topmost
⭐
62
Towards the TopMost: A Topic Modeling System Toolkit
Nlp Guide
⭐
61
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
Natas
⭐
61
Python 3 library for processing historical English
Simstring
⭐
60
A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.
Mecab Ko For Google Colab
⭐
58
Use Mecab Library(NLP Library) in Google Colab
Mlconjug3
⭐
57
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Rakun2
⭐
56
RaKUn 2.0 - A fast keyword detection algorithm
Khmer Nltk
⭐
56
Khmer language processing toolkit
Node Opennlp
⭐
54
Apache OpenNLP wrapper for Nodejs
Kermit
⭐
52
🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings
Grammarengine
⭐
51
Грамматический Словарь Русского Языка (+ английский, японский, etc)
Extra Model
⭐
50
Code to run the ExtRA algorithm for unsupervised topic/aspect extraction on English texts.
Indic_nlp_resources
⭐
45
Resources to go with the Indic NLP Library
Textfeatureselection
⭐
45
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Botok
⭐
43
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
Simplenetnlp
⭐
41
.NET NLP library
Dsr16_nlp
⭐
39
NLP tutorial for the Berlin Data Science Retreat
Easy Bert
⭐
38
easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代
Template Based Generator Template
⭐
36
基于模板的文本生成器的模板,模生模,凤生凤,老鼠的儿子会打洞。本地启动:npm i && npm run dev:demo
Sentiment Analyser
⭐
35
ML that can extract german and english sentiment
Py Lingualytics
⭐
32
A text analytics library with support for codemixed data
React Nlp Annotate
⭐
31
Interface for making NLP annotations.
Engerek
⭐
30
Turkish natural language processing library for Python
Python Ucto
⭐
29
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
Nlp Tools
⭐
28
Useful python NLP tools (evaluation, GUI interface, tokenization)
Simple_ner
⭐
26
simple rule based named entity recognition
Luanlp
⭐
25
Natural Language Processing Library for Lua
Starcc Py
⭐
25
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework
Paribhasha
⭐
24
A complete NLP application used to perform almost all sorts of Natural Language Processing operations for any user. It has a wide range of applications with a set of trustworthy results
Nalapa
⭐
23
NodeJS NLP Library for Bahasa Indonesia.
Lachesis
⭐
23
lachesis automates the segmentation of a transcript into closed captions
Laonlp
⭐
22
Lao language NLP
Italian Nlp Library
⭐
21
A library to run NLP tasks on Italian language
Most Powerful Nlp Library
⭐
21
Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it can solve almost any NLP task you want to tackle.
Murre
⭐
20
The amazing 🐕will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!
Nuts
⭐
20
自然语言处理常见任务(主要包括文本分类,序列标注,自动问答等)解决方案试验田
Taxonomy4good
⭐
20
Taxonomy4Good: a sustainability lexicon that provides the freedom to create custom taxonomies in addition to listed ESG and Sustainability Standards taxonomies.
1-100 of 158 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.