Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python nlp library
nlp-library
x
python
x
99 search results found
Transformers
⭐
124,049
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Spacy
⭐
28,628
💫 Industrial-strength Natural Language Processing (NLP) in Python
Awesome Pytorch List
⭐
14,715
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Openprompt
⭐
4,006
An Open-Source Framework for Prompt-Learning.
Fastnlp
⭐
2,940
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Farm
⭐
1,706
🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Spacy Models
⭐
1,429
💫 Models for the spaCy Natural Language Processing (NLP) library
Awesome Pytorch List Cnversion
⭐
1,426
Awesome-pytorch-list 翻译工作进行中......
Tika Python
⭐
1,316
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Contextualized Topic Models
⭐
1,141
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Pythainlp
⭐
902
Thai Natural Language Processing in Python.
Skweak
⭐
898
skweak: A software toolkit for weak supervision applied to NLP tasks
Opendelta
⭐
810
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Janome
⭐
776
Japanese morphological analysis engine written in pure Python
Octis
⭐
647
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
Cn2an
⭐
589
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Ekphrasis
⭐
583
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Treasure Of Transformers
⭐
541
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
Indic_nlp_library
⭐
511
Resources and tools for Indian language Natural Language Processing
Pynlpl
⭐
466
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP spec
Deep_qa
⭐
410
A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)
Pyarabic
⭐
407
pyarabic
Nagisa
⭐
365
A Japanese tokenizer based on recurrent neural networks
Camel_tools
⭐
351
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Chatbot_ner
⭐
320
chatbot_ner: Named Entity Recognition for chatbots.
Sudachipy
⭐
318
Python version of Sudachi, a Japanese tokenizer.
Transfer Nlp
⭐
281
NLP library designed for reproducible experimentation management
Zshot
⭐
278
Zero and Few shot named entity & relationships recognition
Quick Nlp
⭐
275
Pytorch NLP library based on FastAI
Urduhack
⭐
274
An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
Multi Task Nlp
⭐
269
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
Nlp_profiler
⭐
227
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Spaczz
⭐
217
Fuzzy matching and more functionality for spaCy.
Wefe
⭐
169
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Mindnlp
⭐
145
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
Danlp
⭐
141
DaNLP is a repository for Natural Language Processing resources for the Danish Language.
Spacy Udpipe
⭐
129
spaCy + UDPipe
Mutate
⭐
111
A library to synthesize text datasets using Large Language Models (LLM)
Toiro
⭐
110
A comparison tool of Japanese tokenizers
Turkish Deasciifier
⭐
103
Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs
Nlp_toolkit
⭐
101
Library of state-of-the-art models (PyTorch) for NLP tasks
Lima
⭐
92
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Classy
⭐
82
classy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Mlconjug
⭐
69
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Uralicnlp
⭐
65
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English
Nlp Guide
⭐
61
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
Natas
⭐
61
Python 3 library for processing historical English
Simstring
⭐
60
A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.
Mlconjug3
⭐
57
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Rakun2
⭐
56
RaKUn 2.0 - A fast keyword detection algorithm
Khmer Nltk
⭐
56
Khmer language processing toolkit
Extra Model
⭐
50
Code to run the ExtRA algorithm for unsupervised topic/aspect extraction on English texts.
Textfeatureselection
⭐
45
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Botok
⭐
43
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
Dsr16_nlp
⭐
39
NLP tutorial for the Berlin Data Science Retreat
Py Lingualytics
⭐
32
A text analytics library with support for codemixed data
Engerek
⭐
30
Turkish natural language processing library for Python
Python Ucto
⭐
29
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
Nlp Tools
⭐
28
Useful python NLP tools (evaluation, GUI interface, tokenization)
Simple_ner
⭐
26
simple rule based named entity recognition
Starcc Py
⭐
25
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework
Paribhasha
⭐
24
A complete NLP application used to perform almost all sorts of Natural Language Processing operations for any user. It has a wide range of applications with a set of trustworthy results
Lachesis
⭐
23
lachesis automates the segmentation of a transcript into closed captions
Laonlp
⭐
22
Lao language NLP
Most Powerful Nlp Library
⭐
21
Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it can solve almost any NLP task you want to tackle.
Nuts
⭐
20
自然语言处理常见任务(主要包括文本分类,序列标注,自动问答等)解决方案试验田
Taxonomy4good
⭐
20
Taxonomy4Good: a sustainability lexicon that provides the freedom to create custom taxonomies in addition to listed ESG and Sustainability Standards taxonomies.
Schrutepy
⭐
20
The Entire Transcript from the Office in Tidy Format
Murre
⭐
20
The amazing 🐕will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!
Unitxt
⭐
19
🦄 Unitxt: a python library for getting data fired up and set for training and evaluation
Giveme5w
⭐
15
Extraction of the five journalistic W-questions (5W) from news articles
Nlpiper
⭐
15
NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.
Empythy
⭐
15
Automated NLP sentiment predictions- batteries included, or use your own data
Little_questions
⭐
13
parse and classify questions
Rulemma
⭐
13
Лемматизатор для русскоязычных текстов
Ner Annotator
⭐
13
GUI useful to manually annotate text for Named Entity Recognition purposes
Honest
⭐
12
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
Sngramextractor
⭐
12
Python package code repo for Implementation of syntactic n-grams (sn-gram) extraction
Ppdb
⭐
12
Interface for reading the Paraphrase Database (PPDB)
Ndetcstemmer
⭐
11
library yang mengimplementasikan metode stemming nondeterministic berbasis context untuk memecahkan permasalahan kata-kata ambigu (bermakna lebih dari satu) morfologis pada proses stemming kata dalam bahasa Indonesia.
Pyfeel
⭐
11
Python package for emotion analysis in French
Spanlp
⭐
10
spanlp: nlp applied for spanish vulgarity. A fast, robust Python library to check for profanity or offensive language in Spanish strings. It contains all the rude words of Spanish-speaking countries.
Reliability Checklist
⭐
10
NLP tool for wide-range model reliability evaluations
Maleo
⭐
10
Wrapper library for text cleansing, preprocessing in NLP
Tearobot
⭐
10
a toy telegram bot using python
Spacy Th
⭐
9
Thai in spaCy
Arabic Nlp
⭐
9
The Arabic NLP Python Library
Smart Banking Chatbot
⭐
8
Smart Banking Chat Bot- This is an AI based project which uses several ML algorithms for Natural Language Understanding which identifies intent and entities from user issues and generates dialogue.
Jange
⭐
8
Easy NLP in Python
Semi Automated Youtube Channel
⭐
7
Semi automated youtube channel that has a lot of cool features for someone to use in their content generating project
Autonlp
⭐
7
NLP library for tasks like generation,inference etc
Taibun
⭐
6
Taiwanese Hokkien Transliterator and Tokeniser
Multiel
⭐
6
Multilingual Entity Linking model by BELA model
Rutokenizer
⭐
6
Russian text segmenter and tokenizer
Spacy Pythainlp
⭐
6
PyThaiNLP For spaCy
Keyword Extract
⭐
5
This is a simple library for extracting keywords from data with/without using a corpus.
Tl Dr
⭐
5
An end-to-end event extraction and summarization system.
Breame
⭐
5
Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English
Deepnlp Models
⭐
5
A repositiory about the practical deep learning models in NLP tasks, all of those models will be implement with tensorflow and test with chinese industrial dataset.
Pyseg
⭐
5
Python 中文分词库/词性标注库
Related Searches
Python Dataset (14,792)
Python Docker (14,113)
Python Machine Learning (14,099)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Algorithms (10,033)
Python Natural Language Processing (9,064)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
1-99 of 99 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.