Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for lemmatizer
lemmatizer
x
98 search results found
Spark Nlp
⭐
3,578
State of the Art Natural Language Processing
Nlu
⭐
775
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Parsimmon
⭐
714
Parsimmon is a wee linguistics toolkit for iOS written in Swift.
Wordless
⭐
649
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Word_forms
⭐
483
Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.
Cogcomp Nlp
⭐
448
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
Urduhack
⭐
274
An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
Pymystem3
⭐
231
A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.
Lemminflect
⭐
226
A python module for English lemmatization and inflection.
Dadmatools
⭐
142
DadmaTools is a Persian NLP tools developed by Dadmatech Co.
Pynlp
⭐
105
A pythonic wrapper for Stanford CoreNLP.
Stanfordnlp
⭐
101
[Deprecated] This library has been renamed to "Stanza". Latest development at: https://github.com/stanfordnlp/stanza
Lemmatizer
⭐
100
Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy
Simplemma
⭐
100
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
Jargon
⭐
98
Tokenizers and lemmatizers for Go
Elasticsearch Analysis Lemmagen
⭐
98
Elasticsearch lemmatizer for 15 languages
Rakun
⭐
91
Rank-based Unsupervised Keyword Extraction via Metavertex Aggregation
Spacy Experimental
⭐
87
🧪 Cutting-edge experimental spaCy components and features
Aot
⭐
80
Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a base line for many research projects in computer linguistics area.
Elasticsearch Analysis Morfologik
⭐
80
Morfologik Polish Lemmatizer plugin for Elasticsearch
Germalemma
⭐
77
A lemmatizer for German language text
Uralicnlp
⭐
65
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English
Spacy Lefff
⭐
60
Custom French POS and lemmatizer based on Lefff for spacy
Wink Lemmatizer
⭐
57
English lemmatizer
Fastcampus_textml_blogs
⭐
55
패스트캠퍼스, 자연어처리를 위한 머신러닝, 수업관련 포스트 입니다.
Golem
⭐
52
A lemmatizer implemented in Go
Grammarengine
⭐
51
Грамматический Словарь Русского Языка (+ английский, японский, etc)
Lucene Stanford Lemmatizer
⭐
48
A library that adds some NLP capabilities to the Lucene search engine
Lemma
⭐
45
A Morphological Parser (Analyser) / Lemmatizer written in Elixir.
Nlpstack
⭐
42
NLP toolkit (tokenizer, POS-tagger, parser, etc.)
Elasticsearch Ukrainian Lemmatizer
⭐
42
Ukrainian lemmatizer plugin for ElasticSearch
Collatinus
⭐
40
Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion
Lemmy
⭐
40
🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪
Javascript Lemmatizer
⭐
40
JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.
Zeyrek
⭐
40
Python morphological analyzer for Turkish language. Partial port of ZemberekNLP.
Jhazm
⭐
37
A Java version of Hazm (Python library for digesting Persian text)
Spacy Fi
⭐
35
Experimental Finnish language model for SpaCy
Tamil Stemmer
⭐
34
A rule-based iterative affix stripping stemmer for Tamil
Turkish Lemmatizer
⭐
32
Lemmatization for Turkish Language
Cstlemma
⭐
32
Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.
Nhazm
⭐
31
A C# version of Hazm (Python library for digesting Persian text)
Spacy Spanish Lemmatizer
⭐
31
Spanish rule-based lemmatization for spaCy
Nlp Js Tools French
⭐
29
POS Tagger, lemmatizer and stemmer for french language in javascript
Spacy Pl
⭐
29
Combo
⭐
29
COMBO is jointly trained tagger, lemmatizer and dependency parser.
Node Phpmorphy
⭐
28
Полнофункциональный порт phpMorphy на Node.JS
Unidic2ud
⭐
27
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese
Node Lemmer
⭐
27
English and Russian lemmatizer for Node.js
Lemport
⭐
26
A Lemmatizer for Portuguese
Lemmagenerator
⭐
26
Generator of rule-based lemmatizers (based on examples) for serveral European languages.
Lemlat3
⭐
24
Morphological analyzer and lemmatizer for Latin.
Lemmatag
⭐
23
A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, Arabic, etc.)
Jsastrawi
⭐
23
Natural Language Processing (NLP) Tools for Bahasa Indonesia
Spacy Iwnlp
⭐
23
German lemmatization with IWNLP as extension for spaCy
Mystem Scala
⭐
21
Morphological analyzer `mystem` wrapper for JVM languages
Knetlayers.jl
⭐
20
Useful Layers for Knet
Lexical Graph
⭐
18
🕸️WordNet visualization
Php Lemmatizer
⭐
18
Ixa Pipe Pos
⭐
17
IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)
Jpdict
⭐
16
A Japanese dictionary for beginner
Iwnlp
⭐
16
IWNLP: A parser for the German edition of Wiktionary
Texttk
⭐
15
Text Preprocessing in Python
Glem
⭐
15
GLEM is a lemmatizer for Ancient Greek.
Lara Hungarian Nlp
⭐
14
NLP class for rapid ChatBot development in Hungarian language
Rulemma
⭐
13
Лемматизатор для русскоязычных текстов
Pyvabamorf
⭐
13
Python interface for the Vabamorf Estonian lemmatizer and morphological analyzer
Solr Lemmatizer
⭐
13
A TokenFilter that applies lemmatization to lemmatize English words.
Cl Lemma
⭐
13
English lemmatizer in Common Lisp
Lemmatizer
⭐
11
A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.
Polem
⭐
10
Tool for lemmatization of multi-word phrases and named entities for Polish.
Nefnir
⭐
10
A lemmatizer for Icelandic text
Korean_lemmatizer
⭐
10
한국어 용언 분석기 (원형 복원, 용언 형태소 분석)
Jlemmagen
⭐
10
Java implmentation of LemmaGen project
Morphit Lemmatizer
⭐
10
Lexical lemmatizer of italian text
Awesome Bot
⭐
10
A curated list of awesome bot and AI packages and resources.
Pydic
⭐
9
Python toolkit for managing simple inflectional dictionaries
Frenchleffflemmatizer
⭐
9
A French Lemmatizer in Python based on the LEFFF
Eulexis_off_line
⭐
9
Ancient Greek lemmatisation tool
Corenlp Scala Examples
⭐
9
Stanford CoreNLP examples in Scala
Jp_tokenizer
⭐
8
A tokenizer and lemmatizer for Japanese text
Finnlem
⭐
8
Neural network based lemmatizer for Finnish language
Slovenianlemmatizer
⭐
8
LemmaGen Slovenian lemmatization library with bindings for C, Java and Python
Lemmatizer Pl
⭐
8
Python lemmatizer for Polish.
Nlproc_sdk_sample_code
⭐
7
Lemmatizer and Sentiment Analysis SDK sample code
Aksara
⭐
7
An Indonesian NLP tool that conforms to Universal Dependencies v2 annotation guidelines
Php Lemmatizer
⭐
7
Lemmatizer text with php and the TreeTagger library
Ninja Post
⭐
7
PHP + MySql implementation of Eric Brill's rule based part of speech tagger.
Lemmagen Python Extension
⭐
7
Python extension code to expose lemmatization functions of Lemmagen (a lemmatizer)
Word Quiz Generator
⭐
7
CLI tool to generate a vocabulary quiz
Lemmagen Lexicons
⭐
7
Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-le plugin
Solr_example
⭐
6
Example configuration for Solr with Slovenian language support
Norlem Norwegian Lemmatizer
⭐
6
A lemmatizer for Norwegian that uses lexical and contextual information from the Norwegian Dependency Treebank (NDT) and lexical information from Ordbanken.
Topic Modelling Using Lda
⭐
6
The Enron database is analysed using Latent Dirichlet allocation.
Ner_tsd2016
⭐
5
Software and data accompanying paper Neural Networks for Featureless Named Entity Recognition in Czech
Lemmatizer
⭐
5
Modest script for identifying dictionary forms of morphological forms in a text by the use of a full form lemma list.
Morfologik
⭐
5
Ruby MRI bindings for morfologik-stemming library.
Turglem Client
⭐
5
A simple client to the turglem lemmatizer
Stanfordnlp Util
⭐
5
java utilities for stanford nlp
1-98 of 98 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.