Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java tokenizer
java
x
tokenizer
x
56 search results found
Jflex
⭐
523
The fast scanner generator for Java™ with full Unicode support
Elasticsearch Analysis Vietnamese
⭐
470
Vietnamese Analysis Plugin for Elasticsearch
Cogcomp Nlp
⭐
448
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
Mmseg4j Solr
⭐
403
mmseg4j for lucene or solr analyzer
Spyglass
⭐
378
A library for mentions on Android
Smoothnlp
⭐
320
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Elasticsearch Analysis Jieba
⭐
296
The plugin includes the `jieba` analyzer, `jieba` tokenizer, and `jieba` token filter, and have two mode you can choose. one is `index` which means it will be used when you want to index a document. another is `search` mode which used when you want to search something.
Elasticsearch Analysis Hao
⭐
201
一个非常hao用的elasticsearch(es)中文分词器插件
Elasticsearch Analysis Openkoreantext
⭐
112
Korean analysis plugin that integrates open-korean-text module into elasticsearch.
Elasticsearch Analysis Bosonnlp
⭐
105
BosonNLP Analysis for ElasticSearch
Open Nlp
⭐
88
Ruby bindings to the OpenNLP Java toolkit.
Elasticsearch Analysis Korean
⭐
81
Korean Analysis Plugin for Elasticsearch
Elasticsearch Analysis Url
⭐
56
A URL tokenizer and token filter plugin for Elasticsearch
Talismane
⭐
45
NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Elasticsearch Extended Analyze
⭐
44
Extend Analyze API Plugin for Elasticsearch
Sonar Ps Plugin
⭐
33
Powershell language plugin for SonarQube
React Native Japanese Tokenizer
⭐
31
Async Japanese Tokenizer Native Plugin for React Native for iOS and Android
Lfuzzer
⭐
28
Fuzzing Parsers with Tokens
Elasticsearch Analysis Japanese
⭐
28
Japanese analyzer uses kuromoji japanese tokenizer for ElasticSearch
Sentencepiece Jni
⭐
27
Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.
Elasticsearch Plugins
⭐
25
Some native scoring script plugins for elasticsearch
Deeplearningsmells
⭐
23
Smelling smells using Deep Learning
Mte
⭐
23
MiTextExplorer - interactive browser of text and document covariates.
Jfiveparse
⭐
22
A java html 5 compliant parser
Elasticsearch Analysis Paoding
⭐
22
Paoding Analysis Plugin for ElasticSearch
Mystem Scala
⭐
21
Morphological analyzer `mystem` wrapper for JVM languages
Php Stanford Corenlp Adapter
⭐
20
PHP adapter for Stanford CoreNLP
Compiler
⭐
19
Compiler of YF programming language (YFlang), with GLR parser generator and tokenizer generator.
Javacoffee
⭐
16
☕ Coffeescript-like syntax for writing Java code
Xultimate Searching
⭐
13
使用扩展的通过数据库维护的IKAnalyzer和分布式搜索搜索服务SolrCloud及SolrJ的S
Vespa Kuromoji Linguistics
⭐
12
Tri
⭐
12
Temporal Random Indexing
Lucene Bo
⭐
10
Lucene analyzer for Tibetan
Opennlp
⭐
10
mirror of opennlp.sourceforge.net
Tokenizer Id
⭐
10
Tokenizer untuk Bahasa Indonesia
Tokenreplacer
⭐
10
Token Replacer is a simple and small Java Library that helps replacing tokens in strings. You can replace the tokens with static values or create values "on-the-fly" by calling a generator. You can even pass arguments to the generator which makes it pretty powerful.
Senti Storm
⭐
10
SentiStorm - Real-time Twitter Sentiment Classification based on Apache Storm
Hunlp
⭐
10
Hungarian NLP tools API
Android Multiautocomplete
⭐
9
A lightweight and powerful abstraction over MultiAutoCompleteTextView and Tokenizer
Sctokenizer
⭐
9
A Source Code Tokenizer
Tkt Elasticsearch
⭐
9
elasticsearch plugin of twitter-korean-text for korean analyzer
Molecularlucene
⭐
8
Lucene tokenizer for chemical structures indexing/searching
Elasticsearch Analysis Hashsplitter
⭐
8
A N-Gram tokenizer generating non-overlapping positioned token chunks for partial token search
Dictionary Annotator
⭐
8
Fast and configurable UIMA dictionary annotator.
Jtokenizer
⭐
7
A Java library for splitting text into constituent words. This can be tricky for non-trivial examples, therefore the jTokenizer package was designed to combine a set of tokenizers that range from basic whitespace tokenizers to more complex ones that deal intuitively with natural language.
Elasticsearch Analysis Korean
⭐
7
Elasticsearch Analysis Vi
⭐
6
Elasticsearch Vietnamese Analysis Plugin
Elasticsearch Analysis Ik 5.2.0 Mysql
⭐
6
Mmseg
⭐
6
A java implementation of MMSEG. http://technology.chtsai.org/mmseg/
Actojat
⭐
6
A Cobol to Java Transpiler
Bnfparserjava
⭐
5
Elasticsearch Analysis Extension
⭐
5
Elasticsearch Plugin for Analysis Library
Embulk Filter Kuromoji
⭐
5
Morphological analysis plugin for Embulk.
Jtok
⭐
5
A Java-based configurable tokenizer and sentence splitter
Solr Ik
⭐
5
solr-ik
Map4d Tokenizer Elasticsearch
⭐
5
Map4D tokenizer elasticsearch plugin for Vietnamese language
Related Searches
Java Spring (21,350)
Java Spring Boot (11,982)
Java Video Game (8,093)
Java Gradle (8,072)
Java Docker (6,180)
Java Database (6,015)
Java Mysql (5,954)
Java Sdk (5,864)
Javascript Java (5,468)
Java Rest (4,956)
1-56 of 56 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.