Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for ngrams
ngrams
x
84 search results found
Language Detection
⭐
776
A language detection library for PHP. Detects the language from a given text string.
Ngram2vec
⭐
638
Four word embedding models implemented in Python. Supporting arbitrary context features
Jamspell
⭐
572
Modern spell checking library - accurate, fast, multi-language
Neuspell
⭐
541
NeuSpell: A Neural Spelling Correction Toolkit
Stringmetric
⭐
475
🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
Covid19_twitter
⭐
443
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
Albert_pytorch
⭐
423
A Lite Bert For Self-Supervised Learning Language Representations
Strutil
⭐
245
Golang metrics for calculating string similarity and other string utility functions
Ngram Type
⭐
140
Touch typing trainer using N-grams as data source, with options to customize the auto-generated lessons and specify the minimum typing performance needed. There are sound/color effects as well.
Colibri Core
⭐
122
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Tongrams
⭐
120
A C++ library providing fast language model queries in compressed space.
Refinr
⭐
100
Cluster and merge similar char values: an R implementation of Open Refine clustering algorithms
Language Detector
⭐
96
A fast and reliable PHP library for detecting languages
Ml Classify Text Js
⭐
91
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Daguan_2019_rank9
⭐
85
达观信息提取比赛第九名代码
Wink Nlp Utils
⭐
81
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
Ngram
⭐
70
Fast n-Gram Tokenization
N Gram
⭐
67
Get n-grams from text
N Grammer Pytorch
⭐
60
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
Stringdistance
⭐
57
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Poesy
⭐
55
Poetry generation via natural language markov models
Turkish Tweets Sentiment Analysis
⭐
52
This sentiment analysis project determines whether the tweets posted in the Turkish language on Twitter are positive or negative.
Suggest
⭐
51
Top-k Approximate String Matching.
Tinyld
⭐
35
Simple and Performant Language detection library for NodeJS
Vietnamese Accent Prediction
⭐
31
A simple/fast/accurate accent prediction for non-accented Vietnamese text
2017 Summer Workshop
⭐
29
Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Tacl
⭐
27
Tool for performing basic text analysis on the CBETA corpus
Typing Assistant
⭐
27
Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Gramify
⭐
26
Create n-grams of wordlists based on words, characters, or charsets to use in offline password attacks and data analysis
Company Ngram
⭐
25
Unsupervised_extract_detect_words
⭐
23
multiprocess unsupervised chinese_detect_words ngram_combination
Ngram Py
⭐
20
Ngrams with Basic Smoothings
Srilm
⭐
20
Mirror of SRILM
Pybo
⭐
19
🦜 NLP for Tibetan, in Python.
Word Prediction Ngram
⭐
18
Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
Trigrams
⭐
18
Trigram files for 400+ languages
Chr
⭐
17
🔤 Lightweight R package for manipulating [string] characters
Calm Textgame
⭐
16
[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games
Tongrams_estimation
⭐
14
A C++ library implementing fast language models estimation using the 1-Sort algorithm.
Tongrams Rs
⭐
14
Rust library providing fast language model queries in compressed space
N Gram
⭐
13
A project of N-gram model comparing FMM/BMM
Evaluation Of Machine Translation By Nlp
⭐
13
To evaluate machine translation, they use several methods, some of which we fully implemented
Language Modeling
⭐
12
Pipeline for training Language Models using PyTorch.
N Gram Language Model
⭐
12
Programming for NLP Project - Implement a basic n-gram language model and generate sentence using beam search
Nlp_fuzzy_match_algorithms
⭐
12
Volcago
⭐
11
Model Generator for Firestore
Google Ngrams And R
⭐
11
An R-based guide to sampling Google n-gram data, building historical term-feature matrices & investigating lexical semantic change historically.
Icegrams
⭐
11
A fast, compact trigram library for Icelandic
Nanosearch
⭐
10
A tiny search engine.
Aind Recognizer
⭐
10
Term 1 Project 3 Design a Sign Language Recognition System by Luke Schoen for Udacity Artificial Intelligence Nanodegree (AIND)
Malware Detection In Pe Files Using Machine Learning
⭐
10
Detecting Malware in PE files
Saotd
⭐
9
Sentiment Analysis of Twitter Data (saotd)
Nlp Course
⭐
8
NLP Course stuff and algorithm implementations
Ngrams.java
⭐
8
🍰 A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.
Seqr
⭐
8
fast and comprehensive k-mer counting package
Nbspell
⭐
8
New spell(1) implementation for NetBSD
Ngrams_graphs
⭐
8
ngram graphs library
Predict4all
⭐
8
Accurate, fast, lightweight, multilingual, free and open-source next word prediction library
Efranc
⭐
7
Detect the language of text
Hawaiian Corpus
⭐
7
Data from a corpus of written Hawaiian
Ngram Word Generator
⭐
7
Word generation based on n-gram models, and a cli utility to generate said models.
Phpngrams
⭐
7
Get N-Grams !
Text Decomtrans
⭐
7
JS/Python3 Lib for text-decomposition, text-transformation, string-transformation, string-decomposition, ngram, other-gram and neighborhood-representation
Google Books Ngram Frequency
⭐
7
Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code
Inflearn New Year Event 2020
⭐
6
기획자와 마케터를 위한 이벤트 댓글 분석 - feat. 인프런 새해 다짐 이벤트
Nltk_model
⭐
6
The NLTK Model Submodule.
Metacurate Lexicon
⭐
6
A web service that exposes semantic similarity search via a web GUI and a RESTful API.
Datemymusic
⭐
6
Predict the composition year of a given MIDI piece - Classical Music Hack Day 2013 @ Vienna. Live at:
Ngram Cpp
⭐
6
Ngrams with Basic Smoothings
Smp Etst 2018
⭐
6
SMP_ETST 2018 christmas
Spider
⭐
6
URL Spider - web crawler and wordlist / ngram generator
Greek Dialect Classifier
⭐
6
Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek
Wordrepresentations
⭐
5
Сравнение нескольких способов представления слов для построения языковых моделей
Chinese Tokenization
⭐
5
利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等) word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre training methods (Bert, etc.)】
Storywrangling
⭐
5
Python API for the Storywrangler project
Nlp
⭐
5
Code written as a part of assignments for CSE556 Natural Language Processing taught by Dr. Tanmoy Chakraborty at IIIT Delhi in Monsoon 2018
Language Detector
⭐
5
Package to detect the language of a given text (focusing on short "sms" type text used on tweets, facebook, WhatsApp, etc)
Word_prediction_system_based_on_n Gram
⭐
5
Implementation of language model for parallel n-gram extraction from large text corpora
Bytesteady
⭐
5
A fast classification and tagging tool using byte-level n-gram embeddings
Umlauter
⭐
5
Corrects common German transcriptions using ML
Townsandvillages
⭐
5
🏰 Mapping British place names and other analysis
Ngram Syllables
⭐
5
Syllable counting and detection using an n-gram language model.
Neural Ngram
⭐
5
Neural ngram language model in PyTorch.
General
⭐
5
NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and issues.
1-84 of 84 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.