Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for c plus plus natural language processing
c-plus-plus
x
natural-language-processing
x
50 search results found
Ciphey
⭐
16,681
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Sentencepiece
⭐
8,851
Unsupervised text tokenizer for Neural Network-based text generation.
It_book
⭐
8,543
本项目收藏这些年来看过或者听过的一些不错的常用的上千本书籍,没准你想找的书就在这里呢,包含了互联网行
Openvino
⭐
5,316
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Ai Job Notes
⭐
4,419
AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
Nlp_paper_study
⭐
3,373
该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记
Mitie
⭐
2,778
MITIE: library and tools for information extraction
Familia
⭐
2,432
A Toolkit for Industrial Topic Modeling
Chatglm.cpp
⭐
2,215
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
Kcws
⭐
2,044
Deep Learning Chinese Word Segment
Hunspell
⭐
1,912
The most popular spellchecking library.
Sling
⭐
1,873
SLING - A natural language frame semantics parser
Nlper Interview
⭐
1,746
该仓库主要记录 NLP 算法工程师相关的面试题
Turbotransformers
⭐
1,322
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
Bolt
⭐
823
Bolt is a deep learning library with high performance and heterogeneous flexibility.
Meta
⭐
660
A Modern C++ Data Sciences Toolkit
Ifopt
⭐
634
An Eigen-based, light-weight C++ Interface to Nonlinear Programming Solvers (Ipopt, Snopt)
Algorithm_interview_notes Chinese
⭐
603
2018/2019/校招/春招/秋招/自然语言处理(NLP)/深度学习(Deep Learning)/机器学习(Machine Learning)/C/C++/Python/面试笔记,此外,还包括创建者看到的所有机器学习/深度学 除了其中 DL/ML 相关的,其他与算法岗相关的计算机知识也会记录。 但是不会包括如前端/测试/JAVA/Android等岗位中有关的问题。
Jamspell
⭐
572
Modern spell checking library - accurate, fast, multi-language
Tomotopy
⭐
519
Python package of Tomoto, the Topic Modeling Tool
Matterport3dsimulator
⭐
414
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
Clause
⭐
390
🏇 聊天机器人,自然语言理解,语义理解
Fugashi
⭐
339
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Jumanpp
⭐
334
Juman++ (a Morphological Analyzer Toolkit)
Kiwi
⭐
330
Kiwi(지능형 한국어 형태소 분석기)
Jiebar
⭐
277
Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :https://qinwenfeng.com/jiebaR/ )
Caiss
⭐
261
跨平台/多语言的 相似向量/相似词/相似句 高性能检索引擎。欢迎star & fork。Build together! Power another !
Tokenizer
⭐
224
Fast and customizable text tokenization library with BPE and SentencePiece support
Node Postal
⭐
211
NodeJS bindings to libpostal for fast international address parsing/normalization
Hiop
⭐
202
HPC solver for nonlinear optimization problems
Udpipe
⭐
198
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Nuspell
⭐
193
🖋️ Fast and safe spellchecking C++ library
Radish
⭐
146
C++ model train&inference framework
Sling
⭐
143
SLING - A natural language frame semantics parser
Colibri Core
⭐
122
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Finbert
⭐
101
BERT for Finance : UC Berkeley MIDS w266 Final Project
Fastrtext
⭐
97
R wrapper for fastText
Ruimtehol
⭐
95
R package to Embed All the Things! using StarSpace
Lima
⭐
92
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Tgcontest
⭐
87
Telegram Data Clustering contest solution by Mindful Squirrel
Btm
⭐
83
Biterm Topic Modelling for Short Text with R
Spacy Cpp
⭐
83
C++ wrapper library for the NLP library spaCy
Word2vec
⭐
81
word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch
Frog
⭐
73
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
N3lp
⭐
69
C++ implementation for Neural Network-based NLP, such as LSTM machine translation!
Ucto
⭐
60
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-s
Word2vec
⭐
58
Distributed Representations of Words using word2vec
Joint Lstm Parser
⭐
57
Transition-based joint syntactic dependency parser and semantic role labeler using a stack LSTM RNN architecture.
Grammarengine
⭐
51
Грамматический Словарь Русского Языка (+ английский, японский, etc)
Vnla
⭐
51
Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155
Iknow
⭐
48
Community development repository for iKnow
Oleanderstemminglibrary
⭐
43
Porter stemming library (C++)
Pyeunjeon
⭐
39
은전한닢 프로젝트와 mecab 기반의 한국어 형태소 분석기의 독립형 python 인터페이스
Voice Assistant
⭐
39
重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我 AI 打工人。
Books
⭐
39
整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、Scala 、TensorFlow 、大数据 、推荐系统、数据库、数据挖掘 、机器学习 、深度学习 、算法等。
Generalised Brown
⭐
39
C++ implementation of Generalised Brown clustering and python scripts for feature generation
Algorithm_interview_notes Chinese Master
⭐
36
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
Cistem
⭐
33
Stemmer for German
Tinygpt
⭐
30
Tiny C++11 GPT-2 inference implementation from scratch
Metaltranslate
⭐
27
Customizable machine translation in C++
Sentencepiece Jni
⭐
27
Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.
Newstudents
⭐
26
For the new students who just join a NLP group
Doc2vec
⭐
26
Distributed Representations of Sentences and Documents
Generate
⭐
25
Generate networks from syntax (e.g. natural language, math proofs, action plans, biome/reactome nets)
Hacktober Fest 2021
⭐
24
📜This repository is created to welcome all the open source enthusiasts to get introduced to beginner friendly projects they could work with in the festive season of HacktoberFest 2021🎇🙌.
Parser
⭐
24
Semantic parser induction using a generative model of grammar.
Shifted Label Distribution
⭐
23
Source code for paper "Looking Beyond Label Noise: Shifted Label Distribution Matters in Distantly Supervised Relation Extraction" (EMNLP 2019)
Rcppmecab
⭐
22
RcppMeCab: Rcpp Interface of CJK Morpheme Analyzer MeCab
Sentencepiece
⭐
21
R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece
Algo Trading
⭐
21
This is my github repository where I post trading strategies, tutorials and research on quantitative finance with R, C++ and Python. Some of the topics explored include: machine learning, high frequency trading, NLP, technical analysis and more. Hope you enjoy it!
Unitex Core
⭐
21
Unitex/GramLab C++ Core
Easynlp
⭐
20
Do NLP without coding! Simple NLP framework.
Cg3
⭐
19
Tools for the 3rd edition of the Constraint Grammar formalism.
Pullword
⭐
19
An R package for pullword.com
Nlu
⭐
18
This repo contains every ML/NLU related code written by Botpress in the NodeJS environment. This includes the Botpress Standalone NLU Server.
Tscan
⭐
18
T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf
Statnlp Framework
⭐
17
C++ based implementation of StatNLP framework
Clab Autodiff Examples
⭐
16
Examples of using the adept autodifferentiation library for standard NLP learning problems
Unsupervised Pos Tagging
⭐
15
教師なし品詞タグ推定
Natlang
⭐
15
NatLang is an English parser with an extensible grammar
Grammar
⭐
14
Implementation of generative semantic grammar.
Libfolia
⭐
14
FoLiA library for C++
Alegre
⭐
14
A text and media analysis service for Meedan Check, a collaborative media annotation platform
Esapp
⭐
12
An unsupervised Chinese word segmentation tool.
Shorttext Fasttext
⭐
12
ShortText classification
Flexfringe
⭐
11
The FlexFringe tool for flexible learning of state machines (deterministic automata) from traces. See the paper at https://arxiv.org/abs/2203.16331
Chronogram
⭐
10
Diachronic Word Embedding Model based on Word2vec Skip-gram with Chebyshev approximation
Elvex
⭐
10
A Natural Language Generation System
Parse English
⭐
10
parse-english is a minimum viable English parser implemented in LexYacc
Mbt
⭐
10
MBT: Memory-based tagger generation and tagging MBT is a memory-based tagger-generator and tagger in one.
Flare
⭐
10
A Fast, Header-Only C++ Neural Network Library
Fast Text
⭐
10
Prediction and nearest neighbour tools from Facebook Fast Text wrapped into Node.js packages.
Interview
⭐
10
面试常见知识点整理
Python Npycrf
⭐
10
条件付確率場とベイズ階層言語モデルの統合による半教師あり形態素解析
Parser
⭐
10
한국어 문장 분석 시스템 BCD-KL-Parser
Nlp Engine
⭐
9
NLP engine code
Ue Chatgpt
⭐
9
基于OPENAI辅助UE开发,接入openai-api,使用DALL.E自动生成模型,Chat-GP
Boosting Tree Tokenizer
⭐
9
Gradient Boosting Dicision Tree(LightGBM)を用い、教師ありで自然言語の分かちと形態素の推定を学習&予想します。名称
Udon2
⭐
8
A package for manipulating Universal Dependencies trees
Vgram
⭐
7
Feature extraction from sequential data
Related Searches
C Plus Plus Cmake (8,712)
C Plus Plus Qt (8,557)
C Plus Plus Video Game (8,255)
Python C Plus Plus (6,819)
C Plus Plus Algorithms (6,194)
C Plus Plus Opengl (4,396)
C Plus Plus 3d Graphics (3,196)
C Plus Plus Testing (2,735)
Java C Plus Plus (2,629)
C Plus Plus Command Line (2,304)
1-50 of 50 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.