Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for computational linguistics
computational-linguistics
x
182 search results found
Pke
⭐
1,431
Python Keyphrase Extraction module
Arguman.org
⭐
1,349
Argument mapping and analysis platform
Nlp With Ruby
⭐
1,002
Curated List: Practical Natural Language Processing done in Ruby
Pywsd
⭐
704
Python Implementations of Word Sense Disambiguation (WSD) Technologies.
Pynlpl
⭐
466
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP spec
Kefir
⭐
445
🥛turkic morphology project
Nlp Papers With Arxiv
⭐
363
Statistics and accepted paper list of NLP conferences with arXiv link
German Nlp
⭐
360
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
Rulm
⭐
341
Language modeling and instruction tuning for Russian
Awesome Linguistics
⭐
335
A curated list of anything remotely related to linguistics
Kartaslov
⭐
309
Открытые лингвистические датасеты: тональный словарь русского языка, датасет по семантике, ассоциативный граф и датасет по орфографическим ошибкам и опечаткам.
Acl Anthology
⭐
304
Data and software for building the ACL Anthology.
Pycantonese
⭐
290
Cantonese Linguistics and NLP
Nlp Conference Compendium
⭐
285
Compendium of the resources available from top NLP conferences.
Wikipron
⭐
256
Massively multilingual pronunciation mining
Scisumm Corpus
⭐
208
Scientific Document Summarization Corpus and Annotations from the WING NUS group.
Bllip Parser
⭐
207
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
Aclpub
⭐
199
The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).
Awesome Hungarian Nlp
⭐
192
A curated list of NLP resources for Hungarian
Tgen
⭐
190
Statistical NLG for spoken dialogue systems
Acl Papers
⭐
178
paper summary of Association for Computational Linguistics
Datastories Semeval2017 Task4
⭐
171
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Thuctc
⭐
167
An Efficient Chinese Text Classifier
Awesome Computational Neuroscience
⭐
166
A list of schools and researchers in computational neuroscience
Compling_nlp_hse_course
⭐
157
Материалы курса по компьютерной лингвистике Школы Лингвистики НИУ ВШЭ
Elpis
⭐
137
🙊 software for creating speech recognition models.
Openwordnet Pt
⭐
128
OpenWordnet-PT: an open access wordnet for Portuguese
Colibri Core
⭐
122
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Flat
⭐
105
FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
Jfleg
⭐
105
JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation
Amr Tutorial
⭐
100
Abstract Meaning Representation (AMR) tutorial slides
Robotreviewer
⭐
97
Automatic synthesis of RCTs
Python Tutorial Notebooks
⭐
97
Python tutorials as Jupyter Notebooks for NLP, ML, AI
Datalinguist
⭐
87
Stanford CoreNLP in idiomatic Clojure.
Ruts
⭐
85
Библиотека для извлечения статистик из текстов на русском языке.
Camr
⭐
85
Transition-based tree-to-graph AMR Parser
Frog
⭐
73
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Lamachine
⭐
66
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
Vecto
⭐
60
Doing things with embeddings
Folia
⭐
60
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas,
Ucto
⭐
60
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-s
Natural Language Processing And Computational Linguistics
⭐
55
Natural Language Processing and Computational Linguistics, published by Packt
Emnlp 2023 Papers
⭐
54
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. ⭐ support NLP!
Sylbreak
⭐
52
Syllable segmentation tool for Myanmar language (Burmese) by Ye.
Python_nlp_tutorial
⭐
47
This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Segmentation.evaluation
⭐
47
SegEval Segmentation Evaluation Package
Sentiment Analysis Of Tweets In Russian
⭐
46
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
Botok
⭐
43
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
Phonemes
⭐
42
Jason Riggle's chart of phonological features in JSON format + extras
Scalabha
⭐
42
Scala utilities for teaching computational linguistics and prototyping algorithms.
Piccl
⭐
40
A set of workflows for corpus building through OCR, post-correction and normalisation
Takahe
⭐
40
takahe is a multi-sentence compression module
Negation Detection
⭐
38
Negation detection NLP tool. If you use the code, please cite George Gkotsis, Sumithra Velupillai, Anika Oellrich, Harry Dean, Maria Liakata and Rina Dutta. Don't Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records, Computational Linguistics and Clinical Psychology 2016
Yap
⭐
37
Yet Another (natural language) Parser
Pylangacq
⭐
36
Language Acquisition Research Tools
Word2vec Tsne
⭐
35
Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
Amr Bibliography
⭐
34
Organized inventory of research using the Abstract Meaning Representation
Java Probabilistic Earley Parser
⭐
33
🎲 Efficient Java implementation of the probabilistic Earley algorithm to parse Stochastic Context Free Grammars (SCFGs)
Cistem
⭐
33
Stemmer for German
Gec Reading List
⭐
32
A grammatical error correction reading list maintained by BLCU ICALL Research Group
Python Arpa
⭐
32
🐍 Python library for n-gram models in ARPA format
Mingpipe
⭐
30
A Chinese name matcher written in Python. Describe in: Nanyun Peng, Mo Yu, Mark Dredze. An Empirical Study of Chinese Name Matching and Applications. Association for Computational Linguistics (ACL) (short paper), 2015.
C2xg
⭐
29
A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars
Python Ucto
⭐
29
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
Chinese_ner_with_attention
⭐
27
Calamancy
⭐
26
NLP pipelines for Tagalog using spaCy
Docs
⭐
26
DELPH-IN Documentation
Embedding_evaluation
⭐
25
Evaluate your word embeddings
Elixir Nlp
⭐
25
A (hopefully helpful) collection of resources for Elixir NLP devs
Sentiment Analysis Workshop
⭐
24
Deep Learning for Language Workshop prepared for the AI For Social Good Summer Lab, 2018
Sentiment Analysis Imdb
⭐
23
Example project of sentiment analysis using LSTM NN on IMDB reviews database
Edas
⭐
22
Emotional Dialogue Acts corpus contains dialogue act labels for the multimodal conversational emotion datasets IEMOCAP and MELD. https://www.aclweb.org/anthology/2020.lrec-1.78/
Lxa5
⭐
22
Linguistica 5: Unsupervised Learning of Linguistic Structure
Charscnn Theano
⭐
22
implementation of CharSCNN and SCNN.
Linguistics_problems
⭐
22
Natural language processing in examples and games
Sentimentanalysis
⭐
22
Sentiment Analysis: Deep Bi-LSTM+attention model
Praaline
⭐
22
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
Pytorch Rnng
⭐
21
Mystem Scala
⭐
21
Morphological analyzer `mystem` wrapper for JVM languages
Streaming_lsh
⭐
21
A project for clustering text streams using locality-sensitive hashing (LSH) in Python
Go Trie
⭐
21
Trie implementation based on a minimal automaton for Go
Neural Abstract Anaphora
⭐
20
A Mention-Ranking Model for Abstract Anaphora Resolution
Hades
⭐
20
Repository for the CLiPS HAte speech DEtection System [HADES].
Punkt
⭐
19
Unsupervised multilingual sentence segmentation.
Cg3
⭐
19
Tools for the 3rd edition of the Constraint Grammar formalism.
Angel
⭐
19
An Ancient Greek Morphology Tagger
Wlapi
⭐
19
Ruby based API for the project Wortschatz Leipzig.
Acl22 Identifying The Human Values Behind Arguments
⭐
19
Machine Learning scripts for the identification of human values behind arguments.
Pybo
⭐
19
🦜 NLP for Tibetan, in Python.
Datastories Semeval2017 Task6
⭐
19
Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Citation Function
⭐
18
Measuring the Evolution of a Scientific Field through Citation Frames
Blabla
⭐
18
Novoic's linguistic feature extraction library
Textexpansion
⭐
17
Nytwit
⭐
16
New York Times Word Innovation Types dataset
Gec Reading List
⭐
15
A grammatical error correction reading list maintained by Beijing Language and Culture University Natural Language Processing Group
Babyberta
⭐
15
Source code for CoNLL 2021 paper by Huebner et al. 2021
Abuseeval
⭐
15
Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"
Arabicprocessingcog
⭐
15
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Uncertainty
⭐
14
A Python implementation of the uncertainty classifier, based on the work of Veronika Vincze.
Foliapy
⭐
14
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.
1-100 of 182 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.