Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for corpus text analysis
corpus
x
text-analysis
x
23 search results found
Quanteda
⭐
824
An R package for the Quantitative Analysis of Textual Data
Atap
⭐
367
Code for Applied Text Analysis with Python
Pykospacing
⭐
348
Automatic Korean word spacing with Python
Quanteda.dictionaries
⭐
64
Dictionaries for text analysis
Corporaexplorer
⭐
62
An R package for dynamic exploration of text collections
Kospacing
⭐
61
Automatic Korean word spacing with R
Lsx
⭐
52
Semi-supervised algorithm for document scaling
Rainette
⭐
48
R implementation of the Reinert text clustering method
Corpus Db
⭐
38
A textual corpus database for the digital humanities.
Align Linguistic Alignment
⭐
33
Python library for extracting quantitative, reproducible metrics of multi-level alignment between two speakers in naturalistic language corpora.
Tacl
⭐
27
Tool for performing basic text analysis on the CBETA corpus
Corpustools
⭐
26
An R corpus class for tokenized texts
Workshop Ijta
⭐
24
Rによる日本語テキスト分析入門
Cmdist
⭐
22
DEPRECATED - The Concept Mover's Distance Method is now available in the text2map package. Concept Mover's Distance measure a document's conceptual engagement using word embeddings.
Rectr
⭐
19
💒 Reproducible Extraction of Cross-lingual Topics using R
Code Corpora
⭐
18
Source code corpora for text analysis
Quanteda.corpora
⭐
15
A collection of corpora for quanteda
Legalst 190
⭐
11
Data, Prediction, and Law
Jgtextrank
⭐
10
jgtextrank: Yet another Python implementation of TextRank
Twittr
⭐
10
R Shiny app for tweet analysis
Russian_subtitles_dataset
⭐
9
Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.
Fromzeroton
⭐
7
Materials for From Zero To N (Grams): A Very Basic Bootcamp for Text Analysis in Python
Late Style Pca
⭐
7
An attempt to experimentally test Edward Said's claims about late style using computational text analysis and principal component analysis.
Edu Text Analysis Experiments
⭐
5
Statistical text analysis and semantic networks with Python
Dsaraby
⭐
5
We've created a library named "DSAraby" that aims to transliterate text which write a word using the closest corresponding letters of a different alphabet or language. The algorithm gives the possible words in Arabic based on a given word in Latin by mapping Latin letters to Arabic ones, then takes the most frequent word existing in a corpus.
Thaitoxicitytweetcorpus
⭐
5
Related Searches
Python Corpus (2,447)
Natural Language Processing Corpus (510)
Jupyter Notebook Corpus (471)
Dataset Corpus (342)
Java Corpus (308)
Language Corpus (261)
1-23 of 23 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.