Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for text processing
text-processing
x
341 search results found
Syn
⭐
51
syn - the thesaurus
Konfuzio Sdk
⭐
48
OCR, extract and classify documents. In addition, annotate documents and build your own NLP and Computer Vision models using Python by downloading the data. Find examples in our Colab Notebooks, e. g. how to fine-tune Flair.
Vss
⭐
48
High level string and text processing library
Deduce
⭐
47
Deduce: de-identification method for Dutch medical text
Compare Userjs
⭐
47
PowerShell script for comparing user.js (or prefs.js) files.
Hr
⭐
46
Easy Access to Uppercase H
Awesome Legal Data
⭐
45
Collection of Datasets for Legal Text Processing
Kas Text
⭐
45
Rich text processing
Dragonmapper
⭐
43
Identification and conversion functions for Chinese text processing
Sciteco
⭐
43
Advanced TECO dialect and interactive screen editor based on Scintilla
Aho Corasick
⭐
42
efficient string matching in Golang via the aho-corasick algorithm.
Edit Distance
⭐
40
Levenshtein edit distance in Rust
Suffixtree
⭐
39
Optimized implementation of suffix tree in python using Ukkonen's algorithm.
Sova Tts Tps
⭐
38
NLP-preprocessor for the SOVA-TTS project
Rew
⭐
38
A text processing CLI tool that rewrites FS paths according to a pattern.
Contexto
⭐
38
Librería en Python para minería de texto y NLP
Bertify
⭐
37
An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.
Texttinyr
⭐
37
Text Processing for Small or Big Data Files in R
Ruby_scripting
⭐
37
examples based tutorial for Ruby scripting
Neuthink
⭐
36
Neural network library for text processing in F#
Pwsh Prelude
⭐
35
PowerShell “standard” library for supercharging your productivity. Provides a powerful cross-platform scripting environment enabling efficient analysis and sustainable science in myriad contexts.
Speech2affective_gestures
⭐
35
This is the official implementation of the paper "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning".
Tif
⭐
35
Text Interchange Formats
Oneai Python
⭐
35
Python SDK for One AI APIs. One AI is an NLP-as-a-service platform. Our APIs enables language comprehension in context, transforming texts from any source into structured data to use in code.
Applied Text Mining In Python
⭐
34
Repo for Applied Text Mining in Python (coursera) by University of Michigan
Text
⭐
34
Qiniu Text Processing Libraries for Go
Hackerrank The Linux Shell Challenges Solutions
⭐
33
Complete Solutions and related tutorials for the Linux Shell - Bash, text processing, Arrays in Bash, Grep Sed Awk Challenges on HackerRank
Pyline
⭐
33
Pyline is a grep-like, sed-like, awk-like command-line tool for line-based text processing in Python. https://pypi.python.org/pypi/pyline
Text Analysis
⭐
32
Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
S3 Utils
⭐
32
Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI
Natural Language Processing Nlp Roadmap
⭐
32
A simple RoadMap to Natural Language Processing(NLP)
Text Summarization And Visualization Using Watson Studio
⭐
31
Can we quickly summarize & visualize text to get the details about the unstructured data? Yes we can! Please review this code pattern for all the steps involved to quickly summarize & visualize the data.
Text Classification Lstms Pytorch
⭐
31
The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Learning Awk Programming
⭐
31
Learning AWK Programming, published by Packt
S3 Concat
⭐
29
Concatenate Amazon S3 files remotely using flexible patterns
Textstat
⭐
29
Ruby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.
Python Ucto
⭐
29
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
Fxt
⭐
28
A large scale feature extraction tool for text-based machine learning
G
⭐
28
g: A portable general purpose programmable text editor with calculator and macro facility.
Normalizer
⭐
28
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
Wactor
⭐
28
Word Factor Vectors
Gnu Linux Shell Scripting
⭐
28
A foundation for GNU/Linux shell scripting
Nlp Tools
⭐
28
Useful python NLP tools (evaluation, GUI interface, tokenization)
Cinje
⭐
27
A Pythonic and ultra fast template engine DSL.
Markover
⭐
27
Natural Language Generation with Markov
Nlp Stuff
⭐
27
A bit of everything about text and nlp [IN PROGRESS]
Python Mecab
⭐
27
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Gomtch
⭐
26
Find text even if it doesn't want to be found
Voice_chatbot
⭐
26
Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using Python3+TensorFlow+Keras.
Hashedindex
⭐
25
Python package providing an Inverted Index implementation using dictionaries
Stringx
⭐
25
Drop-in replacements for base R string functions powered by stringi
Find_job_titles
⭐
25
find any kind of occupation or job title in a text or file
Pnlp
⭐
25
NLP预/后处理工具。
Frej
⭐
24
Fuzzy Regular Expressions for Java
Phonics In R
⭐
23
Phonetic Spelling Algorithms in R
Twitter Text Python
⭐
23
Twitter Text Libraries for Python
Typ3r.js
⭐
22
🍟 [Library] dA aNn0Y1Ng t3Xt g3NeRa7or
Cassandre
⭐
22
Diary for qualitative analysis
Wikiwho
⭐
22
An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.
Chatbot Indonesia
⭐
21
Kumpulan data yang akan digunakan untuk keperluan chatbot bahasa Indonesia dengan kode chatbot sederhana menggunakan Typescript
Nlpo3
⭐
21
Thai Natural Language Processing library in Rust, with Python and Node bindings.
Atarashi
⭐
21
Atarashi scans for license statements in open source software, focusing on text statistics. Designed to work stand-alone and with FOSSology.
Fasttextr
⭐
21
Efficient learning of word representations
Mytwitterbot
⭐
20
A Twitter bot powered by a Recurrent Neural Network (RNN)
Andaluh Js
⭐
20
Transliterate español (spanish) spelling to andaluz proposals using javascript
Nlcli
⭐
20
Natural language interface for the command line.
Data Science From Scratch
⭐
20
Code Companion to Joel Grus' book
Huggingface Datasets Text Quality Analysis
⭐
19
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
Predictionary
⭐
19
A learning JavaScript dictionary-based word prediction / autocomplete / suggestion library.
Refiner
⭐
19
Refiner improves your writing by correcting grammar and style, adjusting tone, and offering formatting options. It is useful for non-native speakers and professionals who communicate with text.
Kts_linguistics
⭐
18
Spellcheck, phonetics, text processing and more
Fxt
⭐
18
A large scale feature extraction tool for text-based machine learning
Blabla
⭐
18
Novoic's linguistic feature extraction library
Knime Textprocessing
⭐
18
KNIME - Text Processing Extension (Labs)
Textdatasetcleaner
⭐
18
Настраиваемый пайплайн для очистки текстовых датасетов от мусора
Corpusexplorer2.0
⭐
17
Korpuslinguistik war noch nie so einfach...
Dif
⭐
17
'dif' is a Linux preprocessing front end to gvimdiff/meld/kompare
Chr
⭐
17
🔤 Lightweight R package for manipulating [string] characters
Rake Rs
⭐
17
Multilingual implementation of RAKE algorithm for Rust
Concise Ipython Notebooks For Deep Learning
⭐
17
Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Hama Py
⭐
17
🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer
Gohn
⭐
16
Hatena Notation (はてな記法) Parser written in Go
Advanced Text Mining
⭐
16
TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.
Linux_shell
⭐
15
Unix-like Operating Systems. Linux. Bash & Z shell. C. Synchronization Problems & Theory.
Odin Ai
⭐
15
Orgainzed Digital Intelligent Network (O.D.I.N)
Andaluh Py
⭐
15
Transliterate español (spanish) spelling to andaluz proposals using python
Text2video
⭐
15
Text to Video Generation Problem
Stringanalysis.jl
⭐
15
Hard-Forked from JuliaText/TextAnalysis.jl
Text Preprocess Python
⭐
15
Text preprocessing tools in python.
Greek Normalisation
⭐
15
utilities for validating and normalising Ancient Greek text
Nlpiper
⭐
15
NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.
Arabicprocessingcog
⭐
15
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Text Mining For Beginner
⭐
14
파이썬 기초문법 부터 간단한 텍스트 분석을 수행하는 방법에 대해 다룹니다.
Untanglr
⭐
14
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
Emotion Recognition From Tweets
⭐
14
A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.
Lara Hungarian Nlp
⭐
14
NLP class for rapid ChatBot development in Hungarian language
Trunajod2.0
⭐
14
An easy-to-use library to extract indices from texts.
Pascoale
⭐
14
Minor utilities for text processing Brazilian Portuguese.
Textstelle
⭐
14
Textstelle is a collection of corpora for the creation of bots and other things that generate text 🤖
Markdown For Mantisbt
⭐
14
It's help convert some Markdown to html-style.
101-200 of 341 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.