Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python text processing
python
x
text-processing
x
85 search results found
Fastnlp
⭐
2,940
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Text_classification
⭐
1,621
Text Classification Algorithms: A Survey
Hazm
⭐
1,187
Persian NLP Toolkit
Python_basics
⭐
496
🐍 Syntax, working with Shell commands, Files, Text Processing, and more...
Pynlpl
⭐
466
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP spec
Bsed
⭐
416
Simple SQL-like syntax on top of Perl text processing.
Pykospacing
⭐
348
Automatic Korean word spacing with Python
Wetextprocessing
⭐
338
Text Normalization & Inverse Text Normalization
Artificial Adversary
⭐
317
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Jaconv
⭐
254
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
Stanza Old
⭐
141
Stanford NLP group's shared Python tools.
Nlpre
⭐
135
Python library for Natural Language Preprocessing (NLPre)
Padatious
⭐
132
A neural network intent parser
Support Tickets Classification
⭐
128
This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Colibri Core
⭐
122
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Cogcomp Nlpy
⭐
108
CogComp's light-weight Python NLP annotators
Prenlp
⭐
105
Preprocessing Library for Natural Language Processing
Teanaps
⭐
92
자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Nostril
⭐
91
Nostril: Nonsense String Evaluator
Nlp
⭐
77
Free hands-on course with the implementation (in Python) and description of several Natural Language Processing (NLP) algorithms and techniques, on several modern platforms and libraries.
Soyspacing
⭐
76
띄어쓰기 오류 교정 라이브러리입니다. CRF 와 같은 머신러닝 알고리즘이 아닌, 직관적인 접근법으로 띄어쓰기를 교정합니다.
Cso Classifier
⭐
74
Python library that classifies content from scientific papers with the topics of the Computer Science Ontology (CSO).
Talkwithyourfiles
⭐
70
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
Hands On Python Natural Language Processing
⭐
65
Textcluster
⭐
60
短文本聚类预处理模块 Short text cluster
Wiki Table Scrape
⭐
59
Scrape tables from Wikipedia articles into CSVs
Konfuzio Sdk
⭐
48
OCR, extract and classify documents. In addition, annotate documents and build your own NLP and Computer Vision models using Python by downloading the data. Find examples in our Colab Notebooks, e. g. how to fine-tune Flair.
Oneai Python
⭐
35
Python SDK for One AI APIs. One AI is an NLP-as-a-service platform. Our APIs enables language comprehension in context, transforming texts from any source into structured data to use in code.
Text Analysis
⭐
32
Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Text Classification Lstms Pytorch
⭐
31
The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Normalizer
⭐
28
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
Nlp Stuff
⭐
27
A bit of everything about text and nlp [IN PROGRESS]
Python Mecab
⭐
27
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Pnlp
⭐
25
NLP预/后处理工具。
Nlpo3
⭐
21
Thai Natural Language Processing library in Rust, with Python and Node bindings.
Mytwitterbot
⭐
20
A Twitter bot powered by a Recurrent Neural Network (RNN)
Huggingface Datasets Text Quality Analysis
⭐
19
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
Textdatasetcleaner
⭐
18
Настраиваемый пайплайн для очистки текстовых датасетов от мусора
Blabla
⭐
18
Novoic's linguistic feature extraction library
Hama Py
⭐
17
🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer
Advanced Text Mining
⭐
16
TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.
Nlpiper
⭐
15
NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.
Text Preprocess Python
⭐
15
Text preprocessing tools in python.
Odin Ai
⭐
15
Orgainzed Digital Intelligent Network (O.D.I.N)
Emotion Recognition From Tweets
⭐
14
A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.
Humanreadable
⭐
14
humanreadable is a Python library to convert human-readable values to other units.
Text Mining For Beginner
⭐
14
파이썬 기초문법 부터 간단한 텍스트 분석을 수행하는 방법에 대해 다룹니다.
Lara Hungarian Nlp
⭐
14
NLP class for rapid ChatBot development in Hungarian language
Embeddings
⭐
12
zero-vocab or low-vocab embeddings
Text Mining For Practice
⭐
12
파이썬 라이브러리를 활용해 텍스트 분석을 수행하는 방법에 대해 다룹니다.
Dnlp
⭐
12
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
Text Analysis
⭐
12
Weaving analytical stories from text data
Auto Corpus
⭐
12
Auto-CORPus pipeline developed by a University of Leicester and Imperial College London collaboration to standardize text and table data extracted from full text publications. See Open Access publication at: https://doi.org/10.3389/fdgth.2022.788124.
Pawpaw
⭐
11
Text Processing & Segmentation Framework
Html5lib Truncation
⭐
11
Truncating HTML with html5lib filter
Rutranscript
⭐
10
Russian phonetical transcription
Decepticonlp
⭐
10
Python Library for Robustness Monitoring and Adversarial Debugging of NLP models
Dl_text_classification
⭐
10
Collection of Deep Learning Text Classification Models in Keras; Includes a GPU tutorial.
Simplechinese
⭐
9
This package integrates many basic Chinese NLP functions, making Python-based Chinese word processing and information extraction simple and convenient.
Identification Of Imminent Suicide Risk Among Young Adults Using Text Messages
⭐
9
Wordcloud In Plotly
⭐
9
Interactive wordcloud
Russian_subtitles_dataset
⭐
9
Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.
Supersummarizeai
⭐
8
Unleash the power of AI with SuperSummarizeAI! Effortlessly extract, condense, and clip content from webpages and YouTube videos using ChatGPT. Turning endless streams of content into digestible summaries.
Flashtext2
⭐
8
FlashText rewritten from scratch (much better)
Pyutil
⭐
8
Some useful Python modules/scripts.
Betterletter
⭐
7
Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, ß).
Predict Blog Author Features
⭐
7
Predicts gender, age, label, and zodiac sign of the writer from the given text.
Rmalt
⭐
7
the malt language implemented by rbnf. https://github.com/malt-project/cmalt
Corpus_text_processor
⭐
6
A desktop application for preparing files for use in a corpus
Fda Ars
⭐
6
Tools for text processing of structured product labels
Textcl
⭐
6
Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/
Diagnosisextraction_ml
⭐
6
Pipeline for building Machine Learning Classifiers for the diagnosis of EHR text-data. We used this pipeline for our study, published here: https://doi.org/10.2196/23930.
Benchmark_minhash_lsh
⭐
6
insight data engineering fellow project
Expert
⭐
5
Assessing the validity of expert opinion based on the analysis of video content
Pyregex
⭐
5
Backtracking regular expression engine written in Python
Linesieve
⭐
5
An unholy blend of grep, sed, awk, and Python.
Text Proc
⭐
5
Scripts for Text Processing
Simple Nlp Projects
⭐
5
Collection of all ML/NLP programs. Created Date: 21 Aug 2017
Predict_bitcoin_using_reddit_sentiment
⭐
5
Sentiment analysis of Reddit comments to predict bitcoin price movement
Related Searches
Python Django (28,897)
Python Deep Learning (17,972)
Python Flask (17,643)
Python Machine Learning (16,719)
Python Jupyter Notebook (16,511)
Python Docker (14,810)
Python Dataset (14,792)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Network (11,547)
1-85 of 85 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.