Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for natural language processing preprocessing
natural-language-processing
x
preprocessing
x
25 search results found
Unstructured
⭐
4,404
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Texthero
⭐
2,773
Text preprocessing, representation and visualization from zero to hero.
Jionlp
⭐
2,724
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Contextualized Topic Models
⭐
1,141
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Nlp In Practice
⭐
861
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Contextualspellcheck
⭐
365
✔️Contextual word checker for better suggestions
Neologdn
⭐
259
Japanese text normalizer for mecab-neologd
Dilated Cnn Ner
⭐
214
Dilated CNNs for NER in TensorFlow
Character Based Cnn
⭐
209
Implementation of character based convolutional neural network
Headliner
⭐
197
🏖 Easy training and deployment of seq2seq models.
Tmtoolkit
⭐
191
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Ktext
⭐
180
Utilities for preprocessing text for deep learning with Keras
Nlpre
⭐
135
Python library for Natural Language Preprocessing (NLPre)
Question Answering
⭐
128
TensorFlow implementation of Match-LSTM and Answer pointer for the popular SQuAD dataset.
Chariot
⭐
121
Deliver the ready-to-train data to your NLP model.
Treebankpreprocessing
⭐
106
Python scripts preprocessing Penn Treebank and Chinese Treebank
Neattext
⭐
63
NeatText a simple NLP package for cleaning textual data and text preprocessing
Copycat Abstractive Opinion Summarizer
⭐
62
ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generation
Podium
⭐
50
Podium: a framework agnostic Python NLP library for data loading and preprocessing
Itu Turkish Nlp Pipeline Caller
⭐
44
A Python3 wrapper tool to help using ITU Turkish NLP Pipeline API -- UNMAINTAINED --
Nlp Stock Prediction
⭐
34
Textgo
⭐
33
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
Gensimr
⭐
31
📝 Topic Modeling for Humans
Bogotobogo Machine Learning
⭐
30
Code repository - Jupyter notebook
Text Preprocessing
⭐
30
A python package for text preprocessing task in natural language processing.
Nlp Flask Website
⭐
30
A simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 Jan 2019
Indic Num2words
⭐
29
Python library for converting numbers to words for all Indian Languages.
Smashed
⭐
26
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
Pnlp
⭐
25
NLP预/后处理工具。
Codeprep
⭐
24
A toolkit for pre-processing large source code corpora
Most Powerful Nlp Library
⭐
21
Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it can solve almost any NLP task you want to tackle.
Figet Hyperbolic Space
⭐
20
Code for the paper "Fine-Grained Entity Typing in Hyperbolic Space"
Rouge
⭐
20
An implementation of ROUGE family metrics for automatic summarization.
Portuguese Nlp
⭐
18
Nlp work on Brazil Portuguese newswire text
Textdatasetcleaner
⭐
18
Настраиваемый пайплайн для очистки текстовых датасетов от мусора
Nlstruct
⭐
17
Natural language structuring library
Gec Pseudodata
⭐
15
Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Text Preprocess Python
⭐
15
Text preprocessing tools in python.
Texttk
⭐
15
Text Preprocessing in Python
Nlpiper
⭐
15
NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.
Preprocess Conll05
⭐
14
Scripts for preprocessing the CoNLL-2005 SRL dataset.
Twitter Sentiment Analysis Nlp Hackathon
⭐
12
Problem Statement: Given the tweets from customers about various tech firms who manufacture and sell mobiles, computers, laptops, etc, the task is to identify if the tweets have a negative sentiment towards such companies or products.
Text Normalizer
⭐
12
Normalize text string
Basic Text Preprocessing
⭐
11
Basic text preprocessing for Bahasa with Python.
Nlp Preprocess Tools
⭐
11
This is a curated list of samples on NLP preprocessing. You are welcome to make a pull request to contribute!
Ipa
⭐
10
NLP Preprocessing Pipeline Wrappers
Forecasting Us Elections
⭐
10
Extraction of tweets and Perform sentiment analysis on the presidential candidature of Donald Trump, Joe Biden and Kanye West in the upcoming elections in US in November, 2020.
Tolkein_text
⭐
10
Neural Network Language Model that generates text based off Lord of the Rings. Built with Pytorch.
Msaic_2018
⭐
10
My solution to Microsoft AI Challenge 2018
Entity_knowledge_in_bert
⭐
10
This repository contains the code for the CONLL 2019 paper "Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking". The code is provided as a documentation for the paper and also for follow-up research.
Parasite
⭐
9
🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 Biomedical Translation Task.
Named Entity Recognition
⭐
8
Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.
Opiec Pipeline
⭐
7
Ea Associate Ds
⭐
6
Electronic Arts (EA) NLP Assignment for: Associate Data Scientist
Deeplearning
⭐
6
Techvalleymachinelearning
⭐
6
Repo for projects associated with TechValleyMachineLearning
Tchatbot Api
⭐
6
A Flask REST API to serve trained ChatBots using Tensorflow Serving and Docker Containers
Introduction To Data Analyst And Data Science
⭐
6
Introduction to Data Analyst and Data Science for Beginners
Nlcodec
⭐
5
Natural Language EnCoder-Decoder: word, char, bpe etc
Toxine
⭐
5
Tiny preprocessor for Russian text
Docutron
⭐
5
Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.
Chinese Nlp Preprocessing
⭐
5
Simple preprocessing scripts for Chinese NLP.
Social Media Data Analysis
⭐
5
Tweets Filtering; Tweets Preprocessing; Tweet Representation; Sentiment Analysis; Geographical Analysis
Related Searches
Python Natural Language Processing (7,915)
Jupyter Notebook Natural Language Processing (4,405)
Machine Learning Natural Language Processing (3,939)
Deep Learning Natural Language Processing (2,345)
Pytorch Natural Language Processing (1,212)
Python Preprocessing (1,083)
Artificial Intelligence Natural Language Processing (1,010)
Dataset Natural Language Processing (1,010)
Tensorflow Natural Language Processing (909)
Javascript Natural Language Processing (843)
1-25 of 25 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.