Twembeddings

Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora
Alternatives To Twembeddings
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Ekphrasis583
72 years ago54May 17, 202218mitPython
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Chat_corpus216
7 years ago4
chat corpus collection from various open sources
Tweet Secret181
11 years agomitClojure
This is a text steganography application optimized for use on Twitter, written in Clojure.
Bitcoin Value Predictor90
5 years ago1Jupyter Notebook
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Textgrounder60
8 years ago1apache-2.0Scala
A system for connecting language to space and time.
Broad_twitter_corpus52
2 years ago9otherJupyter Notebook
The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors
Twitter Corpus46
6 years ago5otherPython
Collects all tweets from the sample Public stream using Twitter's streaming API, and saves them to a file for later use as a corpus.
Extract_covid19_events_from_twitter45
2 years ago1gpl-3.0Python
Annotated corpus and code for "Extracting COVID-19 Events from Twitter".
News Media Reliability32
4 years ago1Python
Twitter_scraper29
8 years ago2Python
Scrap real time posts from twitter through the streaming api
Alternatives To Twembeddings
Select To Compare


Alternative Project Comparisons
Popular Corpus Projects
Popular Twitter Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Dataset
Twitter
Corpus
Tweets
Embeddings
Tf Idf