Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for corpus word2vec
corpus
x
word2vec
x
77 search results found
Nlp_chinese_corpus
⭐
8,344
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Lexvec
⭐
700
This is an implementation of the LexVec word embedding model (similar to word2vec and GloVe) that achieves state of the art results in multiple NLP tasks
Ngram2vec
⭐
638
Four word embedding models implemented in Python. Supporting arbitrary context features
Wiki2vec
⭐
587
Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby
Magpie
⭐
574
Deep neural network framework for multi-label text classification
Ner Lstm
⭐
528
Named Entity Recognition using multilayered bidirectional LSTM
Word Embedding Dimensionality Selection
⭐
320
On the Dimensionality of Word Embedding
Chinese Word2vec
⭐
319
word2vec/glove/swivel binary file on chinese corpus
Abcnn
⭐
252
Implementation of ABCNN(Attention-Based Convolutional Neural Network) on Tensorflow
Movietaster Open
⭐
241
A practical movie recommend project based on Item2vec.
Germanwordembeddings
⭐
224
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Sensegram
⭐
211
Making sense embedding out of word embeddings using graph-based word sense induction
Fasttextjapanesetutorial
⭐
174
Tutorial to train fastText with Japanese corpus
Word2vec Spam Filter
⭐
147
Using word vectors to classify spam messages
Word2vec Lucene
⭐
127
This tool extracts word vectors from Lucene index.
Word_embeddings
⭐
112
Code for the blog post "Making Sense of Word2vec"
Ner Crf
⭐
107
CRF to detect named entities (primarily names of people)
Chive
⭐
105
Japanese word embedding with Sudachi and NWJC 🌿
Dict2vec
⭐
88
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
Word2vec
⭐
81
word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch
Russian_news_corpus
⭐
76
Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Ja.text8
⭐
74
Japanese text8 corpus for word embedding.
Wiki Word2vec
⭐
66
Train a gensim word2vec model on Wikipedia.
Word2vec_torch
⭐
60
Word2Vec implementation in Torch
Japanese Words To Vectors
⭐
57
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
Ner Pt
⭐
54
Portuguese Named Entity Recognition
Word2vec Scala
⭐
46
Scala port of the word2vec toolkit.
Autoencode
⭐
40
AutoenCODE is a Deep Learning infrastructure that allows to encode source code fragments into vector representations, which can be used to learn similarities.
Word2vec On Wikipedia
⭐
39
A pipeline for training word embeddings using word2vec on wikipedia corpus.
Word2vec
⭐
38
訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
Geiger
⭐
36
get a sense of comments from a safe distance
Topic Labeling
⭐
35
The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.
Word2vec Chinese
⭐
34
a tutorial for training Chinese-word2vec using Wiki corpus
Text Classification Cn
⭐
33
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Align Linguistic Alignment
⭐
33
Python library for extracting quantitative, reproducible metrics of multi-level alignment between two speakers in naturalistic language corpora.
Cade
⭐
30
Compass-aligned Distributional Embeddings. Align embeddings from different corpora
Word_embedding
⭐
26
Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding.
Word2veclite
⭐
24
Python implementation of Word2Vec
Hs Word2vec
⭐
23
A port of Google's word2vec to Haskell
Darks Learning
⭐
18
Darks learning is the machine learning algorithm library. It contains Word2vec,DBN, RBM, MLP, LSA, PLSA, SDA, Maxent, regression, etc.
Embeddings
⭐
17
spark job, sangria server, and react front-end for Word2Vec models
Word2vecfz
⭐
17
Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.
Char2vec
⭐
16
Implementation of char2vec model from http://www.aclweb.org/anthology/W/W16/W16-1603.pdf
Searchbetter
⭐
16
SearchBetter: query rewriting for search engines on small corpuses (Harvard research project)
Bengali Word Embedding
⭐
15
Bengali Word Embedding
Wordfish Python
⭐
15
extract relationships from standardized terms from corpus of interest with deep learning 🐟
Edit Unsup Ts
⭐
15
This repo contains the code for our paper "Iterative Edit-Based Unsupervised Sentence Simplification" accepted at ACL 2020.
W2v_ol
⭐
15
Using word embeddings (word2vec) for ontology learning
Word2vec
⭐
15
Word2Vec - Google's word2vec in Scala using UMASS factorie library for better hacking and research.
Word2vec Embeddings For Nepali Language
⭐
13
Word Embeddings (Word2Vec) for Nepali Language
Elang
⭐
13
Word Embedding utilities for Language Models (English & Indonesian)
Cnn_chinese_text_classification
⭐
11
运用cnn + highway network网络结构中文文本分类
Nlp Augment
⭐
10
A collection of utilities used in exploring data augmentation of low-resource parallel corpuses.
Word2vec.net Csharp
⭐
10
Word2Vec.Net-CSharp
Neulearn Ai_agent
⭐
10
대화형 에이전트 프로젝트 - Neulearn
Case2vec
⭐
10
A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated Council of Law Reporting for England & Wales (https://www.iclr.co.uk).
Romanian Word Embeddings
⭐
10
Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .vec and .model files are available for download (all in one archive).
Char2vec
⭐
10
Training from scratch a character embedding following Word2Vec, using tensorflow.
Word2vec4kor
⭐
9
Russian_subtitles_dataset
⭐
9
Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.
Scholar
⭐
8
Simple interface for Word2Vec in Python
Word2vecincrementallearning
⭐
8
word2vec variations
Nlp Course
⭐
8
NLP Course stuff and algorithm implementations
Multi Embedding Cws
⭐
8
Multiple Character Embeddings for Chinese Word Segmentation, ACL 2019
Vecshare
⭐
8
This library provides functionality for rapidly sharing and retrieving word embeddings over the internet. (EMNLP 2017).
Word2vector
⭐
8
用百科数据和搜狗新闻数据训练word2vec模型
Named Entity Recognition
⭐
7
This is a project in python to extract named entities from the given text corpus. You can use this project directly on your text corpus (changing path in config file) to train the model and score it on new corpus.
Wiki_zh_vec
⭐
7
a python autotool for train Chinese wiki corpus to word embeddings using word2vec ,glove and lexvec.
Dependency Based W2v
⭐
7
The code for improved dependency-based word2vec
Word2vec On Farsi Literature
⭐
6
Ngram Word2vec
⭐
6
News Review Pickup
⭐
6
新闻人物言论自动提取---->得到说话的人和说话的内容
Cluster Preprocessing
⭐
5
preprocessing of large corpora to induce various cluster types
Text Categorization Using Neural Word Embeddings
⭐
5
This is a practical implementation implementing neural networks on top of fasttext as well as word2vec word embeddings.
Relation_extraction
⭐
5
use cnn to extraction relation in chinese.
Benchmark Word2vec Frameworks
⭐
5
Benchmark popular Neural Networks frameworks for word2vec on different hardware platforms
Npm Recommender
⭐
5
uses word2vec to recommend NPM packages similar to those you already like
Related Searches
Python Corpus (2,447)
Python Word2vec (1,108)
Jupyter Notebook Word2vec (564)
Natural Language Processing Word2vec (515)
Natural Language Processing Corpus (510)
Embeddings Word2vec (354)
Dataset Corpus (342)
Java Corpus (308)
Language Corpus (261)
1-77 of 77 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.