Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python embeddings
embeddings
x
python
x
1,181 search results found
Gradio
⭐
25,823
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Langchain Chatchat
⭐
21,633
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
Sentence Transformers
⭐
12,951
Multilingual Sentence & Image Embeddings with BERT
Chinese Word Vectors
⭐
11,230
100+ Chinese Word Vectors 上百种预训练中文词向量
Paddlenlp
⭐
10,908
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Chroma
⭐
10,682
the AI-native open-source embedding database
H2ogpt
⭐
9,542
Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
Txtai
⭐
6,143
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Postgresml
⭐
5,135
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
Mygptreader
⭐
4,267
A community-driven way to read and chat with AI bots - powered by chatGPT.
Text2vec
⭐
3,610
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-
Hub
⭐
3,420
A library for transfer learning by reusing parts of TensorFlow models.
Pytorch Biggraph
⭐
3,326
Generate embeddings from large-scale graph-structured data.
Keybert
⭐
3,047
Minimal keyword extraction with BERT
Lance
⭐
3,003
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
Towhee
⭐
2,903
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Siamese Triplet
⭐
2,853
Siamese and triplet networks with online pair/triplet mining in PyTorch
Muse
⭐
2,844
A library for Multilingual Unsupervised or Supervised word Embeddings
Flagembedding
⭐
2,797
Dense Retrieval and Retrieval-augmented LLMs
Lightly
⭐
2,665
A python library for self-supervised learning on images.
Esm
⭐
2,577
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
Deepwalk
⭐
2,561
DeepWalk - Deep Learning for Graphs
Resemblyzer
⭐
2,415
A python package to analyze and compare voices with deep learning
Awesome Community Detection
⭐
2,232
A curated list of community detection research papers with implementations.
Awesome Network Embedding
⭐
2,218
A curated list of network embedding techniques.
Pytorch Nlp
⭐
2,180
Basic Utilities for PyTorch Natural Language Processing (NLP)
Prompttools
⭐
2,112
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
Ampligraph
⭐
1,963
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org
Llmware
⭐
1,859
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
Awesome Sentence Embedding
⭐
1,831
A curated list of pretrained sentence and word embedding models
Sequence_tagging
⭐
1,725
Named Entity Recognition (LSTM + CRF) - Tensorflow
Gptdiscord
⭐
1,720
A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!
Featureform
⭐
1,716
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Keras Textclassification
⭐
1,641
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastTex RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Senteval
⭐
1,627
A python tool for evaluating the quality of sentence embeddings.
Bilm Tf
⭐
1,607
Tensorflow implementation of contextualized word representations from bi-directional language models
Poincare Embeddings
⭐
1,607
PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"
Stackgan
⭐
1,602
Nlp Journey
⭐
1,563
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Magnitude
⭐
1,542
A fast, efficient universal vector embedding utility package.
Unsupervisedmt
⭐
1,437
Phrase-Based & Neural Unsupervised Machine Translation
Eda_nlp
⭐
1,405
Data augmentation for NLP, presented at EMNLP 2019
Stock Rnn
⭐
1,339
Predict stock market prices using RNN model with multilayer LSTM cells + optional multi-stock embeddings.
Opentsne
⭐
1,295
Extensible, parallel implementations of t-SNE
Deep Siamese Text Similarity
⭐
1,216
Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings
Capsgnn
⭐
1,180
A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).
Hmtl
⭐
1,176
🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP
Glove Python
⭐
1,171
Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/
Contextualized Topic Models
⭐
1,141
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Node2vec
⭐
1,141
Implementation of the node2vec algorithm.
Hazm
⭐
1,126
Persian NLP Toolkit
Conceptnet Numberbatch
⭐
1,114
Bert Extractive Summarizer
⭐
1,096
Easy to use extractive text summarization with BERT
Dgl Ke
⭐
1,091
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
Natasha
⭐
1,085
Solves basic Russian NLP tasks, API for lower level Natasha projects
Bpemb
⭐
1,068
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Paperai
⭐
1,023
📄 🤖 Semantic search and workflows for medical/scientific papers
Latticelstm
⭐
1,018
Chinese NER using Lattice LSTM. Code for ACL 2018 paper.
Gpt Vup
⭐
988
GPT-vup BIliBili | 抖音 | AI | 虚拟主播
Wikipedia2vec
⭐
899
A tool for learning vector representations of words and entities from Wikipedia
Seagoat
⭐
881
local-first semantic code search engine
Awesome 2vec
⭐
858
Curated list of 2vec-type embedding models
Inltk
⭐
808
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Flat Lattice Transformer
⭐
806
code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
Zero Shot Gcn
⭐
789
Zero-Shot Learning with GCN (CVPR 2018)
Nlu
⭐
775
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Tensorflow Triplet Loss
⭐
764
Implementation of triplet loss in TensorFlow
Nlp
⭐
734
📝 This repository recorded my NLP journey.
Frogbase
⭐
704
Transform audio-visual content into navigable knowledge.
Neumai
⭐
693
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Swiss_army_llama
⭐
688
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
Polyfuzz
⭐
671
Fuzzy string matching, grouping, and evaluation.
Word2vec Graph
⭐
650
Exploring word2vec embeddings as a graph of nearest neighbors
Triplet Reid
⭐
642
Code for reproducing the results of our "In Defense of the Triplet Loss for Person Re-Identification" paper.
Ngram2vec
⭐
638
Four word embedding models implemented in Python. Supporting arbitrary context features
Doc2vec
⭐
619
Python scripts for training/testing paragraph vectors
Mat2vec
⭐
581
Supplementary Materials for Tshitoyan et al. "Unsupervised word embeddings capture latent knowledge from materials science literature", Nature (2019).
Conve
⭐
574
Convolutional 2D Knowledge Graph Embeddings resources
Chatweb
⭐
573
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Vectorflow
⭐
566
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
Speedtorch
⭐
559
Library for faster pinned CPU <-> GPU transfer in Pytorch
Multi Class Text Classification Cnn Rnn
⭐
554
Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.
Generative Ai Python
⭐
553
The Google AI Python SDK enables developers to use Google's state-of-the-art generative AI models (like Gemini and PaLM) to build AI-powered features and applications.
Vectorhub
⭐
546
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
Ner Lstm
⭐
528
Named Entity Recognition using multilayered bidirectional LSTM
Compgcn
⭐
519
ICLR 2020: Composition-Based Multi-Relational Graph Convolutional Networks
Nlp_pytorch_project
⭐
517
Embedding, NMT, Text_Classification, Text_Generation, NER etc.
Etm
⭐
507
Topic Modeling in Embedding Spaces
Context
⭐
496
A CLI tool & API over the top 1221 Python libraries.
Pymde
⭐
480
Minimum-distortion embedding with PyTorch
Openea
⭐
470
A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs, VLDB 2020
Vecmap
⭐
466
A framework to learn cross-lingual word embedding mappings
Deepnl
⭐
463
Deep Learning for Natural Language Processing
Codequestion
⭐
457
🔎 Semantic search for developers
Stackgan Pytorch
⭐
457
Chitchatassistant
⭐
440
Rasa中文聊天机器人
Treelstm.pytorch
⭐
434
Tree LSTM implementation in PyTorch
Img2vec
⭐
426
🔥 Use pre-trained models in PyTorch to extract vector embeddings for any image
Undreamt
⭐
421
Unsupervised Neural Machine Translation
Structured Self Attention
⭐
412
A Structured Self-attentive Sentence Embedding
Related Searches
Python Machine Learning (20,195)
Python Dataset (14,792)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Network (11,495)
Python Algorithms (10,033)
Python Natural Language Processing (9,064)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
Python Neural (7,444)
1-100 of 1,181 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.