Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for embeddings
embeddings
x
2,004 search results found
Supabase
⭐
62,208
The open source Firebase alternative.
Gradio
⭐
25,823
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Langchain Chatchat
⭐
21,633
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
Sentence Transformers
⭐
12,951
Multilingual Sentence & Image Embeddings with BERT
Chinese Word Vectors
⭐
11,230
100+ Chinese Word Vectors 上百种预训练中文词向量
Paddlenlp
⭐
10,908
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Chroma
⭐
10,682
the AI-native open-source embedding database
H2ogpt
⭐
9,542
Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
Nlp_course
⭐
9,139
YSDA course in Natural Language Processing
Flutter Desktop Embedding
⭐
7,088
Experimental plugins for Flutter for Desktop
Txtai
⭐
6,143
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Pytorch Metric Learning
⭐
5,734
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Postgresml
⭐
5,135
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
Generative Ai
⭐
4,453
Sample code and notebooks for Generative AI on Google Cloud
Mygptreader
⭐
4,267
A community-driven way to read and chat with AI bots - powered by chatGPT.
Pytorch Sentiment Analysis
⭐
4,133
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
Text2vec
⭐
3,610
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-
Spark Nlp
⭐
3,578
State of the Art Natural Language Processing
Laser
⭐
3,460
Language-Agnostic SEntence Representations
Hub
⭐
3,420
A library for transfer learning by reusing parts of TensorFlow models.
Pytorch Biggraph
⭐
3,326
Generate embeddings from large-scale graph-structured data.
Keybert
⭐
3,047
Minimal keyword extraction with BERT
Lance
⭐
3,003
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
Towhee
⭐
2,903
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Siamese Triplet
⭐
2,853
Siamese and triplet networks with online pair/triplet mining in PyTorch
Muse
⭐
2,844
A library for Multilingual Unsupervised or Supervised word Embeddings
Flagembedding
⭐
2,797
Dense Retrieval and Retrieval-augmented LLMs
Lightly
⭐
2,665
A python library for self-supervised learning on images.
Embedai
⭐
2,609
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
Esm
⭐
2,577
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
Deepwalk
⭐
2,561
DeepWalk - Deep Learning for Graphs
Ml Surveys
⭐
2,538
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
Resemblyzer
⭐
2,415
A python package to analyze and compare voices with deep learning
Awesome Community Detection
⭐
2,232
A curated list of community detection research papers with implementations.
Awesome Network Embedding
⭐
2,218
A curated list of network embedding techniques.
Pytorch Nlp
⭐
2,180
Basic Utilities for PyTorch Natural Language Processing (NLP)
Infersent
⭐
2,160
InferSent sentence embeddings
Awesome Knowledge Graph
⭐
2,122
整理知识图谱相关学习资料
Prompttools
⭐
2,112
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
Ampligraph
⭐
1,963
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org
Llmware
⭐
1,859
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
Vearch
⭐
1,839
A distributed vector database for embedding-based vector retrieval
Awesome Sentence Embedding
⭐
1,831
A curated list of pretrained sentence and word embedding models
Sequence_tagging
⭐
1,725
Named Entity Recognition (LSTM + CRF) - Tensorflow
Gptdiscord
⭐
1,720
A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!
Featureform
⭐
1,716
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Awesome Generative Ai
⭐
1,704
A curated list of Generative AI tools, works, models, and references
Keras Textclassification
⭐
1,641
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastTex RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Instructor Embedding
⭐
1,630
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Senteval
⭐
1,627
A python tool for evaluating the quality of sentence embeddings.
Bilm Tf
⭐
1,607
Tensorflow implementation of contextualized word representations from bi-directional language models
Poincare Embeddings
⭐
1,607
PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"
Stackgan
⭐
1,602
Text Embeddings Inference
⭐
1,566
A blazing fast inference solution for text embeddings models
Nlp Journey
⭐
1,563
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Langchain4j
⭐
1,550
Java version of LangChain
Magnitude
⭐
1,542
A fast, efficient universal vector embedding utility package.
Obsidian Smart Connections
⭐
1,498
Chat with your notes in Obsidian! Plus, see what's most relevant in real-time! Interact and stay organized. Powered by OpenAI ChatGPT, GPT-4 & Embeddings.
Unsupervisedmt
⭐
1,437
Phrase-Based & Neural Unsupervised Machine Translation
Eda_nlp
⭐
1,405
Data augmentation for NLP, presented at EMNLP 2019
Stock Rnn
⭐
1,339
Predict stock market prices using RNN model with multilayer LSTM cells + optional multi-stock embeddings.
Awesome Embedding Models
⭐
1,329
A curated list of awesome embedding models tutorials, projects and communities.
Opentsne
⭐
1,295
Extensible, parallel implementations of t-SNE
Deep Siamese Text Similarity
⭐
1,216
Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings
Capsgnn
⭐
1,180
A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).
Hmtl
⭐
1,176
🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP
Glove Python
⭐
1,171
Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/
Hazm
⭐
1,142
Persian NLP Toolkit
Contextualized Topic Models
⭐
1,141
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Node2vec
⭐
1,141
Implementation of the node2vec algorithm.
Conceptnet Numberbatch
⭐
1,114
Sent2vec
⭐
1,107
General purpose unsupervised sentence representations
Bert Extractive Summarizer
⭐
1,096
Easy to use extractive text summarization with BERT
Dgl Ke
⭐
1,091
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
Natasha
⭐
1,085
Solves basic Russian NLP tasks, API for lower level Natasha projects
Bpemb
⭐
1,068
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Paperai
⭐
1,023
📄 🤖 Semantic search and workflows for medical/scientific papers
Latticelstm
⭐
1,018
Chinese NER using Lattice LSTM. Code for ACL 2018 paper.
Sif
⭐
990
sentence embedding by Smooth Inverse Frequency weighting scheme
Gpt Vup
⭐
988
GPT-vup BIliBili | 抖音 | AI | 虚拟主播
Trieve
⭐
976
All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.
Nlp_overview
⭐
958
Overview of Modern Deep Learning Techniques Applied to Natural Language Processing
Infinity
⭐
936
The AI-native database built for LLM applications, providing incredibly fast vector and full-text search
Wikipedia2vec
⭐
899
A tool for learning vector representations of words and entities from Wikipedia
Seagoat
⭐
881
local-first semantic code search engine
Generative Ai Docs
⭐
871
Documentation for Google's Generative AI developer site
Nlp In Practice
⭐
861
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Awesome 2vec
⭐
858
Curated list of 2vec-type embedding models
Line
⭐
853
LINE: Large-scale information network embedding
Llama Node
⭐
846
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
Vectordb
⭐
829
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Text2vec
⭐
829
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Recsys
⭐
815
计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估
Inltk
⭐
808
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Flat Lattice Transformer
⭐
806
code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
Zero Shot Gcn
⭐
789
Zero-Shot Learning with GCN (CVPR 2018)
Nlu
⭐
775
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Tensorflow Triplet Loss
⭐
764
Implementation of triplet loss in TensorFlow
Nlp
⭐
734
📝 This repository recorded my NLP journey.
Modelfusion
⭐
712
The TypeScript library for building AI applications.
Related Searches
Python Embeddings (2,141)
Jupyter Notebook Embeddings (708)
1-100 of 2,004 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.