Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning embeddings
embeddings
x
machine-learning
x
173 search results found
Gradio
⭐
25,823
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Txtai
⭐
6,143
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Pytorch Metric Learning
⭐
5,734
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Postgresml
⭐
5,135
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
Spark Nlp
⭐
3,578
State of the Art Natural Language Processing
Hub
⭐
3,420
A library for transfer learning by reusing parts of TensorFlow models.
Lance
⭐
3,003
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
Towhee
⭐
2,903
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Siamese Triplet
⭐
2,853
Siamese and triplet networks with online pair/triplet mining in PyTorch
Lightly
⭐
2,665
A python library for self-supervised learning on images.
Ml Surveys
⭐
2,538
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
Awesome Community Detection
⭐
2,232
A curated list of community detection research papers with implementations.
Pytorch Nlp
⭐
2,180
Basic Utilities for PyTorch Natural Language Processing (NLP)
Prompttools
⭐
2,112
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
Ampligraph
⭐
1,963
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org
Llmware
⭐
1,859
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
Vearch
⭐
1,839
A distributed vector database for embedding-based vector retrieval
Featureform
⭐
1,670
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Text Embeddings Inference
⭐
1,566
A blazing fast inference solution for text embeddings models
Magnitude
⭐
1,542
A fast, efficient universal vector embedding utility package.
Awesome Embedding Models
⭐
1,329
A curated list of awesome embedding models tutorials, projects and communities.
Opentsne
⭐
1,295
Extensible, parallel implementations of t-SNE
Capsgnn
⭐
1,180
A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).
Dgl Ke
⭐
1,091
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
Paperai
⭐
1,023
📄 🤖 Semantic search and workflows for medical/scientific papers
Generative Ai Docs
⭐
871
Documentation for Google's Generative AI developer site
Nlp In Practice
⭐
861
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Vectordb
⭐
829
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Recsys
⭐
815
计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估
Nlp
⭐
734
📝 This repository recorded my NLP journey.
Catalyst
⭐
635
🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Vectorflow
⭐
566
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
Speedtorch
⭐
559
Library for faster pinned CPU <-> GPU transfer in Pytorch
Generative Ai Python
⭐
553
The Google AI Python SDK enables developers to use Google's state-of-the-art generative AI models (like Gemini and PaLM) to build AI-powered features and applications.
What_are_embeddings
⭐
552
A deep dive into embeddings starting from fundamentals
Vectorhub
⭐
546
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
Lantern
⭐
530
PostgreSQL vector database extension for building AI applications
Ml4se
⭐
511
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Pymde
⭐
480
Minimum-distortion embedding with PyTorch
Codequestion
⭐
457
🔎 Semantic search for developers
Cleora
⭐
434
Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Embedbase
⭐
434
A dead-simple API to build LLM-powered apps
Treelstm.pytorch
⭐
434
Tree LSTM implementation in PyTorch
Neuronlp2
⭐
358
Deep neural models for core NLP tasks (Pytorch version)
Bio_embeddings
⭐
357
Get protein embeddings from protein sequences
Awesome Feature Engineering
⭐
316
A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Tinyvector
⭐
314
A tiny embedding database in pure Rust.
Jodie
⭐
314
A PyTorch implementation of ACM SIGKDD 2019 paper "Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks"
Attentionwalk
⭐
299
A PyTorch Implementation of "Watch Your Step: Learning Node Embeddings via Graph Attention" (NeurIPS 2018).
Vectorai
⭐
299
Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.
Nlp Natural Language Processing
⭐
289
Projects and useful articles / links
Examples
⭐
289
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
Personality Detection
⭐
273
Implementation of a hierarchical CNN based model to detect Big Five personality traits
Polish Nlp Resources
⭐
267
Pre-trained models and language resources for Natural Language Processing in Polish
Vectordb Recipes
⭐
267
High quality resources & applications for LLMs, multi-modal models and VectorDBs
Wizmap
⭐
246
Explore and interpret large embeddings in your browser with interactive visualization! 📍
Gemsec
⭐
234
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Pyrdf2vec
⭐
232
🐍 Python Implementation and Extension of RDF2Vec
Tsne Umap Embedding Visualisation
⭐
217
A Simple and easy to use way to Visualise Embeddings!
Coursera Natural Language Processing Specialization
⭐
217
Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.
Dilated Cnn Ner
⭐
214
Dilated CNNs for NER in TensorFlow
Poincare Embedding
⭐
212
Poincaré Embedding (unofficial)
Facerecognition_with_facenet_android
⭐
211
Face Recognition using the FaceNet model and MLKit on Android.
Food2vec
⭐
207
🍔
Semantic Embeddings
⭐
206
Hierarchy-based Image Embeddings for Semantic Image Retrieval
Embedditor
⭐
192
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
Danmf
⭐
187
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Bootleg
⭐
179
Self-Supervision for Named Entity Disambiguation at the Tail
Graphwavemachine
⭐
171
A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
Clickbaits_revisited
⭐
169
Deep learning models to identify clickbaits taking content into consideration
Safe
⭐
154
SAFE: Self-Attentive Function Embeddings for binary similarity
Ncc
⭐
132
Neural Code Comprehension: A Learnable Representation of Code Semantics
Kitabe
⭐
127
Book Recommendation System built for Book Lovers📖. Simply Rate ⭐ some books and get immediate recommendations🤩
Coreference Resolution
⭐
118
Efficient and clean PyTorch reimplementation of "End-to-end Neural Coreference Resolution" (Lee et al., EMNLP 2017).
Pytsetlinmachine
⭐
118
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, and clause indexing
Diff2vec
⭐
116
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
Text
⭐
112
Using Transformers from HuggingFace in R
Parametricumap_paper
⭐
112
Parametric UMAP embeddings for representation and semisupervised learning. From the paper "Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning" (Sainburg, McInnes, Gentner, 2020).
Chatgpt Your Files
⭐
111
Production-ready MVP for securely chatting with your documents using pgvector
Hyte
⭐
103
EMNLP 2018: HyTE: Hyperplane-based Temporally aware Knowledge Graph Embedding
Dna2vec
⭐
103
dna2vec: Consistent vector representations of variable-length k-mers
Emdash
⭐
99
📚🧙♂️ Wisdom indexer — use AI to organize text snippets so you can actually remember & learn from what you read
Nlp Cheat Sheet Python
⭐
98
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Fastrtext
⭐
97
R wrapper for fastText
Walklets
⭐
96
A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Verse
⭐
94
Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures
Tigerlily
⭐
90
TigerLily: Finding drug interactions in silico with the Graph.
Enso
⭐
89
Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods
100 Days Of Nlp
⭐
83
Sytora
⭐
83
A sophisticated smart symptom search engine
Sadedegel
⭐
81
A General Purpose NLP library for Turkish
Ice
⭐
81
ICE: Item Concept Embedding
Graph Pattern Learner
⭐
71
Evolutionary Graph Pattern Learner that learns SPARQL queries for a given set of source-target-pairs from an endpoint.
Easy Bert
⭐
71
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Basic Machine Learning
⭐
71
This is a repo of basic Machine Learning what I learn. More to go...
Contiguous Succotash
⭐
70
Recurrent Variational Autoencoder with Dilated Convolutions that generates sequential data implemented in pytorch
Dreml
⭐
66
PyTorch implementation of Deep Randomized Ensembles for Metric Learning(ECCV2018)
Graphml Tutorials
⭐
63
Tutorials for Machine Learning on Graphs
Codesnippetsearch
⭐
63
Neural bag of words code search implementation using PyTorch and data from the CodeSearchNet project.
Objects That Sound
⭐
62
Implementation of Google Deepmind's paper `Objects that Sound`
Related Searches
Python Machine Learning (14,099)
Jupyter Notebook Machine Learning (12,247)
Machine Learning Neural Network (4,397)
Machine Learning Tensorflow (4,050)
Machine Learning Natural Language Processing (3,891)
Machine Learning Artificial Intelligence (3,877)
Machine Learning Data Science (3,802)
Machine Learning Pytorch (2,910)
Machine Learning Classification (2,529)
Machine Learning Dataset (2,298)
1-100 of 173 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.