Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning information retrieval
information-retrieval
x
machine-learning
x
64 search results found
Easyocr
⭐
20,438
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Gensim
⭐
15,180
Topic Modelling for Humans
Haystack
⭐
12,474
🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Txtai
⭐
6,143
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Unstructured
⭐
4,404
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Marqo
⭐
3,893
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Catalyst
⭐
3,151
Accelerated deep learning R&D
Ranking
⭐
2,666
Learning to Rank in TensorFlow
Llmware
⭐
1,859
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
Awesome Fl
⭐
1,103
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Allrank
⭐
722
allRank is a framework for training learning-to-rank neural models based on PyTorch.
Talisman
⭐
666
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Resin
⭐
557
Vector space search engine. Available as a HTTP service or as an embedded library.
Raft
⭐
530
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
Rmdl
⭐
409
RMDL: Random Multimodel Deep Learning for Classification
Awesome Generative Information Retrieval
⭐
387
Automated Fact Checking Resources
⭐
303
Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).
Cherche
⭐
295
📑 Neural Search
Forte
⭐
215
Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
Mindflow
⭐
213
🧠 AI-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow 🌊
Information Retrieval
⭐
139
Neural information retrieval / semantic-search / Bi-Encoders
Chatgpt Retrievalqa
⭐
130
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.
Summary
⭐
126
summaries of all the papers I read
Foundry
⭐
125
The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning
Tika Similarity
⭐
100
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Ml4ir
⭐
83
Machine Learning for Information Retrieval
Sycamore
⭐
82
🍁 Sycamore is an LLM-powered semantic data preparation system for building search applications.
Machinelearningwithpython
⭐
79
Get started with Machine Learning with Python - An introduction with Python programming examples
Perke
⭐
67
A keyphrase extractor for Persian
Freediscovery
⭐
60
Web Service for E-Discovery Analytics
Cunvsm
⭐
49
Neural Vector Space Models
Horus Ner
⭐
47
HORUS: A framework to boost NLP tasks
Lamp
⭐
47
LaMP: When Large Language Models Meet Personalization
Aspire
⭐
46
Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.
Liblevenshtein Java
⭐
40
Various utilities regarding Levenshtein transducers. (Java)
Senet For Weakly Supervised Relation Extraction
⭐
35
Rankymcrankface
⭐
34
Hardened Fork of Ranklib learning to rank library
Bm25transformer
⭐
30
(Python) transform a document-term matrix to an Okapi/BM25 representation
Sigir19 Neural Ir
⭐
30
Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19
Fxt
⭐
28
A large scale feature extraction tool for text-based machine learning
Text Clf Baselines
⭐
24
WideMLP for Text Classification
Ai_booklet_ce Aut
⭐
20
Booklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.
Fxt
⭐
18
A large scale feature extraction tool for text-based machine learning
Neural Search Pills
⭐
18
Knowledge pills on Neural Search
Information Retrieval
⭐
15
Elasticsearch, MongoDB, Tornado Server, RESTful API, Python, Information Retrieval, Machine Learning, Web Crawler
Ml Nlp Services
⭐
14
机器学习、深度学习、自然语言处理
Information_retrieval_system
⭐
13
The goal of this project is to implement a basic information retrieval system using Python, NLTK and GenSIM.
Paperlist_nlp_ir_rec_ai_conference
⭐
13
2016-至今nlp/ir/recsys/ai相关顶会的论文清单paperlist列表含目录,方便直
Hical
⭐
12
HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.
Irel Reading Group
⭐
12
This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.
Dlkp
⭐
12
A deep learning library for identifying keyphrases from text
Swim Ir
⭐
11
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting.
Ecir2019 Qac
⭐
10
Supplemental material (data and models) for the paper "Identifying Unclear Questions in Community Question Answering Websites" in ECIR'19.
Es3
⭐
10
a shared space for humans and machines to work together to update systematic reviews.
Fortehealth
⭐
9
The project is in the incubation stage and still under development. ForteHealth is a flexible and powerful ML workflow builder for biomedical and clinical scenarios. This is part of the CASL project: http://casl-project.ai/
Docsgpt
⭐
8
This app allows users to easily query a PDF document using OpenAI's GPT-3 language model in Google Colab, utilizing Google Drive for storage.
Corpus2question
⭐
8
Using questions to summarize large amounts of textual data.
Recommendation Engine
⭐
8
Recommendation engine and it's algorithms in python , R .
R3s
⭐
8
Real-time Relevant Recommendation Suggestion
Latent Aspect Detection
⭐
7
Code and models for the paper "Latent Aspect Detection from Online Unsolicited Customer Reviews"
Findkit
⭐
7
A Python library for content-based information retrieval
Rank_text_cnn
⭐
7
Implementation of Learning to Rank Short Text Pairs with CNNs SIGIR'15 in Keras.
Icdclassifier
⭐
5
A Weka-based classifier/evaluator of text extracts (e.g. pathology reports) into ICD codes
Qnatables An Intelligent Question Answering System
⭐
5
Question Answering System to answer question over tables in a document
Machine_learning_focused_crawler
⭐
5
A focused web crawler that uses Machine Learning to fetch better relevant results.
Hierarchical Language Modeling
⭐
5
We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.
Related Searches
Python Machine Learning (14,099)
Jupyter Notebook Machine Learning (12,247)
Machine Learning Neural Network (4,397)
Machine Learning Tensorflow (4,050)
Machine Learning Natural Language Processing (3,891)
Machine Learning Artificial Intelligence (3,877)
Machine Learning Data Science (3,802)
Machine Learning Pytorch (2,910)
Machine Learning Dataset (2,298)
Machine Learning Computer Vision (1,966)
1-64 of 64 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.