Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
The Top 10 Information Retrieval Open Source Projects
Open source projects categorized as Information Retrieval
Categories
>
Data Processing
>
Information Retrieval
Edit Category
JaidedAI/EasyOCR
⭐
20,438
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
arc53/DocsGPT
⭐
17,677
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
dependent packages
0
total releases
0
most recent commit
4 months ago
weaviate/weaviate
⭐
15,498
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
dependent packages
0
total releases
0
most recent commit
4 months ago
piskvorky/gensim
⭐
14,915
Topic Modelling for Humans
dependent packages
0
total releases
0
most recent commit
over 2 years ago
deepset-ai/haystack
⭐
12,474
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
danswer-ai/danswer
⭐
6,435
Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
neuml/txtai
⭐
6,143
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
dependent packages
0
total releases
0
most recent commit
over 2 years ago
Unstructured-IO/unstructured
⭐
4,404
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
apache/lucene-solr
⭐
4,363
Apache Lucene and Solr open-source search software
dependent packages
0
total releases
0
most recent commit
over 2 years ago
marqo-ai/marqo
⭐
3,893
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
dependent packages
0
total releases
0
most recent commit
over 2 years ago
Get A Weekly Email With Trending Information Retrieval Projects
No Spam. Unsubscribe easily at any time.
Information Retrieval
Subscribe
Javascript must be enabled to subscribe.
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2026 Awesome Open Source. All rights reserved.