Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for unstructured data
unstructured-data
x
29 search results found
Towhee
⭐
2,903
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Vdp
⭐
1,556
💧 Instill VDP (Versatile Data Pipeline) is an open-source tool to seamlessly integrate AI to process unstructured data in the modern data stack
Bootcamp
⭐
1,521
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Awesome Document Understanding
⭐
783
A curated list of resources for Document Understanding (DU) topic
Spotlight
⭐
766
Interactively explore unstructured datasets from your dataframe.
Lilac
⭐
575
Curate better data for LLMs
Nucliadb
⭐
532
NucliaDB, The AI Search database for unstructured data
Dingo
⭐
276
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
Pygrok
⭐
201
python implementation of jordansissel's grok regular expression library
Trex
⭐
182
Intelligently transform unstructured to structured data
Dkm
⭐
94
Dynamic Kernel Matching (DKM) for Classifying Data with Non-conforming Features
Relevanceai
⭐
84
Home of the AI workforce - Multi-agent system, AI agents & tools
Bracmat
⭐
46
Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.
Pixiedust Facebook Analysis
⭐
42
A Jupyter notebook that uses the Watson Visual Recognition and Natural Language Understanding services to enrich Facebook Analytics and uses Cognos Dashboard Embedded to explore and visualize the results in Watson Studio
Base
⭐
24
Adansons Base is a data programming tool for error-analysis of training results. It organizes metadata of unstructured data and creates and organizes datasets. It makes dataset creation more effective and helps to find low-quality data by using the training results and improves AI performance.
Console
⭐
20
⛅ Versatile Data Pipeline (VDP) console website
Model
⭐
19
⚗️ Instill Model contains components for AI model orchestration
Cli
⭐
17
📺 Instill AI's official command line tool
Model Backend
⭐
14
⇋ A REST/gRPC server for Instill Model API service
Core
⭐
12
🔮 Instill Core contains components for supporting Instill VDP and Instill Model
Pipeline Backend
⭐
12
⇋ A REST/gRPC server for Instill VDP API service
Generate Insights From Data Formats With Watson
⭐
11
How do we process data in different formats like docx, pdf etc and generate insights to be linked with structured data in database?This pattern helps in establishing relations between structured & unstructured data to generate recommendations using Watson NLU & Watson Studio.
Unstructuredio Haystack
⭐
11
💙 Unstructured Data Connectors for Haystack 2.0
Wibble
⭐
11
Web Data Frames
Soledata.jl
⭐
11
Manage unstructured and multimodal datasets!
Html_tag_annotator
⭐
8
A Machine Learning tool to create the training dataset very quickly & easily by using a smart chrome extension
Gotz
⭐
8
Gotz - Heavy duty ETL to automate data extraction from tons of HTML pages
Rl3stdlib
⭐
7
The RL3 Standard Library is a collection of modules accessible to a RL3 program to simplify the programming process and removing the need to rewrite commonly used RL3 patterns and predicates.
Content Repository With Dynamic Access Control
⭐
5
Code and walkthrough to build an end-to-end content repository for unstructured data with dynamic access control.
1-29 of 29 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.