Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for extract data
extract-data
x
40 search results found
Node Crawler
⭐
6,610
Web Crawler/Spider for NodeJS + server-side jQuery ;-)
Pymupdf
⭐
3,908
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Meltano
⭐
1,460
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Engauge Digitizer
⭐
841
Extracts data points from images of graphs
Crawly
⭐
790
Crawly, a high-level web crawling & scraping framework for Elixir.
Dataflowkit
⭐
394
Extract structured data from web sites. Web sites scraping.
Traceutility
⭐
176
Extract data from .trace documents generated by Instruments
Receipt Scanner
⭐
161
Receipt scanner extracts information from your PDF or image receipts - built in NodeJS
Resumeparser
⭐
146
A simple resume parser used for extracting information from resumes
Smapr
⭐
77
An R package for acquisition and processing of NASA SMAP data
Html2data
⭐
64
Library and cli for extracting data from HTML via CSS selectors
Web Data Extractor
⭐
54
Extracting and parsing structured data with jQuery Selector, XPath or JsonPath from common web format like HTML, XML and JSON.
Fb_scraper
⭐
52
FBLYZE is a Facebook scraping system and analysis system.
Html Table Extractor
⭐
51
extract data from html table
Insider Trading
⭐
38
This program extracts insider trading data from the sec website and stores it in excel file for the specified time frame.
Gr Eventstream
⭐
35
gr-eventstream is a set of GNU Radio blocks for creating precisely timed events and either inserting them into, or extracting them from normal data-streams precisely. It allows for the definition of high speed time-synchronous c++ burst event handlers, as well as bridging to standard GNU Radio Async PDU messages with precise timing easily.
Trio Plus Data
⭐
25
Extract audio and other data from the Digitech Trio Plus guitar pedal's SD card
Bluebird
⭐
23
Unofficial Python client for Twitter
Pinpoint Digitizer
⭐
19
Open source digitizer application to extract data from plots
Svg2data
⭐
19
A Python module for reading data from a plot provided as SVG file.
Pga
⭐
19
This is a library for making batch request to Google Analytics Core Reporting v3 API and extracting data from Google Analytics property into Python 3 data structures.
Extract Colors Py
⭐
17
Extract colors from an image. Colors are grouped based on visual similarities using the CIE76 formula.
Webhere
⭐
16
HTML scraping for Objective-C.
Pylyrics Extractor
⭐
15
Get Lyrics for any songs by just passing in the song name (spelled or misspelled) in less than 2 seconds using this awesome Python Library.
Pdfix_sdk_example_cpp
⭐
14
Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...
Arksavegametoolkitnet
⭐
14
Library for reading ARK Survival Evolved savegame files using C#.
Serritor
⭐
13
Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.
Tap Dbt
⭐
12
Singer Tap for dbt API v2 built with the Meltano SDK
Dokuextractor
⭐
10
Easily extract data from PDF documents
Sqlitediskexplorer
⭐
9
SQLiteDiskExplorer enables you to explore, catalog, and batch extract SQLite files from disks and removable media.
Pdfix_sdk_example_dotnet
⭐
9
Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...
Unityassetreplacer
⭐
8
A tool to replace data in a Unity Asset Bundle from modified files.
Mdict_reader
⭐
8
Extract data from Octopus mdict (*.mdd, *.mdx) files
Linux_project
⭐
8
a linux lab bash project that focuses on automation and text extraction
Docker Seedbox Rclone Fetch Extract
⭐
8
Dockerised service pulling data from remote seedbox & extracting archives
Jextract
⭐
7
Allows extracting data from DOM
Sypht Elixir Client
⭐
6
An Elixir client for the Sypht API https://sypht.com
Parsers
⭐
6
Collection of parsers written in PHP, Python
Jxldatatableextractor
⭐
5
Extract data as tables from Excel. Search columns by their header or index number. Sets conditions for extracting the rows.
Tool Fastbatchimagecrop
⭐
5
A simple UI tool to batch crop images to prepare datasets from images and videos.
1-40 of 40 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.