Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for apache tika
apache
x
tika
x
40 search results found
Tika
⭐
2,007
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Tika Python
⭐
1,316
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Datashare
⭐
519
A self-hosted search engine for documents.
Sparkler
⭐
401
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Go Tika
⭐
171
Go package for using Apache Tika
Docker Tikaserver
⭐
160
Apache Tika Server as a Docker Image
Pdf2html
⭐
117
pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.
Memex Explorer
⭐
106
Viewers for statistics and dashboarding of Domain Search Engine data
Php Apache Tika
⭐
104
Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats
Imagecat
⭐
84
ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
Tika Docker
⭐
81
Convenience Docker images for Apache Tika Server
Rtika
⭐
52
R Interface to Apache Tika
Phptikawrapper
⭐
52
Simple PHP Wrapper for Apache Tika
Rika
⭐
43
A JRuby wrapper for Apache Tika to extract text and metadata from files of various formats.
Sentimentanalysisparser
⭐
29
Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.
Xltsearch
⭐
28
High-performance, portable and configurable desktop search application / information retrieval system
Nifi Extracttext Processor
⭐
28
Apache NiFi Custom Processor Extracting Text From Files with Apache Tika
Ipfs Tika
⭐
27
Java web application taking IPFS hashes, extracting (textual) content and metadata through Apache's Tika.
Clj Tika
⭐
25
Clojure bindings to Apache Tika project
Solr_exploit
⭐
23
Apache Solr远程代码执行漏洞(CVE-2019-0193) Exploit
Document_search_engine_architecture
⭐
22
📄🚀 Unleash a powerful Document Search Engine with Apache NiFi for lightning-fast, comprehensive text indexing and search.
Tika Dockers
⭐
18
A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video
Etllib
⭐
16
This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.
Rtika
⭐
16
A JRuby wrapper for Apache Tika
Tika Server
⭐
14
Apache Tika Server as a Background Service in Node.js
Tika App Python
⭐
13
Python bindings for Apache Tika
Apache Tika Lambda Layer
⭐
12
AWS Lambda layer containing latest version of Apache Tika
Shangridocs
⭐
11
Document exploration tool
Loophole
⭐
11
记录搭建漏洞环境及漏洞复现
Tika Ner Corenlp
⭐
11
Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser
Dropwizard Tika Server
⭐
10
A DropWizard wrapper around Apache Tika.
Tika Hadoop Mapreduce
⭐
10
Apache Tika integration with Java MapReduce for Hadoop
Jsolr
⭐
8
IMPORTANT NOTE: This repo is no longer maintained and will soon be deleted. Our new repository is located at https://gitlab.com/knowledgearcdotorg/jsolr.
Tika
⭐
8
Docker container to provide Apache Tika RESTful API
Cve 2018 11761
⭐
8
Apache Tika Denial of Service Vulnerability (CVE-2018-11761)
Tikatools
⭐
7
TikaTools is a small wrapper for Apache Tika written in C#
Techarticles
⭐
6
A set of tech articles.
Img2text
⭐
6
Models, and associated helper code for GSOC 2017 project Tensorflow Image to Text in Apache Tika
Ext Tika
⭐
6
A TYPO3 CMS extension that provides Apache Tika functionality
Text Extractor
⭐
6
Tool for extract text from Office and PDFs files as a very, very tiny alternative to Apache Tika
Related Searches
Java Apache (4,365)
Php Apache (2,291)
Javascript Apache (1,546)
Python Apache (1,535)
Shell Apache (1,473)
Docker Apache (1,361)
Ruby Apache (1,290)
Apache Spark (1,207)
Mysql Apache (961)
Apache Kafka (836)
1-40 of 40 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.