Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for tika
tika
x
132 search results found
Leechcrawler
⭐
8
Incremental crawling capabilities for Apache Tika. Crawl content out of e.g. file systems, http(s) sources (webcrawling) imap(s) servers or your own arbitrary data sources. LeechCrawler offers additional Tika parsers providing these crawling capabilities.
Jsolr
⭐
8
IMPORTANT NOTE: This repo is no longer maintained and will soon be deleted. Our new repository is located at https://gitlab.com/knowledgearcdotorg/jsolr.
Hawarp
⭐
7
HAdoop-based Web Archive Record Processing
Tikatools
⭐
7
TikaTools is a small wrapper for Apache Tika written in C#
Cogworks.examinefileindexer
⭐
7
An examine indexer that uses Apache Tika.
Simple_text_extract
⭐
7
📄 SimpleTextExtract attempts to quickly extract text from various file types before resorting to an OCR solution.
Iscc Service
⭐
6
ISCC Web Api Service
Pdf To Algolia Playground
⭐
6
Transform PDFs and other documents into Algolia records
Img2text
⭐
6
Models, and associated helper code for GSOC 2017 project Tensorflow Image to Text in Apache Tika
Text Extractor
⭐
6
Tool for extract text from Office and PDFs files as a very, very tiny alternative to Apache Tika
Refinerycms Elasticsearch
⭐
6
Elasticsearch full text search capabilities for Refinery CMS
Tika Wrapper
⭐
6
Wraps Apache Tika library (http://tika.apache.org/) in order to allow a simple usage and add or improve some features
Semester
⭐
6
Collection of small apps and libraries
Techarticles
⭐
6
A set of tech articles.
Project Matt
⭐
6
Project Matt: Scan your AWS S3 Buckets for PII Data to Guard against GDPR
Masala
⭐
6
A WordPress plugin that puts the full content of uploaded text, PDF, DOC and other files into metadata upon upload.
Hoover Snoop2
⭐
6
Processing system for the search engine service in Liquid Investigations.
Bleve Indexer
⭐
6
A small Dockerized Go program for indexing documents with bleve and Tika.
Tika
⭐
6
Golang client for Apache Tika
Visualize Unstructured Data With Watson
⭐
6
Visualize unstructured data using Watson NLU
Ext Tika
⭐
6
A TYPO3 CMS extension that provides Apache Tika functionality
Springboot Fileserver
⭐
5
Dsci 550 Assignment 2
⭐
5
👨🦰 Large Scale Active Social Engineering Defense (ASED): Multimedia and Social Engineering
Dsci 550 Assignment 1
⭐
5
📧 Analysis of Cyber Phishing Emails: Fraudulent Emails and Social Engineering.
Local Development Setup
⭐
5
Netgen's Local Development Setup
Tika Page Extractor
⭐
5
Tika per page PDF extractor server returning content as JSON.
Tikaserver Ex
⭐
5
JAX-RS Server for Apache Tika
Mitie Resources
⭐
5
easy Mitie-nlp setup to use with MITIE NER enabled in TIKA
Configs
⭐
5
Configuration and deployment script repository
Js Technologies
⭐
5
Tools of the Trade
Lucene Example
⭐
5
Example project to show using Tika with Lucene
Lean
⭐
5
Lucene Tools for Text Analytics
101-132 of 132 search results
< Previous
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.