Leechcrawler

Incremental crawling capabilities for Apache Tika. Crawl content out of e.g. file systems, http(s) sources (webcrawling) imap(s) servers or your own arbitrary data sources. LeechCrawler offers additional Tika parsers providing these crawling capabilities.
Alternatives To Leechcrawler
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Fscrawler1,27913 months ago5January 10, 2022145apache-2.0Java
Elasticsearch File System Crawler (FS Crawler)
Sparkler401
a year ago55apache-2.0Java
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Memex Explorer106
8 years ago67bsd-2-clausePython
Viewers for statistics and dashboarding of Domain Search Engine data
Harvester59
7 years ago3gpl-3.0JavaScript
Web crawling and document processing through a usable interface.
Leechcrawler8
2 years ago2bsd-3-clauseJava
Incremental crawling capabilities for Apache Tika. Crawl content out of e.g. file systems, http(s) sources (webcrawling) imap(s) servers or your own arbitrary data sources. LeechCrawler offers additional Tika parsers providing these crawling capabilities.
Alternatives To Leechcrawler
Select To Compare


Alternative Project Comparisons
Popular Crawler Projects
Popular Tika Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Java
Crawler
Tika