Sitecrawler

This is a Java library which can be used to crawl the content of some of web properties (www.salesforce.com, blogs.salesforce.com for example). It supports dynamic scaling (depending on available machine power (CPU, RAM) and network capacity) out of the box. It also has a Plugin structure, which allows others to write code (plugins) that act on the crawled pages.
Alternatives To Sitecrawler
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Cyworld Bot54
4 years ago6mitPython
🤖 Cyworld image crawler
Sitecrawler18
3 years ago1July 30, 20183bsd-3-clauseJava
This is a Java library which can be used to crawl the content of some of web properties (www.salesforce.com, blogs.salesforce.com for example). It supports dynamic scaling (depending on available machine power (CPU, RAM) and network capacity) out of the box. It also has a Plugin structure, which allows others to write code (plugins) that act on the crawled pages.
Imoocgo5
6 years agoHTML
imoocGo code
Alternatives To Sitecrawler
Select To Compare


Alternative Project Comparisons
Popular Cpu Projects
Popular Crawler Projects
Popular Hardware Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Java
Crawler
Cpu