Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Spider | 907 | 6 years ago | 3 | gpl-3.0 | Java | |||||
A configurable web spider with a easy-to-use web console | ||||||||||
Xioc | 140 | 4 years ago | 10 | April 19, 2020 | 4 | mit | Go | |||
Extract indicators of compromise from text, including "escaped" ones. | ||||||||||
Tg_crawler | 61 | 6 years ago | 3 | gpl-3.0 | Python | |||||
Just a messy crawler based on tg-cli for Telegram. Deprecated by now, please use telegram-export. | ||||||||||
Docwire | 31 | 3 months ago | 2 | other | C++ | |||||
DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Supports nearly 100 data formats, including email boxes and OCR. Boost efficiency in text extraction, web data extraction, data mining, document analysis. Offline processing is possible for security and confidentiality | ||||||||||
Simplechinese | 9 | 3 years ago | mit | Python | ||||||
This package integrates many basic Chinese NLP functions, making Python-based Chinese word processing and information extraction simple and convenient. |