Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Spring Boot Quick | 2,282 | 7 months ago | 13 | Java | ||||||
:herb: 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、k3s、k3d、k8s、mybatis加解密插件、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等:pushpin: | ||||||||||
Sparkler | 401 | a year ago | 55 | apache-2.0 | Java | |||||
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark. | ||||||||||
Cc Pyspark | 280 | a year ago | 4 | mit | Python | |||||
Process Common Crawl data with Python and Spark | ||||||||||
Docs | 102 | 5 years ago | 3 | |||||||
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志 | ||||||||||
Cc Index Table | 78 | 7 months ago | 8 | apache-2.0 | Java | |||||
Index Common Crawl archives in tabular format | ||||||||||
Keywordanalysis | 33 | 6 years ago | ||||||||
Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends | ||||||||||
Engineeringteam | 32 | 5 years ago | 2 | |||||||
와이빅타 엔지니어링팀의 자료를 정리해두는 곳입니다. | ||||||||||
Search_ads_web_service | 27 | 7 years ago | Java | |||||||
Online search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated] | ||||||||||
Steam_recommendation_system | 25 | 7 years ago | Jupyter Notebook | |||||||
Recommendation System, Collaborative Filtering, Spark, Hive, Flask, Web Crawler, AWS EC2, AWS RDS | ||||||||||
Sparkwarc | 13 | 2 years ago | 4 | January 11, 2022 | apache-2.0 | WebAssembly | ||||
Load WARC files into Apache Spark with sparklyr |