Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Spark | 35,319 | 2,394 | 882 | 10 hours ago | 46 | May 09, 2021 | 214 | apache-2.0 | Scala | |
Apache Spark - A unified analytics engine for large-scale data processing | ||||||||||
Cookbook | 11,362 | 2 months ago | 108 | apache-2.0 | ||||||
The Data Engineering Cookbook | ||||||||||
God Of Bigdata | 7,901 | 22 days ago | 2 | |||||||
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive... | ||||||||||
Zeppelin | 5,981 | 32 | 23 | 2 days ago | 2 | June 21, 2017 | 134 | apache-2.0 | Java | |
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more. | ||||||||||
Sparkinternals | 4,665 | a year ago | 27 | |||||||
Notes talking about the design and implementation of Apache Spark | ||||||||||
Bigdl | 4,178 | 10 | 2 days ago | 16 | April 19, 2021 | 719 | apache-2.0 | Jupyter Notebook | ||
Fast, distributed, secure AI for Big Data | ||||||||||
Iceberg | 4,050 | 5 hours ago | 4 | May 23, 2022 | 1,301 | apache-2.0 | Java | |||
Apache Iceberg | ||||||||||
Tensorflowonspark | 3,849 | 5 | 16 days ago | 32 | April 21, 2022 | 11 | apache-2.0 | Python | ||
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters. | ||||||||||
Koalas | 3,228 | 1 | 12 | 3 months ago | 47 | October 19, 2021 | 109 | apache-2.0 | Python | |
Koalas: pandas API on Apache Spark | ||||||||||
Spark Nlp | 3,157 | 2 | 2 | 3 hours ago | 90 | March 05, 2021 | 36 | apache-2.0 | Scala | |
State of the Art Natural Language Processing |
Connect to SQL Server using Apache Spark
You need to download and install the JDBC Drivers for SQL Server:
Microsoft JDBC Drivers 6.0, 4.2, 4.1, and 4.0 for SQL Server
https://www.microsoft.com/en-us/download/details.aspx?displaylang=en&id=11774
Start a Spark shell and make the JDBC SQL Server driver available
bin/spark-shell --driver-class-path /opt/sqljdbc_4.2/enu/sqljdbc42.jar
Then explore and modify the spark-jdbc-mssql.scala accordingly which containing Spark SQL commands...