Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Data Science Ipython Notebooks | 25,242 | 3 months ago | 34 | other | Python | |||||
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. | ||||||||||
Horovod | 13,564 | 20 | 11 | 5 days ago | 77 | June 12, 2023 | 358 | other | Python | |
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet. | ||||||||||
It_book | 8,543 | 2 years ago | 7 | |||||||
本项目收藏这些年来看过或者听过的一些不错的常用的上千本书籍,没准你想找的书就在这里呢,包含了互联网行业大多数书籍和面试经验题目等等。有人工智能系列(常用深度学习框架TensorFlow、pytorch、keras。NLP、机器学习,深度学习等等),大数据系列(Spark,Hadoop,Scala,kafka等),程序员必修系列(C、C++、java、数据结构、linux,设计模式、数据库等等) | ||||||||||
Alluxio | 6,385 | 31 | 51 | 7 hours ago | 69 | July 28, 2023 | 930 | apache-2.0 | Java | |
Alluxio, data orchestration for analytics and machine learning in the cloud | ||||||||||
Bigdl | 4,383 | 10 | 2 days ago | 16 | April 19, 2021 | 826 | apache-2.0 | Jupyter Notebook | ||
Accelerating LLM with low-bit (INT3 / INT4 / NF4 / INT5 / INT8) optimizations using bigdl-llm | ||||||||||
Pipeline | 4,159 | a year ago | 85 | July 18, 2017 | 1 | apache-2.0 | Jsonnet | |||
PipelineAI Kubeflow Distribution | ||||||||||
Tensorflowonspark | 3,851 | 5 | 2 months ago | 32 | April 21, 2022 | 13 | apache-2.0 | Python | ||
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters. | ||||||||||
Spark Nlp | 3,434 | 25 | 3 days ago | 128 | August 02, 2023 | 47 | apache-2.0 | Scala | ||
State of the Art Natural Language Processing | ||||||||||
Analytics Zoo | 2,580 | 4 | 19 days ago | 508 | August 21, 2023 | 533 | apache-2.0 | Jupyter Notebook | ||
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray | ||||||||||
Petastorm | 1,614 | 8 | 4 months ago | 86 | February 03, 2023 | 171 | apache-2.0 | Python | ||
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. |
Quick Start | Developer Guide | FAQ | Javadocs | Contributing | Slack
DC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
To create a new service based on the SDK:
To update an existing framework to use the latest version of the SDK:
Contributions are welcome! See CONTRIBUTING.
DC/OS SDK is licensed under the Apache License, Version 2.0.