Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Alternatives To Alink
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Xgboost24,2197965743 hours ago65May 09, 2022363apache-2.0C++
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Alink3,34312 months ago16September 08, 202248apache-2.0Java
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Ai_tutorial1,440
an hour ago
精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总
Featran465111a month ago34December 04, 201911apache-2.0Scala
A Scala feature transformation library for data science and machine learning
Cascading321
5 years agon,ullotherJava
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms. Please see https://github.com/cwensel/cascading for access to all WIP branches.
Flink Ml243
14 days ago17July 01, 20224apache-2.0Java
Machine learning library of Apache Flink
Bigdata Playground154
4 years ago4apache-2.0TypeScript
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Bigdata142
4 years ago20Shell
hadoop,hbase,storm,spark,etc..
Toolbox104
3 years ago46apache-2.0Java
A Java Toolbox for Scalable Probabilistic Machine Learning
Cloud Bigdata Book53
2 years ago86C
write book
Alternatives To Alink
Select To Compare


Alternative Project Comparisons
Readme

English|

Alink

AlinkFlink,PAI,Alink

  • Alinkhttp://alinklab.cn/manual/index.html
  • Alinkhttp://alinklab.cn/tutorial/index.html
  • Alinkhttps://www.yuque.com/pinshu/alink_guide/plugin_downloader

Alink

PyAlink

PyAlink


  • PyAlink Alink Flink Python pyalink Alink Flink 1.13 pyalink-flink-*** Flink pyalink-flink-1.12, pyalink-flink-1.11, pyalink-flink-1.10 pyalink-flink-1.9
  • Python Alink 1.6.1

  1. Python3 3.63.7 3.8
  2. Java 8
  3. pip pip install pyalink``pip install pyalink-flink-1.12``pip install pyalink-flink-1.11``pip install pyalink-flink-1.10 pip install pyalink-flink-1.9

  1. pyalink pyalink-flink-*** pyalink pyalink-flink-***``pip uninstall pyalink pip uninstall pyalink-flink-***
  2. pippip whl pip
    • Flink 1.13 (MD5: a10d57a19c53d206d324273f377a1b13)
    • Flink 1.12 (MD5: 82b2395740fbd960895d16350266ab4d)
    • Flink 1.11 (MD5: 5bf901c084b51ebfa13a62489fafc2f2)
    • Flink 1.10 (MD5: e18c620a3a3423407973b8c3d23a02e0)
    • Flink 1.9: (MD5: 2feaed5f159bb8970400eb3f6eafc7e5)
  3. Python pip pip3 Anaconda Anaconda


Jupyter Notebook PyAlink

  1. Jupyterjupyter notebook Python 3 Notebook
  2. pyalink from pyalink.alink import *

useLocalEnv(parallism, flinkHome=None, config=None) parallism flinkHome flink configFlink

JVM listening on ***
  1. PyAlink
source = CsvSourceBatchOp()\
    .setSchemaStr("sepal_length double, sepal_width double, petal_length double, petal_width double, category string")\
    .setFilePath("https://alink-release.oss-cn-beijing.aliyuncs.com/data-files/iris.csv")
res = source.select(["sepal_length", "sepal_width"])
df = res.collectToDataframe()
print(df)


PyAlink Java API setXXX link/linkTo/linkFrom Jupyter Notebook

print/collectToDataframe/collectToDataframes BatchOperator.execute() StreamOperator.execute()


Java


String URL = "https://alink-release.oss-cn-beijing.aliyuncs.com/data-files/iris.csv";
String SCHEMA_STR = "sepal_length double, sepal_width double, petal_length double, petal_width double, category string";

BatchOperator data = new CsvSourceBatchOp()
        .setFilePath(URL)
        .setSchemaStr(SCHEMA_STR);

VectorAssembler va = new VectorAssembler()
        .setSelectedCols(new String[]{"sepal_length", "sepal_width", "petal_length", "petal_width"})
        .setOutputCol("features");

KMeans kMeans = new KMeans().setVectorCol("features").setK(3)
        .setPredictionCol("prediction_result")
        .setPredictionDetailCol("prediction_detail")
        .setReservedCols("category")
        .setMaxIter(100);

Pipeline pipeline = new Pipeline().add(va).add(kMeans);
pipeline.fit(data).transform(data).print();

Flink-1.13 Maven

<dependency>
    <groupId>com.alibaba.alink</groupId>
    <artifactId>alink_core_flink-1.13_2.11</artifactId>
    <version>1.6.1</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-streaming-scala_2.11</artifactId>
    <version>1.13.0</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-table-planner_2.11</artifactId>
    <version>1.13.0</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-clients_2.11</artifactId>
    <version>1.13.0</version>
</dependency>

Flink-1.12 Maven

<dependency>
    <groupId>com.alibaba.alink</groupId>
    <artifactId>alink_core_flink-1.12_2.11</artifactId>
    <version>1.6.1</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-streaming-scala_2.11</artifactId>
    <version>1.12.1</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-table-planner_2.11</artifactId>
    <version>1.12.1</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-clients_2.11</artifactId>
    <version>1.12.1</version>
</dependency>

Flink-1.11 Maven

<dependency>
    <groupId>com.alibaba.alink</groupId>
    <artifactId>alink_core_flink-1.11_2.11</artifactId>
    <version>1.6.1</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-streaming-scala_2.11</artifactId>
    <version>1.11.0</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-table-planner_2.11</artifactId>
    <version>1.11.0</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-clients_2.11</artifactId>
    <version>1.11.0</version>
</dependency>

Flink-1.10 Maven

<dependency>
    <groupId>com.alibaba.alink</groupId>
    <artifactId>alink_core_flink-1.10_2.11</artifactId>
    <version>1.6.1</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-streaming-scala_2.11</artifactId>
    <version>1.10.0</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-table-planner_2.11</artifactId>
    <version>1.10.0</version>
</dependency>

Flink-1.9 Maven

<dependency>
    <groupId>com.alibaba.alink</groupId>
    <artifactId>alink_core_flink-1.9_2.11</artifactId>
    <version>1.6.1</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-streaming-scala_2.11</artifactId>
    <version>1.9.0</version>
</dependency>
<dependency>
    <groupId>org.apache.flink</groupId>
    <artifactId>flink-table-planner_2.11</artifactId>
    <version>1.9.0</version>
</dependency>

Alink


  1. Flink
  wget https://archive.apache.org/dist/flink/flink-1.13.0/flink-1.13.0-bin-scala_2.11.tgz
  tar -xf flink-1.13.0-bin-scala_2.11.tgz && cd flink-1.13.0
  ./bin/start-cluster.sh
  1. Alink
  git clone https://github.com/alibaba/Alink.git
  # add <scope>provided</scope> in pom.xml of alink_examples.
  cd Alink && mvn -Dmaven.test.skip=true clean package shade:shade
  1. Java
  ./bin/flink run -p 1 -c com.alibaba.alink.ALSExample [path_to_Alink]/examples/target/alink_examples-1.5-SNAPSHOT.jar
  # ./bin/flink run -p 1 -c com.alibaba.alink.GBDTExample [path_to_Alink]/examples/target/alink_examples-1.5-SNAPSHOT.jar
  # ./bin/flink run -p 1 -c com.alibaba.alink.KMeansExample [path_to_Alink]/examples/target/alink_examples-1.5-SNAPSHOT.jar


Popular Flink Projects
Popular Machine Learning Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Machine Learning
Classification
Statistics
Kafka
Recommender System
Word2vec
Data Mining
Xgboost
Graph Algorithms
Flink
Feature Engineering