Awesome Open Source

Programming Languages

Dplasma

DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backend.

Categories > Software Performance > High Performance Computing

Suggest Alternative

Stars

6

License

other

Most Recent Commit

4 months ago

Programming Language

C

Categories

Programming Languages > C

Software Performance > High Performance Computing

Machine Learning > Gpu Acceleration

Software Performance > Gpu Computing

Control Flow > Dataflow Programming

Alternatives To Dplasma

Project Name	Stars	Downloads	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
How_to_optimize_in_gpu	346				9 months ago			4	apache-2.0	Cuda
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Vuh	329				6 months ago			19	mit	C++
Vulkan compute for people
Radar Electrooptical Simulation	50				3 months ago				mit	C++
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
Rbcuda	50				5 years ago			4	bsd-3-clause	C
CUDA bindings for Ruby
Radar_electrooptical_simulation	44				3 months ago				lgpl-3.0	Fortran
(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.
Parsec	39				17 days ago			113	other	C
PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation threads to the cores, GPU accelerators, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on architectural features such as NUMA nodes and algorithmic features such as data reuse.
Tvm Lesson	19				3 years ago			2		Python
动手学习TVM核心原理教程
Gpu Cuda Self Organising Maps	7				a year ago				mit	C++
🧠 💡 📈 A project based in High Performance Computing. This project was built using CUDA (Compute Unified Device Architecture), C++ (C Plus Plus), C, CMake and JetBrains CLion. The scenario of the project was a GPU-based implementation of the Self-Organising-Maps (S.O.M.) algorithm for Artificial Neural Networks (A.N.N.), with the support of CUDA (Compute Unified Device Architecture), using its offered parallel optimisations and tunings. The final goal of the project was to test the several GPU-based implementations of the algorithm against a given CPU-based implementation of the same algorithm and, evaluate and compare the overall performance (speedup, efficiency and cost).
Gpu Normal Computation	7				6 years ago				lgpl-3.0	C++
Performing normal computation for big point clouds on the gpu using openCL
Custen	7				4 years ago			1	apache-2.0	Cuda
CUDA Finite Difference Library

Alternatives To Dplasma

Select To Compare

How_to_optimize_in_gpu ⭐ 346

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

most recent commit 9 months ago

Vulkan compute for people

most recent commit 6 months ago

Radar Electrooptical Simulation ⭐ 50

(REOS) Radar and Electro-Optical Simulation Framework written in C++.

most recent commit 3 months ago

CUDA bindings for Ruby

most recent commit 5 years ago

Radar_electrooptical_simulation ⭐ 44

(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.

most recent commit 3 months ago

PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation threads to the cores, GPU accelerators, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on architectural features such as NUMA nodes and algorithmic features such as data reuse.

most recent commit 17 days ago

Tvm Lesson ⭐ 19

动手学习TVM核心原理教程

most recent commit 3 years ago

Gpu Cuda Self Organising Maps ⭐ 7

🧠 💡 📈 A project based in High Performance Computing. This project was built using CUDA (Compute Unified Device Architecture), C++ (C Plus Plus), C, CMake and JetBrains CLion. The scenario of the project was a GPU-based implementation of the Self-Organising-Maps (S.O.M.) algorithm for Artificial Neural Networks (A.N.N.), with the support of CUDA (Compute Unified Device Architecture), using its offered parallel optimisations and tunings. The final goal of the project was to test the several GPU

most recent commit a year ago

Gpu Normal Computation ⭐ 7

Performing normal computation for big point clouds on the gpu using openCL

most recent commit 6 years ago

CUDA Finite Difference Library

most recent commit 4 years ago

Suggest An Alternative To dplasma

Alternative Project Comparisons

Dplasma vs How_to_optimize_in_gpu

Dplasma vs Radar Electrooptical Simulation

Dplasma vs Rbcuda

Dplasma vs Radar_electrooptical_simulation

Dplasma vs Parsec

Dplasma vs Tvm Lesson

Dplasma vs Gpu Cuda Self Organising Maps

Dplasma vs Gpu Normal Computation

Dplasma vs Custen

Popular High Performance Computing Projects

Taskflow ⭐ 9,515

A General-purpose Parallel and Heterogeneous Task Programming System

total releases 6latest release May 24, 2022most recent commit 2 days ago

Metaflow ⭐ 7,524

:rocket: Build and manage real-life ML, AI, and data science projects with ease!

dependent packages 25total releases 103latest release December 04, 2023most recent commit 10 days ago

pypi metaflow} Downloads

Tf Quant Finance ⭐ 4,031

High-performance TensorFlow library for quantitative finance.

dependent packages 2total releases 30latest release August 19, 2022most recent commit 6 months ago

pypi tf-quant-finance} Downloads

Fluidx3d ⭐ 2,918

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.

most recent commit 3 months ago

Training and serving large-scale neural networks with auto parallelization.

total releases 13latest release July 04, 2022most recent commit 4 months ago

Popular Gpu Acceleration Projects

Tfjs ⭐ 17,925

A WebGL accelerated JavaScript library for training and deploying ML models.

dependent packages 698total releases 146latest release December 05, 2023most recent commit 3 months ago

npm @tensorflow/tfjs} Downloads

Tensorrt ⭐ 8,908

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

dependent packages 5total releases 4latest release September 25, 2023most recent commit 2 months ago

pypi polygraphy} Downloads

Gpytorch ⭐ 3,337

A highly efficient implementation of Gaussian Processes in PyTorch

dependent packages 79total releases 38latest release June 02, 2023most recent commit 3 months ago

pypi gpytorch} Downloads

A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.

dependent packages 3total releases 17latest release December 10, 2023most recent commit 3 months ago

cargo sugarloaf} Downloads

Hedgehog Lab ⭐ 2,357

Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.

most recent commit 3 months ago

Popular Software Performance Categories

Related Searches

C High Performance Computing

C Gpu Acceleration

C Gpu Computing

Gpu Acceleration High Performance Computing

High Performance Computing Linear Algebra Library

C Dataflow Programming

C Linear Algebra Library

Get A Weekly Email With Trending Projects For These Categories

No Spam. Unsubscribe easily at any time.

C

High Performance Computing

Gpu Acceleration

Gpu Computing

Dataflow Programming

Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2024 Awesome Open Source. All rights reserved.