Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for rocm
rocm
x
101 search results found
Tvm
⭐
11,107
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Cupy
⭐
7,482
NumPy & SciPy for GPU
Deepmd Kit
⭐
1,303
A deep learning package for many-body potential energy representation and molecular dynamics
Stdgpu
⭐
990
stdgpu: Efficient STL-like Data Structures on the GPU
Backend.ai
⭐
449
Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.
Hcc
⭐
379
HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform
Syclacademy
⭐
376
SYCL Academy, a set of learning materials for SYCL heterogeneous programming
Rocm Docker
⭐
369
Dockerfiles for the various software layers defined in the Radeon Open Compute Platform
Antares
⭐
355
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.
Alpaka
⭐
319
Abstraction Library for Parallel Kernel Acceleration 🦙
Rocm Arch
⭐
319
A collection of Arch Linux PKGBUILDS for the ROCm platform
Rocblas
⭐
311
Next generation BLAS implementation for ROCm platform
Amdgpu.jl
⭐
257
AMD GPU (ROCm) programming in Julia
K8s Device Plugin
⭐
211
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Hiop
⭐
202
HPC solver for nonlinear optimization problems
Nsimd
⭐
184
Agenium Scale vectorization library for CPUs and GPUs
Aomp
⭐
181
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
Mivisionx
⭐
179
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
Cosma
⭐
174
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Roc Smi
⭐
149
ROC System Management Interface
Rocfft
⭐
143
Next generation FFT implementation for ROCm
Rocprim
⭐
142
ROCm Parallel Primitives
Gpufort
⭐
133
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
Amdovx Core
⭐
132
AMD OpenVX Core -- a sub-module of amdovx-modules:
Sirius
⭐
112
Domain specific library for electronic structure calculations
Rocm Build
⭐
108
build scripts for ROCm
Rocrand
⭐
99
RAND library for HIP programming language
Amdovx Modules
⭐
96
AMD OpenVX modules: such as, neural network inference, 360 video stitching, etc.
Automatic1111 Webui Nix
⭐
87
AUTOMATIC1111/stable-diffusion-webui for CUDA and ROCm on NixOS
Demosaic_project
⭐
82
Removing Pixelated Mosaic Censorship using ESRGAN and green_mask_project
Radeon_compute_profiler
⭐
81
The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to discover bottlenecks in the application and to find ways to optimize the application's performance.
Aluminum
⭐
79
High-performance, GPU-aware communication library
Nixos Rocm
⭐
76
NixOS support for the ROCm graphics stack (rocm.github.io)
Opencl Amd Fedora
⭐
72
AMD OpenCL userspace drivers for Fedora.
Tensorflow
⭐
63
TensorFlow for the IPU
Atmi
⭐
62
Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provides a consistent, declarative API to create task graphs on CPUs and GPUs (integrated and discrete).
Amdgpunative.jl
⭐
60
Julia interface to AMD/Radeon GPUs
Puzzlelib
⭐
57
Deep Learning framework with NVIDIA & AMD support
Stable Diffusion Rocm Docker
⭐
57
Stable Diffusion Docker image preconfigured for usage with AMD Radeon cards
Pennylane Lightning
⭐
55
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Hipfort
⭐
55
Fortran interfaces for ROCm libraries
Miopengemm
⭐
49
Rocm_lab
⭐
49
Rtl8812au
⭐
48
Linux driver for Realtek 802.11ac based on Realtek's 5.1.5 version
Megray
⭐
46
A communication library for deep learning
Rx580 Rocm Tensorflow Ubuntu20.4 Guide
⭐
46
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
Rpp
⭐
45
AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
Trafficvision
⭐
42
MIVisionX toolkit is a comprehensive computer vision and machine intelligence libraries, utilities and applications bundled into a single toolkit.
Hetero Mark
⭐
39
A Benchmark Suite for Heterogeneous System Computation
Gcngemm
⭐
39
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
Rocm
⭐
38
Ebuilds to install ROCM on Gentoo Linux
Spfft
⭐
36
Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support
Gtensor
⭐
33
GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.
Experimental_roc
⭐
32
Experimental and Intriguing Tools for ROCm
Quokka
⭐
30
Two-moment AMR radiation hydrodynamics (with self-gravity, particles, and chemistry) on CPUs/GPUs for astrophysics
Rochpcg
⭐
29
HPCG benchmark based on ROCm platform
Rocm Computeabi Doc
⭐
28
ROCm - AMDGPU Compute Application Binary Interface
Tdxminer
⭐
27
Cryptocurrency mining software for AMD GPUs
Roc_shmem
⭐
27
ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
Hsa Docs Amd
⭐
27
HSA-Docs-AMD has been superseded by new ROCm Developer Focused Website.
Hipblaslt
⭐
26
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Mfakto
⭐
25
Mersenne number trial factoring using OpenCL, primarily for GIMPS: Great Internet Mersenne Prime Search
Ret
⭐
24
ROCm Machine Learning and HPC Stack installer
Rocm Opencl Driver
⭐
22
ROCm OpenCL Compiler Tool Driver
Amazon Ec2 Nice Dcv Samples
⭐
22
AWS CloudFormation templates to provision Linux or Windows EC2 instances with GUI running NICE DCV remote display server. Includes option to install GPU drivers
Spla
⭐
19
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.
Rocgdb
⭐
19
This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.
Rocarrays.jl
⭐
18
Parallel on the ROCks
Tiled Mm
⭐
17
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
Rocm_tensorflow_info
⭐
17
The official page of ROCm/TensorFlow will contain information that is always confusing. On this page we will endeavor to describe accurate information based on the knowledge gained by GPUEater infrastructure development.
Hsa Openmp Gcc Amd
⭐
17
This repository contains documentation for setting up HSA platform, building OpenMP applications using GCC and running on a HSA device
Nnvm Rocm
⭐
17
NNVM for ROCm Examples
Rocdbgapi
⭐
14
The AMD Debugger API is a library that provides all the support necessary for a debugger and other tools to perform low level control of the execution and inspection of execution state of AMD's commercially available GPU architectures.
Rocnrdma
⭐
14
ROCm Driver RDMA Peer to Peer Support
Realcaffe2
⭐
13
The repo is obsolete. Use at your own risk.
Hcc Clang Upgrade
⭐
13
stage the upgrade of hcc-clang to clang ToT
Hsa Profiler Amd
⭐
11
This Profiler has been superceded by ROCm-Profiler for ROCm Platform AMD's HSA enabled GPU computing platform
Rocr_debug_agent
⭐
11
The ROCdebug-agent is a library that can be loaded by ROCm Platform Runtime to provide some debugging functionality.
Rocminstaller
⭐
11
ROCm Install Utilities: rocminstall.py script to install a specific ROCm release version/revision.
Rocm Tvm
⭐
10
Repository for TVM on rocm experiments.
Docker Rocm Xtra
⭐
9
ROCm docker images with fixes/support for extra architectures, such as gfx803/gfx1010.
Hip Performance Optmization On Vega64
⭐
9
14 basic topics for VEGA64 performance optmization
Hcc Example Application
⭐
9
HCC Sample Applications
Myrocm
⭐
8
Navi support for ROCm
Amd Rocm Miner
⭐
8
Dev userland for USB flash drive with full AMD ROCm and blockchain support, plug and chug.
Roctools
⭐
7
Tools for using AMD ROCm with Numba
Rakau
⭐
7
C++17 N-body Barnes-Hut on heterogeneous hardware architectures
Hat
⭐
7
TOML-annotated C header file format for packaging binary files, from Microsoft Research
Cudabrot
⭐
7
A CUDA renderer for the Buddhabrot fractal
Ai Passwords
⭐
7
Password lists generated by deep learning algorithms.
Pytorch Ebuild
⭐
6
Ebuild infrastructure files for PyTorch and some related projects
Epic Boost Miner
⭐
6
Grin Miner for Sapphire RX 570 16GB GPU Cards
Gem5_docker
⭐
6
Run gem5 in Docker, avoiding issues with gem5 in newer OS and gcc versions
Lisk Vanity
⭐
6
A tool to generate short Lisk addresses with GPU support
Rocm Pytorch Gfx803 Docker
⭐
5
A Docker image based on rocm/pytorch with support for gfx803(Polaris 20-21 (XT/PRO/XL); RX580; RX570; RX560) and Python 3.8
Cudipy
⭐
5
cupy-accelerated DIPY
Rust Hsa
⭐
5
HSA (Heterogeneous System Architecture) bindings for Rust
Cgo22ae Darm Code
⭐
5
Cub Hip
⭐
5
An implementation of CUB on the ROCM stack. This has been replaced by hipCUB.
Xpu
⭐
5
Compile and run C++ code with CUDA, HIP, SYCL or OpenMP.
1-100 of 101 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.