Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for gpu hpc
gpu
x
hpc
x
99 search results found
Arrayfire
⭐
4,318
ArrayFire: a general purpose GPU library.
Fluidx3d
⭐
2,918
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.
Futhark
⭐
2,260
💥💻💥 A data-parallel functional programming language
Chapel
⭐
1,699
a Productive Parallel Programming Language
Compute
⭐
1,490
A C++ GPU Computing Library for OpenCL
Vkfft
⭐
1,394
Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library
Matx
⭐
1,059
An efficient C++17 GPU numerical computing library with Python-like syntax
Gunrock
⭐
908
Programmable CUDA/C++ GPU Graph Analytics
Arrayfire Rust
⭐
706
Rust wrapper for ArrayFire
Onemkl
⭐
528
oneAPI Math Kernel Library (oneMKL) Interfaces
Cccl
⭐
523
CUDA C++ Core Libraries
Devito
⭐
517
DSL and compiler framework for automated finite-differences and stencil computation
Gpubootcamp
⭐
479
This repository consists for gpu bootcamp material for HPC and AI
Arrayfire Python
⭐
406
Python bindings for ArrayFire: A general purpose GPU library.
Occa
⭐
363
Portable and vendor neutral framework for parallel programming on heterogeneous platforms.
Alpaka
⭐
319
Abstraction Library for Parallel Kernel Acceleration 🦙
Umpire
⭐
283
An application-focused API for memory management on NUMA & GPU architectures
Babelstream
⭐
283
STREAM, for lots of devices written in many programming models
Qmcpack
⭐
273
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support
Batch Shipyard
⭐
270
Simplify HPC and Batch workloads on Azure
Nekrs
⭐
205
our next generation code
Jaxfluids
⭐
203
Differentiable Fluid Dynamics Package
Hiop
⭐
202
HPC solver for nonlinear optimization problems
Geos
⭐
197
GEOS Simulation Framework
Monolish
⭐
185
monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Libceed
⭐
169
CEED Library: Code for Efficient Extensible Discretizations
Qrack
⭐
161
Comprehensive, GPU accelerated framework for developing universal virtual quantum processors
Arborx
⭐
148
Performance-portable geometric search library
Coreneuron
⭐
122
Simulator optimized for large scale neural network simulations.
Tutorial Multi Gpu
⭐
116
Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial
Training Material
⭐
114
A collection of code examples as well as presentations for training purposes
Pymapd
⭐
113
Python client for OmniSci GPU-accelerated SQL engine and analytics platform
Cf4ocl
⭐
107
C Framework for OpenCL
Omega_h
⭐
105
Simplex mesh adaptivity for HPC
Arbor
⭐
98
The Arbor multi-compartment neural network simulation library.
Cuda_memtest
⭐
93
Fork of CUDA GPU memtest 👓
Entangle
⭐
89
A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.
Holoscan Sdk
⭐
82
The AI sensor processing SDK for low latency streaming workflows
Opencl Benchmark
⭐
81
A small OpenCL benchmark program to measure peak GPU/CPU performance.
Aluminum
⭐
79
High-performance, GPU-aware communication library
Gpu_programming_intro
⭐
74
Parenchyma
⭐
69
An extensible HPC framework for CUDA, OpenCL and native CPU.
Mf Lbm
⭐
63
MF-LBM: A Portable, Scalable and High-performance Lattice Boltzmann Code for DNS of Flow in Porous Media
Parallelreductionsbenchmark
⭐
58
Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
Pennylane Lightning
⭐
55
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Mgcpp
⭐
55
A C++ Math Library Based on CUDA
Zpc
⭐
51
zenus parallel computing library for zenus physics-based simulations
Ramsesgpu
⭐
50
Astrophysics MHD simulation code optimized for large cluster of GPU
Hpc
⭐
47
Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Rpp
⭐
45
AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
Gpuinspector.jl
⭐
37
Inspecting GPUs with Julia
Spfft
⭐
36
Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support
Kokkos Remote Spaces
⭐
32
This repository contains Kokkos Remote Spaces, which implements distributed shared memory support for Kokkos.
Ptxprofiler
⭐
30
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
Singularity Tutorial
⭐
30
Singularity 101
Neon
⭐
28
Multi-GPU Framework for Voxel Grid Computations
Simulateqcd
⭐
27
SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it easy for physicists to implement lattice QCD formulas while still providing competitive performance.
Aws Parallelcluster Monitoring
⭐
27
Monitoring Dashboard for AWS ParallelCluster
Arrayfire Java
⭐
27
Java wrapper for ArrayFire
Care
⭐
26
CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Essentials
⭐
26
❤️ CUDA/C++ GPU graph analytics simplified.
Nways_accelerated_programming
⭐
25
N-Ways to GPU Programming Bootcamp
Python For Hpc
⭐
25
Repository for participants of the "Python for HPC" training
Mpibind
⭐
25
Pragmatic, Productive, and Portable Affinity for HPC
Fml
⭐
23
Fused Matrix Library
Pystella
⭐
21
A code generator for grid-based PDE solving on CPUs and GPUs
Inq
⭐
21
This is a mirror. Please check our main website on gitlab.
Dlt
⭐
20
Deep Learning Toolbox for Torch
Bigcomputelabs
⭐
20
Big Compute Learning Labs
Ms Hpc Ai Gpu
⭐
19
resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI
Loki
⭐
18
Freely programmable source-to-source translation for Fortran
Cuda Optimization Guide
⭐
18
Xiao's CUDA Optimization Guide [Active Adding New Contents]
Energy Sdk
⭐
17
GPU Accelerated Building Blocks integrating HPC+AI for Energy Customers
Logan
⭐
17
LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment.
Ppkmhd
⭐
17
MPI+Kokkos implementation of spectral difference method (SDM) high order schemes
Allgebra
⭐
16
Base container for developing C++ and Fortran HPC applications
Community_datasets
⭐
15
Example datasets and dashboards known to work well in OmniSci
Idefix
⭐
15
A new finite volume code designed to run on many architectures, such as GPU, CPU and manycores, using Kokkos.
Carme
⭐
14
Multi-User Software stack for interactive Machine Learning on HPC-Clusters
Nondeterministic Stack Rnn
⭐
13
Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)
Loops
⭐
13
🎃 GPU load-balancing library for regular and irregular computations.
Azhpc Extensions
⭐
12
Metadata for Azure HPC Extensions
Arrayfire Lua
⭐
10
Lua wrapper for ArrayFire
Fmlr
⭐
10
R bindings for the Fused Matrix Library (fml)
Gpuperfcounters
⭐
10
Enabling NVIDIA GPU performance counters in Windows PefMon and Windows HPC Cluster Manager.
Openacc_fortran_examples
⭐
9
Simple OpenACC Fortran Examples
Cpubitonicsort
⭐
9
openMP implementation of parallel bitonic sort
Cuda Pca Jacobi
⭐
9
CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable version of the Jacobi eigenvalue algorithm.
Containers_at_tacc
⭐
9
Containers@TACC Workshop
Fastice.jl
⭐
9
Parallel multi-xPU iterative FastIce flow solvers
Libnomp
⭐
8
libnomp is a loopy based runtime for C programming language to create domain specific compilers.
Rindow Math Matrix
⭐
8
The fundamental package for scientific matrix operation
Rindow Opencl
⭐
7
OpenCL library on PHP extension
Gpu Fpx
⭐
7
A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs
Blaze_cuda
⭐
7
WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze
Gpu Normal Computation
⭐
7
Performing normal computation for big point clouds on the gpu using openCL
Occa.py
⭐
6
OCCA Python API: JIT Compilation for Multiple Architectures
Parallelrandomfields.jl
⭐
6
Efficient parallel random field generator for large 3-D problems.
Micm
⭐
5
A model-independent chemistry module for atmosphere models
Related Searches
Python Gpu (2,801)
C Plus Plus Gpu (1,886)
Gpu Nvidia (1,176)
Gpu Cpu (1,134)
Tensorflow Gpu (990)
Jupyter Notebook Gpu (860)
Deep Learning Gpu (772)
C Gpu (750)
Docker Gpu (523)
Python Hpc (493)
1-99 of 99 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.