Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for gpu parallel
gpu
x
parallel
x
114 search results found
Js
⭐
2,522
turbo.js - perform massive parallel computations in your browser with GPGPU.
Chapel
⭐
1,699
a Productive Parallel Programming Language
Gpu Io
⭐
1,128
A GPU-accelerated computing library for running physics simulations and other GPGPU computations in a web browser.
Ilgpu
⭐
994
ILGPU JIT Compiler for high-performance .Net GPU programs
Learn Cuda Programming
⭐
815
Learn CUDA Programming, published by Packt
Libgrape Lite
⭐
345
🍇 A C++ library for parallel graph processing (GRAPE) 🍇
Veros
⭐
303
The versatile ocean simulator, in pure Python, powered by JAX.
Cudpp
⭐
299
CUDA Data Parallel Primitives Library
Parallelstencil.jl
⭐
270
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
Hemi
⭐
249
Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Oneflow
⭐
237
LargeScale Multiphysics Scientific Simulation Environment-OneFLOW CFD
Hybridizer Basic Samples
⭐
220
Examples of C# code compiled to GPU by hybridizer
Awesome Cuda
⭐
213
This is a list of useful libraries and resources for CUDA development.
Pytorch Multi Gpu Training
⭐
170
整理 pytorch 单机多 GPU 训练方法与原理
Heat
⭐
169
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Nimble
⭐
169
Lightweight and Parallel Deep Learning Framework
Arborx
⭐
148
Performance-portable geometric search library
Rocprim
⭐
142
ROCm Parallel Primitives
Elbencho
⭐
125
A distributed storage benchmark for file systems, object stores & block devices with support for GPUs
Torchmpi
⭐
106
Implements a message passing interface (MPI) wrapper that makes it easy to do massively parallel computations inside the Torch deep-learning framework.
Omega_h
⭐
105
Simplex mesh adaptivity for HPC
Fdtd3d
⭐
98
fdtd3d is an open source 1D, 2D, 3D FDTD electromagnetics solver with MPI, OpenMP and CUDA support for x64, ARM, ARM64, RISC-V, PowerPC architectures
Swiftmetalgpuparallelprocessing
⭐
98
Data Parallel Processing with Swift and Metal on GPU for iOS8 (and beyond)
Balanced Dataparallel
⭐
92
这里是改进了pytorch的DataParallel, 用来平衡第一个GPU的显存使用量
Entangle
⭐
89
A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.
Libgdl
⭐
88
一个移动端跨平台的gpu+cpu并行计算的cnn框架(A mobile-side cross-platform gpu+cpu parallel computing CNN framework)
Cuda Notes
⭐
83
高性能编程 笔记
Cuda Swift
⭐
68
Parallel Computing Library for Linux and macOS & NVIDIA CUDA Wrapper
Ue4_sortingcomputeshader
⭐
61
A compute shader plugin that is capable of sorting positional data in parallel directly on the GPU.
Windflow
⭐
58
A C++17 Data Stream Processing Parallel Library for Multicores and GPUs
Arrayfire Haskell
⭐
58
Haskell bindings to ArrayFire
Parallelreductionsbenchmark
⭐
58
Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
Pennylane Lightning
⭐
55
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Mpr
⭐
53
Reference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Accfft
⭐
50
A Massively Parallel FFT Library for CPU/GPU
Rth
⭐
50
Norm Matloff's Rth package
Longestedgebisection2d
⭐
48
Longest Edge Bisection Demos
Parallelr
⭐
45
Accelerate R by Parallel Technologies
Gpufilter
⭐
41
GPU Recursive Filtering
Parallel_nms
⭐
40
Parallel CUDA implementation of NON maximum Suppression
Openacc Training Materials
⭐
39
Training materials provided by OpenACC.org.
Gpu Lossless Compression
⭐
39
GPU-Accelerated Lossless Data Compressors Survey
Pytsetlinmachinecuda
⭐
38
Massively Parallel and Asynchronous Architecture for Logic-based AI
Foldscuda.jl
⭐
36
Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
P4
⭐
30
P4: Portable Parallel Processing Pipeline
Cudamergesort
⭐
30
Highly parallel, GPU-accelerated hybrid mergesort with mmap'd IO
Skelcl
⭐
30
SkelCL is a library providing high-level abstractions for alleviated programming of modern parallel heterogeneous systems. SkelCL is a research project developed at the research group parallel and distributed systems at University of Münster which is located in Germany.
Learn Gpgpu
⭐
29
Algorithms implemented in CUDA + resources about GPGPU
Neon
⭐
28
Multi-GPU Framework for Voxel Grid Computations
Simulateqcd
⭐
27
SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it easy for physicists to implement lattice QCD formulas while still providing competitive performance.
Essentials
⭐
26
❤️ CUDA/C++ GPU graph analytics simplified.
Ihrc
⭐
25
Intel Heterogeneous Research Compiler (iHRC)
Mpibind
⭐
25
Pragmatic, Productive, and Portable Affinity for HPC
Lbvh
⭐
23
an implementation of parallel linear BVH (LBVH) on GPU
Hybrid_bc
⭐
22
Hybrid methods for Parallel Betweenness Centrality on the GPU
Learn Cuda
⭐
21
Learning some parallel programming with CUDA
Cudateaching
⭐
20
CUDA based GPU Programming
Gpuexample
⭐
19
GPUExample
Rocarrays.jl
⭐
18
Parallel on the ROCks
Ntrace
⭐
18
GPU ray tracing framework.
Gpuhd
⭐
16
Massively Parallel Huffman Decoding on GPUs
Unity Parallel Gpu
⭐
15
Ies
⭐
14
A package includes various time-domain numerical solvers for the Maxwell's equations.
Opencl Level Set Segmentation
⭐
13
Parallel/GPU level set volume segmentation using OpenCL
Parsecureml
⭐
13
A Parallel Secure Machine Learning Framework on GPUs
Bfs Cuda
⭐
12
Implementation of breadth first search on GPU with CUDA Driver API.
Csrcolor
⭐
12
Efficient and High-quality Graph Coloring on the GPU
Npb Gpu
⭐
12
NAS Parallel Benchmarks for GPU
Nbody6ppgpu Beijing
⭐
11
This is Nbody6++GPU, an N-body star cluster simulation code, maintained by Rainer Spurzem and team.
Torch Parallel Nccl Mps Example
⭐
11
Example of multi-process, multi-GPU training using Torch-parallel, nVidia-nccl, and nVidia-MPS
Parallel_development_community_gpgpu_study
⭐
11
Aleanotebooks
⭐
10
Notebooks for Alea GPU
G Idw
⭐
10
Parallel GPU Inverse Distance Weighting
Gpu Sextractor
⭐
10
Parallel Astronomical Source Extraction tool based on SExtractor
Dynamicppr
⭐
10
The implementation of the paper "Parallel Personalized PageRank on Dynamic Graphs"
Matrix
⭐
10
Matrix is a PHP extension. It can do parallel computing base on CUDA.
Genome Indexing On Gpu
⭐
9
The project involves parallelizing the construction of suffix arrays on the GPU for the purpose of genome indexing.
Gpu Gmres
⭐
9
Parallel GMRES (Generalized Minimal Residual) linear solver on GPU platforms
Cpubitonicsort
⭐
9
openMP implementation of parallel bitonic sort
Cluja
⭐
8
Lets try using Rootbeer to map a Clojure function in parallel on a CUDA GPU
Inpainting Gpu
⭐
8
Image Inpainting implementation. Parallel Computing Accelerated Image Inpainting using GPU CUDA, Theano, and Tensorflow.
Simple And Effective Paraphrastic Similarity
⭐
8
Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".
Simt_tc
⭐
8
Triangle counting in large graphs using SIMT parallel set intersection on GPU
Rover
⭐
8
ROVER: an open source hybrid-parallel library for volume rendering and simulated radiography
Tsp Gpu
⭐
8
Solving the Travelers Salesman Problem using GPU ( Cuda ) using ANT and GA algorithms
Mc_stretch
⭐
8
Fast, affine-invariant, GPU parallel MCMC.
Abnet
⭐
8
Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"
Surfsara Ptc Python Parallel And Gpu Programming
⭐
8
This repository holds the training material used in the PRACE PTC training at SURFsara entitled: Python Parallel and GPU Programming
Cuda Edu
⭐
8
A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.
Skparallelreduce.js
⭐
7
Three.js GPU parallel reduction library
Parallelpopgen
⭐
7
This is a package of APIs for performing population genetics simulations and analyses in parallel on the GPU.
Image Denoising Using Cufft
⭐
7
A parallel implementation of image denoising using Cuda and cuFFT.
Hillisp
⭐
7
CUDA parallel lisp toy inspired by Connection Machines
Custen
⭐
7
CUDA Finite Difference Library
Partecl Codegen
⭐
7
A tool to generate OpenCL kernels from C programs for the purpose of testing them in parallel on the GPU.
Gpuparallelspatialadaptivekde
⭐
6
GPU-Parallel Spatially Adaptive Kernel Density Estimation for Point Pattern Analysis
Teralens
⭐
6
The fastest gravitational (quasar) microlensing code on the planet. A parallel Barnes-Hut tree code optimized for GPUs, written in OpenCL
Marching_cubes_on_gpu
⭐
6
A marching cube algorithm, that is executed in parallel on the GPU, using compute shaders. This will later enable a highly parallel creation of advanced landscape/terrain structures in potentially real-time (next project).
Parallel_party
⭐
6
A general purpose Python multithreading module offering easy and pythonic access to CPU or GPU parallelization (CUDA)
Warren Data Parallelism
⭐
6
Data-parallel programming with Metal
Related Searches
Python Gpu (2,930)
C Plus Plus Gpu (1,951)
Python Parallel (1,211)
Gpu Nvidia (1,137)
Gpu Cpu (1,105)
C Plus Plus Parallel (1,094)
Tensorflow Gpu (990)
1-100 of 114 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.