| NVIDIA/nccl-tests |
568 |
|
0 |
0 |
over 2 years ago |
0 |
|
63 |
bsd-3-clause |
Cuda |
| NCCL Tests |
| cjmcv/hpc |
47 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
apache-2.0 |
C++ |
| Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. ) |
| tud-zih-energy/lo2s |
42 |
|
0 |
0 |
over 2 years ago |
0 |
|
39 |
gpl-3.0 |
C++ |
| Linux OTF2 Sampling - A Lightweight Node-Level Performance Monitoring Tool |
| mperlet/matrix_multiplication |
41 |
|
0 |
0 |
almost 4 years ago |
0 |
|
1 |
mit |
C |
| Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI |
| cpraveen/dflo |
30 |
|
0 |
0 |
over 5 years ago |
0 |
|
18 |
|
C++ |
| Discontinuous Galerkin solver for compressible flows |
| ROCmSoftwarePlatform/rccl-tests |
22 |
|
0 |
0 |
over 2 years ago |
0 |
|
8 |
other |
Cuda |
| RCCL Performance Benchmark Tests |
| lanl/libquo |
21 |
|
0 |
0 |
over 2 years ago |
0 |
|
6 |
bsd-3-clause |
C |
| Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications |
| leopoldcambier/tasktorrent |
13 |
|
0 |
0 |
about 5 years ago |
0 |
|
0 |
mit |
C++ |
| A fast shared & distributed memory task-based runtime in C++ |
| mdhim/mdhim-tng |
13 |
|
0 |
0 |
over 8 years ago |
0 |
|
4 |
bsd-2-clause |
C |
| MDHIM - Multi-Dimensional Hashing Indexing Middleware |
| PGAS-community-benchmarks/CFD-Proxy |
10 |
|
0 |
0 |
about 10 years ago |
0 |
|
0 |
other |
C |
| CFD proxy application |