Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for high performance computing
high-performance-computing
x
276 search results found
Taskflow
⭐
9,682
A General-purpose Parallel and Heterogeneous Task Programming System
Metaflow
⭐
7,524
🚀 Build and manage real-life ML, AI, and data science projects with ease!
Tf Quant Finance
⭐
4,031
High-performance TensorFlow library for quantitative finance.
Fluidx3d
⭐
2,918
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.
Alpa
⭐
2,878
Training and serving large-scale neural networks with auto parallelization.
Course
⭐
2,801
高性能并行编程与优化 - 课件
Awesome Tensor Compilers
⭐
2,042
A list of awesome compiler projects and papers for tensor computation and deep learning.
Boinc
⭐
1,820
Open-source software for volunteer computing and grid computing.
Blis
⭐
1,741
BLAS-like Library Instantiation Software Framework
Kokkos
⭐
1,644
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
Nnpack
⭐
1,595
Acceleration package for neural networks on multi-core CPUs
Hermit Rs
⭐
1,576
Hermit for Rust.
Mfem
⭐
1,465
Lightweight, general, scalable C++ library for finite element methods
Drake
⭐
1,329
An R-focused pipeline toolkit for reproducibility and high-performance computing
Arraymancer
⭐
1,177
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Neanderthal
⭐
1,034
Fast Clojure Matrix Library
Kernel
⭐
1,031
A Rust-based, lightweight unikernel.
Adaptivecpp
⭐
1,010
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
Envpool
⭐
927
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
Targets
⭐
860
Function-oriented Make-like declarative workflows for R
Vectorious
⭐
781
Linear algebra in TypeScript.
Openmc
⭐
658
OpenMC Monte Carlo Code
Precice
⭐
641
A coupling library for partitioned multi-physics simulations, including, but not restricted to fluid-structure interaction and conjugate heat transfer simulations.
Prometeo
⭐
515
An experimental Python-to-C transpiler and domain specific language for embedded high-performance computing
Geant4
⭐
501
Geant4 toolkit for the simulation of the passage of particles through matter - NIM A 506 (2003) 250-303
Dace
⭐
444
DaCe - Data Centric Parallel Programming
Sundials
⭐
430
Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.
Relion
⭐
400
Image-processing software for cryo-electron microscopy
Pysph
⭐
395
A framework for Smoothed Particle Hydrodynamics in Python
Nrn
⭐
351
NEURON Simulator
Mpi4jax
⭐
348
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡
How_to_optimize_in_gpu
⭐
346
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Aphros
⭐
335
Finite volume solver for incompressible multiphase flows with surface tension. Foaming flows in complex geometries.
Vuh
⭐
329
Vulkan compute for people
Surrogates.jl
⭐
302
Surrogate modeling and optimization for scientific machine learning (SciML)
Pyhpc Benchmarks
⭐
290
A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python 🚀
Feelpp
⭐
287
💎 Feel++: Finite Element Embedded Language and Library in C++
Thread Pool
⭐
284
A modern, fast, lightweight thread pool library based on C++20
Clojurecl
⭐
275
ClojureCL is a Clojure library for parallel computations with OpenCL.
Gpumd
⭐
273
Graphics Processing Units Molecular Dynamics
Qmcpack
⭐
273
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support
Geni
⭐
268
A Clojure dataframe library that runs on Spark
Laser
⭐
262
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
Graphit
⭐
261
GraphIt - A High-Performance Domain Specific Language for Graph Analytics
Metal Flash Attention
⭐
252
Faster alternative to Metal Performance Shaders
Adios2
⭐
241
Next generation of ADIOS developed in the Exascale Computing Program
Opencoarrays
⭐
237
A parallel application binary interface for Fortran 2018 compilers.
Sphinxsys
⭐
236
SPHinXsys provides C++ APIs for physically accurate simulation and optimization. It aims to handle coupled industrial dynamic systems including fluid, solid, multi-body dynamics and beyond. The multi-physics library is based on a unique and unified computational framework by which strong couplings have been achieved for all involved physics.
Libflame
⭐
223
High-performance object-based library for DLA computations
Hpcsharp
⭐
204
High performance algorithms in C#: SIMD/SSE, multi-core and faster
Aspect
⭐
201
A parallel, extensible finite element code to simulate convection in both 2D and 3D models.
Libhermit
⭐
194
HermitCore: A C-based, lightweight unikernel
Intel Qs
⭐
188
High-performance simulator of quantum circuits
Cans
⭐
181
A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
Claymore
⭐
175
Cabana
⭐
172
Performance-portable library for particle-based simulations
The Books Making You Better
⭐
169
A list of time-lasting classic books, which not only help you figure out how it works, but also grasp when it works and why it works in that way.
Heat
⭐
169
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Libceed
⭐
169
CEED Library: Code for Efficient Extensible Discretizations
Mpm
⭐
167
CB-Geo High-Performance Material Point Method
Asl
⭐
166
Advanced Simulation Library - hardware accelerated multiphysics simulation platform.
Hptt
⭐
161
High-Performance Tensor Transpose library
Batchtools
⭐
158
Tools for computation on batch systems
Nonlinearsolve.jl
⭐
154
High-performance and differentiation-enabled nonlinear solvers (Newton methods), bracketed rootfinding (bisection, Falsi), with sparsity and Newton-Krylov support.
Mpl
⭐
154
A C++17 message passing library based on MPI
Accelerator
⭐
150
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Arborx
⭐
148
Performance-portable geometric search library
Methodoflines.jl
⭐
144
Automatic Finite Difference PDE solving with Julia SciML
Librapid
⭐
143
A highly optimised C++ library for mathematical applications and neural networks.
Clustermq
⭐
142
R package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
Torchquad
⭐
142
Numerical integration in arbitrary dimensions on the GPU using PyTorch / TF / JAX
Dash
⭐
133
DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Minstant
⭐
132
Performant time measuring in Rust
Jlsca
⭐
129
Side-channel toolkit in Julia
Vpic
⭐
124
Vector Particle-In-Cell (VPIC) Project
Awesome Distributed Ml
⭐
124
A curated list of awesome projects and papers for distributed training or inference
Tarchetypes
⭐
108
Archetypes for targets and pipelines
Mirai
⭐
106
mirai - minimalist async evaluation framework for R
Fast Poisson Image Editing
⭐
101
A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.
Mads.jl
⭐
100
MADS: Model Analysis & Decision Support
Preallocationtools.jl
⭐
99
Tools for building non-allocating pre-cached functions in Julia, allowing for GC-free usage of automatic differentiation in complex codes
Datablations
⭐
98
Scaling Data-Constrained Language Models
Crew
⭐
97
A distributed worker launcher
Parthenon
⭐
96
Parthenon AMR infrastructure
Psyclone
⭐
92
Domain-specific compiler and code transformation system for Finite Difference/Volume/Element Earth-system models in Fortran
Dftfe
⭐
91
DFT-FE: Real-space DFT calculations using Finite Elements
T8code
⭐
87
Parallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Frontistr
⭐
84
This is the official github mirror repository of FrontISTR, Open-Source Large-Scale Parallel FEM Program for Nonlinear Structural Analysis. Active developments of FrontISTR are hosted on https://gitlab.com/FrontISTR-Commons/FrontISTR.
Geopm
⭐
83
Global Extensible Open Power Manager
Opencl Benchmark
⭐
81
A small OpenCL benchmark program to measure peak GPU/CPU performance.
Bulk
⭐
81
A modern interface for implementing bulk-synchronous parallel programs.
Threadpinning.jl
⭐
79
Readily pin Julia threads to CPU processors
Coldfront
⭐
79
HPC Resource Allocation System
Mt Kahypar
⭐
78
Mt-KaHyPar (Multi-Threaded Karlsruhe Hypergraph Partitioner) is a shared-memory multilevel graph and hypergraph partitioner equipped with parallel implementations of techniques used in the best sequential partitioning algorithms. Mt-KaHyPar can partition extremely large hypergraphs very fast and with high quality.
Product Quantization Tree
⭐
78
GPU-based large scale Approx. Nearest Neighbor Search, accepted at CVPR 2016
Espresso
⭐
77
Notes and tutorials on Density Functional Theory calculation using Quantum Espresso.
Qub_dw_highperformancepython
⭐
76
Code and more for the QUB Development Weeks event 'High Performance Python'
Autooptimize.jl
⭐
75
Automatic optimization and parallelization for Scientific Machine Learning (SciML)
Pencilffts.jl
⭐
72
Fast Fourier transforms of MPI-distributed Julia arrays
Infinity
⭐
68
A lightweight C++ RDMA library for InfiniBand networks.
1-100 of 276 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.