Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for gpu computing
gpu-computing
x
245 search results found
Catboost
⭐
7,564
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Gyroflow
⭐
5,832
Video stabilization using gyroscope data
Thrust
⭐
4,878
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
Tf Quant Finance
⭐
4,031
High-performance TensorFlow library for quantitative finance.
Fluidx3d
⭐
2,918
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.
Lingvo
⭐
2,776
Lingvo
Rio
⭐
2,693
A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.
Pai
⭐
2,602
Resource scheduling and cluster management for AI
Nyuziprocessor
⭐
1,863
GPGPU microprocessor architecture
Scimlbook
⭐
1,722
Parallel Computing and Scientific Machine Learning (SciML): Methods and Applications (MIT 18.337J/6.338J)
Pycuda
⭐
1,692
CUDA integration for Python, plus shiny features
Dfdx
⭐
1,502
Deep learning in Rust, with shape checked tensors and neural networks
Kompute
⭐
1,496
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
Bindsnet
⭐
1,446
Simulation of spiking neural networks (SNNs) using PyTorch.
Emu
⭐
1,361
The write-once-run-anywhere GPGPU library for Rust
Awesome Webgpu
⭐
1,197
😎 Curated list of awesome things around WebGPU ecosystem.
Arraymancer
⭐
1,177
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Luxcore
⭐
1,078
LuxCore source repository
Matx
⭐
1,059
An efficient C++17 GPU numerical computing library with Python-like syntax
Tornadovm
⭐
1,054
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Neanderthal
⭐
1,034
Fast Clojure Matrix Library
Adaptivecpp
⭐
1,010
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
Stdgpu
⭐
990
stdgpu: Efficient STL-like Data Structures on the GPU
Accelerate
⭐
876
Embedded language for high-performance array computations
Kubernetes Gpu Guide
⭐
714
This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Cuda Api Wrappers
⭐
692
Thin, unified, C++-flavored wrappers for the CUDA APIs
Deep Learning In Cloud
⭐
670
List of Deep Learning Cloud Providers
Picongpu
⭐
664
Performance-Portable Particle-in-Cell Simulations for the Exascale Era ✨
Blendluxcore
⭐
659
Blender Integration for LuxCore
Cccl
⭐
523
CUDA C++ Core Libraries
Trisycl
⭐
424
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Fast
⭐
386
A framework for high-performance medical image processing, neural network inference and visualization
Mxnet.jl
⭐
365
MXNet Julia Package - flexible and efficient deep learning in Julia
Ginkgo
⭐
354
Numerical linear algebra software package
Vuh
⭐
329
Vulkan compute for people
Compute Shader 101
⭐
323
Sample code for compute shader 101 training
Autodock Gpu
⭐
317
AutoDock for GPUs and other accelerators
Kernelabstractions.jl
⭐
313
Heterogeneous programming in Julia
Clvk
⭐
303
Experimental implementation of OpenCL 3.0 on Vulkan
Ministark
⭐
296
🏃♂️💨 GPU accelerated STARK prover built on @arkworks-rs
Sol R
⭐
279
Open-Source CUDA/OpenCL Speed Of Light Ray-tracer
Clojurecl
⭐
275
ClojureCL is a Clojure library for parallel computations with OpenCL.
Gpufit
⭐
274
GPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Montecarlomeasurements.jl
⭐
247
Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Glvisualize.jl
⭐
239
Visualization library written in Julia and OpenGL
Kernel_tuner
⭐
236
Kernel Tuner
Cuda By Example Source Code For The Book S Examples
⭐
232
CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through working examples.
Opencl Wrapper
⭐
231
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Opt
⭐
221
Opt DSL
Fastflow
⭐
220
FastFlow pattern-based parallel programming framework (formerly on sourceforge)
Prysm
⭐
214
physical optics: integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing...
Jaxfluids
⭐
203
Differentiable Fluid Dynamics Package
Gpur
⭐
202
R interface to use GPU's
Cans
⭐
181
A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
Akari_render
⭐
176
High Performance CPU/GPU Physically Based Renderer
Taichi.js
⭐
175
Modern GPU Compute and Rendering in Javascript
Beatmup
⭐
175
Beatmup: image and signal processing library
Claymore
⭐
175
Clojurecuda
⭐
171
Clojure library for CUDA development
Netsaur
⭐
167
Powerful machine learning, accelerated by WebGPU
Pymfem
⭐
165
Python wrapper for MFEM
Pelemay
⭐
158
Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
Hecbench
⭐
156
Accelerate Llvm
⭐
147
LLVM backend for Accelerate
Numpower
⭐
139
PHP library for efficient scientific computing and array manipulation with GPU support
Dlprimitives
⭐
138
Deep Learning Primitives and Mini-Framework for OpenCL
Gpuclothsimulationinunity
⭐
127
Trying to replicate what this legend did: https://youtu.be/kCGHXlLR3l8
Gpu Utils
⭐
122
A set of utilities for monitoring and customizing GPU performance
Goofit
⭐
120
Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
Pysnn
⭐
118
Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Jiro Nn
⭐
113
A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.
Openclga
⭐
105
A Python Library for Genetic Algorithm on OpenCL
Gpu Benches
⭐
99
collection of benchmarks to measure basic GPU capabilities
Cuda_memtest
⭐
93
Fork of CUDA GPU memtest 👓
Entangle
⭐
89
A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.
Gpuowl
⭐
89
GPU Mersenne primality test.
Saturn
⭐
86
Saturn accelerates the training of large-scale deep learning models with a novel joint optimization approach.
Deepnet
⭐
84
Deep.Net machine learning framework for F#
Etaler
⭐
82
A flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Opencl Benchmark
⭐
81
A small OpenCL benchmark program to measure peak GPU/CPU performance.
Cekirdekler
⭐
81
Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Dpctl
⭐
80
Python SYCL bindings and SYCL-based Python Array API library
Opencl Examples
⭐
80
Simple OpenCL examples for exploiting GPU computing
Rtorch.old
⭐
79
PyTorch bindings for R
Petibm
⭐
76
PetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
Raspberrypi_tempmon
⭐
74
System monitoring program for Raspberry pi single board computers written in Python 3.
Graphminer
⭐
68
Graph Pattern Mining
3d Core Raub
⭐
63
An extensible Node.js 3D core for desktop applications
Rtorch
⭐
63
PyTorch bindings for R
Urasterizer
⭐
62
URasterizer: A software rasterizer on top of Unity, accelerated by Job system & Compute Shader
Opensbli
⭐
61
A framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Tensordiffeq
⭐
60
Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing
Sushi2
⭐
59
Matrix Library for JavaScript
Parallelreductionsbenchmark
⭐
58
Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
Windflow
⭐
58
A C++17 Data Stream Processing Parallel Library for Multicores and GPUs
Taichi Fluid
⭐
57
A collection of CFD related resources for Taichi developers.
Heteroflow
⭐
51
Concurrent CPU-GPU Programming using Task Models
Rbcuda
⭐
50
CUDA bindings for Ruby
Ramsesgpu
⭐
50
Astrophysics MHD simulation code optimized for large cluster of GPU
Wmma_extension
⭐
49
An extension library of WMMA API (Tensor Core API)
1-100 of 245 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.