Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for c plus plus avx
avx
x
c-plus-plus
x
165 search results found
Simdjson
⭐
18,323
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Tensorflow Windows Wheel
⭐
3,522
Tensorflow prebuilt binary for Windows
Highway
⭐
3,041
Performance-portable, length-agnostic SIMD with runtime dispatch
Ctranslate2
⭐
2,437
Fast inference engine for Transformer models
Ispc
⭐
2,366
Intel® Implicit SPMD Program Compiler
Cpu_features
⭐
2,321
A cross platform C99 library to get cpu features at runtime.
Embree
⭐
2,201
Embree ray tracing kernels repository.
Xsimd
⭐
2,034
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Simd
⭐
1,921
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
Xbyak
⭐
1,901
a JIT assembler for x86(IA-32)/x64(AMD64, x86-64) MMX/SSE/SSE2/SSE3/SSSE3/SSE4/FPU/AVX/AVX2/AVX-512 by C++ header
Kfr
⭐
1,540
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Croaring
⭐
1,382
Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, and StarRocks
Vc
⭐
1,349
SIMD Vector Classes for C++
Libsimdpp
⭐
1,064
Portable header-only C++ low level SIMD library
Tfhe
⭐
1,028
TFHE: Fast Fully Homomorphic Encryption Library over the Torus
Libdivide
⭐
989
Official git repository for libdivide: optimized integer division
Simdutf
⭐
868
Unicode routines (UTF8, UTF16, UTF32): billions of characters per second using SSE2, AVX2, NEON, AVX-512. Part of Node.js and Bun.
Eve
⭐
764
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Libyuv
⭐
758
Unofficial libyuv mirror. Please submit any issues or PRs upstream.
X86 Simd Sort
⭐
731
C++ template library for high performance SIMD based sorting algorithms
Xorstr
⭐
596
heavily vectorized c++17 compile time string encryption.
Flycv
⭐
513
FlyCV is a high-performance library for processing computer visual tasks.
Ipc
⭐
493
Incremental Potential Contact (IPC) is for robust and accurate time stepping of nonlinear elastodynamics. IPC guarantees intersection- and inversion-free trajectories regardless of materials, time-step sizes, velocities, or deformation severity.
Volk
⭐
491
The Vector Optimized Library of Kernels
Std Simd
⭐
467
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Mipp
⭐
450
MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX, AVX-512 and SVE (length specific).
Hlslpp
⭐
439
Math library using hlsl syntax with SSE/NEON support
Bitmagic
⭐
395
BitMagic Library
Sse Popcount
⭐
297
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Turbo Run Length Encoding
⭐
275
TurboRLE-Fastest Run Length Encoding
Fbow
⭐
274
FBOW (Fast Bag of Words) is an extremmely optimized version of the DBow2/DBow3 libraries.
Flops
⭐
269
How many FLOPS can you achieve?
Libpopcnt
⭐
234
🚀 Fast C/C++ bit population count library
Fastfilter_cpp
⭐
232
Fast Approximate Membership Filters (C++)
Lapjv
⭐
210
Linear Assignmment Problem solver using Jonker-Volgenant algorithm - Python 3 native module.
Fmath
⭐
206
fast log and exp functions for x86/x64 SSE
Nsimd
⭐
184
Agenium Scale vectorization library for CPUs and GPUs
Rasterizer
⭐
183
A state-of-the-art software occlusion culling system
Fastgltf
⭐
182
A modern C++17 glTF 2.0 library focused on speed, correctness, and usability
Compactcnncascade
⭐
148
A binary library for very fast face detection using compact CNNs.
Peculiarlog
⭐
132
Lightweight log filtering tool supporting regular expressions and instant (as-you-type) results.
Sse4 Strstr
⭐
130
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Puyoai
⭐
118
AI for puyo
Penguinv
⭐
117
Computer vision library with focus on heterogeneous systems
Xilinx Tiny Cnn
⭐
115
Avx Turbo
⭐
104
Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts
Magnum Singles
⭐
103
Single-header libraries from the Magnum engine
Raytracer
⭐
99
Physically Based Software Renderer
Yask
⭐
98
YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Fast Slic
⭐
96
20x Real-time superpixel SLIC Implementation with CPU
Base64simd
⭐
95
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Mapmap_cpu
⭐
93
A high-performance general-purpose MRF MAP solver, heavily exploiting SIMD instructions.
Everyculling
⭐
93
This library integrates multiple culling methods into one library.
Cpurasterizer
⭐
92
CPU Based Rasterizer Engine
Basicbitmap
⭐
92
Simple and high-performance and platform independent Bitmap class (34% faster than GDI/GDI+, 40% faster than DDraw)
Cpuid
⭐
89
C++ library for detecting CPU capabilities
Rv
⭐
89
RV: A Unified Region Vectorizer for LLVM
Roaring Node
⭐
89
Roaring for NodeJS
Advent2018 Fast
⭐
86
Advent of Code 2018 optimized solutions in C++
Nfllib
⭐
86
NTT-based Fast Lattice library
Facemask Plugin
⭐
84
(sl)obs filter plug-in for detecting faces and drawing masks on them
Image Processing Algorithm Speed
⭐
83
opencv
Mandelbrotsse
⭐
79
Real-time Mandelbrot zoom via SSE, AVX, OpenMP, CUDA, XaoS...
Cppspmd_fast
⭐
77
Optimized CppSPMD test project: macro control flow, SSE4.1/AVX1/AVX2/AVX2 FMA support
Awesome Simd
⭐
71
A curated list of awesome SIMD frameworks, libraries and software
Veryfasttree
⭐
71
Efficient phylogenetic tree inference for massive taxonomic datasets
Benchmarking Fft
⭐
70
choosing FFT library...
Kmer Db
⭐
68
Kmer-db is a fast and memory-efficient tool for large-scale k-mer analyses (indexing, querying, estimating evolutionary relationships, etc.).
Middle Out
⭐
67
Fast Middle-out Compression for Time-series Data
Robsize
⭐
67
ROB size testing utility
Jitasm
⭐
65
JIT Assembler Library for multiple ISAs
Dictionary
⭐
63
High-performance dictionary coding
Rgb2y
⭐
62
Fastest CPU (AVX/SSE) RGB to grayscale: 2-4x faster than OpenCV. For image processing/computer vision.
Fastops
⭐
61
This small library enables acceleration of bulk calls of certain math functions on AVX and AVX2 hardware. Currently supported operations are exp, log, sigmoid and tanh. The library is designed with extensibility in mind.
Argon2
⭐
58
Implementation of argon2 (i, d, id) algorithms with CPU dispatching
Umesimd
⭐
56
UME::SIMD A library for explicit simd vectorization.
Petar
⭐
56
PeTar is a high-performance N-body code for modelling the evolution of star clusters and tidal streams, including the effect of galactic potential, dynamics of binary and hierarchical system, single and binary stellar evolution.
Chiavdf
⭐
55
Chia VDF utilities
Peakperf
⭐
53
Achieve peak performance on x86 CPUs and NVIDIA GPUs
Dlfb
⭐
52
a cpp Deep Learning Framework for Beginners to READ and LEARN
Radar Electrooptical Simulation
⭐
50
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
Simd Sort
⭐
49
AVX512F and AVX2 versions of quick sort
Hpc
⭐
47
Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Objconv
⭐
45
Object file converter This utility can be used for converting object files between COFF/PE, OMF, ELF and Mach-O formats for all 32-bit and 64-bit x86 platforms. Can modify symbol names in object files. Can build, modify and convert function libraries across platforms. Can dump object files and executable files. Also includes a very good disassembler supporting the SSE4, AVX, AVX2, AVX512, FMA3, FMA4, XOP and Knights Corner instruction sets.
Is_utf8
⭐
43
Fast C++ function "is_utf8": checks if the input is valid UTF-8. Made of a single source file. Optimized for ARM NEON, x64 SSE, AVX2 and AVX-512.
Turbo Histogram
⭐
42
Fastest Histogram Construction
Fast Hex
⭐
42
Fast, SIMD hex string encoder and decoder C++ lib and Node.js module
Fastnoise Simd
⭐
42
Perlin style noise sped up with SIMD instructions
Fuzzle
⭐
41
A stateful fuzzing engine.
Instlatx64_demo
⭐
39
InstLatX64_Demo
Parallel Xxhash
⭐
37
Compute xxHash hash codes for 8 keys in parallel
Node Yencode
⭐
36
SIMD accelerated yEnc encoder/decoder and CRC32 calculator for node.js
Dd Avx_v3
⭐
35
Library of High Precision Sparse Matrix Operations Accelerated by SIMD
Fast Filters
⭐
30
Implementation of FIR and IIR filters optimized for SIMD processing
Mutable_rank_select
⭐
29
A SIMD-based C++ library providing rank/select queries over mutable bitmaps.
Fft
⭐
28
FFT (Fast Fourier Transform): SSE, AVX, AVX2
Digitviewer
⭐
28
y-cruncher's Digit Viewer
Ssim
⭐
27
Fast and portable SSIM implementation
Blocking Tutorial
⭐
26
Latch
⭐
26
Fastest CPU implementation of the LATCH 512-bit binary feature descriptor; fully scale- and rotation-invariant
Related Searches
C Plus Plus Cmake (8,712)
C Plus Plus Algorithms (6,194)
Java C Plus Plus (2,629)
C Plus Plus Clang (2,079)
C Plus Plus Gpu (1,948)
C Plus Plus Raspberry Pi (1,718)
C Plus Plus Matrix (1,548)
C Plus Plus Gcc (1,515)
C Plus Plus Game Development (1,488)
C Plus Plus Vector (1,461)
1-100 of 165 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.