Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for cpu simd
cpu
x
simd
x
29 search results found
Xnnpack
⭐
1,641
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Nnpack
⭐
1,595
Acceleration package for neural networks on multi-core CPUs
Highwayhash
⭐
1,452
Fast strong hash functions: SipHash/HighwayHash
Enoki
⭐
975
Enoki: structured vectorization and differentiation on modern processor architectures
Pffft
⭐
167
A fork of Julien Pommier's Pretty Fast FFT (PFFFT) library, with several additions
Compactcnncascade
⭐
148
A binary library for very fast face detection using compact CNNs.
Md5 Simd
⭐
123
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Blake3.net
⭐
120
Blake3.NET is a fast managed wrapper around the SIMD Rust implementations of the BLAKE3 cryptographic hash function.
Penguinv
⭐
117
Computer vision library with focus on heterogeneous systems
Dali_pytorch_demo
⭐
102
Example code showing how to use Nvidia DALI in pytorch, with fallback to torchvision. Contains a few differences to the official Nvidia example, namely a completely CPU pipeline & improved memory usage
Fast Dnn
⭐
78
A fast deep neural network library (CPU) for speech recognition
Cyvlfeat
⭐
74
A thin Cython wrapper around select areas of vlfeat
Dfpsr
⭐
58
Fast realtime softare rendering library for C++14 using SSE/AVX/NEON. 2D, 3D and isometric rendering with minimal system dependencies.
Cpuid.jl
⭐
49
Ask the CPU for cache sizes, SIMD feature support, a running hypervisor, and more.
Basic Simd Processor Verilog Tutorial
⭐
41
Implementation of a simple SIMD processor in Verilog, core of which is a 16-bit SIMD ALU. 2's compliment calculations are implemented in this ALU. The ALU operation will take two clocks. The first clock cycle will be used to load values into the registers. The second will be for performing the operations. 6-bit opcodes are used to select the functions. The instruction code, including the opcode, will be 18-bit.
Node Yencode
⭐
36
SIMD accelerated yEnc encoder/decoder and CRC32 calculator for node.js
Libalgebra
⭐
28
Fast C header-only library for popcnt, pospopcnt, and set algebraic operations
Cpuwhat
⭐
23
Nim utilities for advanced CPU operations: CPU identification, ISA extension detection, bindings to assorted intrinsics
Aviutl Waifu2x Cpu
⭐
23
waifu2x by CPU for AviUtl
Intel Sde Flops
⭐
22
Computing FLOPs with Intel Software Development Emulator (Intel SDE)
Embedsom
⭐
20
Fast embedding ot multidimensional datasets, great for cytometry data
Simd_neuralnet
⭐
11
Feed-forward neural network implementation in C with SIMD instructions
Waterspout
⭐
11
simd abstraction library especially creafted for audio/image manipulation
Cpu Raytracer
⭐
9
Whitted Style CPU Raytracer using SIMD
Vectorforth
⭐
9
SIMD vectorized Forth compiler with CPU based shader application
Cpu Dasher For Android
⭐
8
CPU Dasher for Android
Rakau
⭐
7
C++17 N-body Barnes-Hut on heterogeneous hardware architectures
Neon Sha3_2x
⭐
5
NEON ARMv8 SHA3_2x: 2 times SHA3 or SHAKE128/256 in 01 call. Use In Post-Quantum Cryptography Submission
Vectorizedkernel
⭐
5
Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures
Related Searches
C Cpu (1,679)
C Plus Plus Cpu (1,243)
Python Cpu (1,211)
Gpu Cpu (1,114)
C Plus Plus Simd (440)
Cpu Arm (435)
1-29 of 29 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.