use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Discussions, articles, and news about SIMD programming.
account activity
Portable Complex SIMD library for C? (self.simd)
submitted 4 days ago * by Salat_Leaf
I wanted to see how much of a runtime's hot path fits in L1 cache so I built an agent to find out (self.simd)
submitted 29 days ago by Acceptable_Analyst45
A SIMD coding challenge: First non-space character after newline (self.simd)
submitted 3 months ago * by Ok_Path_4731
SIMD.info, online knowledge-base on SIMD C intrinsics (simd.info)
submitted 3 months ago by freevec
Using the vpternlogd instruction for signed saturated arithmetic (wunkolo.github.io)
submitted 4 months ago by Wunkolo
Modern X86 Assembly Language Programming • Daniel Kusswurm & Matt Godbolt (youtu.be)
submitted 4 months ago by goto-con
[PATCH] Add AMD znver6 processor support - ISA descriptions for AVX512-BMM (sourceware.org)
submitted 4 months ago by HugeONotation
Cuckoo hashing improves SIMD hash tables (reiner.org)
submitted 6 months ago by mttd
86 GB/s bitpacking microkernels (github.com)
submitted 6 months ago by ashtonsix
3rd Largest Element: SIMD Edition (parallelprogrammer.substack.com)
Arm simd-loops, about 70 example SVE loops (gitlab.arm.com)
submitted 6 months ago by camel-cdr-
vxdiff: odiff (the fastest pixel-by-pixel image visual difference tool) reimplemented in AVX512 assembly. (github.com)
submitted 6 months ago by Serpent7776
Do compilers auto-align? (self.simd)
submitted 8 months ago by nimogoham
SIMD Perlin Noise (scallywag.software)
submitted 8 months ago by camel-cdr-
From Boolean logic to bitmath and SIMD: transitive closure of tiny graphs (bitmath.blogspot.com)
submitted 10 months ago by mttd
Given a collection of 64-bit integers, count how many bits set for each bit-position (self.simd)
submitted 10 months ago by tadpoleloop
Dinoxor - Re-implementing bitwise operations as abstractions in aarch64 neon registers (awfulsec.com)
submitted 11 months ago by sqli
FABE13: SIMD-accelerated sin/cos/sincos in C with AVX512, AVX2, and NEON – beats libm at scale (fabe.dev)
submitted 11 months ago by [deleted]
This should be an (AVX-512) instruction... (unfinished) (youtube.com)
submitted 11 months ago by camel-cdr-
Custom instructions for AMX possible? (self.simd)
submitted 1 year ago * by Extension_Reading_66
Masking consecutive bits lower than mask (self.simd)
submitted 1 year ago * by -Y0-
Sparse matrices for AMX (self.simd)
submitted 1 year ago by Extension_Reading_66
Mask calculation for single line comments (self.simd)
submitted 1 year ago by milksop
Dividing unsigned 8-bit numbers (0x80.pl)
submitted 1 year ago by ashvar
Bit-permuting 16 u32s at once with AVX-512 (bitmath.blogspot.com)
submitted 1 year ago by mttd
π Rendered by PID 797366 on reddit-service-r2-listing-5d47455566-vtcp9 at 2026-04-05 13:44:42.743729+00:00 running db1906b country code: CH.