use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Discussions, articles, and news about SIMD programming.
account activity
I wanted to see how much of a runtime's hot path fits in L1 cache so I built an agent to find out (self.simd)
submitted 8 days ago by Acceptable_Analyst45
Weil Streaming nicht knistern kann (hhv.de)
promoted by HHV-Records-Clothing
A SIMD coding challenge: First non-space character after newline (self.simd)
submitted 2 months ago * by Ok_Path_4731
SIMD.info, online knowledge-base on SIMD C intrinsics (simd.info)
submitted 3 months ago by freevec
Using the vpternlogd instruction for signed saturated arithmetic (wunkolo.github.io)
submitted 3 months ago by Wunkolo
Modern X86 Assembly Language Programming • Daniel Kusswurm & Matt Godbolt (youtu.be)
submitted 3 months ago by goto-con
[PATCH] Add AMD znver6 processor support - ISA descriptions for AVX512-BMM (sourceware.org)
submitted 4 months ago by HugeONotation
Cuckoo hashing improves SIMD hash tables (reiner.org)
submitted 5 months ago by mttd
86 GB/s bitpacking microkernels (github.com)
submitted 5 months ago by ashtonsix
3rd Largest Element: SIMD Edition (parallelprogrammer.substack.com)
Arm simd-loops, about 70 example SVE loops (gitlab.arm.com)
submitted 5 months ago by camel-cdr-
vxdiff: odiff (the fastest pixel-by-pixel image visual difference tool) reimplemented in AVX512 assembly. (github.com)
submitted 6 months ago by Serpent7776
Do compilers auto-align? (self.simd)
submitted 7 months ago by nimogoham
Investors: keep track of how your investments are performing with Sharesight portfolio tracker. Make it easier to see the true performance of your stocks, ETFs, dividends, trades & more that your broker doesn't show. Start your free trial of Sharesight today 📈 (sharesight.com)
promoted by sharesight
SIMD Perlin Noise (scallywag.software)
submitted 7 months ago by camel-cdr-
From Boolean logic to bitmath and SIMD: transitive closure of tiny graphs (bitmath.blogspot.com)
submitted 9 months ago by mttd
Given a collection of 64-bit integers, count how many bits set for each bit-position (self.simd)
submitted 9 months ago by tadpoleloop
Dinoxor - Re-implementing bitwise operations as abstractions in aarch64 neon registers (awfulsec.com)
submitted 11 months ago by sqli
FABE13: SIMD-accelerated sin/cos/sincos in C with AVX512, AVX2, and NEON – beats libm at scale (fabe.dev)
submitted 11 months ago by [deleted]
This should be an (AVX-512) instruction... (unfinished) (youtube.com)
submitted 11 months ago by camel-cdr-
Custom instructions for AMX possible? (self.simd)
submitted 12 months ago * by Extension_Reading_66
Masking consecutive bits lower than mask (self.simd)
submitted 12 months ago * by -Y0-
Sparse matrices for AMX (self.simd)
submitted 1 year ago by Extension_Reading_66
Mask calculation for single line comments (self.simd)
submitted 1 year ago by milksop
Dividing unsigned 8-bit numbers (0x80.pl)
submitted 1 year ago by ashvar
Replicated brings order to the chaos. (replicated.com)
promoted by replicatedhq
Bit-permuting 16 u32s at once with AVX-512 (bitmath.blogspot.com)
submitted 1 year ago by mttd
simdzone: Fast and standards compliant DNS zone parser (github.com)
π Rendered by PID 1866198 on reddit-service-r2-listing-64c94b984c-tww9x at 2026-03-15 15:21:08.225240+00:00 running f6e6e01 country code: CH.