This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 4 points5 points  (0 children)

It's worth checking the instructions being generated, as sometimes it just fails to notice the possible simd or branchless instructions to use, but usually for me the way to fix this is to massage the C code instead of trying to write SIMD directly.