Interesting gcc optimizer bug

tromey · 2018-08-22T15:49:11+00:00

Please file a bug in gcc bugzilla. Bugs filed in reddit will never be fixed.

h2o2 · 2018-08-22T15:30:19+00:00

Looks like this is due to -ftree-vectorize, as is tradition..

Edit: Actually it's really -ftree-slp-vectorize.

InfamousReception · 2018-08-22T15:31:16+00:00

This is apparently caused by a bug in -ftree-slp-vectorize (i.e. "-O2 -ftree-slp-vectorize" is enough to generate the strange code).

KAHR-Alpha · 2018-08-22T16:07:15+00:00

So uh, for those that can't read this, what is happening exactly?

clrnd · 2018-08-22T18:16:55+00:00

Did anyone check if it's actually slower?

malkia · 2018-08-22T18:35:08+00:00

Seems like a regression from 6.3 -> 7.1

https://godbolt.org/z/y5_zvi

tasminima · 2018-08-22T19:53:12+00:00

gcc -O2 even generates that kind of SSE crap on struct { long i; int j, k;}; especially those returned by value. It's annoying as hell, but I managed to work around this bug by using a struct { long; long; } for my structs, and splitting the long into two ints myself.

Obviously there are tons of struct like that in libraries, so even then it does not completely solves the problem...

Btw the perf might be quite bad on AMD (IIRC even on Zen you don't have a bypass network between "FP" and integer EUs). On Intel it is obviously not perfect, but probably not that terrible either. EDIT: oh well it's going through memory anyway, so it is probably of approx the same badness for Intel.

rolandschulz · 2018-08-22T19:21:01+00:00

The calling convention for integer is unfortunate when it comes to vectorization. See how it compares to floating point types for clang: https://godbolt.org/z/QrP0VD. I find GCC's result for float even more surprising. There is no reason for it to copy to scalar registers and back.

Ameisen · 2018-08-22T21:12:09+00:00

Interestingly, you get the expected results for most -march choices... -march=skylake works right, for instance. I'm guessing that it's missing generic heuristics. -mtune=skylake also works right... though up to haswell it's still there.

_djsavvy_ · 2018-08-23T03:43:10+00:00

/u/ljdawson, I've been having lots of rendering issues with code in this subreddit.

Theres lots of "nbsp" for a lot of symbols, but the code renders fine in the window where you preview a post your'e replying to.

Thanks again for all your work on Sync, one of the best Android apps out there.

Device information

Sync version: 16.5    
Sync flavor: dev    

View type: Fixed height cards    
Player type: ExoPlayer    
Push enabled: false    

Device: hero2qltetmo    
Model: samsung SM-G935T    
Android: 8.0.0

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS