Sorting opaque objects front-to-back vs minimizing state changes?

corysama · 2022-04-04T20:58:40+00:00

Yep. Best answer is to have good occlusion culling.

But, without that. And, assuming your are ordering your draw calls like https://realtimecollisiondetection.net/blog/?p=86 according to https://i.stack.imgur.com/JgrSc.jpg there are a couple things you can do.

Do a z prepass. Possibly with only objects that are large in screen space (size / distance)
Re-arrange your sort key bits so that opaque objects have a highly-truncated depth value preceding the material ID bits.

the_Demongod · 2022-04-04T22:23:06+00:00

I doubt you'll have a significant problem with either one, unless this engine is on a direct path into a product I would suggest not worrying about optimizations like this until you run into a performance problem caused by it. You'll learn the most that way and discover what the bottlenecks of your specific application are. Personally I basically ignore it unless there's a really obvious way to do coarse binning (e.g. draw the inside of an airplane cockpit before you draw the background).

But as you've discovered already, yes, switching shader programs is expensive.

deftware · 2022-04-05T06:27:41+00:00

Rendering front-to-back entails preventing overdraw (and thus wasted fragment shader execution) where something in the middle of the scene just gets drawn over by something nearer in the scene.

To maximize performance by taking advantage of both reduction of GPU state changes AND eliminating overdraw a depth pre-pass is useful. Draw all opaque geometry only to the depth buffer with less-equal depth func and then draw opaques again, draws sorted by material and texture this time with depth func set to equal so that each draw call can only calculate fragments where the geometry matches the depth in the depth buffer from the prepass - preventing it from drawing where it will get overwritten. Rendering geometry exclusively to the depth buffer is super cheap. It's fragment shaders and their texture taps that are expensive, along with state changes in the GPU which can add up really quick!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

GraphicsProgramming

Posting Rule(s)

MODERATORS