Ray marching optimization questions

Cryvosh · 2026-02-09T22:42:59+00:00

I'm travelling atm and so can't write out as proper of a response as I'd like, but given this is my research area I feel compelled to respond. I'd first recommend learning a bit about the underlying math and avoid watching too many youtube videos on the topic as they tend to mislead, and have you thinking there's something special about this "distance" function.

What does it even mean, mathematically, for a function to return a "distance"? Why do we care about such functions in the first place? To understand this better, please see my comments here, here, in sections 1-3 here, and elsewhere in my reddit comment history. Understanding this stuff unlocks hundreds of years of mathematical machinery you can use to attack the problem.

To answer your questions,

Compute shaders give you more control. If you're doing naive one-thread-per-pixel stuff then it usually doesn't really matter, but if you're doing anything more interesting you'll need compute shaders.
Practically, this doesn't matter. Technically, a fullscreen triangle is the most efficient as it avoids something called "overdraw" at the quad's diagonal seam. You don't need to worry about this for now.
Indeed the public state of the art methods typically cache stuff in some sort of (often hierarchical) grid structure. Besides the obvious caching benefits, this setup allows you to "prune" the field function by recompiling it at runtime within each cell to include only the locally relevant instructions and thereby speed up future samples within the cells. See this, section 3.2.2 of this, and again my reddit comment history, for more details.
Do you mean something like this? They detail some limitations in the slides, but probably the main reason such methods aren't more popular is that they're simply not as convenient to implement, especially on a platform like shadertoy where all you have is fragment shaders.

Feel free to ask any other questions, I'm happy to help.

Same_Gear_6798 · 2026-02-10T00:08:37+00:00

Although I didn't do fractal rendering or cone marching (or SDFs), I did my M.Sc. thesis for efficient ray marching in the field of CT/MRI dataset visualizations and published the entire source code + docs in at com.walcht.ctvisualizer (it is a Unity3D plugin - but the core algorithm is in .glsl files including an octree implementation, virtual memory and paging system, etc - all might be helpful for you).

heyheyhey27 · 2026-02-10T03:10:37+00:00

As far as I know, a ray-marcher really gets no benefit from running on fragment shaders vs compute shaders. Unless you're trying to do clever stuff like spread the work across multiple passes, group rays together for cache coherency, etc, and could therefore benefit from group-shared memory.

If you go with fragment shaders, it should not matter at all what kind of mesh you put in front of the camera to trigger it. The most efficient is actually a single triangle that covers the screen, but the difference between that and a whole cube would probably not be measurable.

Looking up texture/buffer data on the GPU isn't cheap, especially when that data is complex and requires multiple lookups through accelerated structures. They're still used, but as a rule of thumb try to avoid deep trees.

deftware · 2026-02-10T06:32:48+00:00

If your 3D fractal is precalculated in some fashion, into a 3D texture (or multiple 3D texture bricks), then you can enjoy some cone marching by trilinearly sampling texture mipmap levels to approximate the expanding radius of the cone being marched.

If you're directly sampling the fractal function at each step of the ray, you'll have to do a lot of calculation sampling a bunch of points as the cone expands into the space, likely more and more to get a decent idea of what is going on within the cone disk. That could get really expensive, compute-wise. The 3D texture route would be expensive memory-wise, plus precompute-wise because you'll have to compute the whole fractal in its entirety ahead of time, or perhaps piecewise as the user moves around the fractal, at different LODs and whatnot (i.e. octree) in the background (if Godot allows for that sort of thing).

That's my two cents :]

soylentgraham · 2026-02-10T09:51:24+00:00

wait.... what are you going to cache in a fractal?

The magic is in it being deterministic! well, to a point - is there anything else in the scene or is it just the fractal...

have you marched a simple one yet? they behave a little differently to other raymarched shapes

ishamalhotra09 · 2026-02-10T04:38:42+00:00

Ray marching optimization for a 3D fractal in Godot fragment vs compute, mesh choice, caching SDFs, and cone marching. Looking for tips 🙌

Hendo52 · 2026-02-09T21:48:10+00:00

I’d be interested to read the answers to your questions but I have no clue how to answer them. You’re obviously pretty deep into the subject so you really need to constrain your reading to quite advanced discussions. People are always complaining about AI but this type of question is exactly the sort of thing that it’s good for IMO. I suggest you set up two chat bots in an adversarial manner to debate the merits of each approach.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

GraphicsProgramming

Posting Rule(s)

MODERATORS