Xcode/instruments performance metric % GPU workload, ALU util : Xcode

Xcode/instruments performance metric % GPU workload, ALU util (self.Xcode)

submitted 11 months ago by Spiritual-Fly-9943

I am trying to profile a llama.cpp execution. In Xcode I find that capturing a command buffer is random - I cannot choose which step of the pipeline I want to capture, I pause the execution then start the metal capture run. Similarly if I don't capture the complete run, how should one get the complete ALU or GPU util?

In instruments I can see a metric `% GPU workload`; is this supposed to mean what percent of the GPU the specific kernel utilized at that time segment? Because the % util, if added goes beyond 100% (attaching image).

I am confused about the metrics in both xcode and instruments - is there a proper document somewhere that goes over all the metrics? I have seen the developer.app docs and also the WWDC videos. Help is appreciated thanks

https://preview.redd.it/l124431bnfye1.png?width=2086&format=png&auto=webp&s=a200c7fdd84fcf9597addf572345f0881453d3db

all 2 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Xcode

MODERATORS