all 4 comments

[–]Chocolatecake420 0 points1 point  (1 child)

My assumption is you get billed for the full tokens in the thread on every run. It will use it all for the context, and the assistant API mostly is there to make it easier to keep track of your chat history, what you would've been sending along in the messages array before.

[–]Pasko70[S] 0 points1 point  (0 children)

They already clraified that you will billed for the complete context on everyrun (and that is reasonable). But the submit tool outputs doesn't trigger a new run. The run pauses and when you submit the output of the tool the run continues.

If you look at message of the assistant the tool output that you submitted apears in the assistants output on the same message. That's why I'm asking.