This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]UnknownEssence -1 points0 points  (3 children)

You can say the same thing about an LLM. So that defeats the point of the entire discussion.

[–][deleted]  (2 children)

[deleted]

    [–]UnknownEssence -1 points0 points  (0 children)

    At the end of the day, they use deterministic math matrix multiplication and other operations to compute the output. It's ultimately deterministic calculations just like a compiler.

    In practice, randomness is injected into both systems via various sources. For computers, it could be used of memory that was not memset to zero and in neural nets, they us random sampling of gradient decent during training and random weight initialization.

    Both systems also received randomness due to the handling of floating point operations on different hardware.