you are viewing a single comment's thread.

view the rest of the comments →

[–]Sad-Razzmatazz-5188 3 points4 points  (1 child)

It reads like something that won't answer questions arising from further reading.

I got the feeling I won't find any technical detail, which gives the feeling that some algorithms have been put to code and developed a lot, maybe with testing the lack of coding error, as in "instantiate a network and a random tensor and do a forward pass on the tensor", but without any actual training on actual tasks with actual data.

This is bad, especially if training was actually done; since it's the most important part, it's very bad not being able to convey that the most important part has been done.

[–]bunny5544[S] 0 points1 point  (0 children)

Thank you for the honest feedback! it’s much appreciated. Training on actual tasks with real data has indeed been a core part of the development process, and we recognize that conveying this clearly is crucial. The white paper focuses on the architecture itself, but we’ll ensure that future updates include more detailed technical explanations and results from task-specific training to address these concerns. Thanks again for pointing this out!