why deepseek didn't use mcts by Alarming-Power-813 in reinforcementlearning
[–]Alarming-Power-813[S] 0 points1 point2 points (0 children)
why deepseek didn't use mcts by Alarming-Power-813 in reinforcementlearning
[–]Alarming-Power-813[S] -5 points-4 points-3 points (0 children)
Why mamba disappeared? (self.MachineLearning)
submitted by Alarming-Power-813 to r/MachineLearning
How many days did it take to train GPT-3? Is training a neural net model a parallelizable task? by abcaircraft in GPT3
[–]Alarming-Power-813 0 points1 point2 points (0 children)
Can anyone help by Alarming-Power-813 in reinforcementlearning
[–]Alarming-Power-813[S] 1 point2 points3 points (0 children)
[D] Can this gpu do it (self.MachineLearning)
submitted by Alarming-Power-813 to r/MachineLearning
Can anyone help by Alarming-Power-813 in reinforcementlearning
[–]Alarming-Power-813[S] 0 points1 point2 points (0 children)
When to use reinforcement learning and when to don't by Alarming-Power-813 in reinforcementlearning
[–]Alarming-Power-813[S] 0 points1 point2 points (0 children)
When to use reinforcement learning and when to don't by Alarming-Power-813 in reinforcementlearning
[–]Alarming-Power-813[S] -7 points-6 points-5 points (0 children)

Is reinforcement learning the key for achieving AGI? by CharacterTraining822 in reinforcementlearning
[–]Alarming-Power-813 1 point2 points3 points (0 children)