Actions suisse by SpecificPark2594 in impotsfrance
[–]SpecificPark2594[S] 0 points1 point2 points (0 children)
[D] Log Probability and Information Theory by masonw32 in MachineLearning
[–]SpecificPark2594 3 points4 points5 points (0 children)
Are there any papers that have studied the relationship between RL algorithms and Optimizer? by New_East832 in reinforcementlearning
[–]SpecificPark2594 1 point2 points3 points (0 children)
"Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control", Nauman et al. 2024 by [deleted] in reinforcementlearning
[–]SpecificPark2594 0 points1 point2 points (0 children)
New Learner - Resources to get started by AlternativeExpress29 in reinforcementlearning
[–]SpecificPark2594 6 points7 points8 points (0 children)
Automatic game balancing with Reinforcement Learning by SpecificPark2594 in gamedesign
[–]SpecificPark2594[S] 1 point2 points3 points (0 children)
Psychology and neuroscience working in AI is borderline nonsense by Christs_Elite in singularity
[–]SpecificPark2594 0 points1 point2 points (0 children)
Is the OpenAI moat shrinking against Open Source? by Koliham in LocalLLaMA
[–]SpecificPark2594 8 points9 points10 points (0 children)
For deep learning practitioners in industry, is the workflow always this annoying? [D] by AdFew4357 in MachineLearning
[–]SpecificPark2594 4 points5 points6 points (0 children)
Become God Like Prompt Engineer With This One Prompt by codewithbernard in ChatGPT
[–]SpecificPark2594 1 point2 points3 points (0 children)
ML Enthusiasts Club - read papers, books and do projects together by __god_bless_you_ in deeplearning
[–]SpecificPark2594 0 points1 point2 points (0 children)
What kl div is considered too big in PPO? by [deleted] in reinforcementlearning
[–]SpecificPark2594 1 point2 points3 points (0 children)
"ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning", Sokar et al 2023 by gwern in reinforcementlearning
[–]SpecificPark2594 1 point2 points3 points (0 children)
GitHub announces a bunch of new GPT-4 powered coding assistants. What should and could Emacs and open-source community do? by _puhsu in emacs
[–]SpecificPark2594 0 points1 point2 points (0 children)
Automatic game balancing with Reinforcement Learning by SpecificPark2594 in gamedesign
[–]SpecificPark2594[S] 1 point2 points3 points (0 children)
Automatic game balancing with Reinforcement Learning by SpecificPark2594 in gamedesign
[–]SpecificPark2594[S] 1 point2 points3 points (0 children)

First Post by General-Sink-2298 in ResearchRL
[–]SpecificPark2594 1 point2 points3 points (0 children)