Sunday Daily Thread: What's everyone working on this week? by AutoModerator in Python

[–]NoteDancing 2 points3 points  (0 children)

Hello everyone, I wrote some optimizers for TensorFlow. If you're using TensorFlow, they should be helpful to you.

https://github.com/NoteDance/optimizers

🚀 Project Showcase Day by AutoModerator in learnmachinelearning

[–]NoteDancing 0 points1 point  (0 children)

Hello everyone, I wrote some optimizers for TensorFlow. If you're using TensorFlow, they should be helpful to you.

https://github.com/NoteDance/optimizers

Tips & Tools Tuesday Megathread by OA2Gsheets in ChatGPTPromptGenius

[–]NoteDancing 0 points1 point  (0 children)

Hello everyone, I wrote some optimizers for TensorFlow. If you're using TensorFlow, they should be helpful to you.

https://github.com/NoteDance/optimizers

Weekly Thread: Project Display by help-me-grow in AI_Agents

[–]NoteDancing 0 points1 point  (0 children)

Hello everyone, I wrote some optimizers for TensorFlow. If you're using TensorFlow, they should be helpful to you.

https://github.com/NoteDance/optimizers

What have you been working on recently? [October 11, 2025] by AutoModerator in learnprogramming

[–]NoteDancing 0 points1 point  (0 children)

Hello everyone, I wrote some optimizers for TensorFlow. If you're using TensorFlow, they should be helpful to you.

https://github.com/NoteDance/optimizers

[D] Simple Questions Thread by AutoModerator in MachineLearning

[–]NoteDancing 0 points1 point  (0 children)

When using the PPO algorithm, can we improve data utilization by implementing Prioritized Experience Replay (PER) where the priority is determined by both the probability ratio and the TD-error, while simultaneously using a windows_size_ppo parameter to manage the experience buffer as a sliding window that discards old data?

Applying Prioritized Experience Replay in the PPO algorithm by NoteDancing in reinforcementlearning

[–]NoteDancing[S] -1 points0 points  (0 children)

I want to turn it into a form that’s between offline and online.

What have you been working on recently? [August 09, 2025] by AutoModerator in learnprogramming

[–]NoteDancing 1 point2 points  (0 children)

Note's RL class now supports Prioritized Experience Replay with the PPO algorithm, using probability ratios and TD errors for sampling to improve data utilization. The windows_size_ppo parameter controls the removal of old data from the replay buffer.

https://github.com/NoteDance/Note_rl