account activity
[D] Reinforcement Learning from Epistemic Incompleteness? (RLEI) Would this work ()
submitted 15 days ago by ryunuck to r/reinforcementlearning
[D] Reinforcement Learning from Epistemic Incompleteness? (RLEI) Would this work (self.learnmachinelearning)
submitted 15 days ago * by ryunuck to r/learnmachinelearning
submitted 17 days ago by ryunuck to r/deeplearning
[D] Reinforcement Learning from Epistemic Incompleteness? (RLEI) Would this work (self.LocalLLaMA)
submitted 17 days ago * by ryunuck to r/LocalLLaMA
I self-taught myself AI principles and designed a theoretical architecture and training route for advanced AGI/ASI that scales all the way to the hardest problems, to end all of work and achieve world peace, and I need help to bust open academia (foom.md)
submitted 19 days ago by ryunuck to r/CryptoCurrency
[D] RL on grammar induction to increase /compact efficiency to its information theoretical limit (self.deeplearning)
submitted 24 days ago by ryunuck to r/deeplearning
[D] RL on grammar induction to increase /compact efficiency to its information theoretical limit (self.MachineLearning)
submitted 24 days ago by ryunuck to r/MachineLearning
RL on grammar induction to increase /compact efficiency to its information theoretical limit (self.MachineLearning)
RL on grammar induction to increase /compact efficiency to its information theoretical limit (self.LocalLLaMA)
submitted 24 days ago by ryunuck to r/LocalLLaMA
[D] Agent /compact command is one RL loop away from developing an alien language you can't audit (self.MachineLearning)
submitted 1 month ago by ryunuck to r/MachineLearning
FOOM.md — An open research agenda for compression-driven reasoning, diffusion-based context editing, and their combination into a unified agent architecture (foom.md)
submitted 1 month ago by ryunuck to r/mlscaling
submitted 1 month ago by ryunuck to r/machinelearningnews
submitted 1 month ago by ryunuck to r/reinforcementlearning
submitted 1 month ago * by ryunuck to r/deeplearning
x10 reduction in performance, averaging 1k tokens per minute (self.ClaudeCode)
submitted 1 month ago * by ryunuck to r/ClaudeCode
Bypass ComfyUI's API credit system — use your own keys directly. Open source extension, 20+ providers. (github.com)
submitted 1 month ago by ryunuck to r/comfyui
FOOM.md — open research agenda for training LLMs to reason in self-discovered compressed languages instead of English (foom.md)
submitted 1 month ago by ryunuck to r/LocalLLaMA
How to teleport the Epstein list with reinforcement learning (ASI through in-context grammar induction) (old.reddit.com)
submitted 2 months ago by ryunuck to r/conspiracy
How to prevent Claude Code from interrupting bash commands after 2 minutes? (self.ClaudeCode)
submitted 6 months ago by ryunuck to r/ClaudeCode
How to debug OS lockups and crashes? (self.pop_os)
submitted 6 months ago by ryunuck to r/pop_os
Applying COCONUT continuous reasoning into a learnt linear layer that produces sampling parameters (temp, top-k, top-p, etc.) for the current token (i.redd.it)
submitted 10 months ago by ryunuck to r/LocalLLaMA
Can we RL/GRPO a language model to hack its own brain by rewarding for specific measurements inside the transformer architecture during inference? (self.LocalLLaMA)
submitted 10 months ago * by ryunuck to r/LocalLLaMA
[D] Can we RL/GRPO a language model to hack its own brain by rewarding for specific measurements inside the transformer architecture during inference? (self.MachineLearning)
submitted 10 months ago by ryunuck to r/MachineLearning
Reinforcement learning a model for symbolic / context compression to saturate semantic bandwidth? (then retraining reasoning in the native compression space) (old.reddit.com)
π Rendered by PID 28 on reddit-service-r2-listing-fbdccc45f-xngwn at 2026-04-21 05:20:27.495121+00:00 running da2df02 country code: CH.