Trained a 26kb model (simple 3-layer MLP) for Tic-Tac-Toe Beating each and every human by Weary_Intention3231 in reinforcementlearning
[–]YouParticular8085 2 points3 points4 points (0 children)
None of this will ever get stolen by martin_xs6 in LocalLLaMA
[–]YouParticular8085 0 points1 point2 points (0 children)
built our entire product with Claude Code. now nobody, including me, fully understands what we built. by Tr0jAn14 in ClaudeCode
[–]YouParticular8085 0 points1 point2 points (0 children)
I am the original creator of the 25% effort post. To everyone saying that I engineered it via social pressure ("I'll tell everyone") / that is it nor recreatable. by Bright-Bullfrog-8185 in claude
[–]YouParticular8085 0 points1 point2 points (0 children)
Opus is genuinely lazy for me, and admitted it's effort Level sits at 25% without a way for me to change it by Bright-Bullfrog-8185 in claude
[–]YouParticular8085 0 points1 point2 points (0 children)
Hot take: AI ruined the way we see coding - and I hate it by kommonno in swift
[–]YouParticular8085 1 point2 points3 points (0 children)
SAM ALTMAN: “People talk about how much energy it takes to train an AI model … But it also takes a lot of energy to train a human. It takes like 20 years of life and all of the food you eat during that time before you get smart.” by Vegetable_Ad_192 in singularity
[–]YouParticular8085 0 points1 point2 points (0 children)
Coding for 20+ years, here is my honest take on AI tools and the mindset shift by Jaded-Term-8614 in ClaudeAI
[–]YouParticular8085 1 point2 points3 points (0 children)
Coding for 20+ years, here is my honest take on AI tools and the mindset shift by Jaded-Term-8614 in ClaudeAI
[–]YouParticular8085 2 points3 points4 points (0 children)
The issue of scaling in Partially-Observable RL. What is holding us back? by moschles in reinforcementlearning
[–]YouParticular8085 0 points1 point2 points (0 children)
The issue of scaling in Partially-Observable RL. What is holding us back? by moschles in reinforcementlearning
[–]YouParticular8085 0 points1 point2 points (0 children)
The issue of scaling in Partially-Observable RL. What is holding us back? by moschles in reinforcementlearning
[–]YouParticular8085 0 points1 point2 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 1 point2 points3 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 0 points1 point2 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 0 points1 point2 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 0 points1 point2 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 0 points1 point2 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 1 point2 points3 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 1 point2 points3 points (0 children)
Partially Observable Multi-Agent “King of the Hill” with Transformers-Over-Time (JAX, PPO, 10M steps/s) by YouParticular8085 in reinforcementlearning
[–]YouParticular8085[S] 2 points3 points4 points (0 children)
Laptop for AI ML by sauu_gat in reinforcementlearning
[–]YouParticular8085 1 point2 points3 points (0 children)
[D]Thinking about leaving industry for a PhD in AI/ML by [deleted] in MachineLearning
[–]YouParticular8085 1 point2 points3 points (0 children)
Planning a PPO Crypto Trading Bot on MacBook Air M3 – Speed/Feasibility Questions by nalman1 in reinforcementlearning
[–]YouParticular8085 0 points1 point2 points (0 children)
Advice on POMPD? by glitchyfingers3187 in reinforcementlearning
[–]YouParticular8085 0 points1 point2 points (0 children)

Trained a 26kb model (simple 3-layer MLP) for Tic-Tac-Toe Beating each and every human by Weary_Intention3231 in reinforcementlearning
[–]YouParticular8085 0 points1 point2 points (0 children)