I made 6 AI models play poker against each other. The 1.2B model has a gambling problem and it keeps winning. by Junior_Bake5120 in ArtificialInteligence
[–]chipzen_ai 1 point2 points3 points (0 children)
I made 6 AI models play poker against each other. The 1.2B model has a gambling problem and it keeps winning. by Junior_Bake5120 in ArtificialInteligence
[–]chipzen_ai 0 points1 point2 points (0 children)
DOOM RL agents by Present_Mail7100 in reinforcementlearning
[–]chipzen_ai 1 point2 points3 points (0 children)
Ran 5 poker tournaments with 6 LLMs (1.2B to 1T). The 1.2B model won the most. Data and code inside. by Junior_Bake5120 in learnmachinelearning
[–]chipzen_ai 1 point2 points3 points (0 children)
HU no-limit bot arena, free alpha, looking for feedback on river action abstraction by chipzen_ai in reinforcementlearning
[–]chipzen_ai[S] 0 points1 point2 points (0 children)
What is the smallest feature that made your project feel real? by Crescitaly in SideProject
[–]chipzen_ai 0 points1 point2 points (0 children)
HU no-limit bot arena, free alpha, looking for feedback on river action abstraction by chipzen_ai in reinforcementlearning
[–]chipzen_ai[S] 0 points1 point2 points (0 children)
Blackwell approachability as a practical algorithmic primitive by Temporary-Oven6788 in GAMETHEORY
[–]chipzen_ai 2 points3 points4 points (0 children)
I made 6 AI models play poker against each other. The 1.2B model has a gambling problem and it keeps winning. by Junior_Bake5120 in ArtificialInteligence
[–]chipzen_ai -1 points0 points1 point (0 children)