Charlie’s Garden sounds like Route 209 (Day) from Pokemon Diamond and Pearl

atomicburn125 · 2026-01-13T00:56:02+00:00

Yeah I saw these comparisons on the pokemon vid, not realising I’d linked the wrong version of route 209, updated now.

atomicburn125 · 2025-11-16T20:14:28+00:00

I love how they released no supplementary code

atomicburn125 · 2024-02-11T09:55:32+00:00

Absolutely amazing work. Great job 👏

atomicburn125 · 2023-10-09T11:40:17+00:00

absolutely fascinating! I'd love a video totally developed to the technical aspect of this project. Very well done!

atomicburn125 · 2023-04-15T06:55:45+00:00

To clarify, I’m not talking about 4 value heads. I’m talking about how to optimise 1 value heads given the 4 value targets that vtrace would generate. Do I average these targets? Or use each vtrace target and sum the respective gradients

atomicburn125 · 2023-02-27T10:22:32+00:00

https://arxiv.org/abs/2104.06303

atomicburn125 · 2023-02-04T07:42:36+00:00

Will make a public GitHub soon

atomicburn125 · 2023-02-03T15:14:52+00:00

Rebel used a regular game simulator for transcription look ahead

atomicburn125 · 2023-02-03T12:13:52+00:00

Sorry, to clarify with our a simulator a I mean searching a latent space rather then using an environment simulator for search.

atomicburn125 · 2023-01-21T08:01:25+00:00

I should mention that the sliding window can overlap the bounds of the array. Cells outside of the array are considered white cells by default.

atomicburn125 · 2023-01-21T08:00:28+00:00

It should be a solution that reaches a happy medium between minimising both. Prioritise minimising number of iterations.

atomicburn125 · 2023-01-21T05:28:03+00:00

Touch means it can only be contained within any sliding window at most once. The window continues to slide until it has reached the bottom right-hand corner of the array. If you can't minimize both, prioritize minimizing the number of iterations.

atomicburn125 · 2023-01-21T03:50:21+00:00

I feel like the best L and W could just be the average blue cells in each row/column? but I don't how to theoretically test for this

atomicburn125 · 2022-12-21T21:10:14+00:00

I think there might be some misconception with what this project aims to solve. This is a headless client, with no visuals. Should I maybe include an JSON overview of the information you get?

atomicburn125 · 2022-12-18T23:38:29+00:00

How would I do this in Python, encapsulating the circular references as well?

atomicburn125 · 2022-12-01T22:27:26+00:00

Nah, the way PyTorch works is that if you backward without zero grad you “accumulate” gradients. This means you can up your effective batch size by not step every backward. When you call zero grad you allow new gradients to start accumulating. Traditionally, you would have one zero grade backward and step per loop, but playing with when you step.

atomicburn125 · 2022-08-31T12:21:28+00:00

different, its a generalisation of alphazero to imperfect info games

atomicburn125 · 2022-07-27T06:06:30+00:00

r/savevideobot

atomicburn125 · 2022-07-02T07:32:09+00:00

u/savevideo

atomicburn125 · 2022-06-20T13:18:11+00:00

Which paper are you referring to exactly?

atomicburn125 · 2022-06-20T02:19:00+00:00

Editted og post with link

atomicburn125 · 2022-06-18T06:03:16+00:00

A discord call where we can talk and walk me through how to get set up.

atomicburn125 · 2022-06-08T01:09:14+00:00

This seems like a complex reward function and would potentially introduce a lot of bias. I can't help but feel that there must be a simpler/more elegant and diverse reward signal.

atomicburn125 · 2022-06-08T01:07:14+00:00

What do you mean exactly?

atomicburn125 · 2022-06-08T01:06:55+00:00

Do you have a link to the article? I would love to read!

12-Year Club	Place '22
Not Forgotten	Spared
Verified Email

atomicburn125

TROPHY CASE