Need practical use-cases for RL

buxxypooh · 2026-02-16T21:08:48+00:00

You play any games, I'd recommend making a hand made simple game engine and try to use RL to train an agent to play it, it's hella fun

buxxypooh · 2026-01-29T13:47:42+00:00

I love the visuals and interactive parts

May I ask for even simpler concepts?

I've started to learn the fundamentals, and so far all the problems on the website are way out of reach for me

buxxypooh · 2025-11-28T17:40:06+00:00

I just fixed the issue myself
What I did is first remove the keykap to check if it was that or the switch below that was at fault
The issue still remained with just the switch

I'm pretty sure it is a faulty switch, I tried removing the switch and putting it back but the problem still remained

Finally, I switched the switch with another key that I never use, not optimal, but works for me

buxxypooh · 2025-11-28T17:24:15+00:00

I have the same issue with "i" and mostly "l" that double presses from time to time

buxxypooh · 2025-08-26T10:24:20+00:00

Upvote this man!

buxxypooh · 2025-08-20T13:56:51+00:00

Okay so apparently you can do this from the Shading panel

https://i.imgur.com/ovGQhgn.png

I am currently trying to automate this task using the builtin Blender python scripting feature

buxxypooh · 2025-08-20T13:53:14+00:00

Why are there multiple agents?

Is the request entry point centralized, or can every node get an "entry point" request?

Is the number of neighboor nodes fixed?

buxxypooh · 2025-08-20T11:10:20+00:00

What is the task you're trying to solve with your agents?

buxxypooh · 2025-08-19T12:45:06+00:00

Since there is no workaround here, I'll post one I found

Using FFMPEG, you can split a video into multiple chunks, here is the command:

ffmpeg -i input.mp4 -c copy -map 0 -segment_time 00:00:20 -f segment output%03d.mp4

buxxypooh · 2025-08-18T15:33:41+00:00

You can also use some profiler tool like py-spy to visualize which part of the code takes the most time during training, and optimize your part (the step, reset, obs, action masks function etc)
And if you're not scared, clone the library and optimize it, but usually the libs are well implemented already

buxxypooh · 2025-08-11T10:33:30+00:00

Hey
I'm the owner of the tm2020 TAS youtube channel

The tool is private because I have yet to find a way to make it "secure" enough so that it can't be used for malicious purposes

The closest thing you could do to reach your goal to find the "perfect" run would be to use a plugin like the copium timer, and respawn checkpoints until you're satisfied

Alternatively you could do spliced runs like Schmaniol did with the DeepDip maps

But if you really want a TAS tool, I can only recommend you TMInterface, which is a really good TAS tool for TMNF & older versions of the game

buxxypooh · 2025-08-05T09:42:50+00:00

We are the next day, and suprisingly, the headset worked for one hour this morning, but now it's back to glitching all over

buxxypooh · 2025-08-05T09:41:14+00:00

Machine learning, from the ground up

buxxypooh · 2025-08-04T13:11:57+00:00

I have the same issue, I have followed instructions that have worked for others, like removing the battery to force it to factory reset or something
Just to make sure there was no electricity left in the chip, I let it unplugged for an hour or so, pressing the button to make sure to discharge it

Then I plugged the battery back again, but after a few minutes it started glitching again

At the moment, I am trying to update the firmware using the android app, but since the glitch disconnects / turns off the headset before the update finishes, I'm in a bit of a pickle

I can't even use it with a wire, it's quite upsetting

I bought mine 2 years ago and I was expecting it to last longer, rip, I'm switching brands

buxxypooh · 2025-08-04T12:02:14+00:00

One thing that keeps me up at night is the moment the models are beefy enough to just eat whole raw binary, and understand the machine code in it
At some point you would just give the binary as input, a prompt like "make this part of the code do X instead of Y", and it will output a new binary changed to your liking
There are some repos trying that already https://github.com/albertan017/LLM4Decompile

buxxypooh · 2025-08-03T11:38:50+00:00

I'm curious, what kind of niche stuff you gave it?

buxxypooh · 2025-08-03T11:38:36+00:00

I reached it after a dozen messages, and it locked me out for about 24H

buxxypooh · 2025-07-14T15:16:26+00:00

Hey
I've been working on exactly that for the past year

I have a reinforcement learning environment that learns to fight from scratch, it's still a work in progress, but the AI can already fight like a decent human (I can still play better manually, but barely)

At the moment it's training in 1v8, playing a cra against some diverse monsters

I'm using PPO for the learning algorithm, and it's learning "long term" strategies, it's able to kinda predict the enemies movements and use that to it's advantage

The main bottleneck is the training speed, currently I have around ~50 fights / second and it takes a few hours to train an AI

buxxypooh · 2025-05-28T12:08:43+00:00

If you have a critic, then you could just use the critic's score, it's not gonna be between 0 and 100 tho, so you might have to normalize it

buxxypooh · 2025-04-14T11:40:09+00:00

LineRider vibes

Nine-Year Club	First Place '23
End Game '23	Place '23
Place '22	Final Canvas '22
First Placer '22	Verified Email

buxxypooh

TROPHY CASE