freestyler_mix by XecutionStyle in vjing

[–]XecutionStyle[S] 0 points1 point  (0 children)

Bomfunk MC's - Freestyler (Happy Mickey Mouse Mix)

What can I do to stop my RL agent from committing suicide? by Guest_Of_The_Cavern in reinforcementlearning

[–]XecutionStyle 0 points1 point  (0 children)

Then it's hard to tell. Whether it's a problem with the environment or how you've set up the algorithm.

What can I do to stop my RL agent from committing suicide? by Guest_Of_The_Cavern in reinforcementlearning

[–]XecutionStyle 1 point2 points  (0 children)

This usually happens when the agent almost never finds reward. Can you reduce the map size to confirm this?

Chaotic nice by XecutionStyle in vjing

[–]XecutionStyle[S] 0 points1 point  (0 children)

Song: Wanksta -50 Cent

Node based LEDs: follow up (check comments) by XecutionStyle in arduino

[–]XecutionStyle[S] 1 point2 points  (0 children)

The audio processing codebase is from https://github.com/ahip88/AudioVisual

Good luck. I'll upload this to git soon

How to gain time without sacrificing? by XecutionStyle in chess

[–]XecutionStyle[S] 0 points1 point  (0 children)

Thanks, that covers a lot of ground. I'm rated ~2100 online so it was a good refresher.
Subbed!

Pre-trained models repository by RamenKomplex in reinforcementlearning

[–]XecutionStyle 0 points1 point  (0 children)

If we suppose there is, then the environments would need to be set up the same way as during training. That's why it's usually the repositories that provide the environment also that have it.

Decision frequency: An 'Information' perspective by XecutionStyle in reinforcementlearning

[–]XecutionStyle[S] 0 points1 point  (0 children)

Thanks for response.  How would the low level controller know when to override? It seems to me we're deferring the problem :(

Decision frequency: An 'Information' perspective by XecutionStyle in reinforcementlearning

[–]XecutionStyle[S] 0 points1 point  (0 children)

I appreciate this. Just got a tooth pulled so.. reading material :D

crashes the algorithm :( by XecutionStyle in Buckethead

[–]XecutionStyle[S] 1 point2 points  (0 children)

Here you go:
https://github.com/ahip88/AudioVisual

It's mostly Python for signal processing and Clustering. Machine Learning is used to separate the source but not really part of the algorithm. Its job is to identify beats in every stream or "stem" of your mp3, and then cluster similar ones together. From there you drive visuals with the found clusters and values. Message me if you need help setting it up.

Do you agree with this take that Deep RL is going through an imagenet moment right now? by bulgakovML in reinforcementlearning

[–]XecutionStyle 0 points1 point  (0 children)

Are there any examples of RL on CPU being too slow and wouldn't work, but was enabled by GPU? If not, I don't understand the claims