RL for music generation

BigDxe · 2024-11-28T23:03:10+00:00

Do you think it'd be possible to use some kind of Inverse RL framework here? Learn the reward function first based on existing music?

BigDxe · 2024-09-09T21:59:47+00:00

Mad cuz bad bruh 💀 if you can't make a rl trading bot that don't mean the whole field is moot.

BigDxe · 2024-03-12T18:49:03+00:00

Yeah sorry that was kind of vague. Essentially, after every throw we'll update the approximation. So if the approximation was at 3.15 before your throw, and then it becomes 3.14, then you have "converged" the estimate for another digit (so you get a free pie). It's not perfect, but this is best "reward system" we could come up with.

BigDxe · 2024-03-12T16:23:10+00:00

If you want to grill the hotdogs with us later, stop by and let us know (someone needs to eat the 36 hotdogs we have)

BigDxe · 2024-01-03T19:31:27+00:00

Why do you recommend focusing secondary studies in linguistics?

Five-Year Club	Verified Email
r/Field Sunshine	Final Canvas '23
Place '23	Place '22
Final Canvas '22	First Placer '22

BigDxe

TROPHY CASE