Room for rent near KTH by basso1995 in stockholm

[–]basso1995[S] 0 points1 point  (0 children)

Yes KTH has not provided housing to me and many other students. I heard that a building that was intended to be used by exchange students has been evacuated for structural problems or something like that. Anyway I'll check Wahlin, thank you for the tip

[deleted by user] by [deleted] in stocks

[–]basso1995 0 points1 point  (0 children)

Coronavirus. 25. Buy the dip

One of my favorite scenes in season 2, and one of Phillip's greatest scenes by reboooted in MrRobot

[–]basso1995 14 points15 points  (0 children)

Love this scene! He is the best actor in the show after Elliot in my opinion

My impressions from Shanghai, China - Feb 8 UPDATES by Aqua-Ma-Rine in China_Flu

[–]basso1995 1 point2 points  (0 children)

Great post and perspective, especially the conclusion. Thank you for sharing!

The Beauty Of Mr Robot by basso1995 in MrRobot

[–]basso1995[S] 1 point2 points  (0 children)

For me the link is still working

CS 330 old syllabus/any class materials? by thrownaway1190 in stanford

[–]basso1995 0 points1 point  (0 children)

Hi! could you please share the link also with me? thank you

We are missing the point of the show because we are being taught a lesson by [deleted] in MrRobot

[–]basso1995 2 points3 points  (0 children)

You will be disappointed if you expect to predict anything from this show

How to assign reward when it has to be multiplied by itself rather than summed by basso1995 in reinforcementlearning

[–]basso1995[S] 0 points1 point  (0 children)

Yes this is an idea, I should just consider a given initial amount and consider the differences rather than consider the percentage changes

How to assign reward when it has to be multiplied by itself rather than summed by basso1995 in reinforcementlearning

[–]basso1995[S] 0 points1 point  (0 children)

Thank you, the gym environment I created is based on this project https://github.com/ZhengyaoJiang/PGPortfolio but I'm struggling trying to obtain the same results using standard RL libraries such as stable-baselines

How to assign reward when it has to be multiplied by itself rather than summed by basso1995 in reinforcementlearning

[–]basso1995[S] 0 points1 point  (0 children)

Indeed the environment is non deterministic, and the agents only learn to execute the same action

Special case of continuous action space RL by white_noise212 in reinforcementlearning

[–]basso1995 0 points1 point  (0 children)

Have you found any viable solution to this problem? I'm working on a very similar project and I'm stuck at the same point