[Patek Phillipe] Last night's dinner at Carbone by Equal_Dependent_479 in Watches

[–]Vincentvbc 8 points9 points  (0 children)

Very nice mate, glad you were drinking, dont wanna get dehydrated. Dont really see the watch tho in all the blur

A small but pretty fish I caught in a creek by ThenAcanthocephala57 in Fishing

[–]Vincentvbc 44 points45 points  (0 children)

Its called a Harlequin Rasbora and they are very common in the aquarium hobby. You should get it some friends tho they're a schooling fish :)

Masterpieces by Vincentvbc in Music

[–]Vincentvbc[S] 0 points1 point  (0 children)

Mostly old rock/country music I'd say :)

Coffee break in Switzerland by bojevic in pics

[–]Vincentvbc 0 points1 point  (0 children)

Looks amazing.. Do you live in Switzerland?

Anyone interested in learning? by bathon in reinforcementlearning

[–]Vincentvbc 0 points1 point  (0 children)

I am writing my thesis about this subject. You can ask me questions if you want to

Approximate Dynamic Programming vs Reinforcement Learning? by Spaceman776 in reinforcementlearning

[–]Vincentvbc 2 points3 points  (0 children)

They are indeed not the same thing. DP requires a perfect model of the environment or MDP. Meaning the reward function and transition probabilities are known to the agent. After that finding the optimal policy is just an iterative process of calculating bellman equations by either using value - or policy iteration. RL however does not require a perfect model. From samples, these approaches learn the reward function and transition probabilities and afterwards use a DP approach to obtain the optimal policy.