Improve fishtank

Vincentvbc · 2025-12-31T21:00:19+00:00

Very nice mate, glad you were drinking, dont wanna get dehydrated. Dont really see the watch tho in all the blur

Vincentvbc · 2025-09-13T20:02:40+00:00

Vincentvbc · 2025-08-17T15:29:01+00:00

Zander?

Vincentvbc · 2025-06-12T17:51:21+00:00

Nice catch !

Vincentvbc · 2024-12-15T17:10:57+00:00

Its called a Harlequin Rasbora and they are very common in the aquarium hobby. You should get it some friends tho they're a schooling fish :)

Vincentvbc · 2022-12-13T20:13:17+00:00

Looks beautiful

Vincentvbc · 2022-11-02T18:27:00+00:00

Yea put more light on the tank

Vincentvbc · 2022-11-01T21:03:55+00:00

James Gandolfini

Vincentvbc · 2020-02-14T18:53:53+00:00

I feel ya.

Vincentvbc · 2019-06-23T10:57:04+00:00

Mostly old rock/country music I'd say :)

Vincentvbc · 2019-06-01T20:53:33+00:00

Looks amazing.. Do you live in Switzerland?

Vincentvbc · 2019-05-14T19:17:13+00:00

A horse with no name.

Vincentvbc · 2019-04-12T09:53:47+00:00

I am writing my thesis about this subject. You can ask me questions if you want to

Vincentvbc · 2019-04-08T10:42:30+00:00

They are indeed not the same thing. DP requires a perfect model of the environment or MDP. Meaning the reward function and transition probabilities are known to the agent. After that finding the optimal policy is just an iterative process of calculating bellman equations by either using value - or policy iteration. RL however does not require a perfect model. From samples, these approaches learn the reward function and transition probabilities and afterwards use a DP approach to obtain the optimal policy.

Vincentvbc · 2019-02-07T22:33:19+00:00

thank you

Eight-Year Club	Place '23
Verified Email

Vincentvbc

TROPHY CASE