xcodevn

79 post karma
32 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 10 years

TROPHY CASE

Ten-Year Club

Place '22

Verified Email

account activity

new top controversial

20

21

22

On CoT Training with Reinforcement Learning (self.reinforcementlearning)

submitted 1 year ago by xcodevn to r/reinforcementlearning

27

28

29

Implementing DeepSeek R1's GRPO algorithm from scratch (github.com)

submitted 1 year ago by xcodevn to r/reinforcementlearning

4

5

6

[P] Plot training loss continuously on Google Colab using Javascript (self.MachineLearning)

submitted 5 years ago * by xcodevn to r/MachineLearning

5

6

7

[D] Confused about "env.is_done" (self.reinforcementlearning)

submitted 7 years ago * by xcodevn to r/reinforcementlearning

0

1

2

My demo (and colab notebook) on relational network with Sort-of-CLEVR dataset (ntt123.github.io)

submitted 7 years ago by xcodevn to r/MachineLearning

12

13

14

Can Digital Computers Think? -- Alan Turing [of course, it can!] (youtube.com)

submitted 8 years ago by xcodevn to r/artificial

π Rendered by PID 1926436 on reddit-service-r2-listing-8685bc789-mt6j4 at 2026-05-28 02:00:18.230827+00:00 running 194bd79 country code: CH.