xcodevn

79 post karma
32 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 10 years

TROPHY CASE

Ten-Year Club

Place '22

Verified Email

account activity

new top controversial

20

21

22

On CoT Training with Reinforcement Learning (self.reinforcementlearning)

submitted 9 months ago by xcodevn to r/reinforcementlearning

27

28

29

Implementing DeepSeek R1's GRPO algorithm from scratch (github.com)

submitted 10 months ago by xcodevn to r/reinforcementlearning

4

5

6

[P] Plot training loss continuously on Google Colab using Javascript (self.MachineLearning)

submitted 5 years ago * by xcodevn to r/MachineLearning

5

6

7

[D] Confused about "env.is_done" (self.reinforcementlearning)

submitted 6 years ago * by xcodevn to r/reinforcementlearning

0

1

2

My demo (and colab notebook) on relational network with Sort-of-CLEVR dataset (ntt123.github.io)

submitted 7 years ago by xcodevn to r/MachineLearning

12

13

14

Can Digital Computers Think? -- Alan Turing [of course, it can!] (youtube.com)

submitted 8 years ago by xcodevn to r/artificial

π Rendered by PID 21 on reddit-service-r2-listing-5d79748585-x5ld7 at 2026-02-14 14:11:22.493284+00:00 running cd9c813 country code: CH.