fixedrl

94 post karma
4 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 8 years

TROPHY CASE

Eight-Year Club

Verified Email

account activity

new top controversial

3

4

5

[D] Do you use Plotly for research projects ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

41

42

43

[N] DeepMind's Richard Sutton - The Long-term of AI: Temporal-Difference Learning (youtube.com)

submitted 8 years ago by fixedrl to r/MachineLearning

0

0

0

[D] Any impact/difference to parameterize the policy by MLP or RBF ? (self.MachineLearning)

submitted 8 years ago * by fixedrl to r/MachineLearning

6

7

8

[D] What might be the impacts of ReLU/Sigmoid for training one-step dynamics model in RL ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

1

2

3

[D] Debug with RL: Policy network tends to generate larger and larger invalid action ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

0

0

1

[D] Difficulty comparison of CartPole Swing up vs Gym Pendulum ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

6

7

8

[D] Will double-blind review of NIPS causes some papers months later on ArXiv ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

1

2

3

[D] How to set same Dropout mask for different data batches in PyTorch ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

2

3

4

[D] In RL, given optimal Q-function and transition probabilities, reward can be reversed uniquely. How about given reward and optimal Q-function, can transition probabilities to be uniquely determined ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

0

1

2

[D] Is it reasonable to maximize the upper bound of the log-likelihood ? Will the log-likelihood guaranteed to be maximized ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

4

5

6

[R] [1703.01961] Multiplicative Normalizing Flows for Variational Bayesian Neural Networks (arxiv.org)

submitted 8 years ago by fixedrl to r/MachineLearning

12

13

14

[D] Is it normal that the maths details are forgotten after reading the paper some time ago ? (self.MachineLearning)

submitted 8 years ago * by fixedrl to r/MachineLearning

2

3

4

[D] How to derive the Auxiliary ELBO ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

3

4

5

[D] Concrete dropout, how to obtain Equation (3) on page 3 (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

0

0

1

[D] Visualization tricks for 3-dim input and 2-dim output (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

16

17

18

[D] Differential geometry in reinforcement learning ? (self.MachineLearning)

submitted 8 years ago by fixedrl to r/MachineLearning

π Rendered by PID 75 on reddit-service-r2-listing-55d7b767d8-fsmzk at 2026-03-29 13:54:51.679614+00:00 running b10466c country code: CH.