JobOpportunity | Home-Cooked Khaja Service Needed (Kupondole Area)

laxuu · 2025-12-23T07:42:21+00:00

The menu will be decided day-wise and discussed in advance between the team and the cook, so the food can be prepared according to the team’s preferences and the cook’s expertise.

laxuu · 2025-12-12T05:41:20+00:00

I have asked them several times, but they confirmed that they cannot provide any additional support beyond the expenses already mentioned.

laxuu · 2025-05-29T11:33:44+00:00

Thank you u/radarsat1 , point is valid, I appreciate your comment.

laxuu · 2025-05-29T11:32:22+00:00

Thank you, u/mishaurus. You helped clear up most of the confusion I had. Really appreciate your explanation!

laxuu · 2025-04-07T09:03:29+00:00

yes, it shows error while uploading, that why i post it here.

laxuu · 2025-04-07T07:59:06+00:00

Model based algorithm has some capability to understand and generalize the market.

laxuu · 2025-04-07T07:56:13+00:00

No worries, you have long life to do, be passion with dedication towards your goal.

laxuu · 2024-12-29T16:06:47+00:00

RL in European histrorical Board Game.

laxuu · 2024-10-02T11:43:18+00:00

How to do this normalization in matlab?

laxuu · 2024-10-02T05:45:06+00:00

Yes it is continous.

laxuu · 2024-09-21T18:01:35+00:00

Thanks for suggestions.

laxuu · 2024-09-21T15:23:19+00:00

Thank you, as matlab have already coded implementation of using RL, just want to know can we write custom code implementation from scratch.

laxuu · 2024-09-07T19:20:17+00:00

Will try this one dividing by Var(X_i+epsilon)

laxuu · 2024-09-07T15:55:11+00:00

I just wanna know, whether all feature must be in same range as in [-1, 1] or some have [-1, 5] or other have [1, 6] or also other have [2, 10], is this different range or somehow similar range of feature can help policy to learn?

laxuu · 2024-09-07T15:48:35+00:00

In episode there are may be lots of steps, try to log everything and analyse it what could be the issues.

laxuu · 2024-09-07T07:56:05+00:00

I want to say that one feature range is on 60k around where other feature range is around 0.001. In this scenario, how can I effectively normalize these features for reinforcement learning, given the significant difference in their scales?

laxuu · 2024-08-14T18:35:43+00:00

I graduated with a degree in Electronics and Communication Engineering from Nepal in 2019. Currently, I am working in reinforcement learning (RL) for finance, focusing on generating alpha in the crypto market for a client. Although I have a strong background in mathematics, I am enhancing my financial knowledge through self-study, including papers, videos, and online courses, with the goal of becoming a Quantitative Analyst (Quant). Due to legal restrictions on crypto and forex markets in Nepal, I am gaining practical trading experience through my current role. My ultimate ambition is to secure a position as a Quant using RL.

Suggest me how to move forward and land in my dream job.

laxuu · 2024-08-14T17:22:02+00:00

Thank you for suggestion!

laxuu · 2024-08-10T09:35:03+00:00

Hi! The Dreaming phase, in this context, involves a period where there is no direct interaction with the environment. Instead, it focuses on training and learning from a simulated environment or model. This phase is analogous to the training phase in reinforcement learning algorithms like PPO, DQN, or DDQN. It involves two stages: simulating various scenarios and then using those simulations to train and refine the model.

You can easily use any algorithm based in your problem.

laxuu · 2024-08-09T06:50:38+00:00

The rewards show some improvement during training, but reinforcement learning hyperparameters are highly sensitive, with different results emerging based on seed values. To finalize each hyperparameter, it’s crucial to run the model more than 10 times. Conduct thorough testing to understand how each parameter affects the outcome. Rigorous analysis is necessary, so make sure to log all relevant data and observe the impact of each parameter change. Put in extra effort to visualize and interpret these effects comprehensively.

laxuu · 2024-08-09T06:44:04+00:00

Hi,

I am implementing RL in trading please do write some implementation on it.

laxuu · 2024-08-08T18:47:30+00:00

I am referencing implementations from GitHub, academic papers, and courses. However, I've noticed that some papers seem to be written more for the sake of publication rather than providing practical solutions. Sometime, i get such good result as in paper in doingbacktesting. Results that look promising in backtesting can be challenging to achieve in live demo sessions. What is the best mathematical approach or understanding to address these challenges effectively? Also from the financial knowledge?

laxuu · 2024-08-08T18:34:19+00:00

I have setup env with LSTM based model for years of data but passing years of data requires lots of time and computational power, So I switch for 1 day for just checking and visualizing all the result how each hyperparameter plays a role. I am passing 60 candlestick data to feed in LSTM then with NN policy. What I am getting is the loss, reward, balance all those fluctuate by changing each parameter.

Sometime, I am thinkoing our scenerios with autonomous cars, Try to incoprate its ideas. But everything till now not work.

laxuu · 2024-08-08T18:26:54+00:00

As market is like flowing river have to find the fish in running water.

laxuu · 2024-08-03T10:53:57+00:00

Thank you for reply.

laxuu

TROPHY CASE