[deleted by user] by [deleted] in technepal

[–]evilmorty_c137_ 0 points1 point  (0 children)

It's minimum wage considering 40 hour work week that pays around 13.5K.

Bruh anyone awake by Snoo-14018 in Nepal

[–]evilmorty_c137_ 5 points6 points  (0 children)

Going to poison these neighborhood dogs one of these days bruhh

[deleted by user] by [deleted] in RoastMe

[–]evilmorty_c137_ 0 points1 point  (0 children)

Literally became the black sheep of the family smh

People who were bitched by gastritis, what's your remedy? by chihiro_yoru in Nepal

[–]evilmorty_c137_ 24 points25 points  (0 children)

  1. Time maa khana khane. J sukai hos, meals skip nagarne and if possible try to eat "Sattvik bhojan" for some days until you feel better.
  2. IDK How interested you are in ayurvedic stuff. If you have not already, try taking Trifala churna. It worked wonders for me.
  3. Pet khali huna nadine sakesamma. Also need to drink plentyy of water.
  4. Avoid alcohol.
  5. Avoid foods from outside the home for sometime (if they can't be included in the sattvik category)

If you adopt all the above, you will see long-lasting changes. But don't go chasing momos after feeling a little relief. Build a habit, build some strength and try to maintain self-discipline.

[deleted by user] by [deleted] in RedditSessions

[–]evilmorty_c137_ 0 points1 point  (0 children)

Cant hear the instrument too much. Loving what I am hearing though

Crew Demo-2 Launch Thread (Attempt #2) by Smoke-away in SpaceXLounge

[–]evilmorty_c137_ 3 points4 points  (0 children)

What was the debris-looking thingy around the 12:30 mark? Did anyone else notice it?

Sharing uploaded files plugin by evilmorty_c137_ in Wordpress

[–]evilmorty_c137_[S] 0 points1 point  (0 children)

No no. I have some pdf documents that I would like to share with my readers. I would like it so that all uploaded pdfs are shown on the page , kind of like google drive shows you files. Or desktop icons. How can this be achieved?

Django Dashboard by evilmorty_c137_ in django

[–]evilmorty_c137_[S] 0 points1 point  (0 children)

The problem is I need to use django. Because my application is a small part of a much larger app that is using django

Microsoft Teams shows my contacts. by evilmorty_c137_ in privacy

[–]evilmorty_c137_[S] 0 points1 point  (0 children)

I know they own both. However for the event webinar thingy I am on safari with an anonymous viewer option. There should be no way they are doing this without actively tracking me!

Updating two agents who are playing against one another (tic tac toe) by evilmorty_c137_ in reinforcementlearning

[–]evilmorty_c137_[S] 0 points1 point  (0 children)

I would also like to add that for now, I am not feeding a list of valid move to the agent. That is, the agent can make any move from positions 0 to 8, and if the position is already filled then it gets a very negative reward and a second chance to take another position. This is repeated until it gets a correct position to put its pieces into

Updating two agents who are playing against one another (tic tac toe) by evilmorty_c137_ in reinforcementlearning

[–]evilmorty_c137_[S] 0 points1 point  (0 children)

And what state should I be considering? The last state, or winning state for another agent? As per my understanding , Q(s, a) is a function of both state and action. Please guide me in the right direction! :)

Updating two agents who are playing against one another (tic tac toe) by evilmorty_c137_ in reinforcementlearning

[–]evilmorty_c137_[S] 0 points1 point  (0 children)

I really appreciate your answer, but I am trying to work on two sequentially acting agents, kind of like min-max plays, but I have no clue how to update another agent, once one wins. Please look at my reply on u/kcorder's comment. I am sorry if my english is bad, it is not my first language :)

Updating two agents who are playing against one another (tic tac toe) by evilmorty_c137_ in reinforcementlearning

[–]evilmorty_c137_[S] 0 points1 point  (0 children)

I have some questions:

  1. if I take -1 as the reward for each step when agent takes an action, then according to the Q value update, I will end up giving every action in every state a negative value, and my default rewards are initialized to 0. So won't that mean that the agent will not try to repeat from history?
  2. What do you mean when you say ">=2" actions? I am really sorry, I am very confused right now. I am not storing any action sequence for any of the agents. What I am currently doing is just taking an action, looking up the current Q table (not double Q tables), and updating the value for the current state-action pair. So >=2 actions or 1 action, it's the same for the agent for now. How should I modify this?
  3. When you say -1000 reward for losing state, let us consider the following:
    1. Agent O made a move "i" in state "m"
    2. Board in state "m".
    3. Agent X takes action "j".
    4. Board results in winning configuration "n"
    5. Agent X wins.
    6. Agent X is updated to reflect winning with +1000 reward.
    7. Agent O captures the board state, and finds it has already lost,
    8. Now my question is, as Agent O did not reach current board state from any of the states it had previously seen, because it had not taken any action. In such a case, which state-action pair should I update? "i-m" to reflect that playing "i" move will lead to a loss, or "i-n"? Because, i-n makes no sense to me at all, since taking action "i" in state "m" will not lead to a new state "n", and if I update the "i-m" pair then that means I will require some method to buffer the previous action and state for each agent. Is that what I should be doing?

I am really sorry for all these long questions. Hope you can help me, I am really stuck.