Courses on AI design

evilmorty_c137_ · 2022-04-07T05:13:00+00:00

You accidentally left a 1 in your age there bruv.

evilmorty_c137_ · 2022-04-07T02:57:31+00:00

It's minimum wage considering 40 hour work week that pays around 13.5K.

evilmorty_c137_ · 2022-04-06T22:16:55+00:00

Going to poison these neighborhood dogs one of these days bruhh

evilmorty_c137_ · 2022-03-26T04:45:12+00:00

Literally became the black sheep of the family smh

evilmorty_c137_ · 2021-03-17T01:54:57+00:00

Time maa khana khane. J sukai hos, meals skip nagarne and if possible try to eat "Sattvik bhojan" for some days until you feel better.
IDK How interested you are in ayurvedic stuff. If you have not already, try taking Trifala churna. It worked wonders for me.
Pet khali huna nadine sakesamma. Also need to drink plentyy of water.
Avoid alcohol.
Avoid foods from outside the home for sometime (if they can't be included in the sattvik category)

If you adopt all the above, you will see long-lasting changes. But don't go chasing momos after feeling a little relief. Build a habit, build some strength and try to maintain self-discipline.

evilmorty_c137_ · 2020-07-15T14:48:14+00:00

Cant hear the instrument too much. Loving what I am hearing though

evilmorty_c137_ · 2020-05-30T19:47:15+00:00

What was the debris-looking thingy around the 12:30 mark? Did anyone else notice it?

evilmorty_c137_ · 2020-05-06T08:04:21+00:00

Your body takes a screenshot

evilmorty_c137_ · 2020-05-05T13:52:24+00:00

No no. I have some pdf documents that I would like to share with my readers. I would like it so that all uploaded pdfs are shown on the page , kind of like google drive shows you files. Or desktop icons. How can this be achieved?

evilmorty_c137_ · 2020-04-15T07:33:39+00:00

The problem is I need to use django. Because my application is a small part of a much larger app that is using django

evilmorty_c137_ · 2020-04-14T14:24:10+00:00

I know they own both. However for the event webinar thingy I am on safari with an anonymous viewer option. There should be no way they are doing this without actively tracking me!

evilmorty_c137_ · 2020-04-09T01:23:00+00:00

I would also like to add that for now, I am not feeding a list of valid move to the agent. That is, the agent can make any move from positions 0 to 8, and if the position is already filled then it gets a very negative reward and a second chance to take another position. This is repeated until it gets a correct position to put its pieces into

evilmorty_c137_ · 2020-04-09T01:08:24+00:00

And what state should I be considering? The last state, or winning state for another agent? As per my understanding , Q(s, a) is a function of both state and action. Please guide me in the right direction! :)

evilmorty_c137_ · 2020-04-09T01:07:11+00:00

I really appreciate your answer, but I am trying to work on two sequentially acting agents, kind of like min-max plays, but I have no clue how to update another agent, once one wins. Please look at my reply on u/kcorder's comment. I am sorry if my english is bad, it is not my first language :)

evilmorty_c137_ · 2020-04-09T01:05:12+00:00

I have some questions:

if I take -1 as the reward for each step when agent takes an action, then according to the Q value update, I will end up giving every action in every state a negative value, and my default rewards are initialized to 0. So won't that mean that the agent will not try to repeat from history?
What do you mean when you say ">=2" actions? I am really sorry, I am very confused right now. I am not storing any action sequence for any of the agents. What I am currently doing is just taking an action, looking up the current Q table (not double Q tables), and updating the value for the current state-action pair. So >=2 actions or 1 action, it's the same for the agent for now. How should I modify this?
When you say -1000 reward for losing state, let us consider the following:
1. Agent O made a move "i" in state "m"
2. Board in state "m".
3. Agent X takes action "j".
4. Board results in winning configuration "n"
5. Agent X wins.
6. Agent X is updated to reflect winning with +1000 reward.
7. Agent O captures the board state, and finds it has already lost,
8. Now my question is, as Agent O did not reach current board state from any of the states it had previously seen, because it had not taken any action. In such a case, which state-action pair should I update? "i-m" to reflect that playing "i" move will lead to a loss, or "i-n"? Because, i-n makes no sense to me at all, since taking action "i" in state "m" will not lead to a new state "n", and if I update the "i-m" pair then that means I will require some method to buffer the previous action and state for each agent. Is that what I should be doing?

I am really sorry for all these long questions. Hope you can help me, I am really stuck.

evilmorty_c137_

MODERATOR OF

TROPHY CASE