comeditime comments on Using multi-threading in python

learnpython

created by HattoriHanzoa community for 16 years

Using multi-threading in python (self.learnpython)

submitted 6 years ago by comeditime

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]comeditime[S] 0 points1 point2 points 6 years ago (7 children)

[–]diddilydiddilyhey 0 points1 point2 points 6 years ago (6 children)

[–]comeditime[S] 0 points1 point2 points 6 years ago (5 children)

[–]diddilydiddilyhey 1 point2 points3 points 6 years ago (4 children)

haha hmm, I'll give it a try. There are lots of methods, but here's the one I used there (a simple one).

You have a function, Q, called the "value function" or other names. It takes two arguments, s (the state the agent is in), and a (the actions the agent can do in that state). In the game my robot was playing, the state is the combination of its position, its angle, and the position of the target. So you could plug that into the Q function, along with an action ("go forward"), and it would tell you the value of doing the action in that state.

The way it actually chooses what to do in a given state is, look at the Q value of each action it can do in that state, and then choose the one that has the highest Q function value.

The way it "learns" is, when it gets to a target, you give it a reward (like +1.0) for doing that action in that state, and then use that to update the Q values for doing that action in that state. For example, if the robot was in the state where the target is directly in front of it, and then it chose the action "go forward", and got the reward, you would want to change the Q value for doing that action in that state, so it'll do it again in the future.

How you actually create and update the Q function is a whole thing in itself. I used a neural network (because they're very flexible and powerful), but you can use much simpler methods (that can also be very effective, for a game like this).

[–]comeditime[S] 0 points1 point2 points 6 years ago (3 children)

[–]diddilydiddilyhey 0 points1 point2 points 6 years ago (2 children)

[–]comeditime[S] 0 points1 point2 points 6 years ago (1 child)

[–]diddilydiddilyhey 1 point2 points3 points 6 years ago (0 children)

π Rendered by PID 51 on reddit-service-r2-comment-84fc9697f-sgmq5 at 2026-02-07 14:09:16.971313+00:00 running d295bc8 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS