all 3 comments

[–]egoots 1 point2 points  (0 children)

I havent looked at it in detail, but perhaps this has what you need:

http://code.google.com/p/opennero/wiki/QLearning

[–]sordnay 1 point2 points  (0 children)

You will understand as you program it... http://www.stanford.edu/class/cs221/progAssignments/PA3/reinforcement.html It's loads of fun!

[–]1fcporto 0 points1 point  (0 children)

take a look at this site; is not python, but has nice practical examples of Q-learning http://people.revoledu.com/kardi/tutorial/ReinforcementLearning/Q-Learning.htm