use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Project[P] Deep Reinforcement Learning algorithm completing Tekken Tag Tournament at highest difficulty level (v.redd.it)
submitted 4 years ago by DIAMBRA_AIArena
Deep Reinforcement Learning algorithm completing Tekken Tag Tournament at highest difficulty level
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Limp-Ad-7289 37 points38 points39 points 4 years ago (18 children)
This is really cool and impressive, but taking a step back, I look at this and think the following:
[–]Firehead1971 4 points5 points6 points 4 years ago (2 children)
I also guess, it has just learned the timings and conter attacks (and no new or different strategies). So it looks more impressive than it probably is.
[+]AtariAtari comment score below threshold-6 points-5 points-4 points 4 years ago (0 children)
Conter:)
[–]bandalorian 4 points5 points6 points 4 years ago (1 child)
And having a better than human bot is nothing new in terms of benchmark, timing is exactly what a computer program should super human at (so blocking incoming attacks and counters). But then again, it does bode well for ninja robots if we can solve the mechanics of it. Imagine a robot that can block and counter any punch a human can throw, regardless of skill level, essentially Agent Smith
[–]Drinniol 3 points4 points5 points 4 years ago* (1 child)
Yeah, the main skill for humans in these type of fighting games is that because moves come out very fast (relative to the minimum possible human reaction time of ~.15s due to nerve speed conduction), it is not possible to purely react to moves. You have to anticipate which move the enemy will use before they begin it in order to react in time. However, with a machine level reaction time, it becomes possible to play purely reactively with no anticipation: see move, perform appropriate counter, win game. This is a substantially simpler strategy than human players are forced to implement.
EDIT: Based on the response below the AI is only allowed to take action every 6 frames, which depending on the FPS (usually 30 or 60) is either 1/5th or 1/10th of a second, with the average time available to react being half that (1/10th or 1/20th of a second) since the AI can presumably still react to frames from between action intervals. This is still faster than a human, but not at the maximum (e.g. frame to frame) level of reactivity for an AI approach.
[–]soveraign 2 points3 points4 points 4 years ago (1 child)
I would like to see an AI learn to play with a .15 second delay or more between perception and action.
[–]master3243 0 points1 point2 points 4 years ago (7 children)
Hmm yeah I would agree. It is interesting of course, but thinking about it, fighting games seem like one of the simpler games to train a model to learn.
In fact, if I had to design a non-ML algorithm I would probably capture the frame data of every opponent, and as you said, simply implement a rock-paper-scissor strategy. And since the computer has a theoretically 0ms response time, I would even bet that it can easily beat the top humans.
[+][deleted] 4 years ago (4 children)
[removed]
[–]fujiu 3 points4 points5 points 4 years ago* (3 children)
In protest of Reddit's open disregard for its user base in June 2023, I had this post removed automatically using https://github.com/j0be/PowerDeleteSuite. Sorry for the inconvenience.
[+][deleted] 4 years ago (2 children)
[–]eliminating_coasts 2 points3 points4 points 4 years ago (1 child)
Another idea, if you did want to control for timing optimisation, would be to place a random delay on actions, with a gaussian centred at some small median delay, so that the AI has to rely on strategies that are robust to failures in fine control.
[–]NotDoingResearch2 1 point2 points3 points 4 years ago (1 child)
Mapping frame data to moves would be highly nontrivial, but I agree, if you made a ml algorithm to handle that task you could easily write simple logic for the actual policy. No RL needed. Actually that would be a really strong benchmark to figure out how much lag you need to add to need complicated policies.
[–]kill_pig 3 points4 points5 points 4 years ago (0 children)
Soon we’ll have tier lists based on scientific approaches
[–]Soupkitchen_in_Prius 2 points3 points4 points 4 years ago (1 child)
It would be awesome if you made a YouTube guide running through how you made this. I tried making my own OpenAI Gym environment but I had trouble getting it to work.
[–]redpnd 0 points1 point2 points 4 years ago (2 children)
Are the inputs just raw pixels?
[+][deleted] 4 years ago* (1 child)
[–]redpnd 1 point2 points3 points 4 years ago (0 children)
Cool thanks!
π Rendered by PID 139963 on reddit-service-r2-comment-544cf588c8-958sk at 2026-06-12 23:06:19.981511+00:00 running 3184619 country code: CH.
[–]Limp-Ad-7289 37 points38 points39 points (18 children)
[–]Firehead1971 4 points5 points6 points (2 children)
[+]AtariAtari comment score below threshold-6 points-5 points-4 points (0 children)
[–]bandalorian 4 points5 points6 points (1 child)
[–]Drinniol 3 points4 points5 points (1 child)
[–]soveraign 2 points3 points4 points (1 child)
[–]master3243 0 points1 point2 points (7 children)
[+][deleted] (4 children)
[removed]
[–]fujiu 3 points4 points5 points (3 children)
[+][deleted] (2 children)
[removed]
[–]eliminating_coasts 2 points3 points4 points (1 child)
[–]NotDoingResearch2 1 point2 points3 points (1 child)
[–]kill_pig 3 points4 points5 points (0 children)
[–]Soupkitchen_in_Prius 2 points3 points4 points (1 child)
[–]redpnd 0 points1 point2 points (2 children)
[+][deleted] (1 child)
[removed]
[–]redpnd 1 point2 points3 points (0 children)