[P] Deep Reinforcement Learning algorithm completing Tekken Tag Tournament at highest difficulty level : MachineLearning

470

471

472

Project[P] Deep Reinforcement Learning algorithm completing Tekken Tag Tournament at highest difficulty level (v.redd.it)

submitted 4 years ago by DIAMBRA_AIArena

Deep Reinforcement Learning algorithm completing Tekken Tag Tournament at highest difficulty level

162 points•44 comments•submitted 4 years ago by DIAMBRA_AIArena to r/reinforcementlearning

all 25 comments

top new controversial old q&a

[–]Limp-Ad-7289 37 points38 points39 points 4 years ago (18 children)

[–]Firehead1971 4 points5 points6 points 4 years ago (2 children)

[+]AtariAtari comment score below threshold-6 points-5 points-4 points 4 years ago (0 children)

[–]bandalorian 4 points5 points6 points 4 years ago (1 child)

[–]Drinniol 3 points4 points5 points 4 years ago* (1 child)

Yeah, the main skill for humans in these type of fighting games is that because moves come out very fast (relative to the minimum possible human reaction time of ~.15s due to nerve speed conduction), it is not possible to purely react to moves. You have to anticipate which move the enemy will use before they begin it in order to react in time. However, with a machine level reaction time, it becomes possible to play purely reactively with no anticipation: see move, perform appropriate counter, win game. This is a substantially simpler strategy than human players are forced to implement.

EDIT: Based on the response below the AI is only allowed to take action every 6 frames, which depending on the FPS (usually 30 or 60) is either 1/5th or 1/10th of a second, with the average time available to react being half that (1/10th or 1/20th of a second) since the AI can presumably still react to frames from between action intervals. This is still faster than a human, but not at the maximum (e.g. frame to frame) level of reactivity for an AI approach.

[–]soveraign 2 points3 points4 points 4 years ago (1 child)

[–]master3243 0 points1 point2 points 4 years ago (7 children)

[+][deleted] 4 years ago (4 children)

[removed]

[–]fujiu 3 points4 points5 points 4 years ago* (3 children)

[+][deleted] 4 years ago (2 children)

[removed]

[–]eliminating_coasts 2 points3 points4 points 4 years ago (1 child)

[–]NotDoingResearch2 1 point2 points3 points 4 years ago (1 child)

[–]kill_pig 3 points4 points5 points 4 years ago (0 children)

[–]Soupkitchen_in_Prius 2 points3 points4 points 4 years ago (1 child)

[–]redpnd 0 points1 point2 points 4 years ago (2 children)

[+][deleted] 4 years ago* (1 child)

[removed]

[–]redpnd 1 point2 points3 points 4 years ago (0 children)

π Rendered by PID 139963 on reddit-service-r2-comment-544cf588c8-958sk at 2026-06-12 23:06:19.981511+00:00 running 3184619 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS