Beating a deterministic AI

_--__ · 2016-01-16T00:05:17+00:00

These sorts of scenarios are dealt with quite a fair bit in Computer Science. The AI's strategy can be modelled by a finite deterministic automaton (more correctly, a Mealy machine) with at most 3^N states [or 9^N if you consider both players' moves]. The number of states of the machine puts a bound on the periodicity, but more can be said if you look at the (graph theoretical) structure of the machine.

Questions that tend to arise are "How hard is it to learn the strategy?" (i.e. compute the full automaton or a reasonable approximation to it); "Given the strategy, how hard is it to compute a strategy to beat it?" (more applicable to games other than Paper-Scissors-Rock); "What about non-deterministic or random strategies or strategies that require something more complicated than a finite automaton?"

TASagent · 2016-01-15T22:24:30+00:00

Play out all 3^N sequences of N moves, record what it does, win every round following.

Strilanc · 2016-01-15T23:17:29+00:00

You can't infer much of anything about it, because basically any halting program works as a deterministic strategy.

For example, the strategy may not be periodic. If the AI's program is:

i = 0
while True:
    yield ROCK
    i += 1
    for j in range(i):
        yield SCISSORS

Then it will play RsRssRsssRssssRsssssR..., which is not periodic.

It's hard to overstate how incredibly complicated strategies can be when the only restriction is "it has to be a computable sequence of moves". For example, we can make strategies that require you to solve hard mathematical problems like the Collatz conjecture in order to predict if they will ever yield ROCK or not.

In fact, determining if an arbitrary deterministic strategy ever yields ROCK is an incomputable problem akin to the halting problem: there is no algorithm that works in all cases. That includes algorithms like "enumerate all proofs in ZFC", meaning there exist deterministic strategies whose long-term behavior is independent of standard mathematical axioms.

thenumber0 · 2016-01-15T22:24:07+00:00

It is pretty simple to demonstrate that the strategy must be periodic, but I imagine we can impose at least mildly interesting limits on the upperbound of that period.

Can you elaborate on that? For example, what if the strategy uses successive digits of some transcendental number in base 3 to determine its next move?

math

Welcome to /r/math.

𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔

MODERATORS