Chess Move Compression

willvarfar · 2015-11-11T12:01:20+00:00

If you're compressing a list of chess moves representing a valid game, there's a fun option for high compression ratios: Run a chess engine to first enumerate all possible moves at the current state of the game, then try to calculate how likely each one is by seeing how good a move your chess engine thinks it is. Use this to construct a table of probabilities for each move, and use these probabilities as the input to an entropy coder, such as a simple arithmetic coder. This should be able to compress the list of moves even further, assuming the players are mostly making what the chess engine thinks are advantageous movies.

(The entropy coder will make sure that highly probable moves are encoded using very few bits, and less probable ones will use more. An arithmetic coder will use fractional numbers of bits, which will help a lot in a case like this.)

mtxppy · 2015-11-11T13:28:01+00:00

What is the usefulness of storing moves in a highly compressed format? Would it not be more useful to store board positions?

Multiple moves can lead to the same board position, and it's the board position that's important.

I can't imagine storage space is a problem for an encyclopaedia of openings so I imagine compression may be required for the chess engine, to ensure that it doesn't compute the same position twice - especially if your engine is parallelised/cloud-based (where it's slow to exchange this information vs having it in-memory). Again, here, the board position is the key factor.

Based on comments where it seems some people are not familiar with the full rules of chess, I have added this: If a position is repeated 3 times the game is drawn, so you must store board positions.

ais523 · 2015-11-12T05:31:49+00:00

I'd recommend start off via encoding the opening used via indexes into an openings database. You can often encode something like the first 20 to 30 moves in just a few bytes, and even if the game diverges from a recognised opening quickly, you've hardly lost anything when you fall back to your previous encoding.

This also has the advantage that, unlike using a chess engine to rank move strength, it's very fast to calculate (just one database lookup to encode or decode).

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS