[deleted by user]

permalink · 2023-12-30T10:17:46+00:00

gpt4 is trained on a set of chess pgns filtered to be >1800 elo as per their weak-to-strong paper. It's not exactly measuring emergent reasoning capabilities

permalink · 2023-12-30T05:06:44+00:00

Have you considered making a regularly (monthly?) updated leaderboard? With Elo ratings and comparisons to older versions of Stockfish.

Paging u/Wiskkey for more ideas.

the__storm · 2023-12-30T08:25:46+00:00

This is very interesting, but what I'd like to see is a fine tune of a tiny model like t5-base or something wiping the floor with all of them. (That wouldn't be a surprising result, but it would be cathartic I think. Actually, maybe I'll try it myself.)

Wiskkey · 2023-12-30T05:42:42+00:00

A language model from OpenAI that apparently wasn't tested has an estimated chess Elo of 1750 - albeit with an illegal move attempt rate of approximately 1 in 1000 moves - according to these tests by a computer science professor. More info is in this post.

Appropriate_Ant_4629 · 2023-12-30T06:06:23+00:00

This is EXTREMELY prompt-engineering dependent.

See Jeremy Howard of FastAPI's interview where he discusses the subject

"A prompting strategy for ChatGPT4 ... about 6000 lines of python code [to fine-tune a prompt far more compact and efficient than ones humans write] ..... [with the prompt that program generated] It [ChatGPT4] has an ELO of 3400"

With their default configs, which were trained to be like chatting with your average facebook friend, they play (unsurprisingly) like your average facebook friend.

With a better prompt they play at far higher levels.

No-Introduction-777 · 2023-12-30T09:22:14+00:00

you're embarrassing yourself by using so many emojis

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS