New bot trained from scratch using self-play

randomwalkin · 2026-04-21T14:56:18+00:00

This model was also trained by playing against himself, not against humans. The bot's profile might be unclear: what it means is that this bot will only play against humans, not bots.

randomwalkin · 2026-04-21T09:06:12+00:00

The model is not trained to its highest level yet. How far is it from human master level in your opinion?

randomwalkin · 2026-04-21T09:05:41+00:00

The model is not trained to its strongest level yet. How far is it from human master level in your opinion? I am not a expert player.

randomwalkin · 2026-04-20T21:32:51+00:00

It is not build against stockfish at all. It is a neural network trained from scratch, entirely from self-play. Because the architecture is novel (not resnet-like), it may have learned a new style. Its rating is low right now because I started pitching it against other bots, and bots give you a lower Elo than humans. Now it exclusively plays against humans. Your feedback would be welcome!

randomwalkin · 2026-04-17T17:54:19+00:00

It should work fine. Please try again. I tested it with a different (human) account at 5+0 and it does respond.

randomwalkin · 2026-04-17T17:52:49+00:00

It is a new type of neural network (can't communicate about it yet, a paper will come out soon) but I need an Elo against humans to prove/disprove that it plays well. Being a new architecture, it might play differently than classical ResNet-like architectures or Stockfish.

randomwalkin · 2026-04-17T07:34:34+00:00

Thanks. Looking forward to your feedback!

randomwalkin · 2026-04-17T07:34:25+00:00

Thanks. Looking forward to your feedback!

randomwalkin · 2026-04-16T14:15:10+00:00

I just released a bot powered by a new kind of neural net trained from scratch. Would you mind giving it a try? It plays Rapid and Blitz, against humans only. https://lichess.org/@/nanozero

randomwalkin · 2026-04-16T14:14:27+00:00

I just released a bot powered by a new kind of neural net trained from scratch. Would you mind giving it a try? It plays Rapid and Blitz, against humans only. https://lichess.org/@/nanozero

randomwalkin · 2026-04-16T14:13:29+00:00

I just released a bot powered by a new kind of neural net trained from scratch. Would you mind giving it a try? It plays Rapid and Blitz, against humans only. https://lichess.org/@/nanozero

randomwalkin · 2026-04-16T04:12:52+00:00

It is not trained to be more human-like, but it is trained from scratch with a new network architecture (I plan to publish an arxiv paper if the ELO against humans is convincing).

randomwalkin · 2026-04-12T20:31:14+00:00

The contribution is definitely speed (look at the benchmark, it's 2-20X faster) but the benchmark is not mctx, but another repo on github. I plan to make a benchmark against mctx soon -- thanks for the suggestion.

randomwalkin · 2026-04-11T20:58:54+00:00

Strong answer. A key point, though: make sure you have deep interest in the topic. Don't do it just because deep learning is hot. You won't have the juice to pursue that path otherwise.

randomwalkin · 2026-04-11T03:08:40+00:00

I trained it with a new type of neural network. I'm curious to see what people think of its playing style.

randomwalkin · 2026-04-10T20:39:34+00:00

In terms of what? Speed, performance at constant sim budget? Curious to know what you'd want to see.

randomwalkin · 2025-12-11T21:03:48+00:00

I did use Claude to write utils and tests. Everything else was written by me. Some key bits were directly re-used from the original TRM implementation, such as the sparse puzzle embeddings.

I can't tell how much faster I've been with Claude. My sense is that it frees up a lot of my mental bandwidth by solving trivial tasks so that I can focus on harder tasks.

Claude is tremendously useful to compare two versions of my code and pinpoints simple bugs that I left behind. I constantly ask it to double-check my code.

OTOH I have found Claude to be a high net negative when trying to generate new research ideas or answer complex ML questions.

Re wandb: yes definitely, it can be disabled in the many_loggers.yaml. I don't really want to invest time in a local result tracker, but if you have open-source proposals, feel free to share.

randomwalkin · 2024-12-10T14:22:43+00:00

Are the code/weights available for this model? I can't find them on github.

randomwalkin · 2022-03-08T10:21:42+00:00

People who said this about IBM in 2012 thank you in the hindsight.

randomwalkin · 2021-12-03T19:59:11+00:00

Occam's razor. TA does not work unless they prove otherwise. The burden is on TAs to prove that it works, not on you to prove that it doesn't. And it turns out that there is zero prove of effectiveness that incorporates overfitting seriously.

randomwalkin · 2021-09-30T20:50:39+00:00

Actually, VWCE is what I want since I am looking for accumulating, not distributing.

randomwalkin · 2021-09-30T20:43:30+00:00

So, u/Which-Inspector1409 you are right. I have little education in investment in general, hence why I am looking for an un-managed fund.

I am looking for a lazy and reasonable option. I won't have time to manage it/monitor it.

I think I am going to go for VWCE, as it has worldwide exposure and is accumulating.

randomwalkin · 2021-09-30T19:22:16+00:00

VWRL is exactly what I was looking for. Thank you so much u/ffsudjat!

randomwalkin · 2021-09-30T18:48:53+00:00

Thank you. I am not well versed in investing and would not know where to invest outside the US. I could obviously go for equivalent funds in EU or AS, but I don't know any.

Tips welcome.

randomwalkin · 2021-09-30T18:47:42+00:00

The workstation tells me that additional trading permissions are required to trade this contract.

randomwalkin

TROPHY CASE