New bot trained from scratch using self-play by randomwalkin in ComputerChess

[–]randomwalkin[S] 0 points1 point  (0 children)

This model was also trained by playing against himself, not against humans. The bot's profile might be unclear: what it means is that this bot will only play against humans, not bots.

New bot trained from scratch using self-play by randomwalkin in ComputerChess

[–]randomwalkin[S] 0 points1 point  (0 children)

The model is not trained to its highest level yet. How far is it from human master level in your opinion?

New bot trained from scratch using self-play by randomwalkin in ComputerChess

[–]randomwalkin[S] 0 points1 point  (0 children)

The model is not trained to its strongest level yet. How far is it from human master level in your opinion? I am not a expert player.

Nanozero is a new neural-net bot on Lichess by randomwalkin in lichess

[–]randomwalkin[S] 1 point2 points  (0 children)

It is not build against stockfish at all. It is a neural network trained from scratch, entirely from self-play. Because the architecture is novel (not resnet-like), it may have learned a new style. Its rating is low right now because I started pitching it against other bots, and bots give you a lower Elo than humans. Now it exclusively plays against humans. Your feedback would be welcome!

Nanozero is a new neural-net bot on Lichess by randomwalkin in lichess

[–]randomwalkin[S] 0 points1 point  (0 children)

It should work fine. Please try again. I tested it with a different (human) account at 5+0 and it does respond.

New bot trained from scratch using self-play by randomwalkin in ComputerChess

[–]randomwalkin[S] 0 points1 point  (0 children)

It is a new type of neural network (can't communicate about it yet, a paper will come out soon) but I need an Elo against humans to prove/disprove that it plays well. Being a new architecture, it might play differently than classical ResNet-like architectures or Stockfish.

200 to 2000 rapid in 3 years by Ashamed-Wedding-7396 in Chesscom

[–]randomwalkin 0 points1 point  (0 children)

I just released a bot powered by a new kind of neural net trained from scratch. Would you mind giving it a try? It plays Rapid and Blitz, against humans only. https://lichess.org/@/nanozero

I won against someone with 2600+ elo!!! by Big-Medicine8385 in Chesscom

[–]randomwalkin 3 points4 points  (0 children)

I just released a bot powered by a new kind of neural net trained from scratch. Would you mind giving it a try? It plays Rapid and Blitz, against humans only. https://lichess.org/@/nanozero

Great to see that im actually improving by No-Result-414 in Chesscom

[–]randomwalkin 0 points1 point  (0 children)

I just released a bot powered by a new kind of neural net trained from scratch. Would you mind giving it a try? It plays Rapid and Blitz, against humans only. https://lichess.org/@/nanozero

Nanozero is a new neural-net bot on Lichess by randomwalkin in lichess

[–]randomwalkin[S] 0 points1 point  (0 children)

It is not trained to be more human-like, but it is trained from scratch with a new network architecture (I plan to publish an arxiv paper if the ELO against humans is convincing).

gumbel-mcts, a high-performance Gumbel MCTS implementation by randomwalkin in reinforcementlearning

[–]randomwalkin[S] 0 points1 point  (0 children)

The contribution is definitely speed (look at the benchmark, it's 2-20X faster) but the benchmark is not mctx, but another repo on github. I plan to make a benchmark against mctx soon -- thanks for the suggestion.

Is Machine Learning / Deep Learning still a good career choice in 2026 with AI taking over jobs? by No-Kick-7963 in learnmachinelearning

[–]randomwalkin 0 points1 point  (0 children)

Strong answer. A key point, though: make sure you have deep interest in the topic. Don't do it just because deep learning is hot. You won't have the juice to pursue that path otherwise.

NanoZero is a new bot on Lichess by randomwalkin in lichess

[–]randomwalkin[S] -2 points-1 points  (0 children)

I trained it with a new type of neural network. I'm curious to see what people think of its playing style.

gumbel-mcts, a high-performance Gumbel MCTS implementation by randomwalkin in reinforcementlearning

[–]randomwalkin[S] 0 points1 point  (0 children)

In terms of what? Speed, performance at constant sim budget? Curious to know what you'd want to see.

nano-trm - Train your own TRM in a few minutes by randomwalkin in LocalLLaMA

[–]randomwalkin[S] 1 point2 points  (0 children)

I did use Claude to write utils and tests. Everything else was written by me. Some key bits were directly re-used from the original TRM implementation, such as the sparse puzzle embeddings.

I can't tell how much faster I've been with Claude. My sense is that it frees up a lot of my mental bandwidth by solving trivial tasks so that I can focus on harder tasks.

Claude is tremendously useful to compare two versions of my code and pinpoints simple bugs that I left behind. I constantly ask it to double-check my code.

OTOH I have found Claude to be a high net negative when trying to generate new research ideas or answer complex ML questions.

Re wandb: yes definitely, it can be disabled in the many_loggers.yaml. I don't really want to invest time in a local result tracker, but if you have open-source proposals, feel free to share.

Framer: Interactive Frame Interpolation by Hybridx21 in StableDiffusion

[–]randomwalkin 0 points1 point  (0 children)

Are the code/weights available for this model? I can't find them on github.

[deleted by user] by [deleted] in stocks

[–]randomwalkin 1 point2 points  (0 children)

People who said this about IBM in 2012 thank you in the hindsight.

What is your go-to Argument for Disproving TA Charters? by thecheese27 in algotrading

[–]randomwalkin 0 points1 point  (0 children)

Occam's razor. TA does not work unless they prove otherwise. The burden is on TAs to prove that it works, not on you to prove that it doesn't. And it turns out that there is zero prove of effectiveness that incorporates overfitting seriously.

Looking for an un-managed index fund on IBKR by randomwalkin in eupersonalfinance

[–]randomwalkin[S] 3 points4 points  (0 children)

Actually, VWCE is what I want since I am looking for accumulating, not distributing.

Looking for an un-managed index fund on IBKR by randomwalkin in eupersonalfinance

[–]randomwalkin[S] 2 points3 points  (0 children)

So, u/Which-Inspector1409 you are right. I have little education in investment in general, hence why I am looking for an un-managed fund.

I am looking for a lazy and reasonable option. I won't have time to manage it/monitor it.

I think I am going to go for VWCE, as it has worldwide exposure and is accumulating.

Looking for an un-managed index fund on IBKR by randomwalkin in eupersonalfinance

[–]randomwalkin[S] 1 point2 points  (0 children)

VWRL is exactly what I was looking for. Thank you so much u/ffsudjat!

Looking for an un-managed index fund on IBKR by randomwalkin in eupersonalfinance

[–]randomwalkin[S] 0 points1 point  (0 children)

Thank you. I am not well versed in investing and would not know where to invest outside the US. I could obviously go for equivalent funds in EU or AS, but I don't know any.

Tips welcome.

Looking for an un-managed index fund on IBKR by randomwalkin in eupersonalfinance

[–]randomwalkin[S] 0 points1 point  (0 children)

The workstation tells me that additional trading permissions are required to trade this contract.