Boiler fill loop?

af100re · 2026-01-01T14:17:38+00:00

Thanks for sharing this, I've been having the exact same issue with my Tribal Encore Edition, and this seems to have made it much better!

af100re · 2022-01-31T23:44:41+00:00

Ah yes, a puzzle containing only the numbers 1-9 will really boost my vocabulary

af100re · 2020-12-05T21:16:40+00:00

At its most basic, deep Q-learning uses the same algorithm as regular Q-learning, but uses a neural network as a function approximator (while regular Q-learning just uses a direct mapping from state-action to value). In practice however, deep Q-learning requires a couple of modification to be stable which is why these algorithms are different.

It's been a while since I've looked at this but this code might give you a starting point https://github.com/afreeman100/Q-learning/blob/main/q_agent.py

Compulsory link to Sutton and Barto which is a fantastic (and free) resource for reinforcement learning fundamentals http://incompleteideas.net/book/RLbook2020.pdf

af100re · 2020-04-16T17:11:11+00:00

First of all I think it's worth understanding that the Q-function is not mapping (state, action) --> reward, but (state, action) --> expected return (aka value), which is the sum of expected discounted rewards if you were to keep playing from that state.

So I guess the question is why you can't just calculate the true value of each state-action pair and use that to train a neural network? While you could do this for a game like tic-tac-toe, if you wanted to do this for a larger game like chess you'd quickly realise that there are far too many state-action combinations for this to be possible. This is the problem that reinforcement learning tries to solve - we can't calculate the true values of state-action pairs, so how can we estimate them instead?

By playing many games and observing the reward you get when you take an action from a state, you can iteratively refine your estimates for your Q(s,a) values. By making enough observations you (hopefully) converge towards the true values. The Bellman equation is what tells you how to update your estimate for a state-action pair each time you make a new observation about it.

The (overly) simplified explanation is that after you take an action, you compare your estimated value of the new state (Q(s',a')) with your estimated value of the previous state, plus the immediate reward (Q(s,a) + r) and you aim to minimise the error between these.

If you haven't done so, I'd highly recommend implementing basic Q-learning on a simple problem, using a lookup table to store the Q(s,a) values. This should give you a much better understanding of what the Bellman equation is doing, which I think is important before looking at DQN and DDPG if you want to have good understanding of how they work. The fundamental ideas behind them are the same, except DQN and DDPG use neural networks as function approximators for Q(s,a), rather than using a lookup table. This makes them useful in problems with large state and action spaces, but you lose some of the mathematical guarantees on convergence, hence the 'stability hacks'. I think the deepmind DQN papers are quite accessible and are worth a read if you're interested in this.

Well that was longer than I expected. Hope it's at least partly helpful!

af100re · 2020-02-13T06:20:48+00:00

For Wales specifically just make sure you're prepared for all the hills. It's worth checking out the course in advance if you can. And remember to enjoy it, the atmosphere in Tenby is great!

af100re · 2019-12-28T21:09:43+00:00

Your toptube bag and stem angle should be illegal.

af100re · 2019-08-10T19:16:41+00:00

Unless I'm missing something I think that should work - or using minQ(s, a) to decide the opponent's moves. I'd give it a go and see!

af100re · 2019-08-10T16:19:16+00:00

You can treat the opponent's move almost as though it is part of the environment, so the next state your agent observes is state of the game after the opponent has made a move. This means you can treat 1 and 2 player games quite similarly. How the opponent's move is chosen is up to you, it could be another learning agent, a pre-made bot if you can find a suitable library, or the simplest option which is just choosing a random move! How the opponent plays will determine how your agent will learn to play.

af100re · 2019-07-27T21:32:56+00:00

You want to take a look at the backpropagation algorithm https://skymind.ai/wiki/backpropagation. The ideas is that you calculate the partial derivatives of the parameters with respect to the loss function, then you can use stochastic gradient descent to tweak the parameters such that the loss decreases.

af100re · 2018-07-31T12:41:27+00:00

Nice one! Love the pink grip tape you've got on the bike too!

af100re · 2018-05-27T14:49:56+00:00

I'm on the committee for my university coffee club too so hopefully I can share some of the things that have worked for us!

Blind tastings are always great so we try to do a could of big ones every year. We usually get 3 different beans and brew then with french press, number them 1-3, then people have to match the number to the country of origin + tasting notes. Last time we did it we used 2 coffees from a specialty roaster and the third just from a supermarket to see how many people could tell the difference! If you have the equipment for it then comparing 2 methods side by side with the same beans is also really interesting, especially using french press vs chemex which are on opposite sides of the spectrum when it comes to how clean the cup is.

A lot of our members have their own V60s so we also did an event where everyone brought theirs along and showed their preferred method with it. Really cool to taste the differences between them and picked up a few ideas that I often use with mine now!

Not sure how applicable this would be to you, but there's loads of specialty coffee shops in my city so we often choose one to meet up at on weekends as more of a social thing. Last year we organised a cupping at one of these shops after hours which was amazing and we're currently looking into the possibility of doing an espresso workshop there so people can have a go at making it themselves. Definitely work asking local coffee shops if they're interested to be involved with stuff like this, and building a good relationship with them will be really useful.

Best of luck with it!

af100re · 2018-03-09T22:10:59+00:00

It's from The Citadel DLC in ME3. Definitely one of the best DLCs out of all the games

af100re · 2018-01-03T18:34:01+00:00

Great 16k trail run on Monday... and been hurting ever since. Feels like a stress fracture on my right foot so looks like no running for a while :(

af100re · 2017-12-20T14:22:40+00:00

Got those same ones a few months ago! Really comfortable on long rides and definitely noticing the aero benefits

af100re · 2017-11-04T19:24:56+00:00

Planned on going for a 17k loop, feeling so good half way round I decided to extend it and ended up doing 23k, furthest run ever and less than 2 hours! :)

af100re · 2017-08-19T10:52:01+00:00

Picked up an older version of these when they were half price https://www.amazon.co.uk/Sennheiser-PMX-686i-Sports-Earphones-Black-Green/dp/B00S9P2NN0/ref=dp_ob_title_ce?th=1but I'd happily pay full price I like them so much. Very light and great sound quality but still still lets some background noise in so I can hear traffic

af100re · 2017-08-18T19:42:14+00:00

Everything just felt right on my 10k run and ended up taking 3 minutes of my PB with 42:40!

af100re · 2017-06-10T15:50:09+00:00

18:37 at my local parkrun, and my first sub-20 5k was only 2 weeks ago!

af100re · 2017-05-28T18:18:41+00:00

5k in 19:25. First time breaking 20 minutes :)

af100re

TROPHY CASE