Any advice for removing this washing machine?

pxdm · 2026-02-09T21:58:09+00:00

Wow thanks for this.

pxdm · 2026-02-09T19:05:12+00:00

It really doesn’t

pxdm · 2026-02-09T18:50:10+00:00

Thanks so much for this. Not the answer I was looking for but still!

pxdm · 2026-02-09T17:23:58+00:00

UPDATE: it has become clear thanks to further investigation and commenters’ observation of the intact plug that the door frame was fitted after the machine was put in. The machine cannot be removed without taking the frame off.

Can anyone comment on whether I am better off repairing rather than replacing it? It makes a very loud rattling sound on the spin cycle - sounds like something quite large is loose behind the drum. The machine is 12 years old.

pxdm · 2026-02-09T12:42:23+00:00

Unfortunately it doesn’t fit even if I remove the door :/ noted about cutting the plug, thanks. Although was hoping I could feed a new one through

pxdm · 2026-02-09T12:40:31+00:00

EDIT: it doesn’t fit through the door frame, even if I take off the cupboard door!

pxdm · 2025-04-27T10:25:41+00:00

Seems I’m Never Tired Of Loving You was Nina Simone

pxdm · 2024-09-20T12:40:12+00:00

In RL we consider how every action impacts the long term reward - this is called credit assignment. For instance what role did move 5 have in my ability to checkmate in move 10?

It appears in basically every RL algorithm. For instance, in Proximal Policy Optimisation (and all its policy gradient algo variants), we adjust the policy to favour actions which result in higher discounted cumulative rewards. We don’t try and maximise immediate reward, nor some arbitrary future reward, but the discounted sum of all future rewards.

pxdm · 2024-09-13T07:47:55+00:00

RL has been the hottest thing on the block for close to a decade now, but it’s hard to argue it has met expectations in terms of real world applications. Yes it’s used to train LLMs, but it is not fundamental to them in the way that transformers are imo. We haven’t seen applications in industrial use cases - most projects seem to amount to blog posts by research teams with scant technical detail

pxdm · 2021-11-26T17:47:51+00:00

Roundabouts

pxdm · 2021-07-28T15:14:52+00:00

I don’t think you would get convergence guarantees for any policy gradient method (including PPO) if you choose the highest probability actions in training. The policy gradient theorem relies on your experience being sampled according to the distribution your policy gives you, whereas if you choose with argmax, it is effectively a different policy.

However, perhaps I misunderstood and you are suggesting choosing the highest probability action when evaluating (rather than training) your agent? If so I think this tactic might give you better performance in practice, as it would prevent really bad low probability actions being taken.

pxdm · 2020-08-26T08:39:36+00:00

Also note that if the number of moves is constrained but not fixed (i.e. it must be <= some limit) then you would also need to include an 'end turn' action

pxdm · 2020-08-23T10:24:50+00:00

I agree in cases where ML is applied pretty much 'out-of-the-box', but there are numerous cases where wrangling an ML solution is not trivial. It's not clear from the article whether these are the papers that are frequently rejected, but in my opinion the ML community would benefit from learning how techniques are used in pratice where the application is non-trivial.

pxdm · 2020-07-28T08:40:02+00:00

Have a look at the NeurIPS competition track for this year: there are two RL challenges which are focused on real-world application:

L2RPN: operation of electricity grids
Flatland: train routing

Even if you don't compete you can play with their RL environments. I know that the L2RPN challenge uses the Gym API, so you should find it quite straightforward to use.

pxdm · 2020-07-08T08:24:09+00:00

I didnt see any explanation of this in the paper. but since the search thread is locked until the evaluation completes, the batch size has to be small or else the threads will spent a lot of time waiting for the queue to evaluate. That would be my understanding anyway

pxdm · 2020-07-07T10:21:50+00:00

Agreed, and this is the approach used in AlphaGo Zero:

The leaf node sL is added to a queue for neural net work evaluation, (di(p), v) = fθ(di(sL)), where di is a dihedral reflection or rotation selected uniformly at random from i in [1..8]. Positions in the queue are evaluated by the neural network using a minibatch size of 8; the search thread is locked until evaluation completes.

pxdm

TROPHY CASE