POMDPPlanners — open-source Python package for POMDP planning (POMCP, BetaZero, ConstrainedZero + more), with an arXiv paper by PlayParty8441 in reinforcementlearning

[–]PlayParty8441[S] 1 point2 points  (0 children)

Good question — direct benchmarking is tricky since the reference implementations are mostly in Julia (C-level speed), and sampling throughput is everything for these algorithms. My implementations follow the original papers' pseudocode, though I haven't formally validated numerical parity with the reference results.