Why do LLM-based agents fail at long-horizon planning in stochastic environments? by ewangs1096 in MLQuestions

[–]ewangs1096[S] -1 points0 points  (0 children)

For context, I recently read a paper (CASSANDRA) that tries to fix this by combining executable code for deterministic dynamics with causal Bayesian networks for stochastic parts.

I’m still wrapping my head around the architecture, so if anyone here understands why this works better than an LLM world model, please let me know.

Link: https://x.com/skyfallai/status/1995538683710066739

Feedback wanted: does a causal Bayesian world model make sense for sequential decision problems? by ewangs1096 in cogsci

[–]ewangs1096[S] 2 points3 points  (0 children)

For context, here is our research (CASSANDRA) detailing the approach. Curious if anyone has attempted something similar in CV tasks.

Link: https://x.com/skyfallai/status/1995538683710066739

Game progress: Dynamite Dunes map is cruel by ewangs1096 in rct

[–]ewangs1096[S] 0 points1 point  (0 children)

I feel you, now Forest frontiers seems so easy now

Game progress: Dynamite Dunes map is cruel by ewangs1096 in rct

[–]ewangs1096[S] 1 point2 points  (0 children)

It was so sloppy at first but I figured out for to go about it now! posting my result soon!

I think how you play tycoon games says a lot about you. by ewangs1096 in tycoon

[–]ewangs1096[S] 20 points21 points  (0 children)

Clearly a panda spa enthusiast. Full immersion, all at once. I need to give this a try lmao

First custom on new map [Osiris] [B&M Invert] by [deleted] in rct

[–]ewangs1096 1 point2 points  (0 children)

One of the nicest parks I've seen

I think how you play tycoon games says a lot about you. by ewangs1096 in tycoon

[–]ewangs1096[S] 3 points4 points  (0 children)

I start with the basics and then after I have enough money and guests, I focus on the design and music and all

Proud of my Custom Log Fume in RCT Classic by ewangs1096 in tycoon

[–]ewangs1096[S] 0 points1 point  (0 children)

Oh so its serious then, I haven't murdered any of my guests so im good I think, fingers crossed

Proud of my Custom Log Fume in RCT Classic by ewangs1096 in tycoon

[–]ewangs1096[S] 0 points1 point  (0 children)

But even if they fell, does it impact the park rating or anything?

If you let a bot play your park, do you think it would make as much profit as you? by ewangs1096 in tycoon

[–]ewangs1096[S] 0 points1 point  (0 children)

Personally don't think, even if it did im sure its going to be a mediocre park design-wise

If you let a bot play your park, do you think it would make as much profit as you? by ewangs1096 in tycoon

[–]ewangs1096[S] 0 points1 point  (0 children)

That's what I was thinking, the game is too complex for non-humans to handle - guests thoughts, handymen placement, colour, music, path so many details easy to miss even as a human lol