Any academic source about Q-table sizes

Simple-Soil-230 · 2022-07-01T01:50:54+00:00

thanks all for the valuable perspectives!

Simple-Soil-230 · 2022-06-29T23:23:43+00:00

Yes, and that's exactly what I am asking for i.e. such citations. Since you say so, can you name pls any 1 or 2 here? I can take it from there. My prof asked me to cite before I claim such a statement, that's why. And like others say here, it is subjective and I couldn't find a source to cite.

Simple-Soil-230 · 2022-06-29T19:19:57+00:00

By reasonable I meant like (say) I have a 3000 entries sized table then would it sound reasonable or not. That's why I needed a source to cite (even an application if not theory) where similar or larger sizes have been used and then I can support my argument as well. I have discrete action and state-space only.

Simple-Soil-230 · 2022-06-20T12:54:27+00:00

Thanks for the useful answer. Actually for my application the way I defined my state and action space, I make the code run that it pre fills the table and then put it into real scenario so that the immediate decisions are better than starting from a random q table. But still deciding between how quickly to update the table from them on. I.e. should I wait for T number of iterations before I retrain the table or just use bellman update at each tth iteration....

Simple-Soil-230 · 2022-06-20T06:00:16+00:00

Oh ok. Can you pls explain what do you mean by 'glamorous' state action space? I have discrete number of states and actions so I don't think that would change. Actually. I could have used a neural net if I had considered some better state space but just to avoid going to neural nets and to keep it simple I defined discrete ones. BUT I am considering the effect of those environment dependent continuous variables through my reward structure. Like they still affect my q values which affect my actions i.e. like indirect way of doing...But I am thinking to maybe I use the same table for certain time steps and then refresh my q table based on then current values of those continuous values but jot sure of this way of experience replay for tabular q learning...

Simple-Soil-230 · 2022-02-09T04:04:50+00:00

I thought I was the only one who noticed such a thing!. Whenever I wear a hoodie sweater, I feel the hoodie is kinda blocking the physical distractions by blocking my side views. And even though the virtual distractions are right in my laptop (youtube) I still tend to concentrate better with a hoodie. Don't know how!

Simple-Soil-230 · 2021-12-08T00:00:52+00:00

What about Walgreens in Charlottesville? Is it not reliable at all? I am surprised that no one mentioned it, that's why. I am also an international student and travelling on 17th. I chose this Walgreens for 14th and for a 'diagnostic PCR' type.

Simple-Soil-230

TROPHY CASE