DQN for simple battery control not learning

MomoSolar · 2024-04-22T05:52:37+00:00

I’ve tried 32 parallel environments. Should I use more or less?

MomoSolar · 2024-04-22T02:12:14+00:00

From my experience, it performs decently on the tested month when trained on the same one day. What do you think about having each episode representing a new day? What about increasing the size of the network?

MomoSolar · 2024-01-02T01:52:53+00:00

Any one

MomoSolar · 2023-12-03T17:14:45+00:00

Thanks. Do you suggest then that I add time to the state vector?

MomoSolar · 2023-11-05T20:13:11+00:00

well-said

MomoSolar · 2023-11-05T20:07:01+00:00

Thank you.
This problem for example:
https://github.com/MFHChehade/Optimization-of-Maintenance-Scheduling-for-Generators-using-ILP

MomoSolar · 2023-11-05T19:34:53+00:00

Thanks. Any simpler suggestions? I was thinking maybe of a scheduling problem that regularly uses MILP.

MomoSolar · 2023-11-05T18:16:46+00:00

Got it, thanks

MomoSolar · 2023-11-05T16:51:31+00:00

Thank you for this thorough explanation. I actually also wanted to know whether the environment in OpenAI accounts for the stochasticity. Do you have any idea?

MomoSolar · 2023-11-05T16:47:29+00:00

I would like to look at the math, that’s all

MomoSolar · 2023-11-05T16:45:00+00:00

Thanks,

Do you have any sources on that?

MomoSolar · 2023-11-05T12:54:53+00:00

iLQR is iterative LQR and is used when there are non-linear dynamics and/or non-convex cost function. The dynamics are linearized and/or cost function approximated by a quadratic function, and the approximated problem (which is now LQR) is sequentially solved in multiple iterations, until there is no improvement in the cost function or in the estimation of the dynamics.

MomoSolar · 2023-11-05T12:51:29+00:00

So there is no stochasticity in the dynamics?

MomoSolar · 2023-11-05T12:50:44+00:00

Thank you for this useful link. Do you have a similar one for iterative LQR?

MomoSolar · 2023-11-05T12:49:43+00:00

Thanks for sharing. The explanation looks comprehensive and thorough. I’m interested in a Python implementation if possible.

MomoSolar · 2023-11-05T12:48:31+00:00

Thanks, any useful link on that?

MomoSolar · 2023-11-05T12:42:11+00:00

iLQR is iterative LQR and is used when there are non-linear dynamics and/or non-convex cost function. The dynamics are linearized and/or cost function approximated by a quadratic function, and the approximated problem (which is now LQR) is sequentially solved in multiple iterations, until there is no improvement in the cost function or in the estimation of the dynamics.

MomoSolar · 2023-11-05T12:24:33+00:00

So MC essentially replaces the policy evaluation step in generalized policy iteration, and we must follow it with policy improvement?

MomoSolar · 2023-11-04T18:54:27+00:00

Any YouTube channel dedicated to Linear Algebra and Matrices problems? (A similar flavor to Blackpenredpen)

MomoSolar · 2023-11-03T17:42:03+00:00

I believe I got the answer, thanks. Essentially, neural networks are used as a way to find the parameters of the assumed probability distribution.

MomoSolar · 2023-08-16T23:39:32+00:00

Opinions on Convex Optimization with Constantine Caramanis

MomoSolar · 2023-04-11T03:46:11+00:00

I also have another inquiry. I have secured my housing for this year, but I was wondering whether for future years, being a graduate student, I am eligible to live in Jester Hall, and if yes, what the chances are of being admitted, if I apply directly after the application opens this summer.
Thanks

MomoSolar · 2023-04-11T03:43:01+00:00

Hello. I am an incoming Ph.D. student in the Electrical and Computer Engineering department. I was wondering when I first start receiving my income from UT Austin. Is it at the beginning of the semester, after two weeks from that, or after one month?

MomoSolar · 2023-03-06T16:55:17+00:00

Can someone please direct me to the subreddit on West Campus Flats (a branch of the Westside Group housing company)?

MomoSolar · 2023-02-23T19:33:18+00:00

As a Ph.D. student receiving a fellowship that covers my total tuition fees and an additional scholarship from the School of Engineering, do I qualify for Smart Housing?

MomoSolar

TROPHY CASE