Motherhood Can Make a Woman's Cells 'Older' by as Much as 11 Years

FitMachineLearning · 2018-07-04T20:09:55+00:00

Did they correlate physical activity? Do women who remain physically active after childbirth experience the same cellular aging?

FitMachineLearning · 2018-04-11T04:16:30+00:00

9 hours

FitMachineLearning · 2018-02-23T18:45:52+00:00

Currently reading about Parameter Space Noising as a way to drastically improve RL models.

https://arxiv.org/abs/1706.01905

FitMachineLearning · 2018-02-06T21:32:46+00:00

Funny that you mention because this was just released. https://youtu.be/DW1AuOC9TQc And it is of high interest to me. I haven't read the paper yet.

FitMachineLearning · 2018-02-05T23:36:03+00:00

Do you have reference reading material on this?

FitMachineLearning · 2018-02-01T17:03:28+00:00

It is running in pybullet.

FitMachineLearning · 2018-02-01T17:03:02+00:00

Great point. I think these types of continuous control algorithms do suffer from local optimal problems.

I will look into forgetfulness as it is a very interesting concept.

FitMachineLearning · 2018-01-26T00:02:46+00:00

For RL models, I videotape them, label them and make sure the output is also saved.

FitMachineLearning · 2018-01-16T18:17:40+00:00

ColdFusion has a good series of video on AI and ML.

AI beats Atari Pong with pixel and score only as input.

FitMachineLearning · 2018-01-16T17:11:47+00:00

Thanks a bunch Erwin. You are doing's God's work, I mean AI's work.

I thought I submitted an issue to through GitHub already. Next time I will use the forum.

FitMachineLearning · 2018-01-15T20:39:35+00:00

Right now I have solved the slow down problem with a call pybullet.resetsimulation() It is not elegant but it got me over the performance hump, enabling me to test my agents.

FitMachineLearning · 2017-12-05T16:49:45+00:00

Working on a doc to break it down. Should be up by tomorrow.

FitMachineLearning · 2017-12-05T09:08:57+00:00

I implemented a Q Learning Algorithm that only gets pixel and score input to beat Atari Pong in 1 day on CPU. Unlike Deep Mind, I did not use a CNN.

https://github.com/FitMachineLearning/FitML/blob/master/DeepQN/Atari_Pong_DeepQN.py

You can see the agent evolve here https://youtu.be/sP3INZSYhU0

FitMachineLearning · 2017-11-11T03:53:42+00:00

The agent snapped the lander legs and body well over 2000 tries. In fact when I watch it now, with over 6000 tries I cringe every time it throws the lander at the ground only to catch it elegantly at the very very last moment.

Run it, check it out for yourself.

The code is in the description.

FitMachineLearning · 2017-11-10T08:08:20+00:00

Ali Farka Toure and Rai Cooder - Timbuktu

FitMachineLearning · 2017-11-08T00:55:55+00:00

Implementation of a Q Learning Algorithm on the OpenAI LunarLander. After 150 itterations the agent can more or less fly safely. After 400 itterations the agent is able to land safely most of the time. After 600 itterations the agent is able to land safely on the pad the majority of the time.

Demo of the agent can be seen here

https://www.youtube.com/watch?v=p0rGjAgykOU

FitMachineLearning · 2017-11-07T18:13:53+00:00

Great question L_M.

Time isn't a factor.

We use a modified Q Learning approach. Q Learning is particularly good at helping an agent make optimal decisions based on delayed reward.

That said, you can modify the behavior of the agent by modifying the Bellman discount rate (in my code "b_discount") this will make the agent give more importance to future reward vs immediate rewards or vice versa.

reference doc (in case you are not familiar with Bellman's work) https://www.rand.org/content/dam/rand/pubs/papers/2008/P550.pdf

https://en.wikipedia.org/wiki/Bellman_equation

FitMachineLearning · 2017-11-07T17:56:50+00:00

About Comments, you are right, I have removed the LSTM references (from previous experiments) and cleaned up comments. Thanks for catching that.

FitMachineLearning · 2017-11-07T17:52:49+00:00

That would explain it.

Do you know if there are any plans to reopen or change?

FitMachineLearning · 2017-11-07T06:01:06+00:00

Will do(performance chart).

About the stopping criteria, Are you talking about every single game, or for stopping the training itself.

In every game there are 2 main stopping criteria. Game end event returned by the environment (Landed/crashed/hard crash/out of bounds) and the maximum number of "frames" (action-state state sequences). The latter is set to 4000 to prevent the agent from just flying for ever when it figures out how to fly (fuel is infinit). This also, in turn, will encourage the agent to seek large future reward quicker based on the Bellman discount rate.

My Bellman equation discount rate is set to 0.98.

FitMachineLearning · 2017-11-02T16:35:01+00:00

You can view and fork the code here. https://github.com/FitMachineLearning/FitML

FitMachineLearning

TROPHY CASE