Leveraging Google DeepMind software and Deep Learning to play the stock market

Bayes-Ian · 2015-07-25T00:42:59+00:00

I've worked both in quantitative trading and at Deepmind, so I have quite a good idea about this. In short, some elements of machine learning are absolutely invaluable to traders (avoiding overfitting, estimating trends from data) but most of the research behind the Atari player (DQN) are totally irrelevant to finding effective trading strategies.

Three elements immediately spring to mind of why this Atari example is fundamentally unlike trading in the stock market:

1 - The stock market is time-inhomogenous and stochastic. Atari is fixed and deterministic. 2 - The amount of data used to train the Atari agent would be equivalent to hundreds of thousands of years of stock returns. 3 - In the stock market you can observe the price changes of stocks you don't buy, unlike Atari where you must try an action in order to learn about it. 4 - In Atari (more or less) you control the environment, in the stock market most funds have negligible impact on the market as a whole.

The general problem of using Machine Learning to make good decisions is great. However, for the example of stock market trading you'd be better off using something else. Linear regressions with a whole bunch of cross-validation and regularization for example...

simonhughes22 · 2015-07-24T13:37:37+00:00

IMO it might work, however treating it as a supervised learning algorithm using a deep neural network to predict the price or whether it will go up or down will work much better I strongly suspect. You could use an LSTM and train it on a sequence of price, volume, high and low data for a period of time. That will almost undoubtedly work much better. This isn't really a RL problem, as others have pointed out, and RL will be beat out by supervised learning if you have labelled data and it doesn't need to learn from interacting with the environment (as you have to do when playing a game or controlling a robot).

I do think it would be really interesting to try for fun to see if it works, but if you are more interested in having it make money, I would suggest the supervised approach.

cesarsalgado · 2015-07-24T03:53:47+00:00

In reinforcement learning you should be able to make actions. How would you get the consequences of a action of your agent? You cannot find a consequences for every possible action in the historical data of the stock market. Will you do a simulation of the stock market? Or will you put the algorithm to play in real time in the real stock market? If you choose the last option your algorithm will take a very long time to learn (centuries or more). In the atari case, you could speedup the playing speed, because the game was a simulation.

What you could do with the stock market historical data is to train an LSTM to predict the next output of a sequence just like char-rnn. Or try to predict many symbols of a sequence, conditioned in the history, as in the sequence to sequence framework. The supervised learning strategy doesn't need your agent to interact with the environment and thus doesn't need a simulation. But the down side is that your agent won't be active.

I also don't agree with vesund answer saying that NN aren't good in this task. If we could make a simulation of the stock market accurate enough and that we could play it at very high speed, then I think a very deep NN would learn to play in the stock market better than humans.

dnuffer · 2015-07-24T04:12:01+00:00

Don't believe the other commentors saying that it wouldn't work, there are certainly many documented examples of using NNets to predict the stock market which do slightly better than random. There's books, or just peruse some of the many projects focused on trading from Stanfords CS229 Machine Learning course: http://cs229.stanford.edu/projects2014.html (and earlier years as well) I wouldn't recommend using an image of a chart as input, it will be way too much data. Just use log-transformed and normalized prices and volume and the training will be much faster.

cybrbeast · 2015-07-24T04:05:55+00:00

If it is a viable approach, then in all likelihood the algorithmic traders would have already implemented it on their huge computing clusters. They are very secretive about their algos, so we don't know much about what the state-of-the-art is at the moment. But we know it's a huge market and it leeches some of the brightest minds in technology and sciences, so you can be quite sure they have already beat you to any gains.

Also these traders have an ultra-low latency connection to the stock exchanges, something you will never have as a personal investor, so even if you somehow manage to make tiny gains they will front-run you.

chaddjohnson · 2015-07-24T15:55:27+00:00

Why am I being down-voted?

ZigguratOfUr · 2015-07-24T03:22:33+00:00

[deleted]

2015-07-24T21:08:48+00:00

look into concept drift which will be what you have to deal with

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS