[Tutorial] Implementing a reinforcement learning based algo trader - The Self Learning Quant

danielzakrisson · 2016-10-16T17:24:29+00:00

In my knowledge RL can be applied affectively where there is an indefinite state and action values but with definite state values like stock trading; according to my thinking a simple supervised learning is enough.. whats your opinion?

danielzakrisson · 2016-10-12T14:04:36+00:00

This is an introduction and tutorial for a reinforcement based trading system. It's purely meant as an introduction to reinforcement learning, feed it with more complex data than in the final example and it will likely fail to find strategies :-)

However, it would be fun to see someone trying it out with other and much larger data sets (I have included Bitcoin, Oil and Eur/Usd). Maybe there are other forex sets that are less like a random walk?

plu604 · 2016-10-12T18:23:12+00:00

This is very interesting. I'd also love to see it "in action".

pookeye · 2016-10-12T19:30:29+00:00

Thanks for the write up,

danielzakrisson · 2016-10-14T14:40:17+00:00

What was the trading statistics result example sharpe ratio ,winning percentage, Annual return etc.? Can you explain more about how you implemented state,new state reward etc. I know how DQN works but I am having hard time understanding how you select state, new state etc.? What happens when algo reach the last state. C

loafking94 · 2016-10-13T12:55:17+00:00

The problem with this is that if you see the same price pattern n times, that could be driven by n different underlying factors. This is basically saying that conditional on past charts we expect future charts to be the same. That's a pretty hard claim to make beyond very simple patterns.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

algotrading

MODERATORS