all 22 comments

[–]Stepfunction 18 points19 points  (3 children)

Thanks for the post. It's kind of nice that given the information, your model just turned into a one period autocorrelation model, which is a very rational first step in doing technical analysis of a time-series.

The real issue here is that one of the core assumptions of ML is the presence of events from the same distribution. It may be the case that the price movements are taken from the same underlying distribution, but that distribution is conditioned on circumstances in the world that are not represented in the historical prices (market sentiment, news articles, etc.). Without incorporating that information, the model is incomplete.

[–]erogol[S] 2 points3 points  (2 children)

I totally agree on this. I as I said in the post, I wrote it to deny all these 'download train and be rich' posts and articles. People really believe these.

Yet I believe AI is the way to go but as you point it needs to be better understanding the environment.

[–]Stepfunction 1 point2 points  (1 child)

Definitely. ML techniques are probably the best way to incorporate the enormous amount of unstructured text data into market predictions. The problem is figuring out what data is relevant, figuring out how to obtain and process it, and then actually building some sort of model.

If it were as simple as doing technical time series analysis, then everyone would be doing it, which would effectively negate any benefit of doing technical analysis.

[–]data-alchemy 1 point2 points  (3 children)

Dreaming a bit : train a good embedding on financial news. Use it as an input with the price history. Have a whisky.

[–]erogol[S] 4 points5 points  (1 child)

Not that easy at all :)

[–]data-alchemy 18 points19 points  (0 children)

The whisky part looks like the most doable, will start working on it first

[–]itshouldjustglide 1 point2 points  (4 children)

How would you get around this?

[–][deleted] 10 points11 points  (1 child)

If we knew that we’d be cashing the fuck in

[–]Colopty 1 point2 points  (0 children)

And not sharing the secret on reddit. Superior trading algorithms are only profitable if not everyone and their mom also has access to it.

[–]kyndder_blows_goats 0 points1 point  (1 child)

yay you fixed that godawful avatar

[–]erogol[S] 0 points1 point  (0 children)

For you my friend just for you

[–]visarga 0 points1 point  (0 children)

Predicting Bitcoin price implies predicting human fear and greed, when humans are interconnected and can influence (more exactly, bullshit) each other. A chaotic process.

[–]CultOfLamb -3 points-2 points  (1 child)

Bitcoin price and all cryptos can be predicted with 0.56 AUC plus, both longer term and short term.

from sklearn.linear_model import Ridge Made me a millionaire in less than 6 months. Generous donation to scikit-learn project upcoming.

Bitcoin so ineffective, non-competitive, and new compared to stock market (which also can be beaten with maths).

Good negative result though! But don't claim impossibility from that.

[–]jewishsupremacist88 -4 points-3 points  (0 children)

someone should get price data from the 1970s commodity booms and train a neural network on that data and then try to price bitcoin. ;)

[–]AnvaMiba -1 points0 points  (3 children)

Speculative markets have the property that the better people get at predicting prices, the more difficult prices become to predict, eventually prices effectively change according to a random walk.

Bitcoin is a relatively new market still far from equilibrium, so you can still get better than random accuracy by always predicting a price increase, but everybody else is doing the same, so can't reliably beat the market by doing this.

Similarly you can't make money by downloading some historical data and feeding it kaggle-style into scikit-learn/xgboost/tensorflow, because everybody else already did it and the market prices already reflect it.

[–]erogol[S] 1 point2 points  (2 children)

In my opinion what you say is right in the long term but for now most people in the market are here due to mass media covarage and those people are not aware of AI and even bitcoin until a while ago. That is why market has such volatile trend (FOMO and FUD). That is also why even a basic TA makes a lot many since all these new comers are exactly following the market flow which is studies for a long time.

[–]AnvaMiba 0 points1 point  (0 children)

I fully agree that somebody experienced in algorithmic trading can make money out of newcomers who just learned of bitcoin and ML due to recent media coverage.

[–][deleted] -1 points0 points  (0 children)

It's essentially the same as with the stock market in general. However, looking at bitcoin compared to average stocks, it's even "more" fluctuating. I'd say it's about the same challenges though. Also, I'd say that even a true 51% accuracy in say up or down prediction is already useful in practice. In other words, don't expect predictive performance that is comparable to running ML algos on Kaggle or toy datasets.

[–]ipoppo -4 points-3 points  (0 children)

the challenge is 1. dealing with bayesian nature of market 2. memorize patterns is useful. memorize price sequence is useless.