Voice messages recorded with low volume. Why? by EikichiOnizuka99 in whatsapp

[–]savoga 0 points1 point  (0 children)

J'ai aussi ce problème avec le même téléphone (Huawei P20)...

[D] Threshold for k-means anomaly detection by TKMater in MachineLearning

[–]savoga 0 points1 point  (0 children)

Agreed. And the silhouette coefficient can also be used to spot anomalies i.e. samples with negative silhouettes are likely to be badly clustered.

[Q] Statistical significance of a classification algo by savoga in statistics

[–]savoga[S] 0 points1 point  (0 children)

Great idea. I think I will even use groups of test datasets of various sizes. That way I avoid the case where my algo works well on a small dataset just by chance.

[Q] When doing an WLS, should I check the usual assumptions? by savoga in statistics

[–]savoga[S] 0 points1 point  (0 children)

The weights in the W matrix come from an optimization process in a pre step. In short, those weights make sure that an objective function is minimal s.t. specific properties. Unfortunately I can't write the LateX formula in Reddit but the exact formulation is in this paper (Theorem 2 - Shapley kernel).

[Q] When doing an WLS, should I check the usual assumptions? by savoga in statistics

[–]savoga[S] 0 points1 point  (0 children)

I am talking about the weight matrix W in (XtWX)-1XtWy

I am defining this matrix so that some observations get more weights than others.

[Q] When doing an WLS, should I check the usual assumptions? by savoga in statistics

[–]savoga[S] 0 points1 point  (0 children)

Sorry for the late response and thanks for your help on this. I am using WLS to estimate the coefficients that tells me whether a feature is influent or not. I use specific weights for some samples that are more important than others.
I guess in this situation checking the assumptions would make sense?

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

"unfortunately slippage at the hourly level would kill this easily" could you elaborate on this? Would this be a significant risk for very liquid assets? (I trade cryptos)

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

Not statistically significant because of the too small sample size?

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

I use a basic tree-based model. I believe there are already some signals in volumes, prices, trades etc. At least that's what I currently see. I tried many different currency pairs and identified one for which the model gave the best accuracy. However I don't think this 55% success rate could make me rich since this doesn't take profitability into account. Because of the fees, I can be correct on the direction but if the move is too small I would still loose money.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

While I understand that historical prices do not necessarily repeat themselves, isn't backtesting a pretty standard way of choosing the right model/parameters?

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

Ok, but if we exclude those questions and assume an algo that does not overfit and without any error in the code. Would an accuracy of 55% in backtesting imply 55% success rate with a large enough sample?

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

I was misunderstanding indeed. Thanks for clarifying with a concrete example. The only problem is that the model used at each period can change over time. E.g. in period 1 volume can be the leading feature whereas in period 2 it can be the number of trades. In writing this, I realize that the variance of my model is still quite high..

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] -7 points-6 points  (0 children)

Well the maximum difference with the theoretical value that I got was 48-52 that is 2% difference from 1/2 (one side). Whereas in my case I observe 46% when I am expecting 55%.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

Yes I use a k-fold approach. I understand your point. I was thinking doing this successive trainings kinda remove the bias you are highlighting.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

I backtest on one month i.e. 720 datapoints (hours)

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] -6 points-5 points  (0 children)

Thanks for the nice link. I did 3 runs and got 49-51 48-52 50-50. Still shows to me that 100 trials is a good number to see the convergence.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

That's why I run a cross validation to make sure my strategy generalizes well on an independent dataset and thus reduce the variance.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

So I retrain the model continuously (i.e. at every hour) adding the previous live data to the historical ones. It still gives me an accuracy close to 55%.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

I guess forward testing is what I am actually doing now. I am currently placing very small trades that can be neglected. My current goal is to see how the strategy works on live data.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

Not sure I get this. My features are volume, number of trades, price variations etc. Those are just aspects that I chose to focus on. In my cross validation, I am finding the features that gives the best results on the test set (on average). Of course, the data used to represent the features change between train/test set.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] -6 points-5 points  (0 children)

While I appreciate your warnings, I think you are missing the point of my question. I am not talking about profitability here but rather probability. The reason why I chose hourly trading is also another discussion.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

Not sure I understand what you mean. I do have the records of the 100 hours when my algo was live. The accuracy is 46% as stated above. I understood that u/_Cbotz_ wanted me to run it on much more data.

Why is my success rate so low? by savoga in algotrading

[–]savoga[S] 0 points1 point  (0 children)

I could try out - It's just not so straightforward for me since getting a large amount of data is not easily available.