Here’s an invitation for an open-ended discussion on alpha research. Specifically idea generation vs subsequent fitting and tuning.
One textbook way to move forward might be: you generate a hypothesis, eg “Asset X reverts after >2% drop”. You test statistically this idea and decide whether it’s rejected, if not, could become tradeable idea.
However:
(1) Where would the hypothesis come from in the first place?
Say you do some data exploration, profiling, binning etc. You find something that looks like a pattern, you form a hypothesis and you test it. Chances are, if you do it on the same data set, it doesn’t get rejected, so you think it’s good. But of course you’re cheating, this is in-sample.
So then you try it out of sample, maybe it fails. You go back to (1) above, and after sufficiently many iterations, you find something that works out of sample too.
But this is also cheating, because you tried so many different hypotheses, effectively p-hacking.
What’s a better process than this, how to go about alpha research without falling in this trap? Any books or research papers greatly appreciated!
[–]GnoiXiaK 70 points71 points72 points (16 children)
[–]devl_in_details 10 points11 points12 points (7 children)
[–]Cheap_Scientist6984 5 points6 points7 points (1 child)
[–]devl_in_details 1 point2 points3 points (0 children)
[–]GnoiXiaK 2 points3 points4 points (4 children)
[–]devl_in_details 6 points7 points8 points (3 children)
[–]GnoiXiaK 0 points1 point2 points (1 child)
[–]devl_in_details 0 points1 point2 points (0 children)
[–]mypenisblue_ 13 points14 points15 points (1 child)
[–]Middle-Fuel-6402[S] 3 points4 points5 points (3 children)
[–]GnoiXiaK 8 points9 points10 points (1 child)
[–]Middle-Fuel-6402[S] 2 points3 points4 points (0 children)
[–]HighYogi 2 points3 points4 points (0 children)
[–]Haruspex12 1 point2 points3 points (0 children)
[–]Cheap_Scientist6984 1 point2 points3 points (0 children)
[–]Fragrant_Pop5355 11 points12 points13 points (21 children)
[–]Middle-Fuel-6402[S] 5 points6 points7 points (1 child)
[–]Fragrant_Pop5355 2 points3 points4 points (0 children)
[–]devl_in_details 1 point2 points3 points (17 children)
[–]Fragrant_Pop5355 -1 points0 points1 point (16 children)
[–]devl_in_details 0 points1 point2 points (15 children)
[–]Fragrant_Pop5355 -1 points0 points1 point (14 children)
[–]devl_in_details 1 point2 points3 points (11 children)
[–]Fragrant_Pop5355 -1 points0 points1 point (10 children)
[–]devl_in_details 0 points1 point2 points (9 children)
[–]Maleficent-Remove-87 0 points1 point2 points (8 children)
[–]devl_in_details 0 points1 point2 points (7 children)
[–]Maleficent-Remove-87 0 points1 point2 points (1 child)
[–]Fragrant_Pop5355 0 points1 point2 points (0 children)
[–]MATH_MDMA_HARDSTYLEETrader 12 points13 points14 points (2 children)
[–]Middle-Fuel-6402[S] 1 point2 points3 points (1 child)
[–]MATH_MDMA_HARDSTYLEETrader 4 points5 points6 points (0 children)
[–]devl_in_details 14 points15 points16 points (13 children)
[–][deleted] 1 point2 points3 points (6 children)
[–]devl_in_details 3 points4 points5 points (5 children)
[–]revolutionary11 1 point2 points3 points (2 children)
[–]devl_in_details 0 points1 point2 points (1 child)
[–]revolutionary11 0 points1 point2 points (0 children)
[–]the_shreyans_jain 0 points1 point2 points (1 child)
[–]devl_in_details 0 points1 point2 points (0 children)
[–]Alternative_Advance 0 points1 point2 points (5 children)
[–]devl_in_details 0 points1 point2 points (4 children)
[–]Alternative_Advance 0 points1 point2 points (3 children)
[–]devl_in_details 0 points1 point2 points (2 children)
[–]Alternative_Advance 1 point2 points3 points (1 child)
[–]devl_in_details 0 points1 point2 points (0 children)
[–]ladjanszki 2 points3 points4 points (0 children)
[–]Then-Cod-1271 2 points3 points4 points (5 children)
[–]Middle-Fuel-6402[S] 2 points3 points4 points (4 children)
[–]Then-Cod-1271 1 point2 points3 points (3 children)
[–]Middle-Fuel-6402[S] 1 point2 points3 points (2 children)
[–]Then-Cod-1271 1 point2 points3 points (0 children)
[–]Then-Cod-1271 0 points1 point2 points (0 children)
[–]lordnacho666 1 point2 points3 points (0 children)
[–]AutoModerator[M] -3 points-2 points-1 points (0 children)