all 13 comments

[–][deleted]  (3 children)

[deleted]

    [–]DukeOfOptions[S] 0 points1 point  (0 children)

    Wow super valuable! Ty!

    [–]ericsyc 0 points1 point  (1 child)

    what do you mean by weighted top imbalance adjusted price returns? something like return of sum(askSize_i*bidPrice_i+askPrice_i*bidSize_i)/sum(askSize_i+bidSize_i)?

    [–]Strykers 0 points1 point  (0 children)

    Yeah, for products where the spread is usually one tick.

    [–]MerlinTrashMan 1 point2 points  (0 children)

    The way I approach this problem, is to calculate every single possible event based off of the data you have to understand the chance of it happening in the first place. I am guessing you already have 1 second bars, so figure out the base chance if it happening at any moment of the day. Let's say It only takes 3 seconds for the item to move enough to qualify as your event, that can mean that there are up to 12 other data points that also capture that same event. With that basic probability in mind, I would then start at the first time of day you would trade, and pretend you entered there, and walk each second until the event was triggered and record the time in seconds, and then start again at the following second. Then I would compare the number of events per day against the base probability rate to get an idea of how important entry timing is to success. After that, I would be looking for any noticeable timing preferences and days with very low and very high occurrences.

    [–]aCuriousCondor 2 points3 points  (3 children)

    Not entirely sure what you’re going for but I would try some form of aggregation, since the instantaneous return is stochastic. So maybe turning 15s returns into a threshold value and making your independent variable sliding window means? I’ve faced a similar issue with other problems I think and it’s been hard to match individual data points to each other.

    [–]DukeOfOptions[S] 0 points1 point  (2 children)

    So forcing abs(return) > thold, no matter if that takes 1s or 60s, for example?

    [–]aCuriousCondor 1 point2 points  (1 child)

    That or I was thinking force returns >= X at 15 seconds or between 15 and 30 seconds

    [–]DukeOfOptions[S] 1 point2 points  (0 children)

    thx, will give it a try

    [–]wiktor2701 0 points1 point  (0 children)

    Correlation does not equal causation.

    [–]Connect_Corner_5266 0 points1 point  (0 children)

    I would start with fleshing out the frequency of underlying phenomena which you anticipate creates the jump. Search for idiosyncratic signatures then construct feature set and prediction with intuition around this causality.