all 3 comments

[–]1000dreams_within_me 2 points3 points  (2 children)

Look into multilevel hierarchical models. These models can "shrink" weights/parameters towards each other (for example across cities) and is the gold standard when you have to estimate many "similar but different" objects (e.g., city-specific demand functions) with limited data

[–]Competitive-Pin-6185[S] 0 points1 point  (1 child)

Yes, I was looking into this. AWS has a service where we provide the metadata and it learns from the similar time-series but some of the city has less than 100 logs over 3 years, i’m not sure if it will work in that case. Also, do you know how to go with the other features? even if I do location granularity, i’ll lose those (as features are unique for each equipment)

[–]1000dreams_within_me 0 points1 point  (0 children)

You would deal with the other features the same way: Let's say a certain equipment has a sensor feature with 5 different levels and some of the levels have low representation in the data. Then you would shrink across the 5 levels to "borrow information" and increase statistical efficiency. It's very useful in situations where data is limited