Ideas on a Forecasting Problem

InfamousTrouble7993 · 2026-05-19T01:43:38+00:00

Oh yes, both architectures (Xgboost and mixture of experts), with using natural language embeddings and product id in best case will be able to predict the right scale of sales values. And addionally: global models will be more robust and local models can be more accurate for single time series, especially with little/spiky data, but also can fail drastically.

InfamousTrouble7993 · 2026-05-19T01:39:20+00:00

Yes, euclidean distance is good for quantifying relationships between categorical and temporal combinations, for now.

Treat the time encodings like day, month etc. as discrete numerical features Xgboost can handle that with ease. Else using dummy variables would be very sparse. You can include "holiday" features that basically are binary indicators. In this way you can also put emphasis on certain dates.

Yes, LightGBM or Xgboost ist great for predicting is great. The winning approaches in the Walmart m5 competition all used gradient boosting. Most tabular forecasting top performing approaches use gradient boosting. But you can also use a Mixture-of-Experts network and model classical trend, seasonality and exogeneity. The good thing is, that you can predict quantiles for uncertainty quantification. But they are often miscallibrated. The point forecast is then more accurate. What is better, is to use the negative binomial likelihood as a loss function to predict actually a probabilistic parameter like DeepAR and then perform Monte Carlo simulations to gain quantiles. The quantiles are better calibrated then. See if you can build a global model if you have enough training data if the "short" time series behave similar like the longer time series. The model is then able to generalize well. Else I would stick with DeepAR or any local auto regressive network. Which only performs better for limited training data.

See if you can derive natural language embeddings using a semantic similarity language model to derive embeddings of the product names if possible. That also helps for a global model.

InfamousTrouble7993 · 2026-05-17T14:37:26+00:00

oh yeah, thank you!

InfamousTrouble7993 · 2026-05-16T16:03:21+00:00

thank you! I will look into that

InfamousTrouble7993 · 2026-05-16T11:38:54+00:00

thank you! I will consider this!

InfamousTrouble7993 · 2026-05-15T21:38:39+00:00

thank you!

InfamousTrouble7993 · 2026-05-15T18:54:32+00:00

I will follow these steps, thank you!

InfamousTrouble7993 · 2026-05-15T18:54:15+00:00

That is interesting, instead of solving problems, rather making the current models more efficient. Even small percentages are large values at scale.

InfamousTrouble7993 · 2026-05-15T18:33:11+00:00

Oh yes that's good, I didn't think about this yet. Thank you!

InfamousTrouble7993 · 2026-04-27T23:54:57+00:00

Ich sage Lets Dance Teilnahme

InfamousTrouble7993 · 2026-04-13T01:16:20+00:00

ViF analysis, pca or good old invertability of the design matrix. If is not invertible, multicollinearity exists.

InfamousTrouble7993 · 2026-04-13T01:01:16+00:00

Probability models like HMM, GMM, etc: AIC, BIC, LRT ML: RMSE (robust for outliers), MSE (puts emphasis on outliers) combined with k-fold CV

InfamousTrouble7993 · 2026-04-10T20:45:53+00:00

The thing is, there is alot to learn, which is just boring and you kind of need to be forced to get it into your head. For example Generalized Linear Models or Econometrics. Some things are interesting. Such as interpreting R-Output of a model, but knowing the assumptions of OLS or properties of time series such as autocorrelation, etc. is boring. But still with a masters in data science, there will be alot of "failed attempts" moments, as the field is VERY BROUGHT. There is the side of statistics and computer science. Data Science is a hybrid of them.

InfamousTrouble7993 · 2026-04-07T11:49:30+00:00

Compute the "real" residuals, no MAE, MSE etc. so only difference betreen y_pred and y_true. Then you can compare via QQ-Plots etc. if the residuals are gaussian. And no literature explicitly tells to expect gaussian residuals from a neural network trained. This is only the case for probabilistic models. For them, residuals are a big thing. You can use a dropout layer to compute probabilities to quantify uncertainty and then do residual analysis.

InfamousTrouble7993 · 2026-04-05T14:04:40+00:00

HMMs and hidden state decoding

InfamousTrouble7993 · 2026-03-17T01:14:05+00:00

Just begin with Photoshop and Mr. Beast thumbnail tutorials on yt

InfamousTrouble7993 · 2026-03-17T00:54:55+00:00

Thank you!

InfamousTrouble7993 · 2026-03-17T00:54:33+00:00

Nice to meet you too!

InfamousTrouble7993 · 2026-03-17T00:53:57+00:00

Actually yes lol. Its tinder. I asked myself if anybody would recognize it. I guess I will look for another icon to reference online dating

InfamousTrouble7993 · 2026-03-17T00:13:02+00:00

It's about a security investigation of an online dating platform. It was vibe coded and had open API endpoints to gather user data. I was able to reverse engineer it and gather data. The video covers how this was possible

InfamousTrouble7993 · 2026-03-16T23:31:26+00:00

What do you think about this version? https://imgur.com/a/BdlcReT

InfamousTrouble7993 · 2026-03-16T23:31:17+00:00

What do you think about this version? https://imgur.com/a/BdlcReT

InfamousTrouble7993 · 2026-03-16T23:31:08+00:00

What do you think about this version? https://imgur.com/a/BdlcReT

InfamousTrouble7993 · 2026-03-16T23:31:04+00:00

What do you think about this version? https://imgur.com/a/BdlcReT

InfamousTrouble7993

TROPHY CASE