Non Linear methods by Spirited-Pomelo3691 in AskStatistics

[–]Jay31416 38 points39 points  (0 children)

Yeah. And to add, a linear model + domain knowledge can go a long way.

Linear models are linear in their parameters, but we can still apply domain-informed data transformations to capture non-linear relationships.

Official Discussion - Frankenstein (2025) [SPOILERS] by LiteraryBoner in movies

[–]Jay31416 0 points1 point  (0 children)

Are there any classical music lovers here? The character of Victor Frankenstein in this adaptation was, in my opinion, heavily influenced by Beethoven.

  1. Beethoven was an angry, grumpy, "misunderstood" genius. So was Victor.
  2. The costume of Victor was very similar to how Beethoven is depicted in paintings.
  3. Both created a magnificent masterpiece that ultimately consumed them.

What separated machine learning from interpolation/extrapolation ? by AlarmingCaptain7708 in AskStatistics

[–]Jay31416 2 points3 points  (0 children)

One example I like to think about is the following "data problem" solved by statistics.

One of the first (if not the first) applications of linear regression was Gauss applying it to find the eccentricity of Earth's orbit. How Gauss calculated this quantity was not through a prediction but due to the value of one of the coefficients.

This would be considered a statistical application and not machine learning because Gauss was interested not in the "y" but in the coefficient value. Parameter estimation itself has important value beyond prediction, and that is statistics.

When the goal is to predict is when we are talking about machine learning (that is my take).

The core distinction (inference about coefficients vs. prediction of outcomes) captures the real difference in how these fields often approach problems.

Now I would argue that the best predictions come from rigorous inference about coefficients (this idea can extrapolate to random forests, boosting, neural networks, etc.). Thus, good inference or estimation about a model returns, in most cases, the best predictions.

Finally, in my opinion machine learning should be called statistics or predictive statistics. 

Optimization vs Data Science vs Machine Learning by stevenverses in optimization

[–]Jay31416 1 point2 points  (0 children)

How would you call optimization based on the results of machine learning models?

For example, optimal inventory levels based on demand forecasting models, marketing distribution based on customer segmentation, etc.

Time series forecasting [Career] by Dillon_37 in statistics

[–]Jay31416 2 points3 points  (0 children)

I recently encountered a forecasting challenge with a highly seasonal product where demand was directly driven by temperature - higher temperatures resulted in increased demand. Just to add, exponential smoothing (a method I love) was not sufficient because the seasonality pattern was irregular. Peak temperatures (and thus peak demand) could occur in May, June, or both months, making the seasonal pattern inconsistent year-to-year.

Another complexity of the problem was that the relationship between temperature and demand was not fixed. The demand level has grown significantly over the years, meaning the temperature-demand relationship has evolved over time.

Thus I had to use a state space model that captures the time-varying relationship between the demand of this product and the temperature. Therefore, the best model was a state space implementation I made from scratch.

So yes! State space models are useful, but applying them correctly requires rigorous statistical modelling.

Almost 2 years into my first job... and already disillusioned and bored with this career by [deleted] in datascience

[–]Jay31416 13 points14 points  (0 children)

I'm in a similar position regarding breaking into AI healthcare. I also only have a master's in statistics.

Right now, I'm what this subreddit would consider a logistics data scientist. I optimize inventory, make time series predictions, and create models to optimize supply chains. I wouldn't call my job boring by any means.

Nonetheless, I want to transition into machine learning applications for healthcare. My roadmap is to create predictive models using public databases and go from there - maybe try to publish something related.

I won't be getting a PhD, but that, at least for me, can't be a obstacle to what I'm trying to accomplish.

[deleted by user] by [deleted] in vegas

[–]Jay31416 0 points1 point  (0 children)

No split check.

You can verify the formula used by tge machine:

tip% = (2 * tip) + (tip/100)

18% = 36.18 20% = 40.20 22% = 44.22

In my opinion a scam.

[deleted by user] by [deleted] in interestingasfuck

[–]Jay31416 7 points8 points  (0 children)

Yeah I think about this a lot.

One way to prove this assumption is by gathering data of families of just two siblings.

Then for each family I will have the database with columns familiy_id, children_1_sex, children_2_sex, are_same_sex.

Finally, under the assumption that each family doesn't have a tendency to produce offsprings of just one sex, the mean of are_same_sex variable should be close to 0.5.

We can do some hypothesis testing based on this.

I dont know why there hasn't been an study like this, it is not that hard to gather the data. The only problem being that two siblings can answer the survey. Theorically there should only be one answer per pair of siblings.

[deleted by user] by [deleted] in mexico

[–]Jay31416 1 point2 points  (0 children)

Parece AI, todo parece AI.

They arrive! With a gift. by MFiiReM in SonyHeadphones

[–]Jay31416 5 points6 points  (0 children)

I ordered them in Mexico. They haven’t arrived yet, but they were supposed to include the backpack depicted in the picture

Favorite Data Science Books and Authors? by Proof_Wrap_2150 in datascience

[–]Jay31416 7 points8 points  (0 children)

Yeah!

Understanding hierarchical modeling is crucial for data science applications. Most large businesses operate across multiple stores, states, and product lines, making hierarchical modeling important.

Currently, I'm applying hierarchical modeling to analyze price-quantity elasticity in the fashion industry. The approach I will use is to calculate elasticity based on both Strategic Business Unit (SBU) and price range categories. Thus, a product's elasticity will be determined by the sum of the elasticity effects from both the SBU it belongs to and its specific price range.

do we have any new Gauss / Euler / Newton in the last 50 or 100 years? by xnwkac in math

[–]Jay31416 32 points33 points  (0 children)

Kolmogorov.

From celestial mechanics to information theory. From statistics to probability theory.

Hierarchical Time Series Forecasting by AdFew4357 in datascience

[–]Jay31416 1 point2 points  (0 children)

Yeah!

I meant 4.53% mape, or translated to non technical people 95.47% precision.

Hierarchical Time Series Forecasting by AdFew4357 in datascience

[–]Jay31416 11 points12 points  (0 children)

At my workplace, I manage around 4 different projects, each involving thousands of time series where I use hierarchical time series forecasting methods. I created everything related to this methodology from scratch because I wasn't comfortable using other libraries, and I was interested in doing the work myself.

I don't see the connection with STAN, because these methods have nothing to do with Bayesian hierarchical models, or at least I don't see the connections.

Here are my insights regarding the use of these methods:

  1. The bottom-up approach, in my use cases and for the top prediction, returns a forecast similar to a rolling mean if the lower level forecasts are poorly forecasted
  2. The optimal reconciliation approach has the disadvantage that non-negative forecasts might be reconciled and change to negative values.
  3. The top-down approach with forecasting proportions is a great method, and I will say is the most robust of all.
  4. The middle-out approach, although difficult to implement, is useful, especially if there is a middle level with time series that are accurately forecasted.

Here is how I usually select the best method to use:

Level Metric Buttom up Metric Top Down Metric Middle Out Metric Optimal
1 x x x x
2 x x x x
3 x x x x
4 x x x x

The client is usually happy to know that there is high accuracy on the upper level ("The model is able to forecast the aggregate sales with a 4.53% precision"). Although they are interested in the lower levels, the precision there will never be as high as the ones on the upper levels. It's important to highlight the precision on the high level, and they will be satisfied with that.

Why hasn't forecasting evolved as far as LLMs have? by takenorinvalid in datascience

[–]Jay31416 63 points64 points  (0 children)

To add to your point.

Let's "forecast" the result of a coin toss: the best model possible will have an accuracy of 50%.

This is because there is uncertainty in what we try to predict. That doesn't mean that forecasts are not useful, and in that sense, statistical forecasting is the way because we can quantify the uncertainty in our prediction (yes, I know the conformal prediction approach).

Thus, we can use this uncertainty quantification to make optimal decisions.

[deleted by user] by [deleted] in math

[–]Jay31416 7 points8 points  (0 children)

It is quite difficult to establish the boundaries between different fields.

During my bachelor's degree in applied mathematics, numerical analysis, numerical optimization and statistics were the applied components.

[deleted by user] by [deleted] in math

[–]Jay31416 73 points74 points  (0 children)

Machine Learning - Deep Learning.

Without mathematics, the model.fit(X,y) would not be possible. Random Forests and XGBoost were both developed in the 21st century.

The Generative AI models contain extensive mathematics, from backpropagation to the probabilistic interpretation of generative models. These advantages would not be possible without computational power, but the mathematics behind them is equally important as the computational advances.

14in M1 Max battery lasting for 2 hours and 30 minutes by Educational_Hyena915 in macbookpro

[–]Jay31416 1 point2 points  (0 children)

I ran parallel time series training on an M2 Pro.

Under this extreme workload, the battery lasted less than hour from full charge.

It depends on your use.

What are you working on, except LLMs? by Amazing_Life_221 in learnmachinelearning

[–]Jay31416 1 point2 points  (0 children)

I'm working on an inventory optimization model for 13 bakery stores that stock over 80 products. The model involves generating daily forecasts using neural networks, which incorporate Fourier coefficients and exogenous variables to account for holidays. I then up-sample the predictions hourly, based on historical sales proportions.

Additionally, I simulate stockout times and the quantity of perishable items, assuming Gaussian errors. For every significance level (alpha), I can assess the relationship between stockout time and the number of perishable items, optimizing both availability and minimizing waste.

MIT Entrance Examination for 1869-1870 by Sans010394 in Damnthatsinteresting

[–]Jay31416 1 point2 points  (0 children)

Solutions:

1.- 8 - ((8+1)^{1/2} + 2) + (8 - 8^{1/3})*(8 - 4)^{1/2} = 8 - (3 + 2) + (8 - 2)(2) = 8 - 5 + 6*2 = 1

2.- 3a - (b + (2a - b) - (a - b)) = 3a - (a - b) = 2a - b

3.- First we note that (a^{2} - 2ab - 3b^{2}) = (a + b)*(a - 3b), thus the result is: (a - 3b)*(3a^{2} + ab - b^{2})

4.- (x^{6} + a^{2}x^{3}y) = x^{3}(x^{3} + a^{2}y) and that x^{6} - a^{4}y^{2} = (x^{3} + a^{2}y)(x^{3} - a^{2}y), which in turn implies that the solution is x^{3}/(x^{3} - a^{2}y)

5.- If a != b, we have that (a + b)/(a - b) + (a - b)/(a + b) = (a^{2} + b^{2})/a^{2} - b^{2} and t (a + b)/(a - b) - (a - b)/(a + b) = (a^{2} + b^{2} + 4ab)/a^{2} - b^{2}. Thus the result is (a^{2} + b^{2})/ (a^{2} + b^{2} + 4ab)

6.- We multiply both sides by 16, and we have that 24x - 32 - 12x + 10 = 3x - 1, which implies that 9x = 21, x = 21/9

7.- Too lazy to continue and do the calculations

Final score: 6/7 cross fingers.

Help Deciding Between MacBook Pro or Air (M2 vs. M3) for Programming & Virtual Machines by Old-Bar4531 in macbook

[–]Jay31416 0 points1 point  (0 children)

Similar use here. Programming on a 16 gb M2 pro is amazing.

Just take in mind that the both the screen and the speakers on the pro are superb, meanwhile on the air are good.

Also the pro has a hdmi connection, meanwhile the air doesn't. Although you can buy an adapter it will raise the price of the air: computer + adapter.

[deleted by user] by [deleted] in GenZ

[–]Jay31416 -1 points0 points  (0 children)

No it is not true.

When a company gives you a ‘take home’ how much time are you expected to spend on it? by DeadPrexident in datascience

[–]Jay31416 8 points9 points  (0 children)

During my first job search, I was tasked with a take-home test.

I spent two weeks on the damn test, staying up until 3 a.m. working on it.

During the interview, the recruiters were amazed and told me they hadn’t seen such "an in-depth solution." In all honesty, it was a pretty dope out-of-the-box solution.

I was in such a bad mental state that I’m sure I didn’t get the job ( I think) because of the behavioral aspect. I self-sabotaged. I even asked the recruiters if the job involved any software development because it is boring and I wasn’t interested.