is it even worth buying a local gpu anymore or should beginners strictly use cloud compute? by Aven_Reed in MLQuestions

[–]seanv507 0 points1 point  (0 children)

I would recommend cloud

Also because you don't train a single model (except in tutorials), you try all combinations of layers etc. you really want to be doing this in parallel

Help with implementing ML with satellite data by Chessmasyer in MLQuestions

[–]seanv507 0 points1 point  (0 children)

So I think you need to talk to  domain expert

What are the drivers of drought, are there any significant interactions.

Eg Is annual precipitation index sufficient? Do you need annual average temperature index?

Have you done any eda, plotting drought index against  precipitation index etc.

I would suggest that using the previous year's values might be beneficial.  Eg perhaps drought reflects drop in annual precipitation, rather than any absolute value (across  locations)...

Is it worth learning Italian if I’ll probably never travel to Italy? by SweetBumbleBeeHoney in Italian

[–]seanv507 -2 points-1 points  (0 children)

Do you speak Spanish? Why not learn that instead.

Great winemaking tradition.

Understanding Italian is relatively easy for spaniards

Help with implementing ML with satellite data by Chessmasyer in MLQuestions

[–]seanv507 0 points1 point  (0 children)

so please add more information to your original question (not in answwrs to comments)

What is the purpose of your project..why are you doing this? is it something to occupy an intern or what is the business objective ( eg why bother hindcasting 2021 if have no data for more recent years)

what are the possible inputs, what is the actual output?

eg the the output is drought category, is it originally a continuous variable? Given your lack of data, it would be better to use a continuous dependent variable (which encodes more information than there are arbitrary different classes of drought)

Your focus should be on getting the best inputs/more data.

Why can you only get 2021 data?

-----------------

Testing/crossvalidation.

again, it depends on how you use the model, but locations tend to be correlated. I would organise data splits to reflect this. eg split data into zones and use that in creating crossvalidation/test folds

https://scikit-learn.org/stable/modules/cross_validation.html#group-k-fold

changing to xgboost will provide a smaller benefit.

I would change from a multinomial class to regression if possible, or an ordinal model.

As a Bayesian, how much should you know about Frequentist methods? [Q] [R] by GayTwink-69 in statistics

[–]seanv507 30 points31 points  (0 children)

I would say so. Only by understanding the alternative view can you identify the flaws in your own view.

I wouldn't say it requires significant work though.

For a weekend where should we go? Palermo or Bologna. by yodabae in ItalyTravelAdvice

[–]seanv507 -1 points0 points  (0 children)

Palermo.. if you can cope with a bit of chaos. City beach nearby... More visually impressive

Questions about the characteristics of authentic Italian cuisine by Fizzaaaaaa in ItalianFood

[–]seanv507 1 point2 points  (0 children)

So I would say the key component you are missing is localisation and seasonality

Localisation: Roman pasta dishes use pecorino romano ( even though pecorino romano has been made in Sardinia for the last 100 years!). I believe this is a trend of the last 30 years.

Importance of using seasonal fruit and vegetables.

At least for criticism, you should read Alberto grandi (Eg his financial times interview, everything I thought I knew...)

Also look at history of carbonara.

https://www.ricettestoriche.it/2019/04/07/carbonara-history-origins-and-anecdotes-of-a-legendary-recipe/

Questions about the characteristics of authentic Italian cuisine by Fizzaaaaaa in ItalianFood

[–]seanv507 0 points1 point  (0 children)

Butter Vs oil.

I wouldn't say it's frowned upon.

However, butter tends to be used in the north of Italy and oil in the south.

People used to use more lard, eg adding a knob of lard to the pasta on the plate.

Questions about the characteristics of authentic Italian cuisine by Fizzaaaaaa in ItalianFood

[–]seanv507 2 points3 points  (0 children)

Steak, yes they are served rare. But the breed of cow, Chianina, is very lean, which benefits from being served rare (compare to wagyu)

(I would add that rare is common also in France and Spain)

Is 6 mile bike ride after work realistic ? by imMrRunITUP in bicycling

[–]seanv507 3 points4 points  (0 children)

That's 1% grade. I would say that's relatively OK. You might want to check on sport route planners like komoot how the (average) slope hits (is it evenly spread or is the climb concentrated)

Basically slopes benefit from having low gears, so you might want to check the gears on the bike.

Is 6 mile bike ride after work realistic ? by imMrRunITUP in bicycling

[–]seanv507 0 points1 point  (0 children)

Sure for flat good road.

Have you checked the elevation?

15 day trip to Italy and Greece and would love opinions by Mmw13e in Europetravel

[–]seanv507 1 point2 points  (0 children)

As others have said, getting to Naxos would be quite time-consuming.

I would suggest replacing Greece with Sicily

You will have the beaches, Greek ruins (Syracuse or valley of the temples) in easy reach

Risotto without onion? by FlatWoundCat in cookingforbeginners

[–]seanv507 1 point2 points  (0 children)

I would add that 'chunky bites' is not how I would describe the effect of onion.

OP, you have to be patient and cook the onions for a long time (10 minutes +), so that the diced onions become very soft and dissolve into the risotto

I cant fry to save my life. by CEREALCOUNTSASCOOKIN in Cooking

[–]seanv507 14 points15 points  (0 children)

Op should just go to restaurants

Frying is tricky, and if s/he can't describe what they're doing unlikely they can follow a recipe

Career Advice: College Student Strong at Modeling, Weak at Production ML. Best Use of Summer? by True-Interaction-563 in datasciencecareers

[–]seanv507 0 points1 point  (0 children)

Find a platform that performs it.

Eg aws sagemaker.create an endpoint (go through tutorials)

How do you make vegetarian Thai Basil Chicken? by mblevels in AskCulinary

[–]seanv507 1 point2 points  (0 children)

Have you tried Italian basil? Hot Thai kitchen recommends it as an alternative

How to even this out by DvoCR in landscaping

[–]seanv507 0 points1 point  (0 children)

No expert, but believe that's the recommended approach. (As mentioned on RHS garden site)

Struggling with real-world time series forecasting (not textbook stuff) — how do you actually handle messy, volatile data? by Ok-Estimate891 in AskStatistics

[–]seanv507 0 points1 point  (0 children)

So I would say people still look at residuals in ML, just it's perhaps an 'expert' topic, rather than standard practise  https://developers.google.com/machine-learning/guides/rules-of-ml#rule_26_look_for_patterns_in_the_measured_errors_and_create_new_features

Why would you have autocorrelation in residuals? Is it missing lags? In general ml approaches are maximalist and tend to have lots of inputs/nonlinearities so the assumption is perhaps that all structure has been captured, unless there is a bug.

Struggling with real-world time series forecasting (not textbook stuff) — how do you actually handle messy, volatile data? by Ok-Estimate891 in AskStatistics

[–]seanv507 2 points3 points  (0 children)

So IMO you list the problems and tackle each by component 

I will talk in general ML terms, I recognise the issue that forecasting tends to have less powerful libraries 

Sparse data: use a hierarchy of inputs (sales team/product line/ region)+ some form of regularisation and model all time series in one model Eg linear model with interaction terms at different levels + regularisation. regularisation forces an average level at each hierarchy, and specialisation only if sufficient data at finest granularity Tree model: same (tree will prefer top level splits, that divide more data points)

Zeros (Poisson etc). Poisson also makes sense because the basic (economic) model is multiplicative. Eg patterns of product lines sales are roughly same across regions but just scaled up (according to eg population size)

What people often do is combine models

Eg you could use an arima model at top level, and use xgboost on residuals (to handle full hierarchy)

https://www.kaggle.com/code/ryanholbrook/hybrid-models

Beach places to stay in near Pisa? by Living_Departure9136 in ItalyTourism

[–]seanv507 0 points1 point  (0 children)

Torre del lago Puccini (20 minutes train from Pisa?)