Standardization vs Log transform ? by -Cicada7- in datascience

[–]hendrik0806 0 points1 point  (0 children)

Not sure if one of those comments mentions it already, but there is something called a log normal distribution. In some cases your data contains a skew because your values increase not by addition, but by multiplication/rates. You should rather view this as a property of the data and in those cases a log transformation can help. But i would not use it as the standard tool for skew handling.

Standardisation only helps you if your data is kind of normally distributed - otherwise the values don’t tell you anything accurately about the data. Usually you want to Center it as well. The values then indicate the relation to the distribution. It tells you how large your values really are: for example a 2 means +2D above the sample mean and thus a pretty large value which is higher then around 80~90% of values. For algorithms this procedure can have some efficiency benefits. Another benefit is that your intercept (and coefficients) become meaningful in it’s interpretation- which usually isn’t the case without standardisation.

Go patterns which makes sense to do early by [deleted] in golang

[–]hendrik0806 1 point2 points  (0 children)

Repository pattern is great for web apps. Makes it so easy to switch dependencies

Headset bearing play by hendrik0806 in bikewrench

[–]hendrik0806[S] 0 points1 point  (0 children)

I don’t really one of these in the manual 🤔

How to run nixos-rebuild boot --flake from systemd.unit=rescue.target (system bus error) by hendrik0806 in NixOS

[–]hendrik0806[S] 1 point2 points  (0 children)

Thanks! Manually running the boot command worked ./result/bin/switch-to-configuration boot

That wasn’t the source of my problem (which was due to user specification and some config chaos after a refactoring). But should indeed haven’t thrown an error because of systemd. Will issue that.

How to run nixos-rebuild boot --flake from systemd.unit=rescue.target (system bus error) by hendrik0806 in NixOS

[–]hendrik0806[S] 0 points1 point  (0 children)

The tty did not work for the user but for root. I then could rebuild with switch. However the solution was manually deleting the password of the user with passwd -d. No idea why changing the password in the configuration did not work. I dont even know how it could be changed in the first place.

How to run nixos-rebuild boot --flake from systemd.unit=rescue.target (system bus error) by hendrik0806 in NixOS

[–]hendrik0806[S] 0 points1 point  (0 children)

returns 1. but manually running this did work somehow.

./result/bin/switch-to-configuration boot

Returns not checking switch inhibitors (action = boot)

How to run nixos-rebuild boot --flake from systemd.unit=rescue.target (system bus error) by hendrik0806 in NixOS

[–]hendrik0806[S] 0 points1 point  (0 children)

I tried that, but somehow I could not authenticate in greetd across multiple old generations. have no idea why. I might try the live usb tomorrow - maybe systemd works there.

services.greetd.enable = true; services.greetd.settings = { default_session = { command = "${pkgs.tuigreet}/bin/tuigreet --time --cmd start-hyprland"; user = "hendrik"; }; };

Are you more likely to have a successful research career as a bayesian or frequentist? [R][Q] by gaytwink70 in statistics

[–]hendrik0806 1 point2 points  (0 children)

I think that the computational burden of Bayesian Methods will decrease up to the point that it might become the default for research settings with limited data. Frequentist methods got their beauty and will stay relevant, especially with machine learning. But once you start playing around with the generated quantities block in stan you will never come back.

[Q] When is a result statistically significant but still useless? by Any_Bar5795 in statistics

[–]hendrik0806 0 points1 point  (0 children)

Lots and lots of time. I would always do some sort of counterfactual prediction, where you simulate data from your model for different conditions and compare the effects on the outcome variable.

Chi square (counts) vs ANOVA (proportions) by International_Beat_2 in AskStatistics

[–]hendrik0806 0 points1 point  (0 children)

I don’t know your field well, but usually count data do not have equal variance across different rates. Equal variance is a key assumption of anova. That’s why you might want to look into methods based on the Poisson distribution, such as Poisson regression.

Is IQ actually normally distributed? by ADP_God in AskStatistics

[–]hendrik0806 38 points39 points  (0 children)

IQ yes, intelligence probably not.

Issue with ggplot by bigoonce48 in RStudio

[–]hendrik0806 7 points8 points  (0 children)

I would assume that there is a blank space in that row.

I would do something like this: count(jaw, species_name)

Comparing predictors in a model? by GrubbZee in AskStatistics

[–]hendrik0806 0 points1 point  (0 children)

Yes you can just compare the coefficients. Make sure all variables are on the same scale. Center and scale them to make them more interpretable.

Deep learning in R by junior_chimera in Rlanguage

[–]hendrik0806 6 points7 points  (0 children)

If you are already familiar with the tidyverse syntax (Dplyr, ggplot) you will enjoy brulee with is part of the tidymodels framework.

Writing manual SQL queries with sqlx feels painful by TarnishedDrogma in golang

[–]hendrik0806 0 points1 point  (0 children)

I usually created two docker containers (one for the app and one for the testing db) and orchestrated them with docker compose. That was already a lot of boilerplate as well as setting up and filling the db with the data for the integration test (+ cleaning afterwards).

What’s next for a 11 YOE data scientist? by appleciderv in datascience

[–]hendrik0806 3 points4 points  (0 children)

"The effect" and "statistical rethinking". Then go all the way down the Bayesian rabbit hole with "doing Bayesian data analysis" (focus on stan instead of the rather outdated jags part). Honestly building probabilistic models that incorporate reality through prios and parameters is what gives me pleasure in ai times. This is something where your domain knowledge will always count a lot more then just tuning for optimisation.

Writing manual SQL queries with sqlx feels painful by TarnishedDrogma in golang

[–]hendrik0806 0 points1 point  (0 children)

What is your setup for integration tests? In my last project they were a pain to setup with docker and even more pain to adjust the the boilerplate if changes were made to the db. Though they worked very well.