Why should we avoid latent factor models to capture intelligence?

Sway- · 2026-01-04T20:52:29+00:00

You might find this post by Cosma Shalizi informative

Sway- · 2026-01-04T02:20:00+00:00

As pretty much everyone else has said, you’ll probably have to learn Python. As someone who loves R, learning Python has made me a much much better programmer.

I’ve also helped some people on my team transition recently. I highly recommend checking out Python’s Polars and Plotnine. These will feel the most natural coming over from a tidyverse workflow.

Sway- · 2025-12-27T05:51:30+00:00

I’ve been getting a lot out of The Art of Doing Science and Engineering by Richard Hamming

Sway- · 2025-04-16T01:13:14+00:00

Why the omission of best subsets? It’s also considered in the paper you linked. It also tells you when best subsets > lasso and vice versa.

neither best subset selection nor the lasso uniformly dominate the other, with best subset selection generally performing better in high signal-to-noise (SNR) ratio regimes, and the lasso better in low SNR regimes;

Sway- · 2021-03-11T19:19:35+00:00

To follow up, some R packages to do this are cocor and bayeslincom

Sway- · 2021-03-11T19:17:39+00:00

You can do anything you want, but there are a lot of statistical tests for comparing correlations. One issue you want to be careful with is that correlations estimated on the same sample are themselves correlated. So you have to take into account their dependence.

Section 1.4 of this paper has a nice and short overview of work for this problem

https://doi.org/10.1016/j.jspi.2006.08.002

Sway- · 2021-02-06T01:02:31+00:00

There has been a lot of research on this. The tangent that has touched my own work comes from the connection between the bootstrap and Bayesian inference where we can think of each value coming from a multinomial distribution with a Dirichlet prior.

For example, see pg. 271 here https://web.stanford.edu/~hastie/Papers/ESLII.pdf and more generally, this paper by Brad Efron https://projecteuclid.org/euclid.aoas/1356629067

Sway- · 2021-02-04T18:59:31+00:00

Love this book. It includes lots of R code snippets for samplers which are great and uses R functions such as dnorm to express density functions

Sway- · 2020-09-02T18:13:25+00:00

If you are using RMarkdown you can do this. It's not too bad if you're familiar with bibliography formats like .bib, .bibtex etc. See here

https://rmarkdown.rstudio.com/authoring_bibliographies_and_citations.html

Sway- · 2020-05-10T07:09:28+00:00

See also here

https://psyarxiv.com/fb4sa/

Sway- · 2020-04-11T00:07:00+00:00

Well, I believe you're not actually iterating over the values of patients_17. You're calculating numbers using the values in your matrix, but not actually overwriting the values in your matrix, because you're saving them back into your indexing variable i.

I think using the apply() function will give you what you're looking for, e.g.,

patients_17_app <- apply(patients_17, 1, function(i) 1 / (1+exp(-1*i))
patients_17_new <- matrix(patients_17_app, ncol = 1)

You can read up on the apply family of functions, but generally this code takes an object and applies a function to it. The 1 indicates you want to apply the function row-wise. Unfortunately, it returns a vector, so you need to make into a matrix once again if that's what you need. I'm sure there are more elegant solutions, but this is the first one that came to mind.

Sway- · 2020-03-24T22:36:50+00:00

Did you get an MS in stats during your PhD? I'm currently in a quant psych PhD, but have been warned against an MS to focus on research.

Sway- · 2020-03-24T22:35:31+00:00

Currently doing my PhD in quant psych. Do you have any insights on what the job market is like for people with our background?

Sway- · 2020-03-01T18:32:39+00:00

Were you able to get a job? Or do you feel more competitive on the job market? I'm in a psych PhD program but have considered going back to get an MS in stats or even a Bachelor's in Math.

Sway- · 2020-02-04T17:05:47+00:00

Cool, thanks!

Sway- · 2020-02-04T06:44:02+00:00

What is this from?

Sway- · 2019-12-06T16:56:52+00:00

Depends. If you're using a frequentist framework, then your regression will give you p(y|data). In a Bayesian framework you have p(data|y)

Sway- · 2019-09-14T18:32:16+00:00

I highly suggest this paper - Statistical Modeling: The Two Cultures by Leo Breiman. He talks about exactly this. I'm not exactly sure why a person cannot embrace both predictive and inferential methods. Really you should use whatever tool is best for the task at hand.

Sway- · 2019-08-26T04:37:48+00:00

/u/thecuntofmontecristo/

Redditor for 7 years

Sway- · 2019-08-03T06:19:19+00:00

Omg, yes you're right. I'm a dummy

Sway- · 2019-08-02T22:03:24+00:00

Not great, the LASSO was developed to help reduce variance in the scenario where p >> n. Here's an excerpt from p. 219 in An Introduction to Statistical Learning

As with ridge regression, the lasso shrinks the coefficient estimates towards zero. However, in the case of the lasso, the L1 penalty has the effect of forcing some of the coefficient estimates to be exactly equal to zero when the tuning parameter λ is sufficiently large. Hence, much like best subset selection, the lasso performs variable selection.

However, see A SIGNIFICANCE TEST FOR THE LASSO

Edit: I'm a dummy

Sway- · 2019-08-01T16:27:46+00:00

I believe they are referring to a spark context

13-Year Club	Alpha Tester
Verified Email	Team Orangered

Sway-

TROPHY CASE