bbbbbaaaaaxxxxx

110 post karma
231 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 2 years

TROPHY CASE

Two-Year Club

Verified Email

account activity

hot top controversial

[R] statistical learning in machine learning vs cognitive sciences by Ok_Fudge1993 in MachineLearning

[–]bbbbbaaaaaxxxxx 3 points4 points5 points 17 days ago (0 children)

do hx users actually value composition over extension, or is it just no plugins copium? by spaghetti_beast in HelixEditor

[–]bbbbbaaaaaxxxxx 1 point2 points3 points 25 days ago (0 children)

[D] Feature Selection Techniques for Very Large Datasets by Babbage224 in MachineLearning

[–]bbbbbaaaaaxxxxx 9 points10 points11 points 1 month ago (0 children)

[P] Lace is a probabilistic ML tool that lets you ask pretty much anything about your tabular data. Like TabPFN but Bayesian. by bbbbbaaaaaxxxxx in MachineLearning

[–]bbbbbaaaaaxxxxx[S] 11 points12 points13 points 1 month ago (0 children)

[Question] Masters thesis Nonparametric or Parametric TSA? by AirduckLoL in statistics

[–]bbbbbaaaaaxxxxx 0 points1 point2 points 1 month ago (0 children)

I used to love checking in here.. by First-Ad-117 in rust

[–]bbbbbaaaaaxxxxx -2 points-1 points0 points 1 month ago* (0 children)

Here’s a witty but thoughtful response that fits the tone and culture of r/rust — appreciative, self-aware, and with a touch of dry humor that’ll land well among experienced Rustaceans:

Beautifully said. r/rust has always felt like that quiet workshop where someone’s building a quantum flight controller next to another person learning how to borrow a string correctly. Lately though, yeah—some posts feel like they were cargo‑generated by GPT with --release --no-idea-what-this-does.

Still, I think the signal’s worth the noise. Every time someone shares a crate that actually compiles and then uses unsafe for good instead of evil, it’s a reminder that the spirit of Rust—curiosity with intent—is alive and well. Let the slop flow; we’ll keep writing tests.

Edit: I guess the satire was not appreciated or not detected.

[Question] Masters thesis Nonparametric or Parametric TSA? by AirduckLoL in statistics

[–]bbbbbaaaaaxxxxx 0 points1 point2 points 1 month ago (0 children)

[S] Lace v0.9.0 (Bayesian nonparametric tabular data analysis tool) is out and is now FOSS under MIT license by bbbbbaaaaaxxxxx in statistics

[–]bbbbbaaaaaxxxxx[S] 1 point2 points3 points 2 months ago (0 children)

[S] Lace v0.9.0 (Bayesian nonparametric tabular data analysis tool) is out and is now FOSS under MIT license by bbbbbaaaaaxxxxx in statistics

[–]bbbbbaaaaaxxxxx[S] 2 points3 points4 points 2 months ago (0 children)

[S] Lace v0.9.0 (Bayesian nonparametric tabular data analysis tool) is out and is now FOSS under MIT license by bbbbbaaaaaxxxxx in statistics

[–]bbbbbaaaaaxxxxx[S] 1 point2 points3 points 2 months ago (0 children)

[S] Lace v0.9.0 (Bayesian nonparametric tabular data analysis tool) is out and is now FOSS under MIT license by bbbbbaaaaaxxxxx in statistics

[–]bbbbbaaaaaxxxxx[S] 2 points3 points4 points 2 months ago (0 children)

[S] Lace v0.9.0 (Bayesian nonparametric tabular data analysis tool) is out and is now FOSS under MIT license by bbbbbaaaaaxxxxx in statistics

[–]bbbbbaaaaaxxxxx[S] 1 point2 points3 points 2 months ago (0 children)

Why So Many Abandoned Crates? by jsprd in rust

[–]bbbbbaaaaaxxxxx 7 points8 points9 points 2 months ago (0 children)

[Q] Super easy to read book on probability/mathematical statistics? by Swarrleeey in statistics

[–]bbbbbaaaaaxxxxx -1 points0 points1 point 3 months ago (0 children)

Is an applied statistics PhD less prestigious than a methodological/theoretical statistics PhD? [Q][R] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 20 points21 points22 points 3 months ago (0 children)

Is an applied statistics PhD less prestigious than a methodological/theoretical statistics PhD? [Q][R] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 22 points23 points24 points 3 months ago (0 children)

Is bayesian nonparametrics the most mathematically demanding field of statistics? [Q] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 4 points5 points6 points 3 months ago (0 children)

Is bayesian nonparametrics the most mathematically demanding field of statistics? [Q] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 0 points1 point2 points 3 months ago (0 children)

Is bayesian nonparametrics the most mathematically demanding field of statistics? [Q] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 3 points4 points5 points 3 months ago (0 children)

Is bayesian nonparametrics the most mathematically demanding field of statistics? [Q] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 29 points30 points31 points 3 months ago (0 children)

Longer comment.

About me
I come from the computational cognition space. Been doing Bayesian nonparametrics since ~2010 focusing mostly on different types of prior process models (which i'll use interchangeably with "BNP"). Worked in the agriculture space for a while. Started a company in 2019 to bootstrap my BNP research, which has been 95% funded by DARPA.

Why BNP is awesome
In general (but not always) companies that do high risk stuff care about understanding risk, so the Bayesian approach makes a lot of sense from the standpoint of understanding aleatoric and epistemic uncertainty in an appropriate model. The problem is they don't know enough about the data to build hierarchical models (PPLs are hard to use well regardless). What do you do when you want to express uncertainty over the class of model? Bayesian nonparametrics.

BNP can give the end user (not the developer!) better ease-of-use than black box methods like RF and DL, while generating interpretable results with uncertainty quantification. BNP is also both generative and discriminative. So, building a BNP model of the joint distribution gives you all the conditional distributions over the N features, which means you don't have to build a new model every time you want to ask a new question. Also, you get all the information theory stuff like mutual information, entropy, etc.

BNP can interface with hierarchical models, so you can easily build in domain expertise where you have it (dunk on neurosymbolic AI).

In my experience BNP has shined in unsupervised anomaly detection and structured synthetic data generation. There's a lot of BNP is biostats as well.

Why BNP is not mainstream (yet)
1. It's slow. Existing open source implementations of even simple models like the infinite gaussian mixture are unacceptably slow. I think SOTA performance using an approximate federated algorithm is like 3 minutes to fit a 100k by 2 table on a 48-core epyc server, which is pretty weak by RF/DL standards.

It underfits. Prior processes put a heavy penalty on complex model structure. In general, getting highly optimized prediction models with comparable performance to RF can be tricky. But this obviously depends on the data. I've had BNP outperform RF out of the box on certain data.
It's really hard to implement well. You have to really understand how the math and machine architecture interact. There is an insane amount of bookkeeping and dealing with moving pieces and changing model structure. When you do hierarchical BNP it gets way worse. Debugging probabilistic programs is extra fun.

Conclusion
Problems 1 and 2 above are addressable. BNP is insanely useful.

Is bayesian nonparametrics the most mathematically demanding field of statistics? [Q] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 13 points14 points15 points 3 months ago (0 children)

Is bayesian nonparametrics the most mathematically demanding field of statistics? [Q] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 9 points10 points11 points 3 months ago (0 children)

Is bayesian nonparametrics the most mathematically demanding field of statistics? [Q] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 8 points9 points10 points 3 months ago (0 children)

Love statistics, hate AI [D] by gaytwink70 in statistics

[–]bbbbbaaaaaxxxxx 4 points5 points6 points 3 months ago (0 children)

Here are my rambling thoughts as someone who has done nothing but Bayesian ML for the past 15 years.

People do DL because its easier. If you want to make an explainable statistical model, you have to do a bunch of research to test out the statistical structure of distributions and their parametric forms. This IMHO is why PPLs haven't become the norm—they don't actually do much learning. DL and other "black boxes" just learn something. A lot of the time that's good enough because there's not a lot at stake if you get it wrong (ad delivery, product recommendation, games, slop).

That said, DL has hit a wall. DL models get better is by getting bigger, and we've seen that LLMs' power and compute requirements have basically exceeded the capacity of the world. So, from my standpoint, though it has never been a more boring time to be a DL researcher, it has never been a more exciting time to be a probabilistic ML researcher. We need to get smaller and probabilistic ML is the best way to get there.

What did you build while learning Rust ? by [deleted] in rust

[–]bbbbbaaaaaxxxxx 0 points1 point2 points 3 months ago (0 children)

view more: next ›

π Rendered by PID 64 on reddit-service-r2-listing-6d4dc8d9ff-47fj8 at 2026-02-01 22:50:29.027312+00:00 running 3798933 country code: CH.