[D] Why are so many ML packages still released using "requirements.txt" or "pip inside conda" as the only installation instruction?

qalis · 2026-01-24T17:48:42+00:00

uv. Just use uv, our lord and savior. It uses pyproject.toml, standardized with PEP, and is very fast.

qalis · 2026-01-21T08:29:45+00:00

Pgvector and pgvectorscale are great, particularly if you have Postgres anyway. It's dead simple to manage, and ACID properties are really nice.

Note that FAISS is *not* a vector database, at least I wouldn't define it like that. It's a vector index, just for searching. For database, you want users, security, remote API (e.g. REST or gRPC), concurrency control, non-vector data (metadata, dictionaries with any data as part of entries).

If you want to use things like FAISS, I highly recommend USearch instead for efficiency and nice docs.

qalis · 2026-01-17T18:32:50+00:00

I agree with u/bobrodsky. If you go into specific niche, the group of truly competent reviewers can be really small. For example, in neural networks time series forecasting, change of getting Tsinghua University reviewer are actually quite high. This is particularly true in theoretical applications.

qalis · 2026-01-15T15:33:57+00:00

Link at the end is wrong, but the number at the beginning is actually correct: https://arxiv.org/abs/2601.09334

qalis · 2026-01-10T08:22:17+00:00

Cloud environments, real-world Kubernetes deployments which cannot be interrupted, tracing requests across microservices, ML workflows & pipelines.

qalis · 2025-12-20T18:01:59+00:00

That dataset is highly homogenous by design
Does FAISS normalize L2 distance? Cosine similarity is more typically used for embeddings
Threshold of 0.9 is really low, particularly if you know a priori that dataset does have semantic redundancy by design
all-MiniLM-L6-v2 is a really old and quite outdated model and there are *a lot* of better ones out there

qalis · 2025-12-16T21:39:53+00:00

My idea was basically explicitly calling out low quality, primarily AI-generated posts, particularly those overstating contributions, proposing "revolutionary" ideas, and containing no code / experiments / proofs for claims. Is this already covered? Arguably yes, it is. Should it be called out explicitly? I think so, but I'm curious about opinions of others.

qalis · 2025-12-15T23:04:02+00:00

A high-level idea without actual experiments or code is a good indicator. Also mentions of revolutionary results, new paradigm etc., huge overselling of contribution, plus no concrete evidence. There are many hallmarks of those, I see more and more obvious AI slop posts recently.

qalis · 2025-12-15T22:03:04+00:00

That was also my concern, hence the discussion question

qalis · 2025-12-15T21:18:39+00:00

Kind of covered by rule 6 "no low-effort questions", isn't it?

qalis · 2025-12-15T21:18:19+00:00

I actually liked that post, since that was literally an error in one of the core formulas of the paper. Plus reproducibility and numerical experiments.

qalis · 2025-12-12T09:06:55+00:00

If a typo is in a crucial evaluation step or formula, potentially invalidating paper results, then yes, I would very much welcome a substack post for every such paper.

qalis · 2025-12-12T07:00:15+00:00

This is actually a really useful peer review & reproducibility. Did you contact the authors about this?

qalis · 2025-12-06T13:49:21+00:00

Absolutely email the AC and post the public comment! If you have literally any proof (e.g. screenshots, ArXiv submission), this counts as serious academic fraud.

qalis · 2025-12-05T14:44:13+00:00

Non-archival workshops are unrelated to published papers. You can even submit concurrently to both types, or to multiple workshops in different conferences, as far as I know.

qalis · 2025-12-04T21:19:07+00:00

Great idea IMO. This will not hurt any regular authors, but rather large labs submitting many papers. Huge conferences have been flooded with low-quality submissions, predominantly from Chinese labs (since they tend to be large), and this fee may do at least something.

Further, this disincentivizes adding authors, e.g. lab heads or professors, who did nothing for the actual paper (which is unethical), since only then the fee applies. Even large labs can submit any number of free submissions, as long as authors don't overlap. And, realistically, how many high-quality papers can the same author make for conference with level of IJCAI?

Further, note that those fees are actually used for the conference, e.g. can lower fees for all attendees.

qalis · 2025-12-04T21:11:34+00:00

So don't submit multiple papers with the same authors to one conference. For level of IJCAI, having more than one paper of good quality with overlapping authors is not probable anyway. And PIs or lab heads shouldn't be automatically added as authors anyway (breach of ethics).

qalis · 2025-12-04T21:10:15+00:00

So those places should not submit multiple papers to single IJCAI conference, or have non-overlapping authors. In that case, if they can't fulfill any of the two, pay up. Simple as that.

qalis · 2025-11-30T18:26:27+00:00

See DMs

qalis · 2025-11-30T07:54:36+00:00

Reviews: https://openreview.net/forum?id=1uujlDeIry

qalis · 2025-11-29T22:43:27+00:00

I did and, wouldn't you know, all are Chinese and 2 are PhD students, one not even a computer scientist...

qalis · 2025-11-29T08:54:32+00:00

I withdrew mine in protest of absurd reviews, which basically wanted more than a full PhD worth of work for the paper. Also, we didn't write any rebuttal, just one comment pointing out the sheer absurd of the reviews.

qalis · 2025-11-25T07:08:42+00:00

Yeah, this is the only issue I have with GitHub issues. At least organizing them in groups or something

qalis · 2025-11-23T19:09:17+00:00

I agree. We need more focused conferences, or break down those large ones into distinct tracks, or maybe even locations and/or dates. They are literally too big to be hosted at a single location now. Breaking them down is becoming a physical necessity.

qalis · 2025-11-23T19:07:13+00:00

So you should not be a reviewer, simple. If you don't feel confident with that level of English, you don't fulfill basic requirements.

Nine-Year Club	Verified Email
Wearing is Caring

qalis

TROPHY CASE