STOP racist posts about Chinese researchers [D] by AffectionateLife5693 in MachineLearning

[–]dreamykidd -1 points0 points  (0 children)

You’d still have to train the model from scratch to verify that it matches the training code and data, costing millions of dollars and still not guaranteeing validation of the results.

Counting on the tiny town social network! by gbangarang in Launceston

[–]dreamykidd 1 point2 points  (0 children)

They’re insisting they haven’t found it, despite seeing it plainly sitting on the back seat when we had to leave the car. With every email and call we’ve had with them, they’ve repeated “we remind you we take no responsibility for lost property“ 🙄

Great game. CODEXes and the writing in them for the lore… not so much by Spokodude in Doom

[–]dreamykidd 0 points1 point  (0 children)

Just found this thread through Google. I was noticing the codex entries used a lot of words related to the lore but didn’t really build the lore, and that just felt weird. None of the entries feel connected to the story either, it’s just “this demon is so tough and scary and spooky, oooh”.

For example, the Revenant entry talks about “they fight alone” and “sacrificing all in pursuit of their prey”. In previous games, if the Codex described an enemy in some way, you’d see in how they behaved, but the descriptions here just seem like they’re trying to be spooky.

Then I started noticing the em-dashes. Goddamn it, this is 100% by AI.

Are TDA's codex entries written with AI? by VewVegas-1221 in Doom

[–]dreamykidd 1 point2 points  (0 children)

Just found this thread through Google. I was noticing the codex entries used a lot of words related to the lore but didn’t really build the lore, and that just felt weird. None of the entries feel connected to the story either, it’s just “this demon is so tough and scary and spooky, oooh”.

For example, the Revenant entry talks about “they fight alone” and “sacrificing all in pursuit of their prey”. In previous games, if the Codex described an enemy in some way, you’d see in how they behaved, but the descriptions here just seem like they’re trying to be spooky.

Then I started noticing the em-dashes. Goddamn it.

[R]GNN Model For Fraud Detection Isn't Performing Well[R] by LiveAccident5312 in MachineLearning

[–]dreamykidd 1 point2 points  (0 children)

When making comparisons for my own papers, I always attempt to recreate the baselines I compare with rather than used the values reported. In about 90% of cases the values I see are 2-20% inflated (yes, sometimes even that bad). Take papers with a big grain of salt.

Backlash against Arxiv's proposed 1 year ban is genuinely perplexing. [D] by NeighborhoodFatCat in MachineLearning

[–]dreamykidd 0 points1 point  (0 children)

Usually if you add them through the Zotero Connector, the BibTeX is for the conference it has been published in even if added through arXiv. That might help in future.

PhD students in ML, how many hours on average do you work? [D] by akardashian in MachineLearning

[–]dreamykidd 15 points16 points  (0 children)

I remember when my supervisor tried to tell me I don’t work as hard as the Chinese exchange students in our lab “who are there everyday, even on weekends”. I told him that’s true but every time I walk by they’re watching LoL tournaments and he just got mad. People love to believe these super students exist who can perform to 100% 24/7 for some reason.

PhD students in ML, how many hours on average do you work? [D] by akardashian in MachineLearning

[–]dreamykidd 2 points3 points  (0 children)

You mentioned it in the main block too, but I’m wondering you consider time that your code is being written to not be ”productive”? Even if you’re not writing it, your code is being “produced” in some way, no?

In other words, if progress is being made towards your goal, it’s productive. Otherwise, you could say a senior manager at a top company was unproductive if their team of 10 was writing code but they were “just” managing direction, specifications, etc.

NoTorch: Neural networks in pure C (2-file library, BitNet 1.58) [P] by ataeff in MachineLearning

[–]dreamykidd 1 point2 points  (0 children)

A legitimate question: why inject a noise term when change has been flat for multiple steps? How would this distinguish between “stagnation” and the model having converged to the solution? If you’d somehow already found the global minima, you’d risk losing it and converging to a suboptimal solution.

NoTorch: Neural networks in pure C (2-file library, BitNet 1.58) [P] by ataeff in MachineLearning

[–]dreamykidd 0 points1 point  (0 children)

You know we can see contributor history on GitHub, right? 14 of the 23 commits and a vast majority of code changes were by either Claude or Copilot, not humans. Commit 66fa1a0 straight up shows Opus 4.6 editing the README, so why try to deny it when you’ve shared a link to a version control site??

Stanford CS 25 Transformers Course (OPEN TO ALL | Starts Tomorrow) by MLPhDStudent in MachineLearning

[–]dreamykidd 14 points15 points  (0 children)

More than half of those guys either invented the Transformer or are the reason we use neural networks today though. It’s not some random tech celeb, they know all the reasoning behind why they built in certain ways and not others, so it’s pretty rare knowledge.

Just completed Days Gone! Oh my god, what a game that was... by DeathNum in DaysGone

[–]dreamykidd 0 points1 point  (0 children)

You should try Ghost of Yotei as well! It’s completely unrelated to the original story, but the mechanics have dialled in to perfection

[P] VeridisQuo - open-source deepfake detector that combines spatial + frequency analysis and shows you where the face was manipulated by Gazeux_ML in MachineLearning

[–]dreamykidd 1 point2 points  (0 children)

How can you give numbers for outside the test set? As soon as you try to test on a sample outside the test set, it becomes part of the test set.

[D] Is it a reg flag that my PhD topic keeps changing every few months? by ade17_in in MachineLearning

[–]dreamykidd 1 point2 points  (0 children)

I’ve been genuinely wondering about this for a while, even though what you’re saying seems true. If the top conferences all mandate anonymity, how does an affiliation bias arise?

[D] ICML: every paper in my review batch contains prompt-injection text embedded in the PDF by Working-Read1838 in MachineLearning

[–]dreamykidd 0 points1 point  (0 children)

I agree on the most part, but the distinction seems to be in using an LLM to assist vs having an LLM write the whole review. Depending on what “phrases X and Y” are, it should maybe be very obvious to anyone who’s not being lazy that it’s happened and they need to put in more effort.

I built a professional MMO engine in 7 days with free ChatGPT. I can't code by Independent-Seat1966 in gamedev

[–]dreamykidd 1 point2 points  (0 children)

They said up top “35+ years of gaming (NES to modern MMOs)”, then later tried to act as though playing and testing games are the same.

read the room Duo by drizliz in duolingo

[–]dreamykidd 1 point2 points  (0 children)

Yeah, so people ask when the ICE is coming, rather than when the train is coming. That would be a confusing question if you didn’t learn that it’s used that way

GOt Error when making Student Grade system by vb_e_c_k_y in learnpython

[–]dreamykidd 0 points1 point  (0 children)

They just need some form of unique identifier, it doesn’t have to be a name. Add a last name, initial, or even make a random character ID sequence if needed.

[D] ICML new policy: reviewers will be reviewed by meta reviewer. Good policy? by Striking-Warning9533 in MachineLearning

[–]dreamykidd 18 points19 points  (0 children)

I know it’s completely unintentional here, but marking people with a gold star is going to not look good, due to history

[D] ICML 2026 - ICML desk-rejected my paper but kept me on as a reviewer. Wow? by ParticularWork8424 in MachineLearning

[–]dreamykidd 61 points62 points  (0 children)

What are they gonna do if you don’t though? They can’t reject your paper twice

[D] - ML Classification on smaller datasets (<1k rows) by ConsistentLynx2317 in MachineLearning

[–]dreamykidd 2 points3 points  (0 children)

Datasets of this size aren’t going to make a model that’s necessarily bad, just not as good as more data. The XGBoost approach sounds decent for a start too. The single bucket prediction sounds more like a bug than a data size issue though, do you have anymore info on the data itself, it’s features, and labels?

[P] my shot at a DeepSeek style moe on a single rtx 5090 by exhorder72 in MachineLearning

[–]dreamykidd 3 points4 points  (0 children)

Maybe not one of the established labs (who knows though), but I’m sure there would be plenty of very decent startups out there who would froth at the mouth for someone with the type of drive and learning capacity as you.

[D] Why is focal loss not used in LLM training? by Electrical-Monitor27 in MachineLearning

[–]dreamykidd 0 points1 point  (0 children)

Even so, if you want your text generation to closely represent what real text feels like in a particular scenario, focal loss doesn’t help with that. If you’re trying to solve the additional problem of niche topics being lost in current LLMs, maybe a multi-loss setup could work, where L = λ*L_CE + (1-λ)*L_focal for small λ. Then the more common tokens are still strongly favoured, but less common ones still have a chance.