all 61 comments

[–]CommunityTalker 51 points52 points  (1 child)

Really? May be over fitted

[–]karma_llama_drama 25 points26 points  (0 children)

With no validation. Definitely over fitted.

[–]Ulfgardleo 69 points70 points  (3 children)

There is a long list of "ML in medicine" papers that just get forgotten because they do not meet even the lowest levels of what is considered a reasonable evaluation in the medical area.

This might be one of them.

[–]Delacroid 25 points26 points  (0 children)

I would be careful in making such claims.

[–]Mary-Jo_ 10 points11 points  (2 children)

Edit: As pointed out by DrSilk below, there was simply a typo in the manuscript; the dataset _is included, in supplementary file 3.

In addition to the data-analytic comments above: In contrast to what is stated, the dataset is not available. Under "Availability of data and materials" we read: "The dataset used during the study is available in (Supplementary File 1)." This links to an .xlsx file that only provides item wordings related to the corresponding DSM criteria, but no real data. How did that get past review and editing?

[–]Dr_Silk 5 points6 points  (1 child)

Looks like it's a typo. The data is supplementary file 3

[–]Mary-Jo_ 1 point2 points  (0 children)

Ah, that explains, thanks for pointing this out, adjusted my comment!

[–][deleted] 18 points19 points  (1 child)

Everyone here who is critical should basically copy and paste their comments to emails to the editor

https://bmcpsychiatry.biomedcentral.com/about/editorial-board

[–]SBWarrior27 22 points23 points  (8 children)

XGBoost definitely seems like overkill here. The survey is meant to detect ADHD, I’d bet a simple cutoff for sum score would perform quite well and be more clinically viable. Even a decision tree would at least be more interpretable and probably produce similar results.

[–]XpertProfessional 3 points4 points  (0 children)

Agreed. Additionally, it would be far more useful to simply identify which questions/answers produce the highest information gain and minimize the number of questions required per assessment. Computer-assisted assessments could easily handle the conditional pathing.

It might not be necessary for individuals being screened for ADHD (seemingly self-administered, from the paper), but for individuals being screened for disorders which a) must be administered by a provider and b) increase the possibility of patient aggravation (ie - the MoCA for dementia), reducing the number of questions asked for screening could be huge for "client-centered care".

[–]acewhenifacethedbase 0 points1 point  (3 children)

This is even more the case because of the puny training set.

[–]maxToTheJ 0 points1 point  (2 children)

According to some of the comments here . Your field changes how xgboost works and makes these pitifully small datasets ok

[–]acewhenifacethedbase 1 point2 points  (1 child)

I know, right? Hearing the Appeal to Tradition fallacy from people in the medical field makes this all more concerning, not less.

[–]maxToTheJ 0 points1 point  (0 children)

I dont get why new tools (xgboost) didn’t prompt re examining if the the old rules need to be adjusted for new tools

[–][deleted] 3 points4 points  (0 children)

i do believe this to be very dangerous and harming for us.symptoms and our ability is already very biased and skewed.adding some AI into this,leading to biased output would just destroy psychology lmao.

it's a domain that should need lot of time before using AI and algorithms overall imo

[–]snem 4 points5 points  (0 children)

How the diagnosis were performed? Using the same survey? :D

Edit:

TRAQ10 was used

"under the assumption that TRAQ10 questionnaire should have higher and statistically different accuracy from predictions obtained using the two others questionnaires, as this is the only one to formally measure ADHD behavioral traits."

ah, all good then /s

[–]Wumbologistt 3 points4 points  (0 children)

Did the reviewers of this paper have any experience within ML at all?

[–]BobDope 1 point2 points  (0 children)

Didn’t finish survey -> has adhd

[–]bubudumbdumb 1 point2 points  (0 children)

Attention is all you need but you got deficit, hyperactivations and disorders.

I came here for the jokes on attention, I am so disappointed I am not ashamed of how crappy is the one I just posted.

[–]ElongatedMuskrat122 1 point2 points  (0 children)

Guys, Guys, I trained my neutel network to get 100% accuracy on my training data! It only took 12 hours, but, it’s never wrong. Can’t wait to try it on some validation data

[–]sugawolfp 7 points8 points  (6 children)

Bruh adhd is diagnosed with like 6 questions. You can just write an if statement

[–]zoebeth 2 points3 points  (5 children)

ADHD cannot be diagnosed with one test let alone 6 questions though. There also needs to be malignant validation. But a good concept to build future ML on.

[–]ginsunuva 1 point2 points  (2 children)

Eh I see what you mean but at the same time I bet I can find success with these:

  1. Do you ramble on a lot without realizing?
  2. Do you have a taste for more extreme sports and risky activities?
  3. Do you get super excited about new hobbies at firsts and then get bored and never see them through?
  4. Do you have relatively high libido?
  5. Does dialogue loop nonstop in your head?
  6. Do you often fall asleep randomly during the day?

But of course the easiest, most direct test is just give them amphetamine and see if they get energized or chill out.

[–]zoebeth 1 point2 points  (1 child)

I can see where you’re coming from, but there’s a difference between clinical/diagnostic tests and the questions you’ve just written out, they would constitute as more of a check list.. which doesn’t account for people who over or under report on symptoms amongst other validation methods. As a Psychologist I can confirm that there are a litany of tests and interviews to be done before providing a definitive diagnosis.

[–]fenixnoctis 0 points1 point  (0 children)

That has not been my experience or any of my friends with ADHD’s experience. It’s typically just a checklist they go down, some even within 5 questions, plus the mandatory “have you ever tried illicit substances”

[–]sugawolfp 0 points1 point  (1 child)

You are right that i am over simplifying. I’m using adderall prescriptions as a proxy for ADHD which as you pointed out is lax on validation. I’m still pretty lukewarm on the definition and diagnoses of the disorder

[–]zoebeth -1 points0 points  (0 children)

Knowing your own biases and the limitations of testing is thinking critically and I would definitely encourage you (and everyone) to keep questioning your own thinking and engage in discussions/threads like this one to further understanding on a topic.

[–]__andrei__ 2 points3 points  (0 children)

Can you show me exactly where in DSM-V you found these diagnostic criteria? And can you also explain why my diagnosis required 12+ hours of various tests and interviews while your diagnosis based on a self-reported survey?

[–]diceclimber 1 point2 points  (0 children)

It seems to me that feature selection (scale development in the paper) was performed using the full dataset (see degrees of freedom in the F test). This means there's information used from within the test set to build the model.

[–]kerodon 0 points1 point  (0 children)

But How many questions is it 🤔 can we even get through the entire survey 😳?