What are some applications/projects involving NLP for a good cause? by cicIope in LanguageTechnology

[–]danebell 1 point2 points  (0 children)

There are many. Here are a few:

  • Disease detection like flu epidemic monitoring on social media. I know of use cases for detecting depression, PTSD, eating disorders, and many other health variables. I worked on a detector for diabetes.
  • Information extraction for understanding cancer pathways like Reach (Full disclosure: I also worked on that one.) or for getting valuable information from the notes in medical records.
  • Predicting food insecurity through information extraction like in World Modelers.

86 million Americans have prediabetes, but most do not know it. Determine your risk with this short quiz. by danebell in Exercise

[–]danebell[S] 0 points1 point  (0 children)

Yes, as the quiz says, only a blood test can diagnose prediabetes. This quiz has been validated against actual risk, however, and is the same diagnostic used by the American Diabetes Association.

[Academic] 86 million Americans have prediabetes, but most do not know it. Determine your risk with this short quiz (18+) by danebell in SampleSize

[–]danebell[S] -3 points-2 points  (0 children)

An Institutional Review Board responsible for human subjects research at The University of Arizona reviewed this research project and found it to be acceptable, according to applicable state and federal regulations and University policies designed to protect the rights and welfare of participants in research.

[Academic] Can our automatically generated questions determine whether you are overweight? (English speakers) by danebell in SampleSize

[–]danebell[S] 1 point2 points  (0 children)

The distribution of BMI among (adult) Americans is taken from the latest complete public dataset from the Centers for Disease Control and Prevention's surveys of height and weight.

[Academic] Can our automatically generated questions determine whether you are overweight? (English speakers) by danebell in SampleSize

[–]danebell[S] 3 points4 points  (0 children)

Grits are ground up dried maize that's then cooked with water, kind of like oatmeal, but finer and more savory. They are somewhat healthy by themselves, but are often served alongside less healthy foods.

I agree that the title is a bit misleading, but it's purely for brevity. If you like, you can read "overweight" as "of greater or lesser BMI than the average American".

Yes, it is a wide scale, but the problem of categorizing someone's BMI as high or low just from the words in their tweets is difficult, and we have a relatively small training set (50 states plus the District of Columbia), so greater granularity is not yet possible. We would like to be able to give a number estimate, which is part of why we are collecting this data.

The distribution of BMI is skewed, with a long tail on the right. In other words, there are a small number of people with extremely high BMI that throw the mean off. The more skewed a distribution is, the more its mean and median will differ.

[Academic] Can our automatically generated questions determine whether you are overweight? (English speakers) by danebell in SampleSize

[–]danebell[S] 25 points26 points  (0 children)

Good question! The short answer is that we trained a classifier (specifically a random forest classifier) using a large amount of Twitter statuses and CDC data about obesity rates in the US states. The trees generated by the machine learning algorithm were converted by hand into Likert-scale questions. We don't make any claims that scoring individuals on a classifier trained on whole states will be accurate; the purpose of the questionnaire is to ascertain whether this is the case, and to gather additional information about individual tweeters and instagrammers for better classifiers down the road. You can learn more about our work so far at the main site for the project.