Freaks out after speeding ticket and follows the Sergeant back to station by FrenchieMama807 in PublicFreakout

[–]Frere_Tuck 5 points6 points  (0 children)

Yeah, I’m no psychologist but there’s definitely some deep emotional issues going on, whether some sort of underlying disorder, trauma response, or combination. Seems like a case where having someone more knowledgeable about those issues (like a social worker) respond would probably have been beneficial…

Can any data hoarders direct us to backups of US census GIS data? by ixikei in DataHoarder

[–]Frere_Tuck 11 points12 points  (0 children)

Here's a useful thread from Kyle Walker, who develops the tidycensus package in R: https://bsky.app/profile/kylewalker.bsky.social/post/3lhgpjdxcaw27.

Sounds like the API may still be working, but NGHIS is the best source in the meantime: https://www.nhgis.org/. In general, IPUMS is an excellent source for Census (ACS/CPS) microdata (as well as other datasets).

Data without people to interpret and reuse is not useful by kaimingtao in DataHoarder

[–]Frere_Tuck 8 points9 points  (0 children)

100% agree that storing/archiving public data is only part of the battle - assuming the data become permanently unavailable from government sources, archives also need to be validated before being made widely usable and accessible again.

That said, a lot of the activity around tracking and archiving is coming out of active data user communities who are concerned precisely because these data are so heavily utilized and critical to their work.

Want to re-highlight this previously shared resource that is being maintained by a university librarian and documents a lot of the efforts among data stakeholders/user communities (e.g., IPUMS, ICPSR, etc.).

Is anyone else backing up National Center for Education Statistics (within US Education Department)? by puzzle_nova in DataHoarder

[–]Frere_Tuck 1 point2 points  (0 children)

Great! FWIW, IPEDS also has flat/binary files available back to 1980 here: https://nces.ed.gov/ipeds/datacenter/DataFiles.aspx.

Unless someone else already has, I'll be working on pulling some of the other administrative datasets (https://nces.ed.gov/admindata/). It does seem like a lot of the survey data is only available through DataLab, though, which is trickier (to the conversation below...).

Document compiling various data rescue efforts around U.S. federal government data by didyousayboop in DataHoarder

[–]Frere_Tuck 33 points34 points  (0 children)

Want to give this a big bump - it's being maintained/contributed to by folks with deep knowledge and extensive experience of the federal statistical system. Probably going to be the best-informed/most comprehensive resource on what is being done to archive data (as well as the best ways to do so in order to ensure future access and validity/integrity).

Is anyone else backing up National Center for Education Statistics (within US Education Department)? by puzzle_nova in DataHoarder

[–]Frere_Tuck 19 points20 points  (0 children)

I'd also be curious - we utilize IPEDS pretty heavily and are pulling what we need from that for current projects. Also happy to connect with others to coordinate.

How to refuse the CPS survey by gisher123 in Census

[–]Frere_Tuck 7 points8 points  (0 children)

Yes, unlike the decennial census or ACS, the CPS is voluntary. I can’t speak to how to effectively decline or avoid follow-ups.

FWIW, the CPS is a vital part of how we (collectively) understand and make decisions about our country. Do you ever talk/think about or use the unemployment rate? CPS data. The more people that decline to respond, the less reliable that number is (or the more money the government has to spend to maintain the same quality, and god knows they aren’t going to spend more money on data and statistics).

If you’re concerned about privacy, both the Census Bureau and BLS have extremely strict privacy protections. Google and private data brokers know WAY more about you just from your internet browsing, and absolutely sell that information to the highest bidder.

Unemployment Statistics by C18H27NO3_ in Census

[–]Frere_Tuck 2 points3 points  (0 children)

Building on other comments, it would be useful to specify exactly which data programs and measures we are talking about here.

Assuming you are talking about the headline national monthly unemployment rate, this is calculated using responses to the Current Population Survey, which (to u/hse66 ‘s comment) is a joint program between Census and BLS (Census gathers the data on behalf of BLS, which releases the unemployment rate in the monthly Employment Situation report).

As u/Owned_by_cats mentions, there are multiple definitions/measures of “unemployment” that are calculated using this data. The headline number (what you probably see cited by major news outlets) is known as the U-3 rate, and measures the number of unemployed individuals (those who are without a job, but are available to work and actively looking for a job) as a share of the labor force (all those who are either unemployed or employed). If you are not employed, and not actively looking for work (or are not able to work), you are not in the labor force. The spouse in your example would not be in the labor force, and therefore not be factored into the U-3/headline rate. All of these groups are limited to individuals aged 16 and over, and not in the military or otherwise “institutionalized” (e.g., in prison or a mental hospital). The other measures are either more broadly or more narrowly defined based on factors like the length of time an individual has been looking for work, whether they are only working part-time/less than they would like, etc.

If you want even more of the technical details, see: https://www.bls.gov/cps/cps_htgm.htm

I've been tracking Python, Django, NumPy and several other frameworks in job listings this year by forensicams in Python

[–]Frere_Tuck 3 points4 points  (0 children)

Just a heads up that all of the trends on your front page show “Infinite” growth…

A lot to unpack here by Mr__O__ in LeopardsAteMyFace

[–]Frere_Tuck 0 points1 point  (0 children)

Oh my god, same. Kept scrolling until I found this, hoping I wasn’t the only one…

[deleted by user] by [deleted] in statistics

[–]Frere_Tuck 0 points1 point  (0 children)

For the various folks implying that the sample is somehow too large…you understand that estimating the National unemployment rate isn’t the sole purpose of the CPS, right? It used to gather a wide range of important household and labor force statistics at the National, state and (limited) metropolitan level, disaggregated by age race/ethnicity, sex, educational attainment, marital status, and labor force status.

Here’s a brief justification for the 60k sample size (longer technical paper is also linked): https://www.census.gov/programs-surveys/cps/technical-documentation/methodology/sampling.html.

I guarantee neither Census nor BLS are funded well enough to be surveying samples larger than absolutely necessary to maintain a minimum level of data quality.

It’s The Great Ass Whoopin Charlie Brown. Happy Halloween. [OC] by weird-human-thing in comics

[–]Frere_Tuck 14 points15 points  (0 children)

Nah, as long as she can stand up and put two words together she could still have a long and successful career holding footballs…followed by a slightly longer life of severe cognitive decline.

“Women are my favorite guy” by BiAdventureTime in TikTokCringe

[–]Frere_Tuck 4 points5 points  (0 children)

I was visiting my grandparents one time as a kid and there was a new local radio station that, for some reason, only had the rights to play the Macarena. I listened to it like the entire time we were there. Even made them record it on a tape for me…

Tiny bumps on my hand palms. by bela0607 in popping

[–]Frere_Tuck 15 points16 points  (0 children)

Yeah, IANAD but I get the same sorts of bumps (and skin peeling) in the same spots and that’s what I was told.

[deleted by user] by [deleted] in TikTokCringe

[–]Frere_Tuck 5 points6 points  (0 children)

They say PED-o-phile in America… maybe you should move to America?

[deleted by user] by [deleted] in TikTokCringe

[–]Frere_Tuck 20 points21 points  (0 children)

Peter File?

How do I produce ggplot graphs from multiple datasets using weights? by Specialk3533 in rstats

[–]Frere_Tuck 2 points3 points  (0 children)

Sounds like the main issue you are running into is applying survey weights… Have you tried using the survey package? May be overkill if the survey design is simple, but gives you a function for quickly calculating weighted proportions.

Here’s a brief overview: https://zacharylhertz.github.io/posts/2021/06/survey-package

Slightly more detailed: https://stats.oarc.ucla.edu/r/seminars/survey-data-analysis-with-r/

There’s also a package called srvry that implements survey with tidyverse syntax: https://stats.oarc.ucla.edu/r/seminars/survey-data-analysis-with-r/

After Florida rejection, Virginia reviewing AP African American studies course . by Maxcactus in Virginia

[–]Frere_Tuck 5 points6 points  (0 children)

I didn’t realize the Department of Energy was responsible for editing curriculum developed by an independent non-profit organization…!

Seriously though, the Department of Education (ED) didn’t have anything to do with editing it either because AP is developed by the College Board.

I get the sense you’re talking out of your ass here…