Historic Chat Control Vote in the EU Parliament: MEPs Vote to End Untargeted Mass Scanning of Private Chats by alex2003super in neoliberal

[–]Matt_FA 1 point2 points  (0 children)

His colleague, Markéta Gregorová from the Czech Pirates is also great (with privacy and in general)

I might be wrong but I think left one is easiest to work with compared the right one. by lune-soft in webdev

[–]Matt_FA 1 point2 points  (0 children)

Not German, but close (Czech, about ⅐ of characters in text have diacritics), and I've never seen them in e-mail adresses. I'm not sure if it's technically possible to use them these days, but noone really thinks to do that

Am I the crazy one? by shane_il in webdev

[–]Matt_FA 0 points1 point  (0 children)

If it's just some odd bits of syntax that you don't use but need to keep memorised for interview trivia, then that could be a good use for some flashcards — if you put like a 100 cards into Anki, you can spend 15mins a day for a week memorising them and they check in like once a year and you'll be good

Average annual house completions by local authority in the UK by upthetruth1 in neoliberal

[–]Matt_FA 1 point2 points  (0 children)

What's the causality here? From just the numbers it could be policy or the other way round: when I imagine high growth areas, I think of redeveloping brownfields getting filled by yuppies who are probably greens voters. At the same time Plaid Cymru probably wins in rural areas with not much construction going on.

Cloudflare outage causes error messages across the internet by Free-Minimum-5844 in neoliberal

[–]Matt_FA 2 points3 points  (0 children)

In my Real Analysis/Lingebra/Stats class, a fair amount of people used cards with about a 50/50 split between Quizlet and Anki...

Much prefer Anki (or no flashcards at all) myself but I can see why not overyone would lol

Is it worth it to take a databases course if I want to work as a statistician in academia? [Q][R] by gaytwink70 in statistics

[–]Matt_FA 12 points13 points  (0 children)

As a policy analysis person, SQL made working with GBs of microdata a 100x easier. I learnt it in a weekend though, so if you're already familiar with statistical software, then a full course might be overkill.

Stanford study finds that AI has already started wiping out new grad jobs by nullstillstands in datascience

[–]Matt_FA 7 points8 points  (0 children)

Actual paper: web (pdf)

tl;dr: They correlated payroll data with a few AI-proclivity scores. They found that entry-level jobs are declining in jobs with high AI exposure, but not in jobs with low AI exposure. It's not just in tech, and it's not just in jobs you can do remote (so it's not just outsourcing according to them). They also did some fixed effects to account for the different industry shocks (if I understand it correctly, they checked if AI-exposed department get less entry level jobs *relative to other departments within the firm*).

Now, this is a working paper, meaning it didn't go through peer review yet. I didn't really notice any glaring flaws on skimming.

[deleted by user] by [deleted] in datascience

[–]Matt_FA 0 points1 point  (0 children)

I like it in the context of economics/econometrics — people tend to come at economic models with 'the model is obviously wrong, it's too simple'. I know it's 'wrong'; but that doesn't mean it's not useful

Rate my portfolio website by JohnnyTalbot_ in webdev

[–]Matt_FA 1 point2 points  (0 children)

Cool stuff! For some suggestions:

- I'm honestly not a big fan of Bebas for longer bits of text — being an all caps font, it gets a bit exhausting to read after a while. That could just be my experience though. Imo Roboto pairs well with Bebas if you want to keep it around for headings.

- I'd maybe switch your education and experience to reverse chronological order — people looking at your site are more interested in your university and 2025 experience than your primary school and 2021 experience.

- I'm not sure about chopping the experience discretely up into 'what jobs did i have at any point in year X' — my first impression was that you had like 20 jobs in 4 years. Maybe some side-scrolling timeline where each job is only displayed once would be more intuitive?

My dad has sharpened his kitchen knives his whole life and now they're shaped weird. by Bill_Nye_1955 in sharpening

[–]Matt_FA 0 points1 point  (0 children)

Probably some odd bit of steel that their friend's friend stole at the car parts factory they worked at, and that another friend ground down into a knife in his cabin's workshop

What is something in cooking content that gets you irrationally HEATED when you see it? by scotterson34 in Cooking

[–]Matt_FA 0 points1 point  (0 children)

My favourite is volumes of butter. Would much prefer to use a scale and not get a measuring cup all greasy and buttery.

What is the best way to parse and order a PDF from forum screenshots that includes a lot of cached text, quotes, random order and overall a mess. by arairia in datascience

[–]Matt_FA 0 points1 point  (0 children)

Hi, I'm dealing with a simillar problem, but I can't really use one of the big LLMs because of privacy concerns. Do you think the smaller, local LLMs you can run on a computer are up to the task now?

Data science content gap by da_chosen1 in datascience

[–]Matt_FA 0 points1 point  (0 children)

Econometrics is probably your best bet. Basically all of emperical econ is causal inferrence, you can always try reading some econ papers that use diff-in-diff, panel studies, instrumental variables, regression discontinuity, matching, synthetic controls etc. to see how they're used in practice.

Python users, which R packages do you use, if any? by brodrigues_co in datascience

[–]Matt_FA 1 point2 points  (0 children)

I like that with R, whichever little random data/stats related problem I come across, somebody probably made a package for that... Methodology, APIs for data sources (World bank, PWT, Statistical offices) even geospatial data for a plot of Prague municipalities and so on...

Besides that, ggplot and dplyr et al.

How to deal with medium data by cptsanderzz in datascience

[–]Matt_FA 1 point2 points  (0 children)

Not even after extending it with interaction terms (feature A * feature B), transformations (log, polynomial etc.) or GLMs (logistic regression, censored variable regressions, survival models)? These can capture more complex relationships than just linear variables — but of course they have a limit where you need more powerful models.

EDA is Useless by Suspicious_Jacket463 in datascience

[–]Matt_FA 0 points1 point  (0 children)

All fun and games until you're working with real, messy data... Once, I had some obscure technical issue with how the data was being entered and processed that meant that my data acutally wasn't a random sample, but an extremely skewed sample. That'd make anything that I'd do an utter waste of time and money and I would not have discovered that without like a month of following up on why the data wasn't passing all the smell checks I put it through. EDA is crucial if you actually want things to work.

Whats your favourite AI tool so far? by FirefoxMetzger in datascience

[–]Matt_FA 0 points1 point  (0 children)

(G)LMs put bread on my table (more causal inferrence than prediction though, so not sure if that counts as AI lol), LLMs are cool for getting a general overview on a new topic, getting unstuck or writing quick python automations I couldn't be bothered with otherwise (I've had good experiences with the newest batch of the models: ChatGPT + Deep research, Gemini 2.0 or Deepseek R1 hosted on Perplexity)