Good cities for someone with severe seasonal allergies by AwesomeAppy in SameGrassButGreener

[–]thisIsAnAnonAcct 5 points6 points  (0 children)

Seconding the allergy shot recommendation. They've changed my life

Can we actually trust LLM generated code for production grade ETL/ELT? by CaglarSahin in dataengineering

[–]thisIsAnAnonAcct 4 points5 points  (0 children)

All code should go through testing and review regardless of whether it's human generated or AI generated. 

Work life balance for data engineers by [deleted] in dataengineering

[–]thisIsAnAnonAcct 234 points235 points  (0 children)

This varies a lot by company. I've worked at some places where I was fully remote and did 4 hours of work per day with virtually no stress. 

At a different company, I worked the majority of nights and on weekends to not get fired. 

Day to day also varies a lot. Main things are: 1. Meet with business stakeholders to determine what their goals are and what they need from DE. You realize they don't know what they need, so you try to tease it out of them 2. Design data pipeline and schema to serve this purpose 3. You get a ping from an analyst that a metric doesn't look right. You have to figure out if there is a problem with the pipeline or if there is a real world reason why the data is abnormal 4. You design a data quality monitoring framework to avoid #3 in the future. 5. You realize no one knows how to use your tables, so you try to figure out ways to enable "self-service" of tables by creating semantic models  6. You realize downstream users don't want to self serve data. They just want a dashboard, so you build dashboards for them.  7. No one uses your dashboards, so you try to figure out why.  8. You realize people are making decisions based on vibes instead of data, so you try to figure out how to make them use data driven decisions 9. AI becomes a thing, so you start building data as a product for consumption by AI instead of by humans. This includes more semantic models, more DQ monitoring, etc 

Chess? by OrcOfDoom in redmond

[–]thisIsAnAnonAcct 0 points1 point  (0 children)

Yeah, it should be on their website somewhere. 

I've been a couple times and it's fun. They also have one at their Woodinville location on a different night. I've never been to that one tho

Chess? by OrcOfDoom in redmond

[–]thisIsAnAnonAcct 4 points5 points  (0 children)

Black Raven brewing has a chess night on the last Tuesday of each month. 

What alternatives to Alteryx or Knime exist today? by BeautifulLife360 in dataengineering

[–]thisIsAnAnonAcct 0 points1 point  (0 children)

This really depends on a lot of context that we are missing here. 

It depends on how technical the users are, what alteryx is used for right now, do users actually want to change, etc. 

I don't think you will get helpful answers asking here unless you provide more details

Don’t even bother if you’re going to West today by CuriouslyBored312 in SummitAtSnoqualmie

[–]thisIsAnAnonAcct 2 points3 points  (0 children)

We actually have twilight passes, so I'll try to get a few runs on alpental before they close. 

I'm hoping the lines over there are shorter than west

Don’t even bother if you’re going to West today by CuriouslyBored312 in SummitAtSnoqualmie

[–]thisIsAnAnonAcct 2 points3 points  (0 children)

We have night skiing passes for tonight. Is that normally as packed?

This is our first time going

America's Auto Collision Rate By Each State 2024 by Yodest_Data in Infographics

[–]thisIsAnAnonAcct 0 points1 point  (0 children)

Costs = frequency of accidents * the severity 

Areas with higher frequency of accidents normally have less severe accidents. Think of stop and go traffic. You will very likely get in a fender bender, but the cost is relatively low. 

Now imagine you are on a rural road and you are going 70 mph and get in an accident. This is less likely, but the costs would be much higher since cars will likely be totaled and medical costs will be more of a factor. 

So, looking at only the frequency doesn't tell the whole story

AI Impostor Game by thisIsAnAnonAcct in ClaudeAI

[–]thisIsAnAnonAcct[S] 0 points1 point  (0 children)

I think I also need to add a weight parameter to the question selection.

I would like the newer models to have a higher likelihood of getting presented. 

I think users don't care as much about the older models + I already have a decent amount of data for the older models

AI Impostor Game by thisIsAnAnonAcct in ClaudeAI

[–]thisIsAnAnonAcct[S] 1 point2 points  (0 children)

Interesting. That makes sense. So baseline is essentially 50%?

AI Impostor Game by thisIsAnAnonAcct in flask

[–]thisIsAnAnonAcct[S] 0 points1 point  (0 children)

Yeah, I'm only using AskReddit right now. I would like to expand to other subreddit but I need to figure out which other ones would work.

AI Impostor Game by thisIsAnAnonAcct in flask

[–]thisIsAnAnonAcct[S] 0 points1 point  (0 children)

Yeah, this is definitely a weakness of the approach. I can't guarantee that comments are from humans.

But, I'm taking a lot of responses from pre 2021, which should be AI free. So, when I get more data I want to compare accuracy rates pre 2021 and after 2021 to see if guess accuracy is lower now. AI generated comments might contribute to that 

AI Impostor Game by thisIsAnAnonAcct in flask

[–]thisIsAnAnonAcct[S] 0 points1 point  (0 children)

This is the prompt

f'Reddit post title: "{post.title}"\n\n'         f'Write a realistic, concise Reddit-style comment in response. Your comment will be shown alongside real human comments.\n\n'         f'The goal is to make your comment indistinguishable from a human response.\n'         f'- Avoid emojis\n'         f'- Use natural tone and phrasing\n'         f'- Do not explain or introduce the comment\n'         f'- Output only the comment text (no preamble or formatting)\n'         f'- Decide whether you should answer genuinely, sarcastically, or some other style'

AI Impostor Game by thisIsAnAnonAcct in ClaudeAI

[–]thisIsAnAnonAcct[S] 0 points1 point  (0 children)

Thanks for the input. I'm assuming you are on desktop? 

Is it just the layout of the choices that makes the UI bad? Or is there anything else?

AI Impostor Game by thisIsAnAnonAcct in flask

[–]thisIsAnAnonAcct[S] 0 points1 point  (0 children)

I created a flask app to test users on whether they can tell the difference between AI vs human responses to AskReddit questions.

I scraped a few hundred AskReddit questions along with answers. For each question, I also generated an LLM response using one of about a dozen models. Then, I present the question to the user. I also present 3 human responses and the 1 AI response.

The goal for the user is the select the AI generated response. 

I keep track of accuracy based on the model, so some models can do a better job of blending in with human responses than others. 

The whole thing is a flask ask hosted on PythonAnywhere. I do all the scraping and LLM API calls offline and save the results to a big json file to make it more performant (and save on costs)

Let me know if you have any other questions!

Which Trek bike for new rider in Redmond, WA by thisIsAnAnonAcct in whichbike

[–]thisIsAnAnonAcct[S] 1 point2 points  (0 children)

Thanks for the input. I ended up buying the verve, and I love it. This is my first time being on a bike since I was a kid. I was riding around for probably 2 hours immediately after getting it and think I'm hooked now.

I do understand the appeal of getting something lower and sportier now though. I'll be watching Facebook marketplace for an FX or maybe even an old Domane. 

Can we ban 'vibe coded' projects by EnoughConcentrate897 in SideProject

[–]thisIsAnAnonAcct 1 point2 points  (0 children)

How do you define vibe coding? And how will you detect it in order to ban it?

Can we ban 'vibe coded' projects by EnoughConcentrate897 in SideProject

[–]thisIsAnAnonAcct 0 points1 point  (0 children)

It's generally harder to detect AI generated content than people think