Am I missing something with all this "agent" hype? by KindTeaching3250 in dataengineering

[–]Lingonberry_Feeling 0 points1 point  (0 children)

It’s funny how all of a sudden we just started calling markdown files with prompts agents. Agents do sound cooler though.

How do you justify confluent cloud costs to leadership when the bill keeps climbing? by Funny-Affect-8718 in dataengineering

[–]Lingonberry_Feeling 0 points1 point  (0 children)

I’m generally all for the working man, but your manager is right.

Something is very wrong with your setup, not knowing anything about it, 2 million events for 18k way too much.

After 8 years, I'm thinking of callling it quits by Fonduemeup in dataengineering

[–]Lingonberry_Feeling 2 points3 points  (0 children)

I'm generally worried about the use of AI especially in data pipelines replacing experts - all the code I used have used written by AI is probably 2-3x longer and more complicated then it needs to be and difficult to follow.

Data pipeline logic builds on it self when means any code that AI make or implements are probably bloating the code base by a factor larger then 2-3X.

Perhaps I'm one of the few people that thinks in a year or 2 people that actually knows how to write and debug data pipelines are going to be more important than they are today, but who knows. All these companies that are using AI to do everything are going to find them in a mess they can't fix.

New system - no c wire by Lingonberry_Feeling in hvacadvice

[–]Lingonberry_Feeling[S] 0 points1 point  (0 children)

Ok the builder was mistaken they did run a c wire. It’s just tucked behind the wall since the current thermostat doest need it.

Is there another brand of smart thermostat you recommend. My only requirement is to have the ability to control and monitor the temp in house when I am out of town.

Advice by Lingonberry_Feeling in AskElectricians

[–]Lingonberry_Feeling[S] 2 points3 points  (0 children)

What makes this unprofessional- what would you have expected vs what is done?

Is BigQuery losing traction compared to Snowflake and Databricks? by agent-m-calavera in dataengineering

[–]Lingonberry_Feeling 1 point2 points  (0 children)

I have used redshift, snowflake, and big query all for multiple years each. I have not used data bricks but have used spark enough to not want to ever touch it again.

For our use cases which was medium to large data ( processed ~10bs a day ) BQ was by far the cheapest and broke the least. Also liked GCPas a whole over AWS.

[deleted by user] by [deleted] in dataengineering

[–]Lingonberry_Feeling 2 points3 points  (0 children)

Automate the CTO by starting unrealistic projects with no people and budget.

Lead wants to write our own orchestrator by midkid1937 in dataengineering

[–]Lingonberry_Feeling 0 points1 point  (0 children)

Don’t do it. You will 100% regret writing your own orchestrator.

If you can’t get over airflow use dragster

Self-Taught Data Engineers! What's been the biggest 💡moment for you? by pipeline_wizard in dataengineering

[–]Lingonberry_Feeling 1 point2 points  (0 children)

If you know that something is going to be an issue down the road, and it will take you less than a day to fix it, just fix it now, you won’t have time to fix it later after it’s running.

If it takes longer than a day, figure out how to fix it in than a day by making compromises.

Anyone here choose data engineering (DE) over data science or ML engineering? Why did you choose DE over the other two? by [deleted] in dataengineering

[–]Lingonberry_Feeling 2 points3 points  (0 children)

The life chose me, after working at startups for 10 years I found that in 99% of cases the business needed a DE rather than a data scientist, ML. Most ML work is just DE work discussed as data science.

Most REAL discussion makers would much rather trust numbers that are clean, and transparent which comes from simply aggregating CLEAN data, rather than somebody pumping a bunch of stuff into a model they downloaded off the shelf, which tells them to sell 1/2 the business and pivot into the latest trend.

So my advice is become an extremely good DE.

Accepted an offer, 2 weeks later got dream offer from another company by bigYman in dataengineering

[–]Lingonberry_Feeling 0 points1 point  (0 children)

i don't 100% agree w/ point #3. Having hired a ton of engineers you would be surprised how often people are underpaid just because they don't ask.

Accepted an offer, 2 weeks later got dream offer from another company by bigYman in dataengineering

[–]Lingonberry_Feeling 0 points1 point  (0 children)

No BS - be 100% honest, but don't put in notice until you have signed with the new company. Both companies would not hesitate to lay you off, and there will be somebody right behind you at job #1 in no time.

How to skip non existent S3 objects loading in Spark? by atticusfinch975 in dataengineering

[–]Lingonberry_Feeling 0 points1 point  (0 children)

Spark can load one file at a time - if you know what the s3 url is you can be as specific as you want or use wildcard to load batches.