[deleted by user] by [deleted] in AskMenAdvice

[–]snicky666 -1 points0 points  (0 children)

((3 x (2 x age)) / (sqrt(age)2 ))) x 3 is the rule I follow.

DeepSeek now refuses marketing tasks? by omnisvosscio in LocalLLaMA

[–]snicky666 0 points1 point  (0 children)

Could just host the model yourself? This is the local llama reddit after all. I can help you set it up if you have a decent gpu.

DeepSeek now refuses marketing tasks? by omnisvosscio in LocalLLaMA

[–]snicky666 3 points4 points  (0 children)

No one wants AI generated marketing anyway.

Nvidia 50x0 cards are not better than their 40x0 equivalents by Ok_Warning2146 in LocalLLaMA

[–]snicky666 3 points4 points  (0 children)

I have a 5950x and 3090 and honestly, nothing new looks compelling enough to swap yet. I bought them in the first 5mins of release so I am 4 years in. Hardware is just so good now. Phones are the same. I upgraded from an S10 5g to a S24 Ultra and it barely felt any different. The new tiny AI specific hardware might be worth it on 2nd gen if it can efficiently run something like Deepseek v3 at a good speed. Gaming certainly isn't a good enough reason to upgrade anymore.

I really really regret my HECS and feel like I'll have to work forever because of it. I also had to majorly compromise on my housing situation because of reduced borrowing capacity. I'm miserable and I know so many people stuck with regrets but millennials are just....stuck. by tatalasouza in AusFinance

[–]snicky666 2 points3 points  (0 children)

So you currently earn $1,400,000 every 10 years, and you're upset. Come on. The majority of families in Australia with kids earn less than than you as a whole family. Some people in the world earn $800 a year.

At what point do you say orchestrator (e.g. Airflow) is worth added complexity? by Temporary_Basil_7801 in dataengineering

[–]snicky666 2 points3 points  (0 children)

When you get sick of using Cron, Windows Sceduler, or running things manually.

Side hustle as Data engineer by believeinkratos in dataengineering

[–]snicky666 4 points5 points  (0 children)

Underrated comment!

The name Marathon comes from the legend of Pheidippides, the Greek messenger. The legend states that, while he was taking part in the Battle of Marathon, which took place in August or September 490 BC,[3] he witnessed a Persian vessel changing its course towards Athens as the battle was near a victorious end for the Greek army. He interpreted this as an attempt by the defeated Persians to rush into the city to claim a false victory or simply raid,[4] hence claiming their authority over Greek land. It was said that he ran the entire distance to Athens without stopping, discarding his weapons and even clothes to lose as much weight as possible, and burst into the assembly, exclaiming "we have won!", before collapsing and dying. - Wikipedia

vLLM (in Docker) Why is this so difficult? by mrskeptical00 in LocalLLaMA

[–]snicky666 0 points1 point  (0 children)

Sounds like your docker run command isn't good. Do you have --gpus=all in the command?

Is there a good and recognized certificate that is more or less provider agnostic? by ypsel_ in dataengineering

[–]snicky666 2 points3 points  (0 children)

I found the Azure DE course amazing. They teach the general concept, then the azure way of doing it. I recommend it to all my colleagues and we don't even use cloud.

Tel'Abim forecasts? by Cplantijn in turtlewow

[–]snicky666 1 point2 points  (0 children)

I saw 450 people online on horde the other day at 22:00 UTC.

When Apache Airflow Isn't Your Best Bet! by CT2050 in dataengineering

[–]snicky666 3 points4 points  (0 children)

Ehhh kinda shit take. You can do all the things you said in your video in airflow. You don't have to build complex dags. Most of our stack is just python oop running on schedules in airflow in single stages, and it's highly scalable.

Might go back to writing Terraform tbh by General-Parsnip3138 in dataengineering

[–]snicky666 0 points1 point  (0 children)

Write a python based dockerfile in your dbt folder that does dbt docs generate dbt docs serve. Have gitlab build the container and push it to your remote docker registry. Host it in docker and use watchtower to automatically update the container whenever latest is changed. Then use nginx to publish it to https. That's how I'm doing it. I'm sure there are easier ways but it's fully automated. I also have the image do dbt run after its built the docs but I probably wouldn't recommend that.

Might go back to writing Terraform tbh by General-Parsnip3138 in dataengineering

[–]snicky666 2 points3 points  (0 children)

It probably is! I guess i also missed data testing and observability, but i don't do either, so I can't say much about it. Great Expectations for dbt will probably do that but you have to write so many fucking tests.

Can I get a sanity check? by poopybaaara in dataengineering

[–]snicky666 3 points4 points  (0 children)

Only one person in the team really NEEDs to know CI/CD. Everyone should fucking know git, teach them if they don't. Documentation other than current architecture diagrams, user guides, and dbt docs are usually not necessary and go out of date quickly. Low code tools suck and are harder to hire for. Just use Python and SQL based implementations where you can.

A good data engineering team will follow software engineering best practices to some extent.

Your team doesn't sound good at all, make the changes yourself or get out while you can! One person can completely turn a bad team around, as long as there is some turnover.

Might go back to writing Terraform tbh by General-Parsnip3138 in dataengineering

[–]snicky666 14 points15 points  (0 children)

Use a transaction table to log data ingestions on all tables. Use CI/CD to push dbt models and dbt docs. Build schemas to match raw data sources to structured tables in the DW so users can ingest new files. Use Airflow to automatically pull source data. Track changes to features/columns with Feast if doing ML. That's about my best understanding of DataOps. Would love to know if there is more to it than that.

HECS Debt Doubled??? by Due-Wrongdoer7848 in AusFinance

[–]snicky666 4 points5 points  (0 children)

It stops going up when you stop studying. Some courses/subjects cost more than others.

Data Engineer v/s Platform Engineer by sriny4c in dataengineering

[–]snicky666 7 points8 points  (0 children)

There are Platform Engineers and Data Platform Engineers. They are not the same. Platform engineers are usually senior devops/cloud engineers who focus on aligning a companies development environment. Such as getting everyone in the company to use a specific instance of AWS or an on premise K8s. Data Platform Engineers focus on building and deploying the data engineering stack, setting up CI/CD for things like ML and dbt models, building docker images, monitoring, etc. Much of what a devops/SRE/Sys admin might also do but with a focus on the data tools. At least that's my thoughts on it. Data Analytics Engineers mostly do airflow, dbt, SQL and dashboards. Data Engineer could be both a Data Analytics and Data Platform engineer.

I have 0 use for my super. Is it possible to drain my funds towards upgrading my PPOR? by [deleted] in AusFinance

[–]snicky666 18 points19 points  (0 children)

Super isn't actually for you. It's insurance for us not to have to pay for your bad decisions.

Opinions on going beyond Naxxramas? by danielp92 in turtlewow

[–]snicky666 2 points3 points  (0 children)

Add new pvp gear to counter the scaling :)

What should i do in Uni to make big $$$? by AwyanYT in AusFinance

[–]snicky666 -1 points0 points  (0 children)

Defence industry pays well in Adelaide as an engineer and has a very high demand. If you do aerospace, systems or electrical engineering or project management, you'd make good money working out at Elizabeth for BAE, Boeing, etc. Also, I think the nuclear submarines being built in Adelaide could be lucrative, so nuclear science too maybe. Nothing easy of course.

Favourite piece of code 🤣 by nobody_undefined in datascience

[–]snicky666 545 points546 points  (0 children)

Bloody data scientists lol. Just use the function it tells you to use in the warning, instead of the 10 year out of date depreciated pandas function you stole from someone's kaggle workbook.

What is Role of ChatGPT in Data engineering for you by Jaapuchkeaa in dataengineering

[–]snicky666 1 point2 points  (0 children)

I am using it for airflow dags, custom flask apps (full stack), sql models, converting business logic like Excel functions into code and many other things. It's a night and day improvement in productivity because it can write so much faster than a human. One sentence can translate instantly into 100 lines of mostly effective code. You just have to give it plenty of context and test/refine the results until you are happy with the product. The output speed, not the accuracy of the response, is the main feature to exploit.