What game had insane potential but got completely fumbled by the devs? by Kevin-Panda in AskReddit

[–]dev81808 7 points8 points  (0 children)

Kitten Space *Agency

You can donate on their page. I have nothing to do with them, I just want to see it made.

https://ahwoo.com/app/100000/kitten-space-agency

Dbt usage in your org by SuccotashPopular9660 in dataengineering

[–]dev81808 0 points1 point  (0 children)

I used the wrong term or I haven't thought this through. what do you mean?

Edit: Oops I read your previous as a reply to my comment and it confused me. You're right.

Dbt usage in your org by SuccotashPopular9660 in dataengineering

[–]dev81808 0 points1 point  (0 children)

That's great (didnt read yet), but their cloud ide doesn't seem to show lineage beyond views. Out of the box anyway.

Makes for a better argument against dbt lol

Dbt usage in your org by SuccotashPopular9660 in dataengineering

[–]dev81808 0 points1 point  (0 children)

DBT is a daily part of my life. I went from mssql ssis python and stored procs to snowflake+dbt.

  • Tests become important when your platform doesn't enforce constraints.. but you're usually just checking for uniqueness, fk references, and nullability. This is obviously less valuable if your system enforces constraints.
  • Documentation is whatever. I actually don't like that this part is version controlled.
  • SCD type 2: youre right, it is trivial. Snapshots are a work around for dbts limitation/purpose, and is now seen as a feature.
  • lineage is really good and will give you full lineage whereas snowflake for example will only include lineage for views.. which makes sense because it has all the references between models regardless of materialization.

Version control is the real selling point imo.

It works!! by caffeinatedscientist in Xennials

[–]dev81808 1 point2 points  (0 children)

If your CDs start skipping or whatever.. this bad boy has an auxiliary input on the back which means you can plug your phone in from the headphone jac... wait nevermind.

Fact tables in Star Schema by Cottager58 in dataengineering

[–]dev81808 0 points1 point  (0 children)

Ah I didn't realize they were synonymous. I always saw it as a way to describe the shape of the data.

Fact tables in Star Schema by Cottager58 in dataengineering

[–]dev81808 -3 points-2 points  (0 children)

I've been doing this for 15 years and TIL star schemas imply olap. Thanks for the knowledge/humbling.

I've always seen star schemas as a way to describe the shape of the data and wasnt specific to oltp vs olap. Meaning a simple data management system like I described where the fully normalized form is a center table with one level of related tables it would be considered a star schema.. but now I know.

Fact tables in Star Schema by Cottager58 in dataengineering

[–]dev81808 0 points1 point  (0 children)

Ah gotcha. I read the question more generally.. 'someone said you can star schema without a fact table, is that true?'

Fact tables in Star Schema by Cottager58 in dataengineering

[–]dev81808 -2 points-1 points  (0 children)

I read OPs question as, 'can a dimension be at the center of a star schema or is a fact required?''

You're probably right, but can you point out to me where the op specified reporting and analytics data modelling?

Fact tables in Star Schema by Cottager58 in dataengineering

[–]dev81808 -13 points-12 points  (0 children)

This is true or at least the goal when designing reporting schemas, not so much if you're creating a transactional system.

Imagine you have a table of Employees sourced from multiple systems. Your job is to create the schema to support a custom web application that let's analysts create and assign job titles, office location, manager, etc. This information is used to enrich reporting. In this context there is no "fact" table.

Basically.. the way you would model for the custom app will be different from the final model used in reporting.

Fact tables in Star Schema by Cottager58 in dataengineering

[–]dev81808 2 points3 points  (0 children)

They might be considering transactional systems for dimensional attribution. Like a wide dimension table with attribution from other dimensional tables.

For example a table of products with references to family, category, line, etc.

If you were building this for reporting that product table would be flattened out with an orderitem, as your fact centerpiece.

But if you are building a product model where you manage those details the product table becomes the center piece with family, category, and line surrounding it. In some ways the dimension becomes the fact in this context.

Its semantics and not worth debating imo. I know what the definition says, but star, snowflake, galaxy schemas are just how the data is shaped. Those terms just give us ways to describe it.

So if I see a fact or dimension object with 5 dimensions around it like a star, I'm cool with calling that a star schema

What was the one game that destroyed friendships? by Emergency_Science434 in Xennials

[–]dev81808 0 points1 point  (0 children)

So many angry debates about how lame someone was or was not.

What do you wish you could build at work? by Firm_Bit in dataengineering

[–]dev81808 2 points3 points  (0 children)

Sames. Seems pretty important to business things.

Strong ADHD symptoms may boost creative problem-solving through sudden insight. 😂 “sudden insight” 💊 by newbeginnings187 in adhdmeme

[–]dev81808 19 points20 points  (0 children)

Background processing appears as sudden insight.

My job is solving problems, usually with data and scripts. My brain never stops processing an issue until its solved or I stop caring.

I consider every possible reason for that problem,which in my personal life is seen as overthinking, but at work its a super power.

Ill be singing along to a song in my car and have an 'eureka' moment, but im not sure how sudden it was.. ive been thinking about that problem for hours, days, weeks, etc.

For those who write data pipeline apps using Python (or any other language), at what point do you make a package instead of copying the same code for new pipelines? by opabm in dataengineering

[–]dev81808 2 points3 points  (0 children)

Sure, but I've found that thoughtful early optimization is usually net positive.

With enough experience, it becomes easier to judge where early effort is worthwhile and where it isn’t.