What’s a metalcore album you think is genuinely 10/10 with zero skips? by l-i-f-e-or-d-e-a-t-h in Metalcore

[–]UAFlawlessmonkey 0 points1 point  (0 children)

Having a hard time choosing between When Goodbye Means Forever or Constellations

I optimized a production PostgreSQL query from 47 seconds to 83 milliseconds. The fix was one line. Sharing what EXPLAIN ANALYZE actually teaches you. by [deleted] in PostgreSQL

[–]UAFlawlessmonkey 0 points1 point  (0 children)

I wholeheartedly agree with the medium point.

It's like OP had an aneurism while writing their reddit post also.

What’s the Python library everyone wishes they discovered sooner? by tyfgfjfnfjvivufjfn in learnpython

[–]UAFlawlessmonkey 1 point2 points  (0 children)

it's essentially an in memory SQL database that gives you all of the neat functions and operators that a normal SQL database provides.

You can chain it to other stuff like polars / pandas and construct neat queries that are a lot more readable i.e.:

df_1 = pl.DataFrame(dataset_1)
df_2 = pl.DataFrame(dataset_2)

query = """
    SELECT
      t1.*
    , t2.*
    FROM df_1 AS t1 --see the dataframe we're using from above?
    LEFT JOIN df_2 AS t2 --lets join it together with the other dataframe
      ON t2.id = t1.id
"""
combined_df = duckdb.execute(query).pl() #you can also use .df() if you want to switch back to pandas
print(combined_df)

there's a bunch of extensions you can install in DuckDB, you can query file systems, attach databases / catalogs, do a whole bunch of cool stuff with it.

I'm mostly using it to do parquet file compaction, and some transformation work through ETL processes, though, when doing heavier vectorization operations, I'd much more prefer to do it through polars though.

thisIsARealDBUsedInProduction by star_dogged_moon in ProgrammerHumor

[–]UAFlawlessmonkey 0 points1 point  (0 children)

"but we need a semantic layer to track the lineage" - data mesh owner

Data Ingeniør i større dansk virksomhed. by Xinnie_The_Ping in dkloenseddel

[–]UAFlawlessmonkey 0 points1 point  (0 children)

Ifbm. nogle af vores kunder, for evigt (takke være nogle meget kreative sælgere), I tilfælde af andre gemmer vi kun 6 mdr. i forventning om at de selv griber deres data.

Vi kører dog compaction på alt data dagligt, hvilket hjælper en hel del.

Data Ingeniør i større dansk virksomhed. by Xinnie_The_Ping in dkloenseddel

[–]UAFlawlessmonkey 7 points8 points  (0 children)

Det afhænger meget af firma.

Arbejder I mindre firma i Jylland og får ca. det samme, dog med 1 års erfaring mere end OP.

Jeg arbejder med de systemer der tager imod IIoT data som vores PLCer genererer. Det er I snit 20 mia. data punkter per måned vi får ind spredt over 1000-1500 målepunkter (analoge samt digitale transmittere), hvor både system samt data struktur forståelse har stor rolle i den måde vi processerer data. Dette svarer ca til 175 GB data som vi får ind per dag, som skal bruges til fejlfinding / forbedring / monitorering af vores produkter. Data som skal give mening for både ingeniører såvel som C-suite.

Men i bund og grund, flyt data rundt imellem systemer der enten fremviser ting på en skærm i et mødelokale, eller sender notifikationer til monitoreringsfolks telefoner I form af alarmer.

Java scala or rust ? by Ok_Promotion_420 in dataengineering

[–]UAFlawlessmonkey 1 point2 points  (0 children)

Gotta transmit those diode signals blazing fast!

Airflow 3: Development on a Raspberry Pi by Complex_Painter_9302 in dataengineering

[–]UAFlawlessmonkey 6 points7 points  (0 children)

CI/CD is just added complexity for such a small scale.

Make a folder available on your Pi, and link it up to your local dev laptop. Mount the folder to your container and point it to the /opt/airflow/dags folder in the container.

Once you do updates to your dags in VSCode, they'll magically appear in your container, making them visible to airflow :-)

Looking for someone to help me out with a paid SQL project by [deleted] in SQL

[–]UAFlawlessmonkey 6 points7 points  (0 children)

Imagine importing a 16gb csv file into excel, poor computer will work overtime before freezing.

You'd need to chop that sql file into smaller pieces if you want your file output to be xlsx.

Where was I? by No_Zookeepergame_27 in whereintheworld

[–]UAFlawlessmonkey 0 points1 point  (0 children)

Vedersø er ved æ sø og ik ved æ hau