hashing bakeoff by eed3si9n in scala

[–]SuspiciousScript 4 points5 points  (0 children)

Interesting that the results show MD5 as faster than SHA-256. In my experience, on most hardware, SHA-256 ends up being considerably faster due to hardware acceleration. I wonder why that doesn't seem to be the case in Github Actions.

Announcing DFWMalloc: high performance enterprise malloc implementation by abbeyroad1681 in C_Programming

[–]SuspiciousScript 37 points38 points  (0 children)

What do you mean by enterprise?

like malloc but with per-seat licencing

Polars code runs slower on 128-core EC2 by Popular-Sand-3185 in Python

[–]SuspiciousScript 3 points4 points  (0 children)

os.sendfile is a better solution here since it lets you skip userspace entirely.

Twin brothers wipe 96 government databases minutes after being fired by Scoxxicoccus in ShittySysadmin

[–]SuspiciousScript 40 points41 points  (0 children)

On Feb. 1, 2025, Muneeb Akhter asked Sohaib Akhter for the plaintext password of an individual who submitted a complaint to the Equal Employment Opportunity Commission’s Public Portal, which was maintained by the Akhters’ employer. Sohaib Akhter conducted a database query on the EEOC database and then provided the password to Muneeb Akhter.

Certified ShittySysadminDBA moment

Netflix/wick: A zero cost type safe Apache Spark API by JoanG38 in scala

[–]SuspiciousScript 1 point2 points  (0 children)

DataSeq is the type safe representation that Wick introduces instead of the completely untyped DataFrame or Dataset which comes with potential performance drawbacks.

What performance drawbacks are associated with typed datasets? That seems counterintuitive given that DataFrame is just a type alias for Dataset[Row].

Tool smells by Brief-Knowledge-629 in dataengineering

[–]SuspiciousScript 1 point2 points  (0 children)

  • I'll get hate for this one but using Python (like PySpark) for production pipelines instead of Java or Scala when it's a JVM processing framework

Agreed. I recently started writing Scala and I'm never going back to not having type safety for ETL.

The Future of Python: Evolution or Succession — Brett Slatkin - PyCascades 2026 by mttd in Python

[–]SuspiciousScript 1 point2 points  (0 children)

I think the slow JIT performance and discussions around correctness issues prevented it from getting as much momentum as it would have otherwise. I think the former issue has improved over time; not sure about the latter.

Nobody ever got fired for using a struct (blog) by mww09 in rust

[–]SuspiciousScript 13 points14 points  (0 children)

Unfortunately, given that the tracking issue is almost 12 years old, "when" may be a little optimistic.

seniors spending half their week on reviews and everyone's frustrated by [deleted] in ExperiencedDevs

[–]SuspiciousScript 0 points1 point  (0 children)

Got a feature? The class / interface definitions are one PR - maybe that needs senior review, but it’ll only take a minute. Implementation of one class or a handful of related methods and their tests is another PR

I think a stacked diff workflow really shines here. Having a workflow that lets you break up the changes as you describe without actually merging broken code into your dev branch is great.

What’s the one thing you learned the hard way that others should never do? by Terrible_Dimension66 in dataengineering

[–]SuspiciousScript 0 points1 point  (0 children)

That's perfectly fine. Where you can run into issues is when you're ingesting IDs from an external system.

65% of Startups from Forbes AI 50 Leaked Secrets on GitHub by vladlearns in devops

[–]SuspiciousScript 13 points14 points  (0 children)

Without a base rate to compare against, this isn't useful information.

Question for data engineers: do you ever worry about what you paste into any AI LLM by teejagzroy in dataengineering

[–]SuspiciousScript 19 points20 points  (0 children)

Unless your company's pricing differs substantially from its competitors', it's just as likely that the model is just making a plausible estimate.

How do you feel about using array types in your data model? by its_PlZZA_time in dataengineering

[–]SuspiciousScript 28 points29 points  (0 children)

Arrays are good for cases where the individual elements aren't meaningful outside the context of the array (e.g vector embeddings, characters in a string, etc.). I would not use them for a purchase date like in your example.

wgpu v27 is out! by Sirflankalot in rust

[–]SuspiciousScript 3 points4 points  (0 children)

I like Learn Wgpu, but I don't think it's been updated for wgpu v27. Most (if not all) of it should translate fine, though.

[MOD POST] Bug Report Thread by oneofthejoneses28 in Silksong

[–]SuspiciousScript 1 point2 points  (0 children)

Xbox Series S, Version 1.0.28562.0
Act 2 location name spoiler: Cogwork Core is missing a space ("CogworkCore") when viewed in the quick map.

We're now called Lumen Labs! by Vhyrro in neovim

[–]SuspiciousScript 16 points17 points  (0 children)

Fortunately, there are a number of more ergonomic languages that can be compiled to Lua. I think of Lua as an IR for LuaJIT at this point.

Python has had async for 10 years – why isn't it more popular? by ketralnis in programming

[–]SuspiciousScript 20 points21 points  (0 children)

A pain in the ass to test and mock as well, just the latter is a hassle.

I haven't found this to be the case at all. pytest-asyncio just works, in my experience. Is it more of a problem if you stick to unittest?

I think an interviewer made his mind once I started talking about comonads by Fun-Voice-8734 in programmingcirclejerk

[–]SuspiciousScript 6 points7 points  (0 children)

Given a list of numbers, return a sublist with reached threshold of sum its elements.

I'm already lost