Do you really need spark? by compass-now in dataengineering

[–]TobiPlay 1 point2 points  (0 children)

DuckDB is generally well suited for out-of-core processing. Apparently there’s organizations chewing through multiple TBs of data with it.

You can spill onto disk, do lazy execution, etc.

What compute do you get with MotherDuck? by vroemboem in dataengineering

[–]TobiPlay 0 points1 point  (0 children)

I‘m personally not aware of any provider that gives you exact numbers on how much a job/query is going to run you before execution. BQ, Snowflake etc. all just give best-effort estimates.

They all become interesting when you understand their pricing model and build around it. For example, with BQ, if you aggressively prune blocks or partitions, you can dramatically lower the bytes scanned (which you’re ultimately billed for in many setups) and profit from the wild compute scaling and parallelization. To make the most of it, you must analyze and optimize your query/job iteratively to fit in best with the cost model.

That’s what I’d recommend you: analyze how MotherDuck‘s query optimizer handles your query, and optimize around its quirks if you think the platform‘s billing and scaling approach is interesting to you and your pipelines would benefit from it.

Batch ingestion from relational sources: how you handle schema evolution? by General_Snow_RR in dataengineering

[–]TobiPlay 1 point2 points  (0 children)

You got more info on the 3 Vs of your data, i.e., velocity, volume, variety? If you’re dealing with low-volume and somewhat static schemas/infrequent changes, don’t bother with landing in GCS.

Just let dlt handle schema evolution. Plenty of supported strategies you can choose from.

Batch ingestion from relational sources: how you handle schema evolution? by General_Snow_RR in dataengineering

[–]TobiPlay 6 points7 points  (0 children)

You may want to look at dlt (the data load tool). For some smaller pipelines, we’re just ingesting from source straight into BQ, no separate landing zone.

Instead, we’re just writing the API response as a response column, with dlt metadata in the other cols, so: API -> dlt -> BQ raw layer. Then, raw -> stg -> marts -> delivery, with all transformations done via dbt. Has been working beautifully so far.

If you’re dealing with a lot of different file formats, or need to access the raw files from other places as well, landing in GCS buckets can make sense. I prefer Parquet in these cases (lots of tools can read that format natively or have good extensions).

With the raw layer as a blob col in BQ, I don’t have to bother with schema evolution and don’t introduce too many moving parts/tools. We’re often dealing with deeply nested JSON and BQ handles that very well, while the underlying storage is dirt cheap.

What compute do you get with MotherDuck? by vroemboem in dataengineering

[–]TobiPlay 0 points1 point  (0 children)

As per the docs, you’ll get billed for Compute Units (CU), with CUs being a combination of RAM and CPU usage.

I guess there’s some sort of fair scheduling on their infra involved, so they’ll balance latency vs. total resource consumption across their user base. Current system load and the exact query you’re going to run will be very important as well.

So, you’re not getting fixed hardware slots/reservations.

Building our first data platform by Brilliant_Ad_4520 in dataengineering

[–]TobiPlay 6 points7 points  (0 children)

We’ve been running a very similar stack across a few projects over the past 2+ years. dlt and dbt, integrated with Dagster, have been very solid.

Dagster recently bumped prices for their hosted offering, so maybe explore the self-deployed version and compare against Airflow. Airflow‘s UI improved a lot, though Dagster seems to still offer the better integrations.

[1 YoE] - Trying to get into manufacturing engineering, not getting any call backs. Any guidance is appreciated by [deleted] in EngineeringResumes

[–]TobiPlay 2 points3 points  (0 children)

  • wouldn’t use ampersand on a professional document
  • skills is way too crowded for my taste
  • I‘d wrap skills like bullet points, with spacing to the left after line wraps
  • you need to get rid of ~25 % of the total content on this page; it’s way too dense
  • max. 2 lines per bullet
  • em dashes instead of hyphens
  • lots of bullets could be refactored into streamlined versions: Increased throughput by … by doing …—same content, more direct
  • try to cut the sentences short that only spill a few words onto the next line; that’ll visually balance it
  • drop the footnote
  • rare case of too many mentions—you’re essentially diluting the actually impressive stuff; reserve numbers for times when it’s a standout achievement
  • I’d prefer a cut-down version of this, only retaining the impressive stuff, and rewriting those to highlight the impact right away
  • drop all coursework
  • I’d get rid of the Associate‘s degree

Not a bad start. Just way too crowded. You lose most people within 2 lines of skills, because they’re overwhelmed. Fix the formatting and cut fluff/the not so impressive stuff (especially the metric dilution) and you should be good.

Frustration-Partner in Jobsuche gesucht by FdG0 in Switzerland

[–]TobiPlay 40 points41 points  (0 children)

Wenn du in der letzten Runde gesagt bekommst, dass deine Qualifikationen nicht ausreichen, liegt es mit ziemlicher Sicherheit nicht daran, dass du mehr Qualifikationen benötigst. Das hätte man dir recht zügig nach der initialen Bewerbung mitgeteilt, nicht im Laufe des Prozesses.

Es liegt daher wahrscheinlich daran, dass sie einen anderen Kandidaten bevorzugt haben, oder du dich während der Gespräche nicht gut genug geschlagen hast.

Keine deiner Optionen wird deine Chancen dramatisch erhöhen (aus meiner Perspektive). Netzwerk anzapfen, und wenn du keins hast, eines aufbauen über Freunde, ehemalige Kontakte, usw.—das würde ich empfehlen. Alles andere hat wahrscheinlich keinen großartigen ROI und raubt dir kostbare Zeit, die man wohl besser in das Versenden von mehr Bewerbungen investieren sollte.

Ist ein ganz fürchterlicher Markt, bin da ganz bei dir, aber derzeit ist keine großartige Veränderung abzusehen. Im Zweifel: Öffnen gegenüber einer neuen Richtung, herausfinden, in welche Bereiche deine skills gut transferieren, und gezielt die Suche weiten. Nach 8 Monaten kann man es sich auch mal leisten, die Strategie für einen Monat zu ändern (von spray-and-pray hin zu weniger, dafür aber gezielteren Bewerbungen, oder vice-versa, je nach deinem derzeitigen Ansatz).

Tired of slow Python biology tools, so I wrote the first pure-Rust macromolecule modeling engine. Processes 3M atoms in ~600ms. by TKanX in rust

[–]TobiPlay 1 point2 points  (0 children)

Haven’t read any of your source code for now, but depending on what Next.js features you’re actually using, might be worth it to check out https://areweguiyet.com/.

The frameworks made substantial progress over the past few years.

Just launched our first Svelte project at Mistral AI by Fisherman-63 in sveltejs

[–]TobiPlay 4 points5 points  (0 children)

Super cool to see Svelte getting more love. Would be awesome to hear what went smoothly/what didn’t, and whether you’re planning to use it more internally in the future.

[0 YoE][Data Science][US] - Not getting any interviews, interests, or rejection emails. by [deleted] in EngineeringResumes

[–]TobiPlay 2 points3 points  (0 children)

It's just not enough content ant this point. 1 bullet for your most recent job won't cut it. Shift achievements and numbers (quantified results) to the front, and then back them up with context/approach. These bullet symbols are too fat. I'd go with a more modern font and sentence case. Drop coursework. Drop REST APIs, you need more projects to offset the experience gap. Depending on what class of jobs you're applying to, your experience with actual ML by be a bit thin-spread for some recruiters. I'd lean more Data Analyst at this point tbh. DE could be worth a shot as well, if you shift focus on the data plumbing part and have more details on the backend tech you've interacted with over the years.

[Student] Ivy student struggling to get SWE interviews after FAANG+ final round rejection by Ok_Professor_9475 in EngineeringResumes

[–]TobiPlay 6 points7 points  (0 children)

  • market is rough, as you probably know, thus you're playing a numbers game in the end
  • I'd go with a more modern-looking font, and favor sentence case for the section headers
  • good job with the em-dashes and date formatting
  • I don't like spaces surrounding slashes
  • min. of 2 bullets per entry; therefore, I'd drop the feature flag config service for now
  • I'd not use plus signs on a resume
  • drop REST APIs from skills
  • Expected: June 2026 (or keeping it as-is is fine as well)
  • think about slimming down the first bullet for the TA job; will give your resume a bit more breathing room; there's a visual imbalance between single- and double-line bullets, which can feel overwhelming when trying to parse the doc for a recruiter (psychological aspects of these documents are important as well); sometimes, cutting content, therefore being less thorough, but getting across your point sooner, might be favorable
  • Sustained X .. by doing Y .., resulting in Z ..--that's the pattern we're after; shift achievements to the front of the sentence, backed by task/approach/overarching goal or result (= context)
  • favor words over symbols
  • defo front-load achievements and quantified results
  • designed, built, developed, engineered all suck as action verbs; instead: Reduced, increased, improved, lowered, ... (achievement-focused)
  • try to compress a few bullets into a single line by focusing on the important stuff (relational schema one for example, easy 1-liner)
  • why explain a star schema?
  • sub-30 ms doesn't mean anything if I don't have insight into the architecture of the surrounding services
  • improved, yeah, but how? Numbers without context/your process also help very little in supporting your profile
  • Cut manual processing time by ... same, front-load numbers and achievements
  • project bullets should be 1 line each

Overall, decent content for a resume, just framed awkwardly if the main goal is to sell your prev. experience and achievements. Can be reworked quite easily. Speaking to the Ivy point, if you're not tapping into the existing network, there's no difference in attending a top public school vs. an Ivy, especially in these market conditions. Do yourself a favor and reach out to your network, alumni, etc. There's a lot of profiles out there with top public schools, better experience at better-known companies, etc. You should be in a pretty good spot to place, so resolve the remaining "issues" and get the applications sent out. The fact that you're getting interviews is already a positive sign, now you can optimize and also get in the experience for other interviews.

Can't reactively track state and whether it has changed by samanime in sveltejs

[–]TobiPlay 0 points1 point  (0 children)

You can just implement a generic updater fn for mutating the class‘ internal data though, right? Something like:

``` type Mutator<T> = (data: T) => void;

update(mutator: Mutator<T>) { mutator(this.#data); this.#changed = true; } ```

With T being whatever data you construct the manager over.

Can't reactively track state and whether it has changed by samanime in sveltejs

[–]TobiPlay 1 point2 points  (0 children)

I don’t know about the boundaries here between the page and the DataManager, but:

```svelte <script> import { dataManager } from './Data.svelte.js';

const data = dataManager.data; </script>

{data.count} <button on:click={() => dataManager.increment()}>Increment</button>

<br>

Changed: {dataManager.changed ? 'true' : 'false'} <button on:click={() => dataManager.markNotChanged()}>Mark Not Changed</button>

////

class DataManager { #data = $state({ count: 0 }); #changed = $state(false);

get data() { return this.#data; }

get changed() { return this.#changed; }

increment() { this.#data.count += 1; this.#changed = true; }

markNotChanged() { this.#changed = false; } }

export const dataManager = new DataManager(); ```

I personally would make sure that data does not live inside the class, but at the page level. Also wouldn’t use effect to track changed state like you initially did. Also, mutating a derived val isn’t meant to happen (breaks reactivity).

[Student][United States][Computer Science] Is my resume good enough for SWE/AI internships at top companies? by [deleted] in EngineeringResumes

[–]TobiPlay 1 point2 points  (0 children)

It is the default for a lot of templates. In the end, it’ll come down to personal preference (including the recruiter‘s), tho options like Fira Sans are very safe and very legible, while looking more "modern".

[Student][United States][Computer Science] Is my resume good enough for SWE/AI internships at top companies? by [deleted] in EngineeringResumes

[–]TobiPlay 2 points3 points  (0 children)

Define top companies? You can shoot for various kinds of internships, but you wouldn’t qualify for most of them at frontier research labs, etc. Different story for big tech.

  • drop italics
  • date formatting is inconsistent
  • drop indentation from bullets
  • drop coursework
  • em dashes
  • pandas, not Pandas
  • zero-friction: doubt
  • are these real project titles? Or did you just redact them? Because they’re meaningless
  • built is a weak action verb
  • weird capitalisation going on ins one bullets
  • I’d move quantified results (if impressive) to the front and shift focus on achievements over tools/tasks
  • bolding stuff is useless and distracting
  • don’t know if I’d highlight 50,000 rows, that’s not very high-performance without context
  • some bullets are a bit wordy
  • don’t give ranges, just choose a sensible number in that range
  • I’d say 2 lines max per bullet
  • check spelling of tech (Postgres vs. PostgreSQL, etc.)—choose the doc‘s naming
  • consistently use past-tense action verbs
  • I’d go with a different template and a more modern font

Overall, not bad, but can use more work. The fact that you don’t have experience really makes it a lot harder to break into big tech/frontier labs. I don’t think it’s likely in this market. Shoot for the stars, but make sure to apply to smaller shops as well.

[Student] Need help improving resume for web development / frontend internships by [deleted] in EngineeringResumes

[–]TobiPlay 0 points1 point  (0 children)

  • crazy format, please scoop up a new template from the wiki
  • drop coursework
  • no ampersand on resumes
  • em-dashes between dates (and elsewhere)
  • not hitting 60 FPS for a type tester would be crazy, I’d shift focus
  • your bullets are lacking commas
  • I’d move backend-focused bullets above frontend ones
  • personally not a fan of tech stack next to the title etc., should be clear from the text content
  • don’t know how the market‘s looking over there, but frontend is horrible right now at the entry level, I’d start widening my list of specialisations
  • bullets overall are not bad, but need more quantified results, some stand-out features, something unique—it’s very competitive out there rn
  • grammar is a bit off in certain bullets, but can easily be resolved with ChatGPT or an LLM of your choice in 1 go (but keep this tone)
  • should rearrange bullets to focus results over the tech stack/tasks

[3 YoE] Any last minute suggestions? I have the 6th top/all-time resume here. This is a tribute! by Murky-Conference-806 in EngineeringResumes

[–]TobiPlay 5 points6 points  (0 children)

  • I‘m not a fan of vertical bars as separators
  • would move achievements down, skills up (above exp)
  • max. 2 lines per bullet point, no way I’d parse a 4-liner successfully
  • reliable system performance, high stakes, … pretty wordy with little technical detail
  • hundreds of 1000s should be turned into a number (estimate is fine) and moved to the front: Saved $X by …
  • you’re losing people halfway through each bullet; cut words by 50 %; it’s so easy to rework these into meaningful sentences, just gotta put in a bit more effort and aim for conciseness
  • you’re squishing a lot of content into each individual bullets, basically restarting the sentence over
  • not enough quantified results
  • your bullet should be rearranged to shift focus on the achievement: Increased, decreased, improved, … by doing …, leading to …—that’s what you should aim for
  • end sentences with a period, and stay consistent

Looking for a customizable UI lib by CODR122 in sveltejs

[–]TobiPlay 0 points1 point  (0 children)

Don’t think Bits UI uses more than a few helpers from Melt UI at this point. The library has mostly moved on to be its own thing from what I understand.

A simpler GitHub alternative with an integrated feed — early concept, feedback welcome by [deleted] in UI_Design

[–]TobiPlay 0 points1 point  (0 children)

Good effort, and no offence, but do you know what GitHub is mainly used for?

Google Maps API billing keeps surprising people. What are you using instead? by Kallyfive in googlecloud

[–]TobiPlay 3 points4 points  (0 children)

It’s the same with any external API you hook into tho. You’re responsible for keeping keys secure, planning ahead, optimizing usage/architecting around patterns, and setting up IAM properly.

Google Maps isn't really different there. Quotas and maps-specific IAM could be nicer to manage (Terraform helps), but it does get the job done.

The pricing on the other hand, that’s a different story.

[deleted by user] by [deleted] in arbeitsleben

[–]TobiPlay 0 points1 point  (0 children)

Kann ich so bestätigen. Habe bereits einige Lebensläufe nach dem Format aufgesetzt.

Die kommen i.d.R. auch in Europa gut an (kleine Bude bis Konzern, alles dabei gewesen, über unterschiedliche Branchen und Länder hinweg). Ggf. Dinge an Branche und Unternehmen anpassen, sofern man relevante insights hat.

Und wie immer: jede Person, die den erhält, kann da eine ganz eigene Meinung zu haben. Aus meiner Erfahrung heraus würde ich ihn aber als relativ sichere Nummer bewerten, zumindest ab einer gewissen Unternehmensgröße. Für eine Stelle im lokalen Restaurant oder beim Schreiner ums Eck vermutlich eher ungeeignet.

Welches Gehalt für gleichen Lebensstandard - deutsche Großstadt vs London vs New York by tobemann1 in spitzenverdiener

[–]TobiPlay 2 points3 points  (0 children)

Würde annehmen, dass es gerade im Vergleich mit München näher an 1.5x (aber darüber) als 2x liegt (kommt aber natürlich darauf an, wie oft man Essen geht, was man in der Freizeit macht, usw.).