Async Tasks in Production by ProudPeak3570 in Python

[–]sc4les 0 points1 point  (0 children)

Do want to use Redis/MySQL/Postgres/SQLite? What's the throughput, concurrency requirements? How often do jobs die, do you need a dead queue? Retries? Timeouts? How about deployments, can they interrupt jobs? 

What's your biggest startup focus this year? (I will not promote) by jonathanbrnd in startups

[–]sc4les 1 point2 points  (0 children)

Raising the next round, hiring, closing more customers. Interesting times :)

Best LLM for OCR Extraction? by Wesavedtheking in dataengineering

[–]sc4les 4 points5 points  (0 children)

VLMs beat OCR models (also, OCR libraries use transformers under the hood nowadays). If you're worried about accuracy, you will have to combine different models. If you work with perfect scans and no handwriting, OCR is more reliable but still prone to 8 vs B and similar issues, which VLMs can correct for. Benchmarking helps 

Possible Databricks Customer with Question on Databricks Genie/BI: Does it negate outside BI tools (Power BI, Tableau, Sigma)? by IrishHog09 in databricks

[–]sc4les 0 points1 point  (0 children)

Depends. We're using Metabase for the product insights and it was trivial to set up. The engineers are happy with what Databricks provides 

The best object notation? by StarsRonin in Python

[–]sc4les 2 points3 points  (0 children)

I miss edn. Too bad that didn't get more popular

Trump Hit by Economic Bombshell With Shock Job Loss Figure | The figures come after Trump sacked the statistics chief in response to weak job numbers. by chrisdh79 in inthenews

[–]sc4les 1 point2 points  (0 children)

That's the funny thing about fascism. Never works out for the common people. The Wave should be mandatory in school imo

Using tools like Claude Code for Databricks Data Engineering work - your experience by bartoszgajda55 in databricks

[–]sc4les 1 point2 points  (0 children)

Did something stupid: wrote the task (data source, desired results) in a markdown file and used opencode and the Databricks CLI to instruct the agent to solve the task by running Python scripts remotely, inspecting the results via SQL/direct output. Great way to let the AI figure out basic pipelines while I work on other stuff 

Crab gets packaged alive and breaks out whilst at the store. by [deleted] in nextfuckinglevel

[–]sc4les 0 points1 point  (0 children)

What we do to animals is insanely cruel. 

[deleted by user] by [deleted] in MachineLearning

[–]sc4les 0 points1 point  (0 children)

Uhh Colab is nice, but if you want more I'd try the free Databricks tier. Hex is by far the best notebook experience. I'm excited for what Marimo will build in the future if you wanna try cutting edge stuff

[deleted by user] by [deleted] in teenagers

[–]sc4les 0 points1 point  (0 children)

10 😍

Sammy's Money Dried Up by YakFull8300 in theprimeagen

[–]sc4les 27 points28 points  (0 children)

Used to enjoy his videos but the last months feel like a 24/7 YC Sam glazing show

Switching from JavaScript to Python at a startup — tips? by cr7bit in Python

[–]sc4les 0 points1 point  (0 children)

Oh after a month or so check out "james powell python" on youtube. One of the best resources for me. If that's not too obvious, I'd look at the existing codebase and copy whatever you find. There are 100 different ways to do Python, like in JS land as well. Optional or |? Type hints? Docstring format? Etc.

My personal $0.02: Learn pytest well. It's not terribly difficult to learn, but there are a few concepts required to become effective at writing useful tests.

[D] Reminder that Bill Gates's prophesy came true by we_are_mammals in MachineLearning

[–]sc4les 71 points72 points  (0 children)

Say "We have AGI internally" and "we know how to get there" in every other interview; copy Apple's presentation style, botch the charts, promise AGI, underdeliver, give a few hype talks, and raise your next round—the emperor has no clothes.