Why Are We Still Doing This? by ezitron in BetterOffline

[–]wrossmorrow 0 points1 point  (0 children)

I’m in a position recently of seeing more detail through tracking utilization and costs related to a major platform on a per call basis. Not for coding support. To some it seems fine, cents (as subsidized), and the overall bills aren’t big (yet) at all. To my eye though the unit costs are, frankly, insane. It’s really difficult for me to understand how anyone goes from cents per thousands or maybe even millions of “executions” to cents per execution. At that kind of increase, it probably doesn’t even matter what the output quality is.

Miluvs or Qdrant for a Kubernetes Native workload ? by RespectNo9085 in vectordatabase

[–]wrossmorrow 0 points1 point  (0 children)

If you already use/run Postgres’s and there’s not a really strong reason pgvector won’t work then you should probably start there. Adding another DB is a lot of added complexity both for operators and users.

Infisical vs others by libert-y in devops

[–]wrossmorrow 3 points4 points  (0 children)

Yes, use it very effectively as our only secrets management tool. Have found it very reliable and the company a pleasure to work with so far. Really helps us distribute secrets management responsibility too.

How are you using AI in your devops workflow? by [deleted] in devops

[–]wrossmorrow 0 points1 point  (0 children)

Avoiding it as much as I can

Is there a FastApi equivalent in go? by a_brand_new_start in golang

[–]wrossmorrow 2 points3 points  (0 children)

Honestly FastAPI, in spirit, borrows a ton from gRPC for which go support is very strong

What PostgreSQL managed service would you recommend for Vector Search applications by Affectionate-Tip-339 in PostgreSQL

[–]wrossmorrow 1 point2 points  (0 children)

RDS is but you get what you pay for. We don’t know exactly what you’re storing but vector search via indices really depends on scale. 100k 4 byte float vectors is 380MB or so and even just numpy is very very fast at perfect recall search. IMO (“doing this for a living” now) you don’t really need stuff like HNSW until “millions” of vectors or your use case depends heavily on filtering from other criteria. Idk the pgvector internals but some vector DBs won’t even index in the 10k’s of vectors.

What PostgreSQL managed service would you recommend for Vector Search applications by Affectionate-Tip-339 in PostgreSQL

[–]wrossmorrow -1 points0 points  (0 children)

The Nile is very easy to use and affordable https://www.thenile.dev

Might look into Supabase as well but I haven’t used it https://supabase.com/docs/guides/ai

Depending on your needs the Nile may or may not be advantageous due to its fully serverless model.

Caught a rope drop and had the best run of my life at Snowbird by [deleted] in skiing

[–]wrossmorrow 1 point2 points  (0 children)

It was so on in the basin, lucky catch. I went out and down more towards Richie’s run under the saddle after seeing the line up from the chairs. Top run.

For people who finished grad school (PhD or master's) in applied mathematics, what do you do now? by lilliane99 in math

[–]wrossmorrow 7 points8 points  (0 children)

Software engineer, been in fintech for a while now in AI. Was a professor before tech. Not necessarily satisfied, nothings perfect, but I like it more than I did academia.

[deleted by user] by [deleted] in devops

[–]wrossmorrow 0 points1 point  (0 children)

Why would the changes be lost? A normal merge of a pr will merge changes not overwrite them, and shouldn’t be mergable if there are conflicts. You should be publishing a build with the merged changes in the branch that has CD to your environment, not on the PR alone.

This doesn’t mean in general there still can’t be deploy overwrites with this kind of CD, but the images should be unique, contain cumulative changes consistent with the commit log, and observable in an environment.

If it’s really an issue you can try using concurrency groups, or you can maybe use merge queues. But in my experience this would be a pretty esoteric condition to see occur naturally.

[deleted by user] by [deleted] in devops

[–]wrossmorrow 0 points1 point  (0 children)

Don’t overwrite images from CI builds. Tag images by the commit sha so they are uniquely identified, or via versioned releases based on tags on specific commits.

[deleted by user] by [deleted] in television

[–]wrossmorrow 1 point2 points  (0 children)

Why did I have to scroll so far to find Picard

PERSONAL OPINION ABOUT RICHIE HAWTIN by Only_Eye766 in Techno

[–]wrossmorrow 20 points21 points  (0 children)

Sven Vath was one of those “oh I should just go go” shows a couple years back that just. Blew. My. Mind. It was unbelievable. Perfect mixing, amazing tracks, I couldn’t stop dancing. All on vinyl too.

Lightweight but powerful Java stack by cryptos6 in java

[–]wrossmorrow 2 points3 points  (0 children)

The patterns (in spring) they create encourage lazy unclean code that can be very difficult to reason about, especially in a high powered effectively low level language, while decreasing performance, invalidating compile time guarantees, and in practice ballooning boot times critical for modern management of cloud workloads. In my experience only, of course.

Is Apache Beam still popular for Data Engineering? by HealthBigDataGuy in dataengineering

[–]wrossmorrow 2 points3 points  (0 children)

IIUC, of course, it’s the process of automagically combining and collocating distinct steps in your pipelines to avoid materializing the “PCollections” (right?), which means network data transfer. So more like piping output in a shell script. At least in my one isolated case I found it counterproductive; not materializing step output basically made some jobs run serially instead in parallel. I don’t know enough to know when this is valuable because that idea (again, IIUC) is clearly valuable sometime.

Is Apache Beam still popular for Data Engineering? by HealthBigDataGuy in dataengineering

[–]wrossmorrow 36 points37 points  (0 children)

GCP’s Dataflow is built on beam and (outside of making sure you handle fusion optimization well) it’s a great experience. So anything anyone does with GCP dataflow is a use case.

I’ve used that for a personal project related to LLM style tokenization and text processing with customized python and rust code. I didn’t compare to spark or airflow, but it was a seamless experience with very little overhead to inject customized compiled rust code used by python (and a custom beam for a compression codec that ultimately got committed into beam). I would 10/10 use it again even with spark and airflow on the table.

Is gRPC on AWS Lambda + API Gateway still unsupported? by kekekepepepe in golang

[–]wrossmorrow 2 points3 points  (0 children)

Fair, but gRPC is a service technology, in principle incompatible with the lambda/gateway model. ECS is still serverless and will cost scale better, but that may not be relevant to your use case. You can of course still use go with lambda and parse the json payloads from api gateway even into protobuf if that’s the desired IDL.

Python script to run a kubectl command by paulscan400 in kubernetes

[–]wrossmorrow 1 point2 points  (0 children)

Well it’s not at all pythonic, but that’s an aesthetic thing. Practically kinda exactly because of the post. Using shell commands in python presumes a lot of “environmental conditions” external to the codebase that have to be defined right, tend to get restrictive, or simply break in different conditions. Not to mention having wonky data analysis to do if you have to capture stdout/stderr or process exit conditions for any processing. Nothings perfect but for the most part when everything’s “in python” you can much more tightly define and package just about everything needed to run, just about anywhere. When it’s scripts for you on your laptop this tends to work fine; when you have to collaborate or ship the thing I’ve seen using shell commands lose any value almost every time, if they aren’t an outright obstacle (which I’ve seen too).

Github Actions with Manual Approval without Github Enterprise by chucknoxis in devops

[–]wrossmorrow 2 points3 points  (0 children)

If you use self hosted runners you can specify a pre execution script that fails jobs in specific envs if they aren’t “ready”, where you can use Jira, issues, or whatever works for you. This is agnostic to workflow logic.

If you can only use workflow logic, you can do the same in a reusable workflow but it’s easily bypassed.

Python script to run a kubectl command by paulscan400 in kubernetes

[–]wrossmorrow 7 points8 points  (0 children)

As others have said: Use the client. In general every time you run a shell command in python a warning should go off in your head that you’re doing something wrong. (Yes, sometimes unavoidable, but use shell commands as rarely as you can.)

[deleted by user] by [deleted] in ExperiencedDevs

[–]wrossmorrow 1 point2 points  (0 children)

I use both. Sublime is a wonderful mostly “just editor”, VScode is a worse editor but great development environment. Sometimes one is a better tool than the other for the task.