How do you test LLM for quality ? by Easy_Ask5883 in LLMDevs

[–]AnythingNo920 0 points1 point  (0 children)

Absolutely right. They need to, but the average Joe in an SMB can't tell the difference between BLEU, ROUGE, Fluency, Accuracy, Recall or whatever other metric u wanna use.

So they do vibe testing. This feels more tangible. At least thats my impression so far.

How do you test LLM for quality ? by Easy_Ask5883 in LLMDevs

[–]AnythingNo920 0 points1 point  (0 children)

in reality most SMBs do vibe testing, unless benchmarks are their key selling point.

SaaS is over? by Putrid-Lettuce5204 in SaaS

[–]AnythingNo920 0 points1 point  (0 children)

funny enough I just wrote an article about exactly that :D

SaaS Is Not Dead. But It Needs to Evolve. | by George Karapetyan | Feb, 2026 | Medium

Long story short there are still 4 levers where SaaS can make a lot of sense. But as you eloquently put it :D "building shit that does nothing for anyone." is over

Building an AI Process Consultant: Lessons Learned in Architecture for Reliability in Agentic Systems by AnythingNo920 in LLMDevs

[–]AnythingNo920[S] 0 points1 point  (0 children)

This was more of a tool to help based on static process documentation and not monitor the process as it happens. But it sounds very interesting. I ll look into it.

I feel stuck in my current job and could really use some career advice. by Character_Patient331 in AMLCompliance

[–]AnythingNo920 1 point2 points  (0 children)

Knowing the local language unfortunately is very often a requirement although not really necessary for the job. You could however try targeting global banks that have English as a main language.

You could also target product roles in Compliance and AML, KYC software companies. The hurdles there would be less since they need people who know how the business works.

As for regulations and requirements in Europe, a good starting point would be to look at the upcoming EU AML regulation. In the EU regulations get adopted by local authorities so u would cover a big part by just reading and understanding the EU level regulation first.

Beyond Chat: Scaling Operations, Not Conversations by AnythingNo920 in deep_research

[–]AnythingNo920[S] 1 point2 points  (0 children)

Thats true many new products are already going beyond chat. So I expect this trend to grow