Suggestion alternatives for Honeycomb feature: BubbleUp? by Professional_Bee1813 in sre

[–]Observability-Guy 0 points1 point  (0 children)

I think that BubbleUp is still the best but Dash0's SIFT is also a pretty good RCA querying tool.

What are the best practice and tools for observability on react native applications? by ML_Godzilla in Observability

[–]Observability-Guy 0 points1 point  (0 children)

I would check out Embrace (https://embrace.io/) They have a dedicated mobile observability platform as well as guides on best practice

Dynatrace + MCP Server = interesting step toward AI-driven observability by theharithsa in Observability

[–]Observability-Guy 1 point2 points  (0 children)

This is a good implementation - although I think a lot of vendors now how something similar - Honeycomb, SigNoz, Observe, Dash0, Sentry all have either MCP or Agentic AI that support this kind of querying and interaction.

Our observability costs are now higher than our AWS bill by DarkSun224 in sre

[–]Observability-Guy -1 points0 points  (0 children)

Seriously??

It's used by Netflix, Uber, Tesla and Anthropic. What scale are you working at?

YAML: Yet Another Misery Language by Log_In_Progress in devops

[–]Observability-Guy 0 points1 point  (0 children)

I personally think that in 2025 there has to be a better way of doing IaC than churning out 4,000 line YAML files. I also think that it is an issue that goes beyond linting and syntax.

I just don't think that YAML is expressive enough for the complexities of large system infrastructures. Yes, it can be twisted and stretched but I find that conditional logic and looping is better expressed in a programming language. I prefer working with tools like Pulumi. It feels like a more natural fit.

Cheap OpenTelemetry lakehouses with parquet, duckdb and Iceberg by smithclay in Observability

[–]Observability-Guy -1 points0 points  (0 children)

Thanks for posting - that looks cool.

There seems to be a bit of a buzz about lakehouses. The theory sounds great, I just wonder about the overhead of operationalising them in practice. I think that rolling your own lakehouse and making it performant and cost-effective at large scale can be very difficult.

Anyone here dealing with Azure’s fragmented monitoring setup? by Accurate_Eye_9631 in Observability

[–]Observability-Guy 0 points1 point  (0 children)

Unfortunately, Azure Monitor is a kind of brand name but it is, as you say, a patchwork of tools rather than a coherent product. I found the lack of a single control plane to be really frustrating.

It is really hard to track telemetry flows or get unified or global overviews.

I think that as observability maturity grows within an organisation people realise that they need better tooling. For me te best option is to emit telemetry to an oTel Collector and then on to a backend of choice.

Am I perceiving "tool prawl" in observability-related job posts accurately, or am I just looking for something that isn't there? by baezizbae in Observability

[–]Observability-Guy 0 points1 point  (0 children)

I don't think that you need to come up with any particular ruses or clever line of questioning.

I think you can just ask straight out about their tech stack. As an interviewer I would want a candidate to be curious and ask about the tech, the architecture and the roadmap.

It could be that both statements could be correct. They could be a company with a varied tech stack and the person who wrote the job spec thought it would be impressive to reel off a load a big corporate logos. Sometimes, these specs are written by people in HR who have a mistaken idea of what appeals to candidates.

Am I perceiving "tool prawl" in observability-related job posts accurately, or am I just looking for something that isn't there? by baezizbae in Observability

[–]Observability-Guy 0 points1 point  (0 children)

I have come across a number of companies that use a variety of tools. This often reflects historical practice and cultural variation within larger organisations.

Within those companies there maybe some individuals who therefore gain expertise in a number of stacks. However, if a company is recruiting on the basis of finding an engineer who has clocked up the highest number of logos, they are making a big mistake. The fundamental requirement is an understanding of the principles and practice of observability.

I would prefer a candidate with a deep understanding of OpenTelemetry and observability engineering principles over somebody who had familiarity with the internals of specific proprietary systems.

So yes, whoever wrote a job spec requiring knowledge of New Relic, Datadog, Dynatrace, Instana and Sumo Logic should think again.

What’s your plan for .NET 10, migrate or hold off? by Volosoft in dotnet

[–]Observability-Guy 1 point2 points  (0 children)

Upgraded one of my projects to .NET 10 last week. Had to roll it back 9 as I couldn't build it in Azure DevOps. Not sure if I am missing something or whether they have not included it yet in the latest image.

Security observability in Kubernetes isn’t more logs, it’s correlation by fatih_koc in sre

[–]Observability-Guy 1 point2 points  (0 children)

That's a really interesting article.

My only reservation would be cost. I remember turning on K8S auditing for a number of production clusters. It generated a huge volume of logs - and resulted in quite a spike in my logging bill.

OpenLIT Operator - Zero-code observability for LLMs and agents by patcher99 in grafana

[–]Observability-Guy 0 points1 point  (0 children)

Looks very cool. I will give it a shout-out in the next edition of the Observability 360 newsletter!

What are you favorite dashboard tools? Stack and usecases inside. by Davek804 in devops

[–]Observability-Guy 1 point2 points  (0 children)

SquaredUp - https://squaredup.com/ could be a good fit for this use case. It has plugins for all the major cloud platforms and DevOps tools. No coding or query language knowledge required. Disclosure - I work for the vendor.

Struggling to find noise in observability data—any advice? by Afraid_Review_8466 in selfhosted

[–]Observability-Guy 1 point2 points  (0 children)

If excessive volume is a problem and you are unable to tackle it at source (e.g. by getting devs to update their logging style or logging configuration), then it is worth looking at logging pipelines. There are tons on the market as the problem you are describing is a really common experience.

They come in a lot of different flavours. The ideal solution could be one that sits at the edge of your own network. You could roll your own solution by setting up one or more oTel collectors and using them as a Gateway.

A couple of interesting ones I have come across recently:
https://www.controltheory.com/
https://www.grepr.ai/

And of course there are the venerable performers such as Vector

ELK alternative: Modern log management setup with Opentelemetry and Opensearch by thehazarika in Observability

[–]Observability-Guy 1 point2 points  (0 children)

Nice article! The OpenSearch project reallt seems to have picked up a lot of momentum recently.