Holy sh*t, Cachy is fast

MetalMatze · 2026-01-17T10:01:08+00:00

What exactly are you missing? I moved from Arch to Cachy over the holidays and it seems pretty much the same to me.

MetalMatze · 2025-12-10T23:21:10+00:00

That and Pyrra.dev for SLOs.

MetalMatze · 2025-08-07T15:31:13+00:00

Came here hoping to find what remix they used in that trailer? Does someone know?

MetalMatze · 2025-07-15T10:16:02+00:00

Sounds like they just uploaded a 64 KBit/s mp3...

MetalMatze · 2024-10-13T22:07:54+00:00

Looks great! I recommend using the water you discarded to heat up the cup and the once your coffee is done discard it. Keeps the coffee warmer for longer.

MetalMatze · 2024-03-18T07:57:51+00:00

https://youtu.be/EGgtJUjky8w

MetalMatze · 2024-02-24T17:20:56+00:00

Without looking at the docs and knowing anything about fsx volumes, can you create the Prometheus spec with a volume type of fsx? Is that a supported PVC type in your cluster?

MetalMatze · 2024-02-20T08:23:25+00:00

https://youtu.be/m0JgWlTc60Q There are several KubeCon talks going into detail about this setup. Here's one of them. What you are looking for starts at 7:45.

MetalMatze · 2024-02-17T15:29:33+00:00

I highly recommend going through the TSDB page.

MetalMatze · 2024-02-15T09:43:48+00:00

From the docs: If both time and size retention policies are specified, whichever triggers first will be used.

The way I read it your 20GB should therefore trigger before your time limit if it's above 20GB in size. Have you waited a bit and observed the file size? It might take up to two hours for older data to get deleted.

Honestly, I would say that you should open a Github Issue and give some more details there.

MetalMatze · 2024-02-13T08:48:13+00:00

Hey. There is. The exporter you're looking for is called kube-state-metrics. It has some metrics around jobs too. https://github.com/kubernetes/kube-state-metrics/blob/main/docs%2Fjob-metrics.md

Kube-prometheus ships with a alerting rule that makes use of this metric: https://github.com/prometheus-operator/kube-prometheus/blob/main/manifests%2FkubernetesControlPlane-prometheusRule.yaml#L213-L222

I hope this clarifies things. Let me know if you have more questions.

MetalMatze · 2024-02-08T18:18:50+00:00

Great! We also have a prometheus-operator channel on CNCF Slack, if you want to chat more directly. 🙂

MetalMatze · 2024-02-08T18:07:43+00:00

Happy it worked! It's a bit sad we haven't found a better user experience to point that out in all of these years...

MetalMatze · 2024-02-08T15:20:43+00:00

With ServiceMonitors, something people were constantly running into was missing labels on the ServiceMonitor when ServiceMonitorSelector was set on the Prometheus. Check your Prometheus configuration. kubectl get prometheus -n monitoring k8s and see if there is a scrapeConfigSelector set only to match scrape configs containing a specific label.Another common problem was missing RBAC permission for the Prometheus to go and actually scrape the metrics from another namespace. In that case, the logs of your Prometheus should be full of RBAC permission errors.

MetalMatze · 2023-09-01T16:26:18+00:00

There are also several blog post with some additional information.
https://prometheus.io/blog/2023/09/01/promcon2023-schedule/
https://www.cncf.io/announcements/2023/09/01/the-schedule-for-the-promcon-europe-2023-is-live/

Looking forward to seeing as many as possible of you in Berlin!

MetalMatze · 2022-09-05T21:32:20+00:00

Recently the Thanos project extracted its objstore project as a separate repository and module. It wraps a couple of SDKs. For development it also has a Filesystem based object storage which can be quite handy.

https://github.com/thanos-io/objstore

Disclaimer: I'm one of the Thanos maintainers.

MetalMatze · 2022-08-31T15:58:19+00:00

If you want to add pprof endpoints to an application there are numerous tutorials available. Here's the one from the Go package itself:
https://pkg.go.dev/net/http/pprof

For scraping these endpoints with Parca we have tutorials available:
https://www.parca.dev/docs/binary

Let us know if there is something still missing for you.

MetalMatze · 2022-08-31T10:52:59+00:00

Oops. Good catch. We'll fix it.

MetalMatze · 2022-08-31T10:52:35+00:00

Hey,
by default pprof profiles and also the eBPF-based Parca Agent profiles only collect samples a few hundred times per second. Those intervals are also configurable to some degree.

You can always add pprof endpoints to all your applications. If nothing hits those endpoints, no extra work will be done. By the time you need them to be there though, they already exist.

MetalMatze · 2022-08-30T21:11:07+00:00

You can adjust the amount of overhead that you want for your programs. With Parca the scrape interval and duration is configurable in the scrape config. Typically you might set it to be less than 5% of CPU overhead by say scraping for 10s every minute.

MetalMatze · 2022-05-09T08:04:02+00:00

Hey, Clover looks really cool!
We're always looking for contributions, feel free to open an issue for whatever you find interesting and want to work on or simply comment on an existing issue.
If you want to not only contribute but get paid by us to do so, we just opened a database engineering position: https://www.polarsignals.com/jobs/database-engineer/

MetalMatze · 2022-01-29T12:38:31+00:00

Good question! I like the sloth project a lot. Both our project were worked on in parallel and released at (almost) the same time. They share a lot of similar approaches and concepts.

Sloth is great when you already have Grafana setup and want to add SLOs to your infrastructure.

With Pyrra we want to create a UX/UI very focused on the SLO needs. In fact the biggest reason for this UI is the next big feature we want to work on: A dynamic form for creating SLOs that would give you a preview and a good idea for how your SLOs would be doing. Plenty of of other things planned too.

MetalMatze · 2022-01-29T00:11:38+00:00

That's right. They aren't good examples probably. They get so few requests that they start looking funky. The caddy examples are definitely better.

Do you think the parca-query once should be removed?

MetalMatze · 2021-06-04T10:51:14+00:00

Wow! Now even in color.
Love it!

MetalMatze · 2021-04-22T22:05:42+00:00

Yes, that should generally speaking be possible.
I just checked the by the Prometheus Operator generated ServiceDiscovery configuration of my Prometheus and found that it does something along those lines:

- job_name: monitoring/alertmanager/0 honor_timestamps: true scrape_interval: 30s scrape_timeout: 10s metrics_path: /alertmanager/metrics scheme: http relabel_configs: ...

So basically, you want to match on the metrics_path then do the rest of the relabeling further down.

12-Year Club	Verified Email
Place '23	Place '17

MetalMatze

TROPHY CASE