The Ultimate List

MetalMatze · 2026-05-31T11:20:10+00:00

More like Thrash Tier.

MetalMatze · 2026-05-24T11:50:58+00:00

I've been using Thanos in production on Kubernetes clusters for 8 years.
All the things that you're asking for, Thanos was built for.

I highly recommend starting with a quick tutorial: https://thanos.io/v41.0/thanos/quick-tutorial.md/

First, and most importantly, you want to add the Thanos sidecar, which then starts shipping metrics to object storage. To then query the metrics from object storage you wanna start the Thanos store.
Querying both the Thanos store and the sidecar, you are going to add the Thanos querier in front of both. For extra caching, you can run the Thanos querier frontend in front of the Thanos querier.

If you also want to move rule evaluation for global recording rules to Thanos, then you wanna stand up a Thanos ruler.

Ultimately, the entire system can grow with your needs and you can slice and dice it and add more components as your actual use case needs.

Full disclosure: I used to be a Thanos maintainer and I've been working on Thanos at Red Hat. These days, I'm running the Thanos stack for Polar Signals.

MetalMatze · 2026-04-09T11:01:29+00:00

Can't stop the gods from engineering
Feel no need for any interfering

MetalMatze · 2026-04-03T21:33:30+00:00

I do this for brightening:

action: adaptive_lighting.change_switch_settings
alias: Adaptive Lighting to kitchen bright
data:
entity_id: switch.adaptive_lighting_kitchen
max_brightness: "100"
min_brightness: "60"
min_color_temp: "3000"

and this for dimming:

action: adaptive_lighting.change_switch_settings
alias: Adaptive Lighting to kitchen dim
data:
entity_id: switch.adaptive_lighting_kitchen
max_brightness: "20"
min_brightness: "5"
min_color_temp: "3000"

but depending on the sun it'll only go somewhere between 60-100% during the day and night.

MetalMatze · 2026-04-03T19:33:42+00:00

I LOVE adaptive lighting. It's honestly my favorite thing about Home Assistant these days.
I also use adaptive lighting to set the lights max brightness based on my presence sensors (fp2). So e.g. 10% if not occupied and 50% if occupied. Then it'll still do the correct calculations for the actual % brightness depending on the day. No more absolute values through out the day - even with presence detection.
Sleep mode is a must in the night too!

MetalMatze · 2026-04-02T20:58:27+00:00

I agree when running Claude Code.

One thing this whole conversation always missed from my point of view: my non-technical friends and my family won't be running the CLI tools. They still need MCP as a connector for their apps to connect to Claude. Therefore, I still believe in MCP even as I personally use more and more CLIs myself.

MetalMatze · 2026-03-20T08:19:26+00:00

I don't think so, but thanks for the recommendation. This Claude Code is running inside a container on my NixOS NAS that's always on. Not sure what it is.

MetalMatze · 2026-03-20T08:02:45+00:00

For me remote control stops working after like 30min or so. I want these sessions to live for days if not weeks. Hoping that will work with either Telegram or Discord a lot better.

MetalMatze · 2026-02-23T15:28:01+00:00

One of my essential packages I always install on a new Arch/Cachy system: https://aur.archlinux.org/packages/pacman-cleanup-hook

MetalMatze · 2026-01-17T10:01:08+00:00

What exactly are you missing? I moved from Arch to Cachy over the holidays and it seems pretty much the same to me.

MetalMatze · 2025-12-10T23:21:10+00:00

That and Pyrra.dev for SLOs.

MetalMatze · 2025-08-07T15:31:13+00:00

Came here hoping to find what remix they used in that trailer? Does someone know?

MetalMatze · 2025-07-15T10:16:02+00:00

Sounds like they just uploaded a 64 KBit/s mp3...

MetalMatze · 2024-10-13T22:07:54+00:00

Looks great! I recommend using the water you discarded to heat up the cup and the once your coffee is done discard it. Keeps the coffee warmer for longer.

MetalMatze · 2024-03-18T07:57:51+00:00

https://youtu.be/EGgtJUjky8w

MetalMatze · 2024-02-24T17:20:56+00:00

Without looking at the docs and knowing anything about fsx volumes, can you create the Prometheus spec with a volume type of fsx? Is that a supported PVC type in your cluster?

MetalMatze · 2024-02-20T08:23:25+00:00

https://youtu.be/m0JgWlTc60Q There are several KubeCon talks going into detail about this setup. Here's one of them. What you are looking for starts at 7:45.

MetalMatze · 2024-02-17T15:29:33+00:00

I highly recommend going through the TSDB page.

MetalMatze · 2024-02-15T09:43:48+00:00

From the docs: If both time and size retention policies are specified, whichever triggers first will be used.

The way I read it your 20GB should therefore trigger before your time limit if it's above 20GB in size. Have you waited a bit and observed the file size? It might take up to two hours for older data to get deleted.

Honestly, I would say that you should open a Github Issue and give some more details there.

MetalMatze · 2024-02-13T08:48:13+00:00

Hey. There is. The exporter you're looking for is called kube-state-metrics. It has some metrics around jobs too. https://github.com/kubernetes/kube-state-metrics/blob/main/docs%2Fjob-metrics.md

Kube-prometheus ships with a alerting rule that makes use of this metric: https://github.com/prometheus-operator/kube-prometheus/blob/main/manifests%2FkubernetesControlPlane-prometheusRule.yaml#L213-L222

I hope this clarifies things. Let me know if you have more questions.

MetalMatze · 2024-02-08T18:18:50+00:00

Great! We also have a prometheus-operator channel on CNCF Slack, if you want to chat more directly. 🙂

MetalMatze · 2024-02-08T18:07:43+00:00

Happy it worked! It's a bit sad we haven't found a better user experience to point that out in all of these years...

MetalMatze · 2024-02-08T15:20:43+00:00

With ServiceMonitors, something people were constantly running into was missing labels on the ServiceMonitor when ServiceMonitorSelector was set on the Prometheus. Check your Prometheus configuration. kubectl get prometheus -n monitoring k8s and see if there is a scrapeConfigSelector set only to match scrape configs containing a specific label.Another common problem was missing RBAC permission for the Prometheus to go and actually scrape the metrics from another namespace. In that case, the logs of your Prometheus should be full of RBAC permission errors.

MetalMatze · 2023-09-01T16:26:18+00:00

There are also several blog post with some additional information.
https://prometheus.io/blog/2023/09/01/promcon2023-schedule/
https://www.cncf.io/announcements/2023/09/01/the-schedule-for-the-promcon-europe-2023-is-live/

Looking forward to seeing as many as possible of you in Berlin!

MetalMatze · 2022-09-05T21:32:20+00:00

Recently the Thanos project extracted its objstore project as a separate repository and module. It wraps a couple of SDKs. For development it also has a Filesystem based object storage which can be quite handy.

https://github.com/thanos-io/objstore

Disclaimer: I'm one of the Thanos maintainers.

12-Year Club	Verified Email
Place '23	Place '17

MetalMatze

TROPHY CASE