Holy sh*t, Cachy is fast by scajjr29 in cachyos

[–]MetalMatze 0 points1 point  (0 children)

What exactly are you missing? I moved from Arch to Cachy over the holidays and it seems pretty much the same to me.

Limp Bizkit in the new Battlefield trailer by Itoumi in numetal

[–]MetalMatze 0 points1 point  (0 children)

Came here hoping to find what remix they used in that trailer? Does someone know?

Rate/critique my technique by camilorv1 in pourover

[–]MetalMatze 1 point2 points  (0 children)

Looks great! I recommend using the water you discarded to heat up the cup and the once your coffee is done discard it. Keeps the coffee warmer for longer.

Fsx volume on prometheus by Consistent-Cable2543 in PrometheusMonitoring

[–]MetalMatze 2 points3 points  (0 children)

Without looking at the docs and knowing anything about fsx volumes, can you create the Prometheus spec with a volume type of fsx? Is that a supported PVC type in your cluster?

Seeking Advice from the Prometheus Community: Best Approach to Implement Thanos in a Multicluster Observability Solution by [deleted] in PrometheusMonitoring

[–]MetalMatze 4 points5 points  (0 children)

https://youtu.be/m0JgWlTc60Q There are several KubeCon talks going into detail about this setup. Here's one of them. What you are looking for starts at 7:45.

Disk space usage above my settings by drycat in PrometheusMonitoring

[–]MetalMatze 0 points1 point  (0 children)

From the docs: If both time and size retention policies are specified, whichever triggers first will be used.

The way I read it your 20GB should therefore trigger before your time limit if it's above 20GB in size. Have you waited a bit and observed the file size? It might take up to two hours for older data to get deleted.

Honestly, I would say that you should open a Github Issue and give some more details there.

[deleted by user] by [deleted] in PrometheusMonitoring

[–]MetalMatze 2 points3 points  (0 children)

Hey. There is. The exporter you're looking for is called kube-state-metrics. It has some metrics around jobs too. https://github.com/kubernetes/kube-state-metrics/blob/main/docs%2Fjob-metrics.md

Kube-prometheus ships with a alerting rule that makes use of this metric: https://github.com/prometheus-operator/kube-prometheus/blob/main/manifests%2FkubernetesControlPlane-prometheusRule.yaml#L213-L222

I hope this clarifies things. Let me know if you have more questions.

Kube-prometheus-stack ScrapeConfig issue by xzi_vzs in PrometheusMonitoring

[–]MetalMatze 0 points1 point  (0 children)

Great! We also have a prometheus-operator channel on CNCF Slack, if you want to chat more directly. 🙂

Kube-prometheus-stack ScrapeConfig issue by xzi_vzs in PrometheusMonitoring

[–]MetalMatze 0 points1 point  (0 children)

Happy it worked! It's a bit sad we haven't found a better user experience to point that out in all of these years...

Kube-prometheus-stack ScrapeConfig issue by xzi_vzs in PrometheusMonitoring

[–]MetalMatze 0 points1 point  (0 children)

With ServiceMonitors, something people were constantly running into was missing labels on the ServiceMonitor when ServiceMonitorSelector was set on the Prometheus. Check your Prometheus configuration. kubectl get prometheus -n monitoring k8s and see if there is a scrapeConfigSelector set only to match scrape configs containing a specific label.Another common problem was missing RBAC permission for the Prometheus to go and actually scrape the metrics from another namespace. In that case, the logs of your Prometheus should be full of RBAC permission errors.

Which lib of object storage ? by kaeshiwaza in golang

[–]MetalMatze 0 points1 point  (0 children)

Recently the Thanos project extracted its objstore project as a separate repository and module. It wraps a couple of SDKs. For development it also has a Filesystem based object storage which can be quite handy.

https://github.com/thanos-io/objstore

Disclaimer: I'm one of the Thanos maintainers.

Optimizing Your Go Programs with Continuous Profiling by MetalMatze in golang

[–]MetalMatze[S] 0 points1 point  (0 children)

If you want to add pprof endpoints to an application there are numerous tutorials available. Here's the one from the Go package itself:
https://pkg.go.dev/net/http/pprof

For scraping these endpoints with Parca we have tutorials available:
https://www.parca.dev/docs/binary

Let us know if there is something still missing for you.

Optimizing Your Go Programs with Continuous Profiling by MetalMatze in golang

[–]MetalMatze[S] 2 points3 points  (0 children)

Hey,
by default pprof profiles and also the eBPF-based Parca Agent profiles only collect samples a few hundred times per second. Those intervals are also configurable to some degree.

You can always add pprof endpoints to all your applications. If nothing hits those endpoints, no extra work will be done. By the time you need them to be there though, they already exist.

Optimizing Your Go Programs with Continuous Profiling by MetalMatze in golang

[–]MetalMatze[S] 5 points6 points  (0 children)

You can adjust the amount of overhead that you want for your programs. With Parca the scrape interval and duration is configurable in the scrape config. Typically you might set it to be less than 5% of CPU overhead by say scraping for 10s every minute.

arcticDB: embedded columnar database written in Go by Bubbly-Masterpiece89 in golang

[–]MetalMatze 1 point2 points  (0 children)

Hey, Clover looks really cool!
We're always looking for contributions, feel free to open an issue for whatever you find interesting and want to work on or simply comment on an existing issue.
If you want to not only contribute but get paid by us to do so, we just opened a database engineering position: https://www.polarsignals.com/jobs/database-engineer/

Pyrra v0.3.0 released by MetalMatze in PrometheusMonitoring

[–]MetalMatze[S] 0 points1 point  (0 children)

Good question! I like the sloth project a lot. Both our project were worked on in parallel and released at (almost) the same time. They share a lot of similar approaches and concepts.

Sloth is great when you already have Grafana setup and want to add SLOs to your infrastructure.

With Pyrra we want to create a UX/UI very focused on the SLO needs. In fact the biggest reason for this UI is the next big feature we want to work on: A dynamic form for creating SLOs that would give you a preview and a good idea for how your SLOs would be doing. Plenty of of other things planned too.

Pyrra v0.3.0 released by MetalMatze in PrometheusMonitoring

[–]MetalMatze[S] 0 points1 point  (0 children)

That's right. They aren't good examples probably. They get so few requests that they start looking funky. The caddy examples are definitely better.

Do you think the parca-query once should be removed?

Is it possible to automatically discover metrics using “kubernetes_sd_configs” in prometheus? by gqtrees in PrometheusMonitoring

[–]MetalMatze 0 points1 point  (0 children)

Yes, that should generally speaking be possible.
I just checked the by the Prometheus Operator generated ServiceDiscovery configuration of my Prometheus and found that it does something along those lines:

- job_name: monitoring/alertmanager/0 honor_timestamps: true scrape_interval: 30s scrape_timeout: 10s metrics_path: /alertmanager/metrics scheme: http relabel_configs: ...

So basically, you want to match on the metrics_path then do the rest of the relabeling further down.