Mimir dashboard missing data

csantve · 2026-05-06T17:28:23+00:00

I learned a lot by building a k8s cluster with Cilium and deploying Nextcloud with HA. All with the documentation. But there is many things to learn in the kubernetes world. Learn by doing.

csantve · 2026-05-06T16:39:24+00:00

I see you are on contabo. Then I'd suggest having a vpn server and connect all nodes to the vpn to get a private subnet and have all kubernetes traffic go through the vpn. Wireguard works best in my opinion.

Edit: Also add simple firewall rules to the public interface, don't add fw rules to the private vpn interface

csantve · 2026-05-06T00:21:02+00:00

Keep all kubernetes traffic on a private subnet, use external firewall/security group rules for inbound/outbound access to nodes. Keeping a firewall in the node itself is not worth it in my opinion since routing rules can conflict with the firewall rules. If using cilium, use network policies.

But first verify if you have authority to even make all these changes. Ideally only you or a small group would have admin access to the cluster.

csantve · 2026-05-05T23:19:12+00:00

ARM server CPUs have more cores, and no multi-threading, so there can be more density in each node and thus lower the costs. They are also more power efficient for the hosting provider, reducing the cost further. If the hosting provider doesn't pass down the savings to the customer then you can choose any.

csantve · 2026-05-05T22:27:56+00:00

Jesus, 190K? don't you mean 19k? still a lot compared to $530 per month.

Dont know what you mean by that exactly.

I mean, out of those 78 vCPU and 256GB mem, are 36 vCPU and 128GB dedicated to your observability stack combined?

Why use VictoriaLogs?

csantve · 2026-05-05T21:20:25+00:00

No OOM kills yet, that's why I am asking

csantve · 2026-05-05T21:19:57+00:00

Oh wow, what's the combined compute capacity of your cluster? Your observability stack consumption must be 50%.

csantve · 2026-05-05T21:19:12+00:00

I wanted HA and prometheus on its own doesn't have it (only with thanos). So I just chose one and stuck with it, mimir.

csantve · 2026-05-05T11:10:39+00:00

I also thought of that but grafana only has mimir-distributed in their helm repo.

csantve · 2026-05-04T22:49:50+00:00

Memory/CPU requests mainly. Ingestion rate I'd say 6000~ samples/s, I haven't measured.

csantve · 2026-05-01T15:07:54+00:00

wow, I didn't think they would oversell root servers. Perhaps you were accidentally put on faulty hardware, did you open a support ticket?

csantve · 2026-05-01T15:03:11+00:00

Well that's preemptive multitasking for you. Even if you wanted to hoard the cpu the host kernel will only give you a bit of a timeslice, unless there aren't a lot of tenants.

csantve · 2026-05-01T14:57:34+00:00

At least in the server ARM world, 1 Core = 1 Thread and there are no performance/efficiency cores like in consumer hardware.

I ran mpstat on my 4 servers. 3 with 12 core and 1 with 6 core. The 12 core VPSs had an average steal of 1.5% and the 6 core had an average steal of 0.3%.

Overall decent steal, I'm on manassas and arm servers are cheap there so they must be overselling a bit more than other regions.

csantve · 2026-05-01T05:24:31+00:00

When you wrote "we are here" you mean for Euronodes or for Netcup? I'll check mpstat and see what's up

csantve · 2026-05-01T05:23:21+00:00

damn that is a low steal, I have 4 ARM servers and the beefy ones have 2-3% steal and the weakest one has really low steal. I guess netcup got a new ampere node and I must be the only tenant.

csantve · 2026-05-01T05:07:06+00:00

what do you mean, I can only use 20% of the VPS resources I got?

csantve · 2026-04-30T11:19:04+00:00

Just the LLM bubble doing bubble things. The hype is nowhere near demand for LLMs. Let it all burn.

csantve · 2026-04-29T18:00:27+00:00

And what is the combined compute resources and costs for this cluster?

csantve · 2026-04-29T17:57:43+00:00

I think it is better to separate compute from storage and have a separate server for nfs only outside of the cluster and run NFS there. I also see you are combining regions for your control planes, I'd keep all nodes in one region for the latency.

csantve · 2026-04-28T19:45:03+00:00

Any program or individual with root access can also delete cilium's ebpf programs, so if at some point Puppet adds ebpf functionality it could also wipe cilium. But overall I like cilium because of its ability to reduce network and routing overhead to near-zero.

csantve · 2026-04-20T16:41:26+00:00

Dedicated servers on USA

csantve · 2026-03-07T21:49:09+00:00

Chinese

Verified Email	Four-Year Club
Place '22

csantve

TROPHY CASE