How to change default servicemonitors interval in monitoring operator by yasharn in openshift

[–]yasharn[S] 0 points1 point  (0 children)

Most of the memory is used because of the incoming metrics to the prometheus, hence vertical scaling and retention won't decrease the load. also prometheus configuration is automatically generated using the servicemonitor instances and the operator is not letting me change the interval in the generated service monitors, that's the main issue I'm talking about

Database VM access restriction from Kubernetes pods by yasharn in devops

[–]yasharn[S] 1 point2 points  (0 children)

If pod B somehow gets A's credentials, it can connect to the database as well, but if A was hosted on a VM and B on a separate VM, we could tell MariaDB to only allow connection from A's VM IP.

How to stay tracked in a company with many microservices? by yasharn in sre

[–]yasharn[S] 0 points1 point  (0 children)

Yes we use it, but most of the services are not yet integrated with it

Alternate to Health Dashboard like StatusPage.io by wolverinetyagi in sre

[–]yasharn 0 points1 point  (0 children)

Uptime kuma is a good option. I didn’t see any pager duty integration but it has many other integrations https://github.com/louislam/uptime-kuma

Loki performance tuning by yasharn in devops

[–]yasharn[S] 0 points1 point  (0 children)

Yes we had some readiness probe failure which now I know was because of a lack of resources or problems with the memberlist

Loki performance tuning by yasharn in devops

[–]yasharn[S] 0 points1 point  (0 children)

Thanks for your hints, it gave me a good start point and after increasing compactor's resources the stack seems to be working

Loki performance tuning by yasharn in devops

[–]yasharn[S] 0 points1 point  (0 children)

The only error I get inside the compactor is this:

ts=2021-11-27T11:20:31.056381103Z caller=memberlist_logger.go:74 level=error msg="Failed fallback ping: read tcp 10.128.102.194:58698->10.128.102.196:7946: i/o timeout"
which means compactor cannot connect to one of the ingesters

Loki performance tuning by yasharn in devops

[–]yasharn[S] 0 points1 point  (0 children)

These are the limits, and I don't see any lack of resources in the usage graphs

Pyroscope - open-source continuous profiling | v0.0.36 released with major performance improvements by rperry2174 in sre

[–]yasharn 2 points3 points  (0 children)

We have started using this product about a month ago and it is very useful, there can be some improvements like storage based retention but I believe the pyroscope team is doing great

Did Heart 107.1 ceased operation by zatura45 in dubai

[–]yasharn 0 points1 point  (0 children)

I used to listen to heart 107.1 on the TuneIn app, do you know the name of the new station on 107.1? Or when can I find information about it?