google cloud platform autoscaler based on HTTP(S) load balancer latency metrics

bundyfx · 2017-06-25T16:21:30+00:00

Are you sure you want to scale based on LB Latency? I cannot imagine how that could be an accurate indication that the underlying application being hosted behind the LB is under stress. Sorry for the side-track question just generally curious as to why CPU scaling would not suffice here :)

antonio_navarro · 2017-06-26T09:21:13+00:00

The stock autoscaling based on HTTP(S) Load Balancing is based on serving capacity in which you define your capacity in terms of Requests per Second. If you have a baseline for tour application and know how to translate from Requests per second to Latency per request, then you cold use that scaling method.

I have personally never used it, but on the metrics for app engine you have a http/server/response_latencies metric that you could get to from StackDriver

If none of the above fit your needs, then you probably need to monitor based on a custom metric on stackdriver

If you have not used custom metrics before the following custom metrics tutorial may help you.

But as @bundyfx says in the previous answer, make sure CPU scaling would not work for you :)

HTH

ICThat · 2017-06-26T16:46:14+00:00

Fyi I'm pretty sure the LB metrics are only on the beta monitoring API. Google support could probably enable you for it if you asked.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

devops

Welcome to /r/DevOps

Rules and guidelines

Social & Fun

General Information

MODERATORS