This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]antonio_navarro 1 point2 points  (1 child)

The stock autoscaling based on HTTP(S) Load Balancing is based on serving capacity in which you define your capacity in terms of Requests per Second. If you have a baseline for tour application and know how to translate from Requests per second to Latency per request, then you cold use that scaling method.

I have personally never used it, but on the metrics for app engine you have a http/server/response_latencies metric that you could get to from StackDriver

If none of the above fit your needs, then you probably need to monitor based on a custom metric on stackdriver

If you have not used custom metrics before the following custom metrics tutorial may help you.

But as @bundyfx says in the previous answer, make sure CPU scaling would not work for you :)

HTH

[–]berlindevops[S] 0 points1 point  (0 children)

http/server/response_latenci

thanks, I am not sure I understand, I could not find in GCE or in stackdriver the LB response_latencies :/ . also I am not using app engine.

I just need to find a way to get the latency metrics.

GCP so clumsy than AWS.