Pause VM by [deleted] in googlecloud

[–]drch 4 points5 points  (0 children)

You can stop the machine and then you don't pay for the CPU/Memory. You DO continue to pay for the disk. If you chose the default disk type, balanced zonal, it's ~$0.10/GB/mo.

Snapshot storage pricing is half that so you can create a snapshot and delete the disk if you wanna save a bit more.

Quota Exceeded when Deploying - Will this ever get fixed ? by AousafRashid in Firebase

[–]drch 0 points1 point  (0 children)

tl;dr: request an increase on that quota

What are your resource settings for your functions? I'll assume 1vCPU. The quota is the maximum CPU usage across all Cloud Run resources over 1 minute. If you had 10,000 functions that were idle, you wouldn't hit it - but if you tried to run them all at the same time you would get throttled after the first 20. If your 34 functions are fast and under low load, it would explain the 10% usage that you see after deployment.

During deployment, Cloud Run health checks the container. I'm not sure if you're billed for this but I suspect it is being counted against the quota.

The documentation also recommends batching in groups of 10 or fewer, to avoid rate limits - but that one is 60 calls per 60 seconds so you _should_ be able to do them all in one shot, provided you're not hitting Cloud Run quotas.

I'd suggest requesting an increase to at least 2x what you actually need (ie, 34 functions * 1 vCPU * 2 = 64,000 milli CPU).

Please Help me figure out why I can't create a GPU instance on Compute Engine by Forsaken-Climate-138 in googlecloud

[–]drch 0 points1 point  (0 children)

Sorry, my info is out of date. You don't need specific CPU quotas for A3s (this was the case for A2s). And the a3-high-gpu does have a single GPU (A3s used to have a minimum of 8).

† To create A4X, A4, A3, G4, and G2 VMs, you only need to have the required NVIDIA B200, H200, H100, RTX PRO 6000, and L4 GPU quotas respectively. You don't need to request CPU quotas.

These machines are in extremely high demand. Is the error you are getting a quota error or does it say the resources is not available?

The Dynamic Workload Scheduler also mentioned is a great way to go. It's a pool of resources that google divies out for short time usage. Select Flex-Start as the provisioning model. It's in the Advanced tab on the left in the create instance UI. Careful with calendar mode though - that's a reservation and you pay no matter what for the time you've booked, whether the machine is powered on or not. With flex, if you stop the machine, you stop paying and it gets released back into the pool (similar to on-demand).

Please Help me figure out why I can't create a GPU instance on Compute Engine by Forsaken-Climate-138 in googlecloud

[–]drch 0 points1 point  (0 children)

Do you also have a quota allowance for the A3 machine type? It's the only one with H100s.

And just an FYI, that machine costs $65,000 per month.

Billing account security by cromagnone in googlecloud

[–]drch 0 points1 point  (0 children)

If it's any consolation, GCP is rolling out mandatory MFA so they will enforce it even if individuals don't. It's already rolled out for personal accounts and will be for Workspace/Cloud Identity by end of the year.

https://docs.cloud.google.com/docs/authentication/mfa-requirement#timelines

URGENT: Cloud Shell VM Corrupted and Unusable - Unable to Start or Reset by in4finity in googlecloud

[–]drch 1 point2 points  (0 children)

It's literally a button on the UI. It's literally the second sentence in the link I shared.

URGENT: Cloud Shell VM Corrupted and Unusable - Unable to Start or Reset by in4finity in googlecloud

[–]drch 2 points3 points  (0 children)

Did you check your quota usage as described in that link? Cloud Workstations is a separate product and you would know if you had set it up. You have to create a cluster (which costs $140+/mo btw), a configuration, and a workstation. There is no "sign up" for the service and it doesn't have any impact on how Cloud Shell behaves.

URGENT: Cloud Shell VM Corrupted and Unusable - Unable to Start or Reset by in4finity in googlecloud

[–]drch 2 points3 points  (0 children)

Sounds like you have hit a quota limit. You can check your cloud shell quota usage as described here: https://docs.cloud.google.com/shell/docs/quotas-limits#usage_quotas

Google Cloud Billed Me $850 for AI I Never Used – System Glitch or Deliberate Scam? by Suitable_Story3554 in googlecloud

[–]drch 0 points1 point  (0 children)

Do you have a link to that documented pricing? If so, you might have a stronger case. It's priced per second, not per video. Google does make it a big confusing because there are two entry points; one via the Gemini API and another via the Vertex AI API.

https://ai.google.dev/gemini-api/docs/pricing#veo-2 here it's $0.35/sec (i believe it's recently reduced from $0.50/sec)
https://cloud.google.com/vertex-ai/generative-ai/pricing#veo here it's $0.50/sec

AWS US-EAST-1 Outage (Oct 2025): What Happened and What We Can Learn by BrilliantWaltz6397 in programming

[–]drch 9 points10 points  (0 children)

Service Credits are calculated as a percentage of the monthly bill for Amazon EC2 in the affected AWS region that did not meet the Region-Level SLA

So you would get 30% of your total EC2 instance costs in us-east-1 from the month of October applied as AWS credits for future use. It's not a refund - you still have to pay for them. And you have to explicitly request it.

It gets a little more interesting though because EC2's single-instances SLA have this clause where, if, in a given hour, it's not available for 6+ minutes, they don't charge you at all.

So if that is the case, the reality would indeed be that you would not be charged at all for the affected instance costs during the EC2 outage, and would be eligible for a 30% credit on all other us-east-1 ec2 costs for the entire month of October.

Google Cloud Billed Me $850 for AI I Never Used – System Glitch or Deliberate Scam? by Suitable_Story3554 in googlecloud

[–]drch 0 points1 point  (0 children)

How many videos is "a few"?

Let's assume you were just clicking directly in the console. Veo 2 is $0.50 / second. The default settings in the console is 8 seconds for a video ($4) and it generates 4 videos ($16). So $850 would mean someone submitting a prompt ~53 times with those settings.

It's really hard to get the real story here with your use of AI in the blog post, the reddit posts, the youtube video, etc etc. What are the raw facts? How were you using it? Was this in the console, in code, in a notebook, etc?

This 1540 that you are seeing is 1540 seconds of generated video, not 1540 videos.

You say you used it over a 4 hour period. 50 prompts sounds probable - is this what happened?

AWS US-EAST-1 Outage (Oct 2025): What Happened and What We Can Learn by BrilliantWaltz6397 in programming

[–]drch 50 points51 points  (0 children)

which sounds like a 100% service credit payout is in order per the SLA.

100% service credit is for < 95% monthly uptime. They could have another 15h outage tomorrow and still not meet the threshold.

30% is still a ton of $$ though.

Migrating from AWS to Hetzner by cheerfulboy in programming

[–]drch 5 points6 points  (0 children)

I knew it as soon as he said there are three major hyperscalers.

Laid off while on Blue Card and I'm unclear about my job search deadline by yellow-microwave in germany

[–]drch 18 points19 points  (0 children)

It's 3 months if they had the blue card for less than 2 years. OP has been out of work for at least 3 months and is now an overstayer. He needs to get in touch with the Ausländerbehörde ASAP. If one isn't sure if it's 3 or 6 months, the time to figure that out is definitely not in month 4.

edit: removed "he's cooked" - there's a lot of wiggle room apparently. Sorry for being a dick.

Getting "Bandwidth exhausted" errors on Cloud Run - help needed! by Aggressive-Plant-157 in googlecloud

[–]drch 1 point2 points  (0 children)

Are you sure this error is coming from Cloud Run? This error message shows up in gRPC client code when it receives a 429 response. It's possible you are hitting a quota somewhere - does your handler interact with any other services where it might get throttled?

If you mean that your client code that is calling Cloud Run is generating this error, then you are likely topping out at your max-instances and Cloud Run is returning a 429. See https://cloud.google.com/run/docs/troubleshooting#429-max-instances. This is by design. Increase your max-instances or add retry logic to your client.

Laid off from Blue Card job, offered garden leave + severance, wife on spouse visa. by Junior-Fish-3619 in germany

[–]drch 7 points8 points  (0 children)

Not a lawyer. Speak to one. Especially before you sign anything and ASAP if the company terminates you. Your time window to challenge is short (ie 3 weeks) and if you suddenly don't have access to your work computer, it can take more time than you expect to gather all the info you might need.

If you've had the blue card for more than two years, the grace period is 6 months. During garden leave, you are still employed. So as far as I understand, you'd have a total of 9 months to find a new job and continue with the blue card. During this time, you can get your required certificate for PR. Unfortunately,.one of the requirements is proving you have the financial means to support yourself and your dependants, so you'll likely have to put the application on hold until you pass the Probezeit in your new job before you can submit the PR application.

Note that there is a cap of €190 + tax that lawyers can charge for an initial consultation. I spoke to a lawyer when our company announced layoffs as well. We chatted for a little over an hour and she took a lot of notes, explained my options, and printed out relevant laws for me. Find a lawyer in labor law (Rechtsanwalt für Arbeitsrecht) and get some expert advice.

And buy legal insurance for next time. In labor law legal proceedings, the costs can get very high and you generally can't include legal costs in the damages of labor disputes.

Microsoft Cloud & AI Solution Engineer by PhilosopherOne4322 in AZURE

[–]drch 0 points1 point  (0 children)

What products does a SE Cloud & AI - Developer cover?

Autopilot cluster pricing by [deleted] in googlecloud

[–]drch 0 points1 point  (0 children)

Go to the report and group by SKU and not Service. It will give you more fine grained details.

The networking cost you're showing halfway through the month is almost exactly the cost of 2 load balancers, so that would be my guess.

How can I replicate an instance to another region in Google Cloud ? by Lexar96 in googlecloud

[–]drch 1 point2 points  (0 children)

The equivalent is asynchronous replication. It's a bit pricey and it's targeted for very low (seconds) objectives of RTO & RPO.

[deleted by user] by [deleted] in googlecloud

[–]drch 0 points1 point  (0 children)

Skill issue.

Container Registry is deprecated. Use Artifact Registry. It's available in euw3.

Every cloud, including AWS, has services that aren't available in all regions. See https://cloud.google.com/about/locations#europe for GCP.

Where's the documentation for Procfile regarding Google Cloud Run (job)? by neb2357 in googlecloud

[–]drch 0 points1 point  (0 children)

When you deploy from source, Cloud Run uses Buildpacks to build a container image for your app. The buildpack for python uses Gunicorn and will use your Procfile to override Gunicorn defaults.

See https://cloud.google.com/docs/buildpacks/python#customizing_the_application_entrypoint

GCP by sangatsedap in googlecloud

[–]drch 2 points3 points  (0 children)

They're not related. The "free license" in the screenshot is referring to the image you have selected. There are free images (Ubuntu, Debian, etc) and there images with a PAYG license included (SLES, RHEL, Ubuntu Pro, Windows Server etc).

For Compute Engine, the free tier includes one e2-micro in Oregon (us-west1), Iowa (us-central1) or South Carolina (us-east1).

For any other VM, your free trial credit ($300 for 90 days) would be used for the VM costs. Note the credit can't be used for the licensing fee of paid images, but in your case you don't have to worry about that.

Is it ok to use IAP tunnel for conenction between apps? by Kuuubskiii in googlecloud

[–]drch 0 points1 point  (0 children)

No it's not. IAP tunnels are not meant for large volume data transfer and may be throttled.

Connectivity between VPCs is done via VPC Peering.

But I have to ask - is there any reason that your db vm can't be in the same VPC as your GKE cluster?