gabe_monroy's comments

gabe_monroy · 2025-06-04T15:20:21 1749050421

this is something we are working on. stay tuned!

gabe_monroy · 2025-06-04T15:16:45 1749050205

Heard on this feedback. While not quite a hard cap, I'd also point to https://cloud.google.com/billing/docs/how-to/budgets which many customers are having success with for this use case.

gabe_monroy · 2025-06-04T15:14:42 1749050082

i am biased, but i agree :)

icedchai · 2025-06-04T15:32:51 1749051171

hah. I looked at your comments and saw you were a google VP! I've migrated some small systems from AWS to GCP for various POCs and prototypes, mostly Lambda and ECS to Cloud Run, and find GCP provides a better developer experience overall.

gabe_monroy · 2025-06-04T16:03:41 1749053021

love that you're enjoying the devex. we put a lot of sweat into it, especially in services like cloud run.

ashishb · 2025-06-04T18:53:33 1749063213

Yeah, anyone who uses GCP and AWS thoroughly will agree that GCP is a superior developer experience.

The problem is continuous product churn. This was discussed at length at https://news.ycombinator.com/item?id=41614795

gabe_monroy · 2025-06-04T15:14:09 1749050049

google vp here: we appreciate the feedback! i generally agree that if you have a strong understanding of your static capacity needs, pre-provisioning VMs is likely to be more cost efficient with today's pricing. cloud run GPUs are ideal for more bursty workloads -- maybe a new AI app that doesn't yet have PMF, where you really need that scale-to-zero + fast start for more sparse traffic patterns.

jakecodes · 2025-06-04T17:02:30 1749056550

Appreciate the thoughtful response! I’m actually right in the ICP you described — I’ve run my own VMs in the past and recently switched to Cloud Run to simplify ops and take advantage of scale-to-zero. In my case, I was running a few inference jobs and expected a ~$100 bill. But due to the instance-based behavior, it stayed up the whole time, and I ended up with a $1,000 charge for relatively little usage.

I’m fairly experienced with GCP, but even then, the billing model here caught me off guard. When you’re dealing with machines that can run up to $64K/month, small missteps get expensive quickly. Predictability is key, and I’d love to see more safeguards or clearer cost modeling tooling around these types of workloads.

gabe_monroy · 2025-06-04T17:04:49 1749056689

Apologies for the surprise charge there. It sounds like your workload pattern might be sitting in the middle of the VM vs. Serverless spectrum. Feel free to email me at (first)(last)@google.com and I can get you some better answers.

ashishb · 2025-06-04T18:38:56 1749062336

> But due to the instance-based behavior, it stayed up the whole time, and I ended up with a $1,000 charge for relatively little usage.

Indeed. IIRC, if you get a single request every 15 mins (~100 requests a day), you will pay for Cloud Run GPU for the full day.

krembo · 2025-06-04T17:53:35 1749059615

How does that compare to spinning up some ec2s with amazon trainium gpus?

mgraczyk · 2025-06-04T20:57:59 1749070679

Depending on your model, you may spend a lot of time trying to get it to work with Trainium

krembo · 2025-06-17T17:11:41 1750180301

Why is that? Can you explain?

mgraczyk · 2025-06-17T17:16:54 1750180614

The trainium toolchain is not as mature as GPU. Your model may fail to compile out of the box, and even if it does it may be slow and require you to dig into details for reasonable training/inference performance

Sn0wCoder · 2025-06-04T22:15:00 1749075300

Has this changed? When I looked pre-ga the requirements were you need to pay for the CPU 24x7 to attach a GPU so that is not really scaling to zero unless this requirement has changed...

ashishb · 2025-06-04T22:46:21 1749077181

Speaking from my experience, it does scale to zero except you pay for 15 mins after the last request.

So if you get all your requests in a 2 hours window then that's great. It will scale to zero for rest of the 22 hours.

However, if you get at least one request every 15 mins then you will pay for 24 hours and it is ~3X more expensive then equivalent VM on Google Cloud.

Sn0wCoder · 2025-06-06T16:29:57 1749227397

OK thanks will check out the options again, if it does scale to zero (including CPU) that will make it more reasonably priced.

gabe_monroy · 2025-06-04T15:08:22 1749049702

i'm the vp/gm responsible for cloud run and GKE. great to see the interest in this! happy to answer questions on this thread.

gabe_monroy · on Aug 19, 2022

Gabe with DO here. That's a bummer. I hope our support team is able to get your access restored. If you get stuck feel free email me: gabe@

migueltarga · on Aug 19, 2022

Thank you Gabe, I sent you an email.

gabe_monroy · on Aug 19, 2022

Received. I confirmed the team is working the issue. Stay tuned.

migueltarga · on Aug 19, 2022

Thank you very much for helping on this, Much Appreciated!

I would love to see improvements on the "Social Authentication" process, making the third party ID as primary identifier, instead of the email.

gabe_monroy · on Aug 19, 2022

I hear you. I’ll talk to the team about how we can do better.

earleybird · on Aug 19, 2022

I work in the public service sector. When we're architecting OAuth2/OIDC integrations we specify using a unique identifier like a guid or some otherwise immutable id as the federation id. This way other attributes that may be ephemeral can change at will. It's not always easy determining this but it's worth it.