Infrastructure Manager: Provision Google Cloud Resources with Terraform

type_Ben_struct · on Sept 17, 2023

Licensing decision aside, I don’t feel a lot of sympathy for Hashicorp here. I think this is different than other scenarios where big MSP’s sell tools based 99% or open source software.

This service will largely be used for deployments on Google Cloud, for which Google invests a lot of development effort in maintaining their own provider. It’s not like there’s not already significant contribution from Google to the code base.

Jka9rhnDJos · on Sept 18, 2023

This isn’t really a service. All it does is deploy an infrastructure template into your GCP project. It won’t largely be for deployments on Google Cloud. It’s for automating Terraform, and whatever providers the customer wants to use.

https://cloud.google.com/infrastructure-manager/docs/view-re...

You, the customer, pay for everything deployed, and Google just pre-connects it to all their services for monitoring and maintenance. It would be like if Route53 on AWS was free, but it deployed a VM in your account, added gateways and nats, opened ports, etc., so that it all ran on discrete infrastructure for just you and you got charged for everything and had to do all the scaling.

If you’ve used their Apache Airflow product (Cloud Composer), it’s basically the same thing. With Cloud Composer, they are setting up an Airflow node and cluster on your behalf, in your account, that you pay for, and connecting it to their services.

This is no different than going to a consulting company and asking them to setup and maintain a Terraform automation platform in your account., which Hashicorp said was allowed. Google isn’t reselling it as a product. They’re setting it up on their platform on your behalf and giving it to you.

And there’s no reason you couldn’t switch it out to OpenTF.

jen20 · on Sept 17, 2023

Why would you assume that there is no license in place for this?

type_Ben_struct · on Sept 17, 2023

They may well do, but to be honest my fundamental point extends beyond just Google. Hashicorp benefits extensively from third parties maintaining their own providers.

Jka9rhnDJos · on Sept 18, 2023

Google also wrote and maintains Go, which all of Hashicorp products are written in and use for free without contributing back.

dilyevsky · on Sept 17, 2023

So from a product perspective they basically manage tfstate, you get to use “pre-packaged and recommended” providers (not clear if say aws provider is allowed) and they slapped iam to it? Seems like another one of those frankenstein cloud designs…

candiddevmike · on Sept 17, 2023

This is definitely half baked IMO. Not a whole lot of benefits over running TF with Cloud Build and a storage bucket. Definitely have more control and better UX than the weird hoops you jump through to set this up.

LVB · on Sept 17, 2023

There are numerous comments here about the licensing change and that this could be related, but HashiCorp announced this at Google Cloud Next (and on their own blog), so it seems like a fairly standard partnership arrangement.

srsqsonyl · on Sept 18, 2023

Can anyone explain the point of TF?

I’m sure there’s some 100% subjective, hallucinated value statement but never felt a reason to bother.

I’ve only ever used an AWS SDK and “saved state” to git in the form of my SDK scripts.

Switched from Python+boto to Go+SDK a few years ago rather than learn some DSL the Lindy effect clock was ticking on.

mef · on Sept 18, 2023

- the situation: you build cloud infrastructure via CLI or console

- the problem: if something gets deleted, or if you make a bunch of changes and want to undo them, or if you need to make a change to a lot of stuff at once, or if you want to copy what you've built to a new region, how do you do it? or, if you're on a team, how do you as a team make and track changes to your cloud infrastructure?

- terraform as a solution: you describe your cloud infrastructure as yaml files. terraform can figure out what is different between what's in your cloud infrastructure and what your yaml files say it should look like. and, it can make changes to your cloud to e.g. build it from scratch, make wide-ranging changes, make a copy of it, etc.

- since your yaml files are code, you can also create a repo and do PRs to make and track changes to your cloud infrastructure as it evolves over time

erik_seaberg · on Sept 18, 2023

Sometimes the detected differences are manual changes (“drift”) that shouldn’t have happened, and terraform will offer to reverse them.

srsqsonyl · on Sept 18, 2023

Appreciate the answer. In hindsight my use of the word scripts was insufficient.

Looked at the TF code; my solution implements similar functionality to handle AWS CRUD ops. What I avoid is all the DSL parsing and such.

For me an AWS account is a struct with fields of AWS SDK resource types, which it seems is what TF resources map to (they handle a lot more so there’s more to it, but kind of sort of if I squint just right). Either going to duplicate the internal logic or DSL chunks per project, would rather avoid the context switch between syntax, “learning the TF ecosystem”.

Thanks again, though.

friendzis · on Sept 18, 2023

The value of DSL is that the same terraform can run things on your local ESX cluster. I imagine few shops need multiple provider support

pfix · on Sept 18, 2023

What Terraform brings to the table for us is the capability of calculating the delta of "I want those resources" and "these resources are actually there" by having a separate state stored e.g. as JSON in S3 to compare your code, the world as it should be and how it actually is. That takes away reimplementing that.

Why just not writing idempotent resource creation? Terraform also uses this to calculate a "plan" that shows the diff of your changes with reality, which really helps to figure out what happens to your RDS when executing, especially when more abstraction (in form of Terraform modules) is involved.

We used Terraform also in a situation where writing custom code would be "prettier" but would have required to write this actual vs desired state code ourselves and could save us the work of doing so.

The DSL of Terraform is sometimes quite cumbersome though as it's derived JSON and not some actual programming language.

SOLAR_FIELDS · on Sept 18, 2023

To your last point, yeah, I think Terraform gets really painful when you have to do something involving derived values in a loop. Also just computed values in general there is not a great story around (which is not necessarily terraforms fault, but rather a symptom of what you are provisioning).

A simple example of what I mean by computed values is that let’s say you want to provision a k8s cluster on top of a network. The k8s provider might want the network name/id which you could normally get by setting it upstream. The problem is you can’t plan the network creation and k8s cluster in a single pass because you don’t get the network name until it’s actually provisioned. You actually need to apply the network tf first to get the inputs you need to plan the cluster. Meaning not only do you need to run tf twice, you also can’t E2E plan infra provisioning

If anyone has a solution/pattern for the above (or more generally how to chain these modules together when this limitation exists) I’m all ears

msmith · on Sept 19, 2023

Can your example be solved by having the k8s cluster resource reference the network resource’s “name” attribute?

Doing that allows Terraform to create both resources in one plan/apply step, and it also helps Terraform understand the dependency between the resources so that they are created in the correct order.

samsquire · on Sept 18, 2023

My hobby solution was to represent an environment as an ordered graph that you can spin up.

https://devops-pipeline.com/

flanked-evergl · on Sept 18, 2023

The point of TF is working with your infrastructure declaratively. You write down what you want, how it should be integrated with each other, how IAM should be set up. And then that is what you get.

For me using terraform is quicker than clicking things up or using CLI even when initially developing things, as if something goes wrong I can just destroy the state and re-apply the terraform config.

mdaniel · on Sept 18, 2023

> For me using terraform is quicker than clicking things up or using CLI even when initially developing things

then you obviously have every single resource name, required values, and relationships between them memorized because in my experience post-facto encoding of something into TF can be valuable to the organization but trying to _discover_ the providers, iam, required fields to achieve a desired outcome is crawling over broken glass as compared to click-ops-ing something in place

Hell, there's even a browser extension to record the AWS calls so one can at least see what was done later for replay, but with GCP they have their own sneaky RPC something-something encoding so that idea's off the table

flanked-evergl · on Sept 19, 2023

> then you obviously have every single resource name, required values, and relationships between them memorized

I don't.

> crawling over broken glass as compared to click-ops-ing something in place

I do not experience reading the terraform provider docs like crawling over broken glass but okay.

I program with Python also and it's not like I have memorized every class and function either, but still, I and millions of other people somehow manage get by.

mkl95 · on Sept 17, 2023

Is there anything particularly painful about working with the Google Cloud Terraform provider? If there isn't, I would rather use OpenTF with that provider and manage state myself.

abatilo · on Sept 17, 2023

In my experience, running a plan is much less likely to catch a bad value than the AWS provider.

Subjectively, the AWS provider will at least validate that fields have valid values during the plan step. The Google provider doesn't seem to validate actual values until apply, and then you get a failure

diarrhea · on Sept 17, 2023

I can’t help but feel… sad about this. I only recently picked up Terraform and am astounded that this is what goes as coding in the infrastructure world. I was coming from Ansible so there was only improvement to be had, but man did Terraform let me down so far.

It (well, the provider) doesn’t validate fields until apply. That’s just so… sad. How is that acceptable? It’s like a car without a steering wheel, and people just go along with it.

solatic · on Sept 18, 2023

It's not really Terraform's fault. Terraform provides the capability to do all kinds of validations before running an apply, but it's up to the providers to implement the validations. If the provider doesn't implement the validation, then it's not there.

It gets hairier when you delve into the details. The provider is typically an official provider that wraps some company's API, so that company ought to have a good set of validations, since it's their own API, right? Wrong. The team that writes the Terraform provider is typically different from the team that creates API methods, and the API methods themselves don't typically expose "dry-run" style functionality, so there's little for the team writing the Terraform provider to check. Meanwhile, the business doesn't care - the Terraform provider checkbox is already checked and validations/dry-running isn't a feature that affects revenue.

candiddevmike · on Sept 18, 2023

Do you know how hard/tedious/pointless it is to write client side evaluations for everything you do on the server? The documentation for the Google Cloud provider is shit though and absolutely should be improved.

cachvico · on Sept 18, 2023

You do a dry run first

reacharavindh · on Sept 18, 2023

How is a terraform plan different from a dry run? I always mentally mapped terraform plan == dry run to validate what changes will be made. Your comment throws a gauntlet into that understanding..

cassianoleal · on Sept 17, 2023

No, it's fine. It works about as well as any other Terraform provider out there.

Every tool I've seen that wraps Terraform in any way is just a solution looking for a problem.

dimgl · on Sept 18, 2023

I just recently used it and found it significantly more verbose than the AWS provider. Which is unfortunate, because I've actually grown quite fond of GCP.

scarface_74 · on Sept 17, 2023

As someone who doesn’t have any experience with GCP. But I do have experience with AWS, I would have thought this would already be a thing.

I know AWS doesn’t have managed TF support - and as a former AWS ProServe employee i know that they thought about it but didn’t do it because they didn’t want to step on Hashicorp’s business.

AWS does have its own hosted IAC service that was introduced before TF was a thing with CloudFormation.

x86x87 · on Sept 18, 2023

There is much more running on top of CloudFormation than meets the eye. CDK and SAM are both wrappers on top of CloudFormation.

I wouldn't be shocked to hear that licence fuckeries were the reasom why we don't have managed TF support of AWS.

scarface_74 · on Sept 18, 2023

I realize that and the SAM transformer is supported natively by the CloudFormation service.

pm90 · on Sept 17, 2023

Yeah but cloudformation kinda sucked. As did its clones (Openstack heat).

flurie · on Sept 17, 2023

I’m curious how this compares to OCI’s Resource Manager.

https://docs.oracle.com/en-us/iaas/Content/ResourceManager/C...

stilwelldotdev · on Sept 17, 2023

I figured something like this is what drove Hashicorp's licensing decision, but thought it would be Amazon.

pm90 · on Sept 17, 2023

Wonder if they have to pay licensing fees to hashicorp for using terraform this way. It’s essentially replacing terraform cloud for GCP resources.

emptysongglass · on Sept 17, 2023

Which is why it's all the more puzzling Google didn't spring for OpenTF here. They single-handedly could have proven it as the fork of choice but instead they're paying into HashiCorp's bad decision?

Jka9rhnDJos · on Sept 18, 2023

Google has a history of welcoming the product on their platform without taking ownership or responsibility for maintaining it. It does the same thing with Elastic, Confluent, and Redis. I’m guessing Hashicorp is deploying infrastructure on Google cloud on the customers’ behalf. It’s not costing Google anything because they’re not reselling it. AWS does something similar with it’s Marketplace, but customers prefer the AWS full-service product rather than adding it to their environment by network peering or deploying templates in their account.

kevindamm · on Sept 17, 2023

Probably because development was already well under way several months ago.

But I agree, explicit support for OpenTF here would have been really nice to see, even if it delayed launch a little.

x86x87 · on Sept 18, 2023

Hashicorp is still not out of the woods. If Microsoft or AWS gets behind openTF they are fucked.

time0ut · on Sept 17, 2023

I wish AWS would do this and deprecate CloudFormation.

shepherdjerred · on Sept 17, 2023

Just use CDK. By far it's the best way to manage AWS infra; AWS also uses it internally.

x86x87 · on Sept 18, 2023

CDK uses CloudFormation under the hood.

shepherdjerred · on Sept 18, 2023

Is that really relevant? CloudFormation is just another primitive at this point.

It's not much different from high-level code compiling down to machine code. The benefit of writing high level code isn't that machine code is entirely gone. The benefit is instead that as a user you can mostly forget that machine code exists.

hughesjj · on Sept 18, 2023

Sometimes it does appear as a leaky abstraction. I want to say for example secrets storage support as a parameter

slavetologic · on Sept 17, 2023

CDK is amazing, you shouldn't be writing cloud formation anymore

tpmx · on Sept 17, 2023

Or just buy Pulumi... j/k (I'd hate for such an awesome tool to get AWS-ized.)

klooney · on Sept 17, 2023

I haven't used Cloudformation in three years, how's it doing these days?

anyoneamous · on Sept 17, 2023

I see a roughly even split of people using CFn, Terraform and (Python) CDK.

AWS shot themselves in the foot by making the Python version of CDK second-tier after Typescript; IaC is still done by DevOps people far more often than application people, and DevOps people use Python.

Another gripe is the number of services and new features which launch without CFn support, which also blocks CDK support; when Terraform supports a new platform feature before the vendors own tools do, that's a sign the product teams are being driven by the wrong metrics.

scarface_74 · on Sept 17, 2023

Every version of the CDK uses an interop layer and runs on top of the Typescript version

https://github.com/aws/jsii

And as far as TF supports services before CFT. Guess which is easier for an AWS employee to do - getting the CF service team to support a new service or just contribute to Terraform’s open source project?

I know of at least one service where the service team introduced the needed APIs and then an employee of AWS wrote the TF provider and contributed to the project before AWS’s own internal team added it to CFT.

Source: former AWS ProServe employee. I am not referring to myself as the author.

jen20 · on Sept 18, 2023

Terraform really drove CFN to even pretend to care about resource support. It’s typically day 1 now, but only because NAT Gateways took so long it was embarrassing for AWS.

likeabbas · on Sept 17, 2023

Idk if I buy this. I’m mostly a backend service developer, but my team manages our own infra using CDK and we love it. And we’re glad they used typescript as it’s a fantastic language.

Demiurge · on Sept 18, 2023

> Python version of CDK second-tier after Typescript

What? It's not second-tier at all, every single feature in Python is in sync with TypeScript, the library versions are in sync, and the docs for all the languages are auto-generated. They're not second-tier, they're 100% single tier.

anyoneamous · on Sept 18, 2023

IME quite a few of those auto-generated docs (mostly the examples) have slight incorrectness, where the code snippet is a mangled mix of Python and TS.

Also, the tooling you need to drive CDK is all TS-based, which means I now need to think about NPM and Node versions occasionally, which are not relevant in any other part of my workflow.

Admittedly as someone in scientific computing I am unusually far from the JS/TS ecosystem - but all I can tell you is that it feels second tier as a user.

paulddraper · on Sept 17, 2023

CDK has kept it very alive.

(But there's CDKTF, FYI.)

dantiberian · on Sept 18, 2023

Presumably GitHub/GitLab support is coming, but this is quite a limited product at the moment. It doesn’t even support their own Cloud Source Repositories.

You can version the Terraform configuration, either in a public Git repository or in a Cloud Storage bucket. Use Object Versioning to version configurations in a storage bucket.

https://cloud.google.com/infrastructure-manager/docs/overvie...

varun_chopra · on Sept 17, 2023

Terraform's license change suddenly makes sense.

anuraaga · on Sept 18, 2023

This seems likely a partnership between Google and Hashicorp, from what I understand most GCP integrations like redis are actually partnerships rather than just running the OSS independently. Potential license and trademark implications seems like it would be a particularly bad time to try the latter with Terraform.

As others have mentioned, this is not all that useful of a service - it doesn't seem to even have the concept of a "plan" let alone any approval system and seems to only allow for the most basic of workflows. Given that TF by default will store state in a bucket with true locking, I can't come up with any potential benefit this provides vs using Terraform directly.

The main question is whether this will be improved in the future or is intentionally just "Terraform Trial Edition" with terms of partnership preventing anything encroaching on Terraform Cloud. Perhaps for the former, the trial is important to understand usage to better inform revenue sharing for a future improved product.

drabinowitz · on Sept 18, 2023

I was really hoping this would come with rollback support for deployments. That, to me, would be the big advantage of having gcp manage the tfstate. It doesn’t look like that’s currently supported, but maybe that’s coming in the future.

Karrot_Kream · on Sept 18, 2023

Are you thinking of this as separate from a revert to the Terraform files and then a roll forward?

drabinowitz · on Sept 18, 2023

Yeah if I have multiple terraform projects in one repo it would be helpful to have a UI show deployments for each project with a list of rollouts so I can see which one I want to go back to. For SREs it might not be as helpful but if you delegate some scoped terraform module creation to the developers themselves it’d help them to have a simple UI for rollbacks

Karrot_Kream · on Sept 19, 2023

Ah yeah I see what you mean. At my last (big tech fwiw) shop as we were transitioning to GitOps, we ran into this issue a lot. My current gig we're too small for this to be a big issue for us but I remember the pain well.

holografix · on Sept 17, 2023

So Deployment Manager is finally dead?

candiddevmike · on Sept 17, 2023

It's still in the console, this one is not (yet)

fishnchips · on Sept 18, 2023

Curious how long this one lasts.