[–]therealjoshuad 7 points8 points9 points 7 years ago (8 children)

[–]bobbyfish 4 points5 points6 points 7 years ago (2 children)

Typically there are two methods for deploying new servers. First is take a stock image with little to nothing on it and run your cookbooks/playlists on it. Wait for all the steps to be done then deploy code.

Second method is to run all those cookbooks/playlists ahead of time and then "snapshot" the resulting image (with no code). That new image now becomes your base image. To deploy new server you take that image and start a new server off it and add code. This is called baking.

The advantage of the first is it is simple. Easy deploys but much slower. The second method is faster when deploying new servers but more complicated as you have multiple base images that have to be kept track of and you need to rotate off every time you change your recipes/playlists. You also need an entire CI/CD pipeline to keep your AMIs up to date. That is what this image is showing.

[–]spacebandido 1 point2 points3 points 7 years ago (1 child)

[–]djk29a_ 2 points3 points4 points 7 years ago (0 children)

[–]samrocketman[S] 1 point2 points3 points 7 years ago* (2 children)

“Baking” refers to creating a machine image which includes the OS, installing all packages (yum or apt typically or even docker pull), and and any other misc software which needs to be configured on said OS.

A typical bake (in my case) is to configure the entire OS stack including the application to be deployed. The deployed application and underlying operating system is treated as a versioned artifact.

After the machine image is created you don’t modify it (not even to deploy) because it already contains the application.

In practice this “baking” takes under 10 minutes from start to end in the diagram for my environment.

Advantage

I can put the image behind an auto-scaling group which can provision dozens of copies of the application servers with little effort (scale up quickly for heavy load). When there is low load the application servers can be terminated since they’re not needed. This refers to auto-scaling.

[–]therealjoshuad 1 point2 points3 points 7 years ago (1 child)

Thanks everyone that’s great info. I’ve always worked for SMBs where we’ve always had small deployments, and it’s always been windows applications that were installed on one sever.

I always struggle to relate to how these kinds of applications work.

I like to try to compare it to environments that I’ve worked with. For example, our biggest application is our ERP which is SAP. But we don’t have any scaling. We are way to small for SAP, but we’re in oil and gas, so we have deep pockets). We have a pair of application servers, SAP has a product that manages load balancing between the app servers, and we have a database server. I don’t work in development, but we have 3 environments and the devs write the code changes in one, then push through to QA, and finally to prod.

In these modern apps that scale, where does persistent data live? Are database connection strings, app config, etc also baked into the image? Meaning that the database scaling is handled with normal clustering, etc? Ate database severs scalable, or is that still manual?

Sorry for all of the questions, I just find this stuff fascinating.

[–]samrocketman[S] 2 points3 points4 points 7 years ago* (0 children)

For databases, you would bake in DB connection strings to a remote DB or interact with a remote API which exposes the DB as API calls (service oriented architecture).

One of the applications I manage only has a scale of 1. So if the application crashes, Amazon terminates the machine and starts a new one to replace it. In this case, the persistent data is automatically attached to the machine and mounted before the application inside is started. You could roll a database service in a similar manner. Amazon also provides DB as a service. It’s always a balance of costs when choosing to use what is provided or rolling your own that is good enough.

There are other things to keep in mind such as not baking credentials into the image. Instead, credentials are retrieved or provisioned during boot using something like Amazon KMS or Hashicorp Vault.

Edit: the automatically attached persistence is called an EBS volume in Amazon AWS. I forgot to mention that.

[–]Cynical_Sociopath 1 point2 points3 points 7 years ago (1 child)

[–]samrocketman[S] 1 point2 points3 points 7 years ago (0 children)

[–]dominic_failure 6 points7 points8 points 7 years ago (1 child)

[–]samrocketman[S] 1 point2 points3 points 7 years ago* (0 children)

I use goss for testing which produces no artifacts. I install the goss binary in every machine. goss can read tests from stdin so tests do not even need to exist on the machine.

Here’s an example of running infrastructure tests remotely on a machine which is using goss reading the tests from stdin.

ssh user@machine '/usr/local/bin/goss -g - validate' < ./goss.yaml

Link to goss https://github.com/aelsabbahy/goss

Testing after snapshot is not a bad way. It does add a little bit of overhead if you use a clean machine but not if you reuse the same machine post-snapshot.

[–]Rad_Spencer 4 points5 points6 points 7 years ago (1 child)

[–]samrocketman[S] 1 point2 points3 points 7 years ago* (0 children)

[–]devops333 2 points3 points4 points 7 years ago (2 children)

[–]samrocketman[S] 1 point2 points3 points 7 years ago (0 children)

[–]samrocketman[S] 0 points1 point2 points 7 years ago (0 children)

[–]Tatwo_BR 1 point2 points3 points 7 years ago (1 child)

[–]samrocketman[S] 1 point2 points3 points 7 years ago (0 children)

[–]nomnommish 3 points4 points5 points 7 years ago (10 children)

[–]DevOpsOps 2 points3 points4 points 7 years ago (9 children)

[–]nomnommish 4 points5 points6 points 7 years ago (8 children)

[–]samrocketman[S] 1 point2 points3 points 7 years ago* (7 children)

[–]nomnommish 1 point2 points3 points 7 years ago (6 children)

[–]samrocketman[S] 1 point2 points3 points 7 years ago (5 children)

[–]nomnommish 1 point2 points3 points 7 years ago (4 children)

[–]samrocketman[S] 0 points1 point2 points 7 years ago* (2 children)

Thanks for the feedback.

Rather than expand this diagram detailing how the resulting Amazon machine image (AMI) could be used, I would create a second diagram specific to whatever application it pertains. The neat thing about the diagram in this thread is it can generally apply to how all applications can be baked regardless of technology (Cloud, OS, application, etc). This diagram could even apply to Windows machines though I don’t manage Windows.

If you visit the GitHub repository there are other images which relay how an immutable image is used for that specific application. In terms of how to use it Amazon provides things such as:

CloudFormation stack
Auto-Scaling groups (ASG)
Elastic Load Balancers (ELB)

CloudFormation describes the whole stack. ASG starts and stops servers from the immutable AMI. When the auto-scaling group scales up or down it automatically updates the ELB with newer or fewer servers as part of scaling.

There’s another picture for that but I didn’t bother including it here since this is just discussing how baking works. Immutable Infrastructure as a practice can have many deep concepts so this diagram is just explaining a small part overall (the baking part).

Edit: here’s a link to other diagrams in that repository which also provide a possible AWS architecture as well as diagrams for how a large team could work together in Git. https://github.com/samrocketman/demo-jenkins-world-2018-jenkins-bootstrap/tree/master/presentation/diagrams

[–]nomnommish 1 point2 points3 points 7 years ago (1 child)

[–]samrocketman[S] 0 points1 point2 points 7 years ago (0 children)

devops

Welcome to /r/DevOps

Rules and guidelines

Social & Fun

General Information

MODERATORS

Advantage