Kinnary Jangla

Accelerating Development Velocity Using DockerDocker Across Microservices

../images/465114_1_En_BookFrontmatter_Figa_HTML.png

Kinnary Jangla

San Francisco, CA, USA

ISBN 978-1-4842-3935-3e-ISBN 978-1-4842-3936-0

https://doi.org/10.1007/978-1-4842-3936-0

Library of Congress Control Number: 2018962734

This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed.

Trademarked names, logos, and images may appear in this book. Rather than use a trademark symbol with every occurrence of a trademarked name, logo, or image, we use the names, logos, and images only in an editorial fashion and to the benefit of the trademark owner, with no intention of infringement of the trademark. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights.

While the advice and information in this book are believed to be true and accurate at the date of publication, neither the author nor the editors nor the publisher can accept any legal responsibility for any errors or omissions that may be made. The publisher makes no warranty, express or implied, with respect to the material contained herein.

Distributed to the book trade worldwide by Springer Science+Business Media New York, 233 Spring Street, 6th Floor, New York, NY 10013. Phone 1-800-SPRINGER, fax (201) 348-4505, e-mail orders-ny@springer-sbm.com, or visit www.springeronline.com. Apress Media, LLC is a California LLC and the sole member (owner) is Springer Science+Business Media Finance Inc (SSBM Finance Inc). SSBM Finance Inc is a Delaware corporation.

To all those engineers who struggle with ramp-up curves on new software tools!

Introduction

The idea of writing this book occurred to me while I was ramping up on Docker during my first year at Pinterest. There is a lot of content on the Internet, but it is unstructured and sometimes incorrect and inaccurate. This book will help you to understand the fundamentals of Docker. To understand anything in depth, it’s best to start with basic concepts. Over the past years, the needs of tech companies have evolved significantly. This book will help you understand the need for Docker in the software industry and how Docker has managed to ease the growing pains of this industry.

I have tried to structure this book by explaining the fundamentals before going into anything specific to Docker. I hope that helps you understand the fundamentals of Docker.

My hope, too, is that this book is useful to both students and engineers who want to ramp up on Docker quickly.

The following sections provide a snapshot of the book.

Chapter 1 : Containers

This chapter focuses on what Docker is all about. It’s about containers. But what are containers and how do they differ from virtual machines? Why does Docker make use of container technology and what are the benefits of that? What are the advantages and challenges of containerization? At the end of this chapter, you will have learned the underlying technology of Docker.

Chapter 2 : Docker

This chapter focuses on how the software industry evolved and what gave rise to the need for containers and, therefore, Docker. In this chapter, you’ll learn the history of Docker, in addition to some of its basic use cases today.

Chapter 3 : Monolith vs. Microservices

Because this book focuses on debugging microservices using Docker, this chapter talks about the evolution of microservices, the differences between monolith and microservices, and the advantages and challenges of both. It will help you understand why debugging becomes significantly difficult when you’re dealing with multiple services that all need to talk to one another.

Chapter 4 : Docker Basics

This chapter is all about taking the first few steps to begin working with Docker. This section discusses the terminology used in the Docker world, the underlying architecture of Docker, how to install Docker, and some basic Docker commands. This chapter is your go-to to step foot into Docker land.

Chapter 5 : Docker Images

This chapter goes deep into what Docker images are and how they’re created. It examines Dockerfiles, which is where all the instructions to build Docker images are located. Then it goes into how to build Docker images and, finally, into Docker containers in depth. I would encourage you to take some extra time to understand the role of Dockerfiles, Docker images, and Docker containers. I’d also advise acquiring a thorough understanding of this chapter.

Chapter 6 : Docker Compose

This chapter is devoted to the Docker Compose tool. This links all the services and helps in running an application from end to end. Here you’ll learn all aspects of Docker Compose: how to install it, how to use it, and what happens behind the scenes.

Chapter 7 : Debugging Microservices Using Docker

This is what the book has been leading to. This chapter is the core and longest chapter of this book. It explains what distributed environments are and their challenges. It later goes into depth about how to debug an end-to-end real-world use case, by explaining different related debugging techniques.

Chapter 8 : Advanced Docker Use Cases

After exploring how to debug an application, based on the microservices architecture, this chapter discusses some advanced use cases of Docker. It talks about the use of Docker in a production environment, orchestration using Docker, and offers some tips and tricks to help you with the software.

Acknowledgments

Writing a book requires teamwork. I’m lucky to have found a team of thorough tech reviewers such as Michael Erwin and James Markham, who revised my content thoroughly to ensure that this book is completely most up to date. Thanks, Apress, for the opportunity, and Nancy Chen, for all the hard work of coordination and keeping me on schedule.

This book took a long time to complete. In the past few months, I wanted to give up multiple times. It was my husband’s push and support that ultimately got me to the finish line. I can never thank you enough, Abhinav Vora.

Thank you to all my family and friends for being so patient and understanding of the lack of time and attention I was able to devote to you these past months. Your support and motivation kept me going.

Chapter 1: Containers 1

What and Why? 1

Containers vs. Virtual Machines 3

Pros and Cons of Containerizing Applications 5

Running an Application on a Host Machine 5

Running an Application on a Virtual Machine 6

Advantages of Using Containers 6

Challenges of Using Containers 7

Summary 8

Chapter 2: Docker 9

History 9

What Is Docker? 11

The Docker Runtime and Orchestration Engine 12

Docker Images 12

Dockerfiles 13

Why Should You Use Docker? 13

Docker’s Key Use Cases 14

Configuration Management 14

Code Pipeline Management 14

Developer Productivity 14

Faster Deployment 15

Application Isolation 15

Continuous Integration and Continuous Deployment 16

Consistent Environments Across Machines 16

Summary 17

Chapter 3: Monolith vs. Microservices 19

Evolution of Microservices 20

Comparing Monoliths and Microservices 22

Challenges with Microservices 24

Summary 26

Chapter 4: Docker Basics 27

Terminology 28

Architecture 29

Docker Platform 30

Docker Engine 30

Docker Architecture 32

Docker Objects 33

Docker Hub 36

Installing Docker 38

Basic Docker Commands 43

docker container run 43

docker container create 43

docker container start 44

docker container stop 44

docker image build 45

docker image pull 46

docker search 46

docker image ls 48

docker container ps 48

docker container rm 49

docker container inspect 49

Summary 53

Chapter 5: Docker Images 55

Docker Images 55

Dockerfile 56

Creating a Sample Dockerfile 56

Building Images with Dockerfile 60

Docker Containers 71

Attaching and Detaching from a Docker Container 73

Summary 76

Chapter 6: Docker Compose 77

What Is Docker Compose 77

Installing Docker Compose 81

Usage 82

docker-compose up 82

docker-compose build 83

docker-compose config 83

docker-compose kill 84

docker-compose restart 84

docker-compose ps 84

docker-compose logs 84

docker-compose start 85

docker-compose stop 85

docker-compose pause 85

docker-compose run 86

Behind the Scenes and an Example 87

Summary 97

Chapter 7: Debugging Microservices Using Docker 99

Distributed Environments 100

Advantages of Distributed Systems 100

Scalability 100

Reliability and Availability 100

Autonomy 101

Challenges of Distributed Systems 101

Heterogeneity 101

Concealing the Complexity 102

Concurrency 102

Scalability 102

Failure Handling 103

Debugging 103

Sample Real-World End-to-End Use Case 104

Debugging 128

Summary 133

Chapter 8: Advanced Docker Use Cases 135

Docker in Production Environments 136

Managing Docker Images 136

Docker in Cloud 137

Security and Network 137

Load Balancing 138

Deployment 138

Service Discovery 138

Log Management 139

Monitoring Docker Containers 139

Managing Databases 139

Orchestration Using Docker 140

Advanced Use Cases 141

Tips and Tricks 142

Summary 144

Index 145

About the Author and About the Technical Reviewer

About the Author

Kinnary Jangla

has worked in the tech industry for a dozen years and is currently an engineering manager at Pinterest in the Ads division. Previously she worked on the machine learning Homefeed infrastructure team where she used Docker to develop the debugging framework.

Kinnary previously worked at Uber and Microsoft, is the author of three books, and holds six patents. You can follow her on Twitter at @kjangla .

About the Technical Reviewer

Michael Irwin

is an application architect at Virginia Tech who is striving to modernize how software is developed and run on campus, by driving the adoption of Docker-based workloads, CI/CD pipelines, the public cloud, single-page applications, and more. As a Docker Captain and Community Leader (Meetup Organizer), he has the opportunity to share his expertise and experiences with others but also learn how others are using the latest technologies. When developing, he writes code in Node, Java (Java EE mostly), and JavaScript but actively contributes to projects written in other languages and frameworks. He’s blessed to have a beautiful wife and four daughters.

Kinnary JanglaAccelerating Development Velocity Using Dockerhttps://doi.org/10.1007/978-1-4842-3936-0_1

1. Containers

Kinnary Jangla¹

(1)

San Francisco, CA, USA

A container is any receptacle or enclosure for holding a product used in storage, packaging, and shipping.
Wikipedia, “Container,” https://en.wikipedia.org/wiki/Container , 2018.

In this chapter, you will learn the basics of containers and how they are used in the software industry. You will also see how containers differ from virtual machines and discover some of the pros and cons of using containers. This chapter puts you on the path to learning about Docker in depth.

What and Why?

You can’t work in a software company today and not hear about software containers: Docker, Kubernetes, Mesos, and a host of others. But before we dive into any of this, let’s look at what really changed in the world that led to the need for containers.

When you run a program on your machine in a certain environment, and the environment that supports your program on a production machine is not identical, problems arise. You test using a certain version of the programming language, and it runs a different version in production, so something weird happens, owing to the lack of forward or backward compatibility. Alternatively, you rely on a certain version of an SSL library, and a different version is installed in production. The network topology or the security policies might be different. These inconsistencies can cause all sorts of problems. Let’s take a step back. What is a container in the traditional sense of the word, and how can containers solve this problem?

“A container is any receptacle or enclosure for holding a product used in storage, packaging, and shipping,” right? Now let’s apply this to software.

The concept of container technology uses this same paradigm of shipping containers in transportation. The idea is that before shipping containers were invented, manufacturers had to be prepared to ship goods in a wide variety of modes—ships, trains, or trucks—with different sized containers and packaging. By standardizing the shipping container, goods could be seamlessly transferred among shipping methods, without any additional preparation. Before the advent of this standard, shipping anything in bulk was a complicated, laborious process.

The promise behind software containers is essentially the same. Instead of shipping via a full operating system (OS) and your software (and maybe the software that your software depends on), you simply pack your code and its dependencies into an image that can then run anywhere, and because these are usually pretty small, you can pack lots of containers onto a single computer.

Put simply, a container consists of an entire runtime environment: an application, plus all the dependencies, libraries and other binaries, and configuration files needed to run it, bundled into one package. By containerizing the application platform and its dependencies, differences in OS distributions and underlying infrastructure are abstracted away.

By allowing software code to be prepped in ready-made software containers, the code can quickly be moved around to run on servers running the Linux OS or be connected to run a distributed app in the cloud. This approach also has the benefit of speeding up the testing process and building large, scalable cloud applications. While this approach has been around in software development circles for many years, it has recently become more popular with the growth of Linux and cloud computing. Earlier projects taking the container approach have included BSD Jails, Solaris Zones, and Unix V7.

Containers vs. Virtual Machines

Heard the terms virtualization or virtual machine? First, what are virtual machines (VMs)? In the present day and age, when collaborating and working remotely have become commonplace, virtualization is key. Historically, as server processing power and capacity increased, bare metal applications weren’t able to exploit the new abundance in resources. Thus, VMs were born, designed by running software on top of physical servers, to emulate a particular hardware system.

At the heart of it, a VM is an app! Typically called hypervisor, it emulates an OS. Hypervisor is a program that enables you to host several different VMs on a single hardware. Everything in the VM is self-contained, and it typically has all the capabilities of the OS it is imitating.

Sounds like a fake computer, doesn’t it? However, there are some important distinctions. A VM is indeed entirely virtual, in that it doesn’t have any hardware of its own, except for the storage drive it comes from. More modern and complex VMs are supported by server setups.

Virtualization services are usually provided by specific companies, such as VMware, for example.

How do containers compare to VMs, though? Are they the same thing? When do you use what? And what is the key difference, really?

VMs take up a lot of system resources. Each VM runs not just a full copy of an OS but a virtual copy of all the hardware that the OS requires to run. This quickly adds up to a lot of RAM and CPU cycles. In contrast, all that a container requires is enough of an OS, supporting programs and libraries, and system resources to run a specific program.

What this means in practice is that you can put many more applications on a single server with containers than you can with a VM.

OS virtualization has grown in popularity over the last decade, to enable software to run predictably and well when moved from one server environment to another. But containers provide a way to run these isolated systems on a single server or host OS.

Containers sit on top of a physical server and its host OS, for example, Linux or Windows. Each container shares the host OS kernel. Binaries and libraries are the only elements created from scratch. Containers are thus exceptionally “light”—they are only megabytes in size and take just seconds to start, as opposed to gigabytes and minutes for a VM.

Containers also reduce management overhead. Because they share a common OS, only a single OS requires care and feeding for bug fixes, patches, and so on. This concept is similar to what we experience with hypervisor hosts: fewer management points but slightly higher fault domain. In short, containers are lighter weight and more portable than VMs.

VMs and containers differ in several ways, but the primary difference is that containers are isolated processes running on an OS that are implemented using namespaces. With VMs, the hardware is virtualized to run multiple OS instances. Containers’ speed, agility, and portability make them yet another tool to help streamline software development.

Figure 1-1 provides a comparison of containers and VMs.

../images/465114_1_En_1_Chapter/465114_1_En_1_Fig1_HTML.jpg — Figure 1-1
Containers vs. virtual machines

Pros and Cons of Containerizing Applications

Let us start with understanding how applications are run traditionally. That will help us understand what containerization is not.

Running an Application on a Host Machine

Traditionally, you would install an application on a host computer and run it directly from a host computer’s file system. The environment this application runs in would include the host’s file system, network interfaces, ports, devices, etc. To get the application working, you would additionally require other packages that your application depends upon. You might also want different versions of the same package running on your system.

Besides this, running multiple instances of your service on the host computer might get tricky, because the application might bind to a particular network port by default; other services might bind to the same network port; the service might have to read configuration files on service startup; etc.

Running an Application on a Virtual Machine

Running an application on a VM can overcome some of the drawbacks of running applications directly on the host OS. A VM also runs on the host, but it has its own kernel, file system, network interfaces, etc. This makes it easy to keep almost everything inside the OS separate from the host.

Because a VM is a separate entity, you don’t have the same issues of inflexibility that arise from running an application directly on hardware. You could run an application ten times on the host by starting up ten different VMs. The service on each VM could listen on the same port number and not cause a conflict, because each VM could have a different IP address, as if it’s a different computer altogether, except that it’s not.

Likewise, if you have to shut down a host computer, you could either migrate the VM to another host (if your virtualization environment supports it) or just shut it down and start it again on the new host.

The downside of running each instance of an application in a VM is the resources it consumes. Your application might require only a few megabytes of disk space to run, but the entire VM could consume many gigabytes of space. Also, the startup time and CPU consumption of the VM is almost sure to be higher than the application itself would consume.

Containers offer an alternative to running applications directly on the host or in the VM, which can make the applications faster, more portable, and more scalable.

Advantages of Using Containers

Containers offer both efficiency of resources and flexibility of usage. While VMs take up several gigabytes of space, containers are sized within the range of tens to hundreds of megabytes. A server can host significantly more containers than VMs because of the lack of the need to run multiple copies of OSs. Flexibility comes from the container being able to carry all the files it needs with it. As with an application running in a VM, it can have its own configuration files and dependent libraries, as well as its own network interfaces that are distinct from those configured on the host. So, a containerized application is easier to move around than its directly installed counterparts, and it doesn’t have to contend for such resources as port numbers, because each container they run in has separate network interfaces.

Because the container can hold the application and its dependencies it requires to run, the startup time, disk space consumption, and processing power is much lower than those of a VM. Containers also don’t have a separate kernel, as does a VM. Using containers can decrease the time required for development, testing, and deployment of applications and services. Testing and bug tracking also become less complicated, because there is no difference between running your application on a test server vs. production.

Containers are a very cost-effective solution and can potentially help you decrease your operating and development costs. Container-based virtualization is a great option for microservices, developer operations, and continuous deployment.

Challenges of Using Containers

One of the main disadvantages of container-based virtualization compared to traditional VMs is security. Containers share the kernel and other components of the host OS. This means that containers are less isolated from each other than VMs, which have their own OS. If there is a vulnerability in the kernel, it can jeopardize the security of all containers. VMs only share the hypervisor, which makes them less prone to attacks than the shared kernels of containers.

While VMs with any kind of OS can reside next to each other on the same server, you must start a new server, to be able to run containers with different OSs. For complex enterprise applications, this can be a serious constraint.

In addition to that, deploying containers in a sufficiently isolated way while maintaining an adequate network connection can be tricky too. Also, containers, as they are designed, cannot see other containers by default. So, what happens when you want your container to work closely with another container? For example, what if your service requires access to a database server?

Some of these problems are addressed by Docker, which you will read about in the next chapter.

Summary

This chapter described the basics of containers, their use in the software industry, and how they differ from VMs. It also described the difference between running an application on a host machine vs. a VM vs. a container. It discussed the advantages and challenges of using containers.

This chapter has put you on a path along which you can start from scratch, if you’re new to the world of virtualization, by comparing the differences between all options available today and the reasons the software industry has moved toward containerization rather than other available options.

Kinnary JanglaAccelerating Development Velocity Using Dockerhttps://doi.org/10.1007/978-1-4842-3936-0_2

2. Docker

Kinnary Jangla¹

(1)

San Francisco, CA, USA

Docker is another term for longshoreman. Longshoreman: a person employed by a port to load and unload ships.
https://www.collinsdictionary.com/us/dictionary/english/docker

In the last chapter, you saw what containers are and the differences between them and virtual machines (VMs). You also read about some of the advantages of containers and the challenges of using them.

Docker provides a solution to some of the problems posed by containers. But why did Docker become so successful only in recent years? Let’s look into that a little .

In this chapter, you will learn about the evolution of Docker and the reasons for its wide adoption by the software industry. You will learn some basics of Docker, some basic use cases for it, and some of its main components. We’ll dive deeper into this in the future chapters.

History

As new as containerization and Docker might sound to you, the intriguing wrinkle is that they’re really not new. The idea of containers has been around since the early days of Unix, with the chroot command. Rings a bell? Docker software was originally built on Linux containers, which were introduced in 2008.

As you should know from having read Chapter 1, containerized applications share a common operating system (OS) kernel, eliminating the need for each instance to run its own separate system. An application can be deployed in seconds and uses a lot fewer resources than hypervisor-based virtualization. However, because applications rely heavily on a common OS kernel, this approach can work only for applications that share the exact OS version. Docker found a way to address this limitation.

Docker was released as an open source project by dotCloud, Inc., in 2013. dotCloud is a San Francisco–based technology startup founded by the French-born American developer and entrepreneur Solomon Hykes. It relies heavily on namespaces and cgroups to ensure resource isolation and to package an application along with its dependencies, which are mostly Linux kernel features. It is this clustering of dependencies into a package that lets an application run across different platforms and still support a level of portability. This also allows developers to develop in the language of their choice, on a platform of their choice. This flexibility is what attracted a lot of interest in recent years.

Docker became extremely famous in many fast-growing companies that were trying to build test and dev environments for developers that could replicate production systems in many ways. Today, Docker is used by some well-known companies, including PayPal, Spotify, Yelp, and Pinterest, which are finding value in the software.

Let’s look at a time line of Docker milestones, according to the Container Journal. Docker source code was released as an open source software in March 2013. Needless to say, everyone had access to it after that. About a year later, Docker built the libcontainer framework, which it switched to. Around the same time, demand for orchestration tools increased, as Docker kept getting popular. In order for Docker containers to scale, orchestration frameworks are key. In June 2014, Google introduced Kubernetes, which helped Docker scale. Later that year, Amazon’s EC2 container service, which is a cloud-based container as a service, was offered. In June 2015, the open container initiative, which promotes open standards related to containers, was launched. A year later, Docker acquired a small company working on unikernels technology called Unikernels. By June of 2016, Docker had become very popular with the container ecosystem. It included the Swarm orchestrator in its platform, even though it was replaceable. Later that year, Docker started supporting all versions of Microsoft Windows. By 2016, Docker was extremely successful, and major companies began using it extensively for their most important use cases.

Now that we’ve reviewed how Docker became a success in the industry, let’s dive deeper into what Docker is and what use cases it solves.

What Is Docker?

Docker is the name of the company that produces the software called Docker. It is also the open source project that is now called Moby. When someone refers to Docker, he or she can be referring to any of these three things. Let’s try to understand a bit about each of them.

Docker is a software that runs on Linux and Windows. It is a tool designed to make it easier to create, deploy, and run applications, by using containers. The software is developed in the open, as part of the Moby open source project on GitHub.

Docker is a tool that is mainly designed for developers, so that they can focus on developing on their choice of platform, without having to worry about the OS the application will eventually run on. It allows them to run end-to-end workflow without having to get into services they don’t understand. In other words, it helps them to obtain a clearer view of the entire stack fairly easily. Additionally, running Docker containers has no additional memory overhead, so multiple Docker containers running multiple services creates very low overhead.

Understanding the different parts of Docker will help us get a good overview of everything Docker is made of before we dive deeper into any of it. The Docker architecture is explained in detail in Chapter 4.

The Docker Runtime and Orchestration Engine

The Docker engine is the infrastructure plumbing software that runs and orchestrates containers. This means that all the Docker, Inc., and third-party products plug into the Docker Engine and build around it. It is combined with a workflow for building and managing your application stacks. It is this underlying client-server technology that builds and runs containers using Docker’s components and services. It is made up of the Docker daemon, a server that is a type of long-running program; a REST API, which specifies interfaces that programs can use to talk to the daemon and tell it what to do; and the CLI, the command-line interface that talks to the Docker daemon through the API. Many docker applications use the underlying API and CLI.

In other words, the Docker Engine is the program that creates and runs the Docker container from the Docker image file. So, next, let’s take a quick look at what a Docker image file is.

Docker Images

A Docker image is not just a file; it is more of a file system. This file system is composed of multiple layers, and each layer contains a file of the contents for that layer that cannot be changed. In other words, it is immutable. It is essentially a snapshot of a Docker container.

Docker images are created with the build command. They produce a container and are stored in a Docker registry. Images can become fairly large quite quickly. Therefore, they are designed to be composed of layers of other images, allowing a minimal amount of data to be sent when transferring images over a network.

To explain this more clearly with a programming metaphor, if an image is a class, then a container is an instance of a class—a runtime object. Containers are lightweight and portable encapsulations of an environment in which you can run applications.

An image is created using a Dockerfile. Let’s see what a Dockerfile is. Later on, we’ll learn how to build a Docker image from a Dockerfile in detail, in Chapter 5. For now, let’s take a quick look at what Dockerfiles are all about.

Dockerfiles

Everything starts with a Dockerfile. It is a text document that contains a set of instructions or commands to assemble an image that are understood by the build engine.

The Dockerfile defines what goes in the environment inside your container. Access to resources, mapping volumes, passing arguments, copying files that must be inside your container, etc., go into this file. After creating the Dockerfile, you will have to build it to create the image of the container. The image is just the snapshot of all the executed instructions in the Dockerfile. Once you have this application image built, you can expect it to run across any machine using the same kernel.

Why Should You Use Docker?

Docker provides application isolation with little overhead. By saving space with the low memory footprint, it has some powerful advantages.

Primarily, you can benefit from the extra layer of abstraction (in which code and its dependencies are packed together) offered by Docker. Another significant advantage is that you can have many more containers running on a single machine than you can with virtualization alone, owing to Docker’s lightweight nature.

Another significant advantage is that containers can be spun and shut down within seconds. The Docker FAQ has a good overview of what Docker adds to traditional containers.

Let’s look at some of the key uses.

Docker’s Key Use Cases

Here are some of the key use cases that Docker supports that promote consistency of environments.

Configuration Management

Simplifying configurations is one of the primary use cases of Docker. One of the features it provides is the ability to run any application or platform with its own config on any OS or other infrastructure. Docker provides the capability of clubbing your environment with your configuration into code, packaging it, and deploying it.

Code Pipeline Management

When you have simplified your application configuration, code management becomes a lot simpler as a result. Code lives in many different environments before it reaches a point at which it can be shipped. It first lives in the developer’s machine, where it is tested, then it goes to test environments, where it might be deployed on test machines. Only after that does it reach the production servers.

All these environments vary in infrastructure, settings, configuration, etc. With Docker, a consistent environment is provided across these different phases, which in turn ease the development and deployment process. The ease with which Docker images can be spun helps you to maintain consistency across runtime environments.

Developer Productivity

As mentioned earlier, the life cycle of shipping an application goes through numerous phases, starting from the developer machine all the way to the production servers. At all points, we mostly strive to ensure a consistency between test and production environments.

To achieve this, every service must reflect how it will run in the production environment. For that to be possible, test environments require all the dependent services that end up taking huge amounts of space.

Docker comes in handy here by allowing a bigger number of services to run simultaneously, by not adding to the memory footprint. Docker’s shared code volumes make it available to the container’s host OS, which helps to support low memory usage.

This works amazingly well for developers, because they can use the code editor of their choice on a platform of their choice to develop the application, without worrying about the OS the application will run in on a production setting. This also helps developers avoid getting into the nitty gritty of services they don’t really understand but still enables them to test their end-to-end scenarios, which implicitly helps them understand the full stack better.

Faster Deployment

Prior to the existence of VMs, spinning up new hardware was a very cumbersome and time-consuming process. With VMs, that process became slightly easier, and with Docker, it became exponentially easier.

Creating and destroying Docker containers, bringing up a new container, etc., become extremely simple with Docker, not to mention less costly, which in turn allows for better resource allocation.

Application Isolation

When multiple microservices power up an application, it is very likely that these services depend on common libraries and packages, but possibly different versions of them. If you were to start an application on a single machine, getting all these services up and running to kick-start the application would practically be impossible, owing to the version conflicts of the various dependencies.

For that reason, isolating these microservices in their own environments, with only their dependencies and configurations that don’t conflict with other services, lets that service run independently. Setting up all these microservices in their independent Docker containers and having these containers communicate with each other seems like an ideal solution to getting an application up and running seamlessly.

Continuous Integration and Continuous Deployment

Docker has the ability to do image versioning. This means that you can set up your Docker containers to pull new code from your code repository, build it, package it in a Docker image, and push this new image to your image repository. Your deployment tool can then pull the newest image from your image repository, deploy it to your test environments, and, finally, promote it to your production environments. You could do this either every time there is new code in your repository or at a certain frequency, depending on how often you require your code to be deployed.

Consistent Environments Across Machines

How often have you observed that something works on your coworkers’ machines but not on yours? Docker helps you prevent this situation completely, by setting consistent environment variables and configuration settings in the image file, so that your and your coworkers’ machines look the same, without any other variables that can affect the run of an application or service.

Summary

In this chapter, you learned how Docker evolved, how it went from being an open source project in 2013 to acquiring unikernels to running natively on Windows. You saw what requirements of the software industry gave rise to the wide adoption of Docker. You also learned some basics of Docker and its components. We’ll dive deeper into this in future chapters.

Finally, you learned some of the key use cases of Docker, ranging from code pipeline management to faster deployments to increasing developer productivity. These are just some of the use cases of Docker that are widely applied across the software industry.

In the next chapter, you will learn about the differences between monoliths and microservices and when and why you use one vs. the other. You will see how to use Docker with microservices, as well.

Kinnary JanglaAccelerating Development Velocity Using Dockerhttps://doi.org/10.1007/978-1-4842-3936-0_3

3. Monolith vs. Microservices

Kinnary Jangla¹

(1)

San Francisco, CA, USA

A monolith is “a large single upright block of stone, especially one shaped into or serving as a pillar or monument.”
Oxford Living Dictionaries, s.v. “monolith,” https://en.oxforddictionaries.com/definition/monolith , accessed October 1, 2018.

Docker provides a solution to some of the problems posed by containers. But why did Docker become so successful only in recent years? Let’s delve into that a bit.

In the previous chapter, you learned about the evolution of Docker and the reasons for its wide adoption by the software industry. You also learned some basic use cases of Docker and its components.

In this chapter, I will consider the evolution of the microservices architecture. You’ll see how challenges posed by a monolith system, such as difficulty in continuous deployments, testing, scalability, etc., were solved by adopting a microservices architecture. You will also learn about the challenges of a microservices architecture and how application isolation enabled by Docker can come to the rescue.

Before we get into microservices, however, let’s first understand how microservices and service-oriented architectures are related. Both are architectures based on distributed systems, but there are some fundamental differences.

Microservices architecture is a kind of service-oriented architecture. In both architectures, services have a certain responsibility. These services can be developed independently on different tech stacks, and in both architectures, developers must deal with the complexity of a distributed system. However, microservices architecture splits an application into multiple different services that can be independently developed, scaled, tested, and deployed, whereas in a service-oriented architecture, services are provided to other application components. A service-oriented architecture must be deployed as a monolith, and all services must follow the same communication protocol.

Now let’s look at how microservices evolved.

Evolution of Microservices

Before we go into learning how microservices evolved, let’s first look into challenges presented by monoliths, because that is what contributed to the need for microservices architecture.

A monolith application is a single, self-contained software application in which all components of the application, including the user interface and the data access code, are all tightly coupled into a single program.

While a monolith service is simple to implement, test, deploy, and perhaps even scale, there are many other challenges that can arise as the complexity of the software application increases. Here are some of the challenges:

It becomes more and more difficult to test different pieces of the application independently.
Continuously deploying the entire application becomes tedious.
If you change a piece of code in a certain area, you will have to deploy the entire service, which could seem quite long and unnecessary.
A software bug in any module can bring the entire service down. Monoliths have single points of failure, which are very difficult to debug.
As the size of a monolith application increases, the startup time of the application keeps increasing with it.
To adapt new frameworks and technologies in your monolith app that uses a single stack, you must rewrite the entire application.

To mitigate all of these potential pitfalls, microservices architecture was born.

A microservices architecture is one in which a monolith is split into multiple smaller services that operate independently of each other but are interconnected. Each microservice is an independent service or an independent application. Different microservices in an application can be built on different software stacks and implement their own architecture. What’s more, in a microservices architecture, each microservice can additionally implement its own database schema, as required, instead of sharing a single database schema. It can also use a database that best suits its need. As a matter of fact, microservices should use their own databases and database schema; otherwise, the dependency on shared databases and schemas doesn’t really allow the services to be independent. Figure 3-1 shows two services using MySQL but different instances of it. The monolith is broken down into multiple services, each of which uses its own database.

../images/465114_1_En_3_Chapter/465114_1_En_3_Fig1_HTML.jpg — Figure 3-1
Microservices architecture in which an application is broken down into multiple services, and each service uses its own independent database

Microservices have many advantages over monoliths. A microservice architecture deals with the complexity issue of a monolith, for which it helps in dividing a single application into multiple components. This makes understanding as well as maintaining the code base a lot easier. Because the services operate independently, they can be developed using a framework that best suits the need. This gives developers a lot of flexibility, as they are free to choose what works best. Different modules can be deployed independently of one another. Services can also be scaled, as required. Testing independent services becomes easier as well, owing to the modularity that comes with a microservices architecture.

Comparing Monoliths and Microservices

Table 3-1 provides a consolidated view of a monolith vs. a microservices architecture.

Table 3-1

Differences Between Monolith and Microservices

	Monolith	Microservices Architecture
1. Maintenance	Maintenance grows in complexity as the application does.	It is easier to maintain microservices, as they are modular and independent.
2. Deployment	Continuous deployment becomes very difficult as the monolith keeps growing.	Deployment of individual services is easier, and services can be deployed as and when required.
3. Testing	Testing the entire monolith becomes a pain.	Testing individual components is much easier.
4. Startup time	As the monolith grows in size, the startup time increases with it.	Startup times of individual services are much faster, because they are smaller in size.
5. Adoption of newer technologies	A monolith is written in a single language, uses a single database, and is averse to adopting newer technologies.	Developers are free to choose the technologies to build their microservices. Each microservice can also use a database that best suits its needs. Microservices architecture allows you to take advantage of the latest available technologies.
6. Scalability	It’s much harder to scale a complex monolith.	Microservices can be scaled on demand, as and when needed.

Challenges with Microservices

While microservices address many issues with monoliths, they introduce many other kinds of problems that present a challenge. With a microservices architecture, you are dealing with all challenges that come with a distributed system. For example, because services in a microservices architecture are interconnected, inter-service communication must occur, and for that, a single, reliable, and consistent communication channel must be established, for example, using HTTP.

Multiple services mean more management of those services. All of these must be independently managed for their health and maintenance. These services have to be frequently updated and upgraded to meet the newest versions of the dependencies they use.

Microservices might have their own logging mechanisms. This might result in lots of unstructured and potentially unmanaged data. Retrieving logs can become confusing with gigabytes of available logging data.

Finding the root cause of a failure in a certain workflow might be very tedious to debug. In order to debug an entire workflow, you might have to get multiple services up and running and then test them end to end, in order to know where the bug exists, because the logic is distributed, as is the data. There could also be cyclic dependencies between services, which can be very difficult to deal with while debugging the root cause of a failure.

Last, the most significant issues are those related to versioning. When more than one service depends on certain libraries or packages, but only different versions of those libraries, it becomes tricky to get these services up and running. How can you have two versions of the same dependency on your machine? If you can’t have that, how can you manage getting these services up and running, either in a production system or in your debugging environment?

For example, imagine a spellchecker application with three different microservices: service A, service B, and service C. When the user enters a word to check the spelling of, the request is sent to service A, which depends on JavaScript version 1.8.5, Python version 2.7, and Flask version 0.12.4. Service B takes the request from service A, checks the spelling against a dictionary, and sends it to service C. In order to get service B up and running, you need Flask version 0.10.3. Service C takes this spelling and writes it to a database for records. Service C depends on Python version 2.1.

Table 3-2 shows the dependencies required on your machine, to get these services up and running successfully.

Table 3-2

Service dependencies

Service A	Service B	Service C
JavaScript v1.8.5	-	-
Python v2.7	-	Python v2.1
Flask v0.12.4	Flask v0.10.3	-

As you see, getting service A and service B running on the same machine is practically impossible, because they both require a different version of Flask. Similarly, getting service A and service C running successfully on a single machine is also impossible, owing to the different versions of Python.

This is one of the most prevalent and widely seen problems in the software industry. A common solution might be to update your services to use the same version of a certain dependency. But in a complex application with thousands of microservices, this becomes extremely difficult to keep track of. So, what is a good solution here? Docker.

In the preceding example, if you isolate service A, service B, and service C in their own environment and let them run independently and, at the same time, enable inter-process communication between them, they would not conflict with one another. Docker enables exactly this!

In the next few chapters, I will delve into how exactly this problem can be solved with the help of Docker, in addition to the many other advantages of using Docker to solve related problems.

Summary

In this chapter, you saw how the microservices architecture was born and how it evolved. The many challenges that came with monolith services were solved by the microservices architecture.

You also saw the differences between a monolith and a microservices architecture. You saw how as an application grows in size and complexity, a monolith poses many problems, such as difficulties with continuous deployments, testing, scalability, startup, etc. These are elegantly solved by a microservices architecture.

Last, you saw that with a microservices architecture come all the challenges of a distributed system. You saw how getting multiple services up and running can be quite challenging, if they rely on different versions of the same dependencies. Application isolation comes in very handy here. And Docker can help us with that.

In the next chapter, I will get into the basics of Docker and explain the nitty-gritties, including related terminologies, its architecture, how to install Docker, and some basic commands to use to get started.

Kinnary JanglaAccelerating Development Velocity Using Dockerhttps://doi.org/10.1007/978-1-4842-3936-0_4

4. Docker Basics

Kinnary Jangla¹

(1)

San Francisco, CA, USA

Essential foundations, starting points, and fundamentals

In this chapter, we will look into the Docker terminology that has been used in the previous chapters of this book and which I will continue to use in future chapters.

You’ll see the different components of the Docker architecture, including the Docker Engine, Docker Hub, Docker clients, Docker host, and Docker registries. You’ll see how different Docker objects are created by the Docker daemon and how Docker Hub can be used to pull existing Docker images and buy, sell, or distribute images for free.

Additionally, you will learn how to install Docker on the Mac operating system (OS) platform.

I will examine more closely some of the basic Docker commands, providing an example of the use of each command, so that you can play around with it and then follow it with your own example.

Terminology

Before you begin to approach the fundamentals of Docker, it is important to learn the associated lingo. Following are certain keywords and phrases that you will come across frequently, now that you’re on the path to becoming a Docker expert!

Image: A Docker image is a bundle of all the dependencies and configurations that an application depends on to run successfully. An image is this package that runs inside a container. Once an image is created, it cannot be changed. In other words, a Docker image is immutable.
Container: A Docker container is a lightweight instance of a Docker image. It is a running process that has been isolated using namespaces and uses the image for its root file system.
Dockerfile: A Dockerfile is a text file that contains instructions to build a Docker image.
Building a Dockerfile: This refers to building the instructions in the Dockerfile, in order to create a Docker image that can then run inside a Docker container.
Compose: This refers to a command-line tool that operates on one or more files that are a composition of multiple Dockerfiles of different applications/services, in a sense. With the Compose tool, you can run a single YAML file and get the images build to create and have them all running together.

Architecture

Before mastering Docker, let’s get into how it all works behind the scenes, to get a solid understanding of how it really works and how its different components interact with one another.

To begin with, let’s look at Docker’s different components:

Docker platform
Docker Engine
Docker architecture
- Docker client
- Docker daemon
- Docker registries
Docker objects
- Images
- Containers
- Services
Docker Hub

As you have seen in the previous chapters, some of the advantages of Docker are process- and application-level isolation, portability, and ease of deployment and testing. Many different components come into play to support these scenarios. So, let’s delve into the components one at a time.

Docker Platform

Docker provides a platform to bundle dependencies and other information, such as environment variables, configurations, settings, etc., into a single isolated environment. Owing to this isolation, dependencies across applications do not interfere with each other, and, hence, multiple applications can run inside their own containers. These containers can all run simultaneously on a single host machine. Because containers are different than virtual machines (VMs), in that they don’t need a hypervisor later and can run directly on the host machine’s kernel, a lot more containers can run on a single hardware machine than if you were to use VMs.

The Docker platform also provides the ability to manage your containers, allowing you to develop and test your applications using containers. When ready, you can also deploy your application in its production environment, using containers.

Docker Engine

The Docker Engine is a client-server application. It consists of the following three parts, as shown in Figure 4-1.

1.
A server process, also known as a daemon process. This is a background process that is continuously running and constantly listening to the REST API interface for any commands to process.
2.
A REST API interface that programs can talk to, in order to communicate with the Docker daemon. This can be accessed by an HTTP client.
3.
A client that is a command-line interface (CLI).

../images/465114_1_En_4_Chapter/465114_1_En_4_Fig1_HTML.jpg — Figure 4-1
Docker Engine architecture

The way to get anything done using Docker is through the Docker client, via the CLI or a script composed of commands. The client then communicates these commands, via the REST API, to the Docker daemon, which is the server. The Docker daemon then gets the job done. It creates such Docker objects as images, containers, volumes, etc.

Let’s look more extensively into Docker’s client-server architecture.

Docker Architecture

The Docker system mainly consists of the Docker client, daemon, and registry (Figure 4-2).

../images/465114_1_En_4_Chapter/465114_1_En_4_Fig2_HTML.jpg — Figure 4-2
Docker client-server architecture

Docker Client

The Docker Client is the primary way in which most users interact with Docker. When you run commands using the CLI, these commands are then sent to the Docker daemon, using the Docker API interface. The Docker daemon or the dockerd then executes these commands and creates relevant Docker objects. The Docker client has the ability to communicate with multiple Docker daemons.

Docker Daemon

The Docker daemon is a server process that is persistent in nature and runs in the background. It continuously listens to the REST API interface and looks for any incoming requests to process commands. The daemon can listen to the API interface using different socket types, such as Unix, TCP (transmission control protocol), and FD (file descriptor) .

Docker Registries

The images created by the Docker daemon must be stored at a certain location, for ease of access. The Docker registry is this location. There are public registries, such as the Docker Hub, that can be used by anyone. By default, Docker looks for images on the Docker Hub, but this can be configured to use your private registry as well.

Commands such as Docker pull retrieve the required images from your configured registry and Docker push pushes the image to this same configured registry.

From a Docker store, you can buy, sell, or distribute images for free. You can then use these images to deploy an application in your test or production environment.

Let’s move forward a bit and look at the different objects of Docker that have been referenced multiple times in this book so far.

Docker Objects

With the use of Docker, different objects are generated, mostly by the Docker daemon. Some of these objects are images, containers, services, and storage.

Images

A Docker image is a read-only file system that contains instructions to create a container that can run an application. Most of the time, a Docker image is based on another image and is customized. You could either use existing images published in public registries, such as the Docker Hub, or create your own image.

A Dockerfile is used to build a Docker image. A Dockerfile contains simple instructions that can be understood by the Docker daemon, to create the image and run it.

Docker images are layers that correspond to each instruction in the Dockerfile. A part of what makes a Docker image super lightweight is that when you modify a part of the Dockerfile, only that layer is modified, rather than the entire image.

Containers

A Docker container is an instance of an image. An image runs inside a container. You can manage a container using stop, start, and delete commands. Multiple containers can be connected to one another through a network. They can be connected to storage, and they can also talk to one another.

As you have seen in Chapter 1, containers are much more lightweight than VMs, owing to their startup times being very fast.

In order to create a container, an image, in addition to the container’s configuration and settings, is provided. When a container is deleted, everything related to the container is also deleted, including state and storage.

The Docker run command is used to run a container. When you run this command, the following things happen:

1.
The Docker image is pulled from the configured registry.
2.
A new Docker container is created.
3.
A local file system is allocated to that container, to enable creation and modification of files and directories in its local file system.
4.
The container is connected to the default network, unless you configure a networking option. A container is assigned an IP address.
5.
Docker starts running the container and attaches it to your local terminal. This allows you to interact with this container.
6.
You can stop or remove the container, using your terminal input, at any time.

Services

In a distributed application, different functionalities of the app constitute different services. For example, if you are building an application for suggestions based on keywords entered by the user, you might want a front-end service that takes the word and sends it to the service that verifies the legitimacy of the word. This might, in turn, go to another service that might execute an algorithm, in order to generate the suggestions, etc., which are then returned to the service.

These are all different services on different Docker containers that sit behind different Docker daemons. These Docker daemons are all connected through the network and interact with each other. To the user, this might look like a single application that runs, but behind the scenes, these are multiple services that make the entire application function.

All these services work together as a swarm, managed by different managers and workers. Each swarm contains a Docker daemon. These daemons communicate with each other using the Docker API.

A Docker Compose YAML file is used to get all these services up and running together. Later, in Chapter 6, you will see how to use the Docker Compose tool in detail.

Docker Hub

Docker Hub is the primary location for storage of Docker images. It is a cloud-based public registry from which you can pull images or push images to. It also links to Docker Cloud. It is a centralized store for image discovery and distribution. By default, Docker is configured to use this public registry.

A user can buy or sell Docker images from the Docker Hub. Alternatively, a user can also distribute Docker images for free on the hub. A user can search for Docker images using the Docker Hub user interface or the CLI.

kinnaryjangla@dev-abc: docker search alpine

NAME DESCRIPTION STARS OFFICIAL AUTOMATED

alpine A minimal Docker image based on Alpine Linux... 4203 [OK]

mhart/alpine-node Minimal Node.js built on Alpine Linux 379

anapsix/alpine-java Oracle Java 8 (and 7) with GLIBC 2.28 over A... 346 [OK]

gliderlabs/alpine Image based on Alpine Linux will help you wi... 177

frolvlad/alpine-glibc Alpine Docker image with glibc (~12MB) 162 [OK]

alpine/git A simple git container running in alpine li... 46 [OK]

kiasaki/alpine-postgres PostgreSQL docker image based on Alpine Linux 42 [OK]

zzrot/alpine-caddy Caddy Server Docker Container running on Alp... 32 [OK]

easypi/alpine-arm AlpineLinux for RaspberryPi 30

davidcaste/alpine-tomcat Apache Tomcat 7/8 using Oracle Java 7/8 with... 30 [OK]

byrnedo/alpine-curl Alpine linux with curl installed and set as ... 17 [OK]

etopian/alpine-php-wordpress Alpine WordPress Nginx PHP-FPM WP-CLI 15 [OK]

hermsi/alpine-sshd Dockerize your OpenSSH-server upon a lightwe... 12 [OK]

davidcaste/alpine-java-unlimited-jce Oracle Java 8 (and 7) with GLIBC 2.21 over A... 11 [OK]

hermsi/alpine-fpm-php Dockerize your FPM PHP 7.2 upon a lightweigh... 10 [OK]

alpine/socat Run socat command in alpine container 10 [OK]

graze/php-alpine Smallish php7 alpine image with some common ... 9 [OK]

yobasystems/alpine-xen-orchestra Xen Orchestra running on Alpine Linux [docke... 8 [OK]

masterroshi/xmrig-alpine Cryptonote CPU Miner wrapped in a Alpine Doc... 8

spotify/alpine Alpine image with `bash` and `curl`. 5 [OK]

tenstartups/alpine Alpine linux base docker image with useful p... 5 [OK]

functions/alpine Alpine Linux / BusyBox with the OpenFaaS wat... 4

govuk/gemstash-alpine Gemstash server running on Alpine 3 [OK]

casept/alpine-amd64 A basic alpine linux image. 0

smartentry/alpine alpine with smartentry 0 [OK]

Now that we have looked behind the scenes at how Docker actually operates, let’s see how to install it.

Installing Docker

There are two Docker editions available to install.

Docker Community Edition (CE) : This works for small communities or individual developers looking to get started and experiment with Docker.
Docker Enterprise Edition (EE) : This is meant for enterprises that use Docker to ship business-critical applications that need to scale.

For the purposes of this book, let’s look at how to install the Docker CE.

Docker CE is available for both the Mac and Windows OSs. It is also available to Amazon Web Services and Microsoft Azure.

Let’s look at how to install Docker CE on the Mac OS platform. There are some system requirements to meet before you can install Docker on your machine. You will need a Mac machine model that is at least from 2010 or later. In addition, you will need at least 4GB of RAM.

1.
Go to the Docker store at https://store.docker.com/editions/community/docker-ce-desktop-mac and click Get Docker, from the right-side pane, as seen in Figure 4-3.
Figure 4-3
Getting the Docker Community Edition for Mac
2.
Once you have the dmg file on your machine, double-click it and drag Moby the whale to the Applications folder, as shown in Figure 4-4.
Figure 4-4
Drag Moby to your Applications folder
3.
In the Applications folder, double-click the Docker app, as seen in Figure 4-5.

../images/465114_1_En_4_Chapter/465114_1_En_4_Fig5_HTML.jpg — Figure 4-5
Docker icon as seen in the Applications folder

Authorize Docker.app with your system password, after you launch it. You will need admin access to launch the different Docker components.

4.
The Moby whale on the status bar on the top, as shown in Figure 4-6, indicates that Docker is now running.
Figure 4-6
Docker icon on the status bar
5.
If you have successfully installed the app, you will also see a pop-up with a success message, next steps, and tips, as shown in Figure 4-7.

../images/465114_1_En_4_Chapter/465114_1_En_4_Fig7_HTML.jpg — Figure 4-7
Successful installation of Docker shows a pop-up with next steps

To dismiss this pop-up, click the whale on the top status bar.

6.
Right-clicking the whale on the status bar will give you options to set or modify your preferences, as shown in Figure 4-8.
Figure 4-8
Right-click Docker menu on the status bar icon

7.
Check About Docker, to ensure you have the latest version.

Now that we have Docker installed and running on our machines, let’s take a look at some basic Docker commands, so that you can play around and experiment with them.

Basic Docker Commands

Following are some basic Docker commands that you can start playing with.

docker container run

This runs a command in a new container. When a user runs the Docker run command, it isolates the containers in its environment and the configuration within its own local file system.

The Docker run command specifies an image, in order to run that image inside a container.

The basic docker container run command looks like this:

docker container run [OPTIONS] IMAGE [COMMAND] [ARG...]

Image is the existing image you want to run inside the container. With docker container run [OPTIONS], the developer can modify the defaults of the images. Some options types are

-d: You can choose to let the container run in the background, in detached mode, or in the foreground. By default, when -d is not specified, the container runs in the foreground.
-a: The foreground mode lets you attach your local console to the process’s (running inside the container) standard input output.

docker container create

The docker container create command lets you create a new container from an existing image that has been built previously. This is shown following. The –t command stands for “tty,” which sets a pseudo time of the container to live, and the -I command stands for “interactive” and keeps the standard input open, even if it’s not attached.

Usage:

docker container create [OPTIONS] IMAGE [COMMAND] [ARG...]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker container create –t -i myApp bash

38001kjhasd7qhs8whs7sh38729wajsh352191j888dhasg2

kinnaryjangla@dev-abc:~/code/test$

docker container start

The docker container start command lets you start a new container or a container that has been previously stopped, as shown here. The –t flag stands for “tty” and is used to give the container a pseudo time to live. The -I flag keeps the standard input open, even when it’s not attached.

Usage:

docker container start [OPTIONS] CONTAINER [CONTAINER...]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker container create –t -i myApp bash 38001kjhasd7qhs8whs7sh38729wajsh352191j888dhasg20

kinnaryjangla@dev-abc:~/code/test$ docker start –a -i 38001kjhasd

root@38001kjhasd:/mnt/myApp #

docker container stop

The docker container stop command lets you stop a currently running container.

Usage:

docker container stop [OPTIONS] CONTAINER [CONTAINER...]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker container stop 38001kjhasd

docker image build

The docker image build command builds the docker image using the instructions in the Dockerfile.

Usage:

docker image build [OPTIONS] PATH | URL | -

Example:

kinnaryjangla@dev-abc:~/code/test$ docker image build myApp/.

Sending build context to Docker daemon 1.649MB

Step 1/6 : FROM openjdk:8

---> ef09cb43251e

Step 2/6 : ENV CONFIG_FILE config/myApp.dev.properties HEAP_SIZE 4G LOG4J_CONFIG_FILE config/log4j.dev.properties NEW_SIZE 2G JAVA_COMMAND java

---> Using cache

---> 09c7e98f7c49

Step 3/6 : WORKDIR /opt/myApp

---> Using cache

---> 3c29b8fa2f25

Step 4/6 : ARG ARTIFACT_PATH=target/myApp-0.1-SNAPSHOT-bin.tar.gz

---> Using cache

---> c563d2e7990c

Step 5/6 : ADD $ARTIFACT_PATH /opt/myApp/

Successfully built bd6110589d1b

docker image pull

The docker image pull command pulls an image from a docker registry.

Usage:

docker image pull [OPTIONS] NAME[:TAG|@DIGEST]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker image pull alpine

Using default tag: latest

latest: Pulling from library/alpine

8e3ba11ec2a2: Pull complete

Digest: sha256:7043076348bf5040220df6ad703798fd8593a0918d06d3ce30c6c93be117e430

Status: Downloaded newer image for alpine:latest

kinnaryjangla@dev-abc:~/code/test$

docker search

You can search for Docker images using the docker search command.

Usage:

docker search [OPTIONS] TERM

Example:

kinnaryjangla@dev-abc:~/code/test$ docker search alpine

NAME DESCRIPTION STARS OFFICIAL AUTOMATED

alpine A minimal Docker image based on Alpine Lin... 4203 [OK]

mhart/alpine-node Minimal Node.js built on Alpine Linux 379

anapsix/alpine-java Oracle Java 8 (and 7) with GLIBC 2.28 over... 346 [OK]

gliderlabs/alpine Image based on Alpine Linux will help you ... 177

frolvlad/alpine-glibc Alpine Docker image with glibc (~12MB) 162 [OK]

alpine/git A simple git container running in alpine ... 46 [OK]

kiasaki/alpine-postgres PostgreSQL docker image based on Alpine Linux 42 [OK]

zzrot/alpine-caddy Caddy Server Docker Container running on A... 32 [OK]

easypi/alpine-arm AlpineLinux for RaspberryPi 30

davidcaste/alpine-tomcat Apache Tomcat 7/8 using Oracle Java 7/8 wi... 30 [OK]

byrnedo/alpine-curl Alpine linux with curl installed and set a... 17 [OK]

etopian/alpine-php-wordpress Alpine WordPress Nginx PHP-FPM WP-CLI 15 [OK]

hermsi/alpine-sshd Dockerize your OpenSSH-server upon a light... 12 [OK]

davidcaste/alpine-java-unlimited-jce Oracle Java 8 (and 7) with GLIBC 2.21 over... 11 [OK]

hermsi/alpine-fpm-php Dockerize your FPM PHP 7.2 upon a lightwei... 10 [OK]

alpine/socat Run socat command in alpine container 10 [OK]

graze/php-alpine Smallish php7 alpine image with some commo... 9 [OK]

yobasystems/alpine-xen-orchestra Xen Orchestra running on Alpine Linux [doc... 8 [OK]

masterroshi/xmrig-alpine Cryptonote CPU Miner wrapped in a Alpine D... 8

spotify/alpine Alpine image with `bash` and `curl`. 5 [OK]

tenstartups/alpine Alpine linux base docker image with useful... 5 [OK]

functions/alpine Alpine Linux / BusyBox with the OpenFaaS w... 4

govuk/gemstash-alpine Gemstash server running on Alpine 3 [OK]

casept/alpine-amd64 A basic alpine linux image. 0

smartentry/alpine alpine with smartentry 0 [OK]

docker image ls

The docker image ls command is used to list all the Docker images on the host machine.

Usage:

docker image ls [OPTIONS] [REPOSITORY[:TAG]]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker image ls

REPOSITORY TAG IMAGE ID CREATED SIZE

ubuntu latest cd6d8154f1e1 3 days ago 84.1MB

openjdk 7 bd6110589d1b 4 days ago 472MB

alpine latest 11cd0b38bc3c 2 months ago 4.41MB

docker container ps

The docker container ps command is used to list all containers running on the host.

Usage:

docker container ps [OPTIONS]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker container ps

CONTAINER ID IMAGE COMMAND CREATED

e55ce4b2e4f5 alpine "./bin/docker_run_..." 6 days ago

119b4b5eed95 ubuntu "./bin/docker_run_..." 6 days ago

docker container rm

The docker container rm command is used to remove one or more containers. You cannot remove a running container without the –f flag to force it, which first stops the container and then removes it. In order to do that, you must first stop the container, using docker container stop <container-id>. This command execution is shown following:

Usage:

docker container rm [OPTIONS] CONTAINER [CONTAINER...]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker container stop e55ce4b2e4f5

kinnaryjangla@dev-abc:~/code/test$ docker image rm 119b4b5eed95

docker container inspect

This command allows you to inspect the details of a container.

Usage:

docker container inspect [OPTIONS] CONTAINER [CONTAINER...]

Example:

kinnaryjangla@dev-abc:~/code/test$ docker container inspect f9d4b5c9aa49

[

{

"Id": "f9d4b5c9aa49fb22b23ae0d377236e1da80ceebc14c67550e36f6c0345eb2062",

"Created": "2018-08-03T06:11:54.181815872Z",

"Path": "/test/bin/entry_point.sh",

"Args": [],

"State": {

"Status": "running",

"Running": true,

"Paused": false,

"Restarting": false,

"OOMKilled": false,

"Dead": false,

"Pid": 30163,

"ExitCode": 0,

"Error": "",

"StartedAt": "2018-08-03T06:11:58.414063235Z",

"FinishedAt": "0001-01-01T00:00:00Z"

"Image": "sha256:b657c637b59170b7ea275d0af93fed7b89b1c286aeacd5438052955911a89d7a",

"ResolvConfPath": "/var/lib/docker/containers/f9d4b5c9aa49fb22b23ae0d377236e1da80ceebc14c67550e36f6c0345eb2062/resolv.conf",

"HostnamePath": "/var/lib/docker/containers/f9d4b5c9aa49fb22b23ae0d377236e1da80ceebc14c67550e36f6c0345eb2062/hostname",

"HostsPath": "/var/lib/docker/containers/f9d4b5c9aa49fb22b23ae0d377236e1da80ceebc14c67550e36f6c0345eb2062/hosts",

"LogPath": "/var/lib/docker/containers/f9d4b5c9aa49fb22b23ae0d377236e1da80ceebc14c67550e36f6c0345eb2062/f9d4b5c9aa49fb22b23ae0d377236e1da80ceebc14c67550e36f6c0345eb2062-json.log",

"Name": "/webapp_selenium-chrome_1",

"RestartCount": 0,

"Driver": "overlay",

"MountLabel": "",

These are some basic commands you can start to explore. Let’s go through a little end-to-end Hello World example.

In the following example, we’ll pull an existing Hello World image, run it, and view the images and containers.

1.
First, pull the hello-world Docker image. This will pull the image from the Docker Hub registry.
kinnaryjangla@dev-abc:~/code/test$ docker image pull hello-world
Using default tag: latest
Latest: Pulling from library/hello-world
9bbdshfg673e39ja: Pull complete
Digest: sha256: fkjdh7t6dauadubiadadia8dya98777da9fiudfhd9a86fidfbdfid8fydisch
Status: Downloaded newer image for hello-world:latest
2.
Use docker images to view the image that was just pulled, as shown following.
kinnaryjangla@dev-abc:~/code/test$ docker image ls
REPOSITORY TAG IMAGE ID CREATED SIZE
Hello-world latest ekjad89sjdfd 2 months ago
1.85kB
3.
Now run the hello-world image, which will run this image inside a new container, as follows.
kinnaryjangla@dev-abc:~/code/test$ docker container run hello-world
latest: Pulling from library/hello-world
d1725b59e92d: Pull complete
Digest: sha256:0add3ace90ecb4adbf7777e9aacf18357296e799f81cabc9fde470971e499788
Status: Downloaded newer image for hello-world:latest
Hello from Docker!
This message shows that your installation appears to be working correctly.
To generate this message, Docker took the following steps:
1. The Docker client contacted the Docker daemon.
2. The Docker daemon pulled the "hello-world" image from the Docker Hub.
(amd64)
3. The Docker daemon created a new container from that image which runs the executable that produces the output you are currently reading.
4. The Docker daemon streamed that output to the Docker client, which sent it to your terminal.
To try something more ambitious, you can run an Ubuntu container with:
$ docker run -it ubuntu bash
Share images, automate workflows, and more with a free Docker ID:
https://hub.docker.com/
For more examples and ideas, visit:
https://docs.docker.com/get-started/

This preceding example lets you go through an end-to-end scenario of pulling an existing Docker image, viewing the image, and running the image.

In the next chapter, we’ll take a closer look at how to create Docker images using Dockerfiles and run these images inside Docker containers.

Summary

In this chapter, we looked in detail at the Docker terminology that has been commonly used in the previous chapters of this book and that will continue to be used in future chapters.

We also examined the different components of the Docker architecture, including the Docker Engine, Docker Hub, Docker clients, Docker hosts, and Docker registries. We also saw how different Docker objects are created by the Docker daemon. We saw how Docker Hub can be used to pull existing Docker images and buy, sell, or distribute images for free.

Additionally, you saw how to install Docker on the Mac OS platform in detail.

We looked at some basic Docker commands, with sample usage and examples of each command, so that you can explore them further. We then walked through a simple end-to-end example of pulling the existing Hello World image and running it.

In the next chapter, I’ll go more into detail on how to build an image from a Dockerfile and run it inside a container.

Kinnary JanglaAccelerating Development Velocity Using Dockerhttps://doi.org/10.1007/978-1-4842-3936-0_5

5. Docker Images

Kinnary Jangla¹

(1)

San Francisco, CA, USA

A Docker image is an immutable read-only file system that is a snapshot of the entire package of an application, including the dependencies, configuration, and settings.

In this chapter, you’ll learn about Dockerfile and its basics. We’ll build images using Dockerfiles and then view the running images. We’ll then run these images inside a Docker container, and you’ll discover how to attach the container to our local terminal input/output.

Docker Images

As mentioned previously, Docker images are read-only and immutable and created with the docker image build command . They are stored inside a Docker registry and run inside a container. Images can become quite large very quickly. Therefore, they are designed to be composed of layers of other images, allowing a minimal amount of data to be sent when transferring images over a network. So, you can build your own customized image on top of an existing image. When you modify that image, new layers are added that contain your changes.

As for Docker containers, you’ll learn about them in more detail later in this chapter, but to summarize with a programming metaphor, if an image is a class, then a container is an instance of a class, that is, a runtime object. While images are lightweight and portable encapsulations of an environment, containers are the running instances of images.

Furthermore, a Docker image is created using a Dockerfile. Let’s see what a Dockerfile is. Later on, you’ll learn how to build a Docker image from a Dockerfile.

Dockerfile

Everything Docker begins with a Dockerfile. The Dockerfile is the instruction set on how to build an image. It the basis on which your entire Docker container is built. It specifies all the configuration settings environment variables, volumes to be mounted, the base image to build on top of, the list of dependencies, etc. All this is then bundled into an image that then runs inside the container.

A Dockerfile must be built to create the Docker image of an application. The image is just the “compiled version” of the source code that lives inside the Dockerfile. The Dockerfile is a text file that contains a set of instructions or commands that are then assembled into an image.

Creating a Sample Dockerfile

Let’s create a sample Dockerfile next. To begin, create a file called Dockerfile inside a directory called docker.

kinnaryjangla@dev-abc:~/code/docker$ vim Dockerfile

Build your Dockerfile using the following commands. Replace the LABEL maintainer email with your e-mail address.

#This is a sample image

FROM ubuntu

LABEL maintainer="email@example.com"

RUN apt-get update

RUN apt-get install –y nginx

CMD ["echo", "Hello World!"]

Let’s look at the instructions in the preceding Dockerfile.

1.
The first line, #This is a sample image, is a comment. You can add other comments to the Dockerfile for readability using the # command.
2.
The FROM keyword is used to tell Docker which base image you want to build your customized image on top of. This instruction is mandatory.
3.
LABEL is a non-executable instruction used to indicate the author of the Dockerfile.
4.
The RUN instruction is used to execute a command on top of an existing image. That in turn creates another layer with the results of the execution of the command on top of the image. For example, if there is a precondition to install PHP before running an application, you can run appropriate commands to install PHP on top of the base image (say, Ubuntu), as shown following.
FROM ubuntu
RUN apt-get update && update apt-get install –y php
5.
The CMD command doesn’t execute anything during the build time. It just specifies the intended command for the image. The difference between the CMD and the RUN command is that RUN actually executes the command during build time. If you have multiple CMD instructions in the Dockerfile, only the last one will take effect.

Following are some other commands that can come in handy when creating the Dockerfile:

ENV: This instruction can be used to set the environment variables in the container as shown following.

#Default environment variables requires to run service, can be overridden by docker run
ENV CONFIG_FILE=config/config.service.test.properties \
         HEAP_SIZE=6G \
         LOG4J_CONFIG_FILE=config/log4j_local.xml \
         NEW_SIZE=4G
COPY: This instruction is used to copy the files and directories from a specified source to a specified destination (in the file system of the container), as follows.

COPY conditions.txt /usr/tmp
ADD: The ADD instruction is like the COPY instruction. It has some additional features, such as support for remote URLs. The COPY instruction is more readable, so if you don’t need the extra supported features that ADD provides, it’s recommended that you use the COPY instruction instead. See the following usage. Tar or zip files will be auto-expanded when you add one to a source destination.

ADD http://www.xyz.com/sample.tar.xz /usr/src
WORKDIR: This is used to set the currently active directory for other instructions, such as RUN, CMD, ENTRYPOINT, COPY, and ADD. See the following paragraph for a usage example.
If you provide a relative path as the WORKDIR, it will be taken as relative to the path of the previous WORKDIR instruction.
WORKDIR /user
WORKDIR home
USER: This is used to set the UID (or username) to use when running the image or any subsequent commands. See the following usage.

USER daemon
VOLUME: This instruction specifies a path in which data should be persisted longer than the life of the container. See the following usage.

VOLUME /data
ENTRYPOINT: This command is the primary command of your Docker image.

This command is set in such a way that whenever you run the image, the ENTRYPOINT command will be executed every time.

You can also pass arguments here, but they are optional. You can pass them when you run the image with something such as docker run <image-name>.

Also, all the elements specified using CMD will be overridden, except the arguments. They will be passed to the command specified in ENTRYPOINT. Following is a sample usage.

CMD "Hello World!"

ENTRYPOINT echo

Save this file, and in the next section, you’ll see how to build an image from this Dockerfile.

Building Images with Dockerfile

As you’ve learned so far, Docker images are immutable, read-only file systems. Images can be based on other existing images that can be pulled from Dockerfile. This makes modifying them a lot easier, because the only thing that changes is the layer that gets modified. This also prevents images from becoming extremely large in size.

In the previous section, we created a Dockerfile called Dockerfile with some basic instructions and saved it in a directory called docker.

Let’s continue to build an image from the Dockerfile created in the previous section. From the docker directory, run the command docker image build. The . builds the Dockerfile within the directory.

When you run this command for the first time, you’ll see a long list of packages being pulled, because we’re building our image on top of the Ubuntu image.

I am going to divide the output in multiple sections, to make it easier to read. You should be able to see this entire output, if your image is built successfully.

As per the Dockerfile, each instruction is built sequentially. In the following sequence, you see first (Step 1/5) some images get pulled successfully from the base Ubuntu image. Step 2/5 assigns the author of the image to the image. In Step 3/5, the apt-get update command runs on top of the base Ubuntu image.

kinnaryjangla@dev-abc:~/code/docker$ docker image build .

Sending build context to Docker daemon 2.048kB

Step 1/5 : FROM ubuntu

latest: Pulling from library/ubuntu

124c757242f8: Pull complete

9d866f8bde2a: Pull complete

fa3f2f277e67: Pull complete

398d32b153e8: Pull complete

afde35469481: Pull complete

Digest: sha256:de774a3145f7ca4f0bd144c7d4ffb2931e06634f11529653b23eba85aef8e378

Status: Downloaded newer image for ubuntu:latest

---> cd6d8154f1e1

Step 2/5 : LABEL maintainer "kijangla@example.com"

---> Running in 2d6e3abeff60

---> b7df3b688aca

Removing intermediate container 2d6e3abeff60

Step 3/5 : RUN apt-get update

---> Running in 8bd46979c5fa

Moving forward as part of Step 3/5, a bunch of other packages are installed as apt-get update is executed.

Step 3/5 : RUN apt-get update

---> Running in 8bd46979c5fa

Get:1 http://security.ubuntu.com/ubuntu bionic-security InRelease [83.2 kB]

Get:2 http://archive.ubuntu.com/ubuntu bionic InRelease [242 kB]

Get:3 http://security.ubuntu.com/ubuntu bionic-security/universe Sources [17.4 kB]

Get:4 http://archive.ubuntu.com/ubuntu bionic-updates InRelease [88.7 kB]

Get:5 http://security.ubuntu.com/ubuntu bionic-security/multiverse amd64 Packages [1363 B]

Get:6 http://security.ubuntu.com/ubuntu bionic-security/main amd64 Packages [203 kB]

Get:7 http://archive.ubuntu.com/ubuntu bionic-backports InRelease [74.6 kB]

Get:8 http://archive.ubuntu.com/ubuntu bionic/universe Sources [11.5 MB]

Get:9 http://security.ubuntu.com/ubuntu bionic-security/universe amd64 Packages [69.0 kB]

Get:10 http://archive.ubuntu.com/ubuntu bionic/main amd64 Packages [1344 kB]

Get:11 http://archive.ubuntu.com/ubuntu bionic/universe amd64 Packages [11.3 MB]

Get:12 http://archive.ubuntu.com/ubuntu bionic/multiverse amd64 Packages [186 kB]

Get:13 http://archive.ubuntu.com/ubuntu bionic/restricted amd64 Packages [13.5 kB]

Get:14 http://archive.ubuntu.com/ubuntu bionic-updates/universe Sources [70.4 kB]

Get:15 http://archive.ubuntu.com/ubuntu bionic-updates/universe amd64 Packages [226 kB]

Get:16 http://archive.ubuntu.com/ubuntu bionic-updates/main amd64 Packages [401 kB]

Get:17 http://archive.ubuntu.com/ubuntu bionic-updates/multiverse amd64 Packages [3925 B]

Get:18 http://archive.ubuntu.com/ubuntu bionic-backports/universe amd64 Packages [2807 B]

Fetched 25.9 MB in 3s (8072 kB/s)

Reading package lists...

---> e8081b840106

Removing intermediate container 8bd46979c5fa

Furthermore, Step 4/5 gets executed where the apt-get install -y nginx command runs. As a part of this run command, it builds a dependency tree and installs more packages.

Step 4/5 : RUN apt-get install –y nginx

---> Running in 4e8613ee2337

Reading package lists...

Building dependency tree...

Reading state information...

The following additional packages will be installed:

fontconfig-config fonts-dejavu-core geoip-database libbsd0 libexpat1

libfontconfig1 libfreetype6 libgd3 libgeoip1 libicu60 libjbig0

libjpeg-turbo8 libjpeg8 libnginx-mod-http-geoip

libnginx-mod-http-image-filter libnginx-mod-http-xslt-filter

libnginx-mod-mail libnginx-data libxau6 libxdmcp6 libxml libxpm4

libxslt1.1 multiarch-support nginx-common nginx-code ucf

Suggested packages:

Libgd-tools geoip-bin fcgiwrap nginx-doc ssl-cert

The following NEW packages will be installed:

fontconfig-config fonts-dejavu-core geoip-database libbsd0 libexpat1

libfontconfig1 libfreetype6 libgd3 libgeoip1 libicu60 libjbig0

libjpeg-turbo8 libjpeg8 libnginx-mod-http-geoip

libnginx-mod-http-image-filter libnginx-mod-http-xslt-filter

libnginx-mod-mail libnginx-data libxau6 libxdmcp6 libxml libxpm4

libxslt1.1 multiarch-support nginx-common nginx-code ucf 0 upgraded, 35 newly installed, 0 to remove and 8 no upgraded.

Need to get 16.1 MB of archives.

Soon after, it will install some additional archives.

Need to get 16.1 MB of archives