Docker for newbies š³
Install Docker if you havenāt already and follow along for a gentle introduction to the world of containers āØš³ Weāll be using this FastAPI project for demonstration purposes but the principles will apply to most applications.
Why do we use Docker?
Thereās two fundamental reasons why teams use Docker:
- Creating reproducible development and production environments ā chances are that in a team not everyone will use the exact same machine and/or OS e.g. some of us might use Windows, Mac or Linux, we might use different MacBook models etc. Because of these differences, it is hard to guarantee that our application will behave exactly the same when run locally e.g. some dependencies might be incompatible in a particular machine. Docker solves this problem, by āsimulatingā a machine with some specific properties (memory, OSā¦).
- Deploying containers is more efficient than deploying virtual machines ā before Docker was around, teams would typically deploy instances of their applications in so called āvirtual machinesā. Docker instead, creates ācontainersā which share some of the resources such as the OS among each other.
Images & Containers:
So what is actually going on when we use Docker? Docker allows us to create running instances of our applications, called ācontainersā. Containers are created based on images, which are defined in Dockerfiles. You can think of these like classes and objects in OOP languages.
Image (class) ā Container (instance)
An image is first built and a container can then run based on that image. If a change is added to an image, it must first be re-built for the changes to be applied next time a container is run.
In the following Dockerfile, we use the FROM
statement to build our image on top of the python:3.9-slim-bullseye base image, which uses the Debian linux distribution:
FROM python:3.9-slim-bullseye
ENV PYTHONDONTWRITEBYTECODE 1
ENV PYTHONUNBUFFERED 1
WORKDIR /code
COPY requirements.txt /code/
RUN pip install -r /code/requirements.txt
COPY /scripts/install-wait-for-it.sh .
RUN /bin/sh install-wait-for-it.sh
COPY . /code
ENV PYTHONPATH "${PYTHONPATH}:/code"
EXPOSE 8000
CMD bash -c "alembic upgrade head && uvicorn app.main:app --host 0.0.0.0 --port $PORT --reload"
Again, this is pretty similar to class inheritance in an OOP language. When we run this image, our code will behave as if it was running in a Debian Linux machine. As a result, this also guarantees that our application will behave the same when deployed to production (since it will also run as a Docker container). In addition, we install our requirements, expose port 8000 and execute a command for running our migrations using alembic (in order to create our database tables) and also to run our backend server.
Deployment:
When changes are deployed, we usually push images to a storage entity called a registry. Depending on our cloud provider, this might be GCR (Google) or AWS Container Registry (Amazon) among others. Often, teams will use an additional tool called Kubernetes to manage the deployment and behaviour of Docker containers in an entity called a Kubernetes cluster.
Docker Compose:
Most applications will require multiple Docker containers instead of just one. Docker-compose allows us to define and run many containers inside a single network. We can also define dependencies. This is what a docker-compose.yml file is for. Here we define containers under the services section. Notice how the backend container depends on the database container. This is because our API requires a database server running in the background. In the example we have defined an additional container for running our test suite. In addition, docker-compose makes it easier to define volumes (a type of persistence), ports or environment variables.
version: "3.9"
services:
db:
container_name: db
image: postgres:13.3-alpine
env_file:
- .env
volumes:
- ./config/postgresql.conf:/etc/postgresql.conf
ports:
- "5432:5432"
command: postgres -c config_file=/etc/postgresql.conf
backend:
container_name: backend
build: .
env_file:
- .env
volumes:
- .:/code
ports:
- "8000:8000"
depends_on:
- db
restart: always
python_test:
container_name: tests
build: .
profiles:
- python_test
env_file:
- .env
volumes:
- .:/code
restart: always
command:
- pytest
*Make sure you have the latest docker-compose version installed on your system. This docker-compose file makes use of profiles which are not available in previous versions. Profiles are used here so that the python_test container is not run when we execute docker-compose up. Instead we want to run this container explicitly as we will see below.
Basic Commands
Pull an image from docker registry
docker pull image_nameBuild an image from a dockerfile
docker build docker_file_nameRun a container (will pull and build if image is not found)
docker run image_nameRun a command on a running container
docker exec container_name or container_id command_nameList all built images
docker images
(notice the size and creation date of each image)List all running containers
docker psRun all containers defined as services in docker-compose.yml
docker-compose up
(The images are built the first time we run this)Check container logs
docker logs container_name or container_idStop all running containers
docker-compose down
Introducing the Makefile
Sometimes it might be convenient to chain some of these commands during the development process (stop all running containers, build the images again, run all containers with the new images, migrate the database so that our tables are createdā¦etc.) One way of achieving this is using a Makefile. We can define new commands based on all these smaller actions. Hereās an example:
start: ## Start project containers defined in docker-compose
docker-compose up -d
stop: ## Stop project containers defined in docker-compose
docker-compose stop
destroy: ## Destroy all project containers
docker-compose down -v --rmi local --remove-orphans
clean: ## Delete all volumes, networks, images & cache
docker system prune -a --volumes
test: recreate ## Run tests and generate coverage report
docker-compose build python_test
docker-compose run python_test
recreate: destroy start ## Destroy project containers and start them again
With make test, we stop all running containers, rebuild their images, run the containers again with any new changes and run our test suite.
*The make command is a unix tool for running Makefiles and is not available on Windows. Thereās a few workarounds such as using NMAKE.
Bringing everything together
Clone the example repository if you havenāt already and try running docker-compose up -d and then docker ps. You can then visit the API docs at http://localhost:8000/docs. This request will in turn show up in the backend logs, which you can check with docker logs backend. The output will look something like the following:
INFO: Will watch for changes in these directories: ['/code']
INFO: Uvicorn running on http://0.0.0.0:8000 (Press CTRL+C to quit)
INFO: Started reloader process [8] using watchgod
INFO: Started server process [10]
INFO: Waiting for application startup.
INFO: Application startup complete.
INFO: 172.20.0.1:58780 - "GET /docs HTTP/1.1" 200 OK
INFO: 172.20.0.1:58780 - "GET /openapi.json HTTP/1.1" 200 OK
With Docker Desktop we can check the status of our containers:
Container logs are very useful for debugging what goes on inside our application. You can also check these through Docker desktop:
Similarly, we also log all SQL statements in our postgres container:
And if we want to interact with our local database directly we can also run:
docker exec -it db psql -h db -d test_db -U postgres
*Here the -h stands for host, -d stands for database and -U for user. We defined all of these in the .env file.
This will prompt us for the password, which we have also set as an environment variable in our .env file. We can then use the \dt postgres command to list all existing tables and check that our alembic migration script worked correctly:
Password for user postgres:
psql (13.3)
Type "help" for help.test_db=# \dt
List of relations
Schema | Name | Type | Owner
--------+-----------------+-------+----------
public | alembic_version | table | postgres
public | books | table | postgres
public | users | table | postgres
(3 rows)
And thatās it! You should now have a working knowledge of how to interact with Docker containers as well as some background info on why teams use them. You can check Dockerās official documentation for more information on the different image instructions that are available. If you have an existing project, try to dockerize it and use some of the commands in this tutorial. Building something yourself is normally the best way of learning. Good luck!