Kubernetes terms - a topological approach

Coming from plain old Docker, it's confusing to navigate all the different terms used in Kubernetes - especially because a lot of them already have different meanings in other contexts. Here's an attempt to introduce the basic Kubernetes terms one at the time. I will try to keep all definitions based on terms already introduced, but sometimes there might be a reference to one being explained just below.

Terms will be highlighted when referring to its definition within the Kubernetes context. If not highlighted, think of what it would usually mean.

Container runtime

When running docker containers, a container runtime is needed (similar to JVM for Java and .NET for C#). Docker uses containerd, but there are several other alternatives, such as cri-o, runc and crun.

Resource

Any type of object residing in your cluster (e.g. pods, deployments, services, ingresses).

Manifest

Word used to describe the yaml documents for resources.

Pod

The smallest and simplest Kubernetes object (apart from containers). It is an abstraction over the Docker container, disconnecting the rest of Kubernetes from the container runtime. If there is more than one container in a pod, it's usually with one main container and one or more sidecar containers (e.g. webserver and log handler).

Service

Because pods are often considered ephemeral and may die for no/any reason, it is very inconvenient to have explicit connections to the pods IP address. Therefore, services act as a grouping of pods, and will forward and load balance any communication to the pods in the service. There are different kinds of services, each with their own use cases.

Ingress

A resource that manages external access to the cluster. It can handle incoming HTTP and HTTPS traffic, SSL/TLS termination and act as a load balancer. Requires an ingress controller to work (basically a web server).

Application

A collection of different Kubernetes resources, together composing what is needed to run your app, including auxillary service (e.g. a Postgres database and Redis). Not strictly a Kubernetes term, but used for example in Google Kubernetes Engine, and ArgoCD introduces it as a custom resource.

Deployment

An overarching resource for the deployment of pods. For availability and performance reasons, it's possible run pods with replicas, for example two or more instances of the Node web application. In these cases, services will serve as load balancers. For stateful applications (e.g. databases), a specific type of deployment called StatefulSets is needed.

Node (worker node)

A virtual/physical machine connected to a cluster. A cluster can distribute its applications over several nodes - both for availability and resource reasons. Maybe you want one node to be dedicated to one specific application.

Kubelet

A process running on each node that starts and manages containers on that node.

Kube proxy

A process running on each node that routes requests intelligently between pods - keeping requests within the same node if possible, but is able to connect to any service and pod in the cluster. Can be replaced by other networking applications (e.g. GKE Dataplane V2 uses eBPF)

Cluster

The overarching, top level entity in Kubernetes. Consists of control plane services and one or more worker nodes.

Control plane

Used to manage the Kubernetes cluster. Consists of four components listed below. These components may run on dedicated machines, on the same machines as one or more of your nodes, or somewhere in the cloud (as with for all cloud kubernetes providers). They may also be replicated for availability and performance reasons.

API Server (kube-apiserver)

The REST API used to manage your cluster.

Scheduler (kube-scheduler)

Makes sure that pods are matched to nodes so that kubelet can run them.

Controller Manager (kube-controller-manager)

Control process that regulates the state of your cluster, initiating changes when the current state is not identical to the desired state (e.g. new applications are added, or replication number of pods have changed). Communicates with the kubelets of the different nodes to do changes.

etcd

The configuration store for the cluster. A conceptually simple key-value store that is immediately available to all entities in the cluster. Used to store and maintain the state of the cluster.

Kubectl

Official CLI tool for managing your cluster. Uses the API Server.

Tutorial Kubernetes Docker

Kubernetes terms - a topological approach

Container runtime

Resource

Manifest

Pod

Service

Ingress

Application

Deployment

Node (worker node)

Kubelet

Kube proxy

Cluster

Control plane

API Server (kube-apiserver)

Scheduler (kube-scheduler)

Controller Manager (kube-controller-manager)

etcd

Kubectl

Øystein Blixhavn

Recent Posts

Why do we codify stuff?

Development Platforms and The Strategy of Getting There

When to kill the canary

First impressions on GitOps and Kubernetes

The secret to great development teams

Tags

Kubernetes terms - a topological approach

Container runtime

Resource

Manifest

Pod

Service

Ingress

Application

Deployment

Node (worker node)

Kubelet

Kube proxy

Cluster

Control plane

API Server (kube-apiserver)

Scheduler (kube-scheduler)

Controller Manager (kube-controller-manager)

etcd

Kubectl

Øystein Blixhavn

Recent Posts

Why do we codify stuff?

Development Platforms and The Strategy of Getting There

When to kill the canary

First impressions on GitOps and Kubernetes

The secret to great development teams

Tags

You Might Be Interested In

When to kill the canary

First impressions on GitOps and Kubernetes

Monitor your website with cloud functions