Kubernetes Basics
- What is Kubernetes
K8s Capacity
Kubernetes best practices
K8s Architecture
- K8S cluster components
  - Control plane components
  - Node components
Kubernetes Networking
- Service
- Ingress
K8s monitoring using prometheus

Container Orchestration

Container orchestration is all about managing the life cycles of containers, especially in
large, dynamic environments.

Container Orchestration can be used to perform lot of tasks, some of them includes:

Provisioning and deployment of containers
Scaling up or removing containers to spread application load evenly
Movement of containers from one host to another if there is a shortage of resources
Load balancing of service discovery between containers
Health monitoring of containers and hosts

Orchestration solutions

There are many container orchestration solutions which are available, some of the popular ones include:

Docker Swarm
Kubernetes
Apache Mesos
Elastic Container Service (A WS ECS)
There are also various container orchestration platforms available like EKS, AKS and GKE.

Kubernetes Basics

What is Kubernetes

Kubernetes is a container orchestration tool.
Its portable, extensible, and open-source platform for managing containerized workloads and services, that facilitates both declarative configuration and automation.
The name Kubernetes originates from Greek, meaning helmsman or pilot.
K8s as an abbreviation results from counting the eight letters between the "K" and the "s".
Google open-sourced the Kubernetes project in 2014. Kubernetes combines over 15 years of Google's experience running production workloads at scale with best-of-breed ideas and practices from the community.

K8s Architecture

K8s follows master and worker node architecture, master node manages the worker nodes and the workloads (Pods) in the cluster. Master node (also referred as control plane) make global decisions about the cluster (for example, scheduling), as well as detecting and responding to cluster events, while worker nodes host the Pods that are the components of the application workload.

K8S cluster components

When you deploy Kubernetes, you get a cluster.
A Kubernetes cluster consists of components that represent the control plane and set of worker machines, called nodes, that run containerized applications.
Every cluster has at least one worker node.

Control plane The control plane manages the worker nodes and the Pods in the cluster. In production environments, the control plane usually runs across multiple computers and a cluster usually runs multiple nodes, providing fault-tolerance and high availability. Worker nodes The worker node(s) host the Pods that are the components of the application workload.

Control plane components

The control plane's components make global decisions about the cluster (for example, scheduling), as well as detecting and responding to cluster events (for example, starting up a new pod when a deployment's replicas field is unsatisfied).

Kube API Server
etcd
Kube-scheduler
Kube-control-manager
Cloud-control-manager

Control plane components can be run on any machine in the cluster. However, for simplicity, set up scripts typically start all control plane components on the same machine, and do not run user containers on this machine.

Control plane setup that runs across multiple VMs. K8s HA

Kube API Server

The API server is the front end for the Kubernetes control plane.
The core of Kubernetes' control plane is the API server and the HTTP API that it exposes.
Users, the different parts of your cluster, and external components all communicate with one another through the API server.
kube-apiserver is designed to scale horizontally—that is, it scales by deploying more instances.
You can run several instances of kube-apiserver and balance traffic between those instances.
Kubectl command’s job is to connect to kube-apiserver and communicate with it using the Kubernetes API.
kube-apiserver also authenticates incoming requests, determines whether they are authorized and valid, and manages admission control.
But it’s not just kubectl that talks with kube-apiserver. In fact, any query or change to the cluster’s state must be addressed to the kube-apiserver.
The API Server is the only Kubernetes component that connects to etcd; all the other components must go through the API Server to work with the cluster state.

etcd

etcd is a distributed reliable key-value store.

In a Linux environment, all the configurations are stored in the /etc directory. etcd is inspired from that and there is an addition of d which is for distributed.

etcd reliably stores the configuration data of the Kubernetes cluster, representing the state of the cluster (what nodes exist in the cluster, what pods should be running, which nodes they are running on, and a whole lot more) at any given point of time.

A Kubernetes cluster stores all its data in etcd.
Any node crashing or process dying causes values in etcd to be changed.
Whenever you create something with kubectl create / kubectl run will create an entry in the etcd.

Note: You never interact directly with etcd; instead, kube-apiserver interacts with the database on behalf of the rest of the system. etcdctl is the command line tool to interact with etcd, but recommended not to use it. The API Server is the only Kubernetes component that connects to etcd.

Kube-scheduler

Control plane component that watches for newly created Pods with no assigned node, and selects a node for them to run on.
But it doesn’t do the work of actually launching Pods on Nodes. Instead, whenever it discovers a Pod object that doesn’t yet have an assignment to a node, it chooses a node and simply writes the name of that node into the Pod object. Factors taken into account for scheduling decisions include:

individual and collective resource requirements,
hardware/software/policy constraints
affinity (which cause groups of pods to prefer running on the same node)
anti-affinity (which ensure that pods do not run on the same node) specifications
data locality
inter-workload interference
deadlines

Kube-controller-manager

Control Plane component that runs controller processes. kube-controller-manager has a broader job. It continuously monitors the state of a cluster through Kube-APIserver. Whenever the current state of the cluster doesn’t match the desired state, kube-controller-manager will attempt to make changes to achieve the desired state. It’s called the “controller manager” because many Kubernetes objects are maintained by loops of code called controllers. These loops of code handle the process of remediation.

Logically, each controller is a separate process, but to reduce complexity, they are all compiled into a single binary and run in a single process. Some types of these controllers are:

Node controller: Responsible for noticing and responding when nodes go down.
Job controller: Watches for Job objects that represent one-off tasks, then creates Pods to run those tasks to completion.
Endpoints controller: Populates the Endpoints object (that is, joins Services & Pods).
Service Account & Token controllers: Create default accounts and API access tokens for new namespaces.

cloud-controller-manager

kube-cloud-manager manages controllers that interact with underlying cloud providers.
For example, if you manually launched a Kubernetes cluster on Google Compute Engine, kube-cloud-manager would be responsible for bringing in Google Cloud features like load balancers and storage volumes when you needed them.
If you are running Kubernetes on your own premises, or in a learning environment inside your own PC, the cluster does not have a cloud controller manager.

Node components

Node components run on every node, maintaining running pods and providing the Kubernetes runtime environment.

Kubelet

An agent that runs on each node in the cluster. It makes sure that containers are running in a Pod. The kubelet takes a set of PodSpecs that are provided through various mechanisms and ensures that the containers described in those PodSpecs are running and healthy.

When the kube-apiserver wants to start a Pod on a node, it connects to that node’s kubelet.
Kubelet uses the container runtime to start the Pod and monitors its lifecycle, including readiness and liveness probes, and reports back to Kube-APIserver.

The kubelet doesn't manage containers which were not created by Kubernetes.

kube-proxy

kube-proxy is a network proxy that runs on each node in your cluster, implementing part of the Kubernetes Service concept. kube-proxy maintains network rules on nodes. These network rules allow network communication to your Pods from network sessions inside or outside of your cluster. kube-proxy uses the operating system packet filtering layer if there is one and it's available. Otherwise, kube-proxy forwards the traffic itself.

Container runtime

The container runtime is the software that is responsible for running containers. Kubernetes supports several container runtimes: Docker, containerd, CRI-O, and any implementation of the Kubernetes CRI (Container Runtime Interface).

K8s Capacity

At V1.19, K8s supports clusters with upto 5000 nodes.
More specifically it supports configuration that meets all of the following criteria.

Object	Limit
Nodes	5000
Total Pods	150000
Total Containers	300000
Pods per node	100

Kubernetes best practices

Name spaces

multiple virtual cluster boundary with in one physical cluster.
provides scope for naming
devide cluster resources to namespaces
useful for namespace specific access

Resource requests and limits

set appropriate requests and limits for your container
set resource limits for namespaces using ResourceQuotas
When CPU reaches limit, its throttles
When memory reaches limit, POD will be evicted
Proper requests and limits save cost as well

How do you Cost/Performace optimize your k8s app ?

Detect CPU/Memory waste - Utiliza metrics server. ex: - Cloudwatch container insights (AWS) - kubecost - cloudhealth - kubernetes resource report

Use readyness and liveness probes

Secure k8s

k8s security
- Application security
  - Pod, Namespace, Node
  - RBAC, IRSA
- DevsecOps - DevOps + Security = security of the container devops lifecycle
  - Authorization
  - Scan repository
  - Scan running container
- Security compliance
  - FedRAMP, HIPAA, SOC etc..

Day 2 ops

Detective controls
- collect and analyze audit logs
- Alarm on certain behavior
Understand k8s termination lifecycle
- Use rolling update
- App handle graceful termination
Incident response
- Identify and isolate
- run load and penetration testing

Kubernetes Networking

Service

Ingress

What is Ingress ? An API object that manages external access to the services in a cluster, typically HTTP. Ingress may provide load balancing, SSL termination and name-based virtual hosting. Ingress exposes HTTP and HTTPS routes from outside the cluster to services within the cluster. Traffic routing is controlled by rules defined on the Ingress resource.

An Ingress may be configured to give Services externally-reachable URLs, load balance traffic, terminate SSL / TLS, and offer name-based virtual hosting. An Ingress controller is responsible for fulfilling the Ingress, usually with a load balancer, though it may also configure your edge router or additional frontends to help handle the traffic.

An Ingress does not expose arbitrary ports or protocols. Exposing services other than HTTP and HTTPS to the internet typically uses a service of type Service.Type=NodePort or Service.Type=LoadBalancer.

Prerequisites

You must have an Ingress controller to satisfy an Ingress. Only creating an Ingress resource has no effect. You may need to deploy an Ingress controller such as ingress-nginx. You can choose from a number of Ingress controllers.

what is ingress
ingress yaml configuration
when do you need ingress

Ingress Controllers

In order for the Ingress resource to work, the cluster must have an ingress controller running.

Unlike other types of controllers which run as part of the kube-controller-manager binary, Ingress controllers are not started automatically with a cluster.

Kubernetes as a project supports and maintains AWS, GCE, and nginx ingress controllers.

AKS Application Gateway Ingress Controller is an ingress controller that configures the Azure Application Gateway.

Setup ingress on minikube with ngnix ingress controller. https://kubernetes.io/docs/tasks/access-application-cluster/ingress-minikube/

What is ingress class when multiple ingress controllers are deployed by using ingress class we can tie the ingress to ingress controller.

Ingress configuration

Install ingress controller in minikube
Create Ingress resource
Create service
Create Pods

Install ingress controller in minikube minikube addons enable ingress kubectl get pods -n kube-system automatically starts the k8s nginx implementation of Ingress controller.

Create Ingress rule/resource

apiVersion: networking.k8s.io/v1beta1
kind: Ingress
metadata:
  name: dashboard
  namespace: k8s-dashboard
spec:
  rules:
  - host: dashboard.com
    http:
      paths:
      - backend:
        serviceName: dashboard #Here Service is name same as Ingress and is cluster type.
        servicePort: 80

kubectl apply -f ingress.yml

check the ADDRESS assigned to the ingress, if not wait for some time kubectl get ingress

To resolve the dashboard.com to the IP add entry in the /etc/hosts 192.168.0.5 dashboard.com

K8s monitoring using prometheus

Prometheus provides monitoring and alerting through email or slack. Graphana is for visualization.

ELK is for logging

Kibana and tabview is alternatives for graphana.

RBAC in K8s

Ref: https://github.com/marcel-dempers/docker-development-youtube-series/tree/master/kubernetes/rbac (this is forked) https://www.youtube.com/watch?v=jvhKOAyD8S8

Roles and role bindings are name space level
cluster role and cluster role binding are at cluster level and grant permission to all name spaces.

Note: Kubernetes has no concept of users, it trusts certificates that is signed by its CA. In managed k8s like AKS,EKS or GKE it uses OAUTH authentication mechanism.

A Role can only be used to grant access to resources in a single name space. A Role Binding is used to tie the role and the subject (user/group/service account).

Azure AD and K8s RBAC

https://github.com/bhaaskara/azure-aks-kubernetes-masterclass/tree/master/21-Azure-AKS-Authentication-and-RBAC/21-03-Kubernetes-RBAC-with-AzureAD-on-AzureAKS

Kubeconfig file

The kubectl command-line tool uses kubeconfig files to find the information it needs to choose a cluster and communicate with the API server of a cluster. Use kubeconfig files to organize information about clusters, users, namespaces, and authentication mechanisms.

By default, kubectl looks for a file named config in the $HOME/.kube directory. You can specify other kubeconfig files by setting the KUBECONFIG environment variable or by setting the --kubeconfig flag.

Note: A file that is used to configure access to clusters is called a kubeconfig file. This is a generic way of referring to configuration files. It does not mean that there is a file named kubeconfig.

Warning: Only use kubeconfig files from trusted sources. Using a specially-crafted kubeconfig file could result in malicious code execution or file exposure. If you must use an untrusted kubeconfig file, inspect it carefully first, much as you would a shell script.

Context binds users with the cluster and namespace.

--- its incomplete.

Horizontal Pod Autoscaler (HPA)

Deployment.yml

apiVersion: apps/v1 
kind: Deployment
metadata: 
  name: php-apache
spec: 
  selector:
    matchLabels:
	  run: php-apache 
  replicas: 1 
  template: 
    metadata: 
      lables:
	    run: php-apache 
    spec : 
      containers:
	  - name: php-apache 
        image: k8s.gcr.io/hpa-example 
        ports: 
        - containerPort: 80
		resources:
		  requests:
		    cpu: 500m
		  limits:
		    cpu: 1000m

HPA.yml

apiVersion: autoscaling/v1 
kind: HorizontalPodAutoscaler 
metadata: 
  name: php-apache 
  namespace: default 
spec: 
  scaleTargetRef : 
    apiVersion: apps/vl 
    kind: Deployment 
    name: php-apache
  minReplication: 1 
  maxReplication: 10
  targetCPUUtilizationPercentage: 50

Note: it needs metrics server to be configured.

Probes - Liveness, Readiness, and Startup Probes

A probe is a periodic check that monitors the health of an application. Kubernetes evaluates application health status via probes and automatic application restart.

Applications can become unreliable for a variety of reasons, for example:

Temporary connection loss
Configuration errors
Application errors

Developers can use probes to monitor their applications. Probes make developers aware of events such as application status, resource usage, and errors.

There are currently three types of probes in Kubernetes:

Startup probe
Readiness probe
Liveness probe

Startup Probe

A startup probe verifies whether the application within a container is started. Startup probes run before any other probe, and, unless it finishes successfully, disables other probes. If a container fails its startup probe, then the container is killed and follows the pod’s restartPolicy.

This type of probe is only executed at startup, unlike readiness probes, which are run periodically.

The startup probe is configured in the spec.containers.startupprobe attribute of the pod configuration.

Readiness Probe

Readiness probes determine whether or not a container is ready to serve requests. If the readiness probe returns a failed state, then Kubernetes removes the IP address for the container from the endpoints of all Services.

Developers use readiness probes to instruct Kubernetes that a running container should not receive any traffic. This is useful when waiting for an application to perform time-consuming initial tasks, such as establishing network connections, loading files, and warming caches.

The readiness probe is configured in the spec.containers.readinessprobe attribute of the pod configuration.

Liveness Probe

Liveness probes determine whether or not an application running in a container is in a healthy state. If the liveness probe detects an unhealthy state, then Kubernetes kills the container and tries to redeploy it.

The liveness probe is configured in the spec.containers.livenessprobe attribute of the pod configuration.

Kubernetes provides five options that control these probes:

Name	Mandatory Description		Default Value
`initialDelaySeconds`	Yes	Determines how long to wait after the container starts before beginning the probe.	0
`timeoutSeconds`	Yes	Determines how long to wait for the probe to finish. If this time is exceeded, then Kubernetes assumes that the probe failed.	1
`periodSeconds`	No	Specifies the frequency of the checks.	10
`successThreshold`	No	Specifies the minimum consecutive successes for the probe to be considered successful after it has failed.	1
`failureThreshold`	No	Specifies the minimum consecutive failures for the probe to be considered failed after it has succeeded.	3

The liveness probe passes when the app itself is healthy. but the readiness probe additionally checks that each required back-end service is available. This helps you avoid directing traffic to Pods that can only respond with error messages.

If your container needs to work on loading large data, configuration files, or migrations during startup, you can use a startup probe.

Methods of Checking Application Health

Startup, readiness, and liveness probes can check the health of applications in three ways: HTTP checks, container execution checks, and TCP socket checks.

HTTP Checks

An HTTP check is ideal for applications that return HTTP status codes, such as REST APIs.

HTTP probe uses GET requests to check the health of an application. The check is successful if the HTTP response code is in the range 200-399.

The following example demonstrates how to implement a readiness probe with the HTTP check method:

_...contents omitted..._
readinessProbe:
  httpGet:
    path: /health #(1)
    port: 8080
  initialDelaySeconds: 15 #(2)
  timeoutSeconds: 1 #(3)_...contents omitted..._

The readiness probe endpoint.
How long to wait after the container starts before checking its health.
How long to wait for the probe to finish.

Container Execution Checks

Container execution checks are ideal in scenarios where you must determine the status of the container based on the exit code of a process or shell script running in the container.

When using container execution checks Kubernetes executes a command inside the container. Exiting the check with a status of 0 is considered a success. All other status codes are considered a failure.

The following example demonstrates how to implement a container execution check:

_...contents omitted..._
livenessProbe:
  exec:
    command:**(1)**
    - cat
    - /tmp/health
  initialDelaySeconds: 15
  timeoutSeconds: 1
_...contents omitted..._

1.  The command to run and its arguments, as a YAML array.

TCP Socket Checks

A TCP socket check is ideal for applications that run as daemons, and open TCP ports, such as database servers, file servers, web servers, and application servers.

When using TCP socket checks Kubernetes attempts to open a socket to the container. The container is considered healthy if the check can establish a successful connection.

The following example demonstrates how to implement a liveness probe by using the TCP socket check method:

_...contents omitted..._
livenessProbe:
  tcpSocket:
    port: 8080 #(1)
  initialDelaySeconds: 15
  timeoutSeconds: 1
_...contents omitted..._

The TCP port to check.

Creating Probes

To configure probes on a deployment, edit the deployment’s resource definition. To do this, you can use the kubectl edit or kubectl patch commands. Alternatively, if you already have a deployment YAML definition, you can modify it to include the probes and then apply it with kubectl apply.

Kubernetes Summary

Pods

Files

K8s_concepts.md

Latest commit

History