kb/kubernetes

Fork 0

Go to file

Márcio Fernandes ae27ab285e new file: manifests/ubuntu-deployment.yaml

2026-04-25 10:42:46 +00:00

manifests

new file: manifests/ubuntu-deployment.yaml

2026-04-25 10:42:46 +00:00

README.md

modified: README.md

2026-04-21 21:33:56 +00:00

README.md

kubernetes

Kubernetes is an open‑source platform that automates the deployment, scaling, and management of containerized applications. It acts as an orchestrator, ensuring your containers run reliably across clusters of machines, handling networking, storage, and updates without downtime.

Namespaces
- Create namespace
- namespace stuck on delete
Pods
Persistent volumes
kubectl
Secrets
nodes
taints
statefulset
- statefulset - Set Replicas
Deployment
- Deployment - Set Replicas
- Deployment - rollout restart
Daemonset
- Daemonset - rollout restart
certs
- list all certs
- get cert end date
service accounts
Services DNS Name
core-dns
- Overrides
- Remove warning from logs
Custom Resource Definitions
k3s
host cli
- host cli - check port usage
- kill all connections
cert-manager
- Removing cert‑manager Metadata from Secrets

Namespaces

Create namespace

Using cli

 kubectl create namespace tests

Or using yaml

apiVersion: v1
kind: Namespace
metadata:
  name: namespace-name
  labels:
    name: namespace-name

namespace stuck on delete

nuke finalizers:

NAMESPACE="NAMESPACE_NAME"

kubectl get namespace ${NAMESPACE} -o json \
  | jq '.spec.finalizers = []' \
  | kubectl replace --raw /api/v1/namespaces/${NAMESPACE}/finalize -f -

Pods

Create an pod

Create an ubuntu pod for tty access example:

apiVersion: v1
kind: Pod
metadata:
  name: ubuntu-test
  namespace: tests
spec:
  #### deploy to an specific node
  nodeName: chimera-gluten
  containers:
    - name: ubuntu-test
      image: ubuntu
      # In Kubernetes, the pod stays alive as long as PID 1 is running.
      # so with this options:
      # - It does not exit automatically.
      # - It waits for user input forever.
      # - It behaves like an interactive shell session.
      command: ["sh"]  # PID 1 = interactive shell
      stdin: true # keep STDIN open
      tty: true # allocate a terminal

      volumeMounts:
        - name: data
          mountPath: /data

  volumes:
    - name: data
      persistentVolumeClaim:
        claimName: data-pvc

Create an ubuntu pod with and execute command:

apiVersion: v1
kind: Pod
metadata:
  name: ubuntu-ls-test
  namespace: tests
spec:
  restartPolicy: Never # executes only one time, no retry on error 
  
#
# nodeName: "serverExample01" # restrict to an specific node
#
  containers:
    - name: ubuntu-seaweedfs-test
      image: ubuntu
      command: ["bash", "-c"]
      args: 
        - "ls -lah /data"

      volumeMounts:
      - name: data
        mountPath: /data

  volumes:
    - name: data
      persistentVolumeClaim:
        claimName: data-pvc

Get Pod

Get pod name by label ap:

POD_NAME=$(kubectl get pod -l app=myAppName -n appNamespace -o jsonpath='{.items[0].metadata.name}')
echo $POD_NAME

Get pod name by text on description, for example find by ip:

 kubectl get pods -A -o wide | grep 10.0.3.224

delete Pod

kubectl delete pod -n appNamespace -l app=myAppName

OOMKilled

list all OOMKilled pods:

kubectl get events --all-namespaces | grep -i "OOMKilled"

kubectl get pods --all-namespaces \
-o jsonpath='{range .items[*]}{.metadata.namespace}{" "}{.metadata.name}{" "}{.status.containerStatuses[*].lastState.terminated.reason}{"\n"}{end}' \
| grep OOMKilled

Attach to an pod

Attach connects your terminal to the main process of the container (PID 1), or another running process if specified.

Use it when you want to:

see the raw output of the main process
want to send input directly to the main process

kubectl attach -it myPodName -n appNamespace

POD_NAME=$(kubectl get pod -l app=myAppName -n appNamespace -o jsonpath='{.items[0].metadata.name}')
kubectl attach -it ${POD_NAME} -n appNamespace

Run command on pod

# sh
MY_APP_NAME=???
NAMESPACE=???
POD_NAME=$(kubectl get pod -l app=$MY_APP_NAME -n $NAMESPACE -o jsonpath='{.items[0].metadata.name}')
kubectl exec -n $NAMESPACE -it ${POD_NAME} -- sh

# bash
POD_NAME=$(kubectl get pod -l app=myAppName -n appNamespace -o jsonpath='{.items[0].metadata.name}')
kubectl exec -it ${POD_NAME} -- bash

# execute an command like ls
POD_NAME=$(kubectl get pod -l app=myAppName -n appNamespace -o jsonpath='{.items[0].metadata.name}')
kubectl exec -it ${POD_NAME} -- ls /

Persistent volumes

find persistent volume used pvc

NAMESPACE=???
PVC_NAME=???
PV_NAME=$(kubectl get pvc $PVC_NAME -n $NAMESPACE -o jsonpath='{.spec.volumeName}')
echo "${PV_NAME}"

Patch pv - change to retain policy

PV_NAME="???"
kubectl patch pv $PV_NAME \
  -p '{"spec":{"persistentVolumeReclaimPolicy":"Retain"}}'

Patch pv - remove finalizers

PV_NAME="???"
kubectl patch pv $PV_NAME \
  -p '{"metadata":{"finalizers": null}}'

kubectl

kubectl is the command‑line tool used to interact with Kubernetes clusters. Think of it as the “remote control” for Kubernetes: it lets you deploy applications, inspect resources, and manage cluster operations directly from your terminal.

Helper pods

network testing

kubectl run  -i --tty dns-test --namespace tests --image=busybox --restart=Never -- 
kubectl delete pod dns-test --namespace tests || 0

Example using yaml and hostNetwork:

Create Pod

apiVersion: v1
kind: Pod
metadata:
  name: dns-test
  namespace: tests
spec:
  hostNetwork: true
  containers:
  - name: dns-test
    image: busybox
    command: ["sh"]
    stdin: true
    tty: true

Attach to Pod

kubectl attach -it dns-test -n tests

Execute command inside pod.

nslookup google.com

Delete pod

kubectl delete pod dns-test --namespace tests

Resources

List all resources:

kubectl get all -n kube-system | grep traefik

List service accounts:

kubectl get serviceAccount --all-namespaces

Services Accounts

List all:

kubectl get serviceAccount --all-namespaces

Get Service Account Token:

kubectl get secret <secret_name> -o jsonpath='{.data.token}' | base64 -d

kubectl get secret <secret_name> -o jsonpath='{.data.token}' | base64 -d > ./service-account-secret-base64

Get Cluster certificate Base64:

kubectl config view --raw -o jsonpath='{.clusters[0].cluster.certificate-authority-data}'

Secrets

Manifest - Opaque / Base64

apiVersion: v1
kind: Secret
metadata:
  name: secret-name
  namespace: namespace-name
type: Opaque
data:
  SERVER_ADDRESS: MTI3LjAuMC4x # 127.0.0.1 BASE64

Manifest - StringData

apiVersion: v1
kind: Secret
metadata:
  name: secret-name
  namespace: namespace-name
stringData:
  SERVER_ADDRESS: 127.0.0.1

Inline with heredoc and environment variables

SERVER_ADDRESS=127.0.0.1
kubectl apply -f - <<EOF
apiVersion: v1
kind: Secret
metadata:
  name: secret-name
  namespace: namespace-name
stringData:
  SERVER_ADDRESS: ${SERVER_ADDRESS}
EOF

substr

yaml secret template:

# ./secret.yaml
apiVersion: v1
kind: Secret
metadata:
  name: secret-name
  namespace: namespace-name
stringData:
  SERVER_ADDRESS: ${SERVER_ADDRESS}

export SERVER_ADDRESS="127.0.1"
envsubst < ./secret.yaml | kubectl apply -f -

env file and envsubst:

#---
# ./.env
# content:
# SERVER_ADDRESS=127.0.0.1
#---
set -a
source ./.env
set +a
envsubst < ./secret.yaml | kubectl apply -f -

nodes

Get nodes info:

kubectl get nodes -o wide

remove annotation:

kubectl annotate node <NODE_NAME> <ANNOTATION_NAME>-

taints

get node taints:

kubectl describe node <NODE_NAME> | grep taint

add taint

NODE="????"
TAINT="infra.mydomain.com/dedicated=role:NoSchedule"
kubectl taint nodes ${NODE} ${TAINT}

remove taint

NODE="chimera-deepstate"
TAINT="infra.mydomain.com/dedicated=role:NoSchedule"
kubectl taint nodes ${NODE} ${TAINT}-

control plane - NoSchedule

NODE="????"
kubectl taint nodes ${NODE} node-role.kubernetes.io/control-plane=:NoSchedule

Official *.kubernetes.io taints

Node condition taints (automatic):

node.kubernetes.io/not-ready - Node is NotReady
node.kubernetes.io/unreachable - Node unreachable
node.kubernetes.io/out-of-disk - Node out of disk
node.kubernetes.io/memory-pressure - Memory pressure
node.kubernetes.io/disk-pressure - Disk pressure
node.kubernetes.io/network-unavailable- Network unavailable
node.kubernetes.io/unschedulable - Node was cordoned
node.kubernetes.io/ready - Node is ready (rarely used as taint)

Eviction taints (used by kubelet):

node.kubernetes.io/pid-pressure - Too many processes
node.kubernetes.io/unschedulable - Node cordoned
node.kubernetes.io/taint-effect-no-execute - NoExecute taints

Role taints (official, safe to use):

node-role.kubernetes.io/control-plane - Control-plane node
node-role.kubernetes.io/master - Legacy control-plane

Everything else in *.kubernetes.io is reserved and should not be used.

cordon

NODE="????"
kubectl cordon ${NODE}

Marks a node as unschedulable.

No new pods will be scheduled on that node
Existing pods are not affected
Even after a reboot, existing pods return to the same node
Used for temporary maintenance (updates, debugging, draining prep)
Kubernetes automatically adds the taint: node.kubernetes.io/unschedulable:NoSchedule

NODE="???"
kubectl uncordon ${NODE}

Reverses the cordon.

The node becomes schedulable again
New pods can land on it
Existing pods remain untouched

statefulset

statefulset - Set Replicas

kubectl patch statefulset <statefulset-name>  \
  -p '{"spec":{"replicas":0}}'

Deployment

Deployment - Set Replicas

DEPLOYMENT_NAME="???"
# example with 0 to "disable" deployment
kubectl scale deployment ${DEPLOYMENT_NAME} --replicas=0

kubectl patch deployment <deployment-name> \
  -p '{"spec":{"replicas":0}}'

Deployment - rollout restart

NAME="???"
NAMESPACE="???"
kubectl rollout restart deployment $NAME -n $NAMESPACE

Daemonset

Daemonset - rollout restart

NAME="???"
NAMESPACE="???"
kubectl rollout restart daemonset $NAME -n $NAMESPACE

certs

list all certs

kubectl get cert -n default

get cert end date

kubectl get secret certificate-name-tls -o "jsonpath={.data['tls\.crt']}" | base64 --decode | openssl x509 -enddate -noout

service accounts

Get service account token:

kubectl get secret continuous-deploy -o jsonpath='{.data.token}' | base64 -d

Services DNS Name

Kubernetes automatically provides DNS names for Services and Pods, and CoreDNS serves these records. This allows workloads to communicate using stable, predictable names instead of changing IP addresses.

<service-name>.<namespace>.svc.<cluster-domain>
<SERVICE_NAME>.<NAMESPACE>.svc.cluster.local

core-dns

Overrides

apiVersion: v1
kind: ConfigMap
metadata:
  name: coredns-custom
  namespace: kube-system
data:
  empty.server: |
    # empty file, to remove warning: No files matching import glob pattern: /etc/coredns/custom/*.server
  split_dns.override: |
    rewrite name exact requested.domain.tldr target.domain.tldr
    # wild card. works like *.requested.domain.tldr 
    rewrite name regex (.*)\.requested\.domain\.tldr target.domain.tldr

Remove warning from logs

[WARNING] No files matching import glob pattern: /etc/coredns/custom/*.server
[WARNING] No files matching import glob pattern: /etc/coredns/custom/*.override

Apply on kubernetes

apiVersion: v1
kind: ConfigMap
metadata:
  name: coredns-custom
  namespace: kube-system
data:
  log.override: |
    #
  stub.server: |
    #

Custom Resource Definitions

Definition: A Custom Resource Definition (CRD) is an extension of the Kubernetes API.
Purpose: They allow you to define new resource kinds (e.g., Database, Backup, FooBar) that behave like native Kubernetes objects.
Analogy: By default, Kubernetes understands objects like Pods and Services. With CRDs, you can add your own object types and manage them with kubectl just like built‑in resources

List traefik CRDS:

kubectl get crds | grep traefik

k3s

K3s is a lightweight, certified Kubernetes distribution designed to run in resource‑constrained environments such as edge devices, IoT appliances, and small servers. It simplifies installation and operation by packaging Kubernetes into a single small binary, while still being fully compliant with the Kubernetes API.

🌐 What K3s Is

Definition: K3s is a simplified Kubernetes distribution created by Rancher Labs (now part of SUSE) and maintained under the CNCF.
Purpose: It’s built for environments where full Kubernetes (K8s) is too heavy — like Raspberry Pis, edge servers, or CI pipelines.
Size: The entire distribution is packaged into a binary under ~70MB.

Install / Setup

Default master installation:

curl -sfL https://get.k3s.io | sh -

Install specific version and disable:

flannel (alternative example calico, cilium)
servicelb (alternative example metallb, cilium)
traefik (then install using helm chart or custom manifests for better control)

curl -sfL https://get.k3s.io | INSTALL_K3S_VERSION=v1.33.3+k3s1 INSTALL_K3S_EXEC="--flannel-backend=none \
--disable-network-policy \
--disable=servicelb \
--disable=traefik" \
 sh -

prune old images

prune old images, execute on kubernetes host node

crictl rmi --prune

check system logs

sudo journalctl -u k3s-agent --since "1h ago" --reverse --no-pager | more
sudo journalctl -u k3s-agent --since "1 hour ago" --reverse | grep -i "Starting k3s-agent.service" 
sudo journalctl -u k3s --reverse | grep -i "Starting k3s.service"

Example: test-services.services.svc.cluster.local.

Workarounds & Fixes

Failed unmounting var-lib-rancher.mount on reboot

When running K3s with /var/lib/rancher on a separate disk.

K3s and containerd often leave behind mount namespaces and overlay layers that block clean unmounting during shutdown. This causes slow reboots and errors like:

Failed unmounting var-lib-rancher.mount

Create the cleanup service
```
nano /etc/systemd/system/rancher-cleanup.service
```
Paste:
```
[Unit]
DefaultDependencies=no
Before=shutdown.target

[Service]
Type=oneshot
ExecStart=/bin/sh -c '/bin/umount -l /var/lib/rancher || true'

[Install]
WantedBy=shutdown.target
```
Why this works
- DefaultDependencies=no ensures the service runs early.
- Before=umount.target guarantees it executes before systemd tries to unmount anything.
- umount -l detaches the filesystem immediately, even if containerd still holds namespaces.
- || true prevents harmless “not mounted” errors from blocking shutdown.
Reload systemd
```
systemctl daemon-reload
```

Enable the cleanup service

systemctl enable rancher-cleanup.service

Reboot to test:
```
reboot
```

klipper-lb

Klipper‑LB is the tiny, built‑in load balancer that k3s uses to give each agent a local, stable endpoint for talking to the Kubernetes API server. Instead of exposing a full external load balancer, k3s runs this lightweight component on 127.0.0.1:6444, and it simply forwards traffic from the agent to the control‑plane node (or rotates between multiple servers in an HA setup). It exists to make k3s simpler to deploy—no extra software, and no external LB. startup even though the cluster continues working normally.

troubleshooting

log: warning - Error starting load balancer: listen tcp 127.0.0.1:6444: bind: address already in use.

rm -rf /var/lib/rancher/k3s/agent/etc/klipper-lb 
systemctl restart k3s-agent

Containerd state

This procedure simulates a fresh node joining the cluster. It deletes all containerd runtime state but does not remove the node from the cluster.

Stop k3s (server or agent)

Delete containerd state

sudo rm -rf /var/lib/rancher/k3s/agent/containerd
sudo rm -rf /var/lib/containerd

Start k3s

What this does:

Removes all images, snapshots, and container metadata
Forces k3s to repull every image through CRI → mirrors → Harbor
Simulates a fresh node rebuild
Node identity, certificates, and cluster membership remain intact
Workloads are rescheduled normally
This is the correct method to validate offline rebuild capability and ensure Harbor mirrors are complete.

If all container images are provided locally (Example: through Harbor proxy caches), then the entire containerd image store becomes fully ephemeral. This means:

/var/lib/rancher/k3s/agent/containerd
/var/lib/containerd

contain no unique or irreplaceable data. so it can be ignored on backups.

host cli

host cli - check port usage

# example: port 32329
ss -ltnp | grep 32329

kill all connections

ss -K dst SERVER_IP:6443

cert-manager

clear stale ACME challenges and orders created before DNS fixes.
These objects are temporary and safe to delete — they do NOT remove or affect existing valid certificates.
After cleanup, cert-manager automatically recreates fresh challenges using the corrected DNS configuration.

kubectl delete challenge -A --all
kubectl delete order -A --all

Removing cert‑manager Metadata from Secrets

When migrating clusters or taking manual control of TLS certificates, you may need to fully detach a Secret from cert‑manager. Cert‑manager uses labels and annotations to track ownership, ACME challenge state, and renewal configuration. If these remain, cert‑manager may attempt to “adopt” or overwrite the Secret.

This guide shows how to safely remove all cert‑manager metadata so the Secret becomes unmanaged.

View Secrets Managed by cert‑manager:

kubectl get secrets -A --show-labels | grep cert-manager

This lists Secrets that contain cert‑manager labels or annotations.

Remove cert‑manager Labels and Annotations:

SECRET_NAME=chimera-limbosolutions-com-tls
NAMESPACE=ignition-provisioner

# Remove cert-manager annotations
kubectl annotate secret ${SECRET_NAME} -n ${NAMESPACE} \
  cert-manager.io/alt-names- \
  cert-manager.io/common-name- \
  cert-manager.io/ip-sans- \
  cert-manager.io/issuer-group- \
  cert-manager.io/issuer-kind- \
  cert-manager.io/issuer-name- \
  cert-manager.io/uri-sans- \
  cert-manager.io/certificate-name- \
  acme.cert-manager.io/http-domain- \
  acme.cert-manager.io/dns-domain- \
  kubectl.kubernetes.io/last-applied-configuration-

# Remove cert-manager controller labels
kubectl label secret ${SECRET_NAME} -n ${NAMESPACE} \
  controller.cert-manager.io/fao- \
  controller.cert-manager.io/owner-kind- \
  controller.cert-manager.io/owner-name- \
  controller.cert-manager.io/owner-group-

After this cleanup, the Secret is fully detached from cert‑manager and will no longer be renewed, validated, or overwritten.

Verify Cleanup:

kubectl get secrets -A --show-labels | grep cert-manager

If the Secret no longer appears, it is now unmanaged.

README.md Unescape Escape

kubernetes

Namespaces

Create namespace

namespace stuck on delete

Pods

Create an pod

Get Pod

delete Pod

OOMKilled

Attach to an pod

Run command on pod

Persistent volumes

find persistent volume used pvc

Patch pv - change to retain policy

Patch pv - remove finalizers

kubectl

Helper pods

network testing

Resources

Services Accounts

Secrets

Manifest - Opaque / Base64

Manifest - StringData

Inline with heredoc and environment variables

substr

nodes

taints

add taint

remove taint

control plane - NoSchedule

Official *.kubernetes.io taints

cordon

statefulset

statefulset - Set Replicas

Deployment

Deployment - Set Replicas

Deployment - rollout restart

Daemonset

Daemonset - rollout restart

certs

list all certs

get cert end date

service accounts

Services DNS Name

core-dns

Overrides

Remove warning from logs

Custom Resource Definitions

k3s

Install / Setup

prune old images

check system logs

Workarounds & Fixes

Failed unmounting var-lib-rancher.mount on reboot

klipper-lb

troubleshooting

Containerd state

host cli

host cli - check port usage

kill all connections

cert-manager

Removing cert‑manager Metadata from Secrets

README.md