Solutions to Homework 10 - Kubernetes

This repository contains solutions to Homework 10 from the Machine Learning Zoomcamp 2025 course.

Homework 10: Kubernetes.

Homework

In this homework, we'll deploy the lead scoring model from the homework 5.

We already have a docker image for this model - we'll use it for deploying the model to Kubernetes.

Building the image

Clone the course repo if you haven't:

git clone https://github.com/DataTalksClub/machine-learning-zoomcamp.git

Go to the course-zoomcamp/cohorts/2024/05-deployment/homework folder and execute the following:

docker build -f Dockerfile_full -t zoomcamp-model:3.13.10-hw10 .

Question 1

Run it to test that it's working locally:

docker run -it --rm -p 9696:9696 zoomcamp-model:3.13.10-hw10

And in another terminal, execute q6_test.py file:

python q6_test.py

You should see this:

{'conversion_probability': <value>, 'conversion': False}

Here <value> is the probability of getting a subscription. You need to choose the right one.

0.29
✅ 0.49
0.69
0.89

Now you can stop the container running in Docker.

Installing `kubectl` and `kind`

You need to install:

kubectl - https://kubernetes.io/docs/tasks/tools/ (you might already have it - check before installing)
kind - https://kind.sigs.k8s.io/docs/user/quick-start/

Question 2

What's the version of kind that you have?

Use kind --version to find out.

Answer: 0.26.0

Creating a cluster

Now let's create a cluster with kind:

kind create cluster

And check with kubectl that it was successfully created:

kubectl cluster-info

Question 3

What's the smallest deployable computing unit that we can create and manage in Kubernetes (kind in our case)?

Node
✅ Pod
Deployment
Service

Question 4

Now let's test if everything works. Use kubectl to get the list of running services.

What's the Type of the service that is already running there?

NodePort
✅ ClusterIP
ExternalName
LoadBalancer

Question 5

To be able to use the docker image we previously created (zoomcamp-model:3.13.10-hw10), we need to register it with kind.

What's the command we need to run for that?

kind create cluster
kind build node-image
✅ kind load docker-image
kubectl apply

Question 6

Now let's create a deployment config (e.g. deployment.yaml):

apiVersion: apps/v1
kind: Deployment
metadata:
  name: subscription
spec:
  selector:
    matchLabels:
      app: subscription
  replicas: 1
  template:
    metadata:
      labels:
        app: subscription
    spec:
      containers:
      - name: subscription
        image: <Image>
        resources:
          requests:
            memory: "64Mi"
            cpu: "100m"            
          limits:
            memory: <Memory>
            cpu: <CPU>
        ports:
        - containerPort: <Port>

Replace <Image>, <Memory>, <CPU>, <Port> with the correct values.

What is the value for <Port>?

Answer: 9696

Apply this deployment using the appropriate command and get a list of running Pods. You can see one running Pod.

Question 7

Let's create a service for this deployment (service.yaml):

apiVersion: v1
kind: Service
metadata:
  name: <Service name>
spec:
  type: LoadBalancer
  selector:
    app: <???>
  ports:
  - port: 80
    targetPort: <PORT>

Fill it in. What do we need to write instead of <???>?

Answer: subscription

Apply this config file.

Testing the service

We can test our service locally by forwarding the port 9696 on our computer to the port 80 on the service:

kubectl port-forward service/<Service name> 9696:80

Run q6_test.py (from the homework 5) once again to verify that everything is working. You should get the same result as in Question 1.

Autoscaling

Now we're going to use a HorizontalPodAutoscaler (HPA for short) that automatically updates a workload resource (such as our deployment), with the aim of automatically scaling the workload to match demand.

Use the following command to create the HPA:

kubectl autoscale deployment subscription --name subscription-hpa --cpu-percent=20 --min=1 --max=3

You can check the current status of the new HPA by running:

kubectl get hpa

The output should be similar to the next:

NAME               REFERENCE                 TARGETS   MINPODS   MAXPODS   REPLICAS   AGE
subscription-hpa   Deployment/subscription   1%/20%    1         3         1          27s

TARGET column shows the average CPU consumption across all the Pods controlled by the corresponding deployment. Current CPU consumption is about 0% as there are no clients sending requests to the server.

Note: In case the HPA instance doesn't run properly, try to install the latest Metrics Server release from the components.yaml manifest:
kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml

Increase the load

Let's see how the autoscaler reacts to increasing the load. To do this, we can slightly modify the existing q6_test.py script by putting the operator that sends the request to the subscription service into a loop.

while True:
    sleep(0.1)
    response = requests.post(url, json=client).json()
    print(response)

Now you can run this script.

Question 8 (optional)

Run kubectl get hpa subscription-hpa --watch command to monitor how the autoscaler performs. Within a minute or so, you should see the higher CPU load; and then - more replicas. What was the maximum amount of the replicas during this test?

1
2
✅ 3
4

Note: It may take a few minutes to stabilize the number of replicas. Since the amount of load is not controlled in any way it may happen that the final number of replicas will differ from initial.

Submit the results

Submit your results here: https://courses.datatalks.club/ml-zoomcamp-2025/homework/hw10
If your answer doesn't match options exactly, select the closest one. If the answer is exactly in between two options, select the higher value.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
.python-version		.python-version
Dockerfile_full		Dockerfile_full
README.md		README.md
deployment.yaml		deployment.yaml
pyproject.toml		pyproject.toml
q6_predict.py		q6_predict.py
q6_test.py		q6_test.py
q6_test_loop.py		q6_test_loop.py
service.yaml		service.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solutions to Homework 10 - Kubernetes

Homework 10: Kubernetes.

Homework

Building the image

Question 1

Installing `kubectl` and `kind`

Question 2

Creating a cluster

Question 3

Question 4

Question 5

Question 6

Question 7

Testing the service

Autoscaling

Increase the load

Question 8 (optional)

Submit the results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Solutions to Homework 10 - Kubernetes

Homework 10: Kubernetes.

Homework

Building the image

Question 1

Installing kubectl and kind

Question 2

Creating a cluster

Question 3

Question 4

Question 5

Question 6

Question 7

Testing the service

Autoscaling

Increase the load

Question 8 (optional)

Submit the results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Installing `kubectl` and `kind`

Packages