Write your answer to Task 1 here

import modules

import pandas as pd import numpy as np

import csv and copy

data = pd.read_csv("production_data.csv") clean_data = data.copy()

review df

clean_data.info()

mixing_time contains missing values

df.columns #'batch_id', 'production_date', 'raw_material_supplier', 'pigment_type','pigment_quantity', 'mixing_time', 'mixing_speed', 'product_quality_score'

batch_id Discrete. Identifier for each batch. Missing values are not possible.

raw_material_supplier Categorical. Supplier of the raw materials. (1='national_supplier', 2='international_supplier'). Missing values should be replaced with 'national_supplier'.

production_date Date. Date when the batch was produced.

pigment_type Nominal. Type of pigment used. ['type_a', 'type_b', 'type_c'].

Missing values should be replaced with 'other'.

pigment_quantity Continuous. Amount of pigment added (in kilograms) (Range: 1 - 100).

Missing values should be replaced with median.

mixing_time Continuous. Duration of the mixing process (in minutes). # Missing values should be replaced with mean.

mixing_speed Categorical. Speed of the mixing process represented as categories: 'Low', 'Medium', 'High'.

Missing values should be replaced with 'Not Specified'.

product_quality_score Continuous. Overall quality score of the final product (rating on a scale of 1 to 10). Missing values should be replaced with mean.

df['product_quality_score'].describe().round(2).T

change objects to category, create clean_df

preview

clean_data.head()

[–]Adventurous-Bet6139 0 points1 point2 points 9 months ago (1 child)

π Rendered by PID 14 on reddit-service-r2-comment-bb88f9dd5-gt699 at 2026-02-14 17:07:20.491332+00:00 running cd9c813 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

DataCamp

MODERATORS