15 KNN - Problem Statement
15 KNN - Problem Statement
Instructions:
Please share your answers filled in-line in the word document. Submit code separately
wherever applicable.
Hints:
1. Business Problem
1.1. What is the business objective?
1.1. Are there any constraints?
2. Work on each feature of the dataset to create a data dictionary as displayed in the below
image:
2.1 Make a table as shown above and provide information about the features such as its data type
and its relevance to the model building. And if not relevant, provide reasons and a description of the
feature.
3. Data Pre-processing
3.1 Data Cleaning, Feature Engineering, etc.
4. Exploratory Data Analysis (EDA):
4.1. Summary.
4.2. Univariate analysis.
4.3. Bivariate analysis.
5. Model Building
5.1 Build the model on the scaled data (try multiple options).
5.2 Perform KNN and use cross validation techniques to get optimum K value.
5.3 Train and test the model and perform cross validation techniques. Compare accuracies, precision
and recall and explain them in the documentation.
5.4 Briefly explain the model output in the documentation.
1. A glass manufacturing plant uses different earth elements to design new glass materials
based on customer requirements. For that, they would like to automate the process of
classification as it’s a tedious job to manually classify them. Help the company achieve its
objective by correctly classifying the glass type based on the other features using KNN
algorithm.