0% found this document useful (0 votes)

2K views

KIIT Deemed To Be University: A Project Report

This document is a project report on housing price prediction submitted to KIIT Deemed to be University by 5 students under the guidance of Prof. Bhaswati Sahoo. It includes an introduction to the project, literature survey of previous work on housing price prediction using machine learning algorithms, software requirements specification, system design, testing, implementation details using a housing dataset from Ames, Iowa, screenshots of the project, and conclusions. The objective is to identify important variables and define the best regression model to predict housing prices by analyzing over 1500 property sales between 2006-2012 described by 26 explanatory variables.

Uploaded by

SHADAB NADEEM

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views

KIIT Deemed To Be University: A Project Report

Uploaded by

SHADAB NADEEM

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 33

A PROJECT REPORT

“HOUSING PRICE
PREDICTION”

Submitted to
KIIT Deemed to be University

In Partial Fulfillment of the Requirement for the Award of

BACHELOR’S DEGREE IN COMPUTER

SCIENCE & ENGINEERING
BY

SOMYA RAJ SINHA 1606309

DEEPESH RATHORE 1606349
SWATI LALL 1606397
TUSHAR 1606398
NIDHI AGRAWAL 1606443
UNDER THE GUIDANCE OF
PROF. BHASWATI SAHOO

SCHOOL OF COMPUTER ENGINEERING

KALINGA INSTITUTE OF INDUSTRIAL TECHNOLOGY
BHUBANESWAR, ODISHA - 751024
J u l y 2019
KIIT Deemed to be University
School of Computer Engineering
Bhubaneswar, ODISHA 751024

CERTIFICATE
This is certify that the project entitled
“HOUSING PRICE
PREDICTION”

submitted by

SOMYA RAJ SINHA 1606309

DEEPESH RATHORE 1606349
SWATI LALL 1606397
TUSHAR 1606398
NIDHI AGRAWAL 1606443

is a record of bonafide work carried out by them, in the partial fulfillment of the
requirement for the award of Degree of Bachelor of Engineering (Computer Sci- ence
& Engineering OR Information Technology) at KIIT Deemed to be university,
Bhubaneswar. This work is done during year 2018-2019, under our guidance.

Date: 05 /07 /19

(Prof. BHASWATI
SAHOO)
Acknowledgement
We are profoundly grateful of Prof. Bhaswati Sahoo for her expert guidance and
continuous encouragement throughout the project right from its commencement to
its completion.

SOMYA RAJ SINHA

DEEPESH RATHORE
SWATI LALL
TUSHAR
NIDHI AGRAWAL

School of Computer Engineering, KIIT, BBSR

ABSTRACT
Considering the fact that buying of houses is not a seasonal activity,but a regular
thing and describing homes and their variance with price is of utmost interest and
importance, using the linear regression model in Python,we are analyzing the
pricing patterns and identifying the features affecting the price of a house ,we are
predicting the price of houses of a city.

Broadly, this paper finds the solution to the question of how house prices are
affected by housing characteristics (both internally, such as the number of
bathrooms, bedrooms, etc. and externally, such as schools, or parks, etc. in the
neighbourhood). Using data from Kaggle, a prominent dataset website, this paper
utilizes both the Linear Regression model, to briefly predict house prices. This
paper also identifies the important attributes in housing price prediction such as
comparable houses, sold price, price per square foot, year in which the house is
sold, building type and bedroom, etc.

It also sees the variations of the sales price with the existing as well as derived
features for a more brief and easy to understand prediction through six graphs. We
seek answers to different questions on capabilities of data set and through the
scatterplot ,we show the comparison between actual price and predicted price and
arrive on a range of price which sees the most number of sales.

School of Computer Engineering, KIIT, BBSR

Contents: Page No.
1. Introduction …………………………………….1
2. Literature Survey ………………………………4
3. Software Requirements Specification………….5
3.1 Introduction………………………….………...5
3.2 Objective………………………………………..5
3.3 Problem Statement…………………………….5
3.4 System Overview………………………………6
3.5 Hardware Requirement……………………….6
3.6 Software Requirement……………………… .6
3.6.1 Technical Specification………………………6
4. System Design……………………………………8
5. System Testing………………………………… 9
6. Project Planning…………………………………10
7. Implementation…………………………………..11
7.1 Data set and features………………………… 11
7.2 Output label………………………………… 11
7.3 Data preprocessing and cleaning…………… 12
7.4 Data visualization and feature Engineering… 13
7.5 Applying the model and checking accuracy… 16
8. Screenshots of Project………………………… 17
8.1 Graphs for data visualization…………………17
8.2 Feature Selection………………………………18
8.3 Checking correlation………………………… 20
8.4 Model and Prediction………………………… 20
9. Conclusion and future scope……………………21
References...………………………………… 22
Appendix……………………………………. 23

School of Computer Engineering, KIIT, BBSR

Chapter 1

Introduction
The fact that buying of houses is not a seasonal activity,but a regular thing and
describing homes and their variance with price is of utmost interest and
importance, using the linear regression model in Python,we are analyzing the
pricing patterns and identifying the features affecting the price of a house ,we
are predicting the price of houses of a city.

Objective of this project is to identify the most important variables and to define
the best regression model for predicting the housing prices in Ames, Iowa. The
data set used for the project purposes, describes 1500 residential property sales
in Ames, Iowa between 2006 and 2012. It contains 26 explanatory variables
describing every aspect of the home. Continuous variables determine the various
area dimensions such as the size of the living area, the basement while discrete
variables quantify the number of rooms, baths, kitchens, parking spots etc.
Nominal variables typically describe the various types or classes of dwellings,
materials and locations such as the name of the neighborhood, the garage type,
the sale type etc. Ordinal variables typically rate the quality and condition of
different house parts and utilities. The fact that the data-set was over
parameterized and heterogeneous lead to the following hardships and increased
the difficulty of the analysis.

School of Computer Engineering, KIIT, BBSR Page 1

School of Computer Engineering, KIIT, BBSR Page 2
School of Computer Engineering, KIIT, BBSR Page 3
Chapter 2

Literature Survey
This section focuses on the most popular and relevant methods used for
predicting the housing prices. Many research has been done to practice the
prediction of the housing prices of different cities considering the different
attributes for each city. Methods like Linear Regression, Random Forest,SVM
and also other machine learning algorithms are used to predict the prices of the
house.

One of the famous research paper was written by An Nguyen. This paper
explores the question of how house prices in five different counties are affected
by housing characteristics (both internally, such as the number of bathrooms,
bedrooms, etc. and externally, such as public schools’ scores or the walkability
score of the neighborhood). This paper also identifies the four most important
attributes in housing price prediction across as assessment, comparable houses’
sold price, listed price and number of bathrooms.

The machine learning algorithms used in this paper are Random Forest and
Support Vector Machine (SVM) to do the prediction of houses in Zillow, Trulia,
and Red-fin.
Using a data-set of 1,457 houses from 5 different counties scraped from Zillow,
Trulia and Red-fin, this paper addresses the following questions:
1. Can the models propose in this paper outperform or get close to Zillow’s
prediction score baseline?
2. Can the overestimate to underestimated house ratio be reduced?
3. What are the most important attributes that affect the sold price?

For Hunt (TX), SVM outperforms the baseline by 3.2%. Random Forest outputs
close predictions scores to the baseline with the data-set from Cowlitz (WA) and
Montgomery (IL). Moreover, results suggest that using one single set of 10
attributes for all counties will not change the models’ accuracy scores by a lot in
comparison to using different sets of attributes for different counties.

School of Computer Engineering, KIIT, BBSR Page 4

Chapter 3
Software Requirements Specification

1.1 Introduction:
Through housing price prediction a user can predict the price of a house by
providing certain information about the house such as number of bedroom,
number of bathroom, kitchen area, living area, parking lot and various other
attributes.After providing these information we analyse the data and do data
engineering and select the relevant features to predict the price of the house.
3.2 Objective:
Objective of this project is to identify the most important variables and to define
the best regression model for predicting the housing prices in Ames, Iowa. The
data set used for the project purposes, describes 1500 residential property sales
in Ames, Iowa between 2006 and 2012. It contains 26 explanatory variables
describing every aspect of the home. Continuous variables determine the various
area dimensions such as the size of the living area, the basement while discrete
variables quantify the number of rooms, baths, kitchens, parking spots etc.
Nominal variables typically describe the various types or classes of dwellings,
materials and locations such as the name of the neighborhood, the garage type,
the sale type etc. Ordinal variables typically rate the quality and condition of
different house parts and utilities. The fact that the data-set was over
parameterized and heterogeneous lead to the following hardships and increased
the difficulty of the analysis.
3.3 Problem Statement:
Let's take a real estate company that has a dataset containing the prices of
properties. It wants to utilize the data to optimise the sale prices of the properties
based on important features.
Essentially, the company wants to —
 Identify the variables affecting house prices.
 Design a linear model that quantitatively relates house prices with variables
or factors such as number of rooms, area, number of bathrooms, etc.
 Know the accuracy of the model, i.e. how well these features can predict
house prices.

School of Computer Engineering, KIIT, BBSR Page 5

3.4 System Overview:

Figure 1 : Rough System Architecture

3.5 Hardware Requirement

Hardware Requirement
RAM : 8 GB
SYSTEM TYPE : 64-Bit Operating System,x64-based processor

3.6 Software Requirement

Software Specification
Operating System:Windows 10

3.6.1 TECHNICAL SPECIFICATION

The technical tools used in making this project include the following:

Python3: Python is an interpreted high-level programming language for

general-purpose programming. Created by Guido van Rossum and first
released in 1991, Python has a design philosophy that emphasizes code
readability, notably using significant whitespace. It provides constructs that
enable clear programming on both small and large scales.
School of Computer Engineering, KIIT, BBSR Page 6
Anaconda: is a free and open source distribution of the Python and R
programming languages for data science and machine learning related
applications (large-scale data processing, predictive analytics, scientific
computing), that aims to simplify package management and deployment.
Package versions are managed by the package management system conda,
which makes it quite simple to install, run, and update complex data science
and machine learning software libraries like Scikit-learn, TensorFlow, and
SciPy.

Jupyter Notebook: The Jupyter Notebook is an open-source web application

that allows you to create and share documents that contain live code, equations,
visualizations and narrative text. Uses include: data cleaning and
transformation, numerical simulation, statistical modeling, data visualization,
machine learning, and much

School of Computer Engineering, KIIT, BBSR Page 7

Chapter 4

System Design

School of Computer Engineering, KIIT, BBSR Page 8

Chapter 5

System Testing
Test Cases and Test Results

Test Test Case Title Test Condition System Behavior Expected Result
ID

T01 Accuracy Check on test data 76% 90%

School of Computer Engineering, KIIT, BBSR Page 9

Chapter 6
Project Planning
6.1 DATA COLLECTION :
We have collected the data set from Kaggle on Ames housing prices.

6.2 DATA CLEANING :

6.2.1 Removing null values.

6.2.2 Imputing missing values.

6.3 DATA PRE PROCESSING :

6.3.1 Handling categorical values.

6.3.2 Converting to proper data types.
6.3.3 Eliminating Constant and Quasi-constant columns.

6.4 DATA VISUALIZATION :

6.4.1 Plotting the graph between different columns to visualize the relation or trend between each
columns.
6.4.2 Plotting the heat map and visualizing the correlation between different columns, and
eliminating the columns having higher correlation (>0.8).

6.5 APPLYING LINEAR REGRESSION :

Calculating the accuracy of the model.

School of Computer Engineering, KIIT, BBSR Page 10

Chapter 7

Implementation

7.1 DataSet and Features

Sale Price, Bedrooms, Bathrooms, Kitchens, Ground Living Area, Year Sold, Year Built,
Garages are the main data set we are working on to train our model and to predict the
prices all these datas were obtained from Kaggle.

7.1.1 HOUSE AGE

The age of house has a great impact on it’s price. We have derived a feature of house age
by subtracting Year Sold and Year built. This feature will let us know that how the prices
are varying with the increase in its age. It would help us in predicting our model.

7.1.2 HOUSE TYPE

This feature is derived using the Building Type and Garages. It is used to know the
variation of prices with respect to the type of building(1BHK, Duplex, 3BHK) and the
numbers of cars can be parked in that building’s garage.

7.1.3 BATHROOMS

In the dataset we were given different types of bathroom of a house like full bathroom, half
half bathroom and bathroom in basement. So we combined all these into one column as
they all comes under bathrooms, and also bathrooms are something which everyone looks
for while buying a house. So it is very helpful in predicting the price of houses.

7.1.4 Ground Living Area

The Ground Living Area shows the area in Square foot in which the house is built. The
price of houses is almost dependent on this, as they are directly proportional.

7.2 OUTPUT LABEL

7.2.1 Sale Price

Sales price is the output label here as we have to predict the Sales price of the houses
considering the different attributes and features from the given dataset.

School of Computer Engineering, KIIT, BBSR

Page 11
7.3 DATA PREPROCESSING AND CLEANING

The data set of Ames Housing Price has been used which was taken from Kaggle. We have
cleaned and preprocessed the data by checking out the NULL values and corelation(>=0.8)
between the columns. We also have to deal with the categorical variables so, the attributes
which were not significant in predicting the price was also removed.

We can see that we have many Null values data, so we will replace them and also we have
Some constant and Quasi-Constant attributes we will also remove them as they won’t be
any helpful in prediction.
We will copy all the attributes which are useful in other data set in pandas(python).

Now, we will check for corelation between the attributes and deal with it.

Page 12
Here, from the corelation matrix we can see that OverAllQual is highly corelated with Sales Price.

7.4 DATA VISUALIZATION AND FEATURE ENGINEERING

7.4.1 Plotting the graph of Sales vs Sales Price

With this graph we can visualize the No. Of sales which is happening in a given price range.

Page 13
7.4.2 Plotting the graph of Ground Living Area vs Sales Price

With the help of this scatter plot we can visualize the outliers and also the variation in prices
with respect to the living area which is in square foot.

7.4.3 Plotting the graph of House Age vs Sales Price

We have made a new feature called house age(Year Sold - Year Built) to see the variation
in price trend of houses, this graph will help us to see how the price varies when the
age of house is more.

7.4.4 Plotting the graph of HouseType vs Sales Price

We have made a new feature out of Building Type and Garage Cars to see the price trend of houses
depending on the type of building and number of cars the garage can park.

We have done encoding of Building type to convert it from categorical variable to numerical variable.

Page 14
7.4.5 Plotting the graph of Bathroom vs Sales Price

Page 15
7.5 APPLYING THE MODEL AND CHECKING ACCURACY

Using the Linear Regression Model we are getting an accuracy something near to 76%.

Page 16
NAME OF PROJECT

Chapter 8

Screenshots of Project

8.1 Graphs for Data Visualization

Page 17
8.2 Feature Selection

School of Computer Engineering, KIIT, BBSR Page 18

Page 19
8.3 CHECKING CORELATION

8.4 MODEL AND PREDICTION

Page 20
Chapter 9
Conclusion and Future Scope

9.1 Conclusion
By analysing the pricing patterns and identifying the features affecting the
price of a house we are predicting the price of houses of that city.
It is observed by creating a scatter plot between the actual price and
observed price that the houses costing in between 1000000$ to
2000000$ are predicted quite accurately and is also observed that the
houses costing between 1500000$-200000$ are sold the most.

9.2 Future Scope

Once our model has been trained on a given set of data, it can now be
used to make predictions on new sets of input data. The model has learned
what the best questions to ask about the input data are, and can respond
with a prediction for the target variable. We can use these predictions to
gain information about data where the value of the target variable is
unknown — such as data the model was not trained on.

Page 21
References
 Kaggle.com

 Wikipedia.com

 Google.com

 Balsamiq.cloud : wireframe used as image

 Towardsdatascience.com

Page 22
Appendix-I
STUDENT'S CONTRIBUTION TO THE PROJECT

NAME OF STUDENT Somya Raj Sinha

ROLL NO 1606309

PROJECT TITLE Housing price prediction

ABSTRACT OF THE Considering the fact that buying of houses is not a seasonal
PROJECT (WITHIN 80 activity,but a regular thing and describing homes and their
WORDS) variance with price is of utmost interest and importance,using
linear regression model in Python,we are analyzing the pricing
patterns and identifying the features affecting the price of a
house ,we are predicting the price of houses of a city.

CONTRIBUTION
1. CONTRIBUTION TO Contributed in the report regarding project planning and
THE PROJECT implementation along with screenshot of project.
REPORT

2. CONTRIBUTION Derived features from the existing set of features and plotted
DURING the bar graph for house age(in years) vs sales price($).
IMPLEMENTATION

3. CONTRIBUTION FOR Histogram,scatterplot and bar graph for house age(in years) vs
THE PROJECT sales price($).
DEMONSTRATION /
PRESENTATION

SIGNATURE OF STUDENT

Page 23
SIGNATURE OF GUIDE
Appendix-II
STUDENT'S CONTRIBUTION TO THE PROJECT

NAME OF STUDENT Deepesh Rathore

ROLL NO 1606349

PROJECT TITLE Housing price prediction

CONTRIBUTION
4. CONTRIBUTION TO Contributed in system design and testing along with a section
THE PROJECT of screenshot of project.
REPORT

5. CONTRIBUTION Plotted scatterplot of sale price against living area and bar
DURING graph for different features against sale price.
IMPLEMENTATION

6. CONTRIBUTION FOR Two bar graphs of house type and no of bathrooms vs sales
THE PROJECT price along with a scatterplot showing comparison between
DEMONSTRATION / predicted price and actual price.
PRESENTATION

SIGNATURE OF STUDENT

Page 24
SIGNATURE OF GUIDE
Appendix-III
STUDENT'S CONTRIBUTION TO THE PROJECT

NAME OF STUDENT Swati Lall

ROLL NO 1606397

PROJECT TITLE Housing price prediction

CONTRIBUTION
7. CONTRIBUTION TO Contributed in introduction,software requirement specification
THE PROJECT and a section of screenshot of project.
REPORT

8. CONTRIBUTION Plotted scatter plot for the comparison of actual and predicted
DURING price and histogram of sale price against no of sales.
IMPLEMENTATION

9. CONTRIBUTION FOR Problem statement,motivation for the same and Matplotlib.

THE PROJECT
DEMONSTRATION /
PRESENTATION

SIGNATURE OF STUDENT

Page 25
SIGNATURE OF GUIDE
Appendix-IV
STUDENT'S CONTRIBUTION TO THE PROJECT

NAME OF STUDENT Tushar

ROLL NO 1606398

PROJECT TITLE Housing price prediction

CONTRIBUTION
10. CONTRIBUTION TO Contributed in conclusion and future scope along with
THE PROJECT screenshot of project.
REPORT

11. CONTRIBUTION Applied Linear regression model.

DURING
IMPLEMENTATION

12. CONTRIBUTION FOR Applicability of the data,conclusion of the prediction and

THE PROJECT future outcomes expected from this project.
DEMONSTRATION /
PRESENTATION

SIGNATURE OF STUDENT

Page 26
SIGNATURE OF GUIDE
Appendix-V
STUDENT'S CONTRIBUTION TO THE PROJECT

NAME OF STUDENT Nidhi Agrawal

ROLL NO 1606443

PROJECT TITLE Housing price prediction

CONTRIBUTION
13. CONTRIBUTION TO Contributed in data pre-processing,data cleaning and a section
THE PROJECT of screenshot of project.
REPORT

14. CONTRIBUTION Data cleaning and pre - processing.

DURING
IMPLEMENTATION

15. CONTRIBUTION FOR Tools used for the project along with the reason and Linear
THE PROJECT regression model in Python.
DEMONSTRATION /
PRESENTATION

SIGNATURE OF STUDENT

Page 27
SIGNATURE OF GUIDE
Page 28

Product Team Cialis - Getting Ready To Market - Case Solution
No ratings yet
Product Team Cialis - Getting Ready To Market - Case Solution
6 pages
Intuit
No ratings yet
Intuit
44 pages
Seam 4 Cargo Handling Stowage DG ILO
100% (3)
Seam 4 Cargo Handling Stowage DG ILO
55 pages
Dsbda Mini Manav
No ratings yet
Dsbda Mini Manav
17 pages
Laptop Price Prediction Using Machine Learning: International Journal of Computer Science and Mobile Computing
100% (1)
Laptop Price Prediction Using Machine Learning: International Journal of Computer Science and Mobile Computing
5 pages
Final House Prediction
50% (2)
Final House Prediction
83 pages
Chandigarh University: Format For Project Report
100% (1)
Chandigarh University: Format For Project Report
4 pages
Report of Industrial Training
No ratings yet
Report of Industrial Training
22 pages
2_SampleProjectCertificates
No ratings yet
2_SampleProjectCertificates
6 pages
Internship Report Anthony and Joshil PDF
No ratings yet
Internship Report Anthony and Joshil PDF
20 pages
Mca Project Guidelines Complete
No ratings yet
Mca Project Guidelines Complete
13 pages
Mini Project Report
No ratings yet
Mini Project Report
25 pages
Module 2 PDF
No ratings yet
Module 2 PDF
83 pages
Mini Project Report On Ipl Win Probability Predictor"
No ratings yet
Mini Project Report On Ipl Win Probability Predictor"
28 pages
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
No ratings yet
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
5 pages
Industrial Training Report
No ratings yet
Industrial Training Report
24 pages
Tentative BTech - CSE 4TH Sem Syllabus 2018-19
No ratings yet
Tentative BTech - CSE 4TH Sem Syllabus 2018-19
26 pages
Python Report PDF
No ratings yet
Python Report PDF
41 pages
Karnataka PGCET MCA Syllabus PDF
No ratings yet
Karnataka PGCET MCA Syllabus PDF
2 pages
LP-III - Mini Project Report (ML)
No ratings yet
LP-III - Mini Project Report (ML)
15 pages
Alumni Tracking System: A Major Project Report ON
No ratings yet
Alumni Tracking System: A Major Project Report ON
65 pages
Mini Project
No ratings yet
Mini Project
40 pages
Week 1
100% (1)
Week 1
25 pages
MCA Sem Java Program Solution
No ratings yet
MCA Sem Java Program Solution
16 pages
CSE MINI PROJECT Report
No ratings yet
CSE MINI PROJECT Report
14 pages
Software Testing - 2024 - Assignment 2 22.01.2024
100% (1)
Software Testing - 2024 - Assignment 2 22.01.2024
6 pages
Breast Cancer Classification Using Deep Learning Final Ppt (1)
No ratings yet
Breast Cancer Classification Using Deep Learning Final Ppt (1)
19 pages
CodeAlpha Certificate
No ratings yet
CodeAlpha Certificate
1 page
25th August MCA New First Year Syllabus 2020
No ratings yet
25th August MCA New First Year Syllabus 2020
24 pages
Phase 2 Final Report Depression Detection
No ratings yet
Phase 2 Final Report Depression Detection
48 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
Internship Report (Data Science)
No ratings yet
Internship Report (Data Science)
32 pages
Final ML Report
No ratings yet
Final ML Report
34 pages
CHANDIGARH UNIVERSITY - Final Project Report 1
100% (1)
CHANDIGARH UNIVERSITY - Final Project Report 1
12 pages
Gold Price Prediction Using Ensemble Based Supervised Machine Learning
100% (2)
Gold Price Prediction Using Ensemble Based Supervised Machine Learning
30 pages
Bangalore House Price Prediction Using The Best Machine Learning Model Submitted by Rukzana Vadakkekudy Rassak P2682221
No ratings yet
Bangalore House Price Prediction Using The Best Machine Learning Model Submitted by Rukzana Vadakkekudy Rassak P2682221
9 pages
Develop SRS As Per IEEE Standard For A Student Admission System: (30 Marks)
No ratings yet
Develop SRS As Per IEEE Standard For A Student Admission System: (30 Marks)
9 pages
DSBDA - Mini Project Report
100% (1)
DSBDA - Mini Project Report
7 pages
Java Full Stack Internship Report
No ratings yet
Java Full Stack Internship Report
2 pages
Companies Eligibility Criteria
No ratings yet
Companies Eligibility Criteria
4 pages
Practical 3 ANN
No ratings yet
Practical 3 ANN
3 pages
Visvesvaraya Technological University: City Engineering College
No ratings yet
Visvesvaraya Technological University: City Engineering College
31 pages
Predictive Analysis For Big Mart Sales Using Machine
100% (1)
Predictive Analysis For Big Mart Sales Using Machine
11 pages
1NH17CS407
No ratings yet
1NH17CS407
110 pages
Capstone Project
No ratings yet
Capstone Project
25 pages
Internship Report Format Vtu
No ratings yet
Internship Report Format Vtu
6 pages
MCA Mini Project Report Format 12-2023
No ratings yet
MCA Mini Project Report Format 12-2023
8 pages
TO SUPPLY LEFTOVER FOOD TO POOR
No ratings yet
TO SUPPLY LEFTOVER FOOD TO POOR
32 pages
ML Practical File
No ratings yet
ML Practical File
24 pages
Project Synopsis Format
No ratings yet
Project Synopsis Format
2 pages
Decision Tree Report
100% (1)
Decision Tree Report
29 pages
Anush J Internship Report
No ratings yet
Anush J Internship Report
15 pages
A Seminar Report On Machine Learning
No ratings yet
A Seminar Report On Machine Learning
38 pages
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
No ratings yet
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
5 pages
Deep Learning Based Car Damage Detection, Classification and Severity
No ratings yet
Deep Learning Based Car Damage Detection, Classification and Severity
7 pages
Project Final Report
100% (1)
Project Final Report
44 pages
Admin Module: PG Accommodation Database Management System
No ratings yet
Admin Module: PG Accommodation Database Management System
16 pages
Sales Prediction
No ratings yet
Sales Prediction
37 pages
3rd Sem Java Lab Programs
No ratings yet
3rd Sem Java Lab Programs
17 pages
1D Array
No ratings yet
1D Array
5 pages
Computer Oriented Statistical Methods R22 REGULAR JULY-24 (1)
No ratings yet
Computer Oriented Statistical Methods R22 REGULAR JULY-24 (1)
2 pages
ml project clg (2)
No ratings yet
ml project clg (2)
62 pages
2024 GKS IRTS Application Forms
No ratings yet
2024 GKS IRTS Application Forms
9 pages
Bogibeel Bridge - Wikipedia PDF
No ratings yet
Bogibeel Bridge - Wikipedia PDF
14 pages
Investiture Script
100% (1)
Investiture Script
9 pages
Chem P1 Set 3
No ratings yet
Chem P1 Set 3
14 pages
The Transpersonal William James
100% (1)
The Transpersonal William James
21 pages
Math Q4 Week 2 Lesson 5 Gathering Statistical Data
No ratings yet
Math Q4 Week 2 Lesson 5 Gathering Statistical Data
18 pages
Syllabus PDF
No ratings yet
Syllabus PDF
2 pages
Animals and The Food Chain
No ratings yet
Animals and The Food Chain
8 pages
Employment Contract Form
No ratings yet
Employment Contract Form
2 pages
VMG Myanmar Company Profile
No ratings yet
VMG Myanmar Company Profile
10 pages
Book 3 - Candlestick - Chart Pattern
No ratings yet
Book 3 - Candlestick - Chart Pattern
60 pages
Guidelines For The Husband in Interacting With His Wife
No ratings yet
Guidelines For The Husband in Interacting With His Wife
3 pages
Airtel Case Study PDF
No ratings yet
Airtel Case Study PDF
14 pages
FMEA Deck
100% (1)
FMEA Deck
20 pages
23a Inequalities - H - Question Paper
No ratings yet
23a Inequalities - H - Question Paper
9 pages
(Ebook) Welfare, Choice and Solidarity in Transition: Reforming the Health Sector in Eastern Europe by János Kornai, Karen Eggleston ISBN 9780511012310, 9780521790369, 0521790360, 0511012314 - The full ebook version is ready for instant download
100% (1)
(Ebook) Welfare, Choice and Solidarity in Transition: Reforming the Health Sector in Eastern Europe by János Kornai, Karen Eggleston ISBN 9780511012310, 9780521790369, 0521790360, 0511012314 - The full ebook version is ready for instant download
56 pages
Quiz 3.3 RW
No ratings yet
Quiz 3.3 RW
2 pages
Lean in The Supply Chain A Literature Review
No ratings yet
Lean in The Supply Chain A Literature Review
10 pages
Introduction To Product Management Decision-Making
No ratings yet
Introduction To Product Management Decision-Making
32 pages
RSL FemaleVox G4 2014-OnlineEdition 01sep2017
No ratings yet
RSL FemaleVox G4 2014-OnlineEdition 01sep2017
60 pages
The Lennon Prophecy A New Examination of the Death Clues of The Beatles First Edition Joseph Niezgoda instant download
100% (1)
The Lennon Prophecy A New Examination of the Death Clues of The Beatles First Edition Joseph Niezgoda instant download
49 pages
Lesson - 9 - Week - 9 - OPERATE COMPUTERIZED RESERVATION SYSTEM (OR)
No ratings yet
Lesson - 9 - Week - 9 - OPERATE COMPUTERIZED RESERVATION SYSTEM (OR)
5 pages
MABD Notes
No ratings yet
MABD Notes
5 pages
specific heat capacity &amp- latent heat and other type numerical
No ratings yet
specific heat capacity &amp- latent heat and other type numerical
61 pages
FYBA Introduction To Literature
No ratings yet
FYBA Introduction To Literature
4 pages
Unit 4 - Turing Machineuu
No ratings yet
Unit 4 - Turing Machineuu
50 pages
DPG Degree College 8126785160XBJCBX CXSCBHJCBANMC SACBSAUHCGSAXAHJCBAMCA CACHC NMSACSAHMAC AJCBA JCHSAKJCSANM SNCSAHCBC SACM AGCJACBMNACNACN
100% (1)
DPG Degree College 8126785160XBJCBX CXSCBHJCBANMC SACBSAUHCGSAXAHJCBAMCA CACHC NMSACSAHMAC AJCBA JCHSAKJCSANM SNCSAHCBC SACM AGCJACBMNACNACN
10 pages
[32058454_48418999]Unit 3 Fascinating parks Discover Useful Structures 教学设计 -2024-2025学年高中英语人教版（2019）选择性必修第一册
No ratings yet
[32058454_48418999]Unit 3 Fascinating parks Discover Useful Structures 教学设计 -2024-2025学年高中英语人教版（2019）选择性必修第一册
4 pages

Uploaded by

Uploaded by

A PROJECT REPORT

In Partial Fulfillment of the Requirement for the Award of

BACHELOR’S DEGREE IN COMPUTER

SOMYA RAJ SINHA 1606309

SCHOOL OF COMPUTER ENGINEERING

SOMYA RAJ SINHA 1606309

Date: 05 /07 /19

SOMYA RAJ SINHA

School of Computer Engineering, KIIT, BBSR

School of Computer Engineering, KIIT, BBSR

School of Computer Engineering, KIIT, BBSR

School of Computer Engineering, KIIT, BBSR Page 1

School of Computer Engineering, KIIT, BBSR Page 4

School of Computer Engineering, KIIT, BBSR Page 5

Figure 1 : Rough System Architecture

3.5 Hardware Requirement

3.6 Software Requirement

3.6.1 TECHNICAL SPECIFICATION

Python3: Python is an interpreted high-level programming language for

Jupyter Notebook: The Jupyter Notebook is an open-source web application

School of Computer Engineering, KIIT, BBSR Page 7

School of Computer Engineering, KIIT, BBSR Page 8

T01 Accuracy Check on test data 76% 90%

School of Computer Engineering, KIIT, BBSR Page 9

6.2 DATA CLEANING :

6.2.1 Removing null values.

6.3 DATA PRE PROCESSING :

6.3.1 Handling categorical values.

6.4 DATA VISUALIZATION :

6.5 APPLYING LINEAR REGRESSION :

School of Computer Engineering, KIIT, BBSR Page 10

7.1 DataSet and Features

7.1.1 HOUSE AGE

7.1.2 HOUSE TYPE

7.1.4 Ground Living Area

7.2 OUTPUT LABEL

7.2.1 Sale Price

School of Computer Engineering, KIIT, BBSR

7.4 DATA VISUALIZATION AND FEATURE ENGINEERING

7.4.1 Plotting the graph of Sales vs Sales Price

7.4.3 Plotting the graph of House Age vs Sales Price

7.4.4 Plotting the graph of HouseType vs Sales Price

8.1 Graphs for Data Visualization

School of Computer Engineering, KIIT, BBSR Page 18

8.4 MODEL AND PREDICTION

9.2 Future Scope

 Balsamiq.cloud : wireframe used as image

NAME OF STUDENT Somya Raj Sinha

PROJECT TITLE Housing price prediction

NAME OF STUDENT Deepesh Rathore

PROJECT TITLE Housing price prediction

NAME OF STUDENT Swati Lall

PROJECT TITLE Housing price prediction

9. CONTRIBUTION FOR Problem statement,motivation for the same and Matplotlib.

NAME OF STUDENT Tushar

PROJECT TITLE Housing price prediction

11. CONTRIBUTION Applied Linear regression model.

12. CONTRIBUTION FOR Applicability of the data,conclusion of the prediction and

NAME OF STUDENT Nidhi Agrawal

PROJECT TITLE Housing price prediction

14. CONTRIBUTION Data cleaning and pre - processing.

You might also like