0% found this document useful (0 votes)

128 views

DWM Lab Manual

Lab manual

Uploaded by

Pranav KhamitkaR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

128 views

DWM Lab Manual

Lab manual

Uploaded by

Pranav KhamitkaR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Vishwaniketan’s

Institute of Management Entrepreneurship & Engineering Technology

[ViMEET]
Department of Computer Science & Engineering (AI &ML)

Data Warehousing & Mining Lab

LIST OF EXPERIMENTS

Sr.No. Name of the Experiment

Construct Data Warehouse in SQL Server 2008
01
Construct Star schema and Snowflake Schema
02
Implementation of OLAP cube using analysis services project in SQL 2008.
03
Implement Classification via decision tree using WEKA tool.
04
Implemention of Navie Bayesian Classifier in Java/Python
05
Implemention of K-Means Clustering algorithm in Java/Python
06
Implement Hierarchical Clustering using WEKA tool.
07
Perform Association Rule Mining using WEKA tool.
08

09 Implementation of page rank algorithm.

10 Implementation of HITS algorithm.

ViMEET CSE(AI&ML) Page 1

EXPERIMENT NO. 01

AIM:- Construct data warehouse in SQL Server 2008

Problem Statement : Create a warehouse in SQL Server 2008 & import various databases
from external sources such as Access /Excel /Text file by using Data Transformation Services
(DTS) tool.

Theory and concept-

The Data Transformation Services (DTS) import/export wizard offers the simplest
method of building a DTS package interactively guiding you through the process of coping
and transferring data. Following are the basic steps for creating a package with the DTS
import/export wizard.

Data Warehouse-
A data warehouse is a relational database that is designed for query and analysis
rather than for transaction processing. It usually contains historical data derived from
transaction data, but it can include data from other sources. It separates analysis workload
from transaction workload and enables an organization to consolidate data from several
sources.

In addition to a relational database, a data warehouse environment includes an extraction,

transportation, transformation, and loading (ETL) solution, an online analytical processing
(OLAP) engine, client analysis tools, and other applications that manage the process of
gathering data and delivering it to business users.

1]Subject Oriented:

Data warehouses are designed to help you analyze data. For example, to learn more about your
company's sales data, you can build a warehouse that concentrates on sales. Using this
warehouse, you can answer questions like "Who was our best customer for this item last year?"
This ability to define a data warehouse by subject matter, sales in this case, makes the data
warehouse subject oriented.

2]Integrated:

Integration is closely related to subject orientation. Data warehouses must put data from
disparate sources into a consistent format. They must resolve such problems as naming conflicts
and inconsistencies among units of measure. When they achieve this, they are said to be
integrated.

ViMEET CSE(AI&ML) Page 2

3]Nonvolatile:

Nonvolatile means that, once entered into the warehouse, data should not change. This is logical
because the purpose of a warehouse is to enable you to analyze what has occurred.

4]Time Variant:

In order to discover trends in business, analysts need large amounts of data. This is very much
in contrast to online transaction processing (OLTP) systems, where performance requirements
demand that historical data be moved to an archive. A data warehouse's focus on change over
time is what is meant by the term time variant.

Output:

ViMEET CSE(AI&ML) Page 3

ViMEET CSE(AI&ML) Page 4
ViMEET CSE(AI&ML) Page 5
EXPERIMENT NO. 02

Title: Implementation of Star Schema Snowflake Schema

Problem Statement:

The Mumbai university wants to design star schema to record grade for course completed by
the student. There are four dimensional table namely Course section, student, professor, lecture
with the attributes as follows
Coursesection=Course_id,Sec_No,Course_Name,Units,Room_id,Room_capacity
Professor=Prof_id,Prof_Name,Dept_id,Dept_NameStudent=Stud_id,Stud_Name,Address,Co
n_No Lecture=Sem_Id,year,Class

Theory and Concept:

Schema is a logical description of the entire database. It includes the name and description of
records of all record types including all associated data-items and aggregates.
A fact table works with dimension tables. A fact table holds the data to be analyzed, and a
dimension table stores data about the ways in which the data in the fact table can be analyzed.
Thus, the fact table consists of two types of columns. The foreign keys column allows joins
with dimension tables, and the measures columns contain the data that is being analyzed.

Star Schema

 Each dimension in a star schema is represented with only one-dimension table.

 This dimension table contains the set of attributes.
 The following diagram shows the sales data of a company with respect to the four
dimensions, namely time, item, branch, and location.
 There is a fact table at the center. It contains the keys to each of four dimensions.
 The fact table also contains the attributes, namely dollars sold and units sold.

ViMEET CSE(AI&ML) Page 6

Output:

ViMEET CSE(AI&ML) Page 7

Snowflake Schema

 Some dimension tables in the Snowflake schema are normalized.

 The normalization splits up the data into additional tables.
 Unlike Star schema, the dimensions table in a snowflake schema are normalized. For
example, the item dimension table in star schema is normalized and split into two
dimension tables, namely item and supplier table.
 Now the item dimension table contains the attributes item_key, item_name, type,
brand, and supplier-key.
 The supplier key is linked to the supplier dimension table. The supplier dimension table
contains the attributes supplier_key and supplier_type.

Output:

ViMEET CSE(AI&ML) Page 8

ViMEET CSE(AI&ML) Page 9
EXPERIMENT NO. 03

Title: Implementation of OLAP cube using analysis services project in SQL 2008.

Problem Statement: All car manufacturing company have sales department. Consider
dimension tables and fact table. Create OLAP for car manufacturing company.

Theory and Concept:

Online Analytical Processing Server (OLAP) is based on the multidimensional data model. It
allows managers, and analysts to get an insight of the information through fast, consistent, and
interactive access to information. This chapter covers the types of OLAP, operations on OLAP,
difference between OLAP, and statistical databases and OLTP

Types of OLAP Servers

We have four types of OLAP servers:

 Relational OLAP (ROLAP)

 Multidimensional OLAP (MOLAP)
 Hybrid OLAP (HOLAP)
 Specialized SQL Servers

Relational OLAP

OLAP servers are placed between relational back-end server and client front-end tools.
To store and manage warehouse data, ROLAP uses relational or extended-relational
DBMS.

ROLAP includes the following:

 Implementation of aggregation navigation logic.

 Optimization for each DBMS back end.
 Additional tools and services.

ViMEET CSE(AI&ML) Page 10

Multidimensional OLAP

MOLAP uses array-based multidimensional storage engines for multidimensional views of

data. With multidimensional data stores, the storage utilization may be low if the data set is
sparse. Therefore, many MOLAP server use two levels of data storage representation to handle
dense and sparse data sets.

Hybrid OLAP (HOLAP)

Hybrid OLAP is a combination of both ROLAP and MOLAP. It offers higher scalability of
ROLAP and faster computation of MOLAP. HOLAP servers allows to store the large data
volumes of detailed information. The aggregations are stored separately in MOLAP store.

Output:

ViMEET CSE(AI&ML) Page 11

ViMEET CSE(AI&ML) Page 12
EXPERIMENT NO. 4

Aim: Classification via decision tree using WEKA tool.

Theory and Concept:

Classification models predict categorical class labels; and prediction models predict
continuous valued functions. For example, we can build a classification model to categorize
bank loan applications as either safe or risky, or a prediction model to predict the expenditures
in dollars of potential customers on computer equipment given their income and occupation.

Fig 1. Data set Screenshot

ViMEET CSE(AI&ML) Page 13

Fig 2. Data Preprocess

Selecting Classifier

At the top of the classify section is the Classifier box. This box has a text field that gives
the name of the currently selected classifier, and its options. Clicking on the text box with
the left mouse button brings up a Generic Object Editor dialog box, just the same as for
filters, that you can use to configure the options of the current classifier. With a right click
(orAlt+Shift+leftclick) you can once again copy the setup string to the clipboard or display
the properties in a Generic Object Editor dialog box. The Choose button allows you to
choose one of the classifiers that are available in WEKA.

ViMEET CSE(AI&ML) Page 14

Fig 3. Classifier Selection Screenshot

Test Options

The result of applying the chosen classifier will be tested according to the options that are
set by clicking in the Test options box. There are fourtest modes:

1. Use training set. The classifier is evaluated on how well it predicts the class of
the instances it was trained on.

2. Supplied test set. The classifier is evaluated on how well it predicts the class of
asset if instances loaded from a file. Clicking the Set...button bring supadialog
allowing you to choose the file to test on.

3. Cross- validation. The classifier is valuated bycross-validation, using the

number off olds that are entered in the Folds text field.
4. Percentage split. The classifierise valuated on how well it predicts a certain
percentage of the data which is held out for testing. The amount of data held out
depends on the valueentered in the %field.

Classifier Evaluation:

Fig 4. Classification

ViMEET CSE(AI&ML) Page 15

Fig 5. Decision Tree

Classifier Rules:

Conclusion: Thus we have implemented Classification via decision tree using

WEKA tool.

ViMEET CSE(AI&ML) Page 16

EXPERIMENT NO. 5

Aim: Implemention of Navie Bayesian Classifier in Java/Python

Theory and Concept:

Algorithmic steps for Navie Bayesian Classifier:

Program:

Output:

Conclusion: Thus we have implemented Navie Bayesian Classifier

ViMEET CSE(AI&ML) Page 17

Maya Tutorial
No ratings yet
Maya Tutorial
28 pages
DSU 22317 Practical
No ratings yet
DSU 22317 Practical
31 pages
USB DVB-T Stick With FM&DAB: Specification
No ratings yet
USB DVB-T Stick With FM&DAB: Specification
17 pages
Osy Question Bank Unit 3 To 6 (22516)
No ratings yet
Osy Question Bank Unit 3 To 6 (22516)
6 pages
Practical No.13
No ratings yet
Practical No.13
7 pages
Advanced Java Project Report
No ratings yet
Advanced Java Project Report
19 pages
PWP Micro Project
No ratings yet
PWP Micro Project
2 pages
Practical No. 4: Aim: Design Test Cases For Web Pages Testing Any Web Sites Theoretical Background
No ratings yet
Practical No. 4: Aim: Design Test Cases For Web Pages Testing Any Web Sites Theoretical Background
11 pages
Impact of Online Marketing On The Performance of Small Scale Industry
No ratings yet
Impact of Online Marketing On The Performance of Small Scale Industry
18 pages
Mobile Application Development - For Printing
No ratings yet
Mobile Application Development - For Printing
122 pages
OSY Summer 23
No ratings yet
OSY Summer 23
25 pages
Unit 2
No ratings yet
Unit 2
29 pages
GAD Practical 24
No ratings yet
GAD Practical 24
2 pages
Python 22616 Solved Manual (Join AICTE Telegram)
No ratings yet
Python 22616 Solved Manual (Join AICTE Telegram)
76 pages
Osy Solved Manual Diploma World
No ratings yet
Osy Solved Manual Diploma World
161 pages
Unit 1 Ajp
No ratings yet
Unit 1 Ajp
65 pages
220245-MSBTE-22619-PHP(Unit 5)
No ratings yet
220245-MSBTE-22619-PHP(Unit 5)
7 pages
STE Practical 11-15
No ratings yet
STE Practical 11-15
24 pages
AJP - QB Solved
No ratings yet
AJP - QB Solved
20 pages
AJP No.18
No ratings yet
AJP No.18
2 pages
ETI Microproject (57,59,62,63)
No ratings yet
ETI Microproject (57,59,62,63)
20 pages
Php Manual Answers Write It
No ratings yet
Php Manual Answers Write It
64 pages
STE Solved Manual
No ratings yet
STE Solved Manual
87 pages
Nis Pr_1 - 13 Complete Manual by Msbte Campus Academy
No ratings yet
Nis Pr_1 - 13 Complete Manual by Msbte Campus Academy
37 pages
Amar K Project-2
100% (1)
Amar K Project-2
17 pages
Project Report: Evelop A Small Animation Using Applet Graphics and Multithreading
No ratings yet
Project Report: Evelop A Small Animation Using Applet Graphics and Multithreading
10 pages
Ajp Question Bank
No ratings yet
Ajp Question Bank
6 pages
Mad Final
No ratings yet
Mad Final
18 pages
Online Feedback System: A Micro Project Report ON
No ratings yet
Online Feedback System: A Micro Project Report ON
16 pages
Microprocessors Solved Manual
No ratings yet
Microprocessors Solved Manual
78 pages
Css 22519 Lab Manual
No ratings yet
Css 22519 Lab Manual
48 pages
AJP 22517 Lab Manual All Practical
No ratings yet
AJP 22517 Lab Manual All Practical
121 pages
AJP Microproject
100% (1)
AJP Microproject
10 pages
Python Microproject 4 by Campusify
No ratings yet
Python Microproject 4 by Campusify
16 pages
Css Practical Answers
No ratings yet
Css Practical Answers
18 pages
Client Side Scripting Language (22519) Semester - V (CM) : A Laboratory Manual For
No ratings yet
Client Side Scripting Language (22519) Semester - V (CM) : A Laboratory Manual For
21 pages
STE Practical No.9
No ratings yet
STE Practical No.9
14 pages
Sem - 6 WEB Based Application Development With PHP. WBP (22619)
No ratings yet
Sem - 6 WEB Based Application Development With PHP. WBP (22619)
6 pages
Experiment No:-2: DCC Practical Name:-Huzaifa Dabir Enroll No: - 1905690330 Class: - Co4Ia
No ratings yet
Experiment No:-2: DCC Practical Name:-Huzaifa Dabir Enroll No: - 1905690330 Class: - Co4Ia
13 pages
PHP Report
No ratings yet
PHP Report
14 pages
CSS Practical No. 14. Roll No. 32
No ratings yet
CSS Practical No. 14. Roll No. 32
25 pages
Practical 19
No ratings yet
Practical 19
4 pages
22620 2024 Summer Question Paper
No ratings yet
22620 2024 Summer Question Paper
2 pages
Python Microproject 5 by Campusify
No ratings yet
Python Microproject 5 by Campusify
17 pages
Micro Project List ACN
No ratings yet
Micro Project List ACN
1 page
Ajp MP 2
No ratings yet
Ajp MP 2
15 pages
Government Polytechnic, Washim: "Implement Modifier Caesar's Cipher With Shift of Any Key
No ratings yet
Government Polytechnic, Washim: "Implement Modifier Caesar's Cipher With Shift of Any Key
12 pages
GAD Micro Project Report GAD Micro Project Report
No ratings yet
GAD Micro Project Report GAD Micro Project Report
21 pages
Osy Meena Chapter-5
No ratings yet
Osy Meena Chapter-5
11 pages
PWP Practical No. 3
No ratings yet
PWP Practical No. 3
2 pages
Css Practical 13
No ratings yet
Css Practical 13
3 pages
Software Practical No 8
No ratings yet
Software Practical No 8
4 pages
Microproject Ajp
No ratings yet
Microproject Ajp
19 pages
OSY Micropoject (Group F)
No ratings yet
OSY Micropoject (Group F)
15 pages
Design and Create Web Page of An Institute (22519)
No ratings yet
Design and Create Web Page of An Institute (22519)
36 pages
AJP Micro Project
No ratings yet
AJP Micro Project
10 pages
Practical No 24 Gad 22034 1
No ratings yet
Practical No 24 Gad 22034 1
3 pages
Grocery Store Microproject (22518)
No ratings yet
Grocery Store Microproject (22518)
25 pages
PWP - 22616 - QB - Unit Test 1
100% (1)
PWP - 22616 - QB - Unit Test 1
2 pages
CSS (22519) Manual
No ratings yet
CSS (22519) Manual
62 pages
MAD Micro Project Report
No ratings yet
MAD Micro Project Report
10 pages
DWM Manual
No ratings yet
DWM Manual
60 pages
DC8C3 Manual
No ratings yet
DC8C3 Manual
12 pages
203 Managing Recipient Objects
No ratings yet
203 Managing Recipient Objects
42 pages
DELL SERVIÇOS - Prodeploy Enterprise Suite Customer
No ratings yet
DELL SERVIÇOS - Prodeploy Enterprise Suite Customer
37 pages
Normal Probability Plot Create
No ratings yet
Normal Probability Plot Create
2 pages
Android_Services_Guide
No ratings yet
Android_Services_Guide
3 pages
Summation Iblaze 2-8 Eval
No ratings yet
Summation Iblaze 2-8 Eval
55 pages
Xref Management Guidelines
No ratings yet
Xref Management Guidelines
6 pages
MIL Week 3
No ratings yet
MIL Week 3
4 pages
Python Programming Important Notes
No ratings yet
Python Programming Important Notes
46 pages
Article Careers360 20221222200637
No ratings yet
Article Careers360 20221222200637
25 pages
1999-04 The Computer Paper - BC Edition
No ratings yet
1999-04 The Computer Paper - BC Edition
146 pages
Learn React Native_ Navigation Cheatsheet _ Codecademy
No ratings yet
Learn React Native_ Navigation Cheatsheet _ Codecademy
6 pages
Cyber Threats Classifications and Countermeasures in Banking and Financial Sector
No ratings yet
Cyber Threats Classifications and Countermeasures in Banking and Financial Sector
21 pages
Animated Text Field Component User Guide
No ratings yet
Animated Text Field Component User Guide
10 pages
Sunbelt Quick Guide - Stream C GPS Connection Via GEV269
No ratings yet
Sunbelt Quick Guide - Stream C GPS Connection Via GEV269
6 pages
Speed Matters:: 21 Expert Tips To An Ultra-Fast Wordpress Website
100% (1)
Speed Matters:: 21 Expert Tips To An Ultra-Fast Wordpress Website
21 pages
Operating System Environment
No ratings yet
Operating System Environment
19 pages
Onboarding Job Aid 7.3.2014
No ratings yet
Onboarding Job Aid 7.3.2014
3 pages
Platform Symphony™ Proof of Concept (POC) Service
No ratings yet
Platform Symphony™ Proof of Concept (POC) Service
2 pages
CAT ENG AUTOMATION AND INDUSTRIAL DOSAGE Unlocked
No ratings yet
CAT ENG AUTOMATION AND INDUSTRIAL DOSAGE Unlocked
52 pages
3 - Prepdialer - Vicidialdebian
No ratings yet
3 - Prepdialer - Vicidialdebian
16 pages
Chapter 6
No ratings yet
Chapter 6
15 pages
SB Projects IR NEC Protocol
No ratings yet
SB Projects IR NEC Protocol
3 pages
SCM - Manufacturing Cloud Update 21C Overview
No ratings yet
SCM - Manufacturing Cloud Update 21C Overview
26 pages
Impact 360-Scheduling Web Suite Overview 09162019
No ratings yet
Impact 360-Scheduling Web Suite Overview 09162019
16 pages
OXE Ip Ports
No ratings yet
OXE Ip Ports
38 pages
Effective Tips and Tricks To Google Search
No ratings yet
Effective Tips and Tricks To Google Search
15 pages
Paavai College of Engineering, Pachal, Namakkal. Department of Ece Internal Test-Ii
No ratings yet
Paavai College of Engineering, Pachal, Namakkal. Department of Ece Internal Test-Ii
4 pages

Uploaded by

Uploaded by

Vishwaniketan’s

Institute of Management Entrepreneurship & Engineering Technology

Data Warehousing & Mining Lab

Sr.No. Name of the Experiment

09 Implementation of page rank algorithm.

10 Implementation of HITS algorithm.

ViMEET CSE(AI&ML) Page 1

AIM:- Construct data warehouse in SQL Server 2008

Theory and concept-

In addition to a relational database, a data warehouse environment includes an extraction,

ViMEET CSE(AI&ML) Page 2

ViMEET CSE(AI&ML) Page 3

Title: Implementation of Star Schema Snowflake Schema

Theory and Concept:

 Each dimension in a star schema is represented with only one-dimension table.

ViMEET CSE(AI&ML) Page 6

ViMEET CSE(AI&ML) Page 7

 Some dimension tables in the Snowflake schema are normalized.

ViMEET CSE(AI&ML) Page 8

Theory and Concept:

Types of OLAP Servers

We have four types of OLAP servers:

 Relational OLAP (ROLAP)

ROLAP includes the following:

 Implementation of aggregation navigation logic.

ViMEET CSE(AI&ML) Page 10

MOLAP uses array-based multidimensional storage engines for multidimensional views of

Hybrid OLAP (HOLAP)

ViMEET CSE(AI&ML) Page 11

Aim: Classification via decision tree using WEKA tool.

Theory and Concept:

Fig 1. Data set Screenshot

ViMEET CSE(AI&ML) Page 13

ViMEET CSE(AI&ML) Page 14

3. Cross- validation. The classifier is valuated bycross-validation, using the

ViMEET CSE(AI&ML) Page 15

Conclusion: Thus we have implemented Classification via decision tree using

ViMEET CSE(AI&ML) Page 16

Aim: Implemention of Navie Bayesian Classifier in Java/Python

Theory and Concept:

Algorithmic steps for Navie Bayesian Classifier:

Conclusion: Thus we have implemented Navie Bayesian Classifier

ViMEET CSE(AI&ML) Page 17

You might also like