DWM Lab Manual
DWM Lab Manual
LIST OF EXPERIMENTS
Problem Statement : Create a warehouse in SQL Server 2008 & import various databases
from external sources such as Access /Excel /Text file by using Data Transformation Services
(DTS) tool.
Data Warehouse-
A data warehouse is a relational database that is designed for query and analysis
rather than for transaction processing. It usually contains historical data derived from
transaction data, but it can include data from other sources. It separates analysis workload
from transaction workload and enables an organization to consolidate data from several
sources.
1]Subject Oriented:
Data warehouses are designed to help you analyze data. For example, to learn more about your
company's sales data, you can build a warehouse that concentrates on sales. Using this
warehouse, you can answer questions like "Who was our best customer for this item last year?"
This ability to define a data warehouse by subject matter, sales in this case, makes the data
warehouse subject oriented.
2]Integrated:
Integration is closely related to subject orientation. Data warehouses must put data from
disparate sources into a consistent format. They must resolve such problems as naming conflicts
and inconsistencies among units of measure. When they achieve this, they are said to be
integrated.
Nonvolatile means that, once entered into the warehouse, data should not change. This is logical
because the purpose of a warehouse is to enable you to analyze what has occurred.
4]Time Variant:
In order to discover trends in business, analysts need large amounts of data. This is very much
in contrast to online transaction processing (OLTP) systems, where performance requirements
demand that historical data be moved to an archive. A data warehouse's focus on change over
time is what is meant by the term time variant.
Output:
Problem Statement:
The Mumbai university wants to design star schema to record grade for course completed by
the student. There are four dimensional table namely Course section, student, professor, lecture
with the attributes as follows
Coursesection=Course_id,Sec_No,Course_Name,Units,Room_id,Room_capacity
Professor=Prof_id,Prof_Name,Dept_id,Dept_NameStudent=Stud_id,Stud_Name,Address,Co
n_No Lecture=Sem_Id,year,Class
Schema is a logical description of the entire database. It includes the name and description of
records of all record types including all associated data-items and aggregates.
A fact table works with dimension tables. A fact table holds the data to be analyzed, and a
dimension table stores data about the ways in which the data in the fact table can be analyzed.
Thus, the fact table consists of two types of columns. The foreign keys column allows joins
with dimension tables, and the measures columns contain the data that is being analyzed.
Star Schema
Output:
Title: Implementation of OLAP cube using analysis services project in SQL 2008.
Problem Statement: All car manufacturing company have sales department. Consider
dimension tables and fact table. Create OLAP for car manufacturing company.
Online Analytical Processing Server (OLAP) is based on the multidimensional data model. It
allows managers, and analysts to get an insight of the information through fast, consistent, and
interactive access to information. This chapter covers the types of OLAP, operations on OLAP,
difference between OLAP, and statistical databases and OLTP
Relational OLAP
OLAP servers are placed between relational back-end server and client front-end tools.
To store and manage warehouse data, ROLAP uses relational or extended-relational
DBMS.
Hybrid OLAP is a combination of both ROLAP and MOLAP. It offers higher scalability of
ROLAP and faster computation of MOLAP. HOLAP servers allows to store the large data
volumes of detailed information. The aggregations are stored separately in MOLAP store.
Output:
Selecting Classifier
At the top of the classify section is the Classifier box. This box has a text field that gives
the name of the currently selected classifier, and its options. Clicking on the text box with
the left mouse button brings up a Generic Object Editor dialog box, just the same as for
filters, that you can use to configure the options of the current classifier. With a right click
(orAlt+Shift+leftclick) you can once again copy the setup string to the clipboard or display
the properties in a Generic Object Editor dialog box. The Choose button allows you to
choose one of the classifiers that are available in WEKA.
Test Options
The result of applying the chosen classifier will be tested according to the options that are
set by clicking in the Test options box. There are fourtest modes:
1. Use training set. The classifier is evaluated on how well it predicts the class of
the instances it was trained on.
2. Supplied test set. The classifier is evaluated on how well it predicts the class of
asset if instances loaded from a file. Clicking the Set...button bring supadialog
allowing you to choose the file to test on.
Classifier Evaluation:
Fig 4. Classification
Classifier Rules:
Program:
Output: