Data Mining & Data Warehousing
Data Mining & Data Warehousing
&
DATA
WAREHOUSING
DEFINITION
The Data Mining process is the extraction of
valid and previously unknown information.
OR
The process of the extraction of hidden
predictive information from large databases,
is a powerful new technology with a great
potential to help companies focus on the
most important information in their data
warehouses.
Why Do We Need Data Mining?
To handle bulk of Data in various enterprises,
thereby increasing the margin.
To turn incomprehensible Data into Usable
information.
It is a combination of ideas from statistics,
given period.
Whether deleting a sale line yield more profit.
Uses techniques like regression , correlation
etc.
Identification
Data patterns used to identify the existence of
an item, an event or an activity.
Intruders trying to break the computer
summarization,classification,regression
,association, clustering.
Choosing the mining algorithms.
Data mining: search for patterns of interest.
Pattern evaluation and knowledge presentation.
PATTERN EVALUATION
DATA INTEGARTION
DATA
DATA
BASE
WAREH
S
Process of Data Mining
TRANSFORMED
ASSIMI
DATA LATIO
N
EXTRACTE
TRANS
D
FORM
DATA
ED
Data DATA
wareho SELECTE
use D DATA
warehousing customers
What
product Which
promotions What impact customers
have the will new are most
biggest products/ser likely to go to
impact on vices have the
revenue? on revenue & competition
margins?
WHAT IS DATA WAREHOUSE?
DATA COLLECTED FROM ONE OR MANY
SYSTEMS THAT EXIST WITHIN AND OUTSIDE
THE ORGANIZATION. THE DATA IS
STRUCTURED INSUCH A WAY AS TO REDUCE
THE AMOUNT OF TIME THAT IT TAKES TO
PRODUCE RELIABLE INFORMATION.
WHY DO WE NEED DATA
WAREHOUSING?
As It has both hardware and software
components which facilitates taking better
decisions in massive companies.
To provide a consistent common source for
corporate information.
To store large volumes of historical detail