Data warehousing and OLAP technology.ppt

12
An overview of Data Warehousing and OLAP Technology Surajit Chaudhuri Umeshwar Dayal Microsoft Reserch, Redmond HP Labs, Palo Alto Presented  by:- Krishma Dutta

Transcript of Data warehousing and OLAP technology.ppt

Page 1: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 1/12

An overview of Data Warehousingand OLAP Technology

Surajit Chaudhuri Umeshwar DayalMicrosoft Reserch, Redmond HP Labs, Palo Alto

Presented  by:- Krishma Dutta

Page 2: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 2/12

Outline

Introduction

Need of Data Warehousing and OLAP

Architecture of Data Warehousing

Front-Back End Tools

Database Design Methodology

Conclusion

Page 3: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 3/12

Data Warehousing- An Introduction

Defined in many different ways:

In simplest terms Data Warehouse can be defined

as collection of Data marts

A data warehouse is a “subject-oriented, integrated,time-variant, and nonvolatile collection of data in

support of management’s decision-making

 process.”— W. H. Inmon

A data warehousing is a collection of decision

support technologies, aimed at enabling the

knowledge worker to make better decisions

Page 4: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 4/12

Need of Data Warehousing and OLAP

DataWarehousing

OLAP

Decision support requires historical

data which operational Databases do

not typically maintain

Decision support requires

consolidation (aggregation,

summarization) of data

Unacceptable Performance while

execution of complex OLAP queries

Multidimensional data model is not

supported by DBMS

Page 5: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 5/12

Tiered Architecture

Extract

Transform

Load

Refresh

Data Sources

Operational

Databases

External

Sources 

Data

Warehouse

Data Marts 

Data Storage

Tier1:

Data Warehouse

Server 

Serve

OLAP Engine

OLAP Server 

Tier2:

OLAP

Server 

Analysis

Query/Reports

Data mining

Front-End Tools

Tier3:

Clients

Page 6: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 6/12

Front-Back End Tools

Front End Tools Rollup (Drill-up)

Drill-down (Roll-down)

Slice and dice

Back End Tools Data Cleaning

Load

Refresh

Bob Jamie Brit Todd

Household

Automobile

Kitchen

Office

Montreal

Toronto

Page 7: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 7/12

Conceptual Model

Total annual salesof TV in U.S.A.

Date

   C  o  u  n   t  r

  y sum

 sum

TV

PVR PC

1 2 3 4

U.S.A

Canada

Mexico

 sum

 

ALL

Page 8: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 8/12

Database Design Methodology

DatabaseDesigns

Star Schema Snowflake Schema

A fact tablein the

middle

connected to

a set of dimension

tables

A refinement of star 

schema where

hierarchy is

normalized into a setof smaller dimension

tables, forming a shape

similar to snowflake

Page 9: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 9/12

Star Schema

Star Schema

location_keystreet

city

province

country

location

Time_key

Item_key

Branch_key

Location_key

Units_sold

Dollars_sold

Avg_sales   M  e  a  s  u  r  e  s

B_key

B_name

B_type

Branch

I_key

I_name

I_brand

I_type

I_supplier_type

item

T_keyT_day

T_day_week 

T_month

T_quarter

T_year

Time

Sales Fact Table

Page 10: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 10/12

Snowflake Schema

Snowflake Schema

location_keystreet

city

Location

time_key

item_key

 branch_key

location_key

units_sold

dollars_sold

avg_sales   M  e  a  s  u  r  e  s

B_key

B_name

B_type

Branch

I_key

I_nameI_brand

I_type

I_supplier_type

Item

T_key

T_day

T_day_week 

T_month

T_quarter

T_year

Time

Sales Fact Table

S_keyS_type

Supplier

C_key

C_city

C_province

C_country

City

Page 11: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 11/12

Summary

Data warehouse

A subject-oriented, integrated, time-variant, and nonvolatile

collection of data in support of management’s decision-making

 process

Architecture of Date warehouseConsisting of Warehouse servers, front end and back end tools

OLAP operations: drilling, rolling, slicing, dicing and pivoting

Multi dimensional model of Data warehouse

Data cube

Star Schema

Snowflake Schema

Page 12: Data warehousing and OLAP technology.ppt

7/27/2019 Data warehousing and OLAP technology.ppt

http://slidepdf.com/reader/full/data-warehousing-and-olap-technologyppt 12/12Thank You