Defining Data Warehouse Concepts and Terminology Chapter 3.

30
Defining Data Warehouse Concepts and Terminology Chapter 3
  • date post

    15-Jan-2016
  • Category

    Documents

  • view

    219
  • download

    0

Transcript of Defining Data Warehouse Concepts and Terminology Chapter 3.

Page 1: Defining Data Warehouse Concepts and Terminology Chapter 3.

Defining Data Warehouse Concepts and Terminology

Chapter 3

Page 2: Defining Data Warehouse Concepts and Terminology Chapter 3.

Definition of a Data Warehouse

“ An enterprise structured repository of subject-oriented, time-variant, historical data used for information retrieval and decision support. The data warehouse stores atomic and summary data.”

Oracle Data Warehouse Method

Page 3: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Warehouse Properties

DataWarehouse

Integrated

Time VariantNon Volatile

SubjectOriented

Page 4: Defining Data Warehouse Concepts and Terminology Chapter 3.

Subject-Oriented

Data is categorized and stored by business subjectrather than by application

EquityPlans Shares Customer

financialinformation

Savings

Insurance

Loans

OLTP Applications Data Warehouse Subject

Page 5: Defining Data Warehouse Concepts and Terminology Chapter 3.

Integrated

OLTP Applications

SavingsSavings

Currentaccounts

Currentaccounts

LoansLoans

Data Warehouse

Data on a given subject is defined and stored once.

Customer

Page 6: Defining Data Warehouse Concepts and Terminology Chapter 3.

Time-Variant

Data is stored as a series of snapshots, each representing a period of time

Time DataJan-97 JanuaryFeb-97 FebruaryMar-97 March

Page 7: Defining Data Warehouse Concepts and Terminology Chapter 3.

Nonvolatile

Typically data in the data warehouse is not updated or delelted.

Insert UpdateDelete

Read Read

Operational Warehouse

Load

Page 8: Defining Data Warehouse Concepts and Terminology Chapter 3.

Changing Data

Warehouse Database

First time load

Refresh

Refresh

Refresh

Operational Database

Page 9: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Warehouse Versus OLTP

PropertyResponseTime

Operations

Nature of Data

Data Organization

Size

Data Source

Activities

OperationalSub seconds to seconds

DML

30-60 days

Applications

Small to large

Operational, Internal

Processes

Data Warehouse

Seconds to hours

Snapshots over time

Subject, time

Large to very large

Operational, Internal,External

Analysis

Primarily read only

Page 10: Defining Data Warehouse Concepts and Terminology Chapter 3.

Usage Curves

Operational system is predictableData warehouse - Variable - Random

Page 11: Defining Data Warehouse Concepts and Terminology Chapter 3.

User Expectations

Control expectationsSet achievable targets for query

responseSet SLAsEducateGrowth and use is exponential

Page 12: Defining Data Warehouse Concepts and Terminology Chapter 3.

Enterprisewide Warehouse

Large scale implementationScope the entire businessData from all subject areasDeveloped incrementallySingle source of enterprisewide dataSingle distribution point to

dependent data marts

Page 13: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Warehouses Versus Data Marts

Property Data Warehouse Data MartScope Enterprise DepartmentSubject Multiple Single-subject, LOBData Source Many FewSize(typical) 100 GB to>1 TB <100 GBImplementation time Months to years Months

DataWarehouse

DataWarehouse

DataMartDataMart

Page 14: Defining Data Warehouse Concepts and Terminology Chapter 3.

Dependent Data Mart

MarketingSales

FinanceHuman Resources

MarketingSales

FinanceHuman Resources

MarketingMarketing

MarketingMarketing

MarketingMarketing

External Data

DataWarehouse

OperationalSystems

Flat Files

Data Marts

Page 15: Defining Data Warehouse Concepts and Terminology Chapter 3.

Independent Data Mart

OperationalSystems

External Data

Sale or Marketing

Flat Files

Page 16: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Warehouse TerminologyOperational data store (ODS) Stores tactical data from production

systems that are subject-oriented and integrated to address operational needs

Metadata

MetadataMetadata

Page 17: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Warehouse Terminology

DataIntegration

Enterprise data warehouse

Business areawarehouse

Source data

Architecture

Page 18: Defining Data Warehouse Concepts and Terminology Chapter 3.

Methodolgy

Ensures a successful data warehouseEncourages incremental developmentProvides a staged approach to an

enterprisewide warehouse - Safe - Manageable - Proven - Recommended

Page 19: Defining Data Warehouse Concepts and Terminology Chapter 3.

Modeling

Warehouses differ from operational structures: - Analytical requirements - Subject orientationData must map to subject oriented information: - Identify business subjects - Define relationships between subjects - Name the attributes of each subjectModeling is iterativeModeling tools are available

Page 20: Defining Data Warehouse Concepts and Terminology Chapter 3.

Extraction, Transformation, and Transportation

Purchase specialist tools, or develop programsExtraction-- select data using different methodsTransformation--validate, clean, integrate, and

time stamp dataTransportation--move data into the warehouse

OLTP Databases Staging File Warehouse Database

Page 21: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Management

Efficient database server and management tools for all aspects of data management

Imperatives - Productive - Flexible - Robust - EfficientHardware, operating system and network

management

Page 22: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Access and Reporting

Tools that retrieve data for business analysis Imperatives - Ease of use - Intuitive - Metadata - Training More than one tool may be required

WarehouseDatabase

Simple Queries

Forecasting

Drill-down

Page 23: Defining Data Warehouse Concepts and Terminology Chapter 3.

Oracle Warehouse Components

Relational / Multidimensional

Text, image Spatial

Web Audio video

Externaldata

Operational data

Relational tools

OLAP tools

Applications/Web

Any DataAny Source Any Access

Page 24: Defining Data Warehouse Concepts and Terminology Chapter 3.

Oracle Data Mart Suite

Data ModelingOracle Data Mart Designer

OLTPEngines

OLTPDatabases

DataExtraction

Oracle Data MartBuilder

Ware-housingEngines

Data MartDatabase

SQL*Plus

DataManagement

Oracle EnterpriseManager

Data Access& Analysis

Discoverer &Oracle Reports

Page 25: Defining Data Warehouse Concepts and Terminology Chapter 3.

Data Mart Implementation with the Oracle Data Mart Suite

Oracle Enterprise ServerOracle Enterprise ManagerOracle Data Mart BuilderOracle Data Mart DesignerOracle DiscovererOracle Web Application ServerOracle Reports

Page 26: Defining Data Warehouse Concepts and Terminology Chapter 3.

Oracle Warehouse Builder Architecture

Sources

ExtractionFacilities• Loader• Remotes SQL• Gateways - OLE-DB/ODBC - Mainframe - Specialized• ERP Data - SAP - Peoplesoft - Oracle

PL/SQL, JavaTransforms

TransformDriver

PL/SQL, JavaWrapper

ExternalFunctions

TargetTablesFilter

Transform

Oracle 8i

Page 27: Defining Data Warehouse Concepts and Terminology Chapter 3.

Oracle Business Intelligence Tools

Current Tactical Strategic

IS developsuser’s Views Business users Analysis

Oracle Reports Oracle Discover Oracle Express

Page 28: Defining Data Warehouse Concepts and Terminology Chapter 3.

The Tool for Each Task

Tool

OracleReports

OracleDiscover

OracleExpress

Productionreporting

Ad hocquery and analysis

Advancedanalysis

Question

What were sales byregion last quarter?

What is driving the increase in NorthAmerican sales?

Given the rapid increasein Web sales, what willtotal sales be for the restof the year?

Task

Page 29: Defining Data Warehouse Concepts and Terminology Chapter 3.

Oracle Warehouse Services

OracleEducation

OracleConsulting

Oracle Support Services

Customers

Page 30: Defining Data Warehouse Concepts and Terminology Chapter 3.

SummaryThis lesson covered the following topics:Identifying a common, broadly accepted definition

of the data warehouseDistinguishing the differences between OLTP

systems and analytical systemsDefining some of the common data warehouse

terminologyIdentifying some of the elements and processes in

a data warehouseIdentifying and positioning the Oracle Warehouse

vision, products, and services