The Importance of Data Analysis in Producing a Robust Physical Data Model

55
The Importance of Data Analysis in Producing a Robust Physical Data Model By Declan Chellar

description

This slideshow is an introduction to the importance of data analysis in producing a robust physical data model and the hierarchy of the levels of analysis. The intended audience is software requirements analysts.

Transcript of The Importance of Data Analysis in Producing a Robust Physical Data Model

Page 1: The Importance of Data Analysis in Producing a Robust Physical Data Model

The Importance of Data Analysis in Producing a Robust Physical Data Model

By Declan Chellar

Page 2: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

When “data modelling” is mentioned on

projects…

Page 3: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Physical Data Model

When “data modelling” is mentioned on

projects…

…too many people only think of the

physical data model, DB tables, etc.

Page 4: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Physical Data Model

When “data modelling” is mentioned on

projects…

…too many people only think of the

physical data model, DB tables, etc.

But what about the data analysis that leads to a robust

physical data model?

Page 5: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Conceptual Data Model• A fundamental part of the

information architecture

Page 6: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Conceptual Data Model• A fundamental part of the

information architecture

• Identifies the business entities and shows the relationships between them

Page 7: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Conceptual Data Model• A fundamental part of the

information architecture

• Identifies the business entities and shows the relationships between them

• An essential complement to the business architecture

Page 8: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Conceptual Data Model• A fundamental part of the

information architecture

• Identifies the business entities and shows the relationships between them

• An essential complement to the business architecture

• Ought to be in place before any related software project even starts!

Page 9: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Conceptual Data Model• A fundamental part of the

information architecture

• Identifies the business entities and shows the relationships between them

• An essential complement to the business architecture

• Ought to be in place before any related software project even starts

• In reality, often missing altogether

Page 10: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Conceptual Data Model• A fundamental part of the

information architecture

Page 11: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Conceptual Data Model• A fundamental part of the

information architecture

• Expands upon the CDM by identifying the attributes of each entity and the keys to each relationship

Page 12: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Conceptual Data Model• A fundamental part of the

information architecture

• Expands upon the CDM by identifying the attributes of each entity and the keys to each relationship

• Normalised to reduce redundancy

Page 13: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Conceptual Data Model• A fundamental part of the

information architecture

• Expands upon the CDM by identifying the attributes of each entity and the keys to each relationship

• Normalised to reduce redundancy

• Ideally in place before any related software project even starts

Page 14: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Conceptual Data Model• A fundamental part of the

information architecture

• Expands upon the CDM by identifying the attributes of each entity and the keys to each relationship

• Normalised to reduce redundancy

• Ideally in place before any related software project even starts

• In reality, often missing altogether

Page 15: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Conceptual Data Model• A fundamental part of the

information architecture

Page 16: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Conceptual Data Model• A fundamental part of the

information architecture

• Provides the essential business definitions for each attribute identified on the LDM

Page 17: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Conceptual Data Model• A fundamental part of the

information architecture

• Provides the essential business definitions for each attribute identified on the LDM

• Traceable back to the LDM

Page 18: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Conceptual Data Model• A fundamental part of the

information architecture

• Provides the essential business definitions for each attribute identified on the LDM

• Traceable back to the LDM

• Ideally in place before any related software project even starts

Page 19: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Conceptual Data Model• A fundamental part of the

information architecture

• Provides the essential business definitions for each attribute identified on the LDM

• Traceable back to the LDM

• Ideally in place before any related software project even starts

• Can be enhanced iteratively throughout requirements gathering

Page 20: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Conceptual Data Model• A fundamental part of the

information architecture

• Provides the essential business definitions for each attribute identified on the LDM

• Traceable back to the LDM

• Ideally in place before any related software project even starts

• Can be enhanced iteratively throughout requirements gathering

• In reality, often missing altogether

Page 21: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Conceptual Data Model• A fundamental part of the

requirements definition

Page 22: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Conceptual Data Model• A fundamental part of the

requirements definition

• For any process step, identifies the relevant attributes

Page 23: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Conceptual Data Model• A fundamental part of the

requirements definition

• For any process step, identifies the relevant attributes

• Traceable to/from the Data Dictionary

Page 24: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Conceptual Data Model• A fundamental part of the

requirements definition

• For any process step, identifies the relevant attributes

• Traceable to/from the Data Dictionary

• Its elaboration can feed details back into the Data Dictionary

Page 25: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Conceptual Data Model• A fundamental part of the

requirements definition

• For any process step, identifies the relevant attributes

• Traceable to/from the Data Dictionary

• Its elaboration can feed details back into the Data Dictionary

• In reality, often contains details that should reside in the Data Dictionary, leading to redundancy

Page 26: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model• A fundamental part of the

requirements definition

Page 27: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model• A fundamental part of the

requirements definition

• Documents the required behaviour of each screen and the relevant data to be displayed or captured (not to be confused with screen design/layout)

Page 28: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model• A fundamental part of the

requirements definition

• Documents the required behaviour of each screen and the relevant data to be displayed or captured (not to be confused with screen design/layout)

• Its elaboration can feed details back into the Data Dictionary

Page 29: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model• A fundamental part of the

requirements definition

• Documents the required behaviour of each screen and the relevant data to be displayed or captured (not to be confused with screen design/layout)

• Its elaboration can feed details back into the Data Dictionary

• Should be documented after the relevant logical process steps

Page 30: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model• A fundamental part of the

requirements definition

• Documents the required behaviour of each screen and the relevant data to be displayed or captured (not to be confused with screen design/layout)

• Its elaboration can feed details back into the Data Dictionary

• Should be documented after the relevant logical process steps

• In reality, often contains details that should reside in the Data Dictionary, leading to redundancy

Page 31: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data ModelRobust data analysis

provides the basis for good physical data modelling

Page 32: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data ModelRobust data analysis

provides the basis for good physical data modelling

Otherwise, the data architects might have to

reverse-engineer the data needs of the business

based on screen designs

Page 33: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model

Physical Data Model

Page 34: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Logical Data Model(normalised)

Data Dictionary

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model

Physical Data Model

This takes a little longer, but results in a robust, adaptable and durable

physical data model

Page 35: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Physical Data Model

Page 36: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Physical Data Model

This is sub-optimal and is likely to result in an

inefficient database that will under-perform as it

grows larger

Page 37: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Process Steps(data in/out at the functional level)

Screen Specification(fields, etc., required for screens)

Physical Data Model

This is sub-optimal and is likely to result in an

inefficient database that will under-perform as it

grows larger

Unfortunately, this approach is quite common

Page 38: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Screen Specification(fields, etc., required for screens)

Physical Data Model

Page 39: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Screen Specification(fields, etc., required for screens)

Physical Data Model

This worst-case-scenario will definitely lead to an

under-performing database within as little as six months

Page 40: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Screen Specification(fields, etc., required for screens)

Physical Data Model

This worst-case-scenario will definitely lead to an

under-performing database within as little as six months

Unfortunately, this approach is not uncommon

Page 41: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

Of course, physical data models often come “out of

the box” in the case of BPM or ERP systems

Page 42: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

However, “out-of-the-box” does not mean “magic“ and the PDM does not

automatically fit the data needs of the business

Page 43: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

“Out of the box”Physical Data Model

The PDM must be tailored to suit the

specific needs of the business

Page 44: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

“Out of the box”Physical Data Model

Logical Data Model(normalised)

Conceptual Data Model

Data Dictionary

Page 45: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

“Out of the box”Physical Data Model

Logical Data Model(normalised)

Conceptual Data Model

Data Dictionary

Otherwise, there will be a significant gap between

the PDM and the business needs it should support

Page 46: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

“Out of the box”Physical Data Model

Logical Data Model(normalised)

Conceptual Data Model

Data Dictionary

Otherwise, there will be a significant gap between

the PDM and the business needs it should support

Page 47: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

“Out of the box”Physical

Data Model

Logical Data Model(normalised)

Conceptual Data Model

Data Dictionary

Once in production, this gap eventually becomes a chasm

Page 48: The Importance of Data Analysis in Producing a Robust Physical Data Model

Hierarchy of Data Analysis

And, financially, that chasm can feel like a

bottomless pit

Page 49: The Importance of Data Analysis in Producing a Robust Physical Data Model

REMEMBER!

Screen Specification(fields, etc., required for screens)

Physical Data Model€

€$$ $

£ £

£

Page 50: The Importance of Data Analysis in Producing a Robust Physical Data Model

REMEMBER!

Functional Specification(data required for process steps)

Screen Specification(fields, etc., required for screens)

Physical Data Model€ €

$

£

Page 51: The Importance of Data Analysis in Producing a Robust Physical Data Model

REMEMBER!

Logical Data Model(normalised)

Data Dictionary

Functional Specification(data required for process steps)

Screen Specification(fields, etc., required for screens)

Conceptual Data Model

Physical Data Model

$

£

Page 52: The Importance of Data Analysis in Producing a Robust Physical Data Model

REMEMBER

“Out of the box”Physical

Data Model

Logical Data Model(normalised)

Conceptual Data Model

Data Dictionary€€

€$$ $

£ £

£

Page 53: The Importance of Data Analysis in Producing a Robust Physical Data Model

REMEMBER!

“Out of the box”Physical Data Model

Logical Data Model(normalised)

Conceptual Data Model

Data Dictionary

$

£

Page 54: The Importance of Data Analysis in Producing a Robust Physical Data Model

With thanks to the following for reviewing the draft slideshow:

• Simon Duckett, Enterprise Data Architect• Jonathan Hunsley, Senior Business Architect

Page 55: The Importance of Data Analysis in Producing a Robust Physical Data Model

www.chellar.com/blog