DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 ·...
Transcript of DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 ·...
![Page 1: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/1.jpg)
Data Warehousing in the Real World
Kent Graziano, Snowflake Computing(Virtual) Keith Hoyle, McKesson Specialty Health
![Page 2: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/2.jpg)
Agenda
l Biosl Back storyl Standard DV Architecturel Evolution to Gepettol How we use MD5 Hashesl Planned Schema Architecturel Final Schama Architecturel Advantages & Challenges
![Page 3: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/3.jpg)
My Bio
› Senior Technical Evangelist, Snowflake Computing› Oracle ACE Director (BI/DW)› Certified Data Vault Master and DV 2.0 Practitioner (CDVP2)› Data Modeling, Data Architecture and Data Warehouse Specialist› 30+ years in IT› 25+ years of Oracle-related work› 20+ years of data warehousing experience
› Former-Member: Boulder BI Brain Trust (http://www.boulderbibraintrust.org/)
› Author & Co-Author of a bunch of books› Blogger: The Data Warrior› Past-President of Oracle Development Tools User Group and Rocky Mountain Oracle User Group
![Page 4: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/4.jpg)
Snowflake Computing is…● …a Silicon Valley innovator
● …built a new SQL data warehouse in the cloud
● …with broad customer adoption
The Snowflake Elastic Data Warehouse is …● …All-new, SQL compliant
● No legacy code● …Designed for the elastic cloud
● …Delivered as a service● Nothing to manage
![Page 5: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/5.jpg)
Bio (Keith)
l Sr. Manager, Enterprise Data Architecture (McKesson Specialty Health)
l 25+ years in ITl 8+ years in Genetic Engineering / Biochemistry in Pharmaceutical industry
l Completed multiple successful EDW efforts with large companies (Dell, HP, AMD, Aflac, Amgen, Glaxo-SmithKline, etc.)
l Consulted through large firms catering to big pharma / biotech / medical industry
![Page 6: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/6.jpg)
Back story
l Client: McKesson Specialty Health (formerly US Oncolology)● Division of McKesson (Fortune 500 - #5)
l Building a new Electronic Health Records (EHR) system● IKnowMed Generation 2 (G2)
l Existing DW on G1 – not good, not flexible● Pure Kimball – transient stage area with quasi-star schema model
● Can’t handle multiple sources● Already issues loading and meeting SLA
![Page 7: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/7.jpg)
Back story
l Want to build new DW● Flexible, scalable, etc.
l And want to use agile approachl Sounds like Data Vault?● Contracted Kent to help● Hired Keith to be the internal lead
![Page 8: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/8.jpg)
Standard Data Vault Model
l Hub: List of UNIQUE business keys.l Link: List of UNIQUEl Satellite: Historical descriptive data.
Email ID
Sat
Sat
Sat
Link Bank ID
Sat
Sat
Sat
PassengerID
Sat
Sat
Sat
F(x)
Email Information Bank Transactions
Airline Reservations
Sat
Link
Records a history of the interaction
** Dashed Line is a possible New Relationship
Hub
Satellite
![Page 9: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/9.jpg)
Back story
l Management convinced that DV was too hard, too many layers, would take too long● Politics!
l So starting point – Type 2 style persistent stage area● Start loading ASAP● Never lose any changes● Good!
![Page 10: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/10.jpg)
Type 2 Stage Table
![Page 11: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/11.jpg)
Evolution of Gepetto
l Initial “marts” were just views off the stage tables● Joins in Business Objects● Worked fine for 1 source (G2)
l But what happens when you add another source?● Explosion of mappings from stage to presentation● Mapping logic in ETL or complex views
l Need: a persistent integration layer● Based on natural business keys!● But don’t say “data vault”
![Page 12: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/12.jpg)
Persistent Integration Layer AdvantageNo PIL (M x N)
With a PIL (M + N)
Stage 1
PersistentIntegrationLayer
Stage 2
Stage 3
Stage 4
Stage 5
Stage 6
Dim 1
Dim 2
Dim 3
Fact 1
Fact 2
Fact 3
Stage 1
Stage 2
Stage 3
Stage 4
Stage 5
Stage 6
Dim 1
Dim 2
Dim 3
Fact 1
Fact 2
Fact 3
![Page 13: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/13.jpg)
INTRODUCING GEPETTO!
![Page 14: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/14.jpg)
Development Approach
l While source data was being staged:● Ran JAD sessions determining business information model for integration
● Standardization routines developed● Full featured, configurabe Calendar Dimension● Standardized plumbing columns and CDC logic● Consistent means of MD5 hashing
● Persistent integration layer developed● Prototyped merging data from multiple sources into comformed hybrid SCD-1 / 2 dimensions
● Devised highly-normalized ‘Gepetto’ variant of DV 2.0 / Anchor methods
![Page 15: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/15.jpg)
Gepetto Architecture
l Very “vault-like”l Keys are all MD5 based hash typesl 3 layers● Stage● Integration● Presentation
l Integration● Domains (business key driven like Hubs)● Relaters (basically Links)● Key Map table – joins D & R to stage tables
● Stage tables act like Satellites
![Page 16: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/16.jpg)
Domain and Key Map Tables
![Page 17: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/17.jpg)
Relater and Key Map Table
![Page 18: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/18.jpg)
MD5 Keys
l Concatenate source data fields and hash to create MD5 keys
l Concatenation Rules● Joins are performed against these keys so standards and consistency are vital
● Use a delimiter when concatenating● Convert numbers and dates / times to string● Consider trimming / upper casing values in BUS_KEYS
l MD5 Key Types● PRIM_KEY (STG):
● All source fields (in table order) + LOAD_DTS● Uniquely ID’s all records with DW● Can serve as an SCD-2 key in virtual Dim’s/ Facts
![Page 19: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/19.jpg)
MD5 Keys
● CDC_KEY (STG / INTG):● Source field(s) (in table order) used by SOR to ID data rows uniquely for change data capture purposes
● Same as MD5Key in DV 2.0● CDC_ATTR (STG):
● All non-CDC_KEY source field (in table order) to track changed for change data capture purposes
● Same as MD5DIFF in DV 2.0● NAT_KEY (STG):
● Source field(s) (in table order) from a single SOR table used to logically ID data rows uniquely
● Table “natural” key is not always a true business key
![Page 20: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/20.jpg)
MD5 Keys
● [D_XXX_KEY / R_XXX_KEY] (INTG):● Hash of real business key columns● Source field(s) (in table order) used to logically ID data rows uniquely ● Joins may be required because of the nature of the stage tables
● Same as HUB and LINK keys in DV 2.0● Can serve as an Type 1 SCD key in virtual Dim’s/ Facts● That is another talk!
![Page 21: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/21.jpg)
What does it look like?
l Encode using standard MD5 hash function● rawtohex(sys.utl_raw.cast_to_raw(dbms_obfuscation_toolkit.md5 (input_string => ...)
l Need to minimize chance of duplicates● 12||3||45 and 1||2||345 hash to same value● Need a separator between each● Also handles case of null values● Example: Col1||’^’||Col2||’^’||Col3
© Data Warrior LLC
![Page 22: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/22.jpg)
Other considerations
l To generate most consistent string: standardize!l Convert data typesl If 'NUMBER', 'NVARCHAR2', 'NVARCHAR', 'NCHAR‘● THEN 'TO_CHAR(' || column_name || ')‘
l If 'RAW‘● THEN 'ENC_BASE64(' || column_name || ')‘
l If 'DATE‘● THEN 'TO_CHAR(' || column_name || ', ''YYYY-MM-DD'')‘
l If LIKE 'TIME%‘● THEN 'TO_CHAR(' || column_name || ', ''YYYY-MM-DD HH24:MI:SS'')'
© Data Warrior LLC
![Page 23: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/23.jpg)
Final Input String
(UPPER(TRIM(T1.GENERICNAME))||'^'||UPPER(TRIM(
TO_CHAR(T1.MED_STRNG_AMT)))||'^'||UPPER(TRIM(T1.UOM_CD))||'^'||UPPER(TRIM(T1.MED_FORM_NM))||'^')
© Data Warrior LLC
![Page 24: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/24.jpg)
So what?
l MD5 hash is consistent cross-platform l Changes multi-column compares to a single column
l All compares take the same time during load process
l Can use with any DW architecture that requires change detections
l Virtually no limit● Think Big Data/Hadoop/NoSQL
l Can generate the input string automatically● But that is another talk!
© Data Warrior LLC
![Page 25: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/25.jpg)
ARCHITECTURE OVERVIEW
![Page 26: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/26.jpg)
Persistent Integration LayerStage
Integrate
PresentationKDW_ORG
…PRIM_KEY CDC_KEYG2_PRACTICE
…PRIM_KEY CDC_KEY
DATA_XFRM<SRC System, Table, Field, Value fields>,<TGT: System, Table, Field, Value fields>
CDC_KEY field inSTG also go into theCDC_KEY in INTG.Joins to other STGtable(s) to completeR_x_KEY andD_x_KEY fields inINTG.
R_VSTR_VST_KEY
D_PAT_REC_KEYD_PRVDR_KEYD_LOC_GRP_KEYD_LOC_KEYD_CLNDR_KEY
KDW_PAT_VISIT<Patient Record ID
fields><Provider ID fields><Practice ID fields><Location ID fields><Visit Date fields>
…PRIM_KEY CDC_KEY
DIM_PAT_RECSCD2_PAT_REC_KEYSCD1_PAT_REC_KEY
D_PRSN_KEY…
DIM_PRVDRSCD2_PRVDR_KEYSCD1_PRVDR_KEY
…
DIM_PRCTC_HIERSCD2_PRCTC_HIER_KEYSCD1_PRCTC_HIER_KEY
D_LOC_KEY…
D_PAT_RECD_PAT_REC_K
EY…D_LOC
D_LOC_KEY…D_PRVDR
D_PRVDR_KEY…
KM_LOC_GRPD_LOC_GRP_KEY
CDC_KEY
LYNX_PRCTCPM_PRCTC_KEY
…PRIM_KEY CDC_KEY
1) Logical views can be used to initially vettreports, aggregations, etc. where possible(i.e. most dimensions, primitive facts, someaggregate facts, etc.)2) Materialized views can be used to vettthe scaling of the solution3) ETL processes will be used toproductional-ize the vetted solution4) STG data is transformed using joins tothe DATA_XFRM table in INTG5) Data is scrubbed with standard SQLfunctionalities. (i.e. initcap, trim, removespecial characters, etc.)
D_LOC_GRPD_LOC_GRP_K
EY…
KM_VSTR_VST_KEYCDC_KEY
FACT_VSTSCD2_VST_KE
YSCD1_VST_KE
YD_PAT_REC_K
EYD_PRVDR_KEYD_PRCTC_KEYD_LOC_KEYD_CLNDR_KEY
![Page 27: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/27.jpg)
COMNStage
<Full copies of source data structures with additional plumbing fields to facilitate capturing subsequent data changes over time>
COMNPresentatio
n
Original Schema Architecture
Source(s)of Record
ReportingMSH EDW
COMN Integration
<Enterprise business key model with key mapping pointers to COMN_STG data >
JIT Transformation<Virtual v. Physical>
G2
MU
HI
KDW
CI SAS Routines
EDW V1
FDW / PMS
KDW Lite
Lynx
SFDC BOBJ
Δ CDC
Insert1Xonly
ΣΣ
ΣΣ
ΣΣ
ΣΣ
ΣΣ
StarSchema(s)
DataMarts
Web
TBLU
![Page 28: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/28.jpg)
HI Stage
COMNStage
FIN Stage FINPresentation
HI Presentation
COMNPresentation
Hoped for Schema Architecture (Parallel)Source(s)of Record
BOBJ / BI / ReportingMSH EDW
COMN Validation
COMN Integration
FIN
HI
CLIN
G2
MU
HI
KDW
CI SAS Routines
EDW V1
FDW / PMS
KDW Lite
Lynx
SFDC
MKTG
![Page 29: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/29.jpg)
HI Stage
COMNStage
FIN Stage FINPresentation
HI Presentation
COMNPresentation
Actual Schema Architecture
Source(s)of Record
BOBJ / BI / ReportingMSH EDW
COMN Validation (DQ)
COMN Integration
FIN
HI
CLIN
G2
MU
HI
KDW
CI SAS Routines
EDW V1
FDW / PMS
KDW Lite
Lynx
SFDC
MKTG
![Page 30: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/30.jpg)
Domain with Associated Stage Table
![Page 31: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/31.jpg)
Challenges
l Must have a solid enterprise logical model● With real business keys!
l Mapping disparate sources to the Integration layer is hard!● Must understand the semantic meaning of the source columns
● Must know the enterprise model to see where it fits● Must know how to handle bad and missing business key data● Means you must have good business rules too!
l Dimensional modelers have a hard time with doing these mappings.● Using views in Presentation layer mitigates this by displaying in star manner to BI layer
![Page 32: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/32.jpg)
Advantages
l Can start staging immediately● With history!
l Clear line of sight to source● Unambiguous audit trail
l Can adapt, recovering from incorrect business rules● Stage data is in original source format, with history
![Page 33: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/33.jpg)
Cowpath Highway
Old Way vs New Way
Which way will you follow?
![Page 34: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/34.jpg)
Available onAmazon.com
http://www.amazon.com/Better-Data-Modeling-
Introduction-Engineering-
ebook/dp/B018BREV1C/
SHAMELESS PLUG:
![Page 35: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/35.jpg)
Super Charge Your Data Warehouse
Available on Amazon.comSoft Cover or Kindle Format
Now also available in PDF at LearnDataVault.com
![Page 36: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/36.jpg)
New DV 2.0 Book (includes more details on MD5)
Available on Amazon:http://www.amazon.com/Building-Scalable-Data-Warehouse-Vault/dp/0128025107/
![Page 37: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/37.jpg)
![Page 39: DataWarehousingintheRealWorld - Cloud Object Storage | Store & Retrieve Data … · 2016-10-26 · DataWarehousingintheRealWorld Kent$Graziano,$Snowflake$Computing (Virtual)Keith$Hoyle,$McKesson$Specialty$Health](https://reader030.fdocuments.in/reader030/viewer/2022041017/5ec99249db40ba3c1866603a/html5/thumbnails/39.jpg)
Contact Information
Keith HoyleSr. Mgr., Enterprise Data Architecture
McKesson Specialty [email protected] my blog at
http://khoyle001.wordpress.com