Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS...

17
17 June 2019 © MARKLOGIC CORPORATION Establishing Real-World Data Integration Best Practices CHRIS ATKINSON Solutions Architect MarkLogic T.J. TANG Software Architect AbbVie

Transcript of Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS...

Page 1: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

17 June 2019© MARKLOGIC CORPORATION

Establishing Real-World Data Integration Best Practices

CHRIS ATKINSONSolutions Architect

MarkLogic

T.J. TANGSoftware Architect

AbbVie

Page 2: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

01 IntroductionAbbVie and T.J. Tang ─ Who are they?

02 First StepsTaking those first big decisions

03 EvolutionHow and why the platform changed

04 Establishing a COELearning from their own COE development

The AbbVie Journey to a COE

05 Help and Materials Useful resources when establishing your own COE

Page 3: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Key FactsAbbVie is an American publicly traded biopharmaceutical company founded in 2013. It originated as a spin-off of Abbott Laboratories.

8,800+ R&D Workers

175+ Countries

26M+ Patents

#1 Selling Drug World

Wide

Software Innovators

$4.8B R&D 2017

Page 4: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

T.J. Tang – AbbVie MarkLogic COE

T.J. TangSoftware Engineer20 years experience

ArchitectEvaluates solutions and opportunities

ManagerHeads up the COE effort

The COETeam7 People

PurposeData integration best practices

Business DemandReduce costsAccelerate R&D

Page 5: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

First Steps

Initial ImplementationModelled data to several smaller documents

CPF for processing

External content enrichment Raw Text Document

Source Metadata

Metadata from Named

Entity Recognition

Related Triples

Instance Data

Page 6: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Early Architecture Changes

Early Changes

Envelope pattern

Modelled data into larger documents

COE Takeaways

MarkLogic Experts often made available

Meetups for 1:1 discussions

Use SEs, SAs

Source Metadata

Metadata from Named Entity Recognition

Related Triples

Instance Data

Enveloped

Page 7: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Data Hub Implementation

COE Standards

V2 and later standards

Custom changes need migrating

From & to state needs mapping

COE Takeaways

Talk directly to PMs

SAs for roadmap

Request enhancements to MarkLogic

Page 8: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Evolution of the COE

The COE is Born

A natural evolution

Advice and expertise evolved

Servicing other LOBs

Share & Shepherd

Project Growth

Onboarded Consultant Expert

Advice from MarkLogic

Initial Solution for R&D Business

COE Takeaways

Experts are needed

Consider their role

Page 9: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

COE Day-to-Day Activities

Educate

Qualify

Validate

COE Takeaways

Create standardized materials

Customize to need

Needs to be on-demand

Page 10: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Recent Innovations

NiFi Adoption

Less developer centric

Good performance

Some early challenges

COE Takeaway

One of a few options (e.g. MuleSoft)

Data Hub is innovating fast

MarkLogic relationship important

Request help/advice

Page 11: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Next Steps

Machine Learning

Avoids offloading data

Processing close to data

Page 12: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Establishing Your Own COE / Standards

Technical PatternsCode Examples

Data Integration GuidesArchitecture

Value

Avoiding Pitfalls

Internal ExpertsAn essential component

Expert HelpMarkLogic Experts

Solution Architects

Solution Engineers

Page 13: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Data Integration Guides

3.1 Modelling Entities in the Data Hub

3.3 Tracking Provenance & Lineage

2.2 Ingesting Different Data Types

2.1 Ingesting Multi-Model Data

4.5 Designing for Compliance Queries

4.4 Data Lifecycle Management

3.2 Orchestrating Data Integration

1. Agile Data Management

2. Ingestion

3. Curation

4. Delivery

1.0 Agile Data Management with

MarkLogic

4.3 Optimizing Delivery with Entities

Document

Topic

4.1 Designing for SQL Analytics

4.2 Designing for Operational Workloads

Page 14: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Integration Guide Example

Page 15: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Technical Patterns

Page 16: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Questions?

Page 17: Establishing Real-World Data Integration Best Practices · Data Integration Best Practices CHRIS ATKINSON Solutions Architect. MarkLogic. T.J. TANG ... those first big decisions 03

Thank you