Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly...

28
Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing Teratec Forum 2017 François Andry PhD, Senior Director

Transcript of Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly...

Page 1: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing

Teratec Forum 2017

François Andry PhD, Senior Director

Page 2: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Touching lives around the globe

1,000,000 patients monitored in their

homes every day

in emerging markets around the world now have access to Philips diagnostic imaging

190 million patients tracked with our patient monitors last year

100+ years of listening deeply to customers

to understand what really matters

100,000+ professionals are supported

with education

+970 million people 10 petabytes of data managed for health care providers

Present in

100 countries with 450+ products and services

Page 3: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Solutions for the Health continuum

Healthy Living Prevention Diagnosis Treatment

Recovery

Wellness

Aging in place Personal health

Hospital to home

Sleep

Healthcare operations

Digital Health Platform – enabling solutions

Acute care Imaging diagnostics & intervention

Page 4: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Philips Propositions

Healthy Living Home Care Prevention Diagnosis Treatment Recovery

Customer Services

Health & Wellness

Patient Care & Monitoring

Imaging Systems

Personal Care

Domestic appliances

Home Healthcare

Genome Informatics

Page 5: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for
Page 6: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Genetic heterogeneity

One size fits all

Page 7: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Genetic heterogeneity

One size fits all

Genomically-enabled personalized medicine

Page 8: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Satellite View Airplane View Street View

Page 9: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Satellite View Airplane View Street View

Imaging Systems Digital Pathology Genome Informatics

Page 10: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Satellite View Airplane View Street View

Imaging Systems Digital Pathology Genome Informatics

Intergalactic View

House View

Single Cell Population Health

0 500 1000 1500 2000 2500

0.0

0.2

0.4

0.6

0.8

1.0

Days

Su

rviv

al P

rob

ab

ilit

y

Page 11: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Bacteria / Tumor prepared samples

DNA Sequencing Technology

Molecular Epidemiology identifying infection

spread

Phylogenetic tree of an outbreak

Transmission route

Personalized Therapy selection for cancer

patients

Genomic fingerprint: mutations, fusions

Actionable information

Genomics for Infectious Disease

Genomics for Oncology HSDP Core Genomics Platform

Page 12: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for
Page 13: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Alignment Index Sort Merge

Index Remove duplicates Variants Annotate

EXOME pipeline

Pair

@SRR001666.1 071112_SLXA-EAS1_s_7:5:1:817:345 length=36 GGGTGATGGCCGCTGCCGATGGCGTCAAATCCCACC +SRR001666.1 071112_SLXA-EAS1_s_7:5:1:817:345 length=36 IIIIIIIIIIIIIIIIIIIIIIIIIIIIII9IG9IC

~10GB

~25,000SNPs & indels

Exon1 Exon1 Exon3

Page 14: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Unaligned file

• Protocol

• Sequencing performed

Aligned file

• Tools:

Variant file

• Tools:

MS6_S13_L001_R1_001_fp.fastq

truseq_amplicon_cancer_panel_afp1

2015-11-11,11:22:00

trimmomatic (0.33); fastQC (0.11.3); bwa (0.7.12-r1039); Samtools (1.2); GenomeAnalysisTK

MS6_S13_L001.nb.refined.bam

MS6_S13_L001_filtered.vcf

GenomeAnalysisTK.jar -T UnifiedGenotyper-L truseq_amplicon_cancer_panel_afp1.bed

Page 15: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Asynchronous processing API

Page 16: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Gateway API

Page 17: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Pipeline Definition

Page 18: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Mission Execution

Page 19: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for
Page 20: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for
Page 21: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for
Page 22: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Ultrasound & Modelization

Page 23: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Ultrasound & Modelization

nuchal translucency

Page 24: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Guided Therapy

Cardiac Roadmapping

Emboguide

Instrument tracking

Page 25: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Anatomical Awareness

Page 26: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Anatomical Awareness

Head

Chest

Abdomen

Pelvis

L.kidney

Liver Spleen

R.kidney

Spine

Page 27: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Organ Segmentation and Modelization

Vessel segmentation

Liver segmentation

PV loop and flow modelling

Page 28: Data Science Platform and Highly Scalable Cloud-based ... · Data Science Platform and Highly Scalable Cloud-based Framework for HealthTech Data Processing ... of data managed for

Thank you for listening!