The experience of Andalusia in eHealth and big data · • Genomic big data for personalized...

19
The experience of Andalusia in eHealth and big data --- Big data in health - IMI’s HARMONY project, European Parliament, Brussels, 19 June 2018 Joaquín Dopazo Área de Bioinformática, Fundación Progreso y Salud, Nodo de Genómica Funcional, (INB-ELIXIR-es), Bioinformática de ER (BiER-CIBERER), CDCA, Hospital Virgen del Rocío, Sevilla, Spain http://www.clinbioinfosspa.es http://www. babelomics.org @xdopazo, @ClinicalBioinfo

Transcript of The experience of Andalusia in eHealth and big data · • Genomic big data for personalized...

Page 1: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

The experience of Andalusia in eHealth

and big data---

Big data in health - IMI’s HARMONY project,

European Parliament, Brussels, 19 June 2018

Joaquín Dopazo Área de Bioinformática,

Fundación Progreso y Salud,

Nodo de Genómica Funcional, (INB-ELIXIR-es),

Bioinformática de ER (BiER-CIBERER),

CDCA, Hospital Virgen del Rocío, Sevilla, Spain

http://www.clinbioinfosspa.es

http://www. babelomics.org

@xdopazo, @ClinicalBioinfo

Page 2: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Summary

• Genomic big data for personalized medicine

• Bioinformatics

• Sustainability

• Scalability

• Generation of knowledge: prospective healthcare

• Data integration

• GDPR compliance

Page 3: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

The clinical bioinformatics area

Bioinformatics

Rare diseases

Cancer

Common diseases

Infectious diseases

Microbiome

Pharmacogenomics

The Bioinformatics Area, created in June 2016 in the Fundación Progreso y Salud, has as main goal supporting the Program of Personalized Medicine of the Andalusian Community by facilitating the use of genomic data for precision diagnostic and treatment recommendation, implementing a prospective health care functionality in the public health system.

http://www.clinbioinfosspa.es/

Page 4: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Summary

• Genomic big data for personalized medicine

• Bioinformatics

• Sustainability

• Scalability

• Generation of knowledge: prospective healthcare

• Data integration

• GDPR compliance

Page 5: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Data analysis and the sustainability of

the cycle of knowledge generation

http://www.gbpa.es/

GCGTATAG

CACGGGTA

TCTGTATTA

TGGTGGAT

ATCAGCGG

ATTGCGATT

GGCAGAGC

GGCAAAGT

GCGTATAG

CACGGGTA

TCTGTATTA

TGGTGGAT

ATCAGCGG

ATTGCGATT

GGCAGAGC

GGCAAAGT

GCGTATAG

CACGGGTA

TCTGTATTA

TGGTGGAT

ATCAGCGG

ATTGCGATT

GGCAGAGC

GGCAAAGT

GCGTATAG

CACGGGTA

TCTGTATTA

TGGTGGAT

ATCAGCGG

ATTGCGATT

GGCAGAGC

GGCAAAGT

Raw files

(FastQ)

DB

Analysis

Pipeline

Storage

Knowledge DB

Gene 1 ksdhkahcka

Gene 2 jckacsksda

Gene 3 lkkxkccj<jdc

Gene 4 ksfdjvjvlsdkvjd

Gene 5 kckcksñdksd

Gene 6 ldkdkcksdcldl

Gene x kcdlkclkldsklk

Gene Y jcdksdkcdks

Prioritization

reportDialog with experts in the

disease + validations

Samples

GCGTATAG

CACGGGTA

TCTGTATTA

TGGTGGAT

ATCAGCGG

GCGTATAG

CACGGGTA

TCTGTATTA

TGGTGGAT

ATCAGCGG

VCF BAMProcessed files

Bottleneck

Page 6: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Sustainability requires tools for end users, which involves hiding the complexity of the analysis

?

eHRD

Decision support

G

Laboratory

Corporative analysis request

system

1

2

3

45

68

7

• A solution for the management of genomic data must be integrated the same way that other analyses of the health system.

• Genomic data are stored in the system, linked to clinical data the same way that other data are (medical image, digital pathology –under implementation-, etc.) for further potential clinical studies

Medical image

Digital pathology

Page 7: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Our approach: hiding the complexity

?

eHR

K

NO

YES

D

I: patient`s Información

C: Informed Consent

G: patient`s Genome

D: high precision Diagnosis

Knowledge

K

Clinical research

D

Knowledge

Diagnosis / therapy

G

I

Sequencing Unit

Bioinformatics Area

1

2

3

45

67

8

Corporative analysis request

system

C

Page 8: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Personalized Medicine in cancer

Biomarker 1 Therapy 1

Current use of biomarkers

Therapy 1

Therapy 2

Therapy 3

Enhanced use of biomarkers

Patient genomic data analysis allows one-step association of biomarkers with therapies and

enables the detection of new actionable biomarkers, or clinical trials compatible with patients saving time and cost and increasing

treatment successProspective healthcare

Therapy 2

Genomicbiomarkers

Other therapiesNew therapies

Clinical trial

Result

+

Biomarker 2

Therapy 3Biomarker 3

1st line 2nd line 3rd line …..

Page 9: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Front end: Personalized Medicine Module (MMP)

Sample selectionVariant prioritization

Selection of

variants for

the report

Report generation

(sent to the eHR)

Page 10: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Summary

• Genomic big data for personalized medicine

• Bioinformatics

• Sustainability

• Scalability

• Generation of knowledge: prospective healthcare

• Data integration

• GDPR compliance

Page 11: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Currently, the

fastest and more

powerful genomic

database engine in

the world.

Used in the GEL

for genomic data

management

Backend: OpenCGA, a scalable storage and genomic data management platform

Extensive capabilities to query across genotype and phenotype relationships

https://github.com/opencb/opencga

In collaboration with

Genomics England

(GEL)

Page 12: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Summary

• Genomic big data for personalized medicine

• Bioinformatics

• Sustainability

• Scalability

• Generation of knowledge: prospective healthcare

• Data integration

• GDPR compliance

Page 13: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Prospective healthcare is facilitated by a

model that integrates genomic data and

universal EHR

Genome Clinic

….

Study1 ….. Studyn

MMP

• Revolutionary concept: the whole health system becomes an enormous potential prospective clinical study

• Clinical data dynamically associated to genomic data

• Possibility of many clinical studies by reanalyzing genomic data under diverse perspectives (with no extra investment)

• Growing genomic DB with increasing study possibilities

Page 14: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Summary

• Genomic big data for personalized medicine

• Bioinformatics

• Sustainability

• Scalability

• Generation of knowledge: prospective healthcare

• Data integration

• GDPR compliance

Page 15: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Future vision involves big data integration:Genomic data are especially relevant but not the only useful big

data

Genome Clinic

….

Study1 ….. Studyn

• Other big data are being collected (medical image, digital pathology, wearable devices, etc.)

• Clinical data dynamicallyassociated to different big data

• The whole health system becomes a enormous potential prospective clinical study

• Immense possibility for data reusability

• Growing genomic DB with increasing study possibilities

Digital pathology Medical image ….

MMP

Page 16: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Summary

• Genomic big data for personalized medicine

• Bioinformatics

• Sustainability

• Scalability

• Generation of knowledge

• Data integration

• GDPR compliance

Page 17: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

GDPR complianceThe system has been designed in a way that is compliant with EU and Spanish General Data Protection Regulation

• Clinicians requesting for a genomic diagnostic have access to eHR and get the result of the test.

• Geneticists have access to eHR and can query the genomic data (but never extract them)

• IT have access to de-identified genomic data and no to eHR.

Page 18: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Genomic and clinical data within the health system enable Personalized Medicine

• Database of patients with prospective clinical information. Patients sequenced:• Will have different responses to treatments in the future• Can have other diseases in the future• Dynamic diagnostic of undiagnosed patients as knowledge databases update• Dynamic assignment of treatments for patients without therapeutic options

as knowledge databases update• Preventive medicine:

• Dynamic discovery of pharmacogenomics relevant variants in sequenced individuals

• Dynamic discovery of new risk variants in sequenced individuals• Dynamic discovery of reproductive risk variants

• Health system as a prospective genomic study for clinical knowledge generation:• Prospective discovery of new biomarkers of response to drugs, therapies,

prognostic, etc. • The pool of disease or risk variants is limited and could be surveyed soon

Page 19: The experience of Andalusia in eHealth and big data · • Genomic big data for personalized medicine • Bioinformatics • Sustainability • Scalability • Generation of knowledge:

Clinical Bioinformatics AreaFundación Progreso y Salud, Sevilla, Spain, and…

...the INB-ELIXIR-ES, National Institute of Bioinformaticsand the BiER (CIBERER Network of Centers for Research in Rare Diseases)

@xdopazo

@ClinicalBioinfo

Follow us on

twitter

htt

ps:

//w

ww

.slid

esh

are

.net

/xd

op

azo

/