Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

26
Secure the data – share the knowledge Open Research Cambridge Fiona Nielsen, December 16, 2013 DNAdigest.org

description

On December 16th 2013, Fiona Nielsen from DNAdigest gave a presentation at the Open Research meetup in Cambridge. The meetup organiser had invited DNAdigest to participate in a discussion on genomics and data sharing. Keren introduced the evening with a video explaining what a genome is and what it means to have your genome sequenced. Fiona gave a general presentation on the field of genetics data sharing including topics such as data sharing for research, patient consent and direct-to-consumer genetic testing. The audience was an approximately 50/50 division of researchers vs other professions which gave an insightful discussion on the advantages of genetic research and the potential risks of data sharing and the high hopes of the impact of genetics on the future of medicine. The Open Research meetup is a meetup group initiated by members of the Open Knowledge Foundation. You can read about their upcoming events on their OKFN web page: http://science.okfn.org/community/local-groups/cambridge-open-research/ Read more about DNAdigest online: http://DNAdigest.org and follow the Twitter feed @DNAdigest

Transcript of Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Page 1: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Secure the data – share the knowledge

Open Research Cambridge

Fiona Nielsen, December 16, 2013 DNAdigest.org

Page 2: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Take home messages

• DNA sequencing = exciting new opportunities + new challenges

• Your genome is your data (like your EHR)

• Options for sharing today are limited mostly to all-or-nothing

• Take the opportunity to voice your opinion

My aim with this talk: to give you a little view of how you can share genetic data today, and give

you an idea of the challenges involved.

I will end the presentation with a brief introduction to the DNAdigest project, and then open the

floor for questions and discussion.

Page 3: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Data is donated to research

Individuals are offered to opt-in their consent for their data to be used for research to aid development of

diagnostics and treatments for genetic diseases

Genetic data is needed for research into inborn illnesses, heriditary diseases, rare diseases and cancer.

Page 4: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Genome research today

the patient

the researcher

the sample

the data

DNAdigest.org

Page 5: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Direct-to-consumer genetic testing

You can order your own personal genotyping kit online for only $99 from http://23andme.com

You can assess your own carrier status for known disease genes before you get pregnant, for example https://www.counsyl.com/

You can obtain non-invasive pre-natal testing by detecting foetus DNA in the mothers blood, example http://www.verinata.com/

You can have your whole genome sequenced for about $7,000 http://www.dnadtc.com/products.aspx

Page 6: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Example: 23andme

Manuel Corpas used direct-to-consumer testing for himself and his family

Page 7: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

The “Corpasome”

Family genome and analysis published open access online

• Deceased

• 1M 23andMe

v3 SNP chip

• Age: 75

• 1M 23andMe v3 SNP chip

• 15,823,554 HiSeq Exome

PE Reads

• Age: 79

• 1M 23andMe v3 SNP chip

• 15,190,489 HiSeq Exome

PE Reads

• Age: 51

• 1M 23andMe v3 SNP chip

• 14,123,580 HiSeq Exome PE

Reads

• Age: 36

• 0.5M 23andMe v2 SNP chip

• 32,116,828 HiSeq Exome SE Reads

• Metagenomics

Page 8: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

The “Corpasome”

Family genome and analysis published open access online

open access online as a free resource for research

Page 9: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

But what about privacy? There is large variation between individuals,

and we are all unique

This means that your genome sequence can identify you - or your heritage

Similarly, your medical record may contain information that is unique to you

DNAdigest.org

3,000,000,000bp ~ 3billion basepairs

in the human genome

Page 10: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Consent for research

The head of the research project will create a custom consent form:

- Purpose

- How, when and who

DNAdigest.org

Page 13: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Consent is obtained in the interest of the patient

However,

Contact with patient may be lost after data collection

The institution acts as the custodian of the collected data,

Page 14: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

and data is locked up

Institutions do not freely share data because revealing entire datasets breaches confidentiality

Data access is restricted if available at all If available, access requires specific application per dataset per project

Page 15: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Results are published

When sufficient data is collected for a project, an analysis is made and a paper may be published to report the results.

Results from confidential data are reported in anonymized form

Anonymization = removal of identifiers

• Name, birthdate, NHS number, town of birth

Page 16: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Published information

Level of data detail varies per project and per publication.

Rare disease research usually includes family pedigrees and detailed description of the disease symptoms per individual

GWAS studies usually include only aggregated statistics

Page 17: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Problem for data sharing

Trade-off: details are necessary for data re-use!

Restricted access

repositories

• Advantage: access to

complete datasets of

genetics and medical

data

• Disadvantage:

cumbersome,

timeconsuming and

slow processing of

application for access

• Disadvantage:

difficult to discover the

data you need

Completely public data

• Advantage: Easy

access

• Disadvantage: if

medical data is

removed = no value for

research

• Disadvantage: no

guarantee of privacy

• Example: The Personal

Genomes Project

Page 18: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Limitations of current mechanisms

• Not easy to discover data

• Not easy to apply for access to data

• Not easy to deal with bulk datasets

As a consequence: • Researchers do not cross-check their results

• Data is not re-used for analysis

• Researchers duplicate existing work

• Results are published based on small sample sizes

DNAdigest.org

Page 19: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

What if?

• Every individuals would be custodian of their own data?

• What if there would be different ways of sharing data?

• What if you could share just part of your data?

• What if the consent form included options for the level of sharing of the data?

• What if you gave patients the option to share their data with no restrictions?

• What if you could share data in aggregate statistics?

• What if you could share your data today and change your mind tomorrow?

DNAdigest.org

Page 20: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

New approaches

• Crowdsourcing of genetic testing results: #freethedata for breast cancer genes BRCA1 and BRCA2

• Share your 23andme data with OpenSNP

• Control your EHR with PatientsKnowBest

• DNAdigest: Allow sharing of aggregated data to enable discovery and faster access for research

DNAdigest.org

Page 21: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Our mission

To create a self sustainable platform that supports the widest possible sharing and access of genomic data in

accordance with patient consent.

Page 22: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

DNAdigest is designing an ethical data sharing platform

Allowing hypothesis centered queries, returning anonymised aggregated data by patented mechanism

Page 23: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

DEMO

Results are delivered as

anonymised aggregated statistics

Page 24: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Further reading

• What to consider before undergoing a DNA test article in the Wall Street Journal

• Manuel Corpas blog: http://manuelcorpas.com

• Interview about the DNAdigest project on Open-Steps.org

• Genetic Privacy Network (launched Dec 2013) resources about the risks and legal issues for US residents

• Anonymization and re-identification: Routes for breaching and protecting genetic privacy by Erlich and Narayanan

DNAdigest.org

Page 25: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Take home messages

• DNA sequencing = exciting new opportunities + new challenges

• Your genome is your data (like your EHR)

• Options for sharing today are limited mostly to all-or-nothing

• Take the opportunity to voice your opinion

• It is question time!

Page 26: Open research cambridge December 16 2013 - presentation by Fiona Nielsen DNAdigest

Thanks for listening

And thanks to OpenResCam and Panton Arms for hosting!

DNAdigest is a not-for-profit organisation, founded for the purpose of enabling faster and easier access and sharing of genomic data for research.

Please visit us at DNAdigest.org

and on twitter @dnadigest

Thank you! DNAdigest.org