Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced...

32
Final Review. Final Review. Translational Translational bioinformatics and bioinformatics and medical informatics. medical informatics. Unit 29 Unit 29 BIOL221T BIOL221T : Advanced : Advanced Bioinformatics for Bioinformatics for Biotechnology Biotechnology Irene Gabashvili, PhD
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    222
  • download

    0

Transcript of Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced...

Page 1: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Final Review. Final Review. Translational Translational bioinformatics and medical bioinformatics and medical

informatics.informatics.Unit 29Unit 29

BIOL221TBIOL221T: Advanced : Advanced Bioinformatics for Bioinformatics for

BiotechnologyBiotechnologyIrene Gabashvili, PhD

Page 2: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Projects: 20 points maxProjects: 20 points max

Originality - 7 Originality - 7 Structure - 6Structure - 6 Scope - 7Scope - 7

Penalty points: paper not Penalty points: paper not submitted on time – 1 point off submitted on time – 1 point off for each day starting May 4 for each day starting May 4 (Official deadline was April 30, (Official deadline was April 30, 4 day grace period)4 day grace period)

Page 3: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

ProblemSet 4ProblemSet 4

Questions from topics since PS3: Questions from topics since PS3: proteomics, metabolomics, protein proteomics, metabolomics, protein predictive methods (seqs & predictive methods (seqs & structure) - 15 points maxstructure) - 15 points max

Exam: computers off, 40 questions, Exam: computers off, 40 questions, 2 hr limit 2 hr limit 20 points max 20 points max

Exam Results added to 4 best Exam Results added to 4 best problem sets (60 points max) and problem sets (60 points max) and project (20 pts max), 100 max.project (20 pts max), 100 max.

Page 4: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Translational Translational Bioinformatics & Bioinformatics &

BioMedical InformaticsBioMedical Informatics Translating science into health Translating science into health gainsgains

The use of information sciences to The use of information sciences to improve health care, biomedical & improve health care, biomedical & clinical researchclinical research

Latest meeting:Latest meeting:

http://www.amia.org/meetings/stb08/http://www.amia.org/meetings/stb08/ Disease informatics. Information Disease informatics. Information management, Semantic web, data management, Semantic web, data integration and mining toolsintegration and mining tools

Page 5: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Biomedical/Health Informaticians are:

A) Knowledge Trackers & Sorters, ‘Info-magicians’B) Decision-support tacticiansC) Complex Adaptive Systems & Process Designers D) Specialized GeneralistsE) Intelligent, Altruistic RealistsF) Agents of ChangeG) All of the above, plus other good things?

(Answer: G -- Bet you didn’t guess.)

Don Detmer

Page 6: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

InformaticsInformatics

BioinformaticsBioinformaticsReally Really biomolecular biomolecular informaticsinformatics

Medical informaticsMedical informatics Really Really clinical clinical informaticsinformatics

Biomedical informaticsBiomedical informaticsCovers both and moreCovers both and more

Page 7: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Biomedical informatics:Biomedical informatics: Public health (population) informaticsPublic health (population) informatics

CDC, Health Information ManagementCDC, Health Information Management Consumer Health informaticsConsumer Health informatics Clinical informaticsClinical informatics Nursing informaticsNursing informatics Imaging informaticsImaging informatics Dental informaticsDental informatics Clinical Research informaticsClinical Research informatics Veterinary informaticsVeterinary informatics Pharmacy informaticsPharmacy informatics BioinformaticsBioinformatics

Page 8: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Informatics in PerspectiveInformatics in Perspective

Basic Research

BiologicalFoundations

Health Care Systems

Applied Research

Medical Informatics Methods, Techniques, and Theories

Imaging Informatic

s

Clinical InformaticsBioinformatics

Public Health Informatics

Molecular andCellularProcesses

Tissues andOrgans

Individuals(Patients)

PopulationsAnd Society

Page 9: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

You might be a public You might be a public health professional if you health professional if you

are….are…. looking to control the most basic of human looking to control the most basic of human functions, e.g., lobbying the Federal functions, e.g., lobbying the Federal Trade Commission to investigate snack-food Trade Commission to investigate snack-food and soft-drink marketing or promoting a and soft-drink marketing or promoting a “twinkie tax." “twinkie tax."

worrying about eating, smoking, HIV/AIDS, worrying about eating, smoking, HIV/AIDS, bioterrorism, health literacy and hand bioterrorism, health literacy and hand washing all in one day.washing all in one day.

spending hours per day trying to define spending hours per day trying to define yourself, your work, and explaining your yourself, your work, and explaining your work to others.work to others.

Page 10: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Efforts to Implement Health Information Technology in UK & USA

U.S. U.K.Initial year of national IT effort

2006 2002

Expected year of complete implementation2016 2014

Estimate of total investment (as of 2005)*$125M $11.5B

Total investment per capita (as of 2005)** $0.43 $192.79

* In U.S. dollars. Exchange rates as of September 2005: $1 U.S. = $1.31 AUS; $1.19 CAN; $0.80 EURO; $6.21 NOR; $0.54 U.K.

** In U.S. dollars. Per capita is based on 2003 population numbers from the Organization for Economic Cooperation and Development (OECD).

‘Source: Adapted from G. F. Anderson et al, “Health Care Spending and Use of Information Technology in OECD Countries,”

Health Affairs, May/June 2006 25(3):819–31.

Page 11: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

 

- Sir Cyril Chantler

Medicine used to be simple, ineffective, & relatively safe. 

Now it is complex, effective, & potentially dangerous.

Page 12: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

 

- Will Rogers

The future just isn’t what it used to be.

Page 13: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

“… not what it used to be.”

• Demographics– Aging & Chronic Illness

• Global Diseases/Awareness/Globalization

• Knowledge Explosion– Genomics, Proteomics & Epigenetics– Data v. Intelligence (best evidence)

• Social Dynamics – Consumerism

• Sustainability - $2 trillion/year & rising

• Technology

Page 14: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Information Big Bang

Page 15: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Medical InformaticsMedical Informatics

Expert SystemsExpert Systems Decision SupportDecision Support Information Filtering / Information Filtering / AggregationAggregation

Medical Records (HL7)Medical Records (HL7) Medical imaging (DICOM)Medical imaging (DICOM)

Page 16: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Medical informatics: Medical informatics: Controlled TerminologyControlled Terminology

A finite, enumerated set of terms A finite, enumerated set of terms intended to convey information intended to convey information unambiguouslyunambiguously Diagnostic ProceduresDiagnostic Procedures Therapeutic ProceduresTherapeutic Procedures MedicationsMedications DiagnosesDiagnoses FindingsFindings OrganismsOrganisms AnatomyAnatomy

Page 17: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

What’s out thereWhat’s out there

ICD9-CM & ICD-10 ICD9-CM & ICD-10 (International (International Classification of Diseases, the standard for Classification of Diseases, the standard for coding the diagnosis in MR) coding the diagnosis in MR)

SNOMED - SNOMED - Systematized Nomenclature of Systematized Nomenclature of MedicineMedicine

NHS Clinical Terms NHS Clinical Terms ((formerlyformerly READ READ Clinical Clinical Classification)Classification)

Nursing terminologiesNursing terminologies LOINC: http://loinc.org/LOINC: http://loinc.org/ MeSH, MedPixMeSH, MedPix UMLSUMLS

Page 18: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Classifying Disease Classifying Disease based on Genomicsbased on Genomics

Correlation of 11k gene ortholog Correlation of 11k gene ortholog families v. 75 diseasesfamilies v. 75 diseases

1) Breast Cancer similar to Endocrine 1) Breast Cancer similar to Endocrine diseasedisease

2) Multiple Sclerosis close to Muscular 2) Multiple Sclerosis close to Muscular Dystrophy & Myocardial InfarctionDystrophy & Myocardial Infarction

3) Colon Polyps close to CA Colon3) Colon Polyps close to CA Colon4) SNOMED better than ICD4) SNOMED better than ICD

Page 19: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Genomics & EpigeneticsGenomics & Epigenetics

Page 20: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

FINAL ReviewFINAL Review

Advanced Search in EntrezAdvanced Search in Entrez Boolean logicBoolean logic Terms & FieldsTerms & Fields

Definitions & key concepts of Definitions & key concepts of bioinformaticsbioinformatics

Types of data and formatsTypes of data and formats Database management: key conceptsDatabase management: key concepts Programming languages used for R&D in Programming languages used for R&D in the biological sciences; frequent the biological sciences; frequent taskstasks

Page 21: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Entrez Map ViewerEntrez Map Viewer OMIMOMIM dbSNP, type of variation, haplotypesdbSNP, type of variation, haplotypes Sequence databases, formats, Sequence databases, formats, symbols, codessymbols, codes

Sequence analysis toolsSequence analysis tools PharmacogenomicsPharmacogenomics Sequence Alignments: methods, Sequence Alignments: methods, software, algorithmssoftware, algorithms

Similarity, homologySimilarity, homology Scoring matricesScoring matrices

Page 22: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Types and elements of genomic maps, Types and elements of genomic maps, markersmarkers

Gene finding – what can be searched Gene finding – what can be searched and found? Intrinsic & extrinsic and found? Intrinsic & extrinsic methods. Models, measures of methods. Models, measures of accuracyaccuracy

Genome Organization Genome Organization (introns, repeats, UTRs)(introns, repeats, UTRs)

Sensitivity, Specificity, Sensitivity, Specificity, Correlation, ScoreCorrelation, Score

RNA informatics – what can be RNA informatics – what can be predicted & why? Types of RNA genespredicted & why? Types of RNA genes

Dot plots, ROC curvesDot plots, ROC curves

Page 23: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

MSA, tools, approaches, MSA, tools, approaches, applicationsapplications

Phylogenetics conceptsPhylogenetics concepts UPGMA, NJ, FM, ME ||MP, MLUPGMA, NJ, FM, ME ||MP, ML Bootstrap (scramble MSA)Bootstrap (scramble MSA) Hamming & Levenshtein distancesHamming & Levenshtein distances

"=" Match; "o" Substitution; "+" Insertion; "-" Deletion

Page 24: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Maximum parsimony predicts the evolutionary tree or trees that minimize the number of steps required to generate the observed

variation in the sequences from common ancestral sequences

-- Distance methods are based on genetic distances between sequence pairs in an MSA (e.g. NJ)-- Maximum likelihood (ML) methods are especially useful when there is considerable variation among the sequences in MSA to be analyzed. The ML method is similar to the MP method.

Page 25: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

-omics technologies, large scale -omics technologies, large scale sequencing, hybridization techniquessequencing, hybridization techniques

Top-down and bottom-up approaches Top-down and bottom-up approaches for network reconstructionfor network reconstruction

Levels of abstraction in Levels of abstraction in bioinformatics (central dogma, bioinformatics (central dogma, motifs, metabolic pathways, protein motifs, metabolic pathways, protein sequence motifs)sequence motifs)

Types and elements of graphs, Types and elements of graphs, characteristics of biological characteristics of biological networks (small world, hubs – networks (small world, hubs – conservation, interaction with other conservation, interaction with other hubs)hubs)

Page 26: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Bioinformatics tools to design Bioinformatics tools to design Primers, Probes & cloning strategiesPrimers, Probes & cloning strategies

Tools to annotate probes, map array Tools to annotate probes, map array datadata

Types of arrays; types of probes; Types of arrays; types of probes; sequencing platforms (sequencing platforms (oligo,spotted oligo,spotted

cDNA,TaqMAn,BeadChips,Exon,Tiling,SAGE…cDNA,TaqMAn,BeadChips,Exon,Tiling,SAGE…)) Microarray experiment databasesMicroarray experiment databases Tools to perform statistical analysis Tools to perform statistical analysis of microarray dataof microarray data

Major statistics concepts (PCA, k-Major statistics concepts (PCA, k-means & 7 hierarchical clustering, t-means & 7 hierarchical clustering, t-tests, ANOVA, p-value) tests, ANOVA, p-value)

1 question in today’s PS4!1 question in today’s PS4!

Page 27: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

-omics & omes (definitions, -omics & omes (definitions, experimental techniques, software experimental techniques, software tools)tools)

2D PAGE vs Mass Spec, protein 2D PAGE vs Mass Spec, protein arrays: principles & typical arrays: principles & typical results; software, applicationsresults; software, applications

De novo and sequence tagging De novo and sequence tagging algorithmsalgorithms

Metabolomics: exp. techniques and Metabolomics: exp. techniques and data processing (and pre-data processing (and pre-processing) approachesprocessing) approaches

Supervised and unsupervised methodsSupervised and unsupervised methods

Page 28: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Examples of Protein Examples of Protein FeaturesFeatures

Composition FeaturesComposition Features Mass, pI, Absorptivity, RgMass, pI, Absorptivity, Rg

Sequence FeaturesSequence Features Active sites, Binding Sites, Active sites, Binding Sites, Targeting, Location, Property Targeting, Location, Property Profiles, 2Profiles, 2oo structure elements structure elements

Structure FeaturesStructure Features Super-Secondary Structure, Global Super-Secondary Structure, Global Fold, Volume Fold, Volume http://www.expasy.org/tools/

Page 29: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.

Bioinformatics Tools & Bioinformatics Tools & ServersServers

Protein structure databasesProtein structure databases Protein structure predictionProtein structure prediction Protein structure validationProtein structure validation Protein structure visualizationProtein structure visualization

Homology vs Threading vs Ab Homology vs Threading vs Ab initio predictioninitio prediction

Page 30: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.
Page 31: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.
Page 32: Final Review. Translational bioinformatics and medical informatics. Unit 29 BIOL221T: Advanced Bioinformatics for Biotechnology Irene Gabashvili, PhD.