1 VT. 2 Medical Ontology Barry Smith .

90
1 VT

Transcript of 1 VT. 2 Medical Ontology Barry Smith .

Page 1: 1 VT. 2 Medical Ontology Barry Smith .

1

VT

Page 2: 1 VT. 2 Medical Ontology Barry Smith .

2

Medical Ontology

Barry Smith

http://ifomis.de

Page 3: 1 VT. 2 Medical Ontology Barry Smith .

3

IFOMIS

Institute for Formal Ontology and Medical Information Science

Faculty of Medicine

University of Leipzig

Page 4: 1 VT. 2 Medical Ontology Barry Smith .

4

Reference Ontology

An ontology is a theory of a domain of entities in the world

Ontology is outside the computer

seeks maximal expressiveness and adequacy to reality

and sacrifices computational tractability for the sake of representational adequacy

Page 5: 1 VT. 2 Medical Ontology Barry Smith .

5

Reference Ontology

a theory of the tertium quid

– called reality –

needed to hand-callibrate database/terminology systems

Page 6: 1 VT. 2 Medical Ontology Barry Smith .

6

Methodology

Get ontology right first

(realism; descriptive adequacy; rather powerful logic);

solve tractability problems later

Page 7: 1 VT. 2 Medical Ontology Barry Smith .

7

The Reference Ontology Community

IFOMIS (Leipzig) Laboratories for Applied Ontology (Trento/Rome,

Turin)Foundational Ontology Project (Leeds)Ontology Works (Baltimore)Ontek Corporation (Buffalo/Leeds)Language and Computing (L&C)

(Belgium/Philadelphia)

Page 8: 1 VT. 2 Medical Ontology Barry Smith .

8

Domains of Current Work

IFOMIS Leipzig: Medicine, Bioinformatics

Laboratories for Applied Ontology

Trento/Rome: Ontology of Cognition/Language

Turin: Law

Foundational Ontology Project: Space, Physics

Ontology Works: Genetics, Molecular Biology

Ontek Corporation: Biological Systematics

Language and Computing: Natural Language Understanding

Page 9: 1 VT. 2 Medical Ontology Barry Smith .

9

Ontologie als Zweig der Philosophie

die Wissenschaft von den Arten und Strukturen von Objekten, Qualitäten, Prozessen, Ereignissen, Funktionen und Relationen in allen Bereichen der Wirklichkeit

Page 10: 1 VT. 2 Medical Ontology Barry Smith .

10

Aristotle

Der erste Ontologe

Page 11: 1 VT. 2 Medical Ontology Barry Smith .

11

Eine biologische Ontologie

Page 12: 1 VT. 2 Medical Ontology Barry Smith .

12

Linné

1763: Genera Morborum

(Nosologie

oder

Ontologie der Krankheitsarten)

Page 13: 1 VT. 2 Medical Ontology Barry Smith .

13

Q: Warum “Ontologie” in der medizinischen Informatik?

A: Das Turm von Babel-Problem der Informationssysteme

Page 14: 1 VT. 2 Medical Ontology Barry Smith .

14

Turm von Babel

Jedes Informationssystem basiert auf einer eigenen Terminologie

Wie können wir die Inkompatibilitäten lösen, die entstehen, wenn Daten aus verschiedenen Quellen kombiniert werden?

Vgl. Wie können wir Anatomie und Physiologie integrieren?

Page 15: 1 VT. 2 Medical Ontology Barry Smith .

15

Wie lösen Medizinstudenten dieses Problem?

Vielfach erst durch die Begegnung mit dem Patienten Der Patient und die in ihm ablaufenden Prozesse dienen als Kristallisationspunkt für eine sinnvolle Ordnung sonst isoliert stehender (gelernter) Fakten.

(Aus Wissen-dass wird Wissen-wie)

Page 16: 1 VT. 2 Medical Ontology Barry Smith .

16

Dem Computer fehlt praktisches Wissen

Wie können in Medizininformations-systemen isolierte Datenartefakte zu konsistentem und anwendbarem Wissen integriert werden?

Page 17: 1 VT. 2 Medical Ontology Barry Smith .

17

Ursprünglicher Traum der Ontologie in der Informatik

Eine einzige allumfassende Taxonomie aller Gegenstandsarten, die als zentrales integrierendes Kategoriensystem für alle Informationssysteme dient.

Dieser Traum ist ausgeträumt ...

Page 18: 1 VT. 2 Medical Ontology Barry Smith .

18

Gegenwärtige Lösungen

Standardisierte Terminologien

UMLS

SNOMED

ICD-10

Gene Ontology

Digital Anatomist

usw.

Page 19: 1 VT. 2 Medical Ontology Barry Smith .

19

Standardisierte Terminologien

sollen Zugriff auf biomedizinische Literatur und Faktendatenbanken erleichtern

Beispielsweise um Verbindungen zwischen spezifischen Genen und spezifischen Körperreaktionen auffindbar zu machen

Eine neue Art medizinischer Forschung soll dadurch ermöglicht werden

Page 20: 1 VT. 2 Medical Ontology Barry Smith .

20

Database and terminology standardization

is desparately needed in medical and bioinformatics

to enable the huge amounts of existing data to be fused together automatically

Page 21: 1 VT. 2 Medical Ontology Barry Smith .

21

To reap the benefits of standardization

we need to make ONE SYSTEM out of many different terminologies

But how?

Through government edict? (Scandinavia)

Through efforts of international standards bodies (ISO, CEN …)?

Through UMLS Metathesaurus?

Page 22: 1 VT. 2 Medical Ontology Barry Smith .

22

Zentrale Schaltstelle

UMLS

Universal Medical Language System

National Library of Medicine

Bethesda, MD

Page 23: 1 VT. 2 Medical Ontology Barry Smith .

23

UMLS Metathesaurus

eine riesige Kombination verschiedener maschinenlesbarer Quellterminologien

800,000 Begriffe

10 Mio. Beziehungen

Page 24: 1 VT. 2 Medical Ontology Barry Smith .

24

Beispiele für Quell-Terminologien

SNOMED-RT

Systematized Nomenclature of Medicine

MeSH

Medical Subject Headings

Page 25: 1 VT. 2 Medical Ontology Barry Smith .

25

is_a trees

hormone

peptide hormone digestive hormone

adrenocorticotropin glycopeptide hormone

follicle-stimulating hormone

Page 26: 1 VT. 2 Medical Ontology Barry Smith .

26

is_a = ist ein / ist von der Art

Diabetes Melletus is_a Disease

Page 27: 1 VT. 2 Medical Ontology Barry Smith .

27

Bad Coding

deriving from over-simplification

and from failure to pay attention to ontological principles

Z.B. SNOMED

both_testes is_a testis

(beide_Hoden ist_ein Hoden)

Page 28: 1 VT. 2 Medical Ontology Barry Smith .

28

Terminological Incompatibilities

Page 29: 1 VT. 2 Medical Ontology Barry Smith .

29

Representation of Blood in SNOMED

Blood is_a Tissue

Page 30: 1 VT. 2 Medical Ontology Barry Smith .

30

Representation of Blood in MeSH

Blood is_a Bodily Fluid

Barry Smith
Page 31: 1 VT. 2 Medical Ontology Barry Smith .

31

Bad CodingIncompatibilities

Context-Dependence

Standardized Terminologies must be used properly

Page 32: 1 VT. 2 Medical Ontology Barry Smith .

32

people are lazy and idiosyncratic

Sie machen SchreibfehlerJeder pflegt seine eigene Terminologie, die sich mehr oder weniger von der anderer Akteure unterscheidet Sie verwenden verschiedene natürlich-sprachliche Darstellungen der gleichen medizinischen Phänomena

Page 33: 1 VT. 2 Medical Ontology Barry Smith .

33

The codes are not formulated on the basis of clear principles

Therefore inconsistent

Unintuitive

Difficult to train people to use them

Application often depends on context-dependent knowledge

Page 34: 1 VT. 2 Medical Ontology Barry Smith .

34

The IFOMIS Contribution

help to improve standardizations through constructive criticism based on robust ontological principles

Page 35: 1 VT. 2 Medical Ontology Barry Smith .

35

UMLS Metathesaurus

eine riesige Kombination verschiedener maschinenlesbarer Quellterminologien

UMLS Semantic NetworkSemantic Network

bestehend aus 134 Semantic TypesSemantic Types

soll Ordnung in diesem Wust schaffen

Page 36: 1 VT. 2 Medical Ontology Barry Smith .

36

UMLS Semantic Network

entity event

physical conceptual entity entity

Page 37: 1 VT. 2 Medical Ontology Barry Smith .

37

conceptual entity

Organism Attribute

Finding

Idea or Concept

Occupation or Discipline

Organization

Group

Group Attribute

Intellectual Product

Language

Page 38: 1 VT. 2 Medical Ontology Barry Smith .

38

conceptual entity

Organism Attribute

Finding

Idea or Concept

Occupation or Discipline

Organization

Group

Group Attribute

Intellectual Product

Language

Page 39: 1 VT. 2 Medical Ontology Barry Smith .

39

Idea or ConceptFunctional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

Page 40: 1 VT. 2 Medical Ontology Barry Smith .

40

INNSBRUCK

is an Idea or Concept

Page 41: 1 VT. 2 Medical Ontology Barry Smith .

41

Idea or ConceptFunctional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

Page 42: 1 VT. 2 Medical Ontology Barry Smith .

42

Confusion of Ontology and Epistemology

Physical Entity

Chemical Entity

Chemical Chemical

Viewed Viewed

Structurally Functionally

Page 43: 1 VT. 2 Medical Ontology Barry Smith .

43

Confusion of Ontology and Epistemology

the hydraulic equation:

BP = CO*PVR

arterial blood pressure is directly proportional to the product of blood flow (cardiac output, CO) and peripheral vascular resistance (PVR).

Cardiac Output in UMLS = A Finding

Page 44: 1 VT. 2 Medical Ontology Barry Smith .

44

UMLS-Semantic Types:

blood pressure is an Organism Function,

cardiac output is a Laboratory or Test Result or Diagnostic Procedure

BP = CO*PVR thus asserts that

blood pressure is proportional either to a laboratory or test result or to a diagnostic procedure

Page 45: 1 VT. 2 Medical Ontology Barry Smith .

45

The goal

Formulate clear principles for building ontologies

Reconstitute the UMLS Semantic Types on the basis of these principles

Page 46: 1 VT. 2 Medical Ontology Barry Smith .

46

Zusammenarbeit mit der National Library of Medicine

Revision der UMLS Semantic Types und der Gene Ontology

Page 47: 1 VT. 2 Medical Ontology Barry Smith .

47

GO: the Gene Ontology

3 large telephone directories of standardized designations for gene functions and products

organized into hierarchies via is_a and part_of

Page 48: 1 VT. 2 Medical Ontology Barry Smith .

48

GO

can in practice be used only by trained biologists (with know how)

whether a GO-term truly stands in the is_a relation depends e.g. on the type of organism involved

glycosome is part-of cytoplasm only for Kinetoplastidae

Computers have no counterpart of such context-dependent know-how

Page 49: 1 VT. 2 Medical Ontology Barry Smith .

49

GO divided into three disjoint term hierarchies

the cellular component ontology,

e.g. flagellum, chromosome, cell

the molecular function ontology,

e.g. ice nucleation, binding, protein stabilization

the biological process ontology,

e.g. glycolysis, death

Page 50: 1 VT. 2 Medical Ontology Barry Smith .

50

Definition of Molecular Function

“the action characteristic of a gene product.”

On March 2003 all nodes in the Molecular Function ontology (except the root) had ‘activity’ added to their names

-- confusion of function with functioning

(how deal with dormant/suppressed functions?)

Page 51: 1 VT. 2 Medical Ontology Barry Smith .

51

Definition of Biological Process

“A phenomenon marked by changes that lead to a particular result, mediated by one or more gene products”

Page 52: 1 VT. 2 Medical Ontology Barry Smith .

52

How are the 3 ontologies related?

Function = “the action characteristic of a gene product.”

Process = “phenomenon marked by changes that lead to a particular result, mediated by one or more gene products”

No part-whole relations across ontologies?

Page 53: 1 VT. 2 Medical Ontology Barry Smith .

53

The GO isa relation

in its intended meaning indicates a necessary relationship.

That is, when we say “eukaryotic cell isa cell”, we mean that every eukaryotic cell is a cell.

Confusion of necessarily, universally, and permanently

(No time in GO)

Page 54: 1 VT. 2 Medical Ontology Barry Smith .

54

part_of

The Relation part-of: The intended meaning of part-of as explained in the GO Usage Guide is: “can be a part of, not is always a part of”

Page 55: 1 VT. 2 Medical Ontology Barry Smith .

55

Uses of part_of

– membrane part-of cell, intended to mean “a membrane is a part-of any cell”

– flagellum part-of cell, intended to mean “a flagellum is part-of some cells”

– replication fork part-of cell cycle, intended to mean: “a replication fork is part-of the nucleoplasm only during certain times of the cell cycle”

– regulation of sleep part-of sleep, should be corrected to: “regulation of sleep is co-located with and is causally involved with the sleep process”.

Page 56: 1 VT. 2 Medical Ontology Barry Smith .

56

Need to find ways to deal with time in medical informatics

Functions vs. Realizations of Functions

Function is still there even when not being realized

need to be clear about the distinction between continuants and occurrents

Page 57: 1 VT. 2 Medical Ontology Barry Smith .

57

SNAP and SPAN

Page 58: 1 VT. 2 Medical Ontology Barry Smith .

58

SNAP and SPAN

Substances and processesContinuants and occurrents

In preparing an inventory of realitywe keep track of these two different categories of entities in two different ways

Page 59: 1 VT. 2 Medical Ontology Barry Smith .

59

Substances and processes exist in time in different ways

substance

t i m

e

process

Page 60: 1 VT. 2 Medical Ontology Barry Smith .

60

Need for different perspectives

Not one ontology, but a multiplicity of complementary ontologies

Cf. Quantum mechanics: particle vs. wave ontologies

Page 61: 1 VT. 2 Medical Ontology Barry Smith .

61

SNAPshot Video (SPAN)ontology ontology

substance

t i m

e

process

Page 62: 1 VT. 2 Medical Ontology Barry Smith .

62

SNAP and SPAN

stocks and flows

commodities and services

product and process

anatomy and physiology

synchrony and diachrony

Page 63: 1 VT. 2 Medical Ontology Barry Smith .

63

SNAP and SPAN

SNAP entities

- have continuous existence in time

- preserve their identity through change

- exist in toto if they exist at all

SPAN entities

- have temporal parts

- unfold themselves phase by phase

- exist only in their phases/stages

Page 64: 1 VT. 2 Medical Ontology Barry Smith .

64

SNAP: Entities existing in toto at a time

Page 65: 1 VT. 2 Medical Ontology Barry Smith .

65

Three kinds of SNAP entities

• Substances

• Dependent SNAP entities (qualities, functions, roles, powers …)

• Spatial regions, Contexts, Niches

Page 66: 1 VT. 2 Medical Ontology Barry Smith .

66

FunctionsThe function of the

heart is to pump blood

Page 67: 1 VT. 2 Medical Ontology Barry Smith .

67

Page 68: 1 VT. 2 Medical Ontology Barry Smith .

68

SNAP

Fiat part of substanceExtremity (hand, arm)

Bodily System

Page 69: 1 VT. 2 Medical Ontology Barry Smith .

69

SPAN: Entities extended in time

SPANEntity extended in time

Portion of Spacetime

Fiat part of process *First phase of a clinical trial

Spacetime worm of 3 + Tdimensions

occupied by life of organism

Temporal interval *projection of organism’s life

onto temporal dimension

Aggregate of processes *Clinical trial

Process[±Relational]

Circulation of blood,secretion of hormones,course of disease, life

Processual Entity[Exists in space and time, unfolds

in time phase by phase]

Temporal boundary ofprocess *

onset of disease, death

Page 70: 1 VT. 2 Medical Ontology Barry Smith .

70

SPAN: Entities extended in time

Page 71: 1 VT. 2 Medical Ontology Barry Smith .

71

SPAN: Entities extended in time

FunctioningThe heart’s pumping

of blood

Page 72: 1 VT. 2 Medical Ontology Barry Smith .

72

Granularity

spatial region substance

parts of substances are always substances

Page 73: 1 VT. 2 Medical Ontology Barry Smith .

73

Granularity

spatial region substance

parts of spatial regions are always spatial regions

Page 74: 1 VT. 2 Medical Ontology Barry Smith .

74

Granularity

process

parts of processes are always processes

Page 75: 1 VT. 2 Medical Ontology Barry Smith .

75

MORAL

Relations crossing the SNAP/SPAN border are never part-relations

Page 76: 1 VT. 2 Medical Ontology Barry Smith .

76

Relations crossing the SNAP/SPAN border are never part-relations

John’s lifesubstance John

physiological processes

sustaining in existence

Page 77: 1 VT. 2 Medical Ontology Barry Smith .

77

DIGESTIVE SYSTEM

Page 78: 1 VT. 2 Medical Ontology Barry Smith .

78

RESPIRATORY SYSTEM

Page 79: 1 VT. 2 Medical Ontology Barry Smith .

79

URINARYSYSTEM

Page 80: 1 VT. 2 Medical Ontology Barry Smith .

80

IMMUNE SYSTEM

Page 81: 1 VT. 2 Medical Ontology Barry Smith .

81

CIRCULATORY SYSTEM

Page 82: 1 VT. 2 Medical Ontology Barry Smith .

82

CIRCULATORY SYSTEM (Principal Organs)

Page 83: 1 VT. 2 Medical Ontology Barry Smith .

83

The autonomous part of the nervous system (regulatory links to other systems)

Page 84: 1 VT. 2 Medical Ontology Barry Smith .

84

ENDOCRINESYSTEM

Page 85: 1 VT. 2 Medical Ontology Barry Smith .

85

Bodily Systems are Component Parts of Bodies

respiratorydigestive skeletal circulatorymusculatory immune

Page 86: 1 VT. 2 Medical Ontology Barry Smith .

86

Page 87: 1 VT. 2 Medical Ontology Barry Smith .

87

A system for keeping your jewels safe

Page 88: 1 VT. 2 Medical Ontology Barry Smith .

88

Bodily Systems interconnect

Page 89: 1 VT. 2 Medical Ontology Barry Smith .

89

Systems are SNAP entitiesThey are dependent continuantsWe can take photographs of them

Page 90: 1 VT. 2 Medical Ontology Barry Smith .

90

The Monarchic System of Government