1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

104
1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo http://ontologist.com

Transcript of 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

Page 1: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

1

The Unbearable Lightness of Biomedical Informatics

Barry Smith

Saarbrücken/Buffalo

http://ontologist.com

Page 2: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

2

if Medical WordNet* is the solution

what is the problem?

*Coling Proceedings, Vol. 1, pp. 371-380

Page 3: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

3

Page 4: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

4

Cerebellar tumor

Page 5: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

5

DNA

Protein

Organelle

Cell

Tissue

Organ

Organism

10-5 m

10-1 m

10-9 m

Page 6: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

6

The quantity-quality divide30,000 genes in human200,000 proteins100s of cell types100,000s of disease types 1,000,000s of biochemical pathways (including

disease pathways)

… legacy of Human Genome Project… and of attempts to institute the electronic

health record

Page 7: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

7

DNA

Protein

Organelle

Cell

Tissue

Organ

Organism

10-5 m

10-1 m

10-9 m

Page 8: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

8

FUNCTIONAL GENOMICS

proteomics,

reactomics,

metabonomics,

toxicopharmacogenomics

phenomics,

behaviouromics,

Page 9: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

9

DNA

Protein

Organelle

Cell

Tissue

Organ

Organism

10-5 m

10-1 m

10-9 m

The method of annotations

Page 10: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

10

DNA

Protein

Organelle

Cell

Tissue

Organ

Organism

10-5 m

10-1 m

10-9 m

The method of indexing

Page 11: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

11

The Gene Ontology

menopause

sensitivity to blue light

heptolysis

Page 12: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

12

Page 13: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

13

How overcome incompatibilities between different scientific index

terms?

immunology

genetics

cell biology

Page 14: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

14

One answer (statistical) computational linguistics

Pattern recognition based on string searches

Page 15: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

15

String searches need constraints

we can’t leave it to luck to overcome terminological incompatibilities

Page 16: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

16

Remember –different disciplines are using different terminologies to refer to the same

objects, processes, features in reality

immunology

genetics

cell biology

Page 17: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

17

An alternative answer:

“Ontology”

Page 18: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

18

Ontology, roughly:

Overcome terminological incompatibilities by creating a standardized framework into which diverse vocabularies can be mapped

Page 19: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

19

Kinds of Ontologies

Terms

General Logic

Thesauri

formalTaxonomies

Frames(OKBC)

Data Models(UML, STEP)

Description Logics

(DAML+OIL)

Principled, informal

hierarchies

ad hoc Hierarchies

(Yahoo!)structured Glossaries

XML DTDs

Data Dictionaries

(EDI)

‘ordinary’Glossaries

XML Schema

DB Schema

Glossaries & Data Dictionaries

MetaData,XML Schemas, & Data Models

Formal Ontologies & Inference

Thesauri, Taxonomies

Michael Gruninger

Page 20: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

20

Kinds of Ontologies

A shared vocabulary plus a specification of its intended meaning

meaning specifiedexplicitly in a logically rigorous way

Two extremes

Page 21: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

21

Kinds of Ontologies

Terms

General Logic

Thesauri

formalTaxonomies

Frames(OKBC)

Data Models(UML, STEP)

Description Logics

(DAML+OIL)

Principled, informal

hierarchies

ad hoc Hierarchies

(Yahoo!)structured Glossaries

XML DTDs

Data Dictionaries

(EDI)

‘ordinary’Glossaries

XML Schema

DB Schema

Glossaries & Data Dictionaries

MetaData,XML Schemas, & Data Models

Formal Ontologies & Inference

Thesauri, Taxonomies

Page 22: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

22

Kinds of Ontologies

A shared vocabulary plus a specification of its intended meaning

meaning specifiedexplicitly in a logically rigorous way

Too expensive

Page 23: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

23

Kinds of Ontologies

A shared vocabulary plus a specification of its intended meaning

Meaning specified informally via natural

language

Two extremes

Page 24: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

24

Work on biomedical ontologies grew out of work on medical thesauri and nomenclatures

Page 25: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

25

Kinds of Ontologies

Terms

General Logic

Thesauri

formalTaxonomies

Frames(OKBC)

Data Models(UML, STEP)

Description Logics

(DAML+OIL)

Principled, informal

hierarchies

ad hoc Hierarchies

(Yahoo!)structured Glossaries

XML DTDs

Data Dictionaries

(EDI)

‘ordinary’Glossaries

XML Schema

DB Schema

Glossaries & Data Dictionaries

MetaData,XML Schemas, & Data Models

Formal Ontologies & Inference

Thesauri, Taxonomies

Page 26: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

26

Fruit

Orange

Vegetable

similarTo

ApfelsinesynonymWith

NarrowerTerm

Graph with labels edges (similarTo, Narrower, synonymWith)Fixed set of edge labels (a.k.a. relations)

Goble & Shadbolt

Page 27: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

27

Unified Medical Language System (UMLS)

UMLS Metathesaurus:

1 million biomedical concepts

2.8 million concept names

from more than 100 controlled vocabularies and classifications

built by US National Library of Medicine

Page 28: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

28

UMLS Source Vocabularies

MeSH – Medical Subject Headings

ICD International Classification of Diseases

GO – Gene Ontology

FMA – Foundational Model of Anatomy

Page 29: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

29

To reap the benefits of standardization

we need to make ONE SYSTEM out of many different terminologies

= UMLS “Semantic Network”nearest thing to an “ontology” in the UMLS

Page 30: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

30

UMLS SN

Alexa McCray, “An Upper Level Ontology for the Biomedical Domain”, Comparative and Functional Genomics, 4 (2003), 80-84.

Page 31: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

31

UMLS SN

134 Semantic Types

54 types of edges (relations)

yielding a graph containing more than 6,000 edges

Page 32: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

32Fragment of UMLS SN

Page 33: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

33

Page 34: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

34

Page 35: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

35

UMLS SN Top Level

entity event

physical conceptual object entity

organism

Page 36: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

36

conceptual entity

Organism Attribute

Finding

Idea or Concept

Occupation or Discipline

Organization

Group

Group Attribute

Intellectual Product

Language

Page 37: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

37

conceptual entity

Organism Attribute

Finding

Idea or Concept

Occupation or Discipline

Organization

Group

Group Attribute

Intellectual Product

Language

Page 38: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

38

Idea or ConceptFunctional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

Page 39: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

39

Idea or ConceptFunctional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

Page 40: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

40

Idea or ConceptFunctional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

Page 41: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

41

Idea or ConceptFunctional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

Page 42: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

42

Lake Geneva

is an Idea or Concept

Page 43: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

43

Idea or ConceptFunctional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

Page 44: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

44

UMLS

Fingers is_a Body Location or Region

Hand is_a Body Part, Organ, or Organ Component

hand part_of body

BUT NOT

fingers part_of hand

Page 45: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

45

Problem: Running together of concepts and entities in reality

bioinformatics à la UMLS SN( like many “knowledge engineering” disciplines )

floats free from reality in a conceptual world

of its own creation

Page 46: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

46

Blood Pressure OntologyThe hydraulic equation:

BP = CO*PVR

arterial blood pressure (BP) is directly proportional to the product of blood flow (cardiac output, CO) and peripheral vascular resistance (PVR).

Page 47: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

47

UMLS SN

blood pressure is an Organism Function

cardiac output is a Laboratory or Test Result or Diagnostic Procedure

Page 48: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

48

BP = CO*PVR thus asserts that

blood pressure is proportional either to a laboratory or test result or to a diagnostic procedure

Page 49: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

49

Problem: Confusion of reality with our (ways of gaining) knowledge

about reality

Page 50: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

50

UMLS Semantic Network

entity

physical conceptual object entity

Page 51: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

51

Physical Object

Substance

Food Chemical Body

Page 52: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

52

Chemical

Chemical Chemical

Viewed Viewed

Structurally Functionally

Page 53: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

53

Problem: Confusion of objects with our ways of referring to

objects

Page 54: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

54

Chemical

Chemical ChemicalViewed Viewed

Structurally Functionally

Inorganic Organic Enzyme Biomedical or Chemical Chemical Dental Material

Page 55: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

55

This multiple inheritance leads to errors in coding

Gene Ontology will eliminate multiple inheritance

Page 56: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

56

UMLS Semantic Network

entity

physical conceptual object entity

organism

is_a

Page 57: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

57

UMLS SN

is_a =def.

If one item ‘is_a’ another item then the first item is more specific in meaning than the second item. (Italics added)

Page 58: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

58

fish is_a vertebrate

copulation is_a biological process

both testes is_a testis

Nazi is_a Nazism

plant parts is_a plant

Page 59: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

59

Page 60: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

60

What are the nodes in this graph?

Almost all nodes are linked to other nodes by a multiplicity of different types of edges

Compare: swimming is healthy

swimming has 8 letters

Page 61: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

61

Semantic Network Definition:

Concept =def. An abstract concept, such as a

social, religious, or philosophical concept

UMLS Definition:

Concept =def. A class of synonymous terms

Page 62: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

62

Page 63: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

63

How can concepts figure as relata of these relations?

part_of = def. Composes, with one or more other physical units, some larger whole

causes =def. Brings about a condition or an effect.

contains =def. Holds or is the receptacle for fluids or other substances.

Page 64: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

64

How can a set of synonymous terms serve as a receptacle for fluids or other substances?

How can sets of synonymous terms stand in relations such as affects or causes?

Page 65: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

65

connected_to =def. Directly attached to another physical unit as tendons are

connected to muscles.

How can a concept be directly attached to another physical unit?

Page 66: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

66

What are the relata which are linked by the edges in the SN

graph?

Page 67: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

67

To answer this question

we need to distinguish clearly between concepts and classes:

concepts are creatures of cognition

classes are invariants (types, kinds, universals) out there in reality

Page 68: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

68

If ontologies are about meanings / concepts

it becomes impossible to deal coherently with those relations between entities in reality which involve appeal to both classes and their instances.

Page 69: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

69

Illustration re: part_of

heart part_of human

human heart part_of human

testis part_of human

human testis part_of human

Page 70: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

70

For instances:part_of = instance-level parthood

(for example between Mary and her heart)

For classesA part_of B =def. given any instance a of A

there is some instance b of B such that a part_of b

This is an assertion about As.

Page 71: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

71

a adjacent_to b

(instance-level adjacency, for example between Mary’s head and Mary’s neck)

For classes:

A adjacent_to B =def. given any instance a of A there is some instance b of B which is such that a adjacent_to b

Page 72: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

72

A adjacent_to B

as an assertion about classes

is never an assertion about As exclusively

Page 73: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

73

A adjacent_to B =def.

given any instance a of A there is some instance b of B which is such that a adjacent_to b

and

given any instance b of B there is some instance a of A which is such that a adjacent_to b

Page 74: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

74

Almost all of the 54 types of edges in SN are dealt with

incoherently

part_of HAS INVERSE has_part

nucleus part_of cell

cell has_part nucleus

Page 75: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

75

Page 76: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

76

Acquired Abnormality affects FishExperimental Model of Disease affects

FungusFood causes Experimental Model of

DiseaseBacterium causes Experimental Model of

DiseaseBiomedical or Dental Material causes

Mental or Behavioral Dysfunction Manufactured Object causes Disease or

SyndromeVitamin causes Injury or Poisoning

Page 77: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

77

How to do better?

Page 78: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

78

How to do better?How to create a network of biomedically relevant terms/classes, with coherently defined relations between them, to which expert terms of the UMLS can be assigned in a maximally intelligible way?

Page 79: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

79

What linguistic framework

is shared in common by immunologists, geneticists and cell biologists,

by phenobehavioromists and by toxicopharmacogenomists?

Page 80: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

80

Answer:

the natural language they all use to talk about biological (biomedical) phenomena

Page 81: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

81

BioWordNetjoint work with

Christiane Fellbaum

(see paper in Proceedings)

Page 82: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

82

BioWordNet

use WordNet’s biomedical vocabulary, to create a better alternative to UMLS SN

Page 83: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

83

Strengths of WordNet 2.0

Open source

Very broad coverage

Is-a / part-of architecture

Tool for automatic sense disambiguation

Page 84: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

84

Weaknesses of WordNet 2.0Problems with relationsMixes up expert and non-expert vocabularyErrorsGapsNoise

all prevent WordNet’s being used in scientific context as substitute for UMLS SN

Page 85: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

85

Fix WordNet’s relations by using the methodology outlined above

already applied to:

Foundational Model of Anatomy

Gene Ontology

Open Biological Ontologies

Page 86: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

86

Institute for Formal Ontology and Medical Information Science

Saarbrücken

http://ifomis.org

Page 87: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

87

WordNet mixes up expert and non-expert vocabulary,

both current and medieval:

suppuration#2 {pus, purulence, suppuration, ichor, sanies, festering}

Page 88: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

88

WordNet contains biomedically relevant errors

snore-sleep

WordNet: if someone snores, then he necessarily also sleeps

snoring = the respiratory induced vibration of glottal tissues

associated not only with sleep but also with relaxation or obesity

Page 89: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

89

WordNet has too much noise for purposes of scientific applications

Page 90: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

90

13 senses for feel is a verbexperience – She felt resentfulfind – I feel that he doesn't like mefeel – She felt small and insignificant; feel – We felt the effects of inflationfeel – The sheets feel softgrope –He felt for his walletfinger – Feel this soft cloth! explore – He felt his way around the dark room)feel – It feels nice to be home againfeel – He felt the girl in the movie theater)

Page 91: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

91

Medical senses of ‘feel’

palpate – examine a body part by palpation:

The runner felt her pulse.

sense – perceive by a physical sensation, e.g. coming from the skin or muscles:

He felt his flesh crawl

feel – seem with respect to a given sensation:

My cold is gone – I feel fine today

Page 92: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

92

WordNet has gaps even in its coverage of biomedical natural

language

Page 93: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

93

WordNet seness of ‘regulation’1. regulation (ordinance, rule)2. rule, regulation -- (a principle that customarily governs behavior; "short haircuts were the regulation")3. regulation -- (the state of being controlled or governed)4. regulation -- (the ability of an early embryo to continue normal development after its structure has been somehow damaged)5. regulation, regularization, regularisation -- (the act of bringing to uniformity)6. regulation, regulating -- (the act of controlling according to rule; "fiscal regulations are in the hands of politicians")

Page 94: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

94

Biological sense of ‘regulation’:

A process that modulates the frequency, rate or extent of behavior

(Gene Ontology)

Page 95: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

95

WordNet senses of ‘inhibition’1. inhibition, suppression -- ((psychology) the conscious exclusion of unacceptable thoughts or desires)2. inhibition -- (the quality of being inhibited)3. inhibition -- the process whereby nerves can retard or prevent the functioning of an organ or part; "the inhibition of the heart by the vagus nerve")4. prohibition, inhibition, forbiddance -- (the action of prohibiting or forbidding)

Page 96: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

96

Biological senses of ‘inhibition’ much broader

inhibition = negative regulation

enzymes can be inhibited

reactions can be inhibited

… and not only by nerves

Page 97: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

97

WordNet senses of ‘binding’1. binding -- (the capacity to attract and hold something)2. binding -- (a strip sewn over or along an edge for reinforcement or decoration)3. dressing, bandaging -- (the act of applying a bandage)4. binding, book binding; "the book had a leather binding")

Page 98: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

98

biological sense of ‘binding’

interacting selectively with

(Gene Ontology)

Page 99: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

99

Remove errors, noise and gaps in a two-stage process

1.select biomedically relevant natural-language terms from WordNet 2.0 extended by standard biomedical information sources

2.validate these terms and the relations between them

Page 100: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

100

Validationeach arc in BWN is converted into a natural-

language sentence

e.g. ‘mumps is an inflammation’

via controlled human subjects experiments:

are accredited

1. as intelligible by non-experts

2. as true by experts

Page 101: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

101

we use logical methods to ensure a coherent treatment of BWN’s

upper-level classes and relations

and thereby also bring logical rigor in a practical fashion to the

whole of the UMLS Metathesaurus

Page 102: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

102

Bring ontological rigour to BWN

Terms

General Logic

Thesauri

formalTaxonomies

Frames(OKBC)

Data Models(UML, STEP)

Description Logics

(DAML+OIL)

Principled, informal

hierarchies

ad hoc Hierarchies

(Yahoo!)structured Glossaries

XML DTDs

Data Dictionaries

(EDI)

‘ordinary’Glossaries

XML Schema

DB Schema

Glossaries & Data Dictionaries

MetaData,XML Schemas, & Data Models

Formal Ontologies & Inference

Thesauri, Taxonomies

Page 103: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

103

The long-term goal

BWN should serve as scaffolding/indexing system for the much larger and denser net of expert biomedical terminology which is the UMLS Metathesaurus

Page 104: 1 The Unbearable Lightness of Biomedical Informatics Barry Smith Saarbrücken/Buffalo .

104The End