1 Ontology as Master Discipline of Information Science Barry Smith .

106
1 Ontology as Master Discipline of Information Science Barry Smith http://ifomis.de

Transcript of 1 Ontology as Master Discipline of Information Science Barry Smith .

Page 1: 1 Ontology as Master Discipline of Information Science Barry Smith .

1

Ontology as Master Discipline of Information Science

Barry Smith

http://ifomis.de

Page 2: 1 Ontology as Master Discipline of Information Science Barry Smith .

2

Ontology as Master Discipline of Information Science

Barry Smith

http://ifomis.de

Real

Page 3: 1 Ontology as Master Discipline of Information Science Barry Smith .

3

Real Ontology is a branch of philosophy

the science of what is

the science of the kinds and structures of objects, properties, events, processes and relations in reality

Page 4: 1 Ontology as Master Discipline of Information Science Barry Smith .

4

Real ontology seeks to provide a definitive and exhaustive classification of entities in all spheres of being.

Page 5: 1 Ontology as Master Discipline of Information Science Barry Smith .

5

It seeks to answer questions like this:

What categories of entities are needed for a complete description and explanation of all the goings-on in the universe?

Page 6: 1 Ontology as Master Discipline of Information Science Barry Smith .

6

Ontology is in many respects comparable to the theories produced by science

… but it is radically more general than these

Page 7: 1 Ontology as Master Discipline of Information Science Barry Smith .

7

It can be regarded as a kind of generalized chemistry or zoology (Aristotle’s ontology grew out of biological classification)

(Russell: Logic is a zoology of facts)

Page 8: 1 Ontology as Master Discipline of Information Science Barry Smith .

8

Aristotle

First ontologist

Page 9: 1 Ontology as Master Discipline of Information Science Barry Smith .

9

First ontology

(from Porphyry’s Commentary on Aristotle’s Categories)

Page 10: 1 Ontology as Master Discipline of Information Science Barry Smith .

10

Linnaean Ontology

Page 11: 1 Ontology as Master Discipline of Information Science Barry Smith .

11

Ontologies are

(very roughly)

taxonomical trees

Page 12: 1 Ontology as Master Discipline of Information Science Barry Smith .

12

Ontology is distinguished from the special sciences

it seeks to study all of the various types of entities existing at all levels of granularity

Page 13: 1 Ontology as Master Discipline of Information Science Barry Smith .

13

and to establish how they hang together to form a single whole (‘reality’ or ‘being’)

Page 14: 1 Ontology as Master Discipline of Information Science Barry Smith .

14

Unity achieved

via a good theory of relations

and via taxonomies

at different levels of granularity (atomic, molecular, cellular, organismic, etc.)

Page 15: 1 Ontology as Master Discipline of Information Science Barry Smith .

15

Sources for ontological theorizing:

thought experiments

the study of ancient texts

development of formal theories

the results of natural science

now also: working with computers

Page 16: 1 Ontology as Master Discipline of Information Science Barry Smith .

16

The existence of computers

and of large databases

allows us to express old philosophical problems in a new light

Page 17: 1 Ontology as Master Discipline of Information Science Barry Smith .

17

The problem of the unity of science

The logical positivist solution to this problem addressed a world in which sciences are identified with

printed textsWhat if sciences are identified with

Large Databases ?

Page 18: 1 Ontology as Master Discipline of Information Science Barry Smith .

18

Each family of databases

has its own idiosyncratic terms and concepts

by means of which it represents the information it receives

How to resolve the incompatibilities which result when databases need to be merged?

Page 19: 1 Ontology as Master Discipline of Information Science Barry Smith .

19

The Database Tower of Babel Problem

Page 20: 1 Ontology as Master Discipline of Information Science Barry Smith .

20

Page 21: 1 Ontology as Master Discipline of Information Science Barry Smith .

21

The term ‘ontology’came to be used by information scientists to describe the construction of standardized taxonomies designed to make databases mutually compatible

and thus to make data transportable from one environment to another

Page 22: 1 Ontology as Master Discipline of Information Science Barry Smith .

22

An ‘ontology’is a dictionary of terms formulated in a canonical syntax and with commonly accepted definitions and axioms

designed to yield a shared framework

for use by different information systems communities.

Page 23: 1 Ontology as Master Discipline of Information Science Barry Smith .

23

An ontology

is a concise and unambiguous description of the principal, relevant entities of an application domain and of their potential relations to each other

Page 24: 1 Ontology as Master Discipline of Information Science Barry Smith .

24

SO FAR

SO GOOD

Page 25: 1 Ontology as Master Discipline of Information Science Barry Smith .

25

But how was this idea in fact realized?

How did information systems engineers proceed to build ontologies?

By looking at the world, surely

Well, No

They built ontologies by looking at what people think about the world

Page 26: 1 Ontology as Master Discipline of Information Science Barry Smith .

26

Page 27: 1 Ontology as Master Discipline of Information Science Barry Smith .

27

Quine

Page 28: 1 Ontology as Master Discipline of Information Science Barry Smith .

28

For Quineans

Ontology studies, not reality,

but scientific theories

From ontology

… to ontological commitment

Page 29: 1 Ontology as Master Discipline of Information Science Barry Smith .

29

Quine:

each natural science has its own preferred repertoire of types of objects to the existence of which it is committed

Page 30: 1 Ontology as Master Discipline of Information Science Barry Smith .

30

Quineanism:

ontology is the study of the ontological commitments or presuppositions embodied in the different natural sciences

Page 31: 1 Ontology as Master Discipline of Information Science Barry Smith .

31

Quine:

only natural sciences can be taken ontologically seriously

The way to do ontology is exclusively through the investigation of scientific theories

Page 32: 1 Ontology as Master Discipline of Information Science Barry Smith .

32

Thus it is reasonable to identify ontology

– the search for answers to the question: what exists? –

with the study of the ontological commitments of natural scientists

All natural sciences are compatible with each other

Page 33: 1 Ontology as Master Discipline of Information Science Barry Smith .

33

PROBLEM

The Quinean view of ontology becomes strikingly less defensible

when the ontological commitments of various non-scientists are allowed into the mix

Page 34: 1 Ontology as Master Discipline of Information Science Barry Smith .

34

How, ontologically, are we to treat the commitments of

astrologists,

clairvoyants,

believers in voodoo?

Page 35: 1 Ontology as Master Discipline of Information Science Barry Smith .

35

How, ontologically, are we to treat the commitments of

patients who believe that their illness is caused by evil spirits or magic spells?

Page 36: 1 Ontology as Master Discipline of Information Science Barry Smith .

36

Growth of Quinean ontology outside philosophy:

Psychologists and cognitive anthropologists have sought to elicit the ontological commitments

(‘ontologies’, in the plural)

of different cultures and groups.

Page 37: 1 Ontology as Master Discipline of Information Science Barry Smith .

37

This is not ontology

Not all the things that people believe in are genuine objects of ontological investigation

Only what exists is a genuine object of ontological investigation

Page 38: 1 Ontology as Master Discipline of Information Science Barry Smith .

38

Page 39: 1 Ontology as Master Discipline of Information Science Barry Smith .

39

Why, then,

do information systems ontologists study peoples’ beliefs, thoughts, concepts

rather than the objects themselves?

Page 40: 1 Ontology as Master Discipline of Information Science Barry Smith .

40

Arguments for Ontology as Conceptual Modeling

Ontology is hard.

Life is short.

Let’s do conceptual modeling instead

Page 41: 1 Ontology as Master Discipline of Information Science Barry Smith .

41

programming real ontology into computers is hard

therefore:

we will simplify ontology

and not care about reality at all

Page 42: 1 Ontology as Master Discipline of Information Science Barry Smith .

42

Painting the Emperor´s Palace is

h a r d

Page 43: 1 Ontology as Master Discipline of Information Science Barry Smith .

43

therefore

we will not try to paint the Palace at all

... we will be satisfied instead with a grainy snapshot of some other building

Page 44: 1 Ontology as Master Discipline of Information Science Barry Smith .

44

Page 45: 1 Ontology as Master Discipline of Information Science Barry Smith .

45

Ontological engineers

neglect the standard of truth to reality

in favor of other, putatively more practical, standards:

above all programmability

Page 46: 1 Ontology as Master Discipline of Information Science Barry Smith .

46

They turn to substitutes:

to models,

to conceptualizations

because these are easier to handle

(… they move from messy noumenal reality to neatly packaged “phenomena” …)

Page 47: 1 Ontology as Master Discipline of Information Science Barry Smith .

47

For an information system ontology

there is no reality other than the one created through the system itself,

so that the system is, by definition, correct

Page 48: 1 Ontology as Master Discipline of Information Science Barry Smith .

48

Only those objects exist which are represented in the system

(constructivism)

Page 49: 1 Ontology as Master Discipline of Information Science Barry Smith .

49

Tom Gruber (1995):

‘For AI systems what “exists” is

what can be represented’

Page 50: 1 Ontology as Master Discipline of Information Science Barry Smith .

50

Ontological engineering

concerns itself with conceptualizations

It does not care whether these are true of some independently existing reality.

Page 51: 1 Ontology as Master Discipline of Information Science Barry Smith .

51

In the world of information systems

there are many surrogate world models

and thus many ontologies

Page 52: 1 Ontology as Master Discipline of Information Science Barry Smith .

52

… and all ontologies,

are equalboth good and bad,

Page 53: 1 Ontology as Master Discipline of Information Science Barry Smith .

53

ATTEMPTS TO SOLVE THETOWER OF BABEL PROBLEM

VIA ONTOLOGIES AS“CONCEPTUAL MODELS”

HAVEFAILED

Page 54: 1 Ontology as Master Discipline of Information Science Barry Smith .

54

HALF WAYTHROUGH

Page 55: 1 Ontology as Master Discipline of Information Science Barry Smith .

55

Can Real Ontology do Better?

Test Domain:

Medical Terminology

Page 56: 1 Ontology as Master Discipline of Information Science Barry Smith .

56

Example 1: UMLS

Universal Medical Language System

Taxonomy system maintained by National Library of Medicine in Washington DC

134 semantic types

800,000 concepts

10 million inter-concept relationships

Page 57: 1 Ontology as Master Discipline of Information Science Barry Smith .

57

Example 2: SNOMED

Systematized Nomenclature of Medicine

Taxonomy system maintained by the College of American Pathologists

121,000 concepts

340,000 relationships

Page 58: 1 Ontology as Master Discipline of Information Science Barry Smith .

58

SNOMED

designed to foster interoperability

to serve as a

“common reference point for comparison and aggregation of data throughout the entire healthcare process”

Page 59: 1 Ontology as Master Discipline of Information Science Barry Smith .

59

Problems with UMLS and SNOMED

Each is a fusion of several source vocabularies

They were fused without an ontological system being established first

They contain circularities, taxonomic gaps, unnatural ad hoc determinations

… several billion dollars still being wasted in the making of retrospective fixes

Page 60: 1 Ontology as Master Discipline of Information Science Barry Smith .

60

Example 3: GALEN

Two levels:

ontologically powerful model of clinical information inside the computer

plus

a range of terminological services for clinical tasks involving different coding systems, including natural language

Page 61: 1 Ontology as Master Discipline of Information Science Barry Smith .

61

Problems with GALEN

Ontology is ramshackle and has been subject to repeated fixes

Its unnaturalness makes coding slow and expensive

Coding thus far limited in extent

(surgical processes)

Page 62: 1 Ontology as Master Discipline of Information Science Barry Smith .

62

Blood

Page 63: 1 Ontology as Master Discipline of Information Science Barry Smith .

63

Representation of Blood in UMLS

Blood

Tissue

EntityPhysical Object

Anatomical StructureFully Formed Anatomical Structure

An aggregation of similarly specialized cells and the associated intercellular substance.

Tissues are relatively non-localized in comparison to body parts, organs or organ components

Body SubstanceBody Fluid Soft Tissue

Blood as tissue

Page 64: 1 Ontology as Master Discipline of Information Science Barry Smith .

64

Representation of Blood in SNOMED

Blood

Liquid Substance

Substance categorized by physical state

Body fluid

Body Substance

Substance

Blood as fluid

Page 65: 1 Ontology as Master Discipline of Information Science Barry Smith .

65

Representation of Blood in GALEN

Blood

SoftTissue

DomainCategoryPhenomenon

Blood as SoftTissue with two states:LiquidBlood and CoagulatedBlood

SubstanceTissue

GeneralisedSubstance SubstanceorPhysicalStructure

Page 66: 1 Ontology as Master Discipline of Information Science Barry Smith .

66

Plus attempts at Patients’ Ontology

based on WordNet = online lexical reference system for the English language

Page 67: 1 Ontology as Master Discipline of Information Science Barry Smith .

67

Representation of Blood in WordNet

Blood

Humor

the four fluids in the body whose balance was believed to determine our emotional and physical state

along with phlegm, yellow and black bile

EntityPhysical Object

SubstanceBody Substance

Body Fluid

Blood as humor

Page 68: 1 Ontology as Master Discipline of Information Science Barry Smith .

68

So what is the ontology of blood?

Page 69: 1 Ontology as Master Discipline of Information Science Barry Smith .

69

We cannot solve this problem just by looking at concepts

Page 70: 1 Ontology as Master Discipline of Information Science Barry Smith .

70

concept systems may be simply incommensurable

Page 71: 1 Ontology as Master Discipline of Information Science Barry Smith .

71

the problem can only be solved

by taking the world itself into account

Page 72: 1 Ontology as Master Discipline of Information Science Barry Smith .

72

This implies a view of ontology

not as a theory of concepts

but as a theory of reality

But how is this possible?

How can we get beyond our concepts?

answer: ontology must be maximally opportunistic

Page 73: 1 Ontology as Master Discipline of Information Science Barry Smith .

73

Maximally opportunistic

means:

look at concepts and beliefs critically

and always in the context of a wider view which includes independent ways to access the objects themselves

at different levels of granularity

Page 74: 1 Ontology as Master Discipline of Information Science Barry Smith .

74

Ontology must be maximally opportunistic

This means:

don’t just look at beliefs

look at the objects themselves

from every possible direction,

formal and informal

scientific and non-scientific …

Page 75: 1 Ontology as Master Discipline of Information Science Barry Smith .

75

Maximally opportunistic

means:

look at the same objects at different levels of granularity:

Page 76: 1 Ontology as Master Discipline of Information Science Barry Smith .

76

“skritek”

objects are in the worldnot all concepts correspond to objects

not all concepts are relevant to ontology

concepts are in the head

Page 77: 1 Ontology as Master Discipline of Information Science Barry Smith .

77

problem of ‘merging’ ontologies

“skritek”

“blaznivy”

Page 78: 1 Ontology as Master Discipline of Information Science Barry Smith .

78

How to solve the Tower of Babel Problem?

How to fuse these mutually incompatible ‘conceptual models’ of blood ?

By drawing on the results of philosophical work in ontology carried out over the last 2000 years

Page 79: 1 Ontology as Master Discipline of Information Science Barry Smith .

79

First Step:

A good medical domain ontology

presupposes a good formal or top-level ontology

Page 80: 1 Ontology as Master Discipline of Information Science Barry Smith .

80

Formal

part

hole

connected (spatially, causally)

substance

system

state

Page 81: 1 Ontology as Master Discipline of Information Science Barry Smith .

81

Material

organism

tissue

symptom

circulatory system

organ

is the nose an organ?

is the circulatory system an organ

Page 82: 1 Ontology as Master Discipline of Information Science Barry Smith .

82

Second step: select out the good conceptualizations

these have a reasonable chance of being integrated together into a single ontological system

• based on tested principles

• robust

• conform to natural science

Page 83: 1 Ontology as Master Discipline of Information Science Barry Smith .

83

IFOMIS

Institute for Formal Ontology and Medical Information Science

University of Leipzig

Page 84: 1 Ontology as Master Discipline of Information Science Barry Smith .

84

PARTNERS

Ontology Group, LADSEB-CNR, Padua/Trento

Institute of Cognitive Sciences and Technologies (ISTC-CNR), Rome

ONTEK Corporation

Language and Computing, Belgium www.landc.be

Page 85: 1 Ontology as Master Discipline of Information Science Barry Smith .

85

Strategy: A Network of Domain Ontologies

Material (Regional) Ontologies

Basic Formal Ontology (BFO)

Page 86: 1 Ontology as Master Discipline of Information Science Barry Smith .

86

A Network of Domain Ontologies

BFO

Page 87: 1 Ontology as Master Discipline of Information Science Barry Smith .

87

A Network of Domain Ontologies

B(Chem)O

BFO

Page 88: 1 Ontology as Master Discipline of Information Science Barry Smith .

88

A Network of Domain Ontologies

B(Med)O

B(Chem)O

BFO

Page 89: 1 Ontology as Master Discipline of Information Science Barry Smith .

89

A Network of Domain Ontologies

B(Cell)O

B(Med)O

B(Chem)O

BFO

Page 90: 1 Ontology as Master Discipline of Information Science Barry Smith .

90

A Network of Domain Ontologies

B(Gen)O B(Cell)O

B(Med)O

B(Chem)O

BFO

Page 91: 1 Ontology as Master Discipline of Information Science Barry Smith .

91

A Network of Domain Ontologies

B(Epidem)O

B(Gen)O B(Cell)O

B(Med)O

B(Chem)O

BFO

Page 92: 1 Ontology as Master Discipline of Information Science Barry Smith .

92

Ontology

like cartography

must work with maps at different scales and with maps picking out different dimensions of invariants

Page 93: 1 Ontology as Master Discipline of Information Science Barry Smith .

93

Page 94: 1 Ontology as Master Discipline of Information Science Barry Smith .

94

Thus ontology needs

Page 95: 1 Ontology as Master Discipline of Information Science Barry Smith .

95

 

Ontological ZoomingOntological Zooming

Page 96: 1 Ontology as Master Discipline of Information Science Barry Smith .

96

Universe/Periodic Table

animal

bird

canary

ostrich

fishontology of

biological species

ontology of DNA space

Page 97: 1 Ontology as Master Discipline of Information Science Barry Smith .

97

Universe/Periodic Table

animal

bird

canary

ostrich

fish

both are transparent partitions of one and the same reality

Page 98: 1 Ontology as Master Discipline of Information Science Barry Smith .

98

There are many compatible map-like partitions

many maps at different scales,

all transparent to the reality beyond

the mistake arises when one supposes

that only one of these partitions is a true map of what exists

Page 99: 1 Ontology as Master Discipline of Information Science Barry Smith .

99

Medical ontologies

at different levels of granularity:

cell ontology

drug ontology *

protein ontology

gene ontology *

anatomical ontology *

epidemiological ontology

Page 100: 1 Ontology as Master Discipline of Information Science Barry Smith .

100

Medical ontologies

disease ontology

therapy ontology

pathology ontology *

and also

physician’s ontology

patient’s ontology

Page 101: 1 Ontology as Master Discipline of Information Science Barry Smith .

101

Medical ontologies

and even

hospital management (billing) ontology *

* = already exists (but in a variety of mutually incompatible forms)

Page 102: 1 Ontology as Master Discipline of Information Science Barry Smith .

102

Partitions should be cuts through reality

a good medical ontology should NOT be compatible with the conceptualization of disease as:

caused by evil spirits and demons and cured by skritek

Page 103: 1 Ontology as Master Discipline of Information Science Barry Smith .

103

IFOMIS TestsUniform top-level ontology for medicine

applicable at distinct granularities

Test-case development of partial medical domain ontologies applied to:

• Standardization of clinical trial protocols

• Clinical trial Merkmal-dictionary

• Processing of unstructured patient records (www.landc.be)

Page 104: 1 Ontology as Master Discipline of Information Science Barry Smith .

104

Uniform top-level ontology for medicine

ONE YEAR

Applicable at distinct granularities (e.g. gene ontology)

FOUR YEARS

Standardization of clinical trial protocol

TWO YEARS

Clinical trial Merkmal-dictionary

TWO YEARS

Processing of unstructured patient records (www.landc.be)

THREE YEARS

Page 105: 1 Ontology as Master Discipline of Information Science Barry Smith .

105

Uniform top-level ontology for medicine

NO COMPETITOR

applicable at distinct granularities

NO COMPETITOR

Standardization of clinical trial protocol

NO SERIOUS COMPETITOR

Clinical trial Merkmal-dictionary

NO COMPETITOR

Processing of unstructured patient records

MANY COMPETITORS, BUT GOOD MEASURES OF EFFECTIVENESS

Page 106: 1 Ontology as Master Discipline of Information Science Barry Smith .

106

The End