FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive...

27
FAO of the UN Library and Documentatio n Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies: Creating an ontology on Food Safety, Animal and Plant Health (OFsAPH) Boris Lauser Tanja Wildemann,Allison Poulos Frehiwot Fisseha, Johannes Keizer, Stephen Katz DC 2002: Florence 16 th October 2002

Transcript of FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive...

Page 1: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence

October 02

A Comprehensive Framework for Building Multilingual Domain Ontologies:

Creating an ontology on Food Safety, Animal and Plant Health

(OFsAPH)

Boris LauserTanja Wildemann,Allison Poulos

Frehiwot Fisseha, Johannes Keizer, Stephen Katz

DC 2002: Florence 16th October 2002

Page 2: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Agenda• Introduction:

– Motivation– Ontologies and modeling approach

• Framework for ontology creation• Application of framework:

– Creation of the Food Safety Ontology prototype

• Outlook:– Current project status– Application scenario

• Questions

Introduction

Framework

Application

Outlook

Discussion

Page 3: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Motivation

• Rapid growth in electronically available information

• Badly performing search tools

• Pages often inconsistently indexed

BUT

Introduction

Framework

Application

Outlook

Discussion

Page 4: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Example: Full text search

Who established the Agreement of Agriculture

Search

……Agreement…

……

…Agriculture……

Internationalorganization

standards

WTOWHO

FAO

Agreement ofAgriculture

establish

...

Background knowledge

….…

WTO established the Agreement of Agriculture

in ….…

Specified search

Introduction

Framework

Application

Outlook

Discussion

Page 5: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Example: Indexed Search

Who established the Agreement of Agriculture

Search

Internationalorganization

standards

WTOWHO

FAO

Agreement ofAgriculture

establish

...

Background knowledge

AgriculturalAgreement

Synonym

….…

WTO established the Agreement of Agriculture

in ….…

DocumentIndexed with“AgriculturalAgreement”

High Chance

Of retrieval

Low Chance

Of retrieval

Introduction

Framework

Application

Outlook

Discussion

Page 6: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Ontology as form of background knowledge

• Goal: To create an explicit, formal specification of a shared conceptualization of a domain of interest

Ontology

Introduction

Framework

Application

Outlook

Discussion

Page 7: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Ontology: conceptual model

Concept

label

synonym

synonym

synonym

stem

description

Concept

relationship

Introduction

Framework

Application

Outlook

Discussion

Page 8: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Ontology: RDFS modelIntroduction

Framework

Application

Outlook

Discussion

Page 9: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Introduction

Framework

Application

Outlook

Discussion

Agenda• Introduction:

– Motivation– Ontologies and modeling approach

• Framework for ontology creation• Application of framework:

– Creation of the Food Safety Ontology prototype

• Outlook:– Current project status– Application scenario

• Questions

Page 10: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

The framework

A comprehensive frameworkfor building a domain ontology

Focus:Acquisition and Development stepIn the lifecycle of ontology creation

Introduction

Framework

Application

Outlook

Discussion

Page 11: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

The framework: processes

• Ontology acquisition (2 paths)– Creating core ontology from scratch– Automatic extraction of ontological knowledge

from base vocabulary and domain specific text sources

• Merging into one ontology• Refinement and Extension• Evaluation and Assessment

Introduction

Framework

Application

Outlook

Discussion

Page 12: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Coreontology

Manualcreation

FocusedWeb crawling

List of domainstart web

pages

List offrequent

terms

List of domainSpecific

documents

Term BT t1 NT t2 RT t3Term USE t3…

Thesaurus

RDFS ontologymodel

convert Ontology pruning and learning

algorithm

Domaincorpus

Genericcorpus

Prunedontology

List of frequent

terms

Tool supported

steps

Mergingof

ontologies

Refinementand

Extension

Evaluation

Selectionof documents

Manual creationof core ontology

1st acquisitionapproach

2nd acquisitionapproach

Text To Onto

The Framework: overview

Introduction

Framework

Application

Outlook

Discussion

Page 13: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Introduction

Framework

Application

Outlook

Discussion

Agenda• Introduction:

– Motivation– Ontologies and modeling approach

• Framework for ontology creation• Application of framework:

– Creation of the Food Safety Ontology prototype

• Outlook:– Current project status– Application scenario

• Questions

Page 14: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Creation of the core ontology

67 concepts91 relationships

Information Resources:•Brainstorming•Codex Alimentarius•SPS Agreement

Core Ontology

Ontology Editor(SOEP)

3 subject specialists

Introduction

Framework

Application

Outlook

Discussion

Page 15: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

1st Acquisition Approach:

Focused Crawling

Focused Web Crawling

68 concepts91 relationships

Core Ontology

List of extracted main sites:http://www.foodsafety.gov/ Gateway to Government Food Safety Information

http://vm.cfsan.fda.gov/ Center for Food Safety & Applied Nutrition

http://www.inspection.gc.ca/ Canadian Food Inspection Agency

http://www.extension.iastate.edu/foodsafety/ Iowa State University - Food Safety Project

http://www.foodsafety.iastate.edu Iowa State University - Food Safety Consortium

http://www.fsis.usda.gov/ United States Department of Agriculture, Food Safety and Inspection Service

http://www.nal.usda.gov/foodborne/index.html Foodborne Ilness Education Information Center

http://www.euro.who.int/foodsafety World Health Organization – Regional Office for Europe Food Safety Programme

List of 257 food Safety domainweb pages

Grouping into Main sites

Introduction

Framework

Application

Outlook

Discussion

Page 16: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Selection of Documents• Domain Set: Manual selection

– 11 documents• Codex Alimentarius: Description, Code of Ethics, Food Hygiene, Food Import and Export• Report of consultation on risk assessment of microbiological hazards in foods• Ensuring food quality and safety, Protecting food quality and safety

• Domain Set: Focused Crawler Output– 5 documents extracted:

• http://vm.cfsan.fda.gov/; http://www.inspection.gc.ca/; http://www.foodsafety.iastate.edu; http://www.extension.iastate.edu/foodsafety/; http://www.euro.who.int/foodsafety

• Generic documents: Manual Selection– 8 documents

• www.nytimes.com• Several documents of the animal feed domain

Introduction

Framework

Application

Outlook

Discussion

Page 17: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

2nd Acquisition Approach:

Thesaurus Pruning

Food SafetyDocuments

GenericDocuments

Rice BT … NT … RT … RT … RT … …

AGROVOC27365 keywords

Automatic Pruning

Extracted ontological structure:# of concepts: 504taxonomic depth: 5

5 evaluation runs

1632 frequent terms

Introduction

Framework

Application

Outlook

Discussion

Page 18: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Merging of Ontologies and Refinement

1632 Terms from pruning process 12 new concepts

extracted

Ontologicalstructureextracted from AGROVOC

23 new conceptsWith hierarchicalrelationships extracted

67 concepts91 relationships

Core Ontology

Assemblystep

92 new relationshipscreated

Food Safety OntologyPrototype

102 concepts183 relationships

Introduction

Framework

Application

Outlook

Discussion

Page 19: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Final Prototype

Food Safety OntologyPrototype

102 concepts183 relationships

1.79 relationshipsconcept

Core Ontology67 concepts

91 relationships

relationshipsconcept1.36

Introduction

Framework

Application

Outlook

Discussion

Page 20: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Evaluation: Food Safety ontology web browser

• Open to users and subject specialists for evaluation

• http://localhost:8080/faoportal/dispatcher

Introduction

Framework

Application

Outlook

Discussion

Page 21: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

AgendaIntroduction

Framework

Application

Outlook

Discussion

• Introduction:– Motivation– Ontologies and modeling approach

• Framework for ontology creation• Application of framework:

– Creation of the Food Safety Ontology prototype

• Outlook:– Current project status– Application scenario

• Questions

Page 22: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Current project statusOntology creation: 2nd application of framework

Introduction

Framework

Application

Outlook

Discussion

Food Safety OntologyPrototype

102 concepts183 relationships

Text To Onto ~100 domain

Specificdocuments

AGROVOC

Revised OntologyPruner

List offrequent

terms

Pruned Agrovoc: ~3000 concepts

Ontology Editor(OIModeler)

Merging &Refinement

1st acquisitionapproach

2nd acquisitionapproach

Page 23: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Usage Scenario

Search:

Risk assessment

Biosecurity Portal:

OntologyEnabled Search

Application

Ontology based search extension

Risk characterization

Hazard characterization

Hazard identification

Exposure assessment

Risk assessment

Risk management

Risk communication Risk analysis

Is aStepIn theprocess

Is aStepIn theprocess

Extended Search

Mark the terms below, which you might want to include in your search:

Interactswith

Risk assessment Risk characterization Risk analysisSearch:

Ontology

Doc baseSearchresults

Introduction

Framework

Application

Outlook

Discussion

Page 24: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Current project statusApplication scenario: 2 use cases

Use Case 1: Indexing the subject of a document

Use Case 2: Searching information on the portal

Risk;…Subject

Title

OFsAPH

Risk;…Search…

Page 25: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

Current project statusApplication: Ontology Browser for the

Ontology on Food Safety,Animal and Plant Health

http://localhost:8080/faoportal/dispatcher

Page 26: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

http://kaon.semanticweb.org

Enabler: KAON Tool Suite

Open Source!

Java based!

Highly portable!

Page 27: FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:

FAO of the UN

Library and Documentation

Systems Division

DC 2002Florence, Italy

October 02

AgendaIntroduction

Framework

Application

Outlook

Discussion

• Introduction:– Motivation– Ontologies and modeling approach

• Framework for ontology creation• Application of framework:

– Creation of the Food Safety Ontology prototype

• Outlook:– Current project status– Application scenario

• Questions