Topic Maps: What Works and What Doesn’t? 31 October 2007 A304 - 2:45-3:30 PM PDT Presented by Jay...

Post on 14-Dec-2015

216 views 2 download

Tags:

Transcript of Topic Maps: What Works and What Doesn’t? 31 October 2007 A304 - 2:45-3:30 PM PDT Presented by Jay...

Topic Maps: What Works and What Doesn’t?

31 October 2007

A304 - 2:45-3:30 PM PDT

Presented by Jay Ven Eman, Ph.D., CEO

Access Innovations, Inc. / Data Harmony

505.998.0800 / www.accessinn.com / www.dataharmony.com

j_ven_eman@accessinn.com

Copyright 2007 Access Innovations, Inc.

New Technologies Meta data W3C

OWL SKOS

Topic Maps

Copyright 2007 Access Innovations, Inc.

Meta data What is it in this context? How does it work in a semantic

environment?

Copyright 2007 Access Innovations, Inc.

“Is MLB a sport, entertainment, or business?”

Copyright 2007 Access Innovations, Inc.

Semantic Web?“Is MLB a sport, entertainment, or business?”

About

October 31, 2007

Professional baseball

Entertainment

Business

By Smith

Story Arial

Summary In brief ...

1.98

Copyright 2007 Access Innovations, Inc.

1.98? Price? Price of what?

Newspaper? Stadium seat? Article?

$, , Ÿ, £? Wholesale? Retail? Sale? How?

?

Copyright 2007 Access Innovations, Inc.

“Meaning” starts with a knowledge organization system (KOS)

Uncontrolled list Name authority file Synonym set/ring Controlled vocabulary Taxonomy Thesaurus

Not complex - $

Highly complex - $$$$

LOTS OF OVERLAP!

Topic MapOntologySKOS

Copyright 2007 Access Innovations, Inc.

Meta Data - the “Meaning Markers” Data about data Information about information Included Added

Copyright 2007 Access Innovations, Inc.

Data about ‘stuff’ - like what? Author name Date of creation Language used in the creation Title of the creation Subject of the creation Keywords...

Copyright 2007 Access Innovations, Inc.

Narrowing the focus Keywords (AKA subject headings, index

terms, identifiers, etc.) are one type of meta data.

Copyright 2007 Access Innovations, Inc.

For example... A bibliographic database record usually

includes information such as author, title, language, date of creation, and subject area.

So does a traditional library card catalog

Copyright 2007 Access Innovations, Inc.

But did you think about… The legend on a street map? The yellow pages in a telephone book? The aisle signs in a supermarket?

Copyright 2007 Access Innovations, Inc.

Meaning of meta data Meta data is information

that ‘points’ to a explanation or a resolution

Meta data makes statements about an information resource or object

Copyright 2007 Access Innovations, Inc.

Sidebar - meta data or metadata? ‘Metadata’ is “a word coined by Jack E.

Myers to represent current and future lines of products implementing the concepts of his MetaModel, and also to designate his company, The Metadata Company, that would develop and market those products.”

Copyright 2007 Access Innovations, Inc.

Metadata

A term not used prior to 1969 Used first in 1973 Registered U.S. Trademark (in 1986),

owned by Jack Myers Metadata granted incontestable status

in 1991 Designed to be a term with no particular

meaning

Meta Data

“Is MLB a sport, entertainment, or business?”<TI> </TI>

<ST>

<ST>

<ST>

</ST>

</ST>

</ST>

<DOC Date=10/31/07>

</DOC>

Professional baseball

Entertainment

Business

<Byline> Smith </Byline>

<Text> There was a time ...</Text>

<AB> In brief ... </AB>

Included Added

Object

Copyright 2007 Access Innovations, Inc.

Meta data as indexing language

List of words Synonyms Taxonomy Thesaurus

INCREASING COMPLEXITY / RICHNESS

Ambiguity control Ambiguity control Ambiguity cont’l Synonym control Synonym control Synonym cont’l

Hierarchical rel’s Hierarchical rel’s Associative rel’s

Copyright 2007 Access Innovations, Inc.

Aka subject term, heading, node, category, descriptor, class

Taxonomy / thesaurus Main Term (MT) Top Term (TT) Broader Terms (BT) Narrower Terms (NT) Related Terms (RT)

See also (SA) Scope Note (SN) History (H) NonPreferred Term (NP)

Used for (UF), See (S)

TAXONOMY

THESAURUS

Term record

Various views

Copyright 2007 Access Innovations, Inc.

New Frontiers from the World Wide Web

Consortium:OWL & SKOS

Term record

Various views

The old frontier?

Copyright 2007 Access Innovations, Inc.

Taxonomy, Thesaurus, & Ontology

Taxonomies and thesauri are not ontologies They are entities Ontology – science of describing kinds of

entities “an explicit and formal specification of a

conceptualization”

Copyright 2007 Access Innovations, Inc.

Ontology

From philosophy – the science of

describing Kinds of entities in the world

How they are related

Copyright 2007 Access Innovations, Inc.

OWL Web Ontology Language

W3C Recommendation 10 February 2004

http://www.w3.org/TR/2004/Rec-owl-guide-20040210/

http://www.w3.org/TR/2004/Rec-owl-ref-20040210/

http://www.w3.org/TR/2004/Rec-webont-req-20040210/

Copyright 2007 Access Innovations, Inc.

Copyright 2007 Access Innovations, Inc.

Taxonomic classification

Kingdom: Animalia

Phylum: Chordata

Class: Aves

Order: Strigiformes

Families: Strigidae

Tytonidae

Copyright 2007 Access Innovations, Inc.Spotted Owl

Copyright 2007 Access Innovations, Inc.

Web Ontology language - OWL

OWL output Provides semantic meaning to these kinds of

entities

Web resource

Accessible to automated processes

Copyright 2007 Access Innovations, Inc.

OWL “…is intended to provide a language that

can be used to describe the classes and relations between them

that are inherent in Web documents and applications.”

Copyright 2007 Access Innovations, Inc.

OWL Formalize a domain by defining

Classes Properties of those classes

Define individuals Assert properties about them

Reason about these Classes and Individuals

Copyright 2007 Access Innovations, Inc.

OWL Ontology May include

1. Classes2. Properties3. Instances

Capture semantics Multiple, distributed, related ontology schema Normative OWL exchange syntax RDF/XML

Resource Description Framework/Extensible Markup Language

Topic

SKOS

Copyright 2007 Access Innovations, Inc.

Structure of controlled vocabularies

List of words Synonyms Taxonomy Thesaurus

INCREASING COMPLEXITY / RICHNESS

Ambiguity control Ambiguity control Ambiguity cont’l Synonym control Synonym control Synonym cont’l

Hierarchical rel’s Hierarchical rel’s Associative rel’s

Copyright 2007 Access Innovations, Inc.

Hierarchical View

Term

Copyright 2007 Access Innovations, Inc.

<TermInfo> <T>Agrotechnology</T> <BT>Biotechnology</BT> <NT>Animal management technologies</NT> <NT>Controlled environment agriculture</NT> <NT>Genetically modified crops</NT> </TermInfo> Source: www.DataHarmony.com

Taxonomy term record

Copyright 2007 Access Innovations, Inc.

<TermInfo> <T>Agrotechnology</T> <BT>Biotechnology</BT> <NT>Animal management technologies</NT> <NT>Controlled environment agriculture</NT> <NT>Genetically modified crops</NT> <RT>Agricultural science</RT> <RT>Food technology</RT> <UF>Plant engineering</UF> <Scope></Scope> <Editorial_Note></Editorial_Note> <Facet></Facet> <History></History> </TermInfo> Source: www.DataHarmony.com

Thesaurus term record

Copyright 2007 Access Innovations, Inc.

<PreferredTerm rdf:ID="T131"><rdfs:label xml:lang="en">Agrotechnology</rdfs:label><BroaderTerm rdf:resource="#T603" newsindexer:alpha="Biotechnology"/><NarrowerTerm rdf:resource="#T252" newsindexer:alpha="Animal

management technologies"/><NarrowerTerm rdf:resource="#T1221" newsindexer:alpha="Controlled

environment agriculture"/>

<NarrowerTerm rdf:resource="#T2166" newsindexer:alpha="Geneticallymodified crops"/>

<Related_Term rdf:resource="#T127" newsindexer:alpha="Agriculturalscience"/>

<Related_Term rdf:resource="#T2020" newsindexer:alpha="Food technology"/>

<Non-Preferred_Term rdf:resource="#T3898" newsindexer:alpha="Plantengineering"/>

</PreferredTerm> Source: www.DataHarmony.com

OWL term record

Copyright 2007 Access Innovations, Inc.

SKOS Simple Knowledge Organization System SKOS Core Guide

W3C Working Draft 2 November 2005 http://www.w3.org/TR/2005/WD-swbp-skos-core-guide-

20051102/

SKOS Core Vocabulary Specification W3C Working Draft 2 November 2005 http://www.w3.org/TR/2005/WD-swbp-skos-core-spec-

20051102/

Copyright 2007 Access Innovations, Inc.

SKOS May include

1. Classes (RDFS)2. Properties (RDF)3. Instances??

Express structure and content of concept schemes Multiple, distributed, related SKOS schemes Normative SKOS exchange syntax RDF/XML

Resource Description Framework/Extensible Markup Language

OWL

Copyright 2007 Access Innovations, Inc.

SKOS Specifically for “concept schemes”

Thesauri Classification schemes Subject headings lists Taxonomies Terminologies Glossaries And other types of controlled vocabularies

Copyright 2007 Access Innovations, Inc.

SKOS Models concept schemes

A set of concepts OPTIONALLY includes statements about

semantic relationships between concepts Directionality implied - interpretations -

(‘skos:Concept’ and properties) Not people, organizations, places, etc.

Copyright 2007 Access Innovations, Inc.

Source:

Copyright 2007 Access Innovations, Inc.

DH SKOS Output<skos:Concept rdf:about="#T1">

<skos:prefLabel>Agriculture</skos:prefLabel><skos:altLabel>Agribusiness</skos:altLabel><skos:altLabel>Agronomy</skos:altLabel><skos:altLabel>Farming</skos:altLabel><status>Accepted</status>

</skos:Concept>

Copyright 2007 Access Innovations, Inc.

DH SKOS Output<skos:Concept rdf:about="#T2">

<skos:prefLabel>American music</skos:prefLabel><skos:broader rdf:resource="#T66" local:alpha="Music styles"/><skos:related rdf:resource="#T27" local:alpha="Country and western music"/><skos:related rdf:resource="#T51" local:alpha="Jazz music"/><skos:related rdf:resource="#T99" local:alpha="Rhythm and blues music"/><skos:related rdf:resource="#T101" local:alpha="Rock music"/><status>Accepted</status>

</skos:Concept>

Copyright 2007 Access Innovations, Inc.

DH SKOS Output<skos:Concept rdf:about="#T3">

<skos:prefLabel>Architecture</skos:prefLabel><skos:broader rdf:resource="#T113" local:alpha="Visual and performing arts"/><skos:scopeNote>Refers to the art and practice of designing and building structures</skos:scopeNote><status>Accepted</status>

</skos:Concept><skos:Concept rdf:about="#T4">

<skos:prefLabel>Band music</skos:prefLabel><skos:broader rdf:resource="#T49" local:alpha="Instrumental music"/><skos:related rdf:resource="#T5" local:alpha="Bands (Music)"/><status>Accepted</status>

</skos:Concept>

A Brief Discussion of Topic Maps

Copyright 2007 Access Innovations, Inc.

Statements about what?

Baseball

Amateur baseball

Little league

Professional baseball

Sports

MLB

“Is MLB a sport, entertainment, or business?”

Topic Maps ISO standard - ISO 13250:2002 For merging back-of-the-book indexes Collection of structured markup Describing KOS Associating KOS with information

resources (objects) Separation of KOS from objects

Topic Maps Three main concepts

1. Names of things2. Occurrences of the named things3. Associations between names

Three additional constructs1. Identity2. Facet3. Scope

OWL

Topic with occurrence

“Is MLB a sport, entertainment, or business?”

Professional baseball

http://www.newindexer.com/mlb.htm/

descriptor-for

Topic map layer

Information resources layer

Topics, associations, occurrences

Professional baseball

Baseball

Sports

member-of

member-of

MLB

use-for http://www.newindexer.com/mlb.htm/

doc-type

Amateur baseball

Little leaguemember-of

descriptor-for

Professional athletes

related-to

Smith

author-of

member-of

http://www.swaa.org

article

Problems with Semantic Web Complexity Lack of tools Lack of skills Limited resources Gaming the system The syllogism trap

KOS biases Lack of agreement Lack of interest Good enough Topic Maps vs. OWL

Lack of agreement “Symbionese Liberation Army credited with

offing an SUV” About - ‘revolutionaries’ or ‘freedom fighters’ About - ‘revolutions’ or ‘freedom movements’

“Symbionese Liberation Army accused of firebombing SUV” About - ‘terrorists’ or ‘anarchists’ About - ‘terrorism’ or ‘anarchy’

The syllogism trap Humans are mortal Greeks are human Therefore, Greeks are mortal

New Mexicans speak Spanish The author lives in New Mexico Therefore, ...

Source: Clay Shirky, “The Semantic Web, Syllogism, and Worldview”www.shirky.com/writings/semantic_syllogism.html/ andDave McComb, presentation at DAMA-I, May 2005 www.wilshireconferences.com

The syllogism humor trap I am a nobody Nobody is perfect Therefore, I am perfect

Bonus:I don't approve of political jokes.

I've seen too many of them get elected.

Topic Maps vs. OWL TMCL Topic maps XTM, HyTM, LTM ISO

OWL RDF Schema RDF RDF/XML, N3 SOAP, WSDL W3C

Copyright 2007 Access Innovations, Inc.

Full-text search and applied indexing languages Full-text search engines - getting better?? Thesauri applied using machine

automated indexing - easier, faster, cheaper

Taxonomic navigation Faceted navigation Table of contents drilldown - taxonomy views

Query disambiguation

Copyright 2007 Access Innovations, Inc.

Full-text search and applied indexing languages Long history Many richly developed thesauri with legs Tools that work Large body of professionals Almost as rich

Tools that work!

Hierarchical View

Term Record

Almost as rich

ANSI/NISO Z39.19-200x

Clearer disambiguation?

Mercury

Planets

Roman god

Metallic element

Temperature

Automobile

TypeOf

BrandOf

IsA

IsA

IsA

Clearer disambiguation? Thesaurus statement

Mercury (planet) mercury (metal) Mercury (automobile) Mercury (mythical being) mercury (temperature)

Clearer disambiguation?

OWL statement<PreferredTerm rdf:ID="T3195">

<rdfs:label xml:lang="en">Mercury (Planets)</rdfs:label>

<BroaderTerm rdf:resource="#T3896" newsindexer:alpha="Planets"/>

</PreferredTerm>

Thesaurus to SKOS Thesaurus label

Main Term (MT) Top Term (TT) Broader Terms (BT) Narrower Terms (NT) Narrower Term Instance Related Terms (RT)

See also (SA) NonPreferred Term (NP)

Used for (UF), See (S) Scope Note (SN) History (H)

SKOS Label

<skos:Concept rdf:about=”numeric"> <skos:hasTopConcept

rdf:resource=”numeric" local:alpha=”TopTerm"/>

<skos:broader rdf:resource=”numeric" local:alpha=”BroaderTerm"/>

<skos:Narrower rdf:resource=”numeric" local:alpha=”NarrowerTerm"/>

<skos:related rdf:resource=”numeric" local:alpha=”RelatedTerm"/>

<skos:altLabel>NonpreferredTerm</skos:altLabel>

<rdf:Property rdf:ID=”ScopeNote"> <rdf:Property rdf:ID=”History">

Thesaurus to Ontology (OWL) Thesaurus Label

Main Term (MT) Top Term (TT) Broader Terms (BT) Narrower Terms (NT) Narrower Term Instance Related Terms (RT)

See also (SA) NonPreferred Term (NP)

Used for (UF), See (S) Scope Note (SN) History (H)

OWL Label

<PreferredTerm rdf:ID=”numeric"> <TopTerm rdf:ID=“numeric”> <BroaderTerm rdf:resource=”numeric"

newsindexer:alpha=”BroaderTerm"/> <NarrowerTerm rdf:resource=”numeric"

newsindexer:alpha=”NarrowerTerm"/> <Related_Term rdf:resource=“numeric"

newsindexer:alpha=”RelatedTerm"/> <Non-Preferred_Term

rdf:resource=”numeric" newsindexer:alpha=”Non-preferredTerm"/>

<owl:DatatypeProperty rdf:ID="Scope_Note">

<owl:DatatypeProperty rdf:ID=”History">

Copyright 2007 Access Innovations, Inc.

Objectives for search & navigation ASIS&T -- virtual library

Subject matter ASRT -- internal information control

Organization chart Naval Postgrad -- Homeland security degree

Curriculum outline SLA -- Web content

Public Web navigation

Naval Postgraduate School’s Homeland Security Taxonomy

Naval Postgraduate School’s Homeland Security Taxonomy

SLA website and thesaurus

SLA search

Copyright 2007 Access Innovations, Inc.

Myth of topic maps And OWL, SKOS Not a myth They do work Limited adoption Narrow, tightly defined niches

Topic Maps: What Works and What Doesn’t?

31 October 2007

A304 - 2:45-3:30 PM PDT

Presented by Jay Ven Eman, Ph.D., CEO

Access Innovations, Inc. / Data Harmony

505.998.0800 / www.accessinn.com / www.dataharmony.com

j_ven_eman@accessinn.com

Thank you. Questions?

Copyright 2007 Access Innovations, Inc.

Activity in the field Ontologies

http://www.w3.org/2001/sw/WebOnt/impls SKOS

http://www.w3.org/TR/swbp-skos-core-guide/#secref

Topic Maps http://www.topicmaps.org/

Copyright 2007 Access Innovations, Inc.

Resources www.accessinn.com www.dataharmony.com www.iso.org www.ontopia.com

Lars Marius Garshol, “Metadata? Thesaurui? Taxonomies? Topic Maps!”

Steve Pepper, “The TAO of Topic Maps” www.topicmaps.org

Copyright 2007 Access Innovations, Inc.

Resources Cory Doctorow, “Metacrap: Putting the Torch to Seven Straw-men

of the Meta-utopia,” http://www.well.com/~doctorow/metacrap.htm Russell Glass, “Is Anyone Going to Tag all of this Stuff?,”

http://zoominfo.blogs.com/soughtafter/2005/03/semantic_web_is.html Clay Shirky, “The Semantic Web, Syllogism, and Worldview,”

www.shirky.com/writings/semantic_sllogism.html Pete Norvig, “Semantic Web Ontologies: What Works and What

Doesn’t,” www.alwayson-network.com/comments.php?id=P7480_0_3_0_C