The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A...

43
The Semantic Web Presented by Zhimin Chen

Transcript of The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A...

Page 1: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

The Semantic Web

Presented by Zhimin Chen

Page 2: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page

Consequence: agents can’t effectively process information on the web automatically

Page 3: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

XML May Help, But Can Only Help XML fragment for the course

schedule…<course-offered>

<catalog> 92809 </catalog><course>

<number> 500 </number><session> 201 </session><name> Algorithm Design </name>

</course><room> cisr 104 </room><instructor> Evans </instructor>

Page 4: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

XML May Help, But Can Only Help (Cont’d) Tags in XML carry no semantics

<course><ID> 500 </ID><Session> 201 </Session><Name> Algorithm Design </Name>

</course>

<H1><H2> 500 </H2><H3> 201 </H3><H4> Algorithm Design </H4>

</H1>

is no more meaningful than

Page 5: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Expressing Meanings Of Tags Semantic network, a graph composed of

Two Kinds of Nodes Taxonomic categories or property (labeled by

relation constants) Objects in the domain (labeled by object constants)

Three Kinds of Arcs IS-A arc Set membership arc Function arc

Meanings of tags as taxonomic concepts or property

Page 6: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Example Of Semantic Network

[Tim Berners-Lee, James Hendler and Ora Lassila]

Page 7: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Use Case – Precise Search Current search engine

Key word based Single page only

Semantic search Assemble knowledge spanning many pages

Example Scenario: locating a person her last name is "Cook“ she works for a company on your client list she has a son attending your alma mater,

Avondale University

Page 8: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Architecture Of The Semantic Web

[Tim Berners-Lee]

Where the standard

progress stands

Page 9: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Outline of the talk RDF and RDF Schema DAML+OIL and OWL Description Logic

Page 10: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

RDF An RDF statement is a triple

<subject, property, value> Reification is statement about

statement, i.e., subject is a statement

Each subject identified by a URI Semantics represented as a set of

triples and serialized as XML

Page 11: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

RDF Example

http://www.bob-stacy.com/cook

http://www.bob-stacy.com

<rdf:Description rdf:about=“http://www.bo

b-stacy.com/cook”><works-for

rdf:resource=“www.bob-stacy.com” />

</rdf:Description>

<rdf:Description rdf:about=“http://w

ww.bob-stacy.com/cook”>

<lives-in>Johannesburg</

lives-in> </rdf:Description>

Page 12: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

RDF Schema RDF schema provides a way to define the

meanings of tags A tag is a class

Employee rdf:type rdf:class <rdf:Description rdf:ID="Employee">

<rdf:type rdf:resource = "http://www.w3.org/2000/01/rdf-schema#Class"/> </rdf:Description>

A tag is a property Works-for rdf:type rdf:property Works-for rdf:domain Employee Works-for rdf:range Company

Page 13: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

RDF Schema (Cont’d) XML as a shorthand for RDF

descriptions of a semantic network

Class ClassProperty

Employee Company

type

type

Works-forrangedomain

type

Mrs. Cook Bob-StacyWorks-for

type

type

Page 14: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

RDF Schema (Cont’d)

<Company rdf:ID=“Bob-Stacy” /><Employee rdf:ID=“Cook”>

<Works-for rdf:resource=“#Bob-Stacy” />

</Employee>

Page 15: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

RDF Schema (Cont’d) Other modeling mechanisms

Rdf:subClassOf Rdf:subPropertyOf Rdf:container and Rdf:collection Reification (rdf:type is rdf:statement)

daml+oil provides more

Page 16: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Outline of the talk RDF and RDF Schema DAML+OIL and OWL Description Logic

Page 17: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

DAML+OIL DAML+OIL adds richer expressive

mechanism to RDF schema Constraints on properties Boolean combination of classes Equivalence and disjointness Property of property

Page 18: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Constraints On Property A class is defined as all objects

satisfying constraints on property Universal constraint

E.g.: All objects working for companies<daml:restriction>

<daml:onProperty rdf:resource=“#works-for” />

<daml:toClass rdf:resource=“#Company” /></daml:restriction>

Existential constraint (hasClass and hasValue)

Page 19: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Constraints On Property (Cont’d)

Cardinality constraint (minCardinality, maxCardinality, exactCardinality)

E.g.: All objects having more than 1 child<daml:restriction>

<daml:onProperty rdf:resource=“#parentOf” />

<daml:minCardinality> 2 <daml:minCardinality/></daml:restriction>

Page 20: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Boolean Combination of Classes Intersection, union, complement

E.g.: all objects working for Bob-Stacy and having more than 1 child<daml:Class ID=“BobStacyParentWorker”>

<daml:intersectionOf daml:parseType=“daml:collection”><daml:restriction><daml:onProperty rdf:resource=“#works-for” /><daml:hasValue rdf:resource=“#Bob-Stacy” /></daml:restriction> <daml:restriction><daml:onProperty rdf:resource=“#parentOf” /><daml:minCardinality> 2 <daml:minCardinality/></daml:restriction></daml:intersectionOf>

</daml:Class>

Page 21: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Equivalence and Disjointness Equivalence

sameClassAs samePropertyAs sameIndividualAs

Disjointness disjointWith differentIndividualFrom

Page 22: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Property of property inverseOf transitiveProperty uniqueProperty and

unambiguousProperty

Page 23: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

OWL OWL DL DAML + OIL

Add/remove/rename some language constructs

Version management Knowledge modularization (import)

OWL Full allows class to be an individual (undecidable)

Page 24: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Outline of the talk RDF and RDF Schema DAML+OIL and OWL Description Logic

Page 25: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

ALC Concept descriptions are formed

using constructs: A (atomic concept) C D (daml:intersectionOf) C D (daml:unionOf) C (daml:complementOf) R.C (daml:toClass) R.C (daml:hasClass)

Page 26: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

SHIQ ALC plus

Transitive role R* (daml:transitiveRole)

Concept hierarchy and role hierarchy (subClass and subProperty)

Inverse role R- (daml:inverseOf)

Qualified number restriction nR, etc (daml:minCardinality, etc)

Page 27: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Basic Inference Problem Concept subsumption C T D Reduction to unsatisfiability

C T D there exists no model I for T s.t. (C D)I is not empty.

Tableau algorithm to find such a model

Page 28: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Tableau D is a concept sub(D) is the closure of concept

subexpressions in D’s definition S is a set of individuals L : S 2sub(D) maps each individual to a

subset of sub(D) E : R 2SS maps each role to a set of

pairs of individuals There is some s in S s.t. D is in L(s)

Page 29: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Tableau For ALC Concepts T=<S, L, E> is a tableau for ALC concept

D if it holds L(s) does not contain both C and C If C E L(s), then C L(s) and E L(s) If C E L(s), then C L(s) or E L(s) If R.C L(s) and <s,t> E(R), then C L(t) If R.C L(s), then there is some s s.t. <s,t>

E(R) and C L(t) Further constraints for other concept

constructs can be added for more expressive DL.

Page 30: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Tableau Algorithm For ALC Completion tree

Node x labeled with a set L(x) sub(D) Edge <x,y> labeled with a set L(<x,y>) of

roles occurring in D Tree expansion rules

- rule: Condition: C1 C2 L(x) and {C1, C2} L(x) Action: L(x) {C1, C2} L(x)

- rule: Condition: C1 C2 L(x) and {C1, C2} L(x) = Action: for some C {C1, C2} , L(x) {C} L(x)

Page 31: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Tableau Algorithm For ALC (Cont’d)

- rule: Condition: R.C L(x) and there is a R-

successor y of x s.t. C R Action: L(y) {C} L(y)

- rule: Condition: R.C L(x) and x has no R-

successor y s.t. C L(y) and no other rule is applicable to any of its ancestors

Action: create a R-successor y for x with L(<x,y>)=R and L(y)={C}

Page 32: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Tableau Algorithm For ALC (Cont’d) A node has clash if {C, C} L(x) Algorithm starts with a node x

labeled with L(x)={D} Applying expansion rules until

A clash happens No rules can be further applied to the

tree

Page 33: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Tableau Example Check (R.A) (R.B) R.(A B) D = (R.A) (R.B) (R.(A

B))x{(R.A) (R.B) (R.(A B))}{(R.A), (R.B), (R.(A B))}

y

R

{A}{A, (A B)}{A, A}

clash {A, B} z

R

{B}{B, A B}

{B, A }

Page 34: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Transitive Role and Blocking + - rule:

Condition: R.C L(x) where R is a transitive role and there is a R-successor y of x s.t. R.C L(y)

Action: L(y) {R.C} L(y) May lead to infinite loop Subset blocking: if L(y) is a subset

of an ancestor’s label L(x), then block the expansion

Page 35: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Blocking Example

xL(x)={C, R.C, R.(R.C)}

y

R

L(y)={C}L(y)={C, R.C}

L(x)={C, R.C, R.(R.C)}

Blocked

Page 36: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

More Expressive DL For additional concept construct

(inverse role, quantifier, etc.), add more expansion rules and blocking rules

Page 37: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Optimizations Backjumping

L(x)={C1 D1, …, Cn Dn, R.(A B), R.(A)}

[Horrocks-Satter-Tobies]

backjumping

pruning

Page 38: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Optimizations (Cont’d) Absorption

Reasoning w.r.t. axiom C D needs to add (C D) to every node

CN D (CN D) only need to add D (D) to the nodes that contain CN (CN)

Transforming axiom into this form CN C D CN C D CN C, CN D CN C D Similar rules for the cases

Page 39: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Optimizations (Cont’d) Cache

Cache the satisfiability of L(x) for node x

Caching partial tableaus of concepts to check obvious satisfiability (E.g., merge the tableau of C and D to check satisfiability of C D (and thus C D))

Page 40: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Optimizations (Cont’d) Lazy expansion of concept Semantic branching search C1, …, Cn, C1 … Cn D D Heuristic guided search

Oldest-first: select the disjunctions dependent on the least recent branching point

Page 41: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

Summary The semantic web tries to make www

machine accessible OWL is the current standard to define

vocabulary, and a large part of OWL is DL Challenges (DB-related)

Scalibility (techniques of reasoning with individual in DL unlikely can scale up)

Query Ontology design and integration

Page 42: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

References W3C standards:

Resource Description Framework (RDF) Model and Syntax Specification W3C Recommendation 22 February 1999 Ora Lassila, Ralph R. Swick, eds. http://www.w3.org/TR/1999/REC-rdf-syntax-19990222/

RDF Vocabulary Description Language 1.0: RDF Schema W3C Working Draft Dan Brickley, R.V. Guha, eds. http://www.w3.org/TR/rdf-schema/

RDF Primer W3C Working Draft Frank Manola, Eric Miller, eds. http://www.w3.org/TR/rdf-primer/

DAML+OIL (March 2001) Reference Description. Dan Connolly, Frank van Harmelen, Ian Horrocks, Deborah L. McGuinness, Peter F. Patel-Schneider, and Lynn Andrea Stein. W3C Note 18 December 2001. http://www.w3.org/TR/daml+oil-reference

OWL Web Ontology Language Reference W3C Working Draft Mike Dean, Guus Schreiber eds., Frank van Harmelen Jim Hendler Ian Horrocks Deborah L. McGuinness Peter F. Patel-Schneider Lynn Andrea Stein http://www.w3.org/TR/owl-ref/

Page 43: The Semantic Web Presented by Zhimin Chen. HTML Is Human Readable, Not Machine understandable A Course Schedule Web Page Consequence: agents can’t effectively.

References (contd.) Description logic:

Basic Description Logics Description Logic Handbook, edited by F. Baader, D. Calvanese, D.L. McGuinness, D. Nardi, P.F. Patel-Schneider, Cambridge University Press, 2002, pages 47-100. http://www.cs.man.ac.uk/~franconi/dl/course/dlhb/dlhb-02.pdf

Practical Reasoning for Very Expressive Description Logics I. Horrocks and U. Sattler and S. Tobies Logic Journal of the IGPL, Volume 8, Issue 3: May 2000. http://www3.oup.co.uk/igpl/Volume_08/Issue_03/pdf/horrocks1.pdf