The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University &...

43
The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com

Transcript of The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University &...

Page 1: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

The Ontological Semantic Perspective

on the Semantic Web

Victor Raskin, Purdue University & hakia.comChristian F. Hempelmann, hakia.com

Page 2: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Introduction

• The Semantic Web is a good and obvious idea.

• Its successful implementation, however, depends on two major components, – the adequacy of the formalism used to represent

the content, and, most importantly,– the methods of rendering texts into that formalism.

• The paper will focus on the general and current issues with both components.

Page 3: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Goal of Presentation

• The Semantic Web, as conceived and especially as practiced, has not and cannot work

• However, the purpose is not to knock it further--it is collapsing on its own

• Good work done under its guise (cf. SDI)• In order to work, it should co-opt OntoSem• And then nobody will need the Semantic Web

(stone soup)

Page 4: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Introduction

• Bar Hillel: mathematical logicians favor the manipulation of the logical format to define rules of inference and similar issues over the adequacy of the format.

• Semantic Web (and now, apparently, Google) relies on manual tagging of web pages with OWL or something like it by individual website owners: a Mao-like dream and a fatal error

Page 5: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Structure of Presentation 1

• Semantic Web as a Manifesto (cf. The Communist Manifesto—on second thoughts, don’t!)– Nature– Principles– Reasons

• Formalism:– Form and Content (very very hard)– Linking is good--linking what?– Ontology, tags, and other luxuries

Page 6: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Structure of Presentation 2

• Translation of Text into OWL, RDF, etc.– Tag away!– Know thy meaning– Oh, we don’t know how to do it ourselves but it is really

simple, and no, no professional skills required

• Why not NLP– Sorry! Not NLP, MP– Fear of semantics– OntoSem is Semantic Web– Semantic Web is the stone of stone soup

Page 7: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

What is the Semantic Web?

• Making the content of the Web searchable, at least partially, on the basis of its semantic content, not simply on the basis of matching strings and metasyntactic tags.

• Great vision!• So were alchemy and astrology--

the devil is in the details!

Page 8: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Principles of the Semantic Web?

• Generality:—Berners-Lee (1998a) describes this as follows: “When looking at a possible formulation of a universal Web of semantic assertions, the principle of minimalist design requires that it be based on a common model of great generality. Only when the common model is general can any prospective application be mapped onto the model”.

Page 9: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Principles of the Semantic Web?

• Simplicity and low cost—according to Hendler (2001), “[a] crucial aspect of creating the semantic web is to make it possible for a number of different users to create machine-readable content without being logic experts. In fact, ideally, most of the users shouldn’t even need to know that web semantics exists. Lowering the cost of mark-up isn’t enough—for many users it needs to be free. That is, semantic mark-up should be a by-product of normal computer use. Much like current web content, a small number of tool creators and web ontology designers will have to know the details, but most users will not even know ontologies exist.” (Hold this thought!)

Page 10: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Reasons for the Semantic Web?

• “traditional” artificial intelligence has not led to the development of realistic-scale practical applications;

• the knowledge representation area, while generating useful ideas, has failed to translate them into a coherent, large-scale action;

• prior work on world modeling and reconciliation among different formal models can be useful but still does not measure up to the standards of the emerging Semantic Web;

• first-order predicate calculus (FOPC) and higher-order logic, the traditional reasoning techniques have been duly criticized for expressing things that were at the same time undecidable and not effectively computable, rigid.

Page 11: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Reasons for the Semantic Web?

• Actually, NLP has failed also—because – it has been dominated by meaning-avoidance, street-lamp-

based techniques– It has not attracted, prepared or encouraged qualified

computational linguist participation– Computer scientists, engineers, and statisticians ignorantly

confuse knowing a language with knowing about language--and are proud of it

• But Sir Tim is blissfully unaware of that because it is not on his mental map

• What you don’t know can help you (at least in attaining knighthood)

Page 12: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Formalism over Content

concept-x|concept-x(concept-y)• Happy?• Neat formalism, but the content is hidden: very

important—for most semantic web developers this is content

• http://youtube.com/watch?v=6gmP4nk0EOE nothing here is about meaning

• Semantic Web is not! (Sir Tim apologized for the misnomer: Data Web)

Page 13: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Linking is Good!

• Linking what?• Tagged character strings• Yes, it’s marginally better than linking character

strings but not good enough: we still do not know what the labeled content is—it is still just character strings, and the maximum we can know is that some substrings recur, and how is that different from keywords

• This is shallow semantics, aka no semantics• Semantic web is about “more labels than XML”• More non-semantics does not equal semantics!

Page 14: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Viva OWL!

• Ontologies will save the day! (by carping diem?)• OWL contains ontologies (oops!)• OWL tells us how to build ontologies (oops!)• Okay, okay: OWL tells us how to formalize ontologies

after we build them• “If I had ham I would make ham and eggs—

if I had eggs, that is.”• So who teaches us how to build ontologies?• Oh, Sir Tim, of course! Listen to this:

Page 15: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Tim Almighty

• There are a mixed feelings about the passion for tagging which typifies the Web 2.0 wave. On the one hand, there is excitement about the fact that users are, as a large number, adding re-usable information to the information space, allowing sites such as del.icio.us and flickr to sort, cluster and query masses of otherwise amorphous photos and web content. On the other hand, there is the sinking feeling that tags are headed the same way as keywords of Information Retrieval in the 1980s: initial hope, and then being stranded between the unbearable constraints of a controlled vocabulary and the hopeless ambiguity of uncontrolled user-generated keywords. Tom Gruber, writer of books on ontology who runs a Web 2.0 site himself. gave a talk at ISWC 2006 which touched on bringing the gap, and taking the passion to organize and express, and using it to make re-usable data.

Page 16: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Tim Almighty

• There is currently a tension in the tagging world as to whether tags are regarded as global in meaning, or whether there meaning really depends on the tagger. In del.icio.us, one can query for thinks tagged with a certain word by a certain person. (I heard of one online community which was considering making a system to allow one formally to state when one has committed to use a given tag in the same way as another person, or growing mesh of people. That would be a very interesting feature, as it would allow a useful definition to gain growing acceptance, to progressively move from being a private idea to being a group global standard.)

Page 17: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Tim Almighty

• Meanwhile, other sites get users to provide semantic web data with well-defined global ontologies. The locations of people, events and photos, relationships between people, authorship of publications, things and people an image depicts, and so on, is done using well-defined identifiers (under the covers) for everything involved, including the relationships and properties. The resulting data is extremely re-usable. The problem is that it isn't as quick as tagging with a single word off the top of one's head.

Page 18: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Ontology Building

• So, now we know how to build ontologies?• Well… In fact, there are different evolved areas of ontology research:

– Re-enlightened metaphysics = philosophical ontologies

– Formal ontologies

– Engineering ontologies

– Computational ontologies

– Controlled-vocabulary type ontologies

• Additionally, there are:– Rules, formal and of thumb, for building ontologies

– Methodologies

– Acquisition toolboxes

– Uniformity and continuity concerns (cf. CYC, R.I.P)

• Some serious ontologists, incidentally, easily jumped on the Semantic Web bandwagon to get funding and have gotten off when the funding became scarce

• Smart guys--just as their parents in SDI!

Page 19: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Trying to Build Ontologies from OWL Sites?

• Turns out to be very simple:– Identify objects—that would be nouns– Identify attributes—that would be adjectives– Identify processes—that would be verbs?

(not many go there)

• Turns out to be very wrong as well: Syntax/Morphology do not correspond to meaning much as the non-semantic NLP of the 1980s discovered to its perish

• John is easy to please vs. John is eager to please

Page 20: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Goals of Next Section

• Present functional nouns as evidence of syntactic-semantic discrepancy

• Introduce Ontological Semantics as a comprehensive machine-tractable representation of near-human understanding

Page 21: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Nouns in OntoSem

• Total number of noun senses: 74,286

• Nouns as objects (80.51%)

• Nouns as events (18.29%)

• Nouns as properties (1.20%)

Page 22: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

For Graph Lovers

• Distribution of noun senses

0

10000

20000

30000

40000

50000

60000

70000

number

event

object

property

Page 23: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

noun as OBJECT

bean (sem-struc(BEAN))

BEANis-a LEGUME

is-a VEGETABLE-FOODSTUFFis-a PLANT-FOODSTUFF

is-a FOODSTUFFis-a FOOD

is-a INGESTIBLEis-a INANIMATE

is-a PHYSICAL-OBJECTis-a OBJECT

59,806 noun senses

Page 24: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

noun as PROPERTY

viscosity (sem-struc(VISCOSITY))

VISCOSITYis-a PHYSICAL-PROPERTY

is-a PHYSICAL-OBJECT-ATTRIBUTEis-a LITERAL-OBJECT-ATTRIBUTE

is-a LITERAL-ATTRIBUTEis-a ATTRIBUTE

is-a PROPERTY

893 noun senses

Page 25: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

noun as EVENT

tempest (sem-struc(THUNDERSTORM(intensity(value >0.7))))

THUNDERSTORM

is-a STORM

is-a NATURAL-HAZARD

is-a DISASTER-EVENT

is-a PHYSICAL-EVENT

is-a EVENT

13,587 noun senses

Page 26: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

BANKRUPTCYis-a financial-eventagent owe.agent

owe.beneficiaryprecondition approach-bankruptcyhas-event-as-part

(IF modality.pay.value = 0THEN bankrupt-chapter-7ELSE bankrupt-chapter-11)

APPROACH-BANKRUPTCYis-a financial-eventagent corporation-ahas-event-as-part

...

nouns are complex EVENTS

...(IF

ANDowe

agent corporation-abeneficiary human-a

employed-by corporation-alending-institution-acorporation-b

theme moneypay

agent corporation-abeneficiary human-a

lending-institution-acorporation-b

theme moneyTHEN bankruptcy

agent corporation-abeneficiary human-a

lending-institution-acorporation-b)

Page 27: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

User does the Work:Aim

• complex task• cheap (free) labor

– enthusiastic?– unstrained– lightly supervised– coerced

Page 28: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

User does the Tagging: Case Studies

• CYC– originally largely unsupervised and unsalvageable

output – waning interest [when curiosity fails, why do it?]

• digg.com– large number of unsophisticated users vs.

noteworthy, relevant web content

Page 29: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

User does the Work: Case Studies

• Mao– peasants vs. blast furnaces

• Volkssturm– modern warfare vs. untrained mass armies

• Linguists asking the “native speaker” about meaning of their language– knowing a language vs. knowing about language

Page 30: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Users cannot Identify Meaning

• What is the meaning of John and Mary are husband and wife?

• They met• They liked each other• They dated• They got engaged• They got married• They live together

Page 31: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Users cannot Identify Meaning

• What is the meaning of John and Mary are husband and wife?

• They have sex• They live together• They may have children• They have joint accounts• They socialize together• All of the above

Page 32: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Users will not Tag Reliably, Easily, Uniformly, or

Happily• Users

o can determine all the material that needs to be tagged;

o are familiar with the tag inventory and understand what the tags mean;

o can determine the appropriate tag or tags for each element that must be tagged; and

o can perform consistently over time and with other taggers.

• Not!!!

Page 33: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Users will not Tag Simply

<?xml version="1.0" ?> <!DOCTYPE rdf:RDF (View Source for full doctype...)> - <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"

xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/"> - <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/title"> <rdfs:label xml:lang="en-US">Title</rdfs:label> <rdfs:comment xml:lang="en-US">A name given to the resource.</rdfs:comment> <dc:description xml:lang="en-US">Typically, a Title will be a name by which the resource is formally known.</dc:description> <rdfs:isDefinedBy rdf:resource="http://purl.org/dc/elements/1.1/" /> <dcterms:issued>1999-07-02</dcterms:issued> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/contributor"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/creator"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/publisher"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/subject"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/description"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/date"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/type"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/format"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/identifier"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/language"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/relation"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/source"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/coverage"> </rdf:Property> + <rdf:Property rdf:about="http://purl.org/dc/elements/1.1/rights"> </rdf:Property> </rdf:RDF>

Page 34: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

User Will Not Tag Simply

• The prospective tagger must: o first find this pageo locate the tag in question o understand the semantics of the fillers of the

“comment” and “description” propertieso and then learn to assign the tag “title” to the

appropriate elements of any web page that he or she is writing.

• You must be kidding!

Page 35: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Users won’t do It

• Want simplicity, generality, uniformity, low cost, and ease?

• Sure, automate!• Go where you can find it—not where the

street light is and you can continue to use your favorite methods: playing with yourself… oops, sorry, with formalisms

• Go to meaning processing system—like OntoSem

Page 36: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

OntoSem Resources

• the 6,724-concept ontology,• a 47,025-entry English lexicon with 77,156 senses,• a 19,352-entry onomasticon and a total of 24,328

senses,• a text meaning representation (TMR) language,• an ontological parser transforming text into TMRs,

and• a fact repository, containing the growing number of

implemented TMRs.

Page 37: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

OntoSem Resources(for Graph Lovers)

resources

ontology lexicon

OntoParser

text or data

full TMR

OntoSem

Page 38: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Ontology Top Level

All (= empty root concept)

Objects

Events

Properties

Page 39: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

ALL

Objects

Events

Properties

Events

Mental events

Social events

Physical events

Ontology Event Branch

Page 40: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Properties

Case roles

Agent

Theme

Beneficiary

Instrument

Purpose

Location

Source

Destination

Path

Ontology Property Branch,Case Role Subbranch

Page 41: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Ontological Conceptgo

is-a motion-eventagent animal instrumentbody-part, vehiclesource locationdestination locationstart-time temporal-unitend-time temporal unit

Lexical Entry

drive-V1

[all but semantic information omitted]

sem-struc

go

agent human& adult

instrument car

Concept and Lexicon Entry

Page 42: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

Mary drove from Boston to New York on Wednesday

GO

agent Mary

instrument car

source Boston

destination New York

start-time Wednesday

end-time Wednesday

OntoSem is Event-Biased

Page 43: The Ontological Semantic Perspective on the Semantic Web Victor Raskin, Purdue University & hakia.com Christian F. Hempelmann, hakia.com.

• Ignorance of linguistics, for which the linguists are also responsible

• Fear of semantics, for which the linguists are also responsible

• Bad history of NLP• Objective difficulties of studying meaning (= mind)• So let us do the easy and pleasant stuff!• Forget this talk even happened and carry on with your fun

and games• Thanks—and apologies!

Why That was not in Sir Tim’s Vision?