Granularity in linked open data

23
Granularity in Library Linked Open Data Gordon Dunsire Keynote presentation to Code4Lib 2013, 12-14 Feb 2013, Chicago, USA

description

Discusses the types of granularity found in linked open data.

Transcript of Granularity in linked open data

Page 1: Granularity in linked open data

Granularity in Library Linked Open Data

Gordon DunsireKeynote presentation to Code4Lib 2013,

12-14 Feb 2013, Chicago, USA

Page 2: Granularity in linked open data

Overview

Page 3: Granularity in linked open data

Fractals

Self-similar at all levels of granularity

Cannot determine level: all levels are equal!

Page 4: Granularity in linked open data

Multi-faceted granularity

What is described by a bibliographic record?Or a single statement?

What is the level of description?How complete is it?

How detailed is the schema used?How dumb?

Semantic constraints?Unconstrained?

AAA! OWA! Rumsfeld and the white light!

Page 5: Granularity in linked open data

This resource has intended audience JuvenileTriple:

has Granularity?

Subject Predicate Object

Coarse-grained systems consist of fewer, larger components than fine-grained

systems [Wikipedia]

Resource Description Framework – Linked data

Page 6: Granularity in linked open data

Subject: what is the statement about?

Journal title

Issue

Article

JournalsLibrary collection

ParagraphWord

Consortium collection

Graphics

Subjects AccessDigital collection

RDF map

Section PageMarkup

RDF/XMLURI Node

Component

Super-Aggregate

Sub-Component

Aggregate

Focus

coarser

finer

Journal index

Festschrift

Resource Work

Page 7: Granularity in linked open data

Predicate: what is the aspect described?

Access to resource

Access to content

Suitability rating

Membership category

AudienceAudience of audio-visual material

Audience and usage Component

Super-Aggregate

Sub-Component

Aggregate

Focus

coarser

finer

Page 8: Granularity in linked open data

isbd: International Standard Bibliographic Description

dct: Dublin Core terms

schema: Schema.org

rda: Resource Description and Access

m21: marc21rdf.info

frbrer: Functional Requirements for Bibliographic Records, entity-relationship model

unc: unconstrained version

Possible Audience map (partial)

rdfs:subPropertyOf

unc:“has note on

use or audience”

isbd:“has note on

use or audience”

unc:“Intended audience”

rda:“Intended audience”

m21:“Target

audience” frbrer:“has intended

audience”

dct:“audience”

rdfs:subPropertyOf

  

  

m21:“Target

audience of …”

rdfs:subPropertyOf

schema:“audience”

Page 9: Granularity in linked open data

What is the aspect described?

Manifestation record

Title and s.o.r

Title statement

Resource record

Title wordFirst word of title

Title of manifestationComponent

Super-Aggregate

Sub-Component

Aggregate

Focus

coarser

finer

Page 10: Granularity in linked open data

dct:“Title”

dc:“Title” rdfs:

“Literal”

sP

sP

sPr

rdagrp1:“Title proper

(Manifestation)”

rdafrbr:“Manifestation”

rdaopen:“Title proper”

rdaopen:“Title”

rdagrp1:“Title

(Manifestation)”

d

dsP

sP

sP

sP

isbd:“has title proper”

isbd:“has title”

isbd:“Resource”

dd

sP

sP

eP

Possible Title semantic map(partial)

sP: rdfs:subPropertyOfd: rdfs:domain

r: rdfs:range

Page 11: Granularity in linked open data

Semantic reasoning: the sub-property ladder

isbd:“has title proper”

dct:title

rdfs:subPropertyOf

Semantic rule:If property1 sub-property of property2;Then data triple: Resource property1 “string”Implies data triple: Resource property2 “string”

isbd:”Resource” “Physics”

isbd:“has title proper”

Resource “Physics”

dct:“has title”

machineentailment

coarser

finer

dumb-up

Page 12: Granularity in linked open data

“For children aged 7-”ex:3

rda:”Intended audience (Work)”

“For ages 5-9”ex:2

isbd:”has note on use or audience”

“Primary school”ex:1

frbrer:”has intended audience”

“Juvenile”

ex:4

m21:”Target audience” m21terms:

commonaud#j

skos:prefLabel

Data triples from multiple schema

Page 13: Granularity in linked open data

“For ages 5-9”ex:2unc:”has note on use or audience”

Data triples entailed from sub-property map

“Primary school”ex:1unc:”has note on use or audience”

“For children aged 7-”ex:3unc:”has note on use or audience”

“Juvenile”ex:4unc:”has note on use or audience”

Page 14: Granularity in linked open data

ex:1 frbrer:”Work””is a”

ex:3 rda:”Work””is a”

ex:2 isbd:”Resource””is a”

Data triples entailed from property domains

Page 15: Granularity in linked open data

What is the aspect described?

CreatorAuthor

Screenwriter

Children’s cartoon screenwriterAnimation screenwriter Component

Super-Aggregate

Sub-Component

Aggregate

Focus

coarser

finer

Page 16: Granularity in linked open data

drda:”Work”

rdaroles:”Creator”

[rda:”Agent”]rdaroles:”Author (Work)”

rdaroles:”Screenwriter (Work)”

d

d

r

r

r

s

s

dct:”Creator”

dc:”Creator”

dct:”Agent”r

s

marcrel:”Author”

marcrel:”Authorof screenplay, etc.”

dc:”Contributor”

s

lcsh:”Screenwriters”

?

?

?

s: rdfs:subPropertyOfd: rdfs:domain

r: rdfs:range

?

Page 17: Granularity in linked open data

Machine-generated granularity

Full-text indexing: down to word level

A very large multilingual ontology with 5.5 millions of concepts • A wide-coverage "encyclopedic dictionary" • Obtained from the automatic integration of WordNet and Wikipedia • Enriched with automatic translations of its concepts • Connected to the Linguistic Linked Open Data cloud!

Page 18: Granularity in linked open data
Page 19: Granularity in linked open data

User-generated granularity

“OK for my kids (7 and 9)”

“Too childish for me (age 14)”

“Ideal for the child of ambitious parents”

“This sucks – for kids only”

“Great! Has cool stuff”

Page 20: Granularity in linked open data

KISS

Keep it simple, stupid

Keep it simple and stupid?

The data model is very simple: triples!

The (meta)data content is complex

The Mandelbrot Set:“an example of a complex structure arising from

the application of simple rules” - Wikipedia

Resource discovery is complex

Page 21: Granularity in linked open data

AAA

Anyone can say anything about any thing

Someone will say something about every thing

In every conceivable way

Linguistically

Page 22: Granularity in linked open data

OWA

Will all the gaps get filled?

“There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are also unknown unknowns. There are things we don't know we don't know.”- Donald Rumsfeld

Open World Assumption: the absence of a statement is not a statement of non-existence

Page 23: Granularity in linked open data

!