Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital...

52
Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting March 17, 2014

Transcript of Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital...

Page 1: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Unleashing Expressivity

Linked Data for Digital Collections Managers

Cory LampertHead, Digital Collections

Mountain West Digital Library Hubs MeetingMarch 17, 2014

Page 2: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Agenda

• Why? Primer on LD for Digital Library Managers • Opening our minds to a post-”record” world• Empowering others through practical exploration– Technologies– Transforming metadata into linked data

• Let’s see it!• Next steps

Page 3: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Linked Data Overview

• My collections are already visible through Google; so who cares• This is a topic for catalogers• It’s too technical / complicated / boring

Actually ... • Linked data is the future of the Web• Data will no longer be in trapped in silos imposed by systems, collections, or

records • Exposed open data presents new opportunities for users

Page 4: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

What is Linked Data?

• Linked Data refers to a set of best practices for publishing and interlinking data on the Web

• Data needs to be machine-readable

• Linked data (Web of Data) is an expansion of the Web we know (Web of documents)

Page 5: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Current Practice

• Data (or metadata) encapsulated in records• Records contained in collections• Very few links are created within and/or across collections• Links have to be manually created• Existing links do not specify the nature of the relationships

among recordsThis structure hides potential links within and across collections

Page 6: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

What we can do with linked data

• Free data from silos• Expose relationships• Powerful, seamless, interlinking of our data• Users interact or query data in new ways• Search results would be more precise• Data can be easily repurposed

Page 7: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Why?Our data needs an upgrade.

http://5stardata.info/

Page 8: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

The Linked Data Cloud

Page 9: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

A Post-”record” world• Despite limitations, we have already invested lots of resources in

the metadata! It is valuable.

• Triples represent the next evolution in powerful, flexible, standards-based interoperable metadata.

• Each metadata field may produce one or several statements

• One metadata record can produce many, many, triples

Page 10: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

How can we create linked data?

• Our metadata records are deconstructed in triples (statements) that are machine-readable

• Triples are expressed as: Subject – Predicate - Object For example: This book – has creator – Tom Heath

This book – has title – Linked Data: Evolving the…” • Subjects, predicates and most objects should have unique identifiers

(URIs) creating data that can be used in Web architecture (HTTP)• These statements are expressed using the Resource Description

Framework (RDF)• Linked data can be queried using SPARQL

Page 11: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Example of a metadata record

Page 12: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Expressing metadata as triples

• <this thing> <has creator> <Las Vegas News Bureau>• < this thing > <has genre> <Photographic print>• < this thing> <depicts> <Frank Sinatra>• < this thing> <depicts> <Jack Entratter>-------------------------------------------------------------------• <Frank Sinatra> <has profession> <entertainer>• <Jack Entratter> <has profession> <theatrical producer>

Page 13: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Graphical Representation

Page 14: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Examples of records

Showgirls Menus

Dreaming theSkyline

Page 15: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

title

Page 16: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

How can I transform textual triples into machine-readable?

• We need a data model• Europeana Data Model gives us a framework to help

organize, structure, and define which predicates we are going to use

• Adopting an existing model is preferable to creating your own (interoperability)

Page 17: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

title

Page 18: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Triples with URIs & EDM model predicates

(Local URI)

Page 19: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Machine-readable triple@prefix dc: <http://purl.org/dc/elements/1.1/> . @prefix edm: <http://www.europeana.eu/schemas/edm/> . @prefix foaf: <http://xmlns.com/foaf/0.1/> .

<http://digloc7.library.unlv.edu:8890/ProvidedCHO/sho000071> dc:creator http://digcol7.library.unlv.edu:8890/Agent/Las-Vegas-News-Bureau .

<http://digloc7.library.unlv.edu:8890/ProvidedCHO/sho000071> foaf:depicts <http://id.loc.gov/authorities/names/n50026395> .

<http://digloc7.library.unlv.edu:8890/ProvidedCHO/sho000071> edm:hasType http://id.loc.gov/vocabulary/graphicMaterials/tgm007779 .

Page 20: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

“I’m a digital collections manager”…

• What is known? – lots of THEORY and lots of TECHNICAL information

• What is happening? – a move toward PRACTICE and APPLICATION in libraries by non-programmers

• Is there a “recipe” yet? - No. But, our staff CAN do significant work to prepare for linked data and to understand linked data principles, even if it isn’t realistic to run a parallel process.

Page 21: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

UNLV Linked Data ProjectGoals: • Study the feasibility of developing a common process that

would allow the conversion of our collection records into linked data preserving their original expressivity and richness

• Publish data from our collections in the Linked Data Cloud to improve discoverability and connections with other related data sets on the Web

Page 22: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Actions Technologies

Prepare dataExport data

Import dataPublish

Open Refine

Mulgara /Virtuoso

CONTENTdm

Import dataClean dataReconcileGenerate triplesExport RDF

Page 23: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Export Data and Prepare

• Increase consistency across collections: – metadata element labels– use of CV, – share local CVs– etc.

• Map metadata elements with predicates from data model• Export data as spreadsheet

Page 24: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

OpenRefine

• Open Source

• It is a server – can communicate with other datasets via http

• Install Open Refine and its RDF extension

Screenshots to cover some functions we have used so far

Page 25: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

OpenRefine first screen

Page 26: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 27: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Facet

Page 28: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 29: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 30: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Split multi-value cells

Page 31: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 32: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 33: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Reconciliation

Page 34: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Specifying Reconcilation Service

Page 35: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Activating Reconcilation

Page 36: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 37: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Creating a Skeleton

Page 38: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 39: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 40: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Exporting RDF Files

Page 41: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Actions Technologies

Prepare dataExport data

Import dataPublishQuery

Open Refine

Mulgara /Virtuoso

CONTENTdm

Import dataClean dataReconcileGenerate triplesExport RDF

Page 42: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Mulgara Triple Store: Import

Page 43: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 44: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Simple SPARQL query

Select *

Where {?s ?p ?o} limit 100

Page 45: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 46: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Yeah, but what does it look like – to humans?

• Pivot Viewer and Virtuoso: http://www.microsoft.com/silverlight/pivotviewer/

• RelFinder: www.visualdataweb.org/relfinder.php

• Gelphi: Linked Jazz: http://linkedjazz.org/network/

Page 47: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Query an interface

Page 48: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 49: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 50: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.
Page 51: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Next Steps• Understand workflow for transformation of digital

collections into linked data (parallel structure)• Publish data; understand skills needed for best practices• Increase linkage with other datasets• Explore interfaces and advocate for our users; to access and

display our data and related data from other datasets• Collaborate and partner with others

Page 52: Unleashing Expressivity Linked Data for Digital Collections Managers Cory Lampert Head, Digital Collections Mountain West Digital Library Hubs Meeting.

Thank You!

Questions?