Robust Links - a proposed solution to reference rot in scholarly communication

12
Robust Links @mart1nkle1n IIPC WAC, 06/16/2017, London, UK Robust Links A Proposed Solution to Reference Rot in Scholarly Communication Martin Klein @mart1nkle1n Herbert Van de Sompel @hvdsomp Research Library Los Alamos National Laboratory http://robustlinks.mementoweb.org/

Transcript of Robust Links - a proposed solution to reference rot in scholarly communication

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

Robust Links –

A Proposed Solution to Reference Rot

in Scholarly Communication

Martin Klein@mart1nkle1n

Herbert Van de Sompel@hvdsomp

Research Library

Los Alamos National Laboratory

http://robustlinks.mementoweb.org/

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

2

• Links break

• Referenced content changes

Author’s intention with such links is not reproducible!

Can we:

1. make links more robust?

2. actionable for humans and machines?

Problem

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

3

https://web.archive.org/web/20161228184110/https://www.epa.gov/climatechange

EPA

12/2016

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

4

https://www.epa.gov/sites/production/files/signpost/cc.html

EPA

today

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

5

Robust Links

1. Create a snapshot of referenced resources in a public web

archive

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

6

Common Practice – Archived URI

URI of archived snapshot

https://web.archive.org/web/20071011083729/http://islandheritage.org/faq.html

Capture datetime

But what if URI-M is:

http://archive.is/MTMKu

or

https://www.mummify.it/XbmcMfE3

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

7

Robust Links

1. Create a snapshot of referenced resources in a publically available

web archive

2. Decorate links with:

• URI of archived snapshot

• datetime of archiving

• resource’s original URI

Benefits:

• Can visit live version of referenced resource

• Original URI allows finding captures in all web archives

• Capture datetime allows finding an appropriate capture in all

web archives

• Uniform, machine-actionable

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

8

Link Decoration

<a href="http://netpreserve.org/wac2017/"IIPC Web Archiving Week</a>

http://robustlinks.mementoweb.org/spec

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

9

Link Decoration

<a href="http://netpreserve.org/wac2017/"

data-versionurl="http://archive.is/1y5w3"data-versiondate="2017-06-13">

IIPC Web Archiving Week</a>

http://robustlinks.mementoweb.org/spec

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

10

Link Decoration

<a href="http://archive.is/1y5w3"

data-originalurl="http://netpreserve.org/wac2017/"data-versiondate="2017-06-13">

IIPC Web Archiving Week</a>

http://robustlinks.mementoweb.org/spec

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

11

Link Decoration in Action

http://dx.doi.org/10.1045/november2015-vandesompel

Robust Links

@mart1nkle1n

IIPC WAC, 06/16/2017, London, UK

12

Link Decoration in Action

http://dx.doi.org/10.1045/november2015-vandesompel