Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

41
Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support

Transcript of Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

Page 1: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

1

Easy as ABCA triumph of re-useable metadata

Julia HickieMark Raadgever

Trove Support

Page 2: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

2

Page 3: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

3

https://plot.ly/~wragge/6/trove-newspaper-articles-by-state/

1955

Page 4: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

4

Page 5: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

5

1. A web crawling bot to pickup records

2. Transformers to change the records

3. A loader to dump them in Trove

Dragline loading a dump truck at German Creek brown coal open mine, Queensland, 1985 Sievers, Wolfganghttp://nla.gov.au/nla.pic-vn4801485

Tonka, Brian Auerhttps://flic.kr/p/4qKzQE

Mi colección de Transformers (17/Dic/2007)Gustavo Vargashttps://flic.kr/p/4ee2Nh

Page 6: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

6

Why didn’t they become a Trove contributor?

1. No resources, no money, no capability for technical change

2. Can’t meet a metadata standard (that no longer exists)

Page 7: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

7

http://www.abc.net.au/radionational/feed/2887252/podcast.xml

Page 8: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

8

http://www.abc.net.au/radionational/programs/healthreport/

Page 9: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

9

http://www.abc.net.au/radionational/programs/healthreport/past-programs/index=2013

Page 10: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

10

Page 11: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

11

Page 12: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

12

Radio National

Website NLA Harvester

HTML

XML

HTML

Trove

XML

PHPScript

• Regular expressions• XSLT stylesheets• Java modules

Page 13: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

13

Page 14: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

14

Page 15: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

15

Page 16: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

16

Page 17: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

17

Radio National

Website NLA Harvester

HTML

XML

HTML

Trove

XML

PHPScript

• Regular expressions• XSLT stylesheets• Java modules

Page 18: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

18

Page 19: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

19

Page 20: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

20

Page 21: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

21

Page 22: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

22

Radio National

Website NLA Harvester

HTML

XML

HTML

Trove

XML

PHPScript

• Regular expressions• XSLT stylesheets• Java modules

Page 23: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

23

http://trove.nla.gov.au/version/209294503

Page 24: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

24

WHY?

Page 25: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

25

Page 26: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

26

Page 27: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

27

Page 28: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

28

Michael Neubert From wheels to bikes -http://wheelbike.blogspot.com.au/2012/02/starting-search-for-bikes-in-trove.html

Page 29: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

29

2013-07

2013-08

2013-09

2013-10

2013-11

2013-12

2014-01

2014-02

2014-03

2014-04

2014-05

2014-06

2014-07

2014-08

2014-09

0

200

400

600

800

1000

1200

1400

1600

ABC Clickthroughs July 2013-September 2014

Clickthroughs

Content added

Promotion

Page 30: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

30

Page 31: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

31

Why Radio National

Page 32: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

32

What else?

• Standardised records allow analysis of content• Digital historians can use the API to investigate

trends• E.g. Tim Sherratt’s In a Word

Page 33: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

33

Page 34: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

34http://inaword.dhistory.org

Page 35: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

35

https://github.com/wragge/radio-national-data

Page 36: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

36

Page 37: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

37

Page 38: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

38

Page 39: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

39

Lessons Learned• Adaptation of existing functions– Sitemap harvesting– RSS harvesting– XSLT Transformation

• Development of generic rather than specialised tools

• Staff learning opportunities – we became better at using core technology

Page 40: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

40

Future

• Re-examine contributors previously unable to meet technical requirements

• Encourage re-use of the dataset – including adding it to library catalogues as well as scholarly analysis

• Think beyond conventional data

Page 41: Easy as ABC A triumph of re-useable metadata Julia Hickie Mark Raadgever 1 Trove Support.

41

Questions?