Cloud-sourcing Research Collections: a model for strategic change

23
Constance Malpas Program Officer OCLC Research Cloud-sourcing Research Collections: a model for strategic change ARL Fall Forum Achieving Strategic Change in Libraries 15 October 2010

description

Cloud-sourcing Research Collections: a model for strategic change. ARL Fall Forum Achieving Strategic Change in Libraries 15 October 2010. Constance Malpas Program Officer OCLC Research. Cloud Library Project (2009/2010). - PowerPoint PPT Presentation

Transcript of Cloud-sourcing Research Collections: a model for strategic change

Page 1: Cloud-sourcing Research Collections:   a model for strategic change

Constance MalpasProgram OfficerOCLC Research

Cloud-sourcing Research Collections: a model for strategic change

ARL Fall ForumAchieving

Strategic Change in Libraries

15 October 2010

Page 2: Cloud-sourcing Research Collections:   a model for strategic change

Cloud Library Project (2009/2010)

Objective: Assess feasibility of ‘externalizing’ library repository functions to network service providers

Scope: Legacy monographic collections in mass-digitized corpus

Case study: • Motivated client: NYU• Shared print repository: ReCAP• Shared digital repository: HathiTrust

Method: Analyze holdings overlap for generic ARL service population; assess opportunity for surrogate service provision and relegation of local holdings; quantify potential space savings and cost avoidance

Page 3: Cloud-sourcing Research Collections:   a model for strategic change

25 years+70M vols.

0101010101010101010101010101010101010101010101010101010101010101010101010101010101010101010

9 months +3M vols.

Our Starting Point: June 2009

Will this intersection create new operational efficiencies? For which libraries? Under what conditions? How soon and with what impact?

HathiTrust

Library off-site storage

Page 4: Cloud-sourcing Research Collections:   a model for strategic change

Context: a switching point in library operations

Network service provision entails a “trade-off between cost and control” Nicholas Carr, The Big Switch (2008)

“turnouts, Knoxville, Tennessee” by Steve Minor

In libraries …• shift to licensed provisioning,

shared management infrastructure; gain in operational efficiencies

• print still managed as local asset; increasing attention to long-term costs of redundant inventory, burdensome operations

Page 5: Cloud-sourcing Research Collections:   a model for strategic change

Attention Switch: from Print to Electronic

Academic Library Expenditures on Purchased and Licensed Content

0%10%20%30%40%50%60%70%80%90%

19982000

20022004

20062008

20142020

Print books and journalsE-journals and e-books

Projected change

Derived from US Dept of Education, NCES, Academic Libraries Survey, 1998-2008

You are here

Page 6: Cloud-sourcing Research Collections:   a model for strategic change

Format Transition is a Driver

$- $5,000,000 $10,000,000 $15,000,000 $20,000,000 $25,000,000 $30,000,000 $35,000,000 $40,000,000 0

10

20

30

40

50

60

70

80

90

100

Library Materials Expenditures (2007-2008)

Lic

ense

d Co

nten

t as

% o

f Li

brar

y M

ater

ials

$

Derived from ARL Annual Statistics, 2007-2008

50% of ARL materials budget

Growing number of institutions with e-centric acquisitions, service model

Shrinking pool of libraries with mission and resources to sustain print preservation as ‘core’ operation

Page 7: Cloud-sourcing Research Collections:   a model for strategic change

Medium Discounted Life Cycle

Cost (per unit)

Total Life Cycle Cost (per unit)

Purchase Cost

(per unit)

Total Cost / Purchase

Cost (per unit)

Monographs $ 119.56 $ 343.03 $ 47.78 718%Current serials

$ 634.91 801.87 590.97 134

Microforms $ 0.27 0.45 0.11 256Govt. Docs $ 14.13 55.40 0.00 311MSS & Archives

$ 20.26 126.79 4.46 1130

Maps $ 26.78 73.82 11.05 247Graphic materials

$ 1.65 2.91 0.06 216216

Sound recordings

$ 22.64 24.77 6.80 219

Video & Film $ 128.95 107.50 15.70 307Computer files

$ 0.17 0.07 0.01 331

Potential life-cyclecost savings of

(119.56-47.78)*500,000 titles=$35,890,000

S. Lawrence et al. (2001) Based on 1999 ARL Data

“monographs are overwhelmingly the largest source or driver of library costs . . . If research libraries want to control their costs, they must work to control and reduce the life cycle costs of maintaining their monograph collections” Lawrence , Connaway & Brigham (2001)

Page 8: Cloud-sourcing Research Collections:   a model for strategic change

Inertia: a hidden cost driver?

Cost of management decreases as collections move off-site; the sooner they leave, the greater the savings

Source: P. Courant and M. Nielson (CLIR, 2010)

If 13% of on-campus collection circulates, more than 80% of the expenditure on locally managed collections delivers ‘symbolic’ value

Page 9: Cloud-sourcing Research Collections:   a model for strategic change

Enter the Elephant . . .

Jun-09Jul-0

9

Aug-09

Sep-09

Oct-09

Nov-09

Dec-09

Jan-10

Feb-10

Mar-10

Apr-10

May-10

Jun-100

1,000,000

2,000,000

3,000,000

4,000,000

5,000,000

6,000,000

7,000,000

Growth of HathiTrust Digital LibraryJune 2009 - June 2010

Full View Titles Limited View Titles Total Volumes

Equal in scope to e.g. University of

Minnesota

Equal in size to median ARL

library

Page 10: Cloud-sourcing Research Collections:   a model for strategic change

0 20 40 60 80 100 1200%

10%

20%

30%

40%

50%

60%

Rank in 2008 ARL Investment Index

% o

f Ti

tles

in L

ocal

Col

lect

ion

A global change in the library environment

June 2010Median duplication: 31%

June 2009Median duplication: 19%

Academic print book collection already substantially duplicated in mass digitized book corpus

Page 11: Cloud-sourcing Research Collections:   a model for strategic change

Mass Digitized Books in Shared Repositories

Sep-09 Oct-09 Nov-09 Dec-09 Jan-10 Feb-10 Mar-10 Apr-10 May-10 Jun-100

500,000

1,000,000

1,500,000

2,000,000

2,500,000

3,000,000

3,500,000

Mass digitized books in Hathi digital repository Mass digitized books in shared print repositories

Uni

que

Titl

es

~75% of mass digitized corpus is ‘backed up’ in one or more shared print repositories

~3.6M titles

~2.5M

Page 12: Cloud-sourcing Research Collections:   a model for strategic change

Sep-09 Oct-09 Nov-09 Dec-09 Jan-10 Feb-10 Mar-10 Apr-10 May-10 Jun-100%

10%

20%

30%

40%

50%

60%

70%

80%

% o

f M

ass

Dig

itize

d Co

rpus

Dup

licat

edShared Print Service Provision: Capacity Varies

Library of Congress

UC NRLF/SRLF

ReCAP

CRL

Union of 5 major shared print collections

Page 13: Cloud-sourcing Research Collections:   a model for strategic change

Prediction

Within the next 5-10 years, focus of shared print archiving and service provision will shift to monographic collections

• large scale service hubs will provide low-cost print management on a subscription basis;

• reducing local expenditure on print operations, releasing space for new uses and facilitating a redirection of library resources;

• enabling rationalization of aggregate print collection and renovation of library service portfolio

Mass digitization of retrospective print collections will drive this transition

Page 14: Cloud-sourcing Research Collections:   a model for strategic change

A ‘Green’ Strategy: Cooperative Management

Data current as of June 2010

Shared print provision could enable at least 32K linear feet of space recovery …

Page 15: Cloud-sourcing Research Collections:   a model for strategic change

A Strategy to Transform Library Operations

Data current as of June 2010

Represents +30K linear feet of

shelving and at least $440K in

annual cost avoidance*

* Assuming relegation for titles held by >99 libraries; one volume per title * $.86

Page 16: Cloud-sourcing Research Collections:   a model for strategic change

An Achievable Strategy -- if latent shared print capacity is activated

Data current as of June 2010

95% of target30K linear ft $420K

Page 17: Cloud-sourcing Research Collections:   a model for strategic change

Implications: Shared Print

A small number of repositories may suffice for ‘global’ shared print provision of low-use monographs

Generic service offer is needed to achieve economies of scale, build network; uniform T&C

Fuller disclosure of storage collections is needed to judge capacity of current infrastructure, identify potential hubs

Service hubs will need to shape inventory to market needs; more widely duplicated, moderately used titles

If extant providers aren’t motivated to change service model, a new organization may be needed

Page 18: Cloud-sourcing Research Collections:   a model for strategic change

Recommendations: Where to Start

• If your institution has significant holdings in storage swap your symbol so that aggregate preservation resource is addressable and carrying capacity can be assessed

• Use the mass digitized book corpus as driver for de-duplication and storage transfers; strengthen preservation infrastructure where it is most needed

• Retain on-site only those titles for which demand and local value exceeds the (significant) economic and opportunity costs of local management; est. 13% circ rate does not justify current expenditure pattern

Page 19: Cloud-sourcing Research Collections:   a model for strategic change

Recommendations: What to Stop

• Storage transfers that don’t meet a known preservation need; local space pressures (alone) shouldn’t dictate what moves first or farthest

• Preservation strategies that presume local autonomy; the scholarly record is a shared asset and its preservation is a collective responsibility

• Enhancing bibliographic records for digitized content, beyond the addition of standard identifiers; let network visibility and full-text search hasten the migration of inventory from stacks to storage

Page 20: Cloud-sourcing Research Collections:   a model for strategic change

Proposal: Pilot a Strategic Print Reserve

Largest shared storage collections use mass-digitized titles held in common to characterize generic service offer and common price point for a ‘national print reserve’

Compare:• Availability in print (restricted collections, NOS,

loss rate) • Delivery timetable (including home-delivery

option)• Repository characteristics (environmental

conditions, etc.)• Transaction costs (establish baseline, look for

efficiencies)

subscription cost based on N titles * ($.86 * x) / no. participants in region holding print version; service

level sufficient to enable reduction in inventory

Page 21: Cloud-sourcing Research Collections:   a model for strategic change

An Ideal Time for Experimentation …

Source: Gartner's Hype Cycle Special Report for 2010

2-5 years to refine the model

Page 22: Cloud-sourcing Research Collections:   a model for strategic change

Acknowledgments

Cloud Library Project staff:Michael Stoller, Bob Wolven, Matthew Sheehy (NYU & ReCAP)Kat Hagedorn, Jeremy York (HathiTrust)Roy Tennant, Bruce Washburn, Jenny Toves (OCLC Research)

Sponsors:Carol Mandel, Jim Michalko, Jim Neal, John Wilkin

Funder:Andrew W. Mellon Foundation

Page 23: Cloud-sourcing Research Collections:   a model for strategic change