High Water Raises All Boats Leveraging Partnerships on Campus to Build a Repository Mary Molinaro...

33
High Water Raises All Boats Leveraging Partnerships on Campus to Build a Repository Mary Molinaro University of Kentucky Libraries

Transcript of High Water Raises All Boats Leveraging Partnerships on Campus to Build a Repository Mary Molinaro...

High Water Raises All BoatsLeveraging Partnerships on Campus to Build a Repository

Mary Molinaro

University of Kentucky Libraries

http://www.flickr.com/photos/gnuckx/5040689842/

More Than a “Library” Issue Worked on IR issues since 2003 Opening Doors to Open Scholarship (white

paper) http:

http://uknowledge.uky.edu/libraries_reports/1/ 2007

Advocated for campus-wide support

Repository Model (2007)

We have to do something!

Digital Commons and Be Press

Hosted solution Indexed by Google and Google Scholar

OAI compliantPersistent URLCreation of peer reviewed journalsContent can be delivered back to us

What we are putting into UKnowledge?

Electronic Theses and Dissertations

ArticlesBooksPresentationsUniversity press materials

Concurrently…

Kentucky Digital Library Migration DLXS to what? Has to accommodate multiple kinds

of data Not simply a CMS Use all open source tools Blacklight as the discovery layer Settled on a Micro-Services based

repository

Why Micro-Services?

NimbleFlexibleModularEasily replacedSupports our overall strategyhttps://confluence.ucop.edu/display/Curation/Home

At the same time a storm is brewing…

New data management requirements from federal granting agencies (NSF, NIH, etc)

Various solutions popping up around campus

http://www.flickr.com/photos/torek/3675252198/

Partners: Specialized Expertise Library

Digital preservation Metadata

Information Technology Hardware Infrastructure Cloud Storage solutions

Research Proposal development Agency requirements

Educating partners

Working from a position of strength

Leverage our expertise and experience

Confidence in the approach

Benchmark Survey/Comparison University of Maryland -

College Park University of Michigan - Ann

Arbor University of Minnesota -

Twin Cities University of North Carolina

at Chapel Hill University of Pittsburgh -

Pittsburgh Campus University of Virginia University of Washington -

Seattle Campus University of Wisconsin –

Madison University of Florida North Texas University

Georgia Institute of Technology

Ohio State University Pennsylvania State University Rutgers University - New

Brunswick Texas A & M University The University of Texas at

Austin University of California -

Berkeley University of California - Davis University of California - Los

Angeles University of California – San

Diego University of Illinois at Urbana

– Champaign University of Illinois

Options

Contract with a vendor

Leverage D-Space

Bring up a Fedora repository

Look for a new approach

Enter Micro-Services! Again.

NimbleFlexibleModularEasily replacedSupport our overall strategyhttp://www.cdlib.org/services/uc3/curation/

Timing is Everything

NSF & NIH Research Grant Associated Data

NSF & NIH Research Grant Associated Data

Electronic RecordsElectronic Records

E-PrintE-Print UKnowledgeUKnowledge

Digital Media

Digital Media

Digital Library Content

Digital Library Content

Other Research Data

Other Research Data

Micro-Services to address back-end repository functions

Hydra to address front-end repository functions

Common Metadata Store

Hybrid Cloud Storage – File & Object Based

Our basic strategy Hydra Front-End – Think front-end website Archivematica – workflow engine CDL Micro-Services for the back-end

servicesIngestionStorageFixityIdentityPreservationIndexing

NSF & NIH Research Grant Associated Data

NSF & NIH Research Grant Associated Data

Electronic RecordsElectronic Records

E-PrintE-Print UKnowledgeUKnowledge

Digital Media

Digital Media

Digital Library Content

Digital Library Content

Other Research Data

Other Research Data

Micro-Services to address back-end repository functions

Hydra to address front-end repository functions

Common Metadata Store

Hybrid Cloud Storage – File & Object Based

Scope - Initially Service Research data management

needs providing meaningful & robust services to researchers – specifically, NSF & NIH requirements Workflow Metadata Management Federation – Security & Identity Curation Storage – Lowest Cost

Scope - Future

Provide Enterprise Repository Services for much of UK from a data management & research perspective

Infrastructure Considerations

Overlap with High Performance Computing & Cloud storage space

Virtualized Infrastructure

Partner Roles Library

Technical lead Metadata schema Data management plans

Information Technology Storage solutions – spinning disc, tape, cloud Virtualized infrastructure Security and authorization

Research Policies Governance Proposal development with researchers

Next steps Policies and governance Meetings with research stakeholders Funding models Data management tool

Personnel (Tech Leads)

Eric Weig – [email protected] Jason Conley – [email protected]

Resources Hydra

https://wiki.duraspace.org/display/hydra/The+Hydra+Project Microservices -

https://confluence.ucop.edu/display/Curation/Home Blacklight

http://www.lib.virginia.edu/digital/resndev/blacklight.html Archivematica -http://archivematica.org/wiki/index.php?

title=Main_Page DMPTool -https://dmp.cdlib.org/

Sustainable preservation strategies are not built all at once, nor are they static. Sustainable preservation is a series of timely actions taken to anticipate the dynamic nature of digital information.

- Sustainable Economics for a Digital Planet: Ensuring Long-Term Access to DigitalInformation. Blue Ribbon Task Force on Sustainable Preservation and Access. February 2010.http://brtf.sdsc.edu/