Understanding Distribution of and Value/Impact Generation in DP Infrastructures

download Understanding Distribution of  and Value/Impact Generation in DP Infrastructures

If you can't read please download the document

description

OAIS Extensions within the Archive-Centric Information Lifecycle Matthias Hemmje – FTK APARSEN- EGI-Community-Forum Training on Data Preservation, May , 22 .05.2014. - PowerPoint PPT Presentation

Transcript of Understanding Distribution of and Value/Impact Generation in DP Infrastructures

Folie 1

OAIS Extensions within the Archive-Centric Information Lifecycle Matthias Hemmje FTK APARSEN-EGI-Community-Forum Training on Data Preservation, May , 22.05.2014Co-ordinated byaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Understanding Distribution of and Value/Impact Generation in DP InfrastructuresRevisiting Societal Vision and Technological Barriers in the Macro EnvironmentSupporting Technology Trends & Drivers in the Macro EnvironmentHow does OAIS enable Valorzation, i.e., transfersal Impact Generation?OAIS Extensions within the Archive-Centric Information LifecycleExemplar Valorization ScenariosSuccessful Reference Innovationaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Revisiting Societal Vision and Technological Barriers in theEconomic Macro Environment

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Without a collective memory, we are nothing, and can achieve nothing. It defines our identity and we use it continuously for education, work and leisure

The Internet is the most powerful new tool we have had for storing and sharing information since the Gutenberg press, so lets use it to make the material in Europes libraries and archives accessible to allViviane Reding

European cooperation is an obvious necessity in this field: it is about ensuring preservation and access to our common cultural heritage for the future generationsJan FigelWhat is the Societal Impact Vision behind the VCOE?Collective Memory European Commission (2006)aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6What is the Societal Impact Vision behind the VCOE? (II)Collective Memory & E-Infrastructures EC Impact is ROIIt is worth highlighting the intentions of EC to secure the value of investments already made into e-Infrastructures This will, amongst other measures, be achieved by means of funding DP and PI deployments These are intended to act as insurance policies for scientific production. In consequence, transversal outreach, take-up and re-use into Scientific Communities, Public Infrastructures (e.g. Memory Institutions), and Industrial Innovation is aimed at as major impact potentialsaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6The concept of a (Digital) Collective Memory has been used to describe the convergence of libraries, museums, archives and collections of all kinds including those of private citizens.

Especially in Europe the connection of Collective Memory with its Cultural Heritage has been recognized and played an important role in interdisciplinary research in FP7.

Technical ChallengesAppraisal, Assembly, Packaging, Ingest Archival and Storage, Administration and PreservationClassification, Indexing, Information Retrieval, and AccessUser Interfaces and PresentationContent Adaptation, Dissemination and Re-useWhat barriers to overcome in enabling this Impact Vision?Collective Memory ICT Contextaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6How does OAIS enable Valorization, i.e., transfersal Impact Generation?

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

Where is the business value and impact generated?Collective Memory Is the Impact Vision just an OAIS?aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6 Looking at OAIS we havea very high level conceptual model of the DP domain Currently support for the explicit management ofStatic DP processes Static, homogeneous information object, media and information package typesSo far no support for the explicit management of production cost vs. re-use value, i.e., no management of Return on Investment (ROI)So far no support for managing producer consumer (business) relationships!Properties of OAIS regarding valorization?Collective Memory Archival & DP Return On Investmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Supporting Technology Trends and Driversin the Macro Environment

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6tTraditional Information and PublicationNext-generation Knowledge Products & ServicesOnline Digital Libraries WWW, Internet Technology,Document formats for online publicationTodayService-oriented architectures,Grid&Cloud infrastructures, semantic technologies, digital library and collaboration servicesEnabling TechnologiesCWEsGlobal & EfficientAccessVirtual DLse-InfrastructuresCollectionAccessPreservationSem. WebsOAIS Extension opportunities supporting valorization?Technology Trend (I): Web 2.0 and Semanticsaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

IdeaWorldwide usage of resources (computational / storage, services)Born from the scientific requirements on huge amounts of storage and computational resources VisionConsume DP resources from the internet as easy as storage and processing from grids and clouds AdvantagesDynamic allocation of resourcesCross-organizational resource sharingResource owner still have full controlSecurity infrastructure OAIS Extension opportunities supporting valorization?Technology Trend (II): Resource Virtualizationaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6ServiceBrokerServiceProviderServiceRequestorFindBindPublishServiceDescriptionServiceDescriptionServiceOAIS Extension opportunities supporting valorization?Technology Trend (III): Service-oriented Architecturesaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Distributed, i.e., Virtualized approaches to Digital Preservation Infrastructure supportHigh Volume, High Throughput, Resource on DemandDistributed Curation of Dynamic and Volatile Digital ContentKeeping track of Evolving Meaning in Production, Archival and Usage Context of Digital ContentSafeguarding Integrity, Authenticity and Accessibility over timeDistribution models enabling distributed, i.e., service-oriented approaches to Digital PreservationOAIS Extension Opportunities regarding valorization?Summary of Opportunities in Virtualized DP Infrastructuresaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6OAIS Extensions within the Archive-Centric Information Lifecycle

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Distribution, Processes and Integration of Archive SystemsCreationAssemblyArchivalReuseAdoptionIngestAccessPost-AccessPre-IngestProduction of new digital objectsUse according to original purposeArchival often right after creation, in parallel to useAppraisal of objects relevant for archivalCompilation and enrichment of objects to preserveCreation of Submission Information Package (SIP) Life-time of objects inside archiveOften, perpetual activityEnable use by designated communityManagement of Archival Information Packages (AIP) Receiving and Examination of Dissemination Information Packages (DIP)Adaption and integration of digital objects into working environmentRecontextualize digital objects and accompanying information for prospective reuseExploitation of digital object by consumerOften re-purposing of digital objectsPotential outcomeCreation of new digital objectsRevision of digital objectsExtension or update of metadataAnnotationConceptual Background Archive-centric Information LifecycleExtending OAIS with pre-ingest and post-access valorizationaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Exemplar Valorization Scenarios in Sciencewith the Memory Institutions SegmentBooklike Publication Scenario

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

Exemplar Valorization Scenarios in Science with Memory Institutions: Indexing and Cataloguing of Publicationsaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-618Ingest, archival and use of book-like publicationsMetadata validation and conversionGeneration of additional metadata and URNCataloguing publicationGeneration of Information PackagesArchival and retrieval of archived objectsData created for book-like publicationsBibliographic MetadataTechnical MetadataStructural MetadataAuthority FilesURNComment and AnalysisMETS file containing metadataArchival (AIP) and Dissemination (DIP) Information PackagesProducers, preserver and user of book-like publicationsPublisher, UniversitiesDeposit libraryEnd-User (Researcher) StakeholdersWorkflowsContentOperationalizationArchival of digital scientific publicationsExemplar Valorization Scenarios in Sciencewith Memory Institutions Depot Libraries Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6ChallengesOrganisation of context capturing within the regular course of business activitiesFocus on interactions of stakeholders before and during the ingest processesSpanning the whole process from production to archivingPreservation of persistent links between different manifestations of scientific publications

19Digitization Scenario

Brief Introduction to theExemplar Valorization Scenariosin Sciencewith Memory Institutions Digitization Centre Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

Exemplar Valorization Scenarios in Science withMemory Institutions: Manual Digitization of Sensitive Booksaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

Exemplar Valorization Scenarios in Science withMemory Institutions: Automated Digitization of Sensitive Booksaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

Exemplar Valorization ScenariosMemory Institutions: Digitization Centres Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6DataHigh-quality TIFFsKarge volumesOptional fulltext in TEIUpcoming standard: JPEG 2000MetadataMets file includingDescriptive metadataStructural metadataScholarsDigitization CenterLibrary

Archival of Digitization materialIntegration into Digitization workflowStakeholdersWorkflowsContentOperationalizationSemi-automaticSelectionProductionQuality ControlMetadata Capture / IndexingProviding accessPreserving (incl. format migration)Exemplar Valorization Scenariosfor Memory Institutions Digitization Centres Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-624Exemplar Valorization Scenarios withIndustry Scientific Publishing & Conference Organizers (PCOs)scientific Publishing Scenario

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

Exemplar Valorization Scenarios withPublishers: Scientific Conferences and PCO Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6[Introduction with real-life situation] Prof. Frank Schneider, president of the German Association for Psychiatry and Psychotherapy (DGPPN), giving his introductory talk during the opening ceremony of the 2009 congress of the associationCongress span 4 days, location International Congress Center ICC Berlin, 8162 visitors most of which scientists and physicians, 630 eventsFor first time, English tracks in programmeBy now, DGPPN annual conference is the major scientific congress in the field of mental disorders and neuroscience in Europe15% growth compared to previous annual conference edition, doubled size in past 5 years => importance of congresses in science and practice26

Exemplar Valorization ScenariosPublishers: Scientific Conference and PCO Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6

Audio/ Video and Multimedia Contente.g. talk capture with presentation slide synchronizationSo-far Unpublished Contente.g. posters and presentation slidesAnnotationse.g. reviews and opinionsExemplar Valorization ScenariosPublishers: Scientific Conference & PCO Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-628

Net Publications andElectronic PublicationsE.g. abstract bookUnbundling and Smaller UnitsE.g. single abstracts and articlesInterlinkingE.g. citations, alternative representations, similar works

Cross Media PublishingE.g. net publication, ebook and interactive programme on the webExemplar Valorization ScenariosPublishers: Scientific Conference & PCO Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-629 & ResearchInstitute/ UniversityResearcher ConferenceScientific AssociationCongress OrganizerPresenterParticipantsPublishingPublisherScientific AssociationResearcherPresentationSchedulingReviewArchivalDiscourse Web SiteScientific AssociationCongress OrganizerResearcher PreservationDepot LibraryLegal DepositDistributionProductionArchivalProvisionArchivalDistributionProductionArchivalPreservationStorageIngestDisseminationDiscoursePresentations, Poster, Audio/ Video Captures, Discussions, Reviews, ClassificationsAbstract BookContribution Correlations, Opinions, Web SitePostproduction PracticeAbstracts, Papers, Supplements, DataExemplar Valorization ScenariosPublishers: Summary Scientific Conference Segmentaparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6Level 2 GOME Satellite instrument data

PARSE.Insight and SCIDIP-ESSuccessful Reference, i.e., Pilot Innovation ProjectsDomain: Earth Sciences, Exemplar Roadmapping & Innovation

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-6This is a mosaic composed by the data retrieved over 14 orbits per day with interpolation of data values to fill the gap between two quasi-adjacent geographical areas covered by the satellite ground track. On that date the white zone over Asia indicates absence of useful data.Format of file can be identified through a combination of file extension, physical location of file (as Indicated by Postgres database) and file name which indicates format. Textual description of the formats Is available on MST support pages Raw data IQ (In phase and quadrature) data - non standard binary format files format (textual description of format available)Spectral - Raw data which has undergone first stage processing to spectral non standard binary file format. (textual description of format available)Processed productVersion 2 processed data product Radial data: Nasa Ames format (product specific textual description available) Cartesian data: Nasa Ames format (product specific textual description available)Version 1: processed data productRadial data: Nasa Ames format (product specific textual description available)Cartesian data: Nasa Ames format (product specific textual description available)Version 0: processed data productTime averaged radial data: non standard ASCII format (textual description available)Unaveraged radial data: non standard ASCII format (textual description available)Time averaged wind data: non standard ASCII format (textual description available)Unaveraged wind data: non standard ASCII format (textual description available)Time averaged power data: non standard ASCII format (textual description available)Quick Look plots: graphic file png format generated from cartesian products using GNU plotUnder development With version 3 binary file of cartesian and radial product will be produced in the NetCDF format in tandem with a Nasa Ames ASCII versions.Once Version 3 processing (Python producing NetCDF radial and Cartesian Product) is released a reprocessing of the archived Spectral file will be undertaken. In order to produce a consistent series of Cartesian and radial product. As they will be of higher quality and more easily supported.After a period of time the BADC would wish to dispose of the old higher level product but would like to maintain the ability to recreate it.

Prof. Dr.-Ing. Matthias L. HemmjeAPARSEN Stream Leader SustainabilityMember of the Executive Board [email protected]

FTK Research Institute for Telecommunication and Cooperation e.V.Martin-Schmeier-Weg 4D-44227 DortmundGermanyThanks for your attentionAny Questions?

aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT-2009-632