Project @ a glance Project @ a glance Digital Libraries Powered by the Grid i i Start date:...
-
date post
18-Dec-2015 -
Category
Documents
-
view
217 -
download
2
Transcript of Project @ a glance Project @ a glance Digital Libraries Powered by the Grid i i Start date:...
Project@ a glance
Project@ a glance
Digital Libraries Powered by the Grid
ii
Start date: September 1, 2004.
Duration: 36 months.
Effort: 1077 Personmonths.
Cost: €9,546,561.
European Commission Contribution: €6,300,000.
Administrative and Financial Coordinator: ERCIM.
Technical and Scientific Coordinator: CNR-ISTI.
Laying the foundations for next generation of collaboration
and knowledge management environments: deploying
Digital Libraries on-demand over Grid-enabled
infrastructures.
DILIGENT aims to establish a standards-based test-bed which integrates state-of-the-art Digital Library and Grid technologies to support a wide range of research and industrial applications. With DILIGENT, members of Virtual Organisations can create on-demand, transient Digital Libraries based on shared computing, storage, content, and application resources. A Knowledge Management Environment based on DILIGENT guarantees secure, coordinated, dynamic, and cost-effective collaboration and sharing of knowledge.
Furthermore, DILIGENT aims to:
open up Grid technology to a broader range of research and industrial communities;
promote the diffusion of Digital Libraries technology through a cost-effective operational model;
promote cross-fertilization between the Digital Libraries and Grid domains which will foster synergies and advances in both areas.
ObjectivesObjectives
TechnologyTechnology
The DILIGENT infrastructure is comprised by a set of interacting services which provide:
support for the creation and operation of on-demand, transient Digital Libraries;
typical Digital Library functionality, such as search, annotation, personalisation, and content visualisation;
access to third-party application and information sources;
functionality to handle shared content and application resources.
Furthermore, DILIGENT can integrate pre-existing infrastructure resources (nodes, services, etc) supplied by collaborating parties with its own ones and also exploit those provided by the EGEE Grid infrastructure.
As a result, the DILIGENT infrastructure includes hosting nodes, content resources, and application resources. Its component services can use any gLite-compliant infrastructure to carry out computational and storage intensive tasks.
DILIGENT project emphasises on
technical development in order to
deliver a usable platform.
Design and development tasks
form most of the project's activities
and are linked to the various
components of the DILIGENT
platform. Testing and integration
activities glue the components
together into a robust, stable
system that can be deployed,
reused or extended at will.
As a proof of concept, DILIGENT
supports two user communities
operating within disjoint
application domains:
ImpECt, focusing on the
analysis, study and prevention
in environmental research
through the management of
services and information for
science-oriented applications
in Earth Observation;
ARTE, targeting management
of rich-media archives and
enabling sharing and
dissemination of information /
knowledge produced by
research in humanities, social
sciences and communication.
Finally, a number of tasks promote
the sustainability of the project
and the maximisation of its impact
to the constituent technologies.
The Enabling Grids for E-sciencE (EGEE) project is funded by the European Commission and aims to build on recent advances in grid technology and develop a service grid infrastructure which is available to scientists 24 hours-a-day
A DΙgital Library Infrastructureon Grid EΝabled Technology
Platform Overview
The DILIGENT architecture is distributed across a stack of logical
layers built on top of EGEE middleware to achieve modularity
and independence from hosting system.
The DILIGENT platform spreads over five service groups and three logical layers. The Collective Layer lays the foundations for the system by enhancing existing Grid collective services in order to support the complex service interactions required by the Digital Library Layer.
The DILIGENT platform spreads over five service groups and three logical layers. The Collective Layer lays the foundations for the system by enhancing existing Grid collective services in order to support the complex service interactions required by the Digital Library Layer.
Service layersService layers
Digital Library Creation and ManagementDigital Library Creation and Management
The Information Service supports the discovery and real-time monitoring of DILIGENT resources. Single services and entire Digital Libraries can thus tune resource usage and balance workload.
The Keeper Service acts as the ‘orchestrator‘ of the services which form a Digital Library whilst assuring the Quality of Service required by the Digital Library's definition criteria. To do so, it monitors the status of services and resources during the lifetime of the Digital Library and accordingly re-designs its topology at runtime.
The Broker and Matchmaker Service facilitates the efficient usage of the Grid infrastructure through an optimal distribution of services and resources across Grid nodes. The service exploits and extends underlying grid middleware capabilities so as to match DILIGENT hosted Digital Libraries needs.
The Virtual Organisation Support Service is dedicated to the management of Virtual Organizations. A Virtual Organisation (VO) is a dynamic pool of distributed resources shared by a dynamic set of users from one or more organizations in a trusted way. In DILIGENT, Digital Libraries make use of the VO mechanism to glue together users and resources in the trusted environment of the Digital Library.
The Virtual Digital Library Generator Service allows users/communities to create their own Digital Libraries. It permits users to specify a set of criteria which define the characteristics expected from the new Digital Library; then, it identifies the services and information sources which are required to satisfy the criteria.
The Information Service supports the discovery and real-time monitoring of DILIGENT resources. Single services and entire Digital Libraries can thus tune resource usage and balance workload.
The Keeper Service acts as the ‘orchestrator‘ of the services which form a Digital Library whilst assuring the Quality of Service required by the Digital Library's definition criteria. To do so, it monitors the status of services and resources during the lifetime of the Digital Library and accordingly re-designs its topology at runtime.
The Broker and Matchmaker Service facilitates the efficient usage of the Grid infrastructure through an optimal distribution of services and resources across Grid nodes. The service exploits and extends underlying grid middleware capabilities so as to match DILIGENT hosted Digital Libraries needs.
The Virtual Organisation Support Service is dedicated to the management of Virtual Organizations. A Virtual Organisation (VO) is a dynamic pool of distributed resources shared by a dynamic set of users from one or more organizations in a trusted way. In DILIGENT, Digital Libraries make use of the VO mechanism to glue together users and resources in the trusted environment of the Digital Library.
The Virtual Digital Library Generator Service allows users/communities to create their own Digital Libraries. It permits users to specify a set of criteria which define the characteristics expected from the new Digital Library; then, it identifies the services and information sources which are required to satisfy the criteria.
The Digital Library Layer is the domain-specific functional layer of DILIGENT, for it supports the storage, handling, and retrieval of multi-type and mixed-media content.
Finally, the Application-Specific Layer gathers general-purpose application tools as well as APIs and SDKs for third parties to migrate their data or functional components to the DILIGENT platform. Furthermore it builds a number of tools of common interest to domain-specific application developers.
The Digital Library Layer is the domain-specific functional layer of DILIGENT, for it supports the storage, handling, and retrieval of multi-type and mixed-media content.
Finally, the Application-Specific Layer gathers general-purpose application tools as well as APIs and SDKs for third parties to migrate their data or functional components to the DILIGENT platform. Furthermore it builds a number of tools of common interest to domain-specific application developers.
DILIGENT adopts gLite as the underlying Grid middleware.
gLite services in use are:Job Management (WMS/CE/WN) in: Feature Extraction Service; Content Security Service; Process Execution and Reliability Service.Data Management (SE/LFC/FTS) in: Content Management Service; Keeper Service.Information System (RGMA) in: Information Service (DIS).Security (VOMS) in: Dynamic VO Support Service.
DILIGENT will eventually join EGEE production infrastructure.
DILIGENT adopts gLite as the underlying Grid middleware.
gLite services in use are:Job Management (WMS/CE/WN) in: Feature Extraction Service; Content Security Service; Process Execution and Reliability Service.Data Management (SE/LFC/FTS) in: Content Management Service; Keeper Service.Information System (RGMA) in: Information Service (DIS).Security (VOMS) in: Dynamic VO Support Service.
DILIGENT will eventually join EGEE production infrastructure.
EGEE in DILIGENTEGEE in DILIGENT
is the next generation middleware for grid computing, born from the collaborative efforts of several academic and industrial research centres funded by the EU as part of the EGEE Project. gLite provides a framework for building grid applications tapping into the power of distributed computing and storage resources across the Internet.
is the next generation middleware for grid computing, born from the collaborative efforts of several academic and industrial research centres funded by the EU as part of the EGEE Project. gLite provides a framework for building grid applications tapping into the power of distributed computing and storage resources across the Internet.
GT4 components in use are:Java WS Core (as hosting environment for
all DILIGENT services);Aggregator Framework (in Information
service);Credential Management (in Dynamic VO
Support);WS AuthN and AuthZ (in Dynamic VO
Support).
GT4 components in use are:Java WS Core (as hosting environment for
all DILIGENT services);Aggregator Framework (in Information
service);Credential Management (in Dynamic VO
Support);WS AuthN and AuthZ (in Dynamic VO
Support).
A Digital Library Infrastructure on Grid Enabled TechnologyA Digital Library Infrastructure on Grid Enabled Technology
Architecture, Services, Applications and Middleware in DILIGENT
Created by G.Kakaletris @ di.UoA (2006)Created by G.Kakaletris @ di.UoA (2006)
Process ManagementProcess Management
The Process Design & Verification Service provides the user with graphical tools for the design and manipulation of process definitions. Furthermore, before a compound service (i.e. a workflow of service invocations) may be deployed, the provided verification functionality validates its compliance with certain rules, so that the process can be considered safe for execution.
The Process Execution & Reliability Service is responsible for all actions pertaining to the actual execution of compound services, i.e. for finding and allocating resources, starting, monitoring or aborting processes and handling process execution failures in accordance with transactional policies/guarantees.
Within the Process Optimisation Service, user and system processes are being optimised prior to their execution by the actual services under a rich, flexible cost-estimation policy which takes into consideration a multitude of parameters and values.
The Process Design & Verification Service provides the user with graphical tools for the design and manipulation of process definitions. Furthermore, before a compound service (i.e. a workflow of service invocations) may be deployed, the provided verification functionality validates its compliance with certain rules, so that the process can be considered safe for execution.
The Process Execution & Reliability Service is responsible for all actions pertaining to the actual execution of compound services, i.e. for finding and allocating resources, starting, monitoring or aborting processes and handling process execution failures in accordance with transactional policies/guarantees.
Within the Process Optimisation Service, user and system processes are being optimised prior to their execution by the actual services under a rich, flexible cost-estimation policy which takes into consideration a multitude of parameters and values.
Content and Metadata ManagementContent and Metadata Management
Indexing and SearchingIndexing and Searching
The Content Management Service supports transparent access to both DILIGENT and external rich-media or plain content. It handles the creation of new objects and updates of existing ones via various techniques such as polling and notifications, while providing support for URI maintenance, replication and partitioning.
The Metadata Management Service manages metadata for the description and enrichment of the digital objects handled by the Content Management service.
The Metadata Broker Service provides the mechanisms for achieving metadata interoperability.
The Content Security Service secures the storage and dissemination of content, safeguarding both Intellectual Property Rights and content integrity through the application of techniques such as signing, watermarking etc.
The Annotation Management Service offers integrated management of annotations for digital objects available on the DILIGENT platform, primarily multimedia content objects. Its main goal is to support collaboration activities between distinguished users of a DILIGENT Digital Library.
The Content Management Service supports transparent access to both DILIGENT and external rich-media or plain content. It handles the creation of new objects and updates of existing ones via various techniques such as polling and notifications, while providing support for URI maintenance, replication and partitioning.
The Metadata Management Service manages metadata for the description and enrichment of the digital objects handled by the Content Management service.
The Metadata Broker Service provides the mechanisms for achieving metadata interoperability.
The Content Security Service secures the storage and dissemination of content, safeguarding both Intellectual Property Rights and content integrity through the application of techniques such as signing, watermarking etc.
The Annotation Management Service offers integrated management of annotations for digital objects available on the DILIGENT platform, primarily multimedia content objects. Its main goal is to support collaboration activities between distinguished users of a DILIGENT Digital Library.
The Feature Extraction Service provides automated metadata generation for image, video, audio, and text content objects. The metadata can then be used by the Search Service in order to look for objects by content or similarity.
The Index Service assists the search service by enabling efficient content-based retrieval. It leverages state-of-the-art search engines (such as FAST’s Enterprise Search Platform) to deliver advanced text searching, and it extends search functionality to other media objects by indexing results from feature extraction as well as other metadata.
The Content Source Description and Selection Service optimises the distribution of content-based queries across search engines available on the DILIGENT platform. Using up-to-date descriptions of the content targeted by each engine, the service automatically selects the engines which are most likely to produce relevant results for a given query.
The Data Fusion Service supports the real-time distribution of content-based queries across search engines available on the DILIGENT platform. Using statistics about the engines and their per-query outputs, the service normalises and merges autonomously produced result rankings.
The Search Service offers the entry point to the functionality of the entire search engine. It is a complex service which orchestrates the operation of various DILIGENT services under the user's request for information. Searches over structured, semi-structured and unstructured data are supported through potentially complex processing and transformations.
Through the Personalisation Service, queries are pre-processed and customised with user-specific data so as to tailor the results to the specific needs and expectations of the user. Personalisation criteria include cost, accuracy, content sources, formatting, etc.
The Feature Extraction Service provides automated metadata generation for image, video, audio, and text content objects. The metadata can then be used by the Search Service in order to look for objects by content or similarity.
The Index Service assists the search service by enabling efficient content-based retrieval. It leverages state-of-the-art search engines (such as FAST’s Enterprise Search Platform) to deliver advanced text searching, and it extends search functionality to other media objects by indexing results from feature extraction as well as other metadata.
The Content Source Description and Selection Service optimises the distribution of content-based queries across search engines available on the DILIGENT platform. Using up-to-date descriptions of the content targeted by each engine, the service automatically selects the engines which are most likely to produce relevant results for a given query.
The Data Fusion Service supports the real-time distribution of content-based queries across search engines available on the DILIGENT platform. Using statistics about the engines and their per-query outputs, the service normalises and merges autonomously produced result rankings.
The Search Service offers the entry point to the functionality of the entire search engine. It is a complex service which orchestrates the operation of various DILIGENT services under the user's request for information. Searches over structured, semi-structured and unstructured data are supported through potentially complex processing and transformations.
Through the Personalisation Service, queries are pre-processed and customised with user-specific data so as to tailor the results to the specific needs and expectations of the user. Personalisation criteria include cost, accuracy, content sources, formatting, etc.
Application Specific Services
Application Specific Services
DILIGENT offers a rich user interface to support the administration and management of the infrastructure. It also offers end-user interface allowing the full exploitation of platform’s potential on the fields of information hosting and retrieval.
Interfaces rely on Portlet technology and can thus be plugged into any standards-compliant Portal engine. Ultimately, users and administrators manage and use the DILIGENT platform under the intuitive interface of a personalisable web portal.
Besides mechanisms to support end-user interaction with the system, DILIGENT provides also basic components to visualise the main content types handled in the two application scenarios which form part of the project’s initial workplan.
DILIGENT offers a rich user interface to support the administration and management of the infrastructure. It also offers end-user interface allowing the full exploitation of platform’s potential on the fields of information hosting and retrieval.
Interfaces rely on Portlet technology and can thus be plugged into any standards-compliant Portal engine. Ultimately, users and administrators manage and use the DILIGENT platform under the intuitive interface of a personalisable web portal.
Besides mechanisms to support end-user interaction with the system, DILIGENT provides also basic components to visualise the main content types handled in the two application scenarios which form part of the project’s initial workplan.
The Globus Toolkit is an open source software toolkit used for building Grid systems and applications. It is being developed by the Globus Alliance and several other parties all over the world.
The toolkit includes software for security, information infrastructure, resource management, data management, communication, fault detection, and portability. It is packaged as a set of components that can be used either independently or together to develop applications.
The Globus Toolkit is an open source software toolkit used for building Grid systems and applications. It is being developed by the Globus Alliance and several other parties all over the world.
The toolkit includes software for security, information infrastructure, resource management, data management, communication, fault detection, and portability. It is packaged as a set of components that can be used either independently or together to develop applications.
WSRF and WSN specifications in use are:
WS-ResourceProperties (WSRF-RP);WS-ResourceLifetime (WSRF-RL);WS-BaseFaults (WSRF-BF);WS-ServiceGroup (WSRF-SG);WS-BaseNotification (WSN);WS-Topics (WSN).
DILIGENT services are WSRF compliant.
WSRF and WSN specifications in use are:
WS-ResourceProperties (WSRF-RP);WS-ResourceLifetime (WSRF-RL);WS-BaseFaults (WSRF-BF);WS-ServiceGroup (WSRF-SG);WS-BaseNotification (WSN);WS-Topics (WSN).
DILIGENT services are WSRF compliant.
Globus Toolkit in DILIGENTGlobus Toolkit in DILIGENT
Features @ a glanceFeatures @ a glance
DILIGENT is partially funded by the European Union under the Sixth Framework Program, Information Society Technologies priority (2002-2006)
The European Union is established in accordance with the Treaty on European Union (Maastricht). There are currently 25 Member States of the Union. It is based on the European Communities and the member states cooperation in the fields of Common Foreign and Security Policy and Justice and Home Affairs. The five main institutions of the European Union are the European Parliament, the Council of Ministers, the European Commission, the Court of Justice and the Court of Auditors. (http://europa.eu.int/)
Disclaimer: This publication has been produced with the assistance of the European Union. The contents of this publication is the sole responsibility of DILIGENT consortium and can in no way be taken to reflect the views of the European Union. Material presented in here by no means binds the DILIGENT consortium to future technological decisions and products.
Images sources: ARTE archives, EU and ESA public image galleries. Styling: UoA, ENG. Production: UoA.
** Work teams of marked partners moved to other organisations inside the project.
On-demand Digital Library creation, hosting and management;
Security integration through virtual organisation and virtual community support;
Content security handling (access and watermarking policies);
Semantic content management over a gLite superimposed storage management layer;
Metadata management and indexing;
Annotation management and visualisation;
Dynamic Digital Library resources reallocation;
Shared resource usage optimisation;
Complex process visual design, verification, optimisation and execution;
Advanced feature extraction through pluggable components and services;
Text, image, sound, video and multimedia content processing;
Information visualisation;
Information retrieval out of structured, semi-structured and unstructured data;
Support for application-specific extensions;
Provision of pluggable management and access web components;
Service Oriented Architecture paradigm based design;
Built on WSRF compliant service elements;
EGEE gLite middleware compliancy;
Implementation level homogeneity.
European Research Consortium for Informatics and Mathematics – France
Istituto di Scienza e Tecnologie dell'Informazione "A. Faedo“– Italy
National & Kapodistrian University of Athens – Greece
Swiss Federal Institute of Technology, Zurich** – Switzerland
Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. – Germany
University of Health Informatics and Technology Tyrol ** – Austria
University of Basel - Switzerland
European Organization for Nuclear Research – Switzerland
Engineering Ingegneria Informatica SpA Rome – Italy
University of Strathclyde – United Kingdom
Fast Search & Transfer ASA – Norway
European Space Agency – Italy
Scuola Normale Superiore – Italy
4D SOFT Software Development Ltd. – Hungary
RAI Radio Televisione Italiana – Italy
@@
http://www.diligentproject.org/
mailto:[email protected]
Contact Person:Donatella [email protected] di Scienza e Tecnologie dell'Informazione "A. Faedo" - CNRVia Moruzzi, 1 , 56124, Pisa - ItalyTel: +39 050 3152902Fax: +39 050 3153464
DILIGENT design and implementation is built around the vision to provide a robust stable system capable of supporting production level Digital Libraries.
In order to make this possible procedures followed by industrial / commercial software development lines have been adopted and internal homogeneity has become a primary rule since the beginning of project, leading to:
full adoption of the Service Oriented Architecture as dictated by the OGSA paradigm;
adoption of JAVA as the sole development language and of WS-Core as the WSRF enabling toolkit;
application of common design patterns for families of problems;
development of horizontal mechanisms to support common tasks throughout the system;
provision of common terminology;
compliance with commonly accepted coding and documentation guidelines;
consistency in reuse of technologies and external components.
Despite the internal homogeneity, the Service Oriented nature of the system allows its full or partial reuse and extensibility by a diversity of heterogeneous systems that fit the industrial interfacing standards adopted by DILIGENT.
The frameworkThe framework