The Danish National Research Database Which approach: Look at the environment first and the software...

8
The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment – a hybrid environment A hybrid environment calls for a hybrid solution The solution: Distributed searching (Web-Z gateway) Hybrid model for data collection

Transcript of The Danish National Research Database Which approach: Look at the environment first and the software...

Page 1: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

The Danish National Research Database

• Which approach: Look at the environment first and the software afterwards or vice versa?

• Look at the environment – a hybrid environment

• A hybrid environment calls for a hybrid solution

The solution:

• Distributed searching (Web-Z gateway)

• Hybrid model for data collection

Page 2: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

Considerations

• Larger institutions/suppliers often have CRIS systems of their own

• Smaller institutions/suppliers often do not

• The architecture must be integrated with other national initiatives

• Data feed should to the utmost extent be the reporting that is already done at the institutions

• Do NOT force present suppliers to change routines unless they want to!

Page 3: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

Data collection and searching

Data collection:

• Metadata creator/cataloguing module

• Load routines like the present

• Harvesting service (Open Archives Initiative)

Searching:

• Access to the present database as well as other databases using Z39.50

• The Danish National Research Database becomes a virtual database – a portal

Page 4: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

Users

The Danish National Research Database

(Z39.50-gateway with possible value adding)

HTTP

Z39.50 Z39.50 Z39.50 Z39.50

Localdatabase

Catalogue-module

Import

Export

Catalogue-module

OAIHarvesting

RDBCentral database

Z39.50 server

Localdatabase

Catalogue-module

OAI server

a b c d e

HTTPor FTP

HTTPHTTP

Local database

Catalogue-module

Z39 server

Localdatabase

Catalogue-module

Z39 server

Other Z-gateways- local or

(inter)national

UsersUsers

The national systemThe national system

Local Local systemssystems

Data feedData feed

Z39.50 Z39.50

a. Cataloguing in local system, export on a regular basis to national systemb. Cataloguing directly in the national system.c. Cataloguing in local system with OAI server, data

collected by the national system - OAI harvester robotd. Cataloguing in local installation of the national system. Data are made available

using Z39.50 server (and/or OAI server)e. Cataloguing in local system. Data are made available using the institutions own

system including Z39.50 server

The Danish National Research DatabaseThe Danish National Research DatabaseSystem ArchitectureSystem Architecture

OAI server

OAI server

Legal depositOAI harvesting

HTTP

HTTP

= Part of national system

= Part of local system

2001-04-24

Page 5: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

Key elements in the architecture

• Data collection – metadata creator

• Data collection – harvester (OAI)

• Data collection – present load routine

• RDB and search engine

• Distributed searching – Z39.50 protocolOrganization:

• Denmarks Electronic Research Library

• Project management: Risoe National Laboratory

• The development project 2000 – 2002? A collaboration between The Technical University of Denmark and Risoe National Laboratory

Page 6: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

ERGO – if it is still there…

• Data delivery could be by offering access - not necessarily physical deliveries

• Data collection by means of upload/harvesting – or no collection at all (distributed searching)

• ERGO web site becomes a web-Z gateway - a portal

• ERGO nodes thus acting as targets for harvest or Z39.50 protocol

Page 7: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

ERGO elements

• Z39.50 gateway (web-Z gateway)

• Search engine(s)

• Metadata creator

• Harvester (Open Archives Initiative?)

• Solutions based primarily on open source/freeware. Software configured, maintained and distributed by Cordis team

Page 8: The Danish National Research Database Which approach: Look at the environment first and the software afterwards or vice versa? Look at the environment.

Links

• Denmarks Electronic Research Library: www.deflink.dk (presently being redesigned)

• The DEF web-Z gateway:

www.defkat.dk

• The Danish National Research Database:

www.forskningsdatabase.dk

• Open Archives Initiative:

www.openarchives.org