China Scientific Data Sharing ProjectChina Scientific Data Sharing Project
International Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, BeijingInternational Workshop on Strategies for Preservation of and Open Access to Scientific Data, June 22-24, 2004, Beijing
Xian-En ZHANGXian-En ZHANG
Working Group, China-Scientific Data Sharing ProjectWorking Group, China-Scientific Data Sharing ProjectBasic Research Department, Ministry of Science & TechnologyBasic Research Department, Ministry of Science & Technology
•General Considerations and Objectives General Considerations and Objectives
•Framework and ArchitectureFramework and Architecture
•Major TasksMajor Tasks
•Program Work PlanProgram Work Plan
•Current Status and ProgressCurrent Status and Progress
•China-SDSPChina-SDSP
• China-SDSPChina-SDSP should be developed under comprehensive should be developed under comprehensive planning on the national level. planning on the national level.
• It should collect and re-organize all possible data from It should collect and re-organize all possible data from government agencies, institutes, programs, and government agencies, institutes, programs, and individual investigators while making full use of individual investigators while making full use of international scientific data resources through international scientific data resources through cooperation.cooperation.
• China-SDSP should make all these data accessible to all China-SDSP should make all these data accessible to all interested users at an affordable cost, or free if possible.interested users at an affordable cost, or free if possible.
• • China-SDSP is to form a multi-tiled, distributed scientific China-SDSP is to form a multi-tiled, distributed scientific
data sharing system that bridges the gaps between data sharing system that bridges the gaps between different agencies, institutes, and geographical regions.different agencies, institutes, and geographical regions.
2020 Goals:2020 Goals:
• To form a scientific data management and sharing To form a scientific data management and sharing system that is more user-friendly; system that is more user-friendly;
• To develop a set of supportive laws, policies, and To develop a set of supportive laws, policies, and standards; standards;
• To form a professional service group by establishing To form a professional service group by establishing a career reward mechanism. a career reward mechanism.
• Eighty percent of scientific data funded by the Eighty percent of scientific data funded by the government will be made available to general public.government will be made available to general public.
Framework and ArchitectureFramework and Architecture
1. Logical Framework of CSDSP1. Logical Framework of CSDSPCSDSP is a three-tiled system: master databases, scientific data CSDSP is a three-tiled system: master databases, scientific data centers or networks, and Gateway Web sitecenters or networks, and Gateway Web site
2. Scope of Data Sharing Supported by China-SDSP2. Scope of Data Sharing Supported by China-SDSPChina-SDSP also functions as a catalyst. Its original purpose is to China-SDSP also functions as a catalyst. Its original purpose is to integrate publicly funded data resources, but its long-term goal is to integrate publicly funded data resources, but its long-term goal is to leverage all possible data resources from government to the private leverage all possible data resources from government to the private sectors, and make them available to the general public.sectors, and make them available to the general public.
3. Service Architecture of China-SDSP3. Service Architecture of China-SDSPChina-SDSP may provide services in various ways: facilitating the China-SDSP may provide services in various ways: facilitating the consistent management of distributed databases; providing a content consistent management of distributed databases; providing a content service and data service, as well as other services mentioned.service and data service, as well as other services mentioned.
Gateway to China Scientific Data Sharing Program
Natural Scienceand Environment
Agriculture
Populationand Health
Basic and Frontier Sciences
Engineering andTechnology
Regional Development
Meteorological Scientific Data Center
Rural Development Sci Data Center
Agricultural Scientific Data Center
Basic Medicine Scientific Data Center
Rural Development Sci Data Center
Population Control Sci Data Center
Earth System Scientific Data Center
Space Environment Sci Data Center
………………………………………………
………………………………………………
About 300 Master Databases
In 40 Data Cent
ers
Disciplines Disciplines Data Center / NetworksData Center / Networks Master Database Master Database
Data Users
Architecture and Framework of China SDSP
Scientific Data SharingScientific Data Sharing
Submission from Submission from Agencies and Agencies and
InstitutesInstitutes
Exchange Data Exchange Data with other with other CountriesCountries
Submission from Submission from Major National Major National
ProgramProgram
Data Data DisseminationDissemination
Data Data Integration / Integration / SubmissionSubmission
Data Data GeneratorGenerator
Scientific Research & Scientific Research & Technology Development Technology Development SectorSector
Observation, MonitoringObservation, MonitoringSurvey and EvaluationSurvey and EvaluationStatistics SectorStatistics Sector
Scope of Scientific Data Sharing ProjectScope of Scientific Data Sharing Project
Extented Service
SubmittingArchivingUpdating
Classes of Scientific Data Service
DATA Management
CONTENTService
DATA Service
SearchingBrowering
SearchingBroweringDownloading
Data MiningSubject ServingForum…. …….
Fig 3. Service Functionality of China Scientific Data Sharing ProgramFig 3. Service Functionality of China Scientific Data Sharing Program
1. Architectural Development of Data Management and Sharing System
2. Resource Development for Scientific Data
3. Standardization
4. Law and Policy
Major Tasks of China SDSPMajor Tasks of China SDSP
1 Gateway Site1 Gateway Site40 Data Centers /Networks40 Data Centers /Networks300 Master Databases300 Master Databases
Architectural Development of Data Management and Sharing System
Resource Development for Scientific Data
The major tasks are to re-edify existing data resourcesThe major tasks are to re-edify existing data resources ;;safeguard endangered scientific data and records; devsafeguard endangered scientific data and records; develop the master database for large research programs elop the master database for large research programs funded by the government; introduce international data funded by the government; introduce international data resources based on their scientific values, quality, and resources based on their scientific values, quality, and usabilityusability ;; integrate multi-source data; and conduct vaintegrate multi-source data; and conduct value-added research.lue-added research.
Standardization is the prerequisite for scientific data sharing in the Standardization is the prerequisite for scientific data sharing in the digital era. digital era.
There are two kinds of standards: platform technical standards and There are two kinds of standards: platform technical standards and data sharing standards. The former is based on data platforms, and data sharing standards. The former is based on data platforms, and the latter is based on the scientific data sharing framework. the latter is based on the scientific data sharing framework.
The basic and common data sharing standard will be considered first. The basic and common data sharing standard will be considered first. The data standard in major application areas will also be on the list of The data standard in major application areas will also be on the list of priorities. priorities.
Standardization
Specifically, the following should be conducted first:Specifically, the following should be conducted first:
Policy: Establishment and implementation ofPolicy: Establishment and implementation of • Implementation Guidelines of Scientific Data Sharing Program,Implementation Guidelines of Scientific Data Sharing Program,• Data Submission Guidelines of Major Science and Technology Program Data Submission Guidelines of Major Science and Technology Program Funded by GovernmentFunded by Government• Guideline of Scientific and Technological Data Classification for Data Guideline of Scientific and Technological Data Classification for Data Sharing Sharing
• Management Guidelines of China Scientific Data Sharing Program Management Guidelines of China Scientific Data Sharing Program • Performance Evaluation (Merit Appraisal) of Scientific Data SharingPerformance Evaluation (Merit Appraisal) of Scientific Data Sharing
Law: Legislation and Amendment ofLaw: Legislation and Amendment of• Science and Technology Advancement ActScience and Technology Advancement Act• Copy Right Act Copy Right Act • National Security ActNational Security Act
OthersOthers• Be proactively involved in the on-going legislation of “Policy on Access Be proactively involved in the on-going legislation of “Policy on Access toto
Government Information”.Government Information”.• Promote the issuing of “Policy on National Scientific and Technological Promote the issuing of “Policy on National Scientific and Technological Resources Sharing”.Resources Sharing”.
Law and Policy
Experimental period: 2001-2005• Overall planning and design;• Legislation planning: start research on law and policy framework;• Making and issuing relevant policy and regulation;• Technology and standards;• Establishing data centers (networks) and kicking off the data
sharing pilot project;• Identifying the optical mechanism for existing data consolidation
and sharing;• Launching of program gateway: select 25 data centers for data
sharing pilot project, select other candidate centers for further development;
• Sum up experiences from various aspects of the experimental period, and prepare a feasibility report to facilitate the overall implementation of public good data sharing in next period.
Working PlanWorking Plan
Overall Implementation Period: 2006-2010
• Continue the establishment of data sharing technology, policy and law;
• Extend the program coverage of scientific data centers or networks and make them operational;
• Gradually improve technology and standards • Enforce the cooperation among data centers in different research
area;• Enhance the capacity to develop high-level data product and
quality;
Working PlanWorking Plan
After each yearly performance evaluation of the 25 pilot After each yearly performance evaluation of the 25 pilot data centers or networks, the qualified ones will be data centers or networks, the qualified ones will be included in the “National Scientific Data Master Network” included in the “National Scientific Data Master Network” and will start regular operation; the amount invested in and will start regular operation; the amount invested in each center depends on their merits and performance. each center depends on their merits and performance. Another 15-20 data centers will be built, including 200 new Another 15-20 data centers will be built, including 200 new master databases.master databases.
By 2010, a mechanism is going to be established, By 2010, a mechanism is going to be established, through which data are submitted from various through which data are submitted from various governmental agencies and programs and delivered to governmental agencies and programs and delivered to potential users efficiently.potential users efficiently.
Current Status and Progress of China SDSPCurrent Status and Progress of China SDSP
• General Planning and Design (Draft) General Planning and Design (Draft) FinishedFinished
• Pilot Projects for Data SharingPilot Projects for Data Sharing• Law, Policy, and StandardLaw, Policy, and Standard
In June 2003In June 2003 ,, a Coordinating Group and a Scientific a Coordinating Group and a Scientific Group were established for scientific data sharing. The mGroup were established for scientific data sharing. The main task of these groups was to develop the “Planning of ain task of these groups was to develop the “Planning of China Scientific Data Sharing Program” (China-SDSP) by China Scientific Data Sharing Program” (China-SDSP) by May 2004.May 2004.
There are six major components to China-SDSP: curreThere are six major components to China-SDSP: current status and major national requirement; overall considernt status and major national requirement; overall considerations; principle and objectives; strategic arrangement anations; principle and objectives; strategic arrangement and tasks; implementation and measurements; supporting cd tasks; implementation and measurements; supporting conditions and facilitiesonditions and facilities 。。
General Planning and Design (Draft) Finished
In 2001, the meterological data sharing project was launched, which hIn 2001, the meterological data sharing project was launched, which heralded the start of the scientific data sharing program in China.eralded the start of the scientific data sharing program in China.
By the end of 2002By the end of 2002 , , another 5 data centers and 3 networks had joinanother 5 data centers and 3 networks had joined the pilot project:ed the pilot project:
Pilot Projects for Data Sharing
1.1.Survey data centerSurvey data center2.2.Hydrolgy and Water Resources data centerHydrolgy and Water Resources data center3.3.Seismetic data centerSeismetic data center4.4.Forestry data centerForestry data center5.5.Agriculture data centerAgriculture data center6.6.Earth System Science data center networkEarth System Science data center network7.7.Modern Agricultural Technology and Rural Development networkModern Agricultural Technology and Rural Development network8.8.Sustainable Development networkSustainable Development network
Law, Policy, and StandardLaw, Policy, and Standard
In terms of policy-making, a working group for data sharing has been established and investigated the current status and trend of data policy both home and abroad, compiled relevant materials and information ;
Established the ”Guidelines of Data Submission from Major National Programs” and its interpretation ; began researching the framework of relevant law and policy; and finished the conceptual design for data classification for sharing.
In general, China Scientific Data Sharing Program is still in In general, China Scientific Data Sharing Program is still in the phase of overall planning, accumulating the experiences the phase of overall planning, accumulating the experiences of technology and policy making, as well as overseeing pilot of technology and policy making, as well as overseeing pilot data sharing projects. data sharing projects.
AcknowledgementAcknowledgement
Many who involved the projectMany who involved the project
SUN ShuSUN ShuHUANG DingchengHUANG DingchengSUN JiuLinSUN JiuLinLUI ChuangLUI ChuangXIAO YunXIAO YunYIN LingYIN LingCHEN JunCHEN Jun
TENG MianzhenTENG MianzhenZHOU Wenneng ZHOU Wenneng
Paul Uhlir JDPaul Uhlir JDPeter Weiss JDPeter Weiss JD
Top Related