AlessiaBardiand Paolo Manghi, Institute of Information Science ... - Open...
Transcript of AlessiaBardiand Paolo Manghi, Institute of Information Science ... - Open...
@openaire_eu
OpenAIREFostering the social and technical links that enable Open
Science in Europe and beyond
Alessia Bardi and Paolo Manghi, Institute of Information Science and Technologies –CNRKaterina Iatropoulou, ATHENA,
Iryna Kuchma and Gwen Franck, EIFLPedro Príncipe, University of Minho
Národný workshop OpenAIRE/National workshop OpenAIRE
Who we are?
A 50 partners’ partnership In 24x7 operation since 2010
• Institutional, national and international perspectives on OA policies & e-Infrastructures
Open Access experts
• Building efficient e-Infra technologies
• State of the art technologies (big data, linked data)
Info & Computer Science experts
• Legal &policy recommendations
Legal experts
• Best practices for data
• Linking to data infrastructures
Data communities
33 expert nodes all over Europe to helping with: Open Science training & support, OA policy alignment, Technical assistance...
Human Network
Digital Network
Integrated Scientific Information System with access to: 22,036,305
publications; 571,373 datasets; 2,711 data providers, 700Κ
publications linked to projects from 12 funders
Two faceted e-Infrastructure
RESEARCHERS &
RESEARCH
COMMUNITIES
DATA PROVIDERS FUNDERS &
RESEARCH
ADMINISTRATORS
3rd party SERVICE PROVIDERS
Services for all
Dashboards for data providers, funders and researcher communities.
Services at all levels of e-Infrastructure. Services that cover all research life-cycle.
OPEN ACCESSOpenAIRE implements the
EC requirements& SUPPORTS THE OPEN DATA PILOT
https://ec.europa.eu/digital-single-market/en/news/openaire-helps-projects-and-researchers-comply-open-research-data-policy
OpenAIRE’s e-infrastructure Commons
7
Publications repositories
Research Data repositories
CRIS systems
Registries(e.g. projects)
OAJournals
SoftwareRepositories
Validation
Cleaning De-duplication
EnrichmentBy inference
Funders, research admins, research communities• Research impact
• Research trends
• Open Access trends
Content providers• Repository validation
• Repository notification broker
• Repository analytics and usage stats
Researchers• Claim publications, datasets, software
• Deposit publications, datasets, software
• Search & browse: interlinked publications, datasets, projects
• Open Access & DMP Helpdesk
• End-User feedback
Content Providers
Info Space Services
End-User Services
Project initiative
FunderFunding
Result
Publicatio
nData Software
Organizatio
n
GUIDE
LINES
TERMS
OF USE
LET'S HIGHLIGHT SOMESERVICES & TOOLS
SHARE, DEPOSIT AND PUBLISH IN OA
PROJECT FUNDING IN THE PUBLICATION OR DATASET METADATA RECORD
Acknowledge
11
Acknowledge project funding: e.g. ZENODO
12
API
REPOSITORIES DEPOSIT WORKFLOW: Searching by the name, acronym or the project id number… Select the project and accept
OpenAIRE Funders Projects List
API
DISCOVERY/ACCESSSERVICE
Public (all)
14
Provides search and browsing capabilities over a catalogue of
Europe’s (+) interlinked research artefacts (literature, research data,
software)
LINK YOUR RESEARCH RESULTSGATHER OUTPUTS, VIEW PROGRESS &
REPORT
LINK RESEARCH RESULTS TOOLhttps://www.openaire.eu/participate/claim
Link publication or datasets to projets.Identify the project, select publications or datasets and set the access rights.
Link datasets and projects
Visible in the
Participants
portal
PROJECTS PUBLICATIONS LISThttps://www.openaire.eu/search/find/projects
OpenAIRE portal includes an App Box to generate a project publication list.Communicate your project results.
29
30
All project publications in
HTML or CSV
One click away to EC
project reporting systems
31
PROJECTPUBLICATIONSAND DATASETS
Automatically
OpenAIRE services and tools for Open Research Data in H2020 - IDCC 2017 Workshop
33
EC's participant portal (reporting)
EC's participant portal
MONITOR
37
INTEROPERABILITY:GUIDELINES & VALIDATOR
Data providers
39
Common standards/best practices for data providers (Guidelines for
literature, data repositories, aggregators, OA journals, CRIS
systems).
Validator: web service or standalone
1 2 3Literature
Repositories(and journal platforms)
Dublin Core (DRIVER)
Data
Repositories(and archives/data centres)
Datacite
CRIS systems
CERIF-XML
Guidelines for Data Providers
40
NOTIFICATION BROKER
Repositories
41
(Meta)data and links exchange among different data providers.
OpenAIRE
DATA PROVIDERS DASHBOARDDEMO
OpenAIRE Data Provider Dashboard
43
REGISTRATION
&
VALIDATION
::
current validator
ENRICHMENT
&
ADDITION
::
broker service
USAGE STATISTICS
&
METRICS
::
stats service
NOTIFICATIONS
&
UPDATES
::
manage datasource
Data source registration
Data source validation
Repository metrics: OpenAIRE perspective
Repository metrics: Local perspective
ALIGN OA POLICIES. SYNC INFRASTRUCTURES.
Project funding information in OpenAIRE: overview
50
Align and support OA
policies
Sync infrastructures
and Support national e-
infrastructures
Guidelines for metadata funding info
Monitor mandate
compliance
Projects list for repo
softwares
Text mining and inference
Statistics and reporting
Analytics and trends
51
Agreement between OpenAIRE & Funders
52
1
Data descriptionOpenAIRE requires only a very limited set of metadata fields from funders. No
personal or private details are required. The mandatory and optional data fields are:
• PROJECT IDENTIFIER (MANDATORY)
• PROJECT TITLE or ACRONYM (MANDATORY)
• FUNDER NAME (MANDATORY) – e.g. Wellcome Trust, EC
• START DATE (MANDATORY),
• END DATE (MANDATORY)
• FUNDING STREAM(S) (OPTIONAL) – funding categories for more detailed
statistics
• PARTICIPANT ORGANIZATION(S) (OPTIONAL) – i.e., project partners
53
2
With this information, OpenAIRE can offer funders…• A unique view of the scientific outputs that derive from their funding.
• OpenAIRE enables advanced monitoring (including of compliance with
Open Access policies), reporting and analysis of research impact and
research trends.
• Funders can assess the impact of their funding by viewing advanced
statistics on research outputs (publications and data-sets) and the
funding programme/stream/project from which they derive (including
co-funded research results and research trends).
54
Features OpenAIRE can provide to funders
• filter publications/data by funder and browse by specific funding streams
• search via project title, acronym or grant agreement and view specific
statistics of the project: publications/data over time, OA status, where
they were published/deposited, etc.
• view overall funder/funding stream statistics (facets over time, data
source, institution, etc.)
• correlate author/institution output with funding information
• visualize clusters of publications/data or funding based on their
interlinking (national or ERA-wide level).
Using the OpenAIRE portal, funders can
55
Co-funded publications
56
Monitor and reporting: statistics service, portal info
57
Ongoing changes – Organization Page:Better funding and project information access (page and app box)
58
LEARN ON POLICIES AND HOW TO COMPLY
HELPDESK
Ask a question
FAQs
RESOURCES
OA H2020 guide
Copyright Issues
H2020 factsheets
TRAINING
Webinars
Workshops
DATA PROVISION
3rd party providers
61
OAI-PMH, REST APIs, LOD
62
PROMOTE & INFORM
OpenAIRE services and tools for Open Research Data in H2020 - IDCC 2017 Workshop 63
General Information
64
ANONYMIZATIONData anonymization made easy
65
anonymization
Original data anonymous data
Data anonymizationData anonymization
• Removal of direct identifiers, e.g., Names, SSN etc.
• Removal of infrequent combinations of quasi-
identifiers, e.g., unique combinations of birth dates
and zipcodes.
• Infrequent combinations are removed through
generalization, e.g., birth date 14/01/1977 becomes
**/**/1977.
66
Amnesia• Scalable anonymization tool
• It offers several versions of k-anonymity
• It allows the user to select and customize possible solutions
• It offers graphical tools that allow the user to analyze the
anonymized dataset
• Web service or stand alone
• Integrated within research deposition workflows in
repositories/Zenodo
67
This is where you type in the event 68
http://ec.europa.eu/research/openscience/index.cfm
http://ec.europa.eu/research/openscience/pdf/os_skills_wgreport_final.pdf#view=fit&pagemode=none
http://ec.europa.eu/research/openscience/pdf/os_rewards_wgreport_final.pdf#view=fit&pagemode=none
http://ec.europa.eu/research/openscience/index.cfm?pg=open-science-policy-platform
This is where you type in the event 72
http://ec.europa.eu/research/openscience/index.cfm?pg=home§ion=monitor
Open Science publishingSupporting reproducibility and transparent evaluation
Research
data
Research
methods
e-infra
Tools & Services
Research
data
Scientific process
Research literature:
Articles, docs, white papers
Publishing 01101010
01100001
11010010
01101010
01100001
11010010
Research
Communication
Infrastructure
Repeat/Reproduce/Reuse
and Evaluation
What does Open
Science
Publishing mean?
Open Science publishingSupporting reproducibility and transparent evaluation
Research
data
Research
methods
e-infra
Tools & Services
Research
data
Scientific process
Research literature:
Articles, docs, white papers
Publishing 01101010
01100001
11010010
01101010
01100001
11010010
Publication
Repository
01101010
01100001
11010010
Data
Repository
Software
Repository
01101010
01100001
11010010
01101010
01100001
11010010
Package
RepositoryEnabling
Reproducibility
cita
tion
citation
Enabling
Transparent
evaluation
Other products:
methods, workflows,
protocols
Open Science publishing: enabling factors
Publishing of all kinds of research artefacts
Publishing packages of artefacts
Publishing an up-to-date record of research artefacts metadata and links
Enabling transparent
evaluation
Enabling reproducibility
Open Science Publishing: barriers
Repositories lack support to Open Science publishing
No support for integration of repositories for software, methods, or packages
Minimal or no support for links between artefacts in different repositories
No support for keeping repositories with up-to-date links between artefacts
Research communities lack culture of Open Science publishing
Lack of e-infrastructure and tools for Open Science: e.g. repository limits above, exchange formats, workflows
Difficulties to self-organize and sustain research communication solutions: e.g. identify the problems, see the benefits, devise solutions, applying economy of scale
Open Science as-a-Service (OSaaS)
Catch-All-Notification
BrokerSoftware
Packages
Articles DataProjects
Research Community
Dashboard
Harvesting
Search-Browse-Monitor-
Research Impact
Subscription & Notification
ArticlesData
Researchers
Content Providers
Articles
Data
ProjectsMethods
Software
Open Science as-a-Service
Research Community Dashboard
78
OpenAIRE OSaaS: methods and packages
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
fund
fund
Harvest Harmonize
De-duplicate Inference
01101010
01100001
11010010
Repositories of publications, datasets, projects, methods, packages
• Metadata description for methods and packages: citation and
reproducibility (e.g. Research Objects, Rmap)
• Interoperability guidelines for exchanging packages of interlinked
artefacts: enabling exchange of information across research
communication infrastructure
Dashboard for Research Communities
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
fund
Harvest Harmonize
De-duplicate Inference
Repositories of publications, datasets, projects, methods, packages
Research Community Service
Research
Community
Operator
01101010
01100001
11010010
fund
Request
Dashboard
for Community
Dashboard for Research Communities
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
fund
Harvest Harmonize
Deduplicate Inference
Repositories of publications, datasets, projects, methods, packages
Research Community Service
Research
Community
Operator
Researchers
01101010
01100001
11010010
• Deposit (DOI)
• Claim• Manage users
• Configure stats
• Configure inference
fund
Dashboard for Research Communities
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
fund
Harvest Harmonize
Deduplicate Inference
Repositories of publications, datasets, projects, methods, packages
Research Community Service
Research
Community
Operator
Researchers
01101010
01100001
11010010
• Deposit (DOI)
• Claim
• Stats: research impact & OA
• Manage users
• Configure stats
• Configure inference
fund
Dashboard for Research Communities
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
01101010
01100001
11010010
fund
Harvest Harmonize
Deduplicate Inference
Repositories of publications, datasets, projects, methods, packages
Research Community Service
Research
Community
Operator
Researchers
01101010
01100001
11010010
• Deposit (DOI)
• Claim
• Stats: research impact & OA
• Manage users
• Configure stats
• Configure inference
fund
Research Communities and Open Science benefits
• Can continue their publishing practices, but, if needed they have support for deposition of any artefact
Common repository for publishing (deposition) of datasets, methods, and packages
• Community information space to share, discovery, and reuse (reproduce) scientific results
Collaborative curation of a community-specific research communication domain
• Scientific reward strategies can be developed
Research impact and statistics
www.openaire.eu
@openaire_eu
facebook.com/groups/openaire
linkedin.com/groups/OpenAIRE-3893548