Legal interoperability in text and data mining: In the framework of open research infrastructures,...
Transcript of Legal interoperability in text and data mining: In the framework of open research infrastructures,...
Presentation’s Subtitle
#openminted_eu
In the framework of open research
infrastructures
Legal interoperability
in text and data mining
Stelios Piperidis
Athena Research & Innovation Centre
Openaire workshop @ RDA, Barcelona,
4 April 2017
Sharing
Discoverability
Processability
Interoperability
Openaire workshop @ RDA, Barcelona, 4 April 2017
2
GOAL: Operationalisation of e-infrastruct
ures
In the world of language technology & text mining
OpenMinted framework & focus
Openaire workshop @ RDA, Barcelona, 4 April 2017
3
OpenMinted sets out to create an open, service-oriented e-Infrastructure for Text and Data Mining (TDM) of scientific and scholarly content.
…
Content/Corpora Services/tools Annotated
corpora
Legal concerns
TDM activities on resources • text corpora
• knowledge resources,
• web services/workflows
• Copyright/SGDB protection vs. TDM
exception
• Licensing proliferation and interoperability
• Legal metadata: making it all human-
readable & machine readable
Openaire workshop @ RDA, Barcelona, 4 April 2017
4
The e-Infrastructure era
Openaire workshop @ RDA, Barcelona, 4 April 2017
COREpublishers
contentLINGUISTIC
ANNOTATIONENTITY
EXTRACTION
ENTITY
RELATION
EXTRACTION
web services
Ontologies Lexica/models
"ancillary" resources
Scientific pubs =
Research data
Linguistic
annotation
Entity relation
ExtractionEntity
Extraction
OpenAIRE
Legal framework
Openaire workshop @ RDA, Barcelona, 4 April 2017
COPYRIGHT EXCEPTION: in EU, only in UK
license (one or more)
formal statements/ categories
free-text stmts
terms of use / service
contractual agreements
I have read and accept the terms of use
I have read and accept the terms of useI have read and acce
pt the terms of use
I have read and accept the terms of use
I have read and accept the terms of use
I have read and accept the terms of use
no license!
Scientific pubs =
Research data
Linguistic
annotation
Entity relation
ExtractionEntity
Extraction
In OpenMinted …
Openaire workshop @ RDA, Barcelona, 4 April 2017
7
Scientific pubs =
Research data
Linguistic
annotation
Entity relation
ExtractionEntity
Extraction
ANNOTAT
ED
DATASET
DERIVED
KNOWLED
GE
I have read and acc
ept the terms of use
Compute "mashed up"
summary of licenses
and Tos
Compute recommended
Licenses for Annotation
s/ Derived Knowledge
Ontologies Lexica/models
Interoperability: multi-layer approach
Openaire workshop @ RDA, Barcelona, 4 April 2017
8
Scientific pubs =
Research data
Linguistic
annotation
Entity relation
ExtractionEntity
Extraction
ANNOTATE
D
DATASET
DERIVED
KNOWLEDG
E
1st layer
2nd layer
at the level of
licensing conditions
SCIENTIFIC
DATA
PROCESSING TOOLS/SERVICES
compatibility matrix
Openaire workshop @ RDA, Barcelona, 4 April 2017
LICENCE A LICENCE B LICENCE C
Attribution Attribution Retain notice
Non-
commercial use
.. ..
Share Alike
By "open access" to this literature, we mean its free availability on
the public internet, permitting any users .. (Budapest Open Access
Initiative)
BY
Implementation & Future steps
Human-readable summary??
open ACCESS =? FREE TO MINE!
(ideally yes)
HARMONISED VOCABULARY –
rigidness of semantics
machine readable MACHINE ACTIONS!
twitter.com/openminted_eu
facebook.com/openminted
bit.do/openmintedlinkedin
vimeo.com/openminted
bit.do/openmintedplus
THANK YOU!Stelios piperidis
twitter.com/openminted_eu
facebook.com/openminted
bit.do/openmintedlinkedin
vimeo.com/openminted
bit.do/openmintedplus10