NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils...
Transcript of NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils...
![Page 1: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/1.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
Usage data: Workshop Objectivesfrom the perspective of theDINI - DFG - JISC projects
Frank ScholzeNils Windisch
JISC Usage Statistics Workshop
Humboldt University Berlin, Erwin Schrödinger Center, 7.-8. July 2008
![Page 2: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/2.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
• open-access.net
• OA Repository Network
• OA Statistics
• OA Citations
• DRIVER
2
• JISC usage statistics review !
Cluster of projects
![Page 3: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/3.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
3
DINI certificate on usage statistics
• Minimum standards– Every document repository must keep consistent access
statistics (web server log files)– Web server log files must be anonymized for long-term
archiving purposes– Criteria used to collect or filter the data must be
documented
• Recommendations– Access to documents by automated agents, robots or
similar is filtered out (and documented)– Web server log files are edited according to the Counter
Code of Practice– Access statistic is attached to the document as dynamic
metadata and visible to the end-user
![Page 4: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/4.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
What do we count?
• Practical definition of meaningful items– Files?– Publications (journal articles etc.)?
• Identification of meaningful items– Checksums– Persistent identifiers– Distributed heterogeneous publication
network
4
![Page 5: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/5.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
How do we count?
• Practical and pragmatic definition of usage– Access– Click spans– Definition of non-human access– Pseudonymization– Deleting or tagging– Sessions
5
![Page 6: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/6.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
6
How do we aggregate?
• Technically– processing on which level– OpenUrl ContextObjects– SUSHI
• Organisationally– DRIVER– OA Repository Network– …
• Co-operation
![Page 7: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/7.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
What do we report?
• Access over time
• Sources of aggregation
• Standards for processing
Transparency on the what and the how of counting
7
Johan will tell us more
![Page 8: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/8.jpg)
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DEU
TSC
HE
INIT
IATI
VE F
ÜR
NET
ZWER
KIN
FOR
MA
TIO
N E
.V.
DataMining
Filtering
Metrics
Services
Aggregatedlogs
Log DB
OpenURLContextObjects
LogRepository
Link Resolver
LogRepository
Link Resolver
LogRepository
Log harvester(Service Provider)
COCOCO
COCOCO
COCOCO
Aggregated Usage Data
Log DBWebserver
-Log
Aggregated Usage Data
Rewritemodule
Normalise (optional) -> Robots, psydonymization
OpenURLContextObjects
or SUSHI
Normalise
Infrastructure for aggregating usage data
e.g.
e.g.
Based on: Bollen and Van de Sompel, OAI4, Geneva
![Page 9: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/9.jpg)
Open Access Statisticsrealize what others had in mind…
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
![Page 10: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/10.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
Project
• Funded by: DFG (German Research Foundation)
• 18 months 2008-07-01 – 2009-12-31
• Partner: Berlin (CMS) Göttingen (SUB), Saarbrücken (SUUB), Stuttgart (UB)
![Page 11: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/11.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
Context
• One of three projects:
• Open Access Network of Repositories (OA-N)
• Open Access Citation (DOARC, Distributed Open-Access Reference Citation services)
• Open Access Statistics (OA-S)
![Page 12: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/12.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
Objectives
• Aggregate and normalize usage data locally
• Act as data provider
• Collect data at service provider level
• Process data to provide added values services
![Page 13: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/13.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
![Page 14: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/14.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
![Page 15: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/15.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
![Page 16: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/16.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
Deal with usage date
• Different sources
• License server (HAN-Server)
• Link resolver (SFX)
• Repository software (DSpace, OPUS, e-doc, etc.)
![Page 17: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/17.jpg)
Repository software (DSpace)JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
![Page 18: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/18.jpg)
License server (HAN)JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
![Page 19: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/19.jpg)
Link resolver (SFX)JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
![Page 20: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/20.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
Map usage data
• Use established formats and rule sets
• COUNTER
• IFABC
• LogEc
![Page 21: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/21.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
from A to B
• SUSHI vs. OAI-PMH + OpenURL CO
• project objectives include evaluation of each approach
• SUSHI/OAI-OMH just a transport container/vehicle
• (Usage) data as XML payload
![Page 22: NETZWERKINFORMATION E.V. Usage data: …JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch (windisch@sub.uni-goettingen.de) Context • One of three projects: •](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa26c506faf207535164d82/html5/thumbnails/22.jpg)
JISC Usage Statistics Workshop, Berlin 2008-07-07/08, Nils K. Windisch ([email protected])
Context
• What about the JISC Usage Statistics Project and Workshop?
• Provide inside information
• Build on experience and expert opinions
• Re-use existing technologies