Document

63
Recommendations Improve the Search Experience Innovations in Activity Data workshop 04 July2011 Richard Nurse http://www.open.ac.uk/bl ogs/rise http://www.flickr.com/photos/jm3/2779185414/sizes/ m/in/photostream/

description

Recommendations Improve the Search Experience Innovations in Activity Data workshop 04 July2011 Richard Nurse. http://www.open.ac.uk/blogs/rise. http://www.flickr.com/photos/jm3/2779185414/sizes/m/in/photostream/. Recommendations Improve the Search Experience?. - PowerPoint PPT Presentation

Transcript of Document

Page 1: Document

Recommendations Improve the Search Experience Innovations in Activity Data workshop04 July2011

Richard Nurse

http://www.open.ac.uk/blogs/rise

http://www.flickr.com/photos/jm3/2779185414/sizes/m/in/photostream/

Page 2: Document

http://www.open.ac.uk/blogs/rise

Can you use search data to

make recommendations?

Are recommendations

useful for Discovery systems?

http://www.flickr.com/photos/mag3737/1419690363/sizes/m/in/photostream/

Recommendations Improve the Search Experience?

Page 3: Document

http://www.open.ac.uk/blogs/rise

JISC funded project

February – July 2011

One of eight projects [list at http://bit.ly/gwCmNS]

http://www.flickr.com/photos/mag3737/3069729100/sizes/m/in/photostream/

JISC Activity Data Programme

Page 4: Document

http://www.open.ac.uk/blogs/rise

Usage data

Attention data

http://www.flickr.com/photos/mag3737/2326898219/sizes/m/in/photostream/

What is Activity Data?

Page 5: Document

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/zerimski/5215633183/sizes/z/in/photostream/

"Every day I wake up and ask, 'how can I flow data better, manage data better, analyse data better?"

Rollin Ford, the CIO of Wal-Mart

So what’s the point of this activity data stuff then?

Page 6: Document

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

http://www.flickr.com/photos/xq311z/2468769929/sizes/m/in/photostream/

Library activity data

Page 7: Document

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/neilwykes/134162792/sizes/z/in/photostream/

OU environment

Page 8: Document

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

xx

http://www.flickr.com/photos/stefz/7913287/sizes/m/in/photostream/

Library activity data

Page 9: Document

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/julietteculver/4731004168/in/photostream

OU environment

Page 10: Document

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

xx

x x

http://www.flickr.com/photos/cassidy/352549326/sizes/m/in/photostream/

Library activity data

Page 11: Document

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resourcesx

x xx

http://www.flickr.com/photos/lexnger/116314355/sizes/m/in/photostream

Library activity data

Page 12: Document

http://www.open.ac.uk/blogs/rise

Ebsco Discovery Solution

SFX knowledge base and OpenURL link resolver

EZProxy remote user authentication

Athens DA authentication built into local (SAMS) login system

http://www.flickr.com/photos/nataliesap/3553982299/sizes/m/in/photostream/

OU systems environment

Page 13: Document

Scope of the project

Page 14: Document

So what about collecting more data?

http://www.open.ac.uk/blogs/rise http://library.open.ac.uk/rise

Page 15: Document

So what about collecting more data?

http://www.open.ac.uk/blogs/rise

Page 16: Document

http://www.open.ac.uk/blogs/rise

Page 17: Document

http://www.open.ac.uk/blogs/rise

Page 19: Document

http://www.open.ac.uk/blogs/rise

Page 20: Document

E-journals E-journal articles E-books

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/smallritual/5393527886/sizes/m/in/photostream/

What resources are involved?

Page 21: Document

http://www.open.ac.uk/blogs/rise

EZProxy

SFX

EDS

VLE

website

http://www.flickr.com/photos/cdevers/2665335157/sizes/m/in/photostream/

bookmarklet

What data is RISE using?

Page 22: Document

http://www.open.ac.uk/blogs/rise

• Remote host • Date/Time• Oucu• Request• Status• Size of response• Referrer• User agent• Session

http://www.flickr.com/photos/vincentgallegos/5123100365/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 23: Document

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"http://www.flickr.com/photos/smohundro/2449517861/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 24: Document

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

date and time

http://www.flickr.com/photos/adactio/225402453/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 25: Document

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

User name

http://www.flickr.com/photos/dlytle/422738735/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 26: Document

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

Request

http://www.flickr.com/photos/spoinknet/35410171/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 27: Document

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vanderwal/279135451/sizes/m/in/photostream/

Page 28: Document

http://www.open.ac.uk/blogs/rise

RISE database

Remote host | Date/Time | Oucu | request | status | size of response | referrer | user agent | session

user type | course code(s)

EZProxy

CIRCE

http://www.flickr.com/photos/dw/4950924376/sizes/m/in/photostream/

Page 29: Document

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vanderwal/279135451/sizes/m/in/photostream/

Page 30: Document

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/clankennedy/3022286303/sizes/m/in/photostream/

ISSNs DOI

Article information

Subject terms

But what isn’t there?

Page 31: Document

People on course ‘A’ viewed resource ‘B’ People who looked at resource ‘C’ also looked at resource ‘D’

What can the data tell us?

http://www.open.ac.uk/blogs/rise

Course recommendation

Relationship recommendation

Which are the most popular resources, subjects

http://www.flickr.com/photos/vblibrary/5190554053/sizes/m/in/photostream/

Page 32: Document

So what about collecting more data?

http://www.open.ac.uk/blogs/rise

Page 33: Document

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vblibrary/5052421946/sizes/m/in/photostream/

Page 34: Document

Remote host | Date/Time | Oucu | request | status | size of response | referrer | user agent | session

user type | course code(s)

Searches in RISE

EZProxy

CIRCE

RISE

So how do you improve your data?

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/suttonhoo22/5004250051/sizes/m/in/photostream/

Page 35: Document

People on course ‘A’ viewed resource ‘B’

People who looked at resource ‘C’ also looked at resource ‘D’

People who searched for subject ‘E’ looked at resource

‘F’

What can the data tell us?

http://www.open.ac.uk/blogs/rise

Course recommendation

Search recommendation

Relationship recommendation

http://www.flickr.com/photos/vblibrary/4581698063/sizes/m/in/photostream/

People are looking at resources on this subjectSubject data

This resource is being used by people studying this courseResource management

Page 36: Document

So how do you get a recommendation?

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/will-lion/2442088335/sizes/m/in/photostream/

Page 37: Document

So how do you get a recommendation?

http://www.open.ac.uk/blogs/rise

User A Course A123

Resource BRV=14

Views

User C Course A123

Resource BRV=15

Recommended +1 Resource BRV=16

Views +1

User C Course A123

Resource BRV=17

Rate Useful +1

User C Course A123

Resource BRV=14

Rate Not Useful -2

Resource BRV=15

Views +1

Page 38: Document

http://www.open.ac.uk/blogs/rise

Interface usage

Page views

Browser 7,462

Gadget 855

Page 39: Document

Added a privacy policy to RISE, EDS and SFX interfaces

Provided an opt-out feature

http://www.open.ac.uk/blogs/rise

Privacy and opt-out URL

http://library.open.ac.uk/rise/?page=privacy

Data Protection and privacy

Page 40: Document

Release data

openly

E-Resource accesses

Search terms

Course subjects

http://www.open.ac.uk/blogs/rise

Open Data

http://www.flickr.com/photos/narisa/2720873442/sizes/m/in/photostream/

Page 41: Document

http://www.open.ac.uk/blogs/rise

Anonymization

Ensure compliance with Data Protection requirements

Get agreement to release data

Page 42: Document

Remove the user name

Remove all records for courses with less than x

students

Replace the course code with a generic subject

http://www.open.ac.uk/blogs/rise

Anonymization

Page 43: Document

Data formats and standardsXML

KE Usage Statistics standard

OpenURL

CSV

MOSAIC

Linked Data

http://www.flickr.com/photos/mararie/4121128381/sizes/m/in/photostream/

Page 44: Document

http://www.open.ac.uk/blogs/rise

“That recommender systems can enhance the student

experience in new generation e-resource discovery services”

http://www.flickr.com/photos/mag3737/3318791086/sizes/m/in/photostream/

Hypothesis

Page 45: Document

http://www.open.ac.uk/blogs/rise

Review of web analytics

Face to Face interviews

Online Survey

http://www.flickr.com/photos/8lettersuk/148685757/sizes/m/in/photostream/

Evaluation

Page 46: Document

http://www.open.ac.uk/blogs/rise

Survey results 1

Not useful17%

Quite useful17%

Very useful30%Not used

4%

Not sure9%

Not applicable22%

These resources may be related to others you've viewed recently

Page 47: Document

http://www.open.ac.uk/blogs/rise

Survey results 2

Not useful22%

Slightly useful9%

Quite useful9%Very useful

17%Not used9%

Not applicable35%

People on your course(s) viewed

Page 48: Document

http://www.open.ac.uk/blogs/rise

Survey results 3

Not useful17%

Quite useful13%

Very useful30%

Not used17%

Not applicable22%

People using similar search terms often viewed

Page 49: Document

http://www.open.ac.uk/blogs/rise

Survey results 4

Not relevant35%

Slightly relevant

13%

Quite relevant30%

Very relevant17%

Not used4%

How relevant where the recommendations?

Page 50: Document

http://www.open.ac.uk/blogs/rise

Face to Face evaluation

http://www.flickr.com/photos/ryanhealy/3729881896/sizes/m/in/photostream/

UndergraduatesLike ratings and reviews from other

students

‘other people’s experiences valuable’

Which module studied?

How high a mark?

PostgraduatesCitation as a recommendation

Wary of provenance

Feed to module website

Want synonyms

Trust repository

Page 51: Document

http://www.open.ac.uk/blogs/rise

Should we have a recommender system?

“I think it would be a very good useful feature. It would be definitely very very useful” postgraduate Maths student

“So it would be interesting to see what other people are looking at. Yes, I would definitely use that because my limited knowledge of the library might mean that other people were using slightly different ways of searching and getting different results.” undergraduate English Literature student

I have just had a go, it was good with suggested papers that I had already found (which shows potential in my view) through Google.

http://www.flickr.com/photos/earlg/337743409/sizes/m/in/photostream/

Page 52: Document

http://www.open.ac.uk/blogs/rise

Should we have a recommender system?

“I'm afraid my first reaction is to be a bit sceptical - it presumably doesn't tell you if fellow students found the information/article useful or relevant to what they were looking for.  I would hate to waste time following unproductive links laid down by others who might be failing students or think that any "lazy" students might develop poor practice by relying on what others had looked at.  It sounds like a good idea but I think caution needs to be exercised. ”

http://www.flickr.com/photos/rob-sinclair/2189457309/sizes/m/in/photostream/

Page 53: Document

http://www.open.ac.uk/blogs/rise

Why they prefer course-related recommendations

“I can’t be bothered with knowing what everybody else is interested in. I take a really operational view you know, I’m on here, I want to get the references for this particular piece of work, and those are the people that are most likely to be doing a similar thing that I can use.” H800 student

“I suppose if I wasn’t so sure on an assignment it would perhaps be quite useful to see what other people were looking at to know if I was thinking along the right lines.” - Undergrad literature student

Page 54: Document

http://www.open.ac.uk/blogs/rise

Suggestions for improvement“Maybe include a date. It would be interesting to know when a resource was last looked at” Postgraduate political philosophy student

“If somebody used similar search but three years ago, is that going to carry the same weight?” Postgraduate maths student

Include course drop-down choice. “I would be looking at that and saying “which of my courses does it refer to?”

Page 55: Document

http://www.open.ac.uk/blogs/rise

Rating the recommendations

8 out of ten so far would be happy to rate the recommendation

Most people understood why we were asking them to rate it.

Page 56: Document

http://www.open.ac.uk/blogs/rise

Recommendations usage

Search

40%

Course

36%

Relationship24%

Page 57: Document

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

100

200

300

400

500

600

700

800

900

1000

People using similar search terms often viewed

http://www.flickr.com/photos/antrover/5810373016/sizes/m/in/photostream/

Page 58: Document

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

100

200

300

400

500

600

700

People on your course(s) viewed

http://www.flickr.com/photos/exitfestival/5835349579/sizes/m/in/photostream/

Page 59: Document

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

50

100

150

200

250

300

350

400

450

500

These resources may be related to others you've viewed recently

http://www.flickr.com/photos/oldton_tim/479685673/sizes/m/in/photostream/

Page 60: Document

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

200

400

600

800

1000

1200

1400

1600

1800

2000

RelationshipCourseSearch

http://www.flickr.com/photos/smin/613285324/sizes/m/in/photostream/

Page 61: Document

•EZProxy data

•Use other data sources

•Search terms

•Need more data!

•Users like recommendations ‘in principle’

•Recommendations provenance

•Interest in the search tools

•Quality of recommendations isn’t high

•Limited use of it so far

http://www.open.ac.uk/blogs/rise

Interim findings

Page 62: Document

Release of code via Google Code

Release of data

Complete evaluation work

Final blog posts and write ups

Dissemination

Still to do

Page 63: Document

Recommendations Improve the Search ExperienceInnovations in Activity Data workshop04 July2011

Richard Nurse

http://www.open.ac.uk/blogs/rise

http://www.flickr.com/photos/jm3/2779185414/sizes/m/in/photostream/