Developing Recommendation Techniques for Scholarly Papers

16
Developing Recommendation Techniques for Scholarly Papers Kazunari Sugiyama National University of Singapore

description

Developing Recommendation Techniques for Scholarly Papers. Kazunari Sugiyama. National University of Singapore. Previous Research Topics. Web Information Retrieval (@NAIST) How to Characterize Web page User Adaptive Information Retrieval Disambiguation (@TITECH) - PowerPoint PPT Presentation

Transcript of Developing Recommendation Techniques for Scholarly Papers

Page 1: Developing Recommendation Techniques for Scholarly Papers

Developing Recommendation Techniques for Scholarly Papers

Kazunari Sugiyama

National University of Singapore

Page 2: Developing Recommendation Techniques for Scholarly Papers

Previous Research Topics• Web Information Retrieval (@NAIST)

• How to Characterize Web page

• User Adaptive Information Retrieval

• Disambiguation (@TITECH)• Personal Name Disambiguation in Web Search Results

• Word Sense Disambiguation in Japanese Texts

2

Page 3: Developing Recommendation Techniques for Scholarly Papers

Scholarly Paper Recommendation (@NUS)

3

• Senior researchers

• Junior researchersOnly one recently published paper without citations

Multiple published papers with citation papers

Page 4: Developing Recommendation Techniques for Scholarly Papers

User Profile Construction (Junior Researchers)

4

Weighting schemeCosine similarity

Page 5: Developing Recommendation Techniques for Scholarly Papers

User Profile Construction (Senior Researchers)

5

Weighting schemeCosine similarity

Forgetting factor

Page 6: Developing Recommendation Techniques for Scholarly Papers

Feature Vector Construction for Candidate Papers• Basically, TF-IDF• Also use information about citation and reference papers

6

recpcp 1

References

recp

1refrecp

recpcrecpcrecrecpppp W 11 ffF

11 refrecrefrec ppW f

Weighting schemeCosine similarity

Page 7: Developing Recommendation Techniques for Scholarly Papers

Is Pruning of Citation and Reference Papers Effective?

7

References

ip

1refip 2refip 3refip 4refip lrefip

sim:0.18 sim:0.58 sim:0.22 sim:0.36 sim:0.45

ipcp 1

sim:0.32 sim:0.27 sim:0.42 sim:0.25 sim:0.13

Threshold: 0.3

ipcp 2 ipcp 3 ipcp 4 ik pcp

Page 8: Developing Recommendation Techniques for Scholarly Papers

Is Pruning of Citation and Reference Papers Effective?

8

References

ip

1refip 2refip 3refip 4refip lrefip

sim:0.18 sim:0.58 sim:0.22 sim:0.36 sim:0.45

ipcp 1

sim:0.33 sim:0.27 sim:0.42 sim:0.25 sim:0.13

ipcp 2 ipcp 3 ipcp 4 ik pcp

ipcpW 1

ipcpW 3

2refipW 4refipW lrefipW

Threshold: 0.3

Weighting schemeCosine similarity

Page 9: Developing Recommendation Techniques for Scholarly Papers

ExperimentsExperimental Data• Researchers

• 15 junior researchers

• 13 senior researchersNLP and IR researchers who have publication

lists

in DBLP

• Candidate Papers to Recommend• ACL Anthology Reference Corpus

Includes information about citation and reference papers

9

Page 10: Developing Recommendation Techniques for Scholarly Papers

Junior ResearchersThe most recent paper with pruning its reference papers

10

[NDCG@5]

Pruning is effective!

Page 11: Developing Recommendation Techniques for Scholarly Papers

Senior ResearchersPast published papers with forgetting factor

11

[NDCG@5]

When and are small,FF is effective!

d

Page 12: Developing Recommendation Techniques for Scholarly Papers

ExtensionsCharacterize the target paper using potential papers

Serendipitous recommendation

12

Page 13: Developing Recommendation Techniques for Scholarly Papers

tgtp

(‘06) (‘07)(‘09)

tgtk pcp

(‘05)

tgtpcp 1

13

Potential paper that should cite the target paper

Characterize the target paper using potential papers

Page 14: Developing Recommendation Techniques for Scholarly Papers

Finding potential papers with collaborative filtering

14

pc1 pc2 pc3 pci pcn-1 pcN

p10.212 0.735 0.687

p20.656 0.328 0.436

p30.764 0.527

ptgt0.581 0.330

pN-10.248

pN0.654 0.525

Pi (i=1, … ,N):All papers in the dataset

Pcj (j=1, … ,N):Papers as citation papersin the dataset0.536 0.4720.368 0.211

Page 15: Developing Recommendation Techniques for Scholarly Papers

tgtp

(‘06) (‘07)(‘09)

tgtk pcp

(‘05)

tgtpcp 1

15

Potential paper that should cite the target paper

Characterize the target paper using potential papers

tgtpcp 3tgtN pcp

Page 16: Developing Recommendation Techniques for Scholarly Papers

User 1

User 2

User 3

User n

User profile generated from history of contents

User profile for serendipitous

recommendationUser 4 (Sim: 0.16)Weight: 1/(0.16+1)

User 10 (Sim: 0.26)Weight: 1/(0.26+1)

User 5 (Sim: 0.21)Weight: 1/(0.21+1)

User 1 (Sim: 0.32)Weight: 1/(0.32+1)

User 1 (Sim: 0.14)Weight: 1/(0.14+1)

User profile for serendipitous

recommendation

User 7 (Sim: 0.25)Weight: 1/(0.25+1)

User profile for serendipitous

recommendation

User 6 (Sim: 0.07)Weight: 1/(0.07+1)

User 2 (Sim: 0.12)Weight: 1/(0.12+1)

User profile for serendipitous

recommendation

Serendipitous Recommendation