Clustering User Preferences Using W-Kmeans
-
Upload
hitesh-shetty -
Category
Documents
-
view
219 -
download
0
Transcript of Clustering User Preferences Using W-Kmeans
-
7/27/2019 Clustering User Preferences Using W-Kmeans
1/8
&OXVWHULQJXVHUSUHIHUHQFHVXVLQJ:NPHDQV
&KULVWRV%RXUDV3URIHVVRU
&RPSXWHU(QJLQHHULQJDQG,QIRUPDWLFV'HSDUWPHQW
8QLYHUVLW\RI3DWUDVDQG&RPSXWHU7HFKQRORJ\,QVWLWXWH
DQG3UHVV'LRSKDQWXV1.D]DQW]DNL
3DQHSLVWLPLRXSROL
3DWUDV*UHHFH
ERXUDV#FWLJU
9DVVLOLV7VRJNDV06F
&RPSXWHU(QJLQHHULQJDQG,QIRUPDWLFV'HSDUWPHQW
8QLYHUVLW\RI3DWUDV*UHHFH3DQHSLVWLPLRXSROL
3DWUDV*UHHFH
WVRJNDV#FHLGXSDWUDVJU
$EVWUDFW $OWKRXJK FRPPRQO\ RQO\ GRFXPHQW FOXVWHULQJ LV
VXJJHVWHG E\ :HE PLQLQJ WHFKQLTXHV IRU UHFRPPHQGDWLRQ
V\VWHPV RQH RI WKH YDULRXV WDVNV RI SHUVRQDOL]HG
UHFRPPHQGDWLRQLVFDWHJRUL]DWLRQRI:HEXVHUV,QWKLVSDSHU
DPHWKRG IRU FOXVWHULQJ QDYLJDWLRQ SDWWHUQV RI :HE XVHUV LV
SURSRVHG :H DGDSW WKH :RUG1HWHQDEOHG :NPHDQV
DOJRULWKP DQ HQKDQFHPHQW RI VWDQGDUG NPHDQV DOJRULWKP
ZKLFKXVHVWKHH[WHUQDONQRZOHGJHIURP:RUG1HWK\SHUQ\PV
DQGWKDWKDVEHHQSUHYLRXVO\XVHGIRUGRFXPHQWFOXVWHULQJWR
XVHUSURILOHFOXVWHULQJE\ DQDO\]LQJ WKHXVHUVKLVWRULFDOGDWD
:H DOVR LQYHVWLJDWH WKH HIIHFWV WKLV DSSURDFK KDV RQ WKH
UHFRPPHQGDWLRQHQJLQHE\HYDOXDWLQJWKHRYHUDOOSHUIRUPDQFH
LW KDV LQ WHUPV RI SUHFLVLRQ UHFDOO RQ RXU RQOLQH
UHFRPPHQGDWLRQV\VWHP
8VHU FOXVWHULQJ VHVVLRQ LGHQWLILFDWLRQ UHFRPPHQGDWLRQ
V\VWHPSHUVRQDOL]DWLRQNPHDQV:NPHDQV
,
,1752'8&7,212EMHFW FOXVWHULQJ UHIHUV WR WKH SURFHVV RI SDUWLWLRQLQJ D
FROOHFWLRQ RI REMHFWV LQWR VHYHUDO VXEFROOHFWLRQV EDVHG RQWKHLU VLPLODULW\ RI FRQWHQWV )RUWKH FDVH RI XVHUFOXVWHULQJHDFKVXEFROOHFWLRQLVFDOOHGDXVHUFOXVWHUDQGLQFOXGHVXVHUVWKDWKDYHUHYHDOHGVLPLODUDSSHDOVLQWKHLUVHOHFWLRQVRIWH[WDUWLFOHV ZKLOH EURZVLQJ WKURXJK D GRFXPHQW FROOHFWLRQ&OXVWHULQJ KDV EHHQ SURYHQ WR EH D XVHIXO WHFKQLTXH IRULQIRUPDWLRQ UHWULHYDO E\ GLVFRYHULQJ LQWHUHVWLQJ LQIRUPDWLRQNHUQHOVDQGGLVWULEXWLRQVLQWKHXQGHUO\LQJGDWD,QJHQHUDOLWKHOSV FRQVWUXFWLQJ PHDQLQJIXO SDUWLWLRQV RI ODUJH VHWV RIREMHFWV EDVHG RQ YDULRXV PHWKRGRORJLHV DQG KHXULVWLFV ,WSOD\V D FUXFLDO UROH LQ RUJDQL]LQJ ODUJH FROOHFWLRQV )RUH[DPSOHLWFDQEHXVHGDWRVWUXFWXUHTXHU\UHVXOWVEIRUP
WKH EDVLV IRU IXUWKHU SURFHVVLQJ RI WKH RUJDQL]HG WRSLFDOJURXSV XVLQJ RWKHU LQIRUPDWLRQ UHWULHYDO WHFKQLTXHV VXFK DVVXPPDUL]DWLRQ RU F ZLWKLQ WKH VFRSH RI UHFRPPHQGDWLRQV\VWHPVE\DIIHFWLQJWKHLUSHUIRUPDQFHDVIDUDVVXJJHVWLRQVPDGHWRZDUGVWKHHQGXVHUVDUHFRQFHUQHG
:HEPLQLQJIRFXVHVRQILQGLQJQDWXUDOJURXSLQJVRI:HEUHVRXUFHV RU :HE XVHUV :H FRXOG URXJKO\ GLYLGH :HE0LQLQJLQWRWKUHHEDVLFFDWHJRULHV>@)LUVWO\:HEFRQWHQWPLQLQJ ZKHUH LQIRUPDWLRQ LVH[WUDFWHG IURP WKHFRQWHQWRISDJHV DQG OLQNV LH QRW IURP WKH XVHUV WKHPVHOYHV6HFRQGO\ :HE 6WUXFWXUH 0LQLQJ ZKHUH VWUXFWXUDO
LQIRUPDWLRQ DERXW K\SHUOLQNV DQG RUJDQL]DWLRQ SOD\V DSUHGRPLQDQW UROH $QG WKLUGO\ :HE 8VDJH 0LQLQJ ZKLFKIRFXVHV RQ H[WUDFWLQJ XVHIXO XVDJH SDWWHUQV IURP WKH XVHUV
EHKDYLRU &OXVWHULQJ RI :HE XVHUV LV D SDUWLFXODU UHVHDUFKWRSLF RI :HE 8VDJH 0LQLQJ WKDW DLPV WRZDUGV GHVFULELQJJHQHULF WUHQGV LQ XVHUV EHKDYLRUV ZLWKLQ VRPH SDUWLFXODUWLPHHJDVSHFLILFWLPHZLQGRZ
&RROH\ HW DO LQ >@ LQWURGXFHG WKH WHUP :HE 8VDJH0LQLQJDQGH[SODLQHGLWDV WKHDXWRPDWLFGLVFRYHU\RIXVHUDFFHVVSDWWHUQVIURP:HE6HUYHUV6LQFHWKHQWKHILHOGKDVEHHQ H[SORUHG ZLWKLQ WKH VFRSH RI :HE SHUVRQDOL]DWLRQ E\YDULRXVZRUNVHJ>@DQG>@,Q>@WKHDXWKRUVWDNHLQWRDFFRXQW EDVLFDOO\ WZR W\SHV RI XVDJH SDWWHUQV DQG FOXVWHUWKHPLQRUGHUWREXLOGJHQHULFQDYLJDWLRQDOSURILOHVZLWKRXWPLQGLQJWKHRUGHURIDFFHVVHV$PHWKRGWKDWXVHVDWWULEXWHRULHQWHG LQGXFWLRQ ZKHUH XVHU VHVVLRQV DUH UHSUHVHQWHG DVYHFWRUV LQ DQ QGLPHQVLRQDO (XFOLGLDQ WHUP VSDFH LVGHVFULEHGLQ>@$YLVXDOL]DWLRQDSSURDFKRIWKHXVHUFKRLFHV
KDVDOVREHHQH[SORUHGLQ>@IRUQDYLJDWLRQSDWWHUQV,Q>@WKH DXWKRUV LQWURGXFH D 6HTXHQFH $OLJQPHQW 0HWKRGRORJ\WKDW FOXVWHUV XVHUV EDVHG RQ WKHLU QDYLJDWLRQ SDWWHUQV 7KLVZRUN IRFXVHVRQ WKHRUGHU LQ ZKLFK QDYLJDWLRQ HYHQWV WDNHSODFHE\XVHUV
:HEXVDJHPLQLQJUHVXOWVWR&ROODERUDWLYHILOWHULQJ&)ZKHQ LWXVHVWKHNQRZQ SUHIHUHQFHVRI D JURXS RIXVHUV WRPDNH UHFRPPHQGDWLRQV RU SUHGLFWLRQV DERXW WKH XQNQRZQSUHIHUHQFHVIRURWKHUXVHUV &)WHFKQLTXHV XVHD GDWDEDVHRISUHIHUHQFHVIRULWHPV E\XVHUV WRSUHGLFW DGGLWLRQDOWRSLFVRUSURGXFWVD QHZXVHU PLJKWOLNH7KH\ FRPHLQ URXJKO\WKUHHFDWHJRULHVD PHPRU\EDVHGOLNHQHLJKERUEDVHGDQGLWHPEDVHGWRS1EPRGHOEDVHGOLNH%D\HVLDQEHOLHIQHWVODWHQWVHPDQWLF GLPHQVLRQDOLW\ UHGXFWLRQ 69' DQG F K\EULG
ZKLFK FRPELQH WKH DGYDQWDJHV RI ERWK FDWHJRULHV DQGLPSURYH WKH SUHGLFWLRQ SHUIRUPDQFH (DUO\ JHQHUDWLRQFROODERUDWLYHILOWHULQJV\VWHPVVXFKDV*URXS/HQV>@XVHWKH XVHU UDWLQJ GDWD WR FDOFXODWH WKH VLPLODULW\ RU ZHLJKWEHWZHHQ XVHUV RU LWHPV DQG PDNH SUHGLFWLRQV RUUHFRPPHQGDWLRQV DFFRUGLQJ WR WKRVH FDOFXODWHG VLPLODULW\YDOXHV 0HPRU\EDVHG &) PHWKRGV DUH QRWDEO\ GHSOR\HGLQWR FRPPHUFLDO V\VWHPV VXFK DV KWWSZZZDPD]RQFRPDQG%DUQHVDQG1REOHEHFDXVHWKH\DUHHDV\WRLPSOHPHQW
2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems
978-0-7695-4635-3/11 $26.00 2011 IEEE
DOI 10.1109/SITIS.2011.19
75
-
7/27/2019 Clustering User Preferences Using W-Kmeans
2/8
DQGKLJKO\ HIIHFWLYH &XVWRPL]DWLRQRI&) V\VWHPVIRUHDFKXVHUGHFUHDVHVWKHVHDUFKHIIRUWIRUXVHUV
,Q >@ WKH DXWKRUV IRFXV RQ WKH SHUVRQDOL]HGUHFRPPHQGDWLRQRI:HESDJHVWKDWDUHDGDSWHGDFFRUGLQJWRWKHDFFHVVSDWWHUQVFRQVWUXFWHGE\DQDO\]LQJXVHUQDYLJDWLRQLQIRUPDWLRQ7KH\SURYHWKDWWKHPHWKRGRORJ\RILQWHJUDWLQJXVHU FOXVWHULQJ ZLWKLQ WKH VFRSH RI D UHFRPPHQGDWLRQ
V\VWHPZKLOHPLQLQJLQWHUHVWLQJXVHUQDYLJDWLRQSDWWHUQVFDQEH EHQHILFLDO %H\RQG WKH DERYH WKHUH LV OLWWOH ZRUN ZLWKUHJDUGVWRFOXVWHULQJXVHUSUHIHUHQFHVZLWKLQWKHVFRSHRIDUHFRPPHQGDWLRQV\VWHPDQGKRZWKHDERYHFDQEHH[SORLWHGZLWKDVLJQLILFDQWHIIHFWWRWKHHIILFLHQF\RIVXFKDV\VWHP
7ZRJHQHULFFDWHJRULHVRIWKHYDULRXVFOXVWHULQJPHWKRGVH[LVW KLHUDUFKLFDO DQG SDUWLWLRQDO 7\SLFDO KLHUDUFKLFDOWHFKQLTXHVJHQHUDWHDVHULHVRISDUWLWLRQVRYHUWKHGDWDZKLFKPD\ UXQ IURP D VLQJOH FOXVWHU FRQWDLQLQJ DOO REMHFWV WR QFOXVWHUV HDFK FRQWDLQLQJ D VLQJOH REMHFW DQG DUH ZLGHO\YLVXDOL]HG WKURXJK D WUHHOLNH VWUXFWXUH 2Q WKHRWKHU KDQGSDUWLWLRQDODOJRULWKPVW\SLFDOO\GHWHUPLQHDOOFOXVWHUVDWRQFH)RU SDUWLWLRQDO WHFKQLTXHV D JOREDO FULWHULRQ LV PRVWFRPPRQO\XVHGWKHRSWLPL]DWLRQRI ZKLFKGULYHVWKHHQWLUH
SURFHVV SURGXFLQJ WKXV D VLQJOHOHYHO GLYLVLRQ RI WKH GDWD*LYHQ WKH QXPEHU RI GHVLUHG FOXVWHUV OHW N SDUWLWLRQDODOJRULWKPVILQGDOONFOXVWHUVRIWKHGDWDDWRQFHVXFKWKDWWKHVXP RI GLVWDQFHV RYHU WKH LWHPV WR WKHLU FOXVWHU FHQWHUV LVPLQLPDO 0RUHRYHU IRU D FOXVWHULQJ UHVXOW WR EH DFFXUDWHEHVLGHV WKH ORZ LQWUDFOXVWHU GLVWDQFH KLJK LQWHUFOXVWHUGLVWDQFHV LH ZHOO VHSDUDWHG FOXVWHUV LV GHVLUHG $ W\SLFDOSDUWLWLRQDODOJRULWKPLVNPHDQVZKLFKLVEDVHGRQWKHQRWLRQRIWKH FOXVWHU FHQWHU D SRLQWLQ WKH GDWD VSDFH XVXDOO\QRWH[LVWHQW LQ WKH GDWDWKHPVHOYHV ZKLFK UHSUHVHQWV D FOXVWHU7KHIDPLO\RINPHDQVSDUWLWLRQDOFOXVWHULQJDOJRULWKPV>@XVXDOO\ WULHV WR PLQLPL]H WKH DYHUDJH VTXDUHG GLVWDQFHEHWZHHQSRLQWVLQWKHVDPHFOXVWHULHLIGGGQDUHWKHQGRFXPHQWVDQGFFFNDUHWKHNFOXVWHUVFHQWURLGVNPHDQVWULHVWRPLQLPL]HWKHJOREDOFULWHULRQIXQFWLRQ
N Q
M
LM FGVLP
L
6HYHUDO LPSURYHPHQWV KDYH EHHQ SURSRVHG RYHU WKLVVLPSOHVFKHPHOLNHELVHFWLQJNPHDQV>@NPHDQV>@DQGPDQ\PRUH
:RUG1HW LV RQH RI WKH PRVW ZLGHO\ XVHG WKHVDXUL IRU(QJOLVK ,W DWWHPSWV WR PRGHO WKH OH[LFDO NQRZOHGJH RI DQDWLYH (QJOLVK VSHDNHU &RQWDLQLQJ RYHU WHUPV LWJURXSV QRXQV YHUEV DGMHFWLYHV DQG DGYHUEV LQWR VHWV RIV\QRQ\PV FDOOHG V\QVHWV 7KH V\QVHWV DUH RUJDQL]HG LQWRVHQVHVJLYLQJWKXVWKHV\QRQ\PVRIHDFKZRUGDQGDOVRLQWRK\SRQ\P K\SHUQ\P LH ,V$ DQGPHURQ\P KRORQ\PLH3DUW2IUHODWLRQVKLSVSURYLGLQJDKLHUDUFKLFDOWUHHOLNH
VWUXFWXUH IRU HDFK WHUP 7KH DSSOLFDWLRQV RI :RUG1HW WRYDULRXV ,5 WHFKQLTXHV KDYH EHHQ ZLGHO\ UHVHDUFKHGFRQFHUQLQJILQGLQJWKHVHPDQWLFVLPLODULW\RIUHWULHYHGWHUPV>@RUWKHLUDVVRFLDWLRQZLWKFOXVWHULQJWHFKQLTXHV7KHXVHRI D :RUG1HWEDVHG FOXVWHULQJ DSSURDFK IRU XVHUV KDV QRWEHHQLQYHVWLJDWHGVRIDU
,Q WKLV SDSHU ZH H[WHQG RXU LPSOHPHQWDWLRQ RI WKH:RUG1HW HQKDQFHG :NPHDQV DOJRULWKP WR WKH GRPDLQ RIFOXVWHULQJ :HE8VHUV JHQHUDWLQJ WKXV RIIOLQH XVHUFOXVWHUV
ZKLFKFDQEHXVHGDWDODWHUVWDJHE\WKHRWKHULQIRUPDWLRQUHWULHYDO WHFKQLTXHV ,QHVVHQFH ZH DUH DEOH WRGHFRGH WKHQDYLJDWLRQ SDWWHUQV RI XVHUV DQG DJJUHJDWH WKHLU SURILOHVXVLQJ WKH :NPHDQV FOXVWHULQJ DOJRULWKP 7KLV DOORZV RXUUHFRPPHQGDWLRQ V\VWHP WR VXJJHVW FRQWHQW WKDW ZLWK KLJKSUREDELOLW\ ZLOO EH LQWHUHVWLQJ WR WKH XVHUV 2XU JRDO LV WRLPSURYH WKH UHVXOWV RI RXU LQIRUPDWLRQ UHWULHYDO V\VWHP LQ
WHUPVRI SUHFLVLRQUHFDOODQGWKXVVHUYHEHWWHUILOWHUHGDQGDGHTXDWH UHVXOWV WR WKHLU XVHUV KHOSLQJ LQ HVVHQFH WKHGHFLVLRQ PDNLQJ SURFHVV 2XU UHFRPPHQGDWLRQ V\VWHP DVH[SODLQHGLQWKHQH[WVHFWLRQFRXOGEHGHVFULEHGDVD+\EULGEHWZHHQFRQWHQWEDVHGILOWHULQJDQG&)
7KHUHVWRIWKHSDSHULVVWUXFWXUHGDVIROORZVVHFWLRQ,,GHVFULEHV WKH LQIRUPDWLRQ IORZ ZLWKLQ RXU V\VWHP DQG WKHPRGLILHGFRPSRQHQWVQHHGHGIRUXVHUFOXVWHULQJ6HFWLRQ,,,SUHVHQWVWKH DOJRULWKPVXVHGZKLOHVHFWLRQ ,9GHVFULEHV WKHHYDOXDWLRQSURFHVVDQGWKHUHVXOWV6RPHFRQFOXGLQJUHPDUNVDQGSRLQWHUVIRUIXWXUHZRUNDUHJLYHQLQ6HFWLRQ9
,, )/2:2),1)250$7,21)LJ GHSLFWV WKH IORZ RI LQIRUPDWLRQ ZLWKLQ RXU
UHFRPPHQGDWLRQV\VWHP>@,QLWLDOO\DWLWVLQSXWVWDJHQHZVDUWLFOHV DUH FUDZOHG DQG IHWFKHG IURP QHZV SRUWDOV IURPDURXQG WKH :HE 7KLV LV DQ RIIOLQH SURFHGXUH DQG RQFHDUWLFOHVDVZHOODVPHWDGDWDLQIRUPDWLRQDUHIHWFKHGWKH\DUHVWRUHGLQWKHFHQWUDOL]HGGDWDEDVHIURPZKHUHWKH\DUHSLFNHGXSE\WKHSURFHGXUHVWKDWIROORZ
$ NH\ SURFHVV RI WKH V\VWHP DV D ZKROH SUREDEO\ DVLPSRUWDQWDVWKHFOXVWHULQJDOJRULWKPWKDWIROORZVLWLVWH[WSUHSURFHVVLQJRQWKHIHWFKHGDUWLFOHVFRQWHQWWKDWUHVXOWV WRWKH H[WUDFWLRQ RI WKH NH\ZRUGV HDFK DUWLFOH FRQVLVWV RI$QDO\]HGLQ>@NH\ZRUGH[WUDFWLRQKDQGOHVWKHFOHDQLQJRIDUWLFOHVWKHH[WUDFWLRQRIWKHQRXQV>@WKHVWHPPLQJDVZHOODV WKH VWRSZRUG UHPRYDO SURFHVV )ROORZLQJ LW DSSOLHVVHYHUDOKHXULVWLFVWRFRPHXSZLWKDZHLJKWLQJVFKHPHWKDWDSSURSULDWHO\ZHLJKWVWKHNH\ZRUGVRIHDFKDUWLFOHEDVHGRQLQIRUPDWLRQDERXWWKHUHVWRIWKHGRFXPHQWVLQRXUGDWDEDVH3UXQLQJRIZRUGVDSSHDULQJZLWKORZIUHTXHQF\WKURXJKRXWWKHFRUSXVZKLFKDUHXQOLNHO\WRDSSHDULQPRUHWKDQDVPDOOQXPEHU RI DUWLFOHV FRPHV QH[W .H\ZRUG H[WUDFWLRQXWLOL]LQJ WKH YHFWRU VSDFH PRGHO JHQHUDWHV WKH WHUPIUHTXHQF\YHFWRUGHVFULELQJHDFKDUWLFOHDVDEDJRIZRUGVZRUGV IUHTXHQFLHV WR WKH NH\ LQIRUPDWLRQ UHWULHYDOWHFKQLTXHVWKDWIROORZDUWLFOHFDWHJRUL]DWLRQVXPPDUL]DWLRQDQGFOXVWHULQJ2XUDLPWRZDUGVLQFUHDVLQJWKHHIILFLHQF\RIWKH XVHG FOXVWHULQJ DOJRULWKP LV WR HQKDQFH WKLV EDJ RIZRUGVZLWKWKHXVHRIDQH[WHUQDOGDWDEDVH:RUG1HW7KHDERYH FKDUDFWHULVWLFV RI RXU V\VWHP JLYH LWV FRQWHQWEDVHGQDWXUH 7KLV HQKDQFHG IHDWXUH OLVW IHHGV WKH NPHDQVFOXVWHULQJ SURFHGXUHWKDWIROORZV,QWKLVZRUNFOXVWHULQJLV
DFKLHYHG YLD UHJXODU NPHDQV XVLQJ WKH FRVLQH VLPLODULW\GLVWDQFHPHDVXUH
____FRV
ED
EDEDG
T
ZKHUH __ _E_ DUH WKH OHQJWKV RI WKH YHFWRUV EUHVSHFWLYHO\DQGWKHVLPLODULW\EHWZHHQWKHWZRGDWDSRLQWVLVYLHZHGE\PHDQVRIWKHLUDQJOHLQWKHQGLPHQVLRQDOVSDFH,WLVLPSRUWDQWWRQRWHKRZHYHUWKDWWKHFOXVWHULQJSURFHVVLV
76
-
7/27/2019 Clustering User Preferences Using W-Kmeans
3/8
LQGHSHQGHQW RI WKH UHVW RI WKH VWHSV PHDQLQJ WKDW LW FDQHDVLO\EHUHSODFHGE\DQ\RWKHUFOXVWHULQJDSSURDFKRSHUDWLQJ
RQDZRUGOHYHORIWKHLQSXWGRFXPHQWV
)LJXUH)ORZRI,QIRUPDWLRQ
)RUHDFKXVHUYLHZLQJQHZVDUWLFOHVZHNHHSWUDFNRIWKHVHOHFWHGDFWLRQVZKLFKFKDUDFWHUL]HDXVHUVHVVLRQ$VHVVLRQLV GHILQHG DV WKH OLVW RI VHOHFWHG DUWLFOHV WKDW D XVHU KDVGHFLGHGWRYLHZIRUDPLQLPXPGXUDWLRQDQGZLWKLQDOLPLWHGWLPH IUDPH ERWK RI ZKLFK DUH ILQHWXQHG DW WKHH[SHULPHQWDWLRQ VWDJH 7KH VHOHFWHG DUWLFOHV FRQWDLQHG LQ
WKRVH VHVVLRQV DUH WKHQ DJJUHJDWHG DW D NH\ZRUG OHYHOJHQHUDWLQJ D WLPHOLPLWHG XVHU SURILOH 8VHU SURILOHV IURPPXOWLSOH XVHUV DQG WLPHIUDPHV DUH WKHQ FOXVWHUHG XVLQJ WKH:NPHDQVDOJRULWKPIRUPLQJSURILOHFOXVWHUV
:NPHDQVLVD QRYHODSSURDFKWKDWH[WHQGVWKHVWDQGDUGNPHDQV DOJRULWKP XVLQJ WKH H[WHUQDO NQRZOHGJH IURP:RUG1HWK\SHUQ\PVIRUHQULFKLQJWKHEDJRIZRUGVXVHGSULRU WR WKH FOXVWHULQJ 7KH :NPHDQV DOJRULWKP HQKDQFHVWKH XVHU SURILOHV ZLWK K\SHUQ\PV GHGXFWHG IURP WKH:RUG1HWGDWDEDVHXVLQJD KH\ULVWLFPDQQHU 7KRVHSURILOHFOXVWHUV EHLQJ HVVHQWLDOO\ XVHU FOXVWHUV DUH XVHG DW WKHUHFRPPHQGDWLRQ VWDJH WR HQKDQFH WKH V\VWHPV XVDJHH[SHULHQFH E\ SURYLGLQJ EHWWHU DGDSWHG UHVXOWV WR XVHUVUHYLVLWLQJ WKH VLWH )ROORZLQJ WKH VHVVLRQ FOXVWHULQJSURFHGXUH WKH UHVXOWLQJ FOXVWHUV DUH ODEHOHG XVLQJ RXU:RUG1HW FOXVWHU ODEHOLQJ PHFKDQLVP ZKLFK KRZHYHU LVEH\RQGWKHVFRSHRIWKLVSDSHU:KHQDXVHUFRPHVEDFNKLVFOXVWHUHG SURILOH LV UHFDOOHG DQG DUWLFOHV EHORQJLQJ WR WKHFOXVWHUHG VHVVLRQV RI KLV SURILOH DUH H[WUDFWHG DQG VHQW DVYLHZLQJ UHFRPPHQGDWLRQV EDFN WR WKH XVHU 6XJJHVWHGDUWLFOHVGRQRWEHORQJWRWKHRQHVWKHXVHUKDVDOUHDG\YLVLWHGDQGDOVRDUHQRWFORVHO\UHODWHGWRDUWLFOHVWKDWWKHXVHUKDVPDUNHGQHJDWLYHO\LQWKHSDVW
7KH DSSURDFK SUHYLRXVO\ GHVFULEHG LV HVVHQWLDOO\ WKHFROODERUDWLYHILOWHULQJQDWXUHRIRXUUHFRPPHQGDWLRQV\VWHPZKLFK SUDFWLFDOO\ LQYROYHV UHODWHG XVHUV WR WKH GHFLVLRQPDNLQJSURFHVV:HH[SHFWWKDWFRPELQLQJWKLVPHWKRGZLWKRXU NH\ZRUG H[WUDFWLRQ FRQWHQWEDVHG PHFKDQLVP WKHUHFRPPHQGDWLRQVWRZDUGVXVHUVZLOODPHOLRUDWH
,,, $/*25,7+0$352$&+7KH SURSRVHG DSSURDFK FRQVLVWV RI WKUHH PDMRU
DOJRULWKPLF FRPSRQHQWV WKDW DUH XVHG IRU D WKH RIIOLQHSURFHVV RI LGHQWLI\LQJ WKH VHVVLRQV RI XVHUV QDYLJDWLQJWKURXJK WKHUHFRPPHQGDWLRQ V\VWHP E WKHRIIOLQHSURFHVVRI FOXVWHULQJ RI WKH GHWHFWHG VHVVLRQV DQG F WKH RQOLQHSURFHVVRIUHFRPPHQGLQJQHZVDUWLFOHVWRWKHXVHUVEDVHGRQWKH FOXVWHUHG SURILOHV 7KRVH FRPSRQHQWV DUH VHVVLRQLGHQWLILFDWLRQ FOXVWHULQJ RI XVHU VHVVLRQV DQGUHFRPPHQGDWLRQVWDJH
$ 6HVVLRQ,GHQWLILFDWLRQ7KH LGHQWLILFDWLRQ RI VHVVLRQV ZLWKLQ D XVHUV EURZVLQJ
KLVWRU\LVDFKLHYHGXVLQJWKHIROORZLQJDOJRULWKP
$OJRULWKPILQGBVHVVLRQV,QSXWKLVWRU\WLPHZLQGRZIRUVHVVLRQVWREHH[WUDFWHG2XWSXW6HVVLRQV>@GLVFRYHUHGVHVVLRQVDUUD\YLHZLQJBWKUHVKROGDWOHDVWVHFRQGVVHVVLRQBWKUHVKROGDWPRVWPLQXWHVSUHYLRXVBXVHU18//FXUUHQWBVHVVLRQ18//
77
-
7/27/2019 Clustering User Preferences Using W-Kmeans
4/8
ZKLOHIHWFKIURP'%XVHUYLHZHGDUWLFOHWLPHVWDPSYLHZLQJBWLPH^
LIYLHZLQJBWLPHYLHZLQJBWKUHVKROG__WLPHVWDPSKLVWRU\FRQWLQXHLIFXUUHQWBVHVVLRQXVHUQDPHXVHU^6LQFHWKLVLVVRUWHGE\XVHUQDPHZKHQDQHZXVHULVIRXQGWKLV
PHDQVDQHZVHVVLRQEHJLQVLIFXUUHQWBVHVVLRQXVHUQDPHFXUUHQWBVHVVLRQDUWLFOHV
HPSW\6HVVLRQV>@FXUUHQWBVHVVLRQ
FXUUHQWBVHVVLRQXVHUQDPHXVHUFXUUHQWBVHVVLRQXVHUBLGXVHUBLGFXUUHQWBVHVVLRQVWDUWWLPHVWDPSFXUUHQWBVHVVLRQDUWLFOHVDGGDUWLFOHBLG
`HOVH^,IWKHXVHULVWKHVDPHDVEHIRUHEXWWKHDFFHVVWLPHIRUWKLVDUWLFOHH[FHHGVWKHWLPHOLPLWDQHZVHVVLRQEHJLQV
LIWLPHVWDPSFXUUHQWBVHVVLRQVWDUW!VHVVLRQBWKUHVKROG^LIFXUUHQWBVHVVLRQXVHUQDPHFXUUHQWBVHVVLRQDUWLFOHVHPSW\6HVVLRQV>@FXUUHQWBVHVVLRQ
FXUUHQWBVHVVLRQXVHUQDPHXVHUFXUUHQWBVHVVLRQXVHUBLGXVHUBLGFXUUHQWBVHVVLRQVWDUWWLPHVWDPSFXUUHQWBVHVVLRQHQGWLPHVWDPSFXUUHQWBVHVVLRQDUWLFOHVDGGDUWLFOHBLG
`HOVH^7KHDFFHVVWLPHIRUWKLVDUWLFOHGRHVQRWH[FHHGWKHWLPHOLPLW
FXUUHQWBVHVVLRQDUWLFOHVDGGDUWLFOHBLGFXUUHQWBVHVVLRQHQGWLPHVWDPS`UHWXUQ6HVVLRQV>@
$OJRULWKP'LVFRYHULQJ6HVVLRQVLQXVHUVDFFHVVSDWKV
% &OXVWHULQJ8VHU6HVVLRQV2QFH XVHUVHVVLRQV KDYHEHHQ H[WUDFWHG ZH SURFHHG WR
WKHFRUHSURFHGXUHGHVFULEHGLQWKLVSDSHUVHVVLRQFOXVWHULQJ$V GHVFULEHG LQ $OJRULWKP IRU HDFK XVHU VHVVLRQ ZHDJJUHJDWHWKHQHZVDUWLFOHVWKDWPDNHXSWKLVVHVVLRQ$WWKHQH[WVWHSZHHQULFKWKHNH\ZRUGVWKDWEHORQJWRWKHVHVVLRQXVLQJ UHODWHG K\SHUQ\PV IURP WKH :RUG1HW GDWDEDVH
,QLWLDOO\IRUHDFKJLYHQNH\ZRUGRIWKHVHVVLRQZHJHQHUDWHLWV JUDSKV RI K\SHUQ\PV OHDGLQJ WR WKH URRW K\SHUQ\PFRPPRQO\ EHLQJ HQWLW\ IRU QRXQV )ROORZLQJ ZHFRPELQH HDFK LQGLYLGXDO K\SHUQ\P JUDSK WR DQ DJJUHJDWHGRQH $Q H[DPSOH RI WKH K\SHUQ\P JHQHUDWLRQ DQGDJJUHJDWLRQ SURFHVV LV GHSLFWHG LQ )LJ 7KHUH DUHSUDFWLFDOO\ WZR SDUDPHWHUV WKDW QHHG WR EH WDNHQ LQWRFRQVLGHUDWLRQ IRU HDFK K\SHUQ\P RI WKH DJJUHJDWHWUHHOLNHVWUXFWXUHLQRUGHUWRGHWHUPLQHLWVLPSRUWDQFHWKHGHSWKDQGWKH IUHTXHQF\RI DSSHDUDQFH ,W LV REVHUYHG WKDW WKH KLJKHULHOHVVGHHSZDONLQJIURPWKHURRWQRGHGRZQZDUGVWKHK\SHUQ\PLVLQWKHJUDSKWKHPRUHJHQHULFLWLV+RZHYHUWKHORZHUWKHK\SHUQ\PLVLQWKHJUDSKWKHOHVVFKDQFHVGRHVLWKDYHWR RFFXULQ PDQ\ JUDSK SDWKV LH LWV IUHTXHQF\ RI
DSSHDUDQFHLV ORZ,Q RXUDSSURDFKWKRVHWZR FRQWUDGLFWLQJSDUDPHWHUVDUHZHLJKWHGXVLQJ
7:
IG
H
IG:
ZKHUHGVWDQGVIRUWKHQRGHVGHSWKLQWKHJUDSKVWDUWLQJ
IURP URRW DQG PRYLQJ GRZQZDUGV I LV WKH IUHTXHQF\ RIDSSHDUDQFHRIWKHQRGHWRWKHPXOWLSOHJUDSKSDWKVDQG7:LV WKHQXPEHU RI WRWDO ZRUGV WKDWZHUH XVHGIRU JHQHUDWLQJWKH JUDSK LH WRWDO NH\ZRUGV RI WKH VHVVLRQ
78
-
7/27/2019 Clustering User Preferences Using W-Kmeans
5/8
)LJXUH$JJUHJDWHK\SHUQ\PJUDSKIRUWKUHHZRUGVSLHDSSOHRUDQJH
$OJRULWKPFOXVWHULQJBXVHUBVHVVLRQV
,QSXWVHVVLRQVQXPEHURIFOXVWHUV
2XWSXWVHVVLRQWRFOXVWHUDVVLJQPHQWV
IRUHDFKVHVVLRQV^IRUHDFKDUWLFOHDEHORQJLQJWRVVHVVLRQNZVIHWFKPRVWIUHTXHQWNZVIRUD
ZRUGQHWBHQULFKV6HH$OJRULWKP`
FOXVWHUVNPHDQVVHVVLRQV
UHWXUQFOXVWHUV$OJRULWKP&OXVWHULQJ8VHU6HVVLRQVXVLQJ:RUG1HW
$OJRULWKPZRUGQHWBHQULFK
,QSXWVHVVLRQV2XWSXWVHVVLRQZLWKHQULFKHGOLVWRINH\ZRUGV
WRWDOBK\SHQBWUHH18//
NZVIHWFKPRVWIUHTXHQWNZVIRUV
IRUHDFKNH\ZRUGNZLQNZV^KWUHHZRUGQHWBK\SHQBWUHHNZH[WUDFWWKHK\SHUQ\PWUHH
IURP:RUG1HWIRUHDFKK\SHQKLQKWUHH^
LIKQRWLQWRWDOBK\SHQBWUHHKIUHTXHQF\
WRWDOBK\SHQBWUHH!DSSHQGKHOVH
WRWDOBK\SHQBWUHH!DWK!IUHT`
`
IRUHDFKKLQWRWDOBK\SHQBWUHH^FDOFXODWHBGHSWKKZHLJKWH[SK!GHSWKAK!IUHT
NZVBLQBZQ!VL]H
`
VRUWBZHLJKWVWRWDOBK\SHQBWUHHLPSRUWDQWBK\SHQVNZV!VL]HWRSWRWDOBK\SHQBWUHH
UHWXUQNZVLPSRUWDQWBK\SHQV$OJRULWKP(QULFKLQJXVHUVHVVLRQVXVLQJ:RUG1HWK\SHUQ\PV
& 5HFRPPHQGDWLRQ6WDJH:KHQDXVHUUHWXUQVWRWKHV\VWHPKLVFOXVWHUKDVDOUHDG\
EHHQ GHWHUPLQHG EDVHG RQ WKH UHFRUGHG SDVW VHVVLRQV,W LVQRZ VDIH WR DVVXPH WKDW VHOHFWLRQV PDGH E\ RWKHU XVHUVEHORQJLQJ WR WKH VDPH XVHU FOXVWHUDUH PRUH OLNHO\WR EH RILQWHUHVWWRKLPKHUUDWKHUWKDQUDQGRPDUWLFOHV%DVHGRQWKLVVLPSOH DVVXPSWLRQ ZHDGMXVW RXUUHFRPPHQGDWLRQ VWDJH WRVXJJHVWQHZVDUWLFOHVWRWKHXVHUDVH[SODLQHGLQ$OJRULWKP,QJHQHUDOZHRQO\NHHSRIWKHPRVWIUHTXHQWO\RFFXUULQJDUWLFOHVLQWKHXVHUVFOXVWHULQRUGHUWRDYRLGRYHUORDGLQJWKHXVHUZLWKLQIRUPDWLRQ
79
-
7/27/2019 Clustering User Preferences Using W-Kmeans
6/8
$OJRULWKPFOXVWHUBEDVHGBUHFRPPHQGDWLRQ
,QSXWXVHUXFOXVWHUF
2XWSXWVXJJHVWLRQV
VXJJHVWLRQV>@18//QXPBVXJQXPEHURIVXJJHVWLRQV
VHVVLRQVUHFRYHUBXVHUBFOXVWHULQJBLQIRXF
IRUHDFKVLQVHVVLRQVIRUXVHUVWKDWEHORQJWRWKHVDPHFOXVWHU
VXJJHVWLRQVWRSBVXJJHVWLRQVVQXPBVXJVXJJHVWLRQVUHWXUQVXJJHVWLRQV
WRSBVXJJHVWLRQVILQGVWKHDUWLFOHVZLWKWKHKLJKHVWIUHTXHQF\
,QSXWVHVVLRQVWRWDOVXJJHVWLRQVQXPBVXJVXJJHVWLRQV
2XWSXWVXJJHVWLRQV>@WRSVXJJHVWLRQVIRUWKHSDUVHGVHVVLRQVIRUHDFKDUWLFOHDLQV
LIIUHTD!PLQIUHTVXJJHVWLRQV
VXJJHVWLRQV>@DUWLFOH
UHWXUQVXJJHVWLRQV
$OJRULWKP5HFRPPHQGLQJQHZVDUWLFOHVEDVHGRQXVHUFOXVWHUV
,9 (;3(5,0(17$/352&('85(
)RU WKH HYDOXDWLRQ SURFHVV RI WKH :NPHDQV DOJRULWKPZLWKLQWKHVFRSHRIXVHUFOXVWHULQJZHXVHGDVHWRIQHZV DUWLFOHV REWDLQHG IURP PDMRU QHZV SRUWDOV OLNHEEFFRP FQQFRP UHXWHUVFRP HWF RYHU D SHULRG RI PRQWKV7KHVHDUWLFOHVZHUHHYHQO\VKDUHGDPRQJWKHEDVHFDWHJRULHVWKDWRXUV\VWHPIHDWXUHVEXVLQHVVSROLWLFVKHDOWKHGXFDWLRQ VFLHQFH VSRUWV DQG HQWHUWDLQPHQW $IWHU WKHSUHSURFHVVLQJ SURFHGXUH DQG PRVW QRWDEO\ VWHPPLQJ DQGQRXQ LGHQWLILFDWLRQ ZH NHSW IRU HDFK DUWLFOH LWV OLVW RIVWHPPHG QRXQV 1RWLFH WKDW GXSOLFDWH DUWLFOHV RULJLQDWLQJIURPGLIIHUHQWVRXUFHVKDYHEHHQUHPRYHGIURPWKHGDWDVHWEDVHG RQ WKHLU WLWOH DQG PDLQ ERG\ :H DOVR XVHG WKHQDYLJDWLRQDOSDWWHUQVWKDWZHUHFRUGHGIRUWKHUHJLVWHUHG
V\VWHP XVHUV DW WKH VDPH SHULRG 7KRVH DUH WKH VHOHFWHGDUWLFOHV DV ZHOO DV WKH WLPH VSHQW RQ WKHP DV H[SODLQHG LQ$OJRULWKP
)RURXUHYDOXDWLRQPHWULFVZHXVHG&OXVWHULQJ,QGH[&,DQG)PHDVXUH,QRUGHUWR GHWHUPLQHWKHHIILFLHQF\ RIHDFKFOXVWHULQJSDVVWRJHWKHUZLWKWKHULJKWQXPEHURIFOXVWHUVIRU\RXUGDWDVHW ZHXVHG WKH HYDOXDWLYH FULWHULRQRI &OXVWHULQJ,QGH[&,GHILQHGDV
GVV &,
ZKHUHV LVWKHDYHUDJHLQWUDFOXVWHUVLPLODULW\
DQGG LVWKHDYHUDJHLQWHUFOXVWHUVLPLODULW\,QWXLWLYHO\VLQFH WKH PRVW HIILFLHQW FOXVWHUV DUH WKH RQHV FRQWDLQLQJDUWLFOHVFORVHWRHDFKRWKHUZLWKLQWKHFOXVWHUZKLOHVKDULQJDORZVLPLODULW\ZLWKDUWLFOHVEHORQJLQJWRGLIIHUHQWFOXVWHUV&, IRFXVHV RQ LQFUHDVLQJ WKH ILUVW PHDVXUH LQWUDFOXVWHU
VLPLODULW\ ZKLOH GHFUHDVLQJ WKH VHFRQG LQWHUFOXVWHUVLPLODULW\ 7KH )PHDVXUH DV GHILQHG LQ LV D ZHLJKWHGFRPELQDWLRQ RI WKH SUHFLVLRQ DQG UHFDOO PHWULFV DQG LVHPSOR\HG WR HYDOXDWH WKH DFFXUDF\ DQG HIILFLHQF\ RI RXUUHFRPPHQGDWLRQ V\VWHPZKHQ XVLQJ XVHUSURILOH FOXVWHULQJ:HGHILQHDVHWRIWDUJHWDUWLFOHVGHQRWH&WKDWWKHV\VWHPVXJJHVWV DQG DQRWKHU VHW RI DUWLFOHV GHQRWH & WKDW DUHYLVLWHG E\ WKH XVHU DIWHU WKH UHFRPPHQGDWLRQ SURFHVV
0RUHRYHU ML FFGRF LV XVHG WR GHQRWH WKH QXPEHU RI
GRFXPHQWVERWKLQWKHVXJJHVWHGDQGLQWKHYLVLWHGOLVWV
MLML
MLML
MLFFSFFU
FFSFFUFF)
ZKHUH
L
ML
MLFGRF
FFGRFFFU
DQG
L
ML
MLFGRF
FFGRFFFS
)RU RXU ILUVW H[SHULPHQW ZH FRPSDUHG :NPHDQV WR
VWDQGDUG NPHDQV ZKHQ DSSOLHG WR XVHU FOXVWHULQJ 7KHUHVXOWV GHSLFWHG LQ )LJ VKRZ WKDW :NPHDQV FOHDUO\RXWSHUIRUPVVWDQGDUGNPHDQVE\DWOHDVWDIDFWRURIWKXV
SURYLGLQJ FOXVWHUV RI XVHUV PRUH WLJKWO\ ERXQG&RQVHTXHQWO\ WKH JHQHUDWHG FOXVWHUV FDQ FDSWXUH ZLWK DEHWWHU DFFXUDF\ XVHUV ZLWK VLPLODU LQWHUHVWV ZKLOHVXFFHVVIXOO\ VHSDUDWLQJ XVHUV ZLWK FRQWUDGLFWLQJ LQWHUHVWV)URP)LJLWFDQDOVREHGHGXFWHGWKDWERWKRIWKH&,JUDSKVSHDN DW DURXQG FOXVWHUV7KLV LV D JRRG LQGLFDWLRQDERXWWKHEHVWVXLWHGDPRXQWRIFOXVWHUVDSSOLFDEOHIRURXUGDWDVHWDILQGLQJWKDWVKRZVWKDWWKH:NPHDQVDOJRULWKPJHQHUDWHVILQHJUDLQHGFOXVWHULQJUHVXOWV
80
-
7/27/2019 Clustering User Preferences Using W-Kmeans
7/8
)LJXUH&RPSDULVRQRI:NPHDQVDQGNPHDQVIRUXVHUFOXVWHULQJ
)LJXUH&RPSDULVRQRIWKHUHFRPPHQGDWLRQHQJLQHSHUIRUPDQFH
)RU RXU VHFRQG H[SHULPHQW ZH WULHG WR GHWHUPLQH WKHRYHUDOO LPSURYHPHQW RI RXU UHFRPPHQGDWLRQ HQJLQH ZKHQWDNLQJLQWRFRQVLGHUDWLRQH[LVWLQJXVHUFOXVWHUV$VH[SODLQHGLQ $OJRULWKP IRU UHWXUQLQJ XVHUV ZH PRGLILHG RXUUHFRPPHQGDWLRQ VWDJH WR VXJJHVW RI WKH WRS YLHZHGDUWLFOHV EHORQJLQJ WR WKH XVHUV FOXVWHU )ROORZLQJ ZHUHFRUGHGZKLFKRIWKHVXJJHVWHGDUWLFOHVZHUHYLHZHGE\WKHXVHU ZLWKLQ D WLPH IUDPH RI PLQXWHV 7KH SURFHVV ZDV
UHSHDWHG ZLWKRXW WKH XVHU FOXVWHULQJ HQKDQFHPHQW RI WKHUHFRPPHQGDWLRQ HQJLQH EXW ZLWK RWKHU KHXULVWLFV VXFK DVWH[WFDWHJRUL]DWLRQDQGSHUVRQDOL]DWLRQVWLOOHQDEOHG>@7KHUHVXOWVSUHVHQWHGLQ)LJVKRZWKHDYHUDJH)PHDVXUHIRUHDFKFDVHDVXVHUVLQFUHDVH
)URP)LJZHREVHUYHDQDYHUDJHLPSURYHPHQWRIZLWK UHJDUGV WR WKH )PHDVXUH ZKHQ XVHU FOXVWHULQJ LVGHSOR\HG7KHHIILFLHQF\DOVRUDSLGO\LQFUHDVHVDVPRUHXVHUV
81
-
7/27/2019 Clustering User Preferences Using W-Kmeans
8/8
DUHWDNHQLQWRFRQVLGHUDWLRQE\WKHV\VWHPVRPHWKLQJWKDWLVH[SHFWHG JLYHQ WKH SHUVRQDOL]DWLRQ IHDWXUHV RI RXUUHFRPPHQGDWLRQ HQJLQH )URP D QDWXUDO SRLQW RI YLHZ RXUH[SHULPHQWV VKRZHG WKDW WKHUHVXOWLQJ VXJJHVWLRQV PDWFKHGWKH XVHUV FKRLFHV LQ DYHUDJH RXW RI WLPHV ,Q RXURSLQLRQ WKLV SURYHV WKDW RXUDSSURDFKKDV JUHDWO\EHQHILWHGWKHUHFRPPHQGDWLRQVWDJH
)RU RXU ODVW H[SHULPHQWDWLRQ SURFHGXUH ZH WULHG WRGHWHUPLQH WKH HIILFLHQF\ RI WKH SURSRVHG PHWKRGRORJ\FRPSDUHGWR VRPH VWDWH RI WKH DUW &) PHWKRGVOLNH ODWHQWVHPDQWLF &) QHLJKERUEDVHG &) DQG GLPHQVLRQDOLW\UHGXFWLRQ WHFKQLTXHV OLNH 69' 7KH UHVXOWV RQ WKH VDPHGDWDVHW SUHVHQWHG LQ 7DEOH UHYHDOHG WKDW :NPHDQVRXWSHUIRUPHG RU ZDV DWOHDVW DVHTXDO DVWKRVH PHWKRGV LQWHUPVRI)PHDVXUHDYHUDJHRYHUYDULRXVXVHUV
7$%/(&)0(7+2'2/2*,(6&203$5,621
&)0HWKRGRORJ\ $YHUDJH)PHDVXUH
RYHUDOOXVHUV
:NPHDQV
/DWHQWVHPDQWLF&)
1HLJKERUEDVHG&)
'LPHQVLRQDOLW\UHGXFWLRQ69'
9 &21&/86,216$1')8785(:25.,QWKLVSDSHUZHSUHVHQWHGWKH:RUG1HWHQDEOHGNPHDQV
DOJRULWKP ZKLFK H[SORUHV WKH XVDJH RI ZRUG K\SHUQ\PVH[WUDFWHGIURPWKH :RUG1HWGDWDEDVHWRWKH ILHOG RISURILOHFOXVWHULQJ DV ZHOO DVLWV DSSOLFDWLRQ WRRXU UHFRPPHQGDWLRQV\VWHP :H H[DPLQHG WKH SHUIRUPDQFH RI WKLV DSSURDFKFRPSDUHG WR VWDQGDUG NPHDQV DQG GLVFRYHUHG D IROGDPHOLRUDWLRQLQWHUPVRIFOXVWHUFRKHUHQFH)XUWKHUPRUHZHIRXQGDQDYHUDJHLPSURYHPHQWRIDURXQGLQWHUPVRI)PHDVXUHIRUWKHUHVXOWLQJVXJJHVWLRQVRIRXUUHFRPPHQGDWLRQ
HQJLQHZKHQXVHGE\UHDOV\VWHPXVHUV$GGLWLRQDOO\VRPHEDVLF H[SHULPHQWDWLRQ VKRZHG WKDW :NPHDQV SHUIRUPVXVXDOO\EHWWHUFRPSDUHGWRRWKHU&)WHFKQLTXHVZKHQDSSOLHGWR RXU UHFRPPHQGDWLRQ V\VWHP :H EHOLHYH WKDW WKH DERYHIDFWV SURYH WKH VLJQLILFDQFH RI XVHU FOXVWHULQJ DQG LQSDUWLFXODU:NPHDQVWRWKHUHFRPPHQGDWLRQSURFHVV
$VIDUDVIXWXUHZRUNLVFRQFHUQHGZHDUHSODQQLQJRQLQFRUSRUDWLQJ FOXVWHU ODEHOLQJ IRU WKH JHQHUDWHG SURILOHFOXVWHUVWRWKHV\VWHPDVZHOODVDXWRPDWHWKHGHWHFWLRQRIWKHEHVWVXLWHGQXPEHURIFOXVWHUVIRU:NPHDQVWKDWLVEHVWIRUWKHXQGHUO\LQJGDWD
$&.12:/('*0(17
7KLV UHVHDUFK KDV EHHQ FRILQDQFHG E\ WKH(XURSHDQ8QLRQ(XURSHDQ6RFLDO)XQG(6)DQG*UHHN QDWLRQDO IXQGV WKURXJK WKH 2SHUDWLRQDO3URJUDP(GXFDWLRQDQG/LIHORQJ/HDUQLQJRIWKH
1DWLRQDO6WUDWHJLF5HIHUHQFH)UDPHZRUN165)5HVHDUFK )XQGLQJ 3URJUDP +HUDFOHLWXV ,,,QYHVWLQJ LQ NQRZOHGJH VRFLHW\ WKURXJK WKH(XURSHDQ6RFLDO)XQG
5()(5(1&(6>@ ' $UWKXU DQG 6 9DVVLOYLWVNLL NPHDQV WKH DGYDQWDJHV RI
FDUHIXOVHHGLQJ,Q3URFHHGLQJVRIWKHHLJKWHHQWKDQQXDO$&06,$0V\PSRVLXPRQ'LVFUHWHDOJRULWKPVSS
>@ & %RXUDV 9 3RXORSRXORV DQG 9 7VRJNDV 3H566RQDOV FRUHIXQFWLRQDOLW\ HYDOXDWLRQ (QKDQFLQJ WH[W ODEHOLQJ WKURXJK
SHUVRQDOL]HGVXPPDULHV'DWDDQG.QRZOHGJH(QJLQHHULQJ-RXUQDO(OVHYLHU6FLHQFH9RO,VVXHSS
>@ & %RXUDV DQG 9 7VRJNDV ,PSURYLQJ WH[W VXPPDUL]DWLRQ XVLQJQRXQ UHWULHYDO WHFKQLTXHV /HFWXUH 1RWHV LQ &RPSXWHU 6FLHQFH
.QRZOHGJH%DVHG,QWHOOLJHQW ,QIRUPDWLRQ DQG (QJLQHHULQJ 6\VWHPV9ROSS
>@ , & DGH] ' +HFNHUPDQ & 0 HHN 3 6P\WK DQG 6 :KLWH9LVXDOL]DWLRQRI 1DYLJDWLRQ3DWWHUQV RQ D :HE6LWH 8VLQJ 0RGHO%DVHG&OXVWHULQJ,QSURFHHGLQJVRIWKH,QWHUQDWLRQDO&RQIHUHQFHRI
.QRZOHGJH'LVFRYHU\DQG'DWD0LQLQJSS
>@ 5&RROH\ %0REDVKHU DQG -6ULYDVWDYD *URXSLQJ :HE 3DJH5HIHUHQFHVLQWR7UDQVDFWLRQVIRU0LQLQJ:RUOG:LGH:HE%URZVLQJ3DWWHUQV ,Q3URFHHGLQJV RI WKH ,((( .QRZOHGJH DQG 'DWD
(QJLQHHULQJ([FKDQJH:RUNVKRS1RYHPEHUS
>@ 5 &RROH\ % 0REDVKHU DQG- 6ULYDVWDYD 'DWDSUHSDUDWLRQ IRUPLQLQJ ZRUOG:LGH:HEEURZVLQJSDWWHUQV -RXUQDORI.QRZOHGJHDQG,QIRUPDWLRQ6\VWHPV
>@ 0 (L ULQDNL DQG 0 9D]LUJLDQQLV :HE PLQL QJ IRU :HESHUVRQDOL]DWLRQ$&0 7UDQVDFWLRQV RQ ,QWHUQHW 7HFKQRORJ\ SS
>@ @ )+:DQJDQG+06KDR(IIHFWLYHSHUVRQDOL]HGUHFRPPHQGDWLRQEDVHGRQ WLPHIUDPHGQDYLJDWLRQ FOXVWHULQJDQG DVVRFLDWLRQPLQLQJ([SHUW6\VWHPVZLWK$SSOLFDWLRQVSS
>@ /@