Invited talk at Future Networked Technologies / FIT-IT research calls opening Graz, Austria - June...

download Invited talk at Future Networked Technologies / FIT-IT research calls opening Graz, Austria - June 9th, 2010

If you can't read please download the document

Transcript of Invited talk at Future Networked Technologies / FIT-IT research calls opening Graz, Austria - June...

Trust in Recommender Systems and Social Networks

Paolo MassaSoNet @ FBK, Trento, Italyhttp://sonet.fbk.eu

Invited talk at Future Networked Technologies / FIT-IT research calls openingGraz, Austria - June 9th, 2010

Slides licenced under CreativeCommons Attribution-ShareAlike (see last slide for more info)

What I talk about

My personal visions and suggestions for collaborative research projects funded by FIT-IT on the 3 topics of the calls Trust in IT Systems

Semantic Systems and Services

Visual Computing

starting from what I research(ed) on

There will be people submitting projects, people evaluating projects: I want to tell you what I think would be more beneficial as research (and what less).At least give you food for thought ignite discussion -

Who am I

PhD in Trust Metrics in Decentralized Recommender Systems from University of Trento (2006)Since April 2008, Head of SoNet research group at Bruno Kessler Foundation (FBK), Trento, ItalyFocus is Social Networking

6 people

Bruno Kessler Foundation

~400 researchers (IT,Microelectronics,...)

Funded by Autonomous Province of Trento, North-East Italy www.fbk.eu

Not only SoNet group! ;)Data and Knowledge Management http://dkm.fbk.eu (Semantic Systems)

Web of Data http://wed.fbk.eu (Semantic Services)

Interactive interfaces http://i3.fbk.eu (Visual Computing)

Predictive Models for Biomedicine & Environment http://mpba.fbk.eu (Visual Computing)

Security and Trust http://st.fbk.eu/ (Trust in IT)

OUTLINE

Research on Trust in Recommender Systems as motivating example

Research on Social Networking at SoNet

Promising directions for Trust in IT Systems and Semantic Systems and Services

Trust

My definition of Trust: explicit rating of a user on another user about the perceived quality of the user's characteristics

Ex: I trust Mena as 0.9

Trust as social relationship and NOT about security (later I touch soft security)

Trust usage for real

Explicit trust statement I trust Mena as 0.9 (in [0,1])

E-marketplaces: Ebay.com, Epinions.com, Amazon.comSocial Network Sites: Facebook, Twitter, Flickr, ...Job sites: LinkedIn, Ryze, News sites: Slashdot.org, Kuro5hin.orgP2P networks: eDonkey, Gnutella, JXTAOpen Source Developers communities: Advogato, Affero, GithubHospitalityclub, couchsurfing: host in your house unknown people!Network of personal weblogs (blogroll)Semantic Web: FOAF (Friend-Of-A-Friend), XFNGoogle (and Yahoo!): TrustRank, PageRank

[1] P. Massa (2006). A survey of trust use in current real systems. In "Trust in E-services: Technologies, Practices and Challenges".

Trust Statement

ME

Mena

0.9

Explicit trust statement I trust Mena as 0.9 (in [0,1])

A node is a user.

A direct edge is a trust statement

Trust Network

ME

Mena

0.9

Doc

1

0

Cory

Ben

0.6

Mary

1

- weighted (0=distrust, 1=max trust)

- asymmetric

- subjective

A node is a user.

Properties of Trust:

A direct edge is a trust statement

0.2

Aggregate all trust statements Trust Network (a social network with trust as social relationship)

Trust Network

ME

Mena

0.9

Doc

1

0

Cory

Ben

0.6

Mary

1

What can you do with a Trust network?

Social Network Analysis (SNA):Computing centrality, indegrees, Powerlaws for unveiling the Topology of the network

What I did?Focusing on trust metrics

0.2

Trust Metrics

ME

Mena

0.9

Doc

1

Cory

0.2

?

?

Ben

0.6

Mary

1

Trust Metric:

Uses existing edges for predicting valuesof trust for non-existing edges.

Thanks to trust propagation, if you trust someone, then you have some degree of trust in anyone that person trusts.

0

PageRank: a trust metric?

Nodes are web pages, Edges are links (not weighted).PageRank (Google) computes the importance of every single page based on number and quality of incoming edges...So, YES: PageRank is a trust metric.

Webpage

Webpage

Webpage

Webpage

Webpage

Webpage

Imagine the web as atrust network

TM perspective: Local or Global

Global Trust Metrics:

Reputation of user is based on number and quality of incoming edges. Bill has just one predicted trust value (0.5).

PageRank (eBay, Slashdot, ). Work bad for controversial people (Bush)

Local Trust Metrics

Trust is subjective --> consider personal views (trust Bill?)

Local can be more effective if people are not standardized.

ME

Mena

1

Doc

1

Mary

Bill

1

0

How much can Bill be trusted? On average (by the community)? By Mary? And by ME?

People A trusts canbe totally different frompeople trusted by B

PageRank is Global.For web pages it worksenough well butproblems with High controversial topicsabort or jew

I tend to useReputation for this average trustTrust for the personal prediction.

Bill is Bill Gates

Global trust metricstend to standardize opinions

Global vs Local Trust Metrics

Global vs Local Trust MetricsObjective vs Subjective

Neutral Point Of View vs Your personal POV

Tyranny of the majority vs daily me/echo chambers

Trust in IT: TMs 4 Hard vs Soft Security?Soft security [2]: protects something from harm in quiet and unobtrusive ways, often invisibly and after the fact, rather than with visible barriers before the fact.

Ex: Ad-hoc, fluid, on-the-fly file permissions

[2] L. Rasmusson, S. Jansson. Simulated Social Control for Secure Internet Commerce. New Security Paradigms Workshop. ACM, 1996

Trust in Recommender Systems

How Trust Metrics are used in Recommender Systems (ex: Amazon)?

Recommend items (ex: books) appreciated by users you may trust (according to the chosen Trust Metric),

instead of recommending items appreciated by similar users (Collaborative Filtering)

Massa PhD Thesis Conclusions

Experiments on real data crawled from Epinions.com~132K users, ~14M items ratings, ~841K trust statements (positive and negative)

Trust-aware RSs smaller errors, higher coverage, especially for cold start users [3]

Local TMs very effective, especially on controversial people (>20%!) [4]

[3] P. Massa, P. Avesani (2007). Trust-aware Recommender Systems. ACM Recommender Systems Conference, Minneapolis, Minnesota, USA.[4] P. Massa, P. Avesani (2007). Trust metrics on controversial users: balancing between tyranny of the majority and echo chambers. International Journal on Semantic Web and Information Systems (IJSWIS).

Generalizing suggestions

Key points of this research on Trust-aware Recommender Systems:Open Web

Real data from real users

Share the data!

I released all the collected datasets at http://www.trustlet.orgEpinions.com, Advogato.org,

Also good for improving visibility of your research!

Nowadays data are starting to become available, mainly because of electronic trails we leave on Social Network Sites

OUTLINE

Research on Trust in Recommender Systems as motivating example

Research on Social Networking at SoNet

Promising directions for Trust in IT Systems and Semantic Systems and Services

Social Networking

Social Network Sites [5]: Facebook, Twitter,

SoNet group at FBK focuses on this

[5] boyd, d. m., & Ellison, N. B. (2007). Social network sites: Definition, history, and scholarship. Journal of Computer-Mediated Communication, 13(1), article 11.

Wikipedia as a social network

The peak of User generated content!SoNet focus is on Wikipedia as SN

Millions of users!!!

We work on the social part of Wikipedia, not the content part!

User talk page http://en.wikipedia.org/wiki/User_talk:Phauly

User talk page http://en.wikipedia.org/wiki/User_talk:Phauly

0.6

User talk page http://en.wikipedia.org/wiki/User_talk:Phauly

0.6

Shell

1

Phauly

User talk page http://en.wikipedia.org/wiki/User_talk:Phauly

0.6

Shell

1

Phauly

User talk page http://en.wikipedia.org/wiki/User_talk:Phauly

0.6

Shell

1

Phauly

Martin

1

User talk page http://en.wikipedia.org/wiki/User_talk:Phauly

0.6

Public conversations!Real data!

Shell

1

Phauly

Martin

1

ME

Mena

9

Doc

96

1

Cory

Ben

6

Mary

16

2

Directed and Weighted social network of who talks to whom on Wikipedia

Vec.wikipedia.org: just 922 nodes!

En.wikipedia.org social network has millions of nodes!

SoNet research lines on Wikipedia

Structural properties of different Wikipedia social networks (SNA)

How are talk pages used by different group of users: content analysis on Venetian Wikipedia

Comparing different ways of extracting Wikipedia social network

(future) Longitudinal: evolution in time of a Wikipedia social network

Why talk about Wikipedia SN?

Interesting! ;)

Neutral Point Of View (NPOV) tyranny of the majority?But we see edit wars, personal POVs, conflicts of interests

Trust metrics for personalizing content on wikipedia?

Soft security is how permissions are managed on Wikipedia: lessons for Trust in IT?

In few slides, I talk about Semantic services thanks to Wikipedia: DBPedia

Why did I talk about Wikipedia social network research?

It's another example of research:Open web

Real data from real people

SoNet research on Enterprise2.0

We explored Enterprise2.0, created an open source platform Taolinhttp://taolin.fbk.eu

And did research on its usage in FBK and relationships with social capital [6]

BUT we dropped this lineNO Open web: Intranet!

NO Real data from real people: few users!

[6] An Empirical Analysis on Social Capital and Enterprise 2.0 Participation in a Research Institute (2010). Asonam, Conf. on Advances in Social Networks Analysis and Mining.

By the way, I assume Wikipedia as the prototype of the organization of the futureHorizontal

Ad-hoc, rules/policies are negotiated continuously

Soft security, instead of Hard security [2]Messier but is not reality a mess?

[2] L. Rasmusson, S. Jansson. Simulated Social Control for Secure Internet Commerce. New Security Paradigms Workshop. ACM, 1996

OUTLINE

Research on Trust in Recommender Systems as motivating example

Research on Social Networking at SoNet

Promising directions for Trust in IT Systems and Semantic Systems and Services

Going back to open web and real data,Don't invent some new ways of thinking/speaking and expect people to adopt it!I propose to change all the web into this new format, then we will be able to do lots of things ...

If all the organizations would use this tool, then ...

Look for what is already available on the Web, build on it, look at it with different eyes.

Going back to open web and real data,Don't invent some new ways of thinking/speaking and expect people to adopt it!I propose to change all the web into this new format, then we will be able to do lots of things ...

If all the organizations would use this tool, then ...

Look for what is already available on the Web, build on it, look at it with different eyes.

Ex: billions of links were already there when Google founders saw them and used them for improving search engines!PageRank as 1 of 3 examples in the original definition of Web2.0 by Tim O'Reilly (2004)

Other examples of this (relevant for Semantic Systems and Services)DBPedia.orgInstead of asking people to express knowledge in RDF, automatically extract knowledge (in form of RDF triples) from Wikipedia!

Then SPARQL it (ex: People who were born in Berlin before 1900)

Another example of research:Open Web (from Web, to Web: mashable!)

Real data from real users (info was there!)

More examples

Other examples of this (relevant for Semantic Systems and Services)Microformats / RDFaBoth looked into what is there (Web) and propose small/achievable changes

Another example of research:Open web

Real data from real people

hReview example

Microformat for reviews of items (and ratings): for Trust-aware Rss!

5 out of 5 stars Crepes on Cole is awesome Reviewer: Tantek - April 18, 2005 Crepes on Cole is one of the best little creperies in San Francisco. Excellent food and service. Plenty of tables in a variety of sizes for parties large and small. Window seating makes for excellent people watching to/from the N-Judah which stops right outside. I've had many fun social gatherings here, as well as gotten plenty of work done thanks to neighborhood WiFi. Visit date: April 2005 Food eaten: Florentine crepe

Linked Data

Other examples of this (relevant for Semantic Systems and Services)LinkedData.org

Instead of asking people to express knowledge in RDF, transform available datasets! And link them!Link your research and data to what is already existent (The essence of the Web is the link!)

May 2009: 4.2 billion RDF triples, interlinked by around 142 million RDF links.

Conclusions - Suggestions

Lots of data are there and comingSocialNetworkSites have APIs for exporting data

data.gov, data.gov.uk and similar initiatives worldwide

Don't make unrealistic assumptions

Look for what is available (Web)Possibly with different glasses

Propose projects for improving the open Web, and used by real users:Wrt Trust in IT, Semantic Systems and Services

Interested in collaborating with SoNet or other FBK groups? Get in touch!Now or at http://sonet.fbk.eu

Thank you!Questions?

Licence of these slides

These slides are released underCreative CommonsAttribution-ShareAlike 2.5You are free: * to copy, distribute, display, and perform the work * to make derivative works * to make commercial use of the workUnder the following conditions:Attribution. You must attribute the work in the manner specified by the author or licensor.Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under a license identical to this one. * For any reuse or distribution, you must make clear to others the license terms of this work. * Any of these conditions can be waived if you get permission from the copyright holder.Your fair use and other rights are in no way affected by the above.More info at http://creativecommons.org/licenses/by-sa/2.5/

Click to edit the title text format

Click to edit the outline text formatSecond Outline LevelThird Outline LevelFourth Outline LevelFifth Outline LevelSixth Outline LevelSeventh Outline LevelEighth Outline LevelNinth Outline Level