Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

25
Mathematics of Incidence part 3: What’s with Abby and Doughnuts? (more about concept lattices) Benjamin J. Keller bjkeller.github.io linkedin.com/in/bjkeller v.1, 10 October 2014 Creative Commons Attribution-ShareAlike 4.0 International License a u 1 u 2 s 2 s r(a) s l

description

Third in a growing series using a simple Collaborative Filtering example to explore elements of formal concept analysis and lattice theory in analysis of incidence data. In these slides, we learn a little more about concept lattices, answer the mystery of the doughnuts for Abby, and by looking at recommendations frame a new question about dependencies.

Transcript of Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Page 1: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Mathematics of Incidencepart 3: What’s with Abby and Doughnuts?

(more about concept lattices) !Benjamin J. Kellerbjkeller.github.io linkedin.com/in/bjkeller!v.1, 10 October 2014

Creative Commons Attribution-ShareAlike 4.0 International License

a u1 u2 uk

s1s2sr(a) tsl

un(a)

� �

Page 2: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Recall: Abby and the doughnuts

bananas

doughnutsapples

eggscherries

Charles

DavidBrian

Abby

"Abby

#doughnuts

apples bananas cherries doughnuts eggs

Abby X XBrian X X X XCharles X X X X XDavid X X X

Can recommend foods to Abby based on shared “likes” with other users by composing concepts from principal filter for Abby with those from principal ideal for doughnuts

Page 3: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

CF rating distributionSTUDYING RECOMMENDATION ALGORITHMS 147

Figure 6. Hits-buffs structure of the (reordered) MovieLens dataset.

4.2. Experiments

The goal of our experiments is to investigate the effect of the hammock width w on the aver-age characteristic path lengths of the induced Gs social network graph and Gr recommendergraph for the above datasets. We use versions of the EachMovie and MovieLens datasetssanitized by removing the rating information. So, even though rating information can easilybe used by an appropriate jump function (an example is given in Aggarwal et al. (1999),we explore a purely connection-oriented jump in this study. Then, for various values ofthe hammock width w, we form the social network and recommender graphs and calculatethe degree distributions (for the largest connected component). This was used to obtain thelengths predicted by Eqs. (1) and (2) from Section 3. We also compute the average pathlength for the largest connected component of both the secondary graphs using parallelimplementations of Djikstra’s and Floyd’s algorithms. The experimental observations arecompared with the formula predictions.

4.2.1. MovieLens. Figure 7 describes the number of connected components in Gr as aresult of imposing increasingly strict hammock jump constraints. Up to about w = 17, thegraph remains in one piece and rapidly disintegrates after this threshold. The value of thistransition threshold is not surprising, since the designers of MovieLens insisted that everyparticipant rate at least κ = 20 movies. As observed from our experiment results, after thethreshold and up to w = 28, there is still only one giant component with isolated peoplenodes (figure 8, left). Specifically, the degree distributions of the MovieLens social networkgraphs for w > 17 show us that the people nodes that are not part of the giant componentdo not form any other connected components and are isolated. We say that a jump shattersa set of nodes if the vertices that are not part of the giant component do not have any edges.This aspect of the formation of a giant component is well known from random graph theory(Bollabas, 1985). Since our construction views the movies as a secondary mode, we canensure that only the strictest hammock jumps shatter the NM movie nodes. Figure 8 (right)

MovieLens incidence (Mirza et al., JIIS 2003)

Page 4: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

CF rating distributionSTUDYING RECOMMENDATION ALGORITHMS 147

Figure 6. Hits-buffs structure of the (reordered) MovieLens dataset.

4.2. Experiments

The goal of our experiments is to investigate the effect of the hammock width w on the aver-age characteristic path lengths of the induced Gs social network graph and Gr recommendergraph for the above datasets. We use versions of the EachMovie and MovieLens datasetssanitized by removing the rating information. So, even though rating information can easilybe used by an appropriate jump function (an example is given in Aggarwal et al. (1999),we explore a purely connection-oriented jump in this study. Then, for various values ofthe hammock width w, we form the social network and recommender graphs and calculatethe degree distributions (for the largest connected component). This was used to obtain thelengths predicted by Eqs. (1) and (2) from Section 3. We also compute the average pathlength for the largest connected component of both the secondary graphs using parallelimplementations of Djikstra’s and Floyd’s algorithms. The experimental observations arecompared with the formula predictions.

4.2.1. MovieLens. Figure 7 describes the number of connected components in Gr as aresult of imposing increasingly strict hammock jump constraints. Up to about w = 17, thegraph remains in one piece and rapidly disintegrates after this threshold. The value of thistransition threshold is not surprising, since the designers of MovieLens insisted that everyparticipant rate at least κ = 20 movies. As observed from our experiment results, after thethreshold and up to w = 28, there is still only one giant component with isolated peoplenodes (figure 8, left). Specifically, the degree distributions of the MovieLens social networkgraphs for w > 17 show us that the people nodes that are not part of the giant componentdo not form any other connected components and are isolated. We say that a jump shattersa set of nodes if the vertices that are not part of the giant component do not have any edges.This aspect of the formation of a giant component is well known from random graph theory(Bollabas, 1985). Since our construction views the movies as a secondary mode, we canensure that only the strictest hammock jumps shatter the NM movie nodes. Figure 8 (right)

MovieLens incidence (Mirza et al., JIIS 2003)

people who have rated more movies

movies with more ratings

(more “popular”)

Page 5: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

A question of place

• Location in the lattice tells us something about users and foods

• User higher in lattice has only experienced more popular foods (e.g., those with more “likes”)

• Food lower in lattice only experienced by the more adventurous

bananas

doughnutsapples

eggscherries

Charles

DavidBrian

Abby

More popular items

Users who have experienced more

items

Page 6: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

What happens if Abby likes cherries?

apples bananas cherries doughnuts eggs

Abby X X XBrian X X X XCharles X X X X XDavid X X X

bananas

doughnutsapples,cherries

eggs

Charles

DavidBrian

Abby

Makes cherries more popular, they move up the lattice

Page 7: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

What happens if Abby likes grapes?

apples bananas cherries doughnuts eggs grapes

Abby X X XBrian X X X XCharles X X X X X XDavid X X X

(Charles likes everything)

grapes

bananas

doughnutsapples

eggscherries

Charles

DavidBrianAbby

Abby moves down the lattice

Page 8: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Definitions: top, co-atoms, bottom, atoms

co-atom – element of lattice covered by topa � > a 2 L

atom – element of lattice that covers bottom? � b b 2 L

top of lattice is least upper bound of all elements >written

bottom of lattice is greatest lower bound of all elements?written

Page 9: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

(squirm) A concept lattice technicality

• Concept lattice is complete: it has a top and bottom

• Formally, defined as closure of empty sets

• top is concept with no attributes

• bottom is concept with no objects

• Been using closure on G (attributes shared by all objects) and M (objects with all attributes)

bananas

doughnutsapples

eggscherries

Charles

DavidBrian

Abby

�;

µ;

µM

�G

Page 10: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Being practically lazy

�G � µ; �; � µMCan show that where they only differ when there is an attribute shared by all objects or an object with all attributes

and

More importantly: �A � �G A ✓ G

µB ⌫ µM B ✓ Mfor allfor all

So ignore formal top and bottom and use

�G as top µM as bottom a � �G as a co-atom b � µM as an atom

Page 11: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Why do cherries become a co-atom?

apples bananas cherries doughnuts eggs

Abby X X XBrian X X X XCharles X X X X XDavid X X X

bananas

doughnutsapples,cherries

eggs

Charles

DavidBrian

Abby

Adding Abby likes cherries differentiates cherries from doughnuts (other foods liked by Brian and Charles)

Page 12: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Why does Abby become an atom?

apples bananas cherries doughnuts eggs grapes

Abby X X XBrian X X X XCharles X X X X X XDavid X X X

(Charles likes everything)

grapes

bananas

doughnutsapples

eggscherries

Charles

DavidBrianAbby

Adding Abby likes grapes differentiates Abby from Brian (e.g., other users who like apples and bananas)

Page 13: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Subtleties of place (CF)

• An atom represents

• at least one user liking unique food(s)

• user(s) who like more foods

• A co-atom represents

• at least one food liked by unique user(s)

• food(s) liked by more users

• Correspond to the long-tail of the user/food rating distributions

• Concepts between atoms and co-atoms closer to the fat end

• Ponder: Can we figure out where the long-tail starts?

Page 14: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

So, what is it with Abby and doughnuts?

• Just that doughnuts are the most popular food that Abby has not liked

• Because the other foods are liked by subsets of same users, they fall in the principal ideal for doughnuts

• If there were another group with different tastes that Abby shared likes with, then there would be another most popular food

bananas

doughnutsapples

eggscherries

Charles

DavidBrian

Abby

"Abby

#doughnuts

Page 15: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Let's try it…

apples bananas cherries doughnuts eggs flounder lychee

Abby X XBrian X X X XCharles X X X X X X XDavid X X XEvan X XFan X X X X X

Page 16: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

So, what is it with Abby, doughnuts and lychee?

• Based on new incidence, the ideal for both doughnuts and lychee fruit contains all of the concepts that could be used to recommend to Abby

• So, they correspond to the maximal concepts that generate this filter of composable concepts

• Ponder: do they have to be co-atoms?

lychee

flounder

Fan

Evan

bananas

doughnutsapples

eggs

cherries

Charles

DavidBrian

Abby

Page 17: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

OK, let's back up…

Page 18: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

A recommendation of item t to user a

a u1 u2 uk

s1s2sr(a) tsl

un(a)

� �

({a, u1, u2,…, uk}, {s1, s2,…, sl })

({u1, u2,…, uk}, {s1, s2,…, sl , t})

recommend t to a by composing concept from principal filter of a with a subconcept in principal ideal of t

Page 19: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Wait, let's figure out this co-atom thing

µt µs1, µs2, . . . µsls1, s2, . . . sl

Not necessary that any of orare co-atoms as shown, where areitems that a has rated, and t is the target item

si

µsi = µSi

But, for any there is a set of items

and satisfying

Si ⇢ �(µsi)

consisting only of co-atoms m 2 Si µm � �G

So, can always substitute in a set for any item that doesn’t correspond to a co-atom – but need to revisit this

Page 20: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

a u1 u2 uk

s1s2sr(a) tsl

un(a)

� �

A different perspective

µT

T = {s1, s2, . . . , sl, t}

U = {a, u1, u2, . . . , uk}

µU

Page 21: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

A different perspective

• Each recommendation corresponds to a selection of items and users

• So, can recommend items determined by selection of items liked by a and a selection of users who like those items

a u1 u2 uk

s1s2sr(a) tsl

un(a)

� �

Page 22: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

So, what is a good recommendation?

µT

T = {s1, s2, . . . , sl, t}

U = {a, u1, u2, . . . , uk}

µUA composition of with where

are sets that are “strongly interdependent”

and

Expect that it would be something like

But, we don’t really know what that means

Page 23: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

Questions of ponderousness

• What is dependency? And, what does it have to do with good recommendations?

• What happens with recommendations when don't have atoms/co-atoms?

• Can we figure out where the long-tail starts?

• Serendipity who?

Page 24: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

About me and these slides

I am Ben(jamin) Keller. I learn and, sometimes, create through explaining. I had been involved in a big (US) federally funded project that had the goal of helping biomedical scientists tell stories about their experimental observations. The project is long gone, but I’m still trying to grok how such a thing would work. Much of biological data comes in the form of observations that are distilled to something that looks like an incidence relation, which brings us to this series of presentations. My goal for the slides is to deal with the mathematics and analysis of incidence in an approachable way, but the intuitive beginnings will eventually allow us to embrace the more complex later.

Page 25: Mathematics of Incidence (part 3): What's with Abby and Doughnuts? (More about concept lattices)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0

International License.