Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of...

21
Introduction Approach Ranking Results Conclusion Venue Appropriateness Prediction for Contextual Suggestion Mohammad Alian Nejadi Ida Mele Fabio Crestani January 16, 2017 M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 1/21

Transcript of Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of...

Page 1: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Venue Appropriateness Prediction for

Contextual Suggestion

Mohammad Alian NejadiIda Mele

Fabio Crestani

January 16, 2017

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 1/21

Page 2: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Introduction

What is the track?

Contextual Suggestion Track deals with complexinformation needs which are highly dependent oncontext and user interests.

What do we have?

User contextUser history or profile

What should we do?

Rank the candidate list: Phase 1 and Phase 2

Evaluation: nDCG@5, P@5, and MRR

Fifth year

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 2/21

Page 3: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Collection

What is provided by the organizers?

An attraction IDA context (city) ID which indicates which city thisattraction is inA URL with more information about the attractionA titleA crawled collection of the URLs in the collection

What should we collect?Crawl venues from Location-based Social Networks(LBSNs):

FoursquareYelp

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 3/21

Page 4: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Collection(cont.)

Phase 1:

Virtually 600K venues crawled on Foursquare

Phase 2:

13,704 venues crawled on Foursquare13,604 venues crawled on Yelp

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 4/21

Page 5: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Approach

A combination of multimodal scores from multiple sources

Sources: Foursquare and Yelp

Types of information: categories, venue taste keywords,reviews, user context

Context appropriateness prediction

Two types of scores:

Frequency basedMachine-learning based

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 5/21

Page 6: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Frequency-based Scores

To have a better idea of the user’s taste and behavior weneed to take into account their liked/disliked categories

We have already extracted the categories andsubcategories for each place using Yelp, Foursquare

It is not clear exactly which category or subcategory isliked/disliked:

Italian - Takeaway - PizzaItalian - Pasta - Seafood - PizzaAmerican - Good for Families - Pizza

It is quite obvious that he/she likes Pizza

We calculate a frequency-based score to model users

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 6/21

Page 7: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Frequency-based Scores(cont.)

To calculate the frequency-based scores, we followedthese steps to create frequency-based profiles:

1 For each category/subcategory for a place with positiverating

2 Add the category/subcategory to positive profile (cf+)3 If the category/subcategory already exists in model, add

one to its count4 Normalize the counts5 Do the same for places with negative rating to build

negative profile (cf−)

A new venue’s categories is compared to the profile andthe scores are summed up:

Scat(u, v) =∑

ci∈C(v)

cf+(ci)− cf−(ci).

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 7/21

Page 8: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Frequency-based Scores(cont.)

Calculate the frequency-based score with following typesof information:

Foursquare Categories → SFcat

Yelp Categories → SYcat

Foursquare Venue Taste Keywords → SFkey

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 8/21

Page 9: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Machine-learning-based Scores

We assume that a user likes what others like about aplace and vice versa

Find reviews with similar rating:

Positive Profile: Other users’ reviews with rating 3 or 4corresponding to places that user gave a similar ratingNegative Profile: Other users’ reviews with rating 0 or1 corresponding to places that user gave a similar rating

Train a classifier for each user → SVM

Features: TF-IDF score of each term

Score: The value of decision function: SYrev

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 9/21

Page 10: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Contextual Appropriateness

We need to predict the appropriateness of a venue given acontext

Some are objective and easy to predict:

Is a Nightlife Spot appropriate for a Family? NoIs a Pizza Place appropriate to go with Friends? Yes

Some are very subjective:

Is a Pharmacy appropriate to go on a Business trip?Is a University appropriate to go on a Day trip?

We asked crowd workers on CrowdFlower to judge it.

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 10/21

Page 11: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Contextual Appropriateness(cont.)

We asked the crowd workers to judge if a Context isappropriate for a Category?

We did for almost all category-context pairs, 5assessments per pair

Examples:

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 11/21

Page 12: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Contextual Appropriateness(cont.)

Sample output:

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 12/21

Page 13: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Appropriateness Prediction

Given all pairs of context-category assessments, we needto decide if a venue is appropriate for a context

A trip is described with multiple contextual dimensions:Trip Type, Group Type, Trip Duration

A venue is described with multiple categories: Restaurant,Pizza Place, Pasta

Given the full description of the trip, we predict theappropriateness for each category:

For training data: we asked crowd workers to label 10%of the dataWe gave them the full description, and asked 3 workersto assess the appropriateness

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 13/21

Page 14: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Appropriateness Prediction(cont.)

Examples:

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 14/21

Page 15: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Appropriateness Prediction(cont.)

Examples:

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 15/21

Page 16: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Appropriateness Prediction(cont.)

We trained a SVM classifier using the training set

We predicted the appropriateness score for each categoryassociated with a venue

The overall appropriateness for a venue is the minimumscore (SF

cxt)

Example:

Assume the scores for a context given the categories:Restaurant: 1Asian Restaurant: 0.8Sushi: 0.1

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 16/21

Page 17: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Ranking

Our approach:We perform a linear combination on the scores:

SFcat = Frequency-based category score from Foursquare

(Phase 1 & 2)SYcat = Frequency-based category score from Yelp (Phase

2)SFkey = Frequency-based venue taste keyword score from

Foursquare (Phase 1 & 2)SYrev = Machine-learning-based review score from Yelp

(Phase 2)SFcxt = Machine-learning-based context appropriateness

score from Foursquare (Phase 2)

5-fold cross-validation

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 17/21

Page 18: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Results

We submitted 5 runs: 2 for Phase 1 and 3 for Phase 2

Phase 1:

USI1: SFcat

USI2: SFcat and SF

key

Phase 2:

USI3: Fielded Factorization Machines to combine:categories and reviewsUSI4: SF

cat , SYcat , S

Fkey , and SY

rev

USI5: SFcat , S

Ycat , S

Fkey , SY

rev , and SFcxt

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 18/21

Page 19: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Results

nDCG@5 P@5 MRR

Phase 1 USI1 0.2578 0.3934 0.6139USI2 0.2826 0.4295 0.6150Median 0.2133 0.3508 0.5041

Phase 2 USI3 0.2470 0.4259 0.6231USI4 0.3234 0.4828 0.6854USI5 0.3265 0.5069 0.6796Median 0.2562 0.3931 0.6015

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 19/21

Page 20: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Conclusion and Future Work

We presented a set of multimodal scores from multipleLBSNs

We created two datasets which can be used to predictcontextually appropriate venues

We showed how we can use those datasets to suggestappropriate venues

Explore other methods to incorporate contextualinformation in the basic model

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 20/21

Page 21: Venue Appropriateness Prediction for Contextual SuggestionSources: Foursquare and Yelp Types of information: categories, venue taste keywords, reviews, user context Context appropriateness

Introduction Approach Ranking Results Conclusion

Questions

Thank you for your attention

M. Alian Nejadi, I. Mele, F. Crestani — Venue Appropriateness Prediction for Contextual Suggestion 21/21