Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets:...

13
1 Introduction (Motivation) In 2015, a total of 728 millions of public pictures were uploaded to Flickr Such large amount of user-generated data makes multimedia indexing and retrieval a more challenging task However, it also opens new opportunities for development of novel and more efficient tools

Transcript of Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets:...

Page 1: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

1

Introduction(Motivation)

In2015,atotalof728millionsofpublicpictureswereuploadedtoFlickr

Suchlargeamountof user-generateddatamakesmultimediaindexingandretrievalamorechallengingtask

However,italsoopensnewopportunitiesfordevelopmentofnovelandmoreefficienttools

Page 2: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

2

Introduction(Motivation)User-generated multimedia contents depictindividual experiences or collective activities

WhatisanEvent?

Arealworldhappening toWho?,What?,When?andWhere?

Aneventisplannedbypeopleattendedbypeopleandrelatedmediaarealsocapturedbypeople

Personalexperiences

Collectiveactivities

Page 3: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

3

EventDetectioninImages:State-of-the-art

VisualInformation

Metadata(tags,GPSinformation

etc.)

Visual+Metadata

Page 4: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

4

BenchmarkDatasets:State-of-the-art

Currentdatasetsfor

eventdetectioninimages

lownumberofimages(e.g.,EIMM[1],Cultural

eventrecognitiondatabase[3])

limitedvarietyofevents/eventclasses(e.g.,EiMM [2]andSED2013

database[2])

Unbalancedeventclasses(e.g., EiMM [1]andSED2013[2])

1. R.Mattivi etal..Exploitationoftimeconstraintsfor(sub-)eventrecognition.InProceedingsofthe2011jointACMworkshoponModelingandrepresentingevents,pages7(12).ACM,2011..

2. T.Reuteretal..Socialeventdetectionatmediaeval2013:Challenges,datasets,andevaluation.InMediaEval Workshop,2013..3. S.Escalera etal..ChaLearn LookingatPeople2015:ApparentAgeandCulturalEventRecognitionDatasetsandResults,ICCV2015

Page 5: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

5

USED:AlargeScaleSocialEventDetectionDatasetAlargecollectionofimages

Covers14differenteventsclasses

AbalanceddatasetEqualnumberofimagesineachclass(35,000)

Event-classesinUSEDDataset

Page 6: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

6

USED:AlargeScaleSocialEventDetectionDataset

DiversityincontentsIndoorVs.outdoorGrouppicturesVs.SingleportraitImagesofkey-momentsinaneventMulti-culturalOutliersandborderlinecasesaremanuallyremoved

Somesampleimagesfromweddingclass

Page 7: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

7

USED:AlargeScaleSocialEventDetectionDataset

USED490,000 Eventrelated

imagesdepictinga widevarietyof

events

Page 8: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

8

Comparisonswithstate-of-the-artdatasets

ExistingdatasetsforEventDetectionCulturalEventDetectionDatasetEiMMSED

DatasetName #Event-classes Total Images Minimagesinaclass

Max.images inaclass

EiMM 8 (socialevents) 13219 795 2253

SED 7 82213 342 71556

CulturalEvents 50 11776 180-200(Avg.) 180-200(Avg.)

USED 14 490000 35000 35000

Comparisons ofUSEDwithotherDatasets

Page 9: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

9

ExperimentalValidationofUSED

DISCOVERINGEVENTSFROMSINGLEPICTURESUSINGACONVOLUTIONALNEURALNETWORK

Page 10: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

10

Validation/ExperimentalSetup

Fine-tuningCNN

Classification

Pre-training

ParametersofaCNN(Alexnet)pre-trainedonImageNet dataset

[NIPS2012]

Fine-tunedonnewlycollecteddatasets

Reduced overalllearningrateIncreasedlearningrateof

newlayerMomentum=.9

WeightDecay=.0005

Page 11: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

11

PreliminaryResultsDataset

USED

Event Type Accuracy EventType Accuracy

Concert 74.20% Conference 75.70%

Graduation 66.43% Exhibition 58.54%

Meeting 78.70% Fashion 65.43%

MountainTrip 67.00% Protest 74.58%

Picnic 54.42% Sports 72.24%

Sea-holiday 74.24% Theater 51.90%

Ski-holiday 48.00%

Wedding 51.00%

ResultsonUSEDdataset

DataAssemblageTrainingset=20,000imagesperclassValidationset=7000perclassTestset=7000imagesperclass

Page 12: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

12

ComparisonsofaCNNtrainedonUSEDwithBaselineApproaches

ComparisonwithRosani etal.,[IEEETMM2015]

EiMMDataset SEDDatasetOurApproach 71.54 59.42BaselineApproach 38.8 31.15

0

10

20

30

40

50

60

70

80Ac

curacy(%

)

A.Rosani,G.Baoto,F.G.B.DeNatale,“EventMask:agame-basedframeworkforEvent-saliencyidentificationinImages”,IEEETransactionsonMultimedia2015

Page 13: Introduction (Motivation) · GPS information etc.) Visual + Metadata. 4 Benchmark Datasets: State-of-the-art Current datasets for event detection in images low number of images (e.g.,

13

USED:ALarge-scaleSocialEventDetectionDataset

490,000 Event-relatedimages, 14differentevent-classes,35,000imagesper

class

ENJOY USED!