An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP...

45
An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering National Taiwan University, Taipei, Taiwan National Taiwan University, Taipei, Taiwan DISP Lab @ MD531 1

Transcript of An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP...

Page 1: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

1

An Introduction to Video Fingerprinting

Speaker : Wei–lun ChaoAdvisor : Prof. Jian-jiun Ding

DISP LabGraduate Institute of Communication Engineering

National Taiwan University, Taipei, Taiwan

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Page 2: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

2

Abstract

A video copy detection technique

A new way of thinking against watermarking

Techniques of it are not too complicated until now

Still lots of space to be improved

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Page 3: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

3

Outline & Content

1. Introduction 2. Challenge 3. Basic Structure 4. Existed Techniques 5. Performance Evaluation 6. Our Works 7. Conclusion & Future works 8. Acknowledge 9. Reference National Taiwan University, Taipei, Taiwan

DISP Lab @ MD531

Page 4: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

4

1. Introduction

What is video copy detection and its goal? For database and copyright management Copy V.S Similarity

Two main direction: Content-based video copy detection (CBCD) Watermarking National Taiwan University, Taipei, Taiwan

DISP Lab @ MD531

1

Page 5: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

5

Watermarking

Watermarking relies on inserting a distinct pattern into the video stream

It can be simply classified into visible and invisible watermarking

A widely included research

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

1

Page 6: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

6

CBCD

Pertinent features are extracted as “fingerprints” or “signatures” of the video

Comparing the signatures to determine a copy or not

”The media itself is the watermark”

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

1

Page 7: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

7

ComparisonDifference\Technique Watermarking CBCD

Embed something Yes No

Main protecting process Before propagation After propagation

Additional memory Yes (Key) Yes (Feature database)

Matching step Detection + Extraction + Comparison

Extraction + Comparison

Techniques Advantages Disadvantages

Watermarking Multi-functional Compare the embedded pattern, Afraid of attacking

CBCD Fast search, no insertion, HVS preserved

Additional data memory, Database standard

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

1

Page 8: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

8

2. Challenge

People feel the same, while computers don’t ! Distortions Luminance Geometrical modification Compression & different formats Post-processing Malicious attack CBCD V.S CBVR (content-based video retrieval)

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

2

Page 9: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

9

3. Basic Structure

There are three important properties a video fingerprinting technique should have!

Robustness: distortion tolerance Pair-wise independent: identification Database search efficiency: practical

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

3

Page 10: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

10

Diagram of CBCD system

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

3

Page 11: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

11

System structure Three parts of the system: Database operation (off-line) Query operation (on-line) Matching (on-line) Functional blocks: Shot detection Key-frame extraction Feature extraction Matching & decision The most important one: Feature extraction + arrangement

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

3

Page 12: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

12

Feature extraction

Difference between images and videos: Time 3 dimensions to explore information (feature): Color Spatial Temporal 2 kinds of descriptor: Global Local Arrangement: features ->a signature

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

3

Page 13: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

13

Matching & Decision

Three steps: Searching Voting algorithm: best match

Threshold: Is the best match a real match?

Local description case is more complicated to deal with at this step:

Temporal and spatial registration for each point

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

3

Page 14: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

14

Is there any similarity?

Asymmetric relation!!!

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

3

Page 15: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

15

4. Existed Techniques

Four functional blocks: Shot detection Key frame extraction Feature extraction Matching & decision Feature extraction: global/local descriptors color/spatial/temporal image based/ sequence based

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 16: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

16

4.1 Key-frame extraction

Key-frames forms a compact expression of a video clip

Found techniques: Motion energy model

Spatial temporal color distribution

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 17: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

17

4.2 Global features

Shot boundary Color Different color space histogram, moment, dominant color Changes in or between frames Motion, gradient Ordinal feature Spatial, temporal ordinal Transform-based

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 18: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

18

YCbCr histogram 4

The signature for a shot:“A 125 x 8 matrix”

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Page 19: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

19

Motion direction histogram

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

EX: 15x15 blocks per frame

4

, i=0,…… 4

Page 20: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

20

Centroids of gradient orientation

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

The signature for each frame:“MN CGO scalars”

4

Page 21: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

21

Ordinal feature

Spatial ordinal

Temporal ordinal

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

For each frame: S (t) = ( , ,𝒓𝟏 𝒓𝟐……, ) 𝒓𝑵For each video clip:A 72-bin histogram

4

Page 22: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

22

Compact Fourier-Mellin transform

Original formula:

AFMT:

RST-invariant property:

(, )= ( ( + ), (− + )𝒚 𝒇 𝜶 𝒙𝒄𝒐𝒔𝜷 𝒚𝒔𝒊𝒏𝜷 𝜶 𝒙𝒔𝒊𝒏𝜷 𝒚𝒄𝒐𝒔𝜷 )

Fast AFMT approximation:

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 23: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

23

Compact Fourier-Mellin transform

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 24: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

24

Compact Fourier-Mellin transform

4

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

PCA(principal component analysis)

Each frame:d-dim vector

Page 25: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

25

Comparison of global descriptors

Recall: What we want to solve, page 8

General problem: Post-product processing

4

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Technique Advantage Disadvantage

Shot boundary Compact, only frame index Algorithm, only ok for whole movies

Color Easy to implement High noise sensitivity(inter video not intra video)

Spatial/temporal change More information included Computational complexity

Ordinal feature Immune to global color change

Sensitive to post-processing

Transform-based RST-invariant Computational complexity, high feature dimension

Page 26: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

26

4.3 Local features

Additional work: Points-of-interest extraction Ex: Harris point, SIFT

Method: A. Joly method Space time interest points Video Copy tracking (ViCopT) Scale invariant feature transform (SIFT)

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 27: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

27

A. Joly Method

Harris point detector Key-frame based Signature for each point is a 20-deimensional

vector in [0,255]D=20 :

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

at four nearby locations

Page 28: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

28

ViCopT

Try to find the trajectory of points of interest

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Signal description

Trajectory description

Labeling: Background/Motion

Page 29: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

29

ViCopT

Labeling: label Background: motionless and persistent points along frames label Motion: moving and persistent points

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Background Motion

4

Page 30: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

30

ViCopT

Asymmetric Technique: Off-line: Trajectories On-line: Points of interest in key frames Latter steps: Similarity searching Spatio-temporal registration Combination of labels

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 31: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

31

ViCopT framework

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 32: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

32

SIFT – Scale invariant feature transform

Using the SIFT-point detector (128-dim vector) Computational complexity due to pair-wise comparison

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Point detection

Orientation

Descriptor computation

4

Page 33: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

33

SIFT

With location, scale, and orientation

4

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Page 34: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

34

SIFT

16 regions, each with 8 orientations, totally 128-dim vector

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 35: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

35

SIFT

In order to simplify the matching step, the quantization (codebook) method is used for each point and the signature for each frame is a d-bin histogram

Each descriptor is quantized into a codeword based on the trained codebook

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

K-means clustering

4

Page 36: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

36

4.4 Matching idea

Matching type: Sequential matching Vector matching Local point matching: intersection idea Matching method: L1, L2 distance Pattern recognition or machine learning idea

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

4

Page 37: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

37

4.5 Complete comparisonTechniques Advantage Disadvantage

Global descriptor:

(color, ordinal…)

Simple computationEasy matching

Not robust for logo insertion, cropping, and post-product processing

Local descriptor:

(ViCopT, SIFT…)

Robust to post-product processing…, and can be used for CBVR

Complex matching and decision (pair-wise, intersection…)

4

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Page 38: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

38

Complete comparison

More dimensions (color, spatial, temporal), more robust.

Local descriptor is stronger than global descriptor especially for logo and word insertion

Ordinal recording is better than actual value recording

It’s better to record the direction of an image by some algorithms in order to deal with image rotation

4

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Page 39: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

39

5. Performance Evaluation

Precision-recall curve (P-R curve)

F1 score

Recall = 𝑻𝒓𝒖𝒆𝑷𝒐𝒔𝒊𝒕𝒊𝒗𝒆 (pa)/ 𝑵 𝒍𝒍𝑻𝒖𝒓𝒆

Precision = 𝑻𝒓𝒖𝒆𝑷𝒐𝒔𝒊𝒕𝒊𝒗𝒆 (pa)/ (pa) 𝑵𝑨 𝒍𝒍𝑷 𝒐𝒔𝒕𝒊

𝑭 = × × / + 𝟐 𝑷 𝑹 𝑷 𝑹

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

5

Page 40: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

40

6. Our Works Database: 2600 scenes, 100 for duplication. 63 duplicated videos are created , then

totally we have 6300+2600 = 8900 videos (frame drop, blurring, AWGN, JPEG, Gamma correction) Shot detection Software key-frame detection Spatial temporal color distribution Feature extraction YCbCr histogram, ordinal, CGO, CFMT, and SIFT Matching & detection L1, L2 distance

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

6

Page 41: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

41

Our work result

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

6

Page 42: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

42

7. Conclusion & Future works

Is there any practical case of CBVD?

Youtube website Another comparison

The importance of feature selection: V.S CBIR

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

7

Techniques Characteristics

Watermarking Malicious attack, for example, put fake patterns inside! Need some knowledge of cryptography and coding

Video fingerprinting

Could perform kinds of method on one video!!! Could use lots of image and video ideas

Page 43: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

43

Future works

New features Higher speed Put into practice Deal with more complicated cases While, there are practical cares!

7

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

Page 44: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

44

8. Acknowledge I’d like to thank for my advisor, Prof. Pierre Moulin, and

group members, Julien Dubois and Ryan Rogowski, in UIUC!!

National Taiwan University, Taipei, TaiwanDISP Lab @ MD531

8

Page 45: An Introduction to Video Fingerprinting Speaker : Wei–lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering.

45

9. Reference[1]A. Hampapur, K. Hyun, and R. M. Bolle. Comparison of sequence matching

techniques for video copy detection. volume 4676, pages 194_201. SPIE, 2001.[2] J. Law-To, O. Buisson, V. Gouet-Brunet, and N. Boujemaa. Video copy detection on

the Internet: the challenges of copyright and multiplicity. In ICME'07: Proceedings of the IEEE International Conference on Multimedia and Expo, pages 2082_2085, 2007.

[3] Sunil Lee and Chang D Yoo. Video fingerprinting based on centroids of gradients orientations. In ICASSP '06: Proceedings of the 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 401_404, Washington, DC, USA, 2006. IEEE Computer Society.

[4] J. Law-To, L. Chen, A. Joly, I. Laptev, O. Buisson, V. Gouet-Brunet, N. Boujemaa, and F. Stentiford. Video copy detection: a comparative study. In CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval, pages 371_378, New York, NY, USA, 2007. ACM.

[5] A. Sarkar, P. Ghosh, E. Moxley, and B. S. Manjunath. Video fingerprinting: Features for duplicate and similar video detection and query-based video retrieval. In SPIE – Multimedia Content Access: Algorithms and Systems II, Jan 2008.

9