MPEG-7 Report
-
Upload
kunal-kishore -
Category
Documents
-
view
226 -
download
0
Transcript of MPEG-7 Report
-
8/7/2019 MPEG-7 Report
1/26
Seminar Report
MPEG-7
ABSTRACT
MPEG-7 is a multimedia content description standard. This description will be associated
with the content itself, to allow fast and efficient searching for material that is of interest to
the user. MPEG-7 is formally called Multimedia Content Description Interface. Thus, it is not
a standard which deals with the actual encoding of moving pictures and audio, like MPEG-1,
MPEG-2 and MPEG-4. It uses XML to store metadata, and can be attached to timecode in
order to tag particular events, or synchronize lyrics to a song.
MPEG-7 objectivesis to provide a fast and efficient searching, filtering and content
identification method. Describe main issues about the content (low-level characteristics,
structure, models, collections, etc.). Index a big range of applications. Audiovisual
information that MPEG-7 deals is : Audio, voice, video, images, graphs and 3D models
Inform about how objects are combined in a scene. Independence between description and
the information itself.
Kunal Kishore
School of Engineering (it),CUSAT
1
-
8/7/2019 MPEG-7 Report
2/26
Seminar Report
MPEG-7
ACKNOWLEDGEMENT
We consider it as a great privilege to express our heartfelt gratitude to many respected
personalities who guided, inspired and helped us in successful completion of this seminar and
presentation. We also express our gratitude to DR. SANTOSH KUMAR M.B., Senior Lecturer of
the department for providing us with adequate facilities, ways and means by which we were
able to complete this presentation. We express our sincere gratitude for his constant support
and valuable suggestions without which the successful completion of this presentation wouldnot have been possible. We express our immense pleasure and thankfulness to all the teachers
and staff of the Department of Information Technology, CUSAT for their cooperation and
support. Last but not the least, we thank all others, and especially our classmates and our
family members who in one way or another helped us in the successful completion of this
work.
Kunal Kishore
Kunal Kishore
School of Engineering (it),CUSAT
2
-
8/7/2019 MPEG-7 Report
3/26
Seminar Report
MPEG-7
INDEX
Topic Page No.
1.Introduction 5
2.What is MPEG-7 Standered? 63.What is MPEG-7? 7
4. Technical Overview 9
4.i. Structure 9
4.ii. Descriptor 11
4.iii. Descriptor Scheme 17
4.iv. DDL 19
4.v. MPEG-7 parts 225. Application Domain 23
6. Limitation 25
7. Conclusion 26
8. Refrences 27
Kunal Kishore
School of Engineering (it),CUSAT
3
-
8/7/2019 MPEG-7 Report
4/26
Seminar Report
MPEG-7
1. INTRODUCTION
How many times have you seen science fiction movies such as 2001: A Space Odyssey and
thought, Wow, were so far away from having any of the fancy gadgets depicted in these
movies! In 2001, Hal, the talking computer intelligently navigates and retrieves information
or runs complex operations instigated by spoken input. Or how about using an image-based
query, say an image of the motorbike used by Arnold Schwartzenegger in the movie T2, to
find images of similarly looking motorbikes. Dreams or reality?
MPEG-7 is a multimedia content description standard, (), that addresses how humans and
computer systems expect to interact with computer systems, since it develops richdescriptions that reflect those expectations
Now A days accessing audio and video used to be a simple matter - simple because of the
simplicity of the access mechanisms and because of the poverty of the sources. An
incommensurable amount of audiovisual information is becoming available in digital form, in
digital archives, on the World Wide Web, in broadcast data streams and in personal and
professional databases, and this amount is only growing. The value of information often
depends on how easy it can be found, retrieved, accessed, filtered and managed.
Broadband is being offered with increasing audio and video quality and speed of access. The
trend is clear: in the next few years, users will be confronted with such a large number of
contents provided by multiple sources that efficient and accurate access to this almost infinite
amount of content seems unimaginable todayasy it can be found, retrieved, accessed, filtered
and managed. In spite of the fact that users have increasing access to these resources,
identifying and managing them efficiently is becoming more difficult, because of the sheer
volume. The question of identifying and managing content is not just restricted to database
retrieval applications such as digital libraries, but extends to areas like broadcast channel
selection, multimedia editing, and multimedia directory services.
This challenging situation demands a timely solution to the problem . MPEG-7 is the answer
to this need.
Kunal Kishore
School of Engineering (it),CUSAT
4
-
8/7/2019 MPEG-7 Report
5/26
Seminar Report
MPEG-7
2.What Are the MPEG Standards?
The Moving Picture Coding Experts Group (MPEG) is a working group of the Geneva-based
ISO/IEC standards organization, (International Standards Organization/International Electro-
technical Committee) in charge of the development of international standards for
compression, decompression, processing, and coded representation of moving pictures, audio,
and a combination of the two. MPEG-7 then is an ISO/IEC standard being developed by
MPEG, the committee that also developed the Emmy Award-winning standards known as
MPEG-1 and MPEG-2, and the 1999 MPEG-4 standard.
MPEG-1: For the storage and retrieval of moving pictures and audio on storage
media.
MPEG-2: For digital television, its the timely response for the satellite broadcasting
and cable television industries in their transition from analog to digital formats.
MPEG-4: Codes content as objects and enables those objects to be manipulated
individually or collectively on an audiovisual scene.
MPEG-1, -2, and -4 make content available. MPEG-7 lets you to find the content you
need.
Besides these standards, MPEG is currently also working in MPEG-21 a Technical
Report about Multimedia Framework.
The MPEG-1 and MPEG-2 standards have enabled the production of widely adopted
commercial products, such as Video CD, MP3, digital audio broadcasting (DAB), DVD,
digital television (DVB and ATSC), and many video-on-demand trials and commercialservices. MPEG-4 is the first real multimedia representation standard, allowing interactivity
and a combination of natural and synthetic material coded in the form of objects. MPEG-4
provides the standardized technological elements enabling the integration of the production,
distribution and content access paradigms of the fields of interactive multimedia, mobile
multimedia, interactive graphics and enhanced digital television.
Kunal Kishore
School of Engineering (it),CUSAT
5
-
8/7/2019 MPEG-7 Report
6/26
Seminar Report
MPEG-7
3. What is MPEG-7 ?
The MPEG-7 standard, formally named "Multimedia Content Description Interface",
provides a rich set of standardized tools to describe multimedia content. It is the standard that
describes multimedia content so users can search, browse, and retrieve that content more
efficiently and effectively than they could using todays mainly text-based search engines.
Both human users and automatic systems that process audiovisual information are within the
scope of MPEG-7. Its a standard for describing the features of multimedia content.
MPEG-7 is an ISO/IEC standard developed by MPEG (Moving Picture Experts Group), the
committee that also developed the successful standards known as MPEG-1 (1992) and
MPEG-2 (1994), and the MPEG-4 standard (Version 1 in 1998, and version 2 in 1999) as
mentioned above.
MPEG-7 offers a comprehensive set of audiovisual Description Tools to create descriptions
(i.e., a set of instantiated Description Schemes and their corresponding Descriptors at the
users will), which will form the basis for applications enabling the needed effective and
efficient access (search, filtering and browsing) to multimedia content. This is a challenging
task given the broad spectrum of requirements and targeted multimedia applications, and the
broad number of audiovisual features of importance in such context.
MPEG-7 does not standardize or evaluate applications, although in its development standard
applications have been used for understanding the requirements and evaluation of technology.
It must be made clear that the requirements are derived from analyzing a wide range of
potential applications that could use MPEG-7 tools. MPEG-7 is not aimed at any one
application in particular; rather, the elements that MPEG-7 standardizes support as broad a
range of applications as possible.
MPEG-7 uniquely provides comprehensive standardised multimedia description tools for
content. Descriptions for the catalogue level (e.g. title), the semantic level (who, what, when,
where) and the structural level (spatio-temporal region, color histogram, timbre, texture) will
provide tools for creative developers to generate new waves of multimedia applications.
Kunal Kishore
School of Engineering (it),CUSAT
6
-
8/7/2019 MPEG-7 Report
7/26
Seminar Report
MPEG-7
Standardized MPEG-7 description tools, then, are a key enabler of the following application
domains:
Search Engines, Digital Libraries, Broadcast Networks, Entertainment and News
Distributors, Streaming Businesses.
Dynamic start-up companies, searching for cutting edge technologies.
Governmental, Educational, Law, Medical & Remedial Services, and Non-profit
organizations looking for digital media solutions. For example, the U.S. Library of
Congress receives over 10,000 multimedia items each week, and is committed to a)
the long term preservation of these multimedia items in digital format, and b) making
much of their collection accessible to U.S. citizens in digital format.
XML, Metadata, Modeling/Simulation, & Surveillance Industries
AI Practitioners, Content Creators and Providers.
Kunal Kishore
School of Engineering (it),CUSAT
7
-
8/7/2019 MPEG-7 Report
8/26
Seminar Report
MPEG-7
4. TECHNICAL OVERVIEW OF MPEG-7
4.i. STRUCTURE
Before explaning the MPEG-7 standereds structure, we will first mention MPEG-7
standereds which include Descripters, description Schemas, a description Definition Langage,
and system tools .
Data is Multimedia information that will be described using MPEG_&, regardless of
storage, coding , display, transmission medium or technology. Furthermore, a feature is a
distinctive charectoristic of data that signifies something to somebody. Besed on these
definitions, these are the four major MPEG-7 building blocks:
Descriptor: A representation of feature. A Descriptor defines the syntax and
semantics of the feature representation.
Description Schema: The structures and symentics of the relationship between itscomponents, which may be both Descriptors and description schemes.
Description Definition Language ( DDL ): A language that allows the creation of
new description schemes and possibly descriptors. It also allows the extension and
modification of existing description schemes. MPEG-7 adopted XML Schema
Language as the MPEG-7 DDL. However, the DDL requires some specific extensions
to XML Schema Language to satisfy all the requirements of MPEG-7. These
extensions are currently being discussed through liaison activities between MPEG and
W3C, the group standardizing XML.
System Tools : Tools to support multiplexing of descriptions, synchronization of
descriptors with content , delivery mechanism and coded representation for efficient
storage and transmission and the management and protection of intellectual property
in MPEG-7 descriptions.
Kunal Kishore
School of Engineering (it),CUSAT
8
-
8/7/2019 MPEG-7 Report
9/26
Seminar Report
MPEG-7
The Scope of MPEG-7
Kunal Kishore
School of Engineering (it),CUSAT
9
-
8/7/2019 MPEG-7 Report
10/26
Seminar Report
MPEG-7
4.ii. DESCRIPTORS
A Descriptor (D) defines the syntax and the semantics of one representation of a particular
feature of audiovisual content. A feature is a distinctive characteristic of the data which is of
significance to a user.
For example, the colorof an image is a feature. Possible Descriptors corresponding to
the colorfeature are: color histogram, RGB vector or a string. A Descriptor value is an
instantiation of a Descriptor for a given data set. For example, RGB= (255, 255, 255),
colorstring="red".
Table illustrates some of the current descriptors which have been incorporated into the XM
or are undergoing core experiments (CEs). They have been subdivided into Visual and Audio
descriptors.
Type Feature Description
Visual Besic Structure Grid layout
Time series
2D-3D Multiple View
Multiple view
the Spatial 2D coordinates
Temporal interpolation
Colour Colour space
Dominent color
Color Quantization
Scalable Color
Color Layout
Color-Structure
GoF/GoP Color
Texture Spatial image intensity distributionHomogeneous texture
Shape Object bounding box
Region-based shape
Contour-based shape
3D shape descriptor
Motion Camera motion
Object motion trajectory
Parametric object motion
Motion activity
Motion trajectory featuresKunal Kishore
School of Engineering (it),CUSAT
10
-
8/7/2019 MPEG-7 Report
11/26
Seminar Report
MPEG-7
Audio Speech Annotation Lattice of words and phonemes
plus metadata
Timbre Ratio of even to odd harmonics
Harmonic attack coherenceMelody Melodic contour and rhythm
Table 2. Overview of Current Descriptors
4.ii.(a)MPEG-7 Visual
MPEG-7 Visual Description Tools included in the standard consist of basic structures and
Descriptors that cover the following basic visual features: Color, Texture, Shape, Motion,
Localization, and Face recognition. Each category consists of elementary and sophisticated
Descriptors.
Basic Structure
o Grid Layout : The grid layout is a splitting of the image into a set of
equally sized rectangular regions, so that each region can be described
separately. Each region of the grid can be described in terms of other
Descriptors such as color or texture. Furthermore, the descriptor allows
to assign the subDescriptors to all rectangular areas, as well as to an
arbitrary subset of rectangular regions.
o Time Series: This descriptor defines a temporal series of Descriptors in a video
segment and provides image to video-frame matching and video-frames to video-
frames matching functionalities. Two types of TimeSeries are available:
RegularTimeSeries and IrregularTimeSeries
o 2D-3D Multiple View: The 2D/3D Descriptor specifies a structure which
combines 2D Descriptors representing a visual feature of a 3D object seen from
different view angles. The descriptor forms a complete 3D view-based
representation of the object.
o Spetial 2D Coordinate : This description defines a 2D spatial coordinate system
and a unit to be used by reference in other Ds/DSs when relevant. The coordinate
system is defined by a mapping between an image and the coordinate system. One
Kunal Kishore
School of Engineering (it),CUSAT
11
-
8/7/2019 MPEG-7 Report
12/26
Seminar Report
MPEG-7
of the advantages using this descriptor is that MPEG-7 descriptions need not to be
modified even if the image size is changed or a part of the image is clipped. In this
case, only the description of the mapping from the original image to the edited
image is required.
o Temporal Interpolation : The TemporalInterpolation D describes a temporal
interpolation using connected polynomials. This can be used to
approximate multi-dimensional variable values that change with time
such as an object position in a video. The description size of the temporal
interpolation is usually much smaller than describing all values.
Color Discriptor :There are seven Color Descriptors: Color space, Color Quantization,
Dominant Colors, Scalable Color, Color Layout, Color-Structure, and GoF/GoP Color.
o Color space : The feature is the color space that is to be used in other color based
descriptions. In the current description, the following color spaces are supported:
a) R,G,B (b) Y,Cr,Cb (c) H,S,V (d) HMMD (e) Linear transformation
matrix with reference to R, G, B (f) Monochrome
o Color Quantization : This descriptor defines a uniform quantization of a color
space. The number of bins which the quantizer produces is configurable, such that
great flexibility is provided for a wide range of applications. For a meaningful
application in the context of MPEG-7, this descriptor has to be combined with
dominant color descriptors, e.g. to express the meaning of the values of dominant
colors.
o Dominent Color : Color quantization is used to extract a small number of
representing colors in each region/image. The percentage of each quantized color
in the region is calculated correspondingly. A spatial coherency on the entiredescriptor is also defined, and is used in similarity retrieval.
o Scalable Color : The Scalable Color Descriptor is a Color Histogram in HSV
Color Space, which is encoded by a Haar transform. Its binary representation is
scalable in terms of bin numbers and bit representation accuracy over a broad
range of data rates. The Scalable Color Descriptor is useful for image-to-image
matching and retrieval based on color feature. Retrieval accuracy increases with
the number of bits used in the representation.
Kunal Kishore
School of Engineering (it),CUSAT
12
-
8/7/2019 MPEG-7 Report
13/26
Seminar Report
MPEG-7
o Color Layout: This descriptor effectively represents the spatial distribution of
color of visual signals in a very compact form. This compactness allows visual
signal matching functionality with high retrieval efficiency at very small
computational costs. It provides image-to-image matching as well as ultra high-
speed sequence-to-sequence matching, which requires so many repetitions of
similarity calculations. It also provides very friendly user interface using hand-
written sketch queries since this descriptors captures the layout information of
color feature.
o Color Structure Discreptor: The Color structure descriptor is a color feature
descriptor that captures both color content (similar to a color histogram) and
information about the structure of this content. Its main functionality is image-to-
image matching and its intended use is for still-image retrieval, where an image
may consist of either a single rectangular frame or arbitrarily shaped, possibly
disconnected, regions.
o GoF/GoP Color: The Group of Frames/Group of Pictures color descriptor extends
the ScalableColor descriptor that is defined for a still image to color description of
a video segment or a collection of still images. Additional two bits allows to
define how the color histogram was calculated, before the Haar transfor is applied
to it: by average, median or intersection. The average histogram, which refers to
averaging the counter value of each bin across all frames or pictures, is equivalent
to computing the aggregate color histogram of all frames and pictures with proper
normalization. The Median Histogram refers to computing the median of the
counter value of each bin across all frames or pictures.
Texture
o Spatial image intensity distribution
o Homogeneous texture
Shape
o Object bounding box
o Region-based shape
o Contour-based shape
Kunal Kishore
School of Engineering (it),CUSAT
13
-
8/7/2019 MPEG-7 Report
14/26
Seminar Report
MPEG-7
o 3D shape descriptor
Motion
o Camera motion
o Object motion trajectory
o Parametric object motion
o Motion activity
o Motion trajectory features
4.ii.(a)MPEG-7 Audio
MPEG-7 Audio, standardizes the description tools for describing audio
content. Most Audio description tools are based on audio features that let
us measure similarity in sounds (such as music and speech). Therefore,
we can use these MPEG-7 Audio descriptors and description schemes to
search and filter audio content based on several audio features like
spectrum, harmony, timbre, and melody. Other Audio description tools let
us describe spoken content and create a classification of sounds.
We can classify the MPEG-7 Audio description tools into generic and high-
level description tools. The generic Audio description tools include a group
of low-level descriptors for audio features, named the MPEG-7 Audio
Framework (see Figure A), that let us describe an audio signals spectral,
parametric, and temporal features. The highlevel group provides
description tools for sound recognition and indexing, spoken content, and
query-by-humming applications, among other things.
Speech Annotation
o Lattice of words and phonemes
plus metadata.
Timber
o Ratio of even to odd harmon
Kunal Kishore
School of Engineering (it),CUSAT
14
-
8/7/2019 MPEG-7 Report
15/26
Seminar Report
MPEG-7
o Harmonic attack coherence
Melody:
o Melodic contour and rhythm
Kunal Kishore
School of Engineering (it),CUSAT
15
-
8/7/2019 MPEG-7 Report
16/26
Seminar Report
MPEG-7
4.iii. DESCRIPTION SCHEMES
A Description Scheme (DS) specifies the structure and semantics of the relationships between
its components, which may be both Descriptors and Description Schemes.
The following concepts are used within the DS group to describe audiovisual content:
Syntactic structure - the physical and logical structure of audiovisual content, e.g.,
structures based on temporal segments and/or spatial regions.
Semantic structure - breakdown based on semantic meaning, e.g., structures based on
temporal events and/or spatial objects.
Syntactic-semantic links - the associations between syntactic elements and semantic
elements.
The Generic Audiovisual DS represents the integration of all of the DS proposals and
submissions within a single DS. At the top level it consists of:
A collection of Syntactic structure DSs, i.e., physical features such as segments,
regions, color, texture, and motion are described here;
A collection of Semantic structure DSs, i.e., semantic features such as objects, actors
or events, e.g., "goal", "advertisement", "Madonna";
Syntactic-semantic links DSs - which relate the syntactic elements to the semantic
elements;
Summary DS - this is used to enable browsing at different levels of granularity;
MetaInfo DS - this contains descriptors carrying author or publisher-generated
information, e.g., ContentDS, CreditsDS, CreationPurposeDS, RightsDS,
PublicationDS, RightsDS;
MediaInfo DS - this contains descriptors related to the storage media, e.g., file format,
system, medium, colour, sound, length, duration, compression format;
Model DS - this provides a way to describe the classification methods for audiovisual
data or the correspondence between the current audiovisual content and other content
through different models;
Kunal Kishore
School of Engineering (it),CUSAT
16
-
8/7/2019 MPEG-7 Report
17/26
Seminar Report
MPEG-7
Figure below illustrates the structure and content of the Generic Audiovisual DS.
Figure : The Generic Audiovisual Description Scheme
Kunal Kishore
School of Engineering (it),CUSAT
17
-
8/7/2019 MPEG-7 Report
18/26
Seminar Report
MPEG-7
4.iv. DESCRIPTION DEFINITION LANGUAGE
The Description Definition Language (DDL) is the language that allows the creation of new
Description Schemes and Descriptors. It also allows the extension and modification of
existing Description Schemes. A DDL specifies the constraints that a valid MPEG-7
descripction should respect.
The DDL has to be able to express spatial, temporal, structural, and conceptual relationships
between the elements of a DS, and between DSs. It must provide a rich model for links and
references between one or more descriptions and the data that it describes. It also has to be
capable of validating descriptor data types, both primitive (integer, text, date, time) and
composite (histograms, enumerated types). In addition, it must be platform and application
independent and human- and machine-readable.
There are two form of languages normally used for DDL:
4.iv.(a) XML : The general consensus within MPEG-7 is that it should be based on XML
syntax. It is encoded in XML that defines documents structures and embed it directly
withinthe document through the use of markups. XML is same as HTML but with user
defined tags.
An XML schema can be seen as an extended DTD, because it goes beyond the current
function alities of DTD and allows data type definitions. The MPEG-7 DDL uses w3 XML
schemalanguage as a base language and adds MPEG-7 specific mechanism, such as array,
matrix data type on its top.
A DDL parser is used to check the validity of the diacriptionwith these rules. The parser is
first initialised with the schema and it then parse a description to report any errors.
XML schema parser which can be used to validate the description schemes are XSV, XML
SPY Xerces.
4.iv.(b) BINARY FORMAT FOR MPEG-7
Kunal Kishore
School of Engineering (it),CUSAT
18
-
8/7/2019 MPEG-7 Report
19/26
Seminar Report
MPEG-7
MPEG textual description (XML Files )tend to become large and thus inefficient. That is why
MPEG-7 defines a binary format, BiM, that enables the streaming and compression of
descriptstion. Actually BiM can be used to compress any XML document. The document
structure is highly compressed , 98% an average.
Example: MPEG-7 Description (in textual format) of the MPEG logo.
http://www.tilab.org/mpeg/mpeg_logo-anim_l.gif
The animated MPEG LogoAuthorMPEGhttp://www.tilab.com/mpeg/
Kunal Kishore
School of Engineering (it),CUSAT
19
-
8/7/2019 MPEG-7 Report
20/26
Seminar Report
MPEG-7
4.v. MPEG-7 PARTS
The MPEG-7 standard consists of several parts.This lets us use the various clusters of
technology alone, according to MPEGs toolbox approach to standardization. It also keeps
the editing of the standard manageable. Here are the MPEG-7 standard parts:
MPEG-7 Systems the tools needed to prepare MPEG-7 descriptions for efficient
transport and storage and the terminal architecture.It specifies the system tools to
prepare MPEG-7 Descriptions for efficient transport and storage (through
binarization) to allow synchronization between content and descriptions.
MPEG-7 Description Definition Language - the language for defining the syntax of
the MPEG-7 Description Tools and for defining new Description Schemes.
MPEG-7 Visual the Description Tools dealing with (only) Visual descriptions.
MPEG-7 Audio the Description Tools dealing with (only) Audio descriptions.
MPEG-7 Multimedia Description Schemes - the Description Tools dealing with
generic features and multimedia descriptions.
MPEG-7 Reference Software - a software implementation of relevant parts of the
MPEG-7 Standard with normative status.
MPEG-7 Conformance Testing - guidelines and procedures for testing conformance
of MPEG-7 implementations.
MPEG-7 Extraction and use of descriptions informative material (in the form of a
Technical Report) about the extraction and use of some of the Description Tools.
MPEG-7 Profiles and levels - provides guidelines and standard profiles.
MPEG-7 Schema Definition - specifies the schema using the Description Definition
Language.
Kunal Kishore
School of Engineering (it),CUSAT
20
-
8/7/2019 MPEG-7 Report
21/26
Seminar Report
MPEG-7
5. MPEG-7 Application Domains
The elements that MPEG-7 standardizes will support a broad a range of applications (for
example, multimedia digital libraries, broadcast media selection, multimedia editing, home
entertainment devices, etc.). MPEG-7 will also make the web as searchable for multimedia
content as it is searchable for text today. This would apply especially to large content
archives, which are being made accessible to the public, as well as to multimedia catalogues
enabling people to identify content for purchase. The information used for content retrieval
may also be used by agents, for the selection and filtering of broadcasted "push" material or
for personalized advertising. Additionally, MPEG-7 descriptions will allow fast and cost-effective usage of the underlying data, by enabling semi-automatic multimedia presentation
and editing. All domains making use of multimedia will benefit from MPEG-7 including : -
Digital libraries, Education (image catalogue, musical dictionary, Bio-medical
imaging catalogues)
Multimedia editing (personalised electronic news service, media authoring)
Cultural services (history museums, art galleries, etc.),
Multimedia directory services (e.g. yellow pages, Tourist information,
Geographical information systems)
Broadcast media selection (radio channel, TV channel,)
Journalism (e.g. searching speeches of a certain politician using his name, his
voice or his face),
E-Commerce (personalised advertising, on-line catalogues, directories of e-
shops,)
Surveillance (traffic control, surface transportation, non-destructive testing in
hostile environments, etc.),
Investigation services (human characteristics recognition, forensics),
Home Entertainment (systems for the management of personal multimedia
collections, including manipulation of content, e.g. home video editing,
searching a game, karaoke,)
Social (e.g. dating services),
Kunal Kishore
School of Engineering (it),CUSAT
21
-
8/7/2019 MPEG-7 Report
22/26
Seminar Report
MPEG-7
5.i. Typical applications enabled by MPEG-7 technology include:
Audio: I want to search for songs by humming or whistling a tune or, using an
excerpt of Pavarottis voice, get a list of Pavarottis records and video clips in which
Pavarotti sings or simply makes an appearance. Or, play a few notes on a keyboard
and retrieve a list of musical pieces similar to the required tune, or images matching
the notes in a certain way, e.g. in terms of emotions.
Graphics: Sketch a few lines on a screen and get a set of images containing similar
graphics, logos, and ideograms.
Image: Define objects, including color patches or textures, and get examples from
which you select items to compose your image. Or check if your company logo was
advertised on a TV channel as contracted.
Visual: Allow mobile phone access to video clips of goals scored in a soccer game,
or automatically search and retrieve any unusual movements from surveillance
videos.
Multimedia: On a given set of multimedia objects, describe movements and relations
between objects and so search for animations fulfilling the described temporal and
spatial relations. Or, describe actions and get a list of scenarios containing such
actions.
5.ii. Example of an MPEG-7 Application
The following application is an examples of the type of solutions that MPEG-7 can solve.
There are many more applications being developed around the world.
5.ii.a. Content Retrieval using Image as the Query
Figure shows possible ways to search for visual content using the inherent structural features
of an image. In this example there are four image features detailed.(1) The color histogram
feature of an image allows me to search for images that have the same color. Note, the
position of the colors is not important but rather the amount of similar color in the image is
Kunal Kishore
School of Engineering (it),CUSAT
22
-
8/7/2019 MPEG-7 Report
23/26
Seminar Report
MPEG-7
important. (2) Spatial Color Distribution allows me to search for images where the location
of the same color is important. You can see that the added object in the right-bottom flag does
not affect this type of search. (3) in the Spatial Edge Distribution you can additionally
search for images that have a similar edge or contour profile as search technique. Note, color
does not make a difference to this type of search. (4) you can see an example of searching by
Object Shape . Here, the color and edge profiles are not important.
3. Spatial edge
distribution
2. Spatial color
distribution
4. Object shape
1. Color histogram
5.ii.(b) It is the responsibility of the search engine and filter agent to match the query data to
the MPEG-7 description.
A few query examples are:
o Play a few notes on a keyboard and retrieve a list of musical pieces similar to the
required tune, or images matching the notes in a certain way, e.g. in terms of
emotions.
o Draw a few lines on a screen and find a set of images containing similar graphics,
logos, ideograms,.
o Define objects, including color patches or textures and retrieve examples among
which you select the interesting objects to compose your design.
o On a given set of multimedia objects, describe movements and relations between
objects and so search for animations fulfilling the described temporal and spatial
relations.
o Describe actions and get a list of scenarios containing such actions.
o Using an excerpt of Pavarottis voice, obtaining a list of Pavarottis records, video
clips where Pavarotti is singing and photographic material portraying Pavarotti.Kunal Kishore
School of Engineering (it),CUSAT
23
-
8/7/2019 MPEG-7 Report
24/26
Seminar Report
MPEG-7
6. LIMITATIONS
MPEG-7 will not standardize the (automatic) extraction of AV descriptions/features.
It doesnt specify the search engine (or any other program) that can make use of the
description. It will be left to the creativity and innovation of search engine companies.
Example, to manipulate and massage the MPEG-7-described content into search
indices that can be used by their browser and retrieval tools.
Kunal Kishore
School of Engineering (it),CUSAT
24
-
8/7/2019 MPEG-7 Report
25/26
Seminar Report
MPEG-7
7. CONCLUSSION
MPEG-7 provides a seamless path towards increasingly intelligent content management
systems.
We live in the age of convergence, from the level of production through to distribution and
consumption. The technical hardware and communication infrastructure is evolving and will
soon reach the point where computing and communications will become embedded in
everyday objects and environments. Media will also then become ubiquitous. Ubiquitous
media will create a huge demand for new content, and meeting this demand must involve
fundamental changes to all stages of media production, management and delivery.
Media archives will become vast and interconnected pools of content, too large to be
managed manually. Customization of content within programs, e.g. substitution of structural
elements (characters, music, voices) according to viewer desires, content scaling for PDA,
cell phones, will be not only possible, but easy and pleasant. MPEG-7 will enable the
creation of tools, (through its structured combination of low level features and high-level
meta-data), for coping with this "outbreak" of generic content.
MPEG-7 is intended to describe audiovisual information regardless of storage, coding,
display, transmission, medium, or technology. It will address a wide variety of media types
including: still pictures, graphics, 3D models, audio, speech, video, and combinations of
these.
And it will address both retrieval from digital archives (pull applications) as well as filtering
of streamed audiovisual broadcasts on the Internet (push applications). It will operate in both
real-time and non real-time environments.
Kunal Kishore
School of Engineering (it),CUSAT
25
-
8/7/2019 MPEG-7 Report
26/26
Seminar Report
MPEG-7
8. REFRENCES
MPEG-7 (Technical) Overview, W: Jeju, Korea, March, 2002
MPEG-7 Awareness event(http://www.mpeg-industry.com)
http://www.mpeg.chiariglion.org (A MPEG Home Page )
http://www.mpeg-alliance.com
http://www.w3.org/1999/05/06-xmlschema-1
http://www.dlib.org/dlib/september99/hunter/09hunter.html
Kunal Kishore
S h l f E i i (it) CUSAT
26
http://www.mpeg-industry.com/http://www.mpeg.chiariglion.org/http://www.mpeg.chiariglion.org/http://www.mpeg-alliance.com/http://www.mpeg-alliance.com/http://www.w3.org/1999/05/06-xmlschema-1http://www.w3.org/1999/05/06-xmlschema-1http://www.dlib.org/dlib/september99/hunter/09hunter.htmlhttp://www.dlib.org/dlib/september99/hunter/09hunter.htmlhttp://www.mpeg-industry.com/http://www.mpeg.chiariglion.org/http://www.mpeg-alliance.com/http://www.w3.org/1999/05/06-xmlschema-1http://www.dlib.org/dlib/september99/hunter/09hunter.html