MPEG-7 Report

download MPEG-7 Report

of 26

Transcript of MPEG-7 Report

  • 8/7/2019 MPEG-7 Report

    1/26

    Seminar Report

    MPEG-7

    ABSTRACT

    MPEG-7 is a multimedia content description standard. This description will be associated

    with the content itself, to allow fast and efficient searching for material that is of interest to

    the user. MPEG-7 is formally called Multimedia Content Description Interface. Thus, it is not

    a standard which deals with the actual encoding of moving pictures and audio, like MPEG-1,

    MPEG-2 and MPEG-4. It uses XML to store metadata, and can be attached to timecode in

    order to tag particular events, or synchronize lyrics to a song.

    MPEG-7 objectivesis to provide a fast and efficient searching, filtering and content

    identification method. Describe main issues about the content (low-level characteristics,

    structure, models, collections, etc.). Index a big range of applications. Audiovisual

    information that MPEG-7 deals is : Audio, voice, video, images, graphs and 3D models

    Inform about how objects are combined in a scene. Independence between description and

    the information itself.

    Kunal Kishore

    School of Engineering (it),CUSAT

    1

  • 8/7/2019 MPEG-7 Report

    2/26

    Seminar Report

    MPEG-7

    ACKNOWLEDGEMENT

    We consider it as a great privilege to express our heartfelt gratitude to many respected

    personalities who guided, inspired and helped us in successful completion of this seminar and

    presentation. We also express our gratitude to DR. SANTOSH KUMAR M.B., Senior Lecturer of

    the department for providing us with adequate facilities, ways and means by which we were

    able to complete this presentation. We express our sincere gratitude for his constant support

    and valuable suggestions without which the successful completion of this presentation wouldnot have been possible. We express our immense pleasure and thankfulness to all the teachers

    and staff of the Department of Information Technology, CUSAT for their cooperation and

    support. Last but not the least, we thank all others, and especially our classmates and our

    family members who in one way or another helped us in the successful completion of this

    work.

    Kunal Kishore

    Kunal Kishore

    School of Engineering (it),CUSAT

    2

  • 8/7/2019 MPEG-7 Report

    3/26

    Seminar Report

    MPEG-7

    INDEX

    Topic Page No.

    1.Introduction 5

    2.What is MPEG-7 Standered? 63.What is MPEG-7? 7

    4. Technical Overview 9

    4.i. Structure 9

    4.ii. Descriptor 11

    4.iii. Descriptor Scheme 17

    4.iv. DDL 19

    4.v. MPEG-7 parts 225. Application Domain 23

    6. Limitation 25

    7. Conclusion 26

    8. Refrences 27

    Kunal Kishore

    School of Engineering (it),CUSAT

    3

  • 8/7/2019 MPEG-7 Report

    4/26

    Seminar Report

    MPEG-7

    1. INTRODUCTION

    How many times have you seen science fiction movies such as 2001: A Space Odyssey and

    thought, Wow, were so far away from having any of the fancy gadgets depicted in these

    movies! In 2001, Hal, the talking computer intelligently navigates and retrieves information

    or runs complex operations instigated by spoken input. Or how about using an image-based

    query, say an image of the motorbike used by Arnold Schwartzenegger in the movie T2, to

    find images of similarly looking motorbikes. Dreams or reality?

    MPEG-7 is a multimedia content description standard, (), that addresses how humans and

    computer systems expect to interact with computer systems, since it develops richdescriptions that reflect those expectations

    Now A days accessing audio and video used to be a simple matter - simple because of the

    simplicity of the access mechanisms and because of the poverty of the sources. An

    incommensurable amount of audiovisual information is becoming available in digital form, in

    digital archives, on the World Wide Web, in broadcast data streams and in personal and

    professional databases, and this amount is only growing. The value of information often

    depends on how easy it can be found, retrieved, accessed, filtered and managed.

    Broadband is being offered with increasing audio and video quality and speed of access. The

    trend is clear: in the next few years, users will be confronted with such a large number of

    contents provided by multiple sources that efficient and accurate access to this almost infinite

    amount of content seems unimaginable todayasy it can be found, retrieved, accessed, filtered

    and managed. In spite of the fact that users have increasing access to these resources,

    identifying and managing them efficiently is becoming more difficult, because of the sheer

    volume. The question of identifying and managing content is not just restricted to database

    retrieval applications such as digital libraries, but extends to areas like broadcast channel

    selection, multimedia editing, and multimedia directory services.

    This challenging situation demands a timely solution to the problem . MPEG-7 is the answer

    to this need.

    Kunal Kishore

    School of Engineering (it),CUSAT

    4

  • 8/7/2019 MPEG-7 Report

    5/26

    Seminar Report

    MPEG-7

    2.What Are the MPEG Standards?

    The Moving Picture Coding Experts Group (MPEG) is a working group of the Geneva-based

    ISO/IEC standards organization, (International Standards Organization/International Electro-

    technical Committee) in charge of the development of international standards for

    compression, decompression, processing, and coded representation of moving pictures, audio,

    and a combination of the two. MPEG-7 then is an ISO/IEC standard being developed by

    MPEG, the committee that also developed the Emmy Award-winning standards known as

    MPEG-1 and MPEG-2, and the 1999 MPEG-4 standard.

    MPEG-1: For the storage and retrieval of moving pictures and audio on storage

    media.

    MPEG-2: For digital television, its the timely response for the satellite broadcasting

    and cable television industries in their transition from analog to digital formats.

    MPEG-4: Codes content as objects and enables those objects to be manipulated

    individually or collectively on an audiovisual scene.

    MPEG-1, -2, and -4 make content available. MPEG-7 lets you to find the content you

    need.

    Besides these standards, MPEG is currently also working in MPEG-21 a Technical

    Report about Multimedia Framework.

    The MPEG-1 and MPEG-2 standards have enabled the production of widely adopted

    commercial products, such as Video CD, MP3, digital audio broadcasting (DAB), DVD,

    digital television (DVB and ATSC), and many video-on-demand trials and commercialservices. MPEG-4 is the first real multimedia representation standard, allowing interactivity

    and a combination of natural and synthetic material coded in the form of objects. MPEG-4

    provides the standardized technological elements enabling the integration of the production,

    distribution and content access paradigms of the fields of interactive multimedia, mobile

    multimedia, interactive graphics and enhanced digital television.

    Kunal Kishore

    School of Engineering (it),CUSAT

    5

  • 8/7/2019 MPEG-7 Report

    6/26

    Seminar Report

    MPEG-7

    3. What is MPEG-7 ?

    The MPEG-7 standard, formally named "Multimedia Content Description Interface",

    provides a rich set of standardized tools to describe multimedia content. It is the standard that

    describes multimedia content so users can search, browse, and retrieve that content more

    efficiently and effectively than they could using todays mainly text-based search engines.

    Both human users and automatic systems that process audiovisual information are within the

    scope of MPEG-7. Its a standard for describing the features of multimedia content.

    MPEG-7 is an ISO/IEC standard developed by MPEG (Moving Picture Experts Group), the

    committee that also developed the successful standards known as MPEG-1 (1992) and

    MPEG-2 (1994), and the MPEG-4 standard (Version 1 in 1998, and version 2 in 1999) as

    mentioned above.

    MPEG-7 offers a comprehensive set of audiovisual Description Tools to create descriptions

    (i.e., a set of instantiated Description Schemes and their corresponding Descriptors at the

    users will), which will form the basis for applications enabling the needed effective and

    efficient access (search, filtering and browsing) to multimedia content. This is a challenging

    task given the broad spectrum of requirements and targeted multimedia applications, and the

    broad number of audiovisual features of importance in such context.

    MPEG-7 does not standardize or evaluate applications, although in its development standard

    applications have been used for understanding the requirements and evaluation of technology.

    It must be made clear that the requirements are derived from analyzing a wide range of

    potential applications that could use MPEG-7 tools. MPEG-7 is not aimed at any one

    application in particular; rather, the elements that MPEG-7 standardizes support as broad a

    range of applications as possible.

    MPEG-7 uniquely provides comprehensive standardised multimedia description tools for

    content. Descriptions for the catalogue level (e.g. title), the semantic level (who, what, when,

    where) and the structural level (spatio-temporal region, color histogram, timbre, texture) will

    provide tools for creative developers to generate new waves of multimedia applications.

    Kunal Kishore

    School of Engineering (it),CUSAT

    6

  • 8/7/2019 MPEG-7 Report

    7/26

    Seminar Report

    MPEG-7

    Standardized MPEG-7 description tools, then, are a key enabler of the following application

    domains:

    Search Engines, Digital Libraries, Broadcast Networks, Entertainment and News

    Distributors, Streaming Businesses.

    Dynamic start-up companies, searching for cutting edge technologies.

    Governmental, Educational, Law, Medical & Remedial Services, and Non-profit

    organizations looking for digital media solutions. For example, the U.S. Library of

    Congress receives over 10,000 multimedia items each week, and is committed to a)

    the long term preservation of these multimedia items in digital format, and b) making

    much of their collection accessible to U.S. citizens in digital format.

    XML, Metadata, Modeling/Simulation, & Surveillance Industries

    AI Practitioners, Content Creators and Providers.

    Kunal Kishore

    School of Engineering (it),CUSAT

    7

  • 8/7/2019 MPEG-7 Report

    8/26

    Seminar Report

    MPEG-7

    4. TECHNICAL OVERVIEW OF MPEG-7

    4.i. STRUCTURE

    Before explaning the MPEG-7 standereds structure, we will first mention MPEG-7

    standereds which include Descripters, description Schemas, a description Definition Langage,

    and system tools .

    Data is Multimedia information that will be described using MPEG_&, regardless of

    storage, coding , display, transmission medium or technology. Furthermore, a feature is a

    distinctive charectoristic of data that signifies something to somebody. Besed on these

    definitions, these are the four major MPEG-7 building blocks:

    Descriptor: A representation of feature. A Descriptor defines the syntax and

    semantics of the feature representation.

    Description Schema: The structures and symentics of the relationship between itscomponents, which may be both Descriptors and description schemes.

    Description Definition Language ( DDL ): A language that allows the creation of

    new description schemes and possibly descriptors. It also allows the extension and

    modification of existing description schemes. MPEG-7 adopted XML Schema

    Language as the MPEG-7 DDL. However, the DDL requires some specific extensions

    to XML Schema Language to satisfy all the requirements of MPEG-7. These

    extensions are currently being discussed through liaison activities between MPEG and

    W3C, the group standardizing XML.

    System Tools : Tools to support multiplexing of descriptions, synchronization of

    descriptors with content , delivery mechanism and coded representation for efficient

    storage and transmission and the management and protection of intellectual property

    in MPEG-7 descriptions.

    Kunal Kishore

    School of Engineering (it),CUSAT

    8

  • 8/7/2019 MPEG-7 Report

    9/26

    Seminar Report

    MPEG-7

    The Scope of MPEG-7

    Kunal Kishore

    School of Engineering (it),CUSAT

    9

  • 8/7/2019 MPEG-7 Report

    10/26

    Seminar Report

    MPEG-7

    4.ii. DESCRIPTORS

    A Descriptor (D) defines the syntax and the semantics of one representation of a particular

    feature of audiovisual content. A feature is a distinctive characteristic of the data which is of

    significance to a user.

    For example, the colorof an image is a feature. Possible Descriptors corresponding to

    the colorfeature are: color histogram, RGB vector or a string. A Descriptor value is an

    instantiation of a Descriptor for a given data set. For example, RGB= (255, 255, 255),

    colorstring="red".

    Table illustrates some of the current descriptors which have been incorporated into the XM

    or are undergoing core experiments (CEs). They have been subdivided into Visual and Audio

    descriptors.

    Type Feature Description

    Visual Besic Structure Grid layout

    Time series

    2D-3D Multiple View

    Multiple view

    the Spatial 2D coordinates

    Temporal interpolation

    Colour Colour space

    Dominent color

    Color Quantization

    Scalable Color

    Color Layout

    Color-Structure

    GoF/GoP Color

    Texture Spatial image intensity distributionHomogeneous texture

    Shape Object bounding box

    Region-based shape

    Contour-based shape

    3D shape descriptor

    Motion Camera motion

    Object motion trajectory

    Parametric object motion

    Motion activity

    Motion trajectory featuresKunal Kishore

    School of Engineering (it),CUSAT

    10

  • 8/7/2019 MPEG-7 Report

    11/26

    Seminar Report

    MPEG-7

    Audio Speech Annotation Lattice of words and phonemes

    plus metadata

    Timbre Ratio of even to odd harmonics

    Harmonic attack coherenceMelody Melodic contour and rhythm

    Table 2. Overview of Current Descriptors

    4.ii.(a)MPEG-7 Visual

    MPEG-7 Visual Description Tools included in the standard consist of basic structures and

    Descriptors that cover the following basic visual features: Color, Texture, Shape, Motion,

    Localization, and Face recognition. Each category consists of elementary and sophisticated

    Descriptors.

    Basic Structure

    o Grid Layout : The grid layout is a splitting of the image into a set of

    equally sized rectangular regions, so that each region can be described

    separately. Each region of the grid can be described in terms of other

    Descriptors such as color or texture. Furthermore, the descriptor allows

    to assign the subDescriptors to all rectangular areas, as well as to an

    arbitrary subset of rectangular regions.

    o Time Series: This descriptor defines a temporal series of Descriptors in a video

    segment and provides image to video-frame matching and video-frames to video-

    frames matching functionalities. Two types of TimeSeries are available:

    RegularTimeSeries and IrregularTimeSeries

    o 2D-3D Multiple View: The 2D/3D Descriptor specifies a structure which

    combines 2D Descriptors representing a visual feature of a 3D object seen from

    different view angles. The descriptor forms a complete 3D view-based

    representation of the object.

    o Spetial 2D Coordinate : This description defines a 2D spatial coordinate system

    and a unit to be used by reference in other Ds/DSs when relevant. The coordinate

    system is defined by a mapping between an image and the coordinate system. One

    Kunal Kishore

    School of Engineering (it),CUSAT

    11

  • 8/7/2019 MPEG-7 Report

    12/26

    Seminar Report

    MPEG-7

    of the advantages using this descriptor is that MPEG-7 descriptions need not to be

    modified even if the image size is changed or a part of the image is clipped. In this

    case, only the description of the mapping from the original image to the edited

    image is required.

    o Temporal Interpolation : The TemporalInterpolation D describes a temporal

    interpolation using connected polynomials. This can be used to

    approximate multi-dimensional variable values that change with time

    such as an object position in a video. The description size of the temporal

    interpolation is usually much smaller than describing all values.

    Color Discriptor :There are seven Color Descriptors: Color space, Color Quantization,

    Dominant Colors, Scalable Color, Color Layout, Color-Structure, and GoF/GoP Color.

    o Color space : The feature is the color space that is to be used in other color based

    descriptions. In the current description, the following color spaces are supported:

    a) R,G,B (b) Y,Cr,Cb (c) H,S,V (d) HMMD (e) Linear transformation

    matrix with reference to R, G, B (f) Monochrome

    o Color Quantization : This descriptor defines a uniform quantization of a color

    space. The number of bins which the quantizer produces is configurable, such that

    great flexibility is provided for a wide range of applications. For a meaningful

    application in the context of MPEG-7, this descriptor has to be combined with

    dominant color descriptors, e.g. to express the meaning of the values of dominant

    colors.

    o Dominent Color : Color quantization is used to extract a small number of

    representing colors in each region/image. The percentage of each quantized color

    in the region is calculated correspondingly. A spatial coherency on the entiredescriptor is also defined, and is used in similarity retrieval.

    o Scalable Color : The Scalable Color Descriptor is a Color Histogram in HSV

    Color Space, which is encoded by a Haar transform. Its binary representation is

    scalable in terms of bin numbers and bit representation accuracy over a broad

    range of data rates. The Scalable Color Descriptor is useful for image-to-image

    matching and retrieval based on color feature. Retrieval accuracy increases with

    the number of bits used in the representation.

    Kunal Kishore

    School of Engineering (it),CUSAT

    12

  • 8/7/2019 MPEG-7 Report

    13/26

    Seminar Report

    MPEG-7

    o Color Layout: This descriptor effectively represents the spatial distribution of

    color of visual signals in a very compact form. This compactness allows visual

    signal matching functionality with high retrieval efficiency at very small

    computational costs. It provides image-to-image matching as well as ultra high-

    speed sequence-to-sequence matching, which requires so many repetitions of

    similarity calculations. It also provides very friendly user interface using hand-

    written sketch queries since this descriptors captures the layout information of

    color feature.

    o Color Structure Discreptor: The Color structure descriptor is a color feature

    descriptor that captures both color content (similar to a color histogram) and

    information about the structure of this content. Its main functionality is image-to-

    image matching and its intended use is for still-image retrieval, where an image

    may consist of either a single rectangular frame or arbitrarily shaped, possibly

    disconnected, regions.

    o GoF/GoP Color: The Group of Frames/Group of Pictures color descriptor extends

    the ScalableColor descriptor that is defined for a still image to color description of

    a video segment or a collection of still images. Additional two bits allows to

    define how the color histogram was calculated, before the Haar transfor is applied

    to it: by average, median or intersection. The average histogram, which refers to

    averaging the counter value of each bin across all frames or pictures, is equivalent

    to computing the aggregate color histogram of all frames and pictures with proper

    normalization. The Median Histogram refers to computing the median of the

    counter value of each bin across all frames or pictures.

    Texture

    o Spatial image intensity distribution

    o Homogeneous texture

    Shape

    o Object bounding box

    o Region-based shape

    o Contour-based shape

    Kunal Kishore

    School of Engineering (it),CUSAT

    13

  • 8/7/2019 MPEG-7 Report

    14/26

    Seminar Report

    MPEG-7

    o 3D shape descriptor

    Motion

    o Camera motion

    o Object motion trajectory

    o Parametric object motion

    o Motion activity

    o Motion trajectory features

    4.ii.(a)MPEG-7 Audio

    MPEG-7 Audio, standardizes the description tools for describing audio

    content. Most Audio description tools are based on audio features that let

    us measure similarity in sounds (such as music and speech). Therefore,

    we can use these MPEG-7 Audio descriptors and description schemes to

    search and filter audio content based on several audio features like

    spectrum, harmony, timbre, and melody. Other Audio description tools let

    us describe spoken content and create a classification of sounds.

    We can classify the MPEG-7 Audio description tools into generic and high-

    level description tools. The generic Audio description tools include a group

    of low-level descriptors for audio features, named the MPEG-7 Audio

    Framework (see Figure A), that let us describe an audio signals spectral,

    parametric, and temporal features. The highlevel group provides

    description tools for sound recognition and indexing, spoken content, and

    query-by-humming applications, among other things.

    Speech Annotation

    o Lattice of words and phonemes

    plus metadata.

    Timber

    o Ratio of even to odd harmon

    Kunal Kishore

    School of Engineering (it),CUSAT

    14

  • 8/7/2019 MPEG-7 Report

    15/26

    Seminar Report

    MPEG-7

    o Harmonic attack coherence

    Melody:

    o Melodic contour and rhythm

    Kunal Kishore

    School of Engineering (it),CUSAT

    15

  • 8/7/2019 MPEG-7 Report

    16/26

    Seminar Report

    MPEG-7

    4.iii. DESCRIPTION SCHEMES

    A Description Scheme (DS) specifies the structure and semantics of the relationships between

    its components, which may be both Descriptors and Description Schemes.

    The following concepts are used within the DS group to describe audiovisual content:

    Syntactic structure - the physical and logical structure of audiovisual content, e.g.,

    structures based on temporal segments and/or spatial regions.

    Semantic structure - breakdown based on semantic meaning, e.g., structures based on

    temporal events and/or spatial objects.

    Syntactic-semantic links - the associations between syntactic elements and semantic

    elements.

    The Generic Audiovisual DS represents the integration of all of the DS proposals and

    submissions within a single DS. At the top level it consists of:

    A collection of Syntactic structure DSs, i.e., physical features such as segments,

    regions, color, texture, and motion are described here;

    A collection of Semantic structure DSs, i.e., semantic features such as objects, actors

    or events, e.g., "goal", "advertisement", "Madonna";

    Syntactic-semantic links DSs - which relate the syntactic elements to the semantic

    elements;

    Summary DS - this is used to enable browsing at different levels of granularity;

    MetaInfo DS - this contains descriptors carrying author or publisher-generated

    information, e.g., ContentDS, CreditsDS, CreationPurposeDS, RightsDS,

    PublicationDS, RightsDS;

    MediaInfo DS - this contains descriptors related to the storage media, e.g., file format,

    system, medium, colour, sound, length, duration, compression format;

    Model DS - this provides a way to describe the classification methods for audiovisual

    data or the correspondence between the current audiovisual content and other content

    through different models;

    Kunal Kishore

    School of Engineering (it),CUSAT

    16

  • 8/7/2019 MPEG-7 Report

    17/26

    Seminar Report

    MPEG-7

    Figure below illustrates the structure and content of the Generic Audiovisual DS.

    Figure : The Generic Audiovisual Description Scheme

    Kunal Kishore

    School of Engineering (it),CUSAT

    17

  • 8/7/2019 MPEG-7 Report

    18/26

    Seminar Report

    MPEG-7

    4.iv. DESCRIPTION DEFINITION LANGUAGE

    The Description Definition Language (DDL) is the language that allows the creation of new

    Description Schemes and Descriptors. It also allows the extension and modification of

    existing Description Schemes. A DDL specifies the constraints that a valid MPEG-7

    descripction should respect.

    The DDL has to be able to express spatial, temporal, structural, and conceptual relationships

    between the elements of a DS, and between DSs. It must provide a rich model for links and

    references between one or more descriptions and the data that it describes. It also has to be

    capable of validating descriptor data types, both primitive (integer, text, date, time) and

    composite (histograms, enumerated types). In addition, it must be platform and application

    independent and human- and machine-readable.

    There are two form of languages normally used for DDL:

    4.iv.(a) XML : The general consensus within MPEG-7 is that it should be based on XML

    syntax. It is encoded in XML that defines documents structures and embed it directly

    withinthe document through the use of markups. XML is same as HTML but with user

    defined tags.

    An XML schema can be seen as an extended DTD, because it goes beyond the current

    function alities of DTD and allows data type definitions. The MPEG-7 DDL uses w3 XML

    schemalanguage as a base language and adds MPEG-7 specific mechanism, such as array,

    matrix data type on its top.

    A DDL parser is used to check the validity of the diacriptionwith these rules. The parser is

    first initialised with the schema and it then parse a description to report any errors.

    XML schema parser which can be used to validate the description schemes are XSV, XML

    SPY Xerces.

    4.iv.(b) BINARY FORMAT FOR MPEG-7

    Kunal Kishore

    School of Engineering (it),CUSAT

    18

  • 8/7/2019 MPEG-7 Report

    19/26

    Seminar Report

    MPEG-7

    MPEG textual description (XML Files )tend to become large and thus inefficient. That is why

    MPEG-7 defines a binary format, BiM, that enables the streaming and compression of

    descriptstion. Actually BiM can be used to compress any XML document. The document

    structure is highly compressed , 98% an average.

    Example: MPEG-7 Description (in textual format) of the MPEG logo.

    http://www.tilab.org/mpeg/mpeg_logo-anim_l.gif

    The animated MPEG LogoAuthorMPEGhttp://www.tilab.com/mpeg/

    Kunal Kishore

    School of Engineering (it),CUSAT

    19

  • 8/7/2019 MPEG-7 Report

    20/26

    Seminar Report

    MPEG-7

    4.v. MPEG-7 PARTS

    The MPEG-7 standard consists of several parts.This lets us use the various clusters of

    technology alone, according to MPEGs toolbox approach to standardization. It also keeps

    the editing of the standard manageable. Here are the MPEG-7 standard parts:

    MPEG-7 Systems the tools needed to prepare MPEG-7 descriptions for efficient

    transport and storage and the terminal architecture.It specifies the system tools to

    prepare MPEG-7 Descriptions for efficient transport and storage (through

    binarization) to allow synchronization between content and descriptions.

    MPEG-7 Description Definition Language - the language for defining the syntax of

    the MPEG-7 Description Tools and for defining new Description Schemes.

    MPEG-7 Visual the Description Tools dealing with (only) Visual descriptions.

    MPEG-7 Audio the Description Tools dealing with (only) Audio descriptions.

    MPEG-7 Multimedia Description Schemes - the Description Tools dealing with

    generic features and multimedia descriptions.

    MPEG-7 Reference Software - a software implementation of relevant parts of the

    MPEG-7 Standard with normative status.

    MPEG-7 Conformance Testing - guidelines and procedures for testing conformance

    of MPEG-7 implementations.

    MPEG-7 Extraction and use of descriptions informative material (in the form of a

    Technical Report) about the extraction and use of some of the Description Tools.

    MPEG-7 Profiles and levels - provides guidelines and standard profiles.

    MPEG-7 Schema Definition - specifies the schema using the Description Definition

    Language.

    Kunal Kishore

    School of Engineering (it),CUSAT

    20

  • 8/7/2019 MPEG-7 Report

    21/26

    Seminar Report

    MPEG-7

    5. MPEG-7 Application Domains

    The elements that MPEG-7 standardizes will support a broad a range of applications (for

    example, multimedia digital libraries, broadcast media selection, multimedia editing, home

    entertainment devices, etc.). MPEG-7 will also make the web as searchable for multimedia

    content as it is searchable for text today. This would apply especially to large content

    archives, which are being made accessible to the public, as well as to multimedia catalogues

    enabling people to identify content for purchase. The information used for content retrieval

    may also be used by agents, for the selection and filtering of broadcasted "push" material or

    for personalized advertising. Additionally, MPEG-7 descriptions will allow fast and cost-effective usage of the underlying data, by enabling semi-automatic multimedia presentation

    and editing. All domains making use of multimedia will benefit from MPEG-7 including : -

    Digital libraries, Education (image catalogue, musical dictionary, Bio-medical

    imaging catalogues)

    Multimedia editing (personalised electronic news service, media authoring)

    Cultural services (history museums, art galleries, etc.),

    Multimedia directory services (e.g. yellow pages, Tourist information,

    Geographical information systems)

    Broadcast media selection (radio channel, TV channel,)

    Journalism (e.g. searching speeches of a certain politician using his name, his

    voice or his face),

    E-Commerce (personalised advertising, on-line catalogues, directories of e-

    shops,)

    Surveillance (traffic control, surface transportation, non-destructive testing in

    hostile environments, etc.),

    Investigation services (human characteristics recognition, forensics),

    Home Entertainment (systems for the management of personal multimedia

    collections, including manipulation of content, e.g. home video editing,

    searching a game, karaoke,)

    Social (e.g. dating services),

    Kunal Kishore

    School of Engineering (it),CUSAT

    21

  • 8/7/2019 MPEG-7 Report

    22/26

    Seminar Report

    MPEG-7

    5.i. Typical applications enabled by MPEG-7 technology include:

    Audio: I want to search for songs by humming or whistling a tune or, using an

    excerpt of Pavarottis voice, get a list of Pavarottis records and video clips in which

    Pavarotti sings or simply makes an appearance. Or, play a few notes on a keyboard

    and retrieve a list of musical pieces similar to the required tune, or images matching

    the notes in a certain way, e.g. in terms of emotions.

    Graphics: Sketch a few lines on a screen and get a set of images containing similar

    graphics, logos, and ideograms.

    Image: Define objects, including color patches or textures, and get examples from

    which you select items to compose your image. Or check if your company logo was

    advertised on a TV channel as contracted.

    Visual: Allow mobile phone access to video clips of goals scored in a soccer game,

    or automatically search and retrieve any unusual movements from surveillance

    videos.

    Multimedia: On a given set of multimedia objects, describe movements and relations

    between objects and so search for animations fulfilling the described temporal and

    spatial relations. Or, describe actions and get a list of scenarios containing such

    actions.

    5.ii. Example of an MPEG-7 Application

    The following application is an examples of the type of solutions that MPEG-7 can solve.

    There are many more applications being developed around the world.

    5.ii.a. Content Retrieval using Image as the Query

    Figure shows possible ways to search for visual content using the inherent structural features

    of an image. In this example there are four image features detailed.(1) The color histogram

    feature of an image allows me to search for images that have the same color. Note, the

    position of the colors is not important but rather the amount of similar color in the image is

    Kunal Kishore

    School of Engineering (it),CUSAT

    22

  • 8/7/2019 MPEG-7 Report

    23/26

    Seminar Report

    MPEG-7

    important. (2) Spatial Color Distribution allows me to search for images where the location

    of the same color is important. You can see that the added object in the right-bottom flag does

    not affect this type of search. (3) in the Spatial Edge Distribution you can additionally

    search for images that have a similar edge or contour profile as search technique. Note, color

    does not make a difference to this type of search. (4) you can see an example of searching by

    Object Shape . Here, the color and edge profiles are not important.

    3. Spatial edge

    distribution

    2. Spatial color

    distribution

    4. Object shape

    1. Color histogram

    5.ii.(b) It is the responsibility of the search engine and filter agent to match the query data to

    the MPEG-7 description.

    A few query examples are:

    o Play a few notes on a keyboard and retrieve a list of musical pieces similar to the

    required tune, or images matching the notes in a certain way, e.g. in terms of

    emotions.

    o Draw a few lines on a screen and find a set of images containing similar graphics,

    logos, ideograms,.

    o Define objects, including color patches or textures and retrieve examples among

    which you select the interesting objects to compose your design.

    o On a given set of multimedia objects, describe movements and relations between

    objects and so search for animations fulfilling the described temporal and spatial

    relations.

    o Describe actions and get a list of scenarios containing such actions.

    o Using an excerpt of Pavarottis voice, obtaining a list of Pavarottis records, video

    clips where Pavarotti is singing and photographic material portraying Pavarotti.Kunal Kishore

    School of Engineering (it),CUSAT

    23

  • 8/7/2019 MPEG-7 Report

    24/26

    Seminar Report

    MPEG-7

    6. LIMITATIONS

    MPEG-7 will not standardize the (automatic) extraction of AV descriptions/features.

    It doesnt specify the search engine (or any other program) that can make use of the

    description. It will be left to the creativity and innovation of search engine companies.

    Example, to manipulate and massage the MPEG-7-described content into search

    indices that can be used by their browser and retrieval tools.

    Kunal Kishore

    School of Engineering (it),CUSAT

    24

  • 8/7/2019 MPEG-7 Report

    25/26

    Seminar Report

    MPEG-7

    7. CONCLUSSION

    MPEG-7 provides a seamless path towards increasingly intelligent content management

    systems.

    We live in the age of convergence, from the level of production through to distribution and

    consumption. The technical hardware and communication infrastructure is evolving and will

    soon reach the point where computing and communications will become embedded in

    everyday objects and environments. Media will also then become ubiquitous. Ubiquitous

    media will create a huge demand for new content, and meeting this demand must involve

    fundamental changes to all stages of media production, management and delivery.

    Media archives will become vast and interconnected pools of content, too large to be

    managed manually. Customization of content within programs, e.g. substitution of structural

    elements (characters, music, voices) according to viewer desires, content scaling for PDA,

    cell phones, will be not only possible, but easy and pleasant. MPEG-7 will enable the

    creation of tools, (through its structured combination of low level features and high-level

    meta-data), for coping with this "outbreak" of generic content.

    MPEG-7 is intended to describe audiovisual information regardless of storage, coding,

    display, transmission, medium, or technology. It will address a wide variety of media types

    including: still pictures, graphics, 3D models, audio, speech, video, and combinations of

    these.

    And it will address both retrieval from digital archives (pull applications) as well as filtering

    of streamed audiovisual broadcasts on the Internet (push applications). It will operate in both

    real-time and non real-time environments.

    Kunal Kishore

    School of Engineering (it),CUSAT

    25

  • 8/7/2019 MPEG-7 Report

    26/26

    Seminar Report

    MPEG-7

    8. REFRENCES

    MPEG-7 (Technical) Overview, W: Jeju, Korea, March, 2002

    MPEG-7 Awareness event(http://www.mpeg-industry.com)

    http://www.mpeg.chiariglion.org (A MPEG Home Page )

    http://www.mpeg-alliance.com

    http://www.w3.org/1999/05/06-xmlschema-1

    http://www.dlib.org/dlib/september99/hunter/09hunter.html

    Kunal Kishore

    S h l f E i i (it) CUSAT

    26

    http://www.mpeg-industry.com/http://www.mpeg.chiariglion.org/http://www.mpeg.chiariglion.org/http://www.mpeg-alliance.com/http://www.mpeg-alliance.com/http://www.w3.org/1999/05/06-xmlschema-1http://www.w3.org/1999/05/06-xmlschema-1http://www.dlib.org/dlib/september99/hunter/09hunter.htmlhttp://www.dlib.org/dlib/september99/hunter/09hunter.htmlhttp://www.mpeg-industry.com/http://www.mpeg.chiariglion.org/http://www.mpeg-alliance.com/http://www.w3.org/1999/05/06-xmlschema-1http://www.dlib.org/dlib/september99/hunter/09hunter.html