Text of 15938-5 FCD Information Technology – Multimedia Content ...

981
INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE NORMALISATION ISO/IEC JTC 1/SC 29/WG 11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC 1/SC 29/WG 11/N3966 March 2001, Singapore Source: Multimedia Description Schemes (MDS) Group Title: Text of 15938-5 FCD Information Technology – Multimedia Content Description Interface – Part 5 Multimedia Description Schemes Status: Approved Editors : Peter van Beek, Ana B. Benitez, Joerg Heuer, Jose Martinez, Philippe Salembier, Yoshiaki Shibata, John R. Smith, Toby Walker

Transcript of Text of 15938-5 FCD Information Technology – Multimedia Content ...

Text of 15938-5 FCD Information Technology Multimedia Content Description Interface Part 5 Multimedia Description Schemes

ISO/IEC 2000 All rights reserved

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION

ORGANISATION INTERNATIONALE NORMALISATION

ISO/IEC JTC 1/SC 29/WG 11

CODING OF MOVING PICTURES AND AUDIO

ISO/IEC JTC 1/SC 29/WG 11/N3966

March 2001, SingaporeSource:Multimedia Description Schemes (MDS) Group

Title:Text of 15938-5 FCD Information Technology Multimedia Content Description Interface Part 5 Multimedia Description Schemes

Status:Approved

Editors:Peter van Beek, Ana B. Benitez, Joerg Heuer, Jose Martinez, Philippe Salembier, Yoshiaki Shibata, John R. Smith, Toby Walker

COMMITTEE DRAFT

SET DDOrganization " ISO/IEC 2000 All rights reserved" ISO/IEC 2000 All rights reserved

SET LibEnteteISO "ISO/IECCD159385" ISO/IECCD159385

SET LIBLANG " 2" 2

SET LIBTypeTitreISO " 63" 63

SET DDTITLE4 "Part5: Multimedia Description Schemes" Part5: Multimedia Description Schemes

SET DDTITLE3 "Information Technology Multimedia Content Description Interface" Information Technology Multimedia Content Description Interface

SET DDTITLE2 "lment introductif lment central Partie5: Titre de la partie" lment introductif lment central Partie5: Titre de la partie

SET DDTITLE1 "Information Technology Multimedia Content Description Interface Part5: Multimedia Description Schemes" Information Technology Multimedia Content Description Interface Part5: Multimedia Description Schemes

SET DDDocLanguage "E" E

SET DDWorkDocDate "2000-11-17" 2000-11-17

SET DDDocStage "(30) Commitee" (30) Commitee

SET DDOrganization3 "ISO/IEC" ISO/IEC

SET DDOrganization1 "ISO/IECJ" ISO/IECJ

SET DDBASEYEAR ""

SET DDAmno ""

SET DDDocSubType ""

SET DDDocType "International Standard" International Standard

SET DDWorkDocNo "3705" 3705

SET DDRefNoPart "ISO/IEC15938" ISO/IEC15938

SET DDRefGen "ISO/IEC159385" ISO/IEC159385

SET DDRefNum "ISO/IEC/(30) Commitee15938-5" ISO/IEC/(30) Commitee15938-5

SET DDSCSecr ""

SET DDSecr ""

SET DDSCTitle ""

SET DDTCTitle "JTC" JTC

SET DDWGNum "11" 11

SET DDSCNum "29" 29

SET DDTCNum "1" 1

SET libH2NAME "Ttulo 2" Ttulo 2

SET libH1NAME "Ttulo 1" Ttulo 1

SET LibDesc ""

SET LibDescD ""

SET LibDescE ""

SET LibDescF ""

SET NATSubVer ""

SET CENSubVer ""

SET ISOSubVer "a" a

SET LIBVerMSDN "STD Version 1.0" STD Version 1.0

SET LIBStageCode "30" 30

SET LibRpl ""

SET LibICS ""

SET LIBFIL " 4" 4

SET LIBEnFileName "C:\TEMP\ISO-IEC 15938-5 (E).doc" C:\TEMP\ISO-IEC 15938-5 (E).doc

SET LIBFrFileName ""

SET LIBDeFileName ""

SET LIBNatFileName ""

SET LIBFileOld ""

SET LIBTypeTitreCEN ""

SET LIBTypeTitreNAT ""

SET LibEnteteCEN ""

SET LibEnteteNAT ""

SET LIBASynchroVF ""

SET LIBASynchroVE ""

SET LIBASynchroVD ""

SET DDPubYear ""

SET DDEditionNo "" ISO/IECJTC1/SC29N3966

Date:2001-03-12

ISO/IECCD159385ISO/IECJTC1/SC29/WG11Secretariat: REF DDSecr \* CHARFORMAT Information Technology Multimedia Content Description Interface Part5: Multimedia Description SchemesDocument type:International StandardDocument subtype: REF DDDocSubType \* CHARFORMAT Document stage:(30) CommiteeDocument language:EC:\homes\jrsmith\Standards\MPEG-7\Meetings\Mtg 56 Singapore\Out\FCD\W3966a.docSTD Version 1.0lment introductif lment central Partie5: Titre de la partieError! AutoText entry not defined.Copyright notice

This ISO document is a working draft or committee draft and is copyright-protected by ISO. While the reproduction of working drafts or committee drafts in any form for use by participants in the ISO standards development process is permitted without prior permission from ISO, neither this document nor any extract from it may be reproduced, stored or transmitted in any form for any other purpose without prior written permission from ISO.

Requests for permission to reproduce this document for the purpose of selling it should be addressed as shown below or to ISOs member body in the country of the requester:

[Indicate :

the full address

telephone number

fax number

telex number

and electronic mail address

as appropriate, of the Copyright Manager of the ISO member body responsible for the secretariat of the TC or SC within the framework of which the draft has been prepared]

Reproduction for sales purposes may be subject to royalty payments or a licensing agreement.

Violators may be prosecuted.

Contents

xviiForeword

Introductionxix1Scope11.1Organization of the document11.2Overview of Multimedia Description Schemes12Normative references33Terms, definitions, symbols, abbreviated terms43.1Conventions43.1.1Datatypes, Descriptors and Description Schemes43.1.2Naming convention43.1.3Documentation convention43.2Wrapper of the schema53.3Abbreviations53.4Basic terminology64Schema tools74.1Base types84.1.1Mpeg7RootType84.2Root element124.2.1Mpeg7 Root Element124.3Top-level types154.3.1BasicDescription174.3.2ContentDescription184.3.3ContentManagement224.4Multimedia content entities244.4.1MultimediaContent DS244.5Packages294.5.1Package DS304.6Description Metadata324.6.1DescriptionMetadata DS325Basic datatypes355.1Integer datatypes355.1.1Unsigned datatypes355.2Real datatypes405.2.1zeroToOne datatype405.2.2minusOneToOne datatype415.2.3nonNegativeReal datatype415.3Vectors and matrices425.3.1Representing Arrays and Matrices Values (Informative)425.3.2vector datatypes435.3.3minusOneToOneVector datatype445.3.4Matrix datatypes445.3.5Diagonal matrix datatypes475.3.6MinusOneToOneMatrix datatype495.4Probability datatypes505.4.1probabilityVector datatype505.4.2ProbabilityMatrix datatype505.5String Datatypes525.5.1mimeType Datatype525.5.2countryCode Datatype535.5.3regionCode Datatype545.5.4currencyCode Datatype555.5.5characterSetCode Datatype566Link to the media and localization576.1References to Ds and DSs576.1.1Reference Datatype576.2Unique Identifier596.2.1UniqueID Datatype596.3Time description tools616.3.1Time Datatype626.3.2TimePoint Datatype636.3.3Duration Datatype646.3.4IncrDuration Datatype656.3.5RelTimePoint Datatype656.3.6RelIncrTimePoint Datatype666.3.7TimeHeader Datatype676.3.8Time Datatype Examples (informative)686.3.9MediaTime Datatype686.3.10MediaTimePoint Datatype696.3.11MediaDuration Datatype716.3.12MediaIncrDuration Datatype716.3.13MediaRelTimePoint Datatype726.3.14MediaRelIncrTimePoint Datatype736.3.15MediaTimeHeader Datatype746.3.16MediaTime Datatype Examples (informative)746.4Media Locators776.4.1MediaLocator Datatype776.4.2InlineMedia Datatype786.4.3TemporalSegmentLocator Datatype796.4.4ImageLocator Datatype806.4.5AudioVisualSegmentLocator Datatype817Basic Tools837.1Language Identification837.1.1xml:lang attribute847.1.2xsd:language datatype857.2Textual annotation857.2.1Textual datatype857.2.2TextAnnotation Datatype867.2.3StructuredAnnotation Datatype887.2.4KeywordAnnotation Datatype897.2.5DependencyStructure Datatype907.3Classification Schemes and Terms977.3.1Classification Scheme DS987.3.2TermDefinition DS1037.3.3TermUse datatype1057.3.4ControlledTermUse datatype1077.3.5termReference Datatype1097.3.6ClassificationSchemeAlias Datatype1107.3.7GraphicalClassificationSchemeType DS1117.3.8GraphicalTerm D1127.4Description of agents1157.4.1Agent DS1157.4.2Person DS1167.4.3PersonGroup DS1177.4.4Organization DS1207.4.5PersonName Datatype1237.4.6Electronic Address datatype1267.5Description of places1277.5.1Place DS1277.6Graph1307.6.1Graph DS1317.6.2RelationBase DS1337.6.3Relation DS1347.6.4relationProperty Datatype1367.7Ordering Tools1387.7.1OrderingKey DS1397.8Affective Description1417.8.1Affective DS1417.9Phonetic Description1457.9.1PhoneticTranscriptionLexicon DS1458Media description tools1478.1.1MediaInformation DS1488.1.2MediaIdentification D1498.1.3MediaProfile DS1508.1.4MediaFormat D1518.1.5MediaTranscodingHints D1588.1.6Media Quality D1608.1.7MediaInstance DS1638.1.8MediaInformation DS examples (informative)1639Creation and production description tools1669.1CreationInformation tools1669.1.1CreationInformation DS1669.1.2Creation DS1679.1.3Classification DS1719.1.4RelatedMaterial DS1769.1.5CreationInformation DS examples (informative)17810Usage description tools18010.1UsageInformation tools18010.1.1UsageInformation DS18010.1.2Rights D18110.1.3Financial D18210.1.4Availability DS18310.1.5UsageRecord DS18510.1.6UsageInformation DS examples (informative)18611Structure of the content18811.1Segment entity description tools19111.1.1Segment DS19311.1.2StillRegion DS19611.1.3ImageText DS19911.1.4Mosaic DS20111.1.5StillRegion3D DS20511.1.6VideoSegment DS20611.1.7MovingRegion DS21011.1.8VideoText DS21311.1.9InkSegment DS21511.1.10AudioSegment DS22211.1.11AudioVisualSegment DS22311.1.12AudioVisualRegion DS22611.1.13MultimediaSegment DS22811.1.14Edited video segment description tools22911.2Segment attribute description tools24511.2.1SpatialMask D24611.2.2TemporalMask D24611.2.3SpatioTemporalMask D24711.2.4SceneGraphMask D24811.2.5MediaSpaceMask D24911.2.6OrderedGroupDataSetMask D25011.2.7MatchingHint D25211.2.8PointOfView D25411.2.9InkMediaInfo DS25711.2.10HandWritingRecogInfo DS26011.2.11HandWritingRecogResult DS26211.3Segment decomposition tools26411.3.1Basic segment decomposition tools26511.3.2Still region decomposition tools26911.3.33D still region decomposition tools27011.3.4Video segment decomposition tools27111.3.5Moving region decomposition tools27311.3.6Ink segment decomposition tools27611.3.7Audio segment decomposition tools27711.3.8Audio-visual segment decomposition tools27811.3.9Audio-visual region decomposition tools28111.3.10Multimedia segment decomposition tools28411.3.11Analytic edited video segment decomposition tools28511.4Segment relation description tools28611.4.1SegmentRelationBase DS28811.4.2SegmentRelation DS28911.4.3Temporal segment relation description tools28911.4.4Spatial segment relation description tools29511.4.5Spatio-temporal segment relation description tools30311.4.6Other segment relation description tools30511.4.7SegmentRelations DS examples (informative)30812Semantics of the content31412.1Abstraction Model31612.1.1Formal Abstractions31612.1.2Media Abstractions31612.1.3Concepts31612.1.4Properties31612.2Semantic entity description tools31612.2.1SemanticBase DS31812.3SemanticContainer DS32112.3.1SemanticContainer DS syntax32112.3.2SemanticContainer DS semantics32212.4Semantic DS32212.4.1Semantic DS syntax32212.4.2Semantic DS semantics32212.4.3Object DS32312.4.4AgentObject DS32512.4.5Event DS32612.4.6Concept DS32812.4.7SemanticState DS32912.4.8SemanticPlace DS33712.4.9SemanticTime DS34012.4.10Semantic DS examples (informative)34412.5Semantic attribute description tools35212.5.1AbstractionLevel datatype35212.5.1Extent datatype37112.5.2Position datatype37212.6Semantic relation description tools37212.6.1SemanticRelationBase DS37412.6.2SemanticRelation DS37512.6.3Semantic entity relations37612.6.4Semantic relation - semantic entity relations40912.6.5Segment - semantic entity relations41112.6.6Analytic model - Semantic entity relations41312.6.7Semantic relation examples (informative)41513Content navigation and access41713.1Summarization41713.1.1Summarization DS41913.1.2Summary DS42013.1.3HierarchicalSummary DS42113.1.4SummaryThemeList DS42413.1.5HighlightSummary DS42513.1.6HighlightSegment DS42813.1.7SequentialSummary DS43313.1.8FrameProperty D43513.1.9SoundProperty D43813.1.10TextProperty D44213.2Views, partitions and decompositions44513.2.1Partition datatype44613.2.2Filter datatype45213.2.3Filtering datatype45513.2.4View DS45813.2.5SpaceView DS46013.2.6FrequencyView DS46113.2.7SpaceFrequencyView DS46313.2.8ResolutionView DS46513.2.9SpaceResolutionView DS46713.2.10ViewDecomposition DS46813.2.11ViewSet DS47013.2.12SpaceTree DS47213.2.13FrequencyTree DS47413.2.14SpaceFrequencyGraph DS47513.2.15VideoViewGraph DS47813.2.16MultiResolutionPyramid DS48113.3Description of variation of the content48313.3.1VariationSet DS48514Organization of the content49014.1Collections49014.1.1Collection DS49114.1.2ContentCollection DS49214.1.3SegmentCollection DS49514.1.4DescriptorCollection DS49614.1.5ConceptCollection DS49714.1.6Mixed Collections49814.1.7StructuredCollection DS50014.2Models50214.2.1Model DS50214.3Probability models50314.3.1ProbabilityModel DS50314.3.2ProbabilityDistribution DS50414.3.3DiscreteDistribution DS50914.3.4ContinuousDistribution DS51414.3.5FiniteStateModel DS51914.4Analytic model52414.4.1AnalyticModel DS52414.4.2CollectionModel DS52714.4.3DescriptorModel DS52914.4.4ProbabilityModelClass DS53014.5Cluster models53214.5.1ClusterModel DS53314.6Classification models53414.6.1ClassificationModel DS53514.6.2ClusterClassificationModel DS53614.6.3ProbabilityClassificationModel DS53815User interaction54015.1User preferences54115.1.1Basic user preference tools54115.1.2UserPreferences DS54315.1.3UserIdentifier datatype54415.1.4UsagePreferences DS54515.1.5FilteringAndSearchPreferences DS54715.1.6CreationPreferences DS55115.1.7ClassificationPreferences DS55415.1.8SourcePreferences DS55715.1.9BrowsingPreferences DS56115.1.10SummaryPreferences DS56315.2Usage History56615.2.1UsageHistory DS56715.2.2UserActionHistory DS56815.2.3UserActionList DS57015.2.4UserAction DS57116Bibliography57617Annex A: Classification Schemes57817.1Classification Schemes for Media Definition57817.1.1Classification Schemes for MediaIdentification DS57817.1.2Classification Schemes for MediaFormat D58117.1.3Classification Schemes for MediaQuality D associated59417.2Classification Schemes for Creation and Production Information59817.2.1Classification Schemes for Creation Datatype59817.2.2Classification Schemes for Classification DS60317.2.3Classification Schemes for RelatedMaterial DS63917.3Classification Schemes for Usage Definition64517.3.1Classification Schemes for Availability DS64517.4Classification Schemes for User Interaction64517.4.1Classification Scheme for Usage History DS64517.5Classification Schemes for VideoEditing65017.6Classification Scheme for AnalyticTransition DS65017.6.1Classification Scheme for EvolutionType65017.7Classification Schemes for Affective65117.7.1Classification Scheme for Affective D65118Annex B: xPath Datatype definition based on W3C XPATH specification (informative)65319Annex C: MDS schema definition (informative)65520Annex D: Example description (informative)737

List of Figures

1Figure 1: Overview of the MDSs

8Figure 2: Organization of BaseTypes of Mpeg7RootType, DSType, DType, and HeaderType.

12Figure 3: Valid descriptions begin with the Mpeg7 root element.

17Figure 4: The type organization hierarchy for top-level types. The Mpeg7 element uses top-level types for ContentDescription and ContentManagement.

30Figure 5: Illustrates the use of Packages organizing and conveying the structure and types of description information to aid users to overcome unfamiliarity during browsing.

30Figure 6: Illustrates the use of Packages for signaling between a multimedia database and search engine.

61Figure 7: Overview of Time Datatype entities.

91Figure 8: Dependency structure among the governors, heads, and dependents.

97Figure 9: DependencyStructure example.

102Figure 10: The "Escort2_4:Genre" Genre Classification Scheme.

147Figure 11. Model for content entity, profile and instance

188Figure 12: Model of the tools for describing the structure of multimedia content. (figure modified)

191Figure 13: Illustration of the description of a video segment using tools for describing the structure of multimedia content. (modify figure)

193Figure 14: Overview of the tools for describing segments of multimedia content (modify figure).

196Figure 15: Examples of the spatial and temporal connectivity of segments: a) and b) segments composed of one single connected component; c) and d) segments composed of three connected components.

198Figure 16: Example of a tree hierarchy of still regions.

201Figure 17: Image that contains one scene image text ("Cherie, je suis la") and two superimposed image texts ("En direct du state DE-KUIP Rotterdam Jean-Francois DEVELEY" and "Nous allons assister a un match fantastique") in French.

204Figure 18: Spatial reference system in terms of the relation of the mosaic and the first frame of the video segment.

204Figure 19: Temporal reference system in relation to the first frame of the video segment.

205Figure 20: Mosaic for a video segment over 183 frames, affine model.

215Figure 21: Functional overview of the InkSegment DS.

220Figure 22: Example of electronic ink document.

228Figure 23: Documentary about eagles.

234Figure 24: The central image is a frame of a composite clip. The clips shown on right are mapped on the supports or editing areas shown on the left side.

234Figure 25: In this figure, the pictures represent frames of a same shot. Small pictures belong to composition or internal transitions. Bigger pictures correspond to composition shots or intra-composition shots.

238Figure 26: The first shot contains two different effects. First, there is a composition transition of type "cut", where an incrustation appears in a subpart of the frame. The second is an internal transition: the incrustation is not affected by the effect. A transition effect of type cut introduces the second shot.

249Figure 27: Example of a scene graph mask.

250Figure 28: Example of a media space mask.

252Figure 29: Example of an ordered group mask.

265Figure 30: Overview of the tools for describing decompositions of segments.

268Figure 31: Examples of temporal segment decompositions: a) and b) segment decompositions with neither gaps nor overlaps; c) and d) segment decompositions with gaps or overlaps.

269Figure 32: Valid types of segment decompositions among the specialized Segment DSs ("time" = temporal decomposition; "space" = spatial decomposition; "spaceTime" = spatio-temporal decomposition; and "media" = media source decomposition). Each Segment DS represents itself and the types derived from it. The mosaic is a special case of still region, which can not be the result of any segment decomposition.

288Figure 33: Overview of the tools for describing relations among segments.

309Figure 34: Nature documentary.

310Figure 35: Key frame of a video shot capturing a goal in a soccer game.

310Figure 36: Syntactic relations between "Goal-sg" segment, "Kick-sg" segment, "Not-Catch-sg" segment, and "Goal-rg" region for Figure 35.

311Figure 37: Syntactic relations between "Kick-sg" segment, "Not-Catch-sg" segment, "Goal-rg" region, "Forward-rg" region, "Ball-rg" region, and "Goalkeeper-rg" region for Figure 35.

312Figure 38: An image with associated 2D-String (rabbit < cloud < car = airplane, rabbit = car < airplane < cloud).

315Figure 39: Model of the tools for describing the semantics of multimedia content.

318Figure 40: Illustration of the description of a narrative world using tools for describing the semantics of multimedia content.

333Figure 41: Description of the object ripe banana using membership functions, SemanticState DSs, Property elements, SemanticRelationBase DSs, and AnalyticalModel DSs.

356Figure 42: Relation between description of media and media abstraction description.

357Figure 43: Relation between media abstraction description and formal abstraction description.

361Figure 44: Possible formal abstract description of a soccer game.

374Figure 45: Overview of the tools for describing semantic relations.

418Figure 46: Illustration of HierarchicalSummary DS containing two summaries. In this example, each HighlightSegment refers to a key-videoclip and/or a key-audioclip.

419Figure 47: Illustration of SequentialSummary DS.

429Figure 48: Illustration of HighlightSegment elements containing a KeyAVClip, KeyVideoClip, KeyAudioClip, KeyFrame and KeySound.

431Figure 49: An example of a key-frame hierarchy.

435Figure 50: Temporal properties of summary video frame and their relation to source and summary content.

440Figure 51: Example of an audio summary with a single source.

441Figure 52: Example of an audio summary with multiple sources.

446Figure 53: Illustration of key concepts of Space and Frequency View, Partitions and Decompositions including Views, ViewSets and View Decompositions

447Figure 54: Illustration of the different Partitions of time axis (a), temporal-frequency axis (b), and time / temporal-frequency plane (c).

452Figure 55: Illustration of the Filter tools - FilterType is an abstract type from which Filter1DType, Filter2DType and FilterSeparableType are derived. FilteringType specifies the filtering of image, video and audio signals by specifying a Filter and the Padding, Shifting and Cropping parameters.

458Figure 56: Illustration of the different types of Space and FrequencyViews of a Source image: SpaceView, ResolutionView, SpaceResolutionView, FrequencyView and SpaceFrequencyView.

483Figure 57: Illustration of key concepts of Variation DS including VariationSet, Variation, variationFidelity, variationPriority and VariationRelationship.

488Figure 58: Illustration of different variations of a source multimedia program. The eight variation programs have different modalities (video, image, text, audio) and fidelities with respect to the source program.

491Figure 59: Overview of Collection tools - Content Collection, SegmentCollections, Descriptor Collection, Concept Collection, and Mixed Collection are specialized types of Collections. StructuredCollection further describes the relationships among Collections.

500Figure 60: The StructuredCollection DS describes collections in terms of the relationships (i.e., RAB, RBC, RAC) among collections.

502Figure 61: Overview of Model tools

540Figure 62: Personalized filtering, search and browsing of multimedia content.

541Figure 63: A generic usage model for user preference and content descriptions.

567Figure 64: Context diagram for usage history descriptions and their applications.

List of Tables

2Table 1: List of Tools for Content Description and Management

7Table 2: List of Schema Tools

35Table 3: List of Basic Datatype

57Table 4: List of Linking Tools

83Table 5: List of Basic Tools

98Table 6: Relation between Terms

147Table 7: Media Information Tools

166Table 8: Creation and Production Tools

180Table 9: Usage Information Tools

188Table 10: Tools for the description of the structural aspects of the content.

192Table 11: Tools for the description of segments of multimedia content.

246Table 12: Tools for the description of segment attributes.

264Table 13: Tools for the description of decompositions of segments.

287Table 14: Tools for the description of relations among segments.

293Table 15: Formal definitions of the normative binary temporal relations between segments with informative examples.

293Table 16: Inverse relations of the temporal interval relations.

295Table 17: Formal definitions of the normative n-ary temporal relations between two or more segments with informative examples.

299Table 18: Formal definitions of the normative directional spatial relations between two segments with informative examples.

300Table 19: Inverse relations of the directional spatial relations.

302Table 20: Formal definition of the normative topological spatial relations between two segments with informative examples.

303Table 21: Inverse relations of topological spatial relations.

305Table 22: Formal definitions of the normative n-ary spatio-temporal relations between two or more segments with informative examples.

308Table 23: Definitions and inverse relations of the normative other segment relations between two segments with informative examples.

314Table 24: List of tools for the description of the semantic aspects of the content

317Table 25: List of semantic entities

352Table 26: Abstraction attributes.

374Table 27: Semantic relations.

378Table 28: Semantic Definitions and inverse relations of the normative n-ary relations between two or more semantic entities with informative examples and descriptions.

382Table 29: Definitions and inverse relations of the normative relations between two semantic entities with informative examples and descriptions.

383Table 30: Definitions and inverse relations of the normative relations between one narrative world and one semantic entity with informative examples and descriptions.

385Table 31: Definitions and inverse relations of the normative relations between two narrative worlds with informative examples and descriptions.

387Table 32: Definitions and inverse relations of the normative relations between two objects with informative examples and descriptions.

391Table 33: Definitions and inverse relations of the normative relations between one object and one event with informative examples and descriptions.

394Table 34: Definitions and inverse relations of the normative relations between two events with informative examples and descriptions.

395Table 35: Definitions and inverse relations of the normative relations between one concept and one semantic entity with informative examples and descriptions.

397Table 36: Definitions and inverse relations of the normative relations between one semantic state and one semantic entity with informative examples and descriptions.

398Table 37: Definitions and inverse relations of the normative relations between two semantic states with informative examples and descriptions.

400Table 38: Definitions and inverse relations of the normative relations between one semantic place and one semantic entity with informative examples and descriptions.

405Table 39: Definitions and inverse relations of the normative relations between two semantic places with informative examples and descriptions.

406Table 40: Definitions and inverse relations of the normative relations between one semantic time and one semantic entity with informative examples and descriptions.

409Table 41: Definitions and inverse relations of the normative relations between two semantic times with informative examples and descriptions.

411Table 42: Definitions and inverse relations of the normative relations between one semantic relation and one semantic state with informative examples and descriptions.

413Table 43: Definitions and inverse relations of the normative relations between one segment and one semantic entity with informative examples and descriptions.

415Table 44: Definitions and inverse relations of the normative relations between one analytic model and one semantic entity with informative examples and descriptions.

417Table 45: List of content navigation and access tools

417Table 46: List of summarization tools.

446Table 47: List of view, partition and decomposition tools.

447Table 48: List of partition tools.

458Table 49: List of view tools.

469Table 50: List of view decomposition tools.

490Table 51: List of content organization tools.

540Table 52: User interaction tools.

Foreword

ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the work.

International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part3.

In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IECJTC1. Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75% of the national bodies casting a vote.

Attention is drawn to the possibility that some of the elements of this part of ISO/IEC15938 may be the subject of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.

International Standard ISO/IEC159385 was prepared by Joint Technical Committee ISO/IECJTC1, JTC, Subcommittee SC29, .

This second/third/... edition cancels and replaces the first/second/... edition (), [clause(s) / subclause(s) / table(s) / figure(s) / annex(es)] of which [has / have] been technically revised.

ISO/IEC15938 consists of the following parts, under the general title Information Technology Multimedia Content Description Interface:

Part 1:Systems

Part 2: Description Definition Language

Part 3: Visual

Part 4: Audio

Part5: Multimedia Description Schemes

Part 6: Reference Software

Part 7: Conformance

Introduction

The MPEG-7 standard also known as "Multimedia Content Description Interface" aims at providing standardized core technologies allowing description of multimedia content in multimedia environments. This is a challenging task given by a broad spectrum of requirements and targeted multimedia applications, and a broad number of audio-visual features of importance in such context. In order to achieve this broad goal, MPEG-7 standardizes:

Datatypes that are description elements not specific to the multimedia domain that corresponds to reusable basic types or structures employed by multiple Descriptors and Description Schemes.

Descriptors (D) to represent Features. Descriptors define the syntax and the semantics of each feature representation. A Feature is a distinctive characteristic of the data, which signifies something to somebody. It is possible to have several descriptors representing a single feature, i.e. to address different relevant requirements. A Descriptor does not participate in many-to-one relationships with other description elements.

Description Schemes (DS) to specify the structure and semantics of the relationships between their components, which may be both Ds and DSs. A Description Scheme shall have descriptive information and may participate in many-to-one relationships with other description elements.

A Description Definition Language (DDL) to allow the creation of new DSs and, possibly, Ds and to allows the extension and modification of existing DSs.

Systems tools to support multiplexing of descriptions or description and content, synchronization issues, transmission mechanisms, file format, etc.

The standard is subdivided into seven parts:

1. Systems: Architecture of the standard, tools that are needed to prepare MPEG-7 Descriptions for efficient transport and storage, and to allow synchronization between content and descriptions. Also tools related to managing and protecting intellectual property.

2. Description Definition Language: Language for specifying DSs and Ds and for defining new DSs and Ds.

3. Visual: Visual description tools (Ds and DSs).

4. Audio: Audio description tools (Ds and DSs).

5. Multimedia Description Schemes: Description tools (Ds and DSs) that are generic, i.e. neither purely visual nor purely audio.

6. Reference Software: Software implementation of relevant parts of the MPEG-7 Standard.

7. Conformance: Guidelines and procedures for testing conformance of MPEG-7 implementations.

This document contains the description tools of the Multimedia Description Schemes (MDSs) part of the standard. This document addresses the normative aspects of the MDSs and also illustrates some non-normative examples of Ds and DSs. The syntax of the Ds and DSs is defined using the DDL FCD (ISO/IEC 15938-2).Information Technology Multimedia Content Description Interface Part5: Multimedia Description Schemes1 Scope1.1 Organization of the document

This document describes the MDS description tools under consideration in part 5 of the MPEG-7 standard (15938-5). In the sequel, each description tool is described by the following subclauses:

Syntax: Normative DDL specification of the Ds or DSs.

Binary Syntax: Normative binary representation of the Ds or DSs in case a specific binary representation has been designed. If no specific binary representation has been designed, the generic algorithm defined in the first part of the standard (ISO/IEC 15938-1) is assumed to be used.

Semantic: Normative definition of the semantics of all the components of the corresponding D or DS.

Informative examples: Optionally, an informative subclause giving examples of description.

1.2 Overview of Multimedia Description Schemes

The description tools, Ds and DSs, described in this document are mainly structured on the basis of the functionality they provide. An overview of the structure is described in Figure 1.

Basic

datatypes

Links & media

localization

Basic

Tools

Models

Basic elements

Navigation &

Access

Content management

Content description

Col

lections

Summaries

Variations

Content organization

Creation &

Production

Media

Usage

Semantic

aspects

Structural

aspects

User

interaction

User

Preferences

Schema

Tools

User

History

Views

Views

Figure 1: Overview of the MDSs

At the lower level of Figure 1, basic elements can be found. They deal with schema tools (root element, top-level element and packages), basic datatypes, mathematical structures, linking and media localization tools as well as basic DSs, which are found as elementary components of more complex DSs. These description tools are defined in clauses 4 (Schema Tools), 5 (Basic datatypes), 6 (Link to the media and Localization), and 7 (Basic elements).

Based on this lower level, content description & management elements can be defined. These description tools describe the content of a single multimedia document from several viewpoints. Currently five viewpoints are defined: Creation & Production, Media, Usage, Structural aspects and Semantic aspects. The first three description tools address primarily information related to the management of the content (content management) whereas the two last ones are mainly devoted to the description of perceivable information (content description). The following table defines more precisely the functionality of each set of description tools:

Set of description toolsFunctionality

Media (Clause 8)Description of the storage media: typical features include the storage format, the encoding of the multimedia content, the identification of the media. Note that several instances of storage media for the same multimedia content can be described.

Creation & Production (Clause 9)Meta information describing the creation and production of the content: typical features include title, creator, classification, purpose of the creation, etc. This information is most of the time author generated since it cannot be extracted from the content.

Usage (Clause 10)Meta information related to the usage of the content: typical features involve rights holders, access right, publication, and financial information. This information may very likely be subject to change during the lifetime of the multimedia content.

Structural aspects (Clause 11)Description of the multimedia content from the viewpoint of its structure: the description is structured around segments that represent physical spatial, temporal or spatio-temporal components of the multimedia content. Each segment may be described by signal-based features (color, texture, shape, motion, and audio features) and some elementary semantic information.

Semantic aspects (Clause 12)Description of the multimedia content from the viewpoint of its semantic and conceptual notions. It relies on the notions of objects, events, abstract notions and their relationship.

Table 1: List of Tools for Content Description and Management

These five sets of description tools are presented here as separate entities. As will be seen in the sequel, they are interrelated and may be partially included in each other. For example, Media, Usage or Creation & Production elements can be attached to individual segments involved in the structural description of the content. Depending on the application, some areas of the content description will have to be emphasized and other may be minimized or discarded.

Beside the direct description of the content provided by the five sets of description tools described in the previous table, tools are also defined for navigation and access (clause 13): Browsing is supported by the summary description tools and information about possible variations of the content is also given. Variations of the multimedia content can replace the original, if necessary, to adapt different multimedia presentations to the capabilities of the client terminals, network conditions or user preferences.

Another set of tools (Content organization, clause 14) addresses the organization of the content by classification, by the definition of collections of multimedia documents and by modeling. Finally, the last set of tools specified in User Interaction (clause 15) describes user's preferences pertaining to consumption of multimedia material.

2 Normative references

The following ITU-T Recommendations and International Standards contain provisions, which, through reference in this text, constitute provisions of ISO/IEC 15938. At the time of publication, the editions indicated were valid. All Recommendations and Standards are subject to revision, and parties to agreements based on ISO/IEC 15938 are encouraged to investigate the possibility of applying the most recent editions of the standards indicated below. Members of ISO and IEC maintain registers of currently valid International Standards. The Telecommunication Standardization Bureau maintains a list of currently valid ITU-T Recommendations.

ISO 8601: Data elements and interchange formats -- Information interchange -- Representation of dates and times.

ISO 639: Code for the representation of names of languages.

ISO 3166-1: Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes

ISO 3166-2: Codes for the representation of names of countries and their subdivisions -- Part 2: Country subdivision code.

Note (informative): The current list of valid ISO3166-1 country and ISO3166-2 region codes is maintained by the official maintenance authority Deutsches Institut fr Normung. Information on the current list of valid region and country codes can be found at http://www.din.de/gremien/nas/nabd/iso3166ma/index.html. ISO 4217: Codes for the representation of currencies and funds.

Note (informative): The current list of valid ISO4217 currency code is maintained by the official maintenance authority British Standards Institution (http://www.bsi-global.com/iso4217currency).

XML: Extensible Markup Language, W3C Recommendation 6 October 2000, http://www.w3.org/TR/2000/REC-xml-20001006 XML Schema: W3C Candidate Recommendation 24 October 2000,

Primer:

http://www.w3.org/TR/2000/CR-xmlschema-0-20001024/ Structures:http://www.w3.org/TR/2000/CR-xmlschema-1-20001024/

Datatypes:http://www.w3.org/TR/2000/CR-xmlschema-2-20001024/ xPath: XML Path Language, W3C Recommendation 16 November 1999, http://www.w3.org/TR/1999/REC-xpath-19991116 RFC2045 Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies.

RFC 2046 Multipurpose Internet Mail Extensions (MIME) Part Two: Media Types.

RFC 2048 Multipurpose Internet Mail Extensions (MIME) Part Four: Registration Procedures.

MIMETYPES. The current list of registered mimetypes, as defined in RFC2046, RFC2048, is maintained by IANA (Internet Assigned Numbers Authority). It is available from ftp://ftp.isi.edu/in-notes/iana/assignments/media-types/media-types/ CHARSETS. The current list of registered character set codes, as defined in RFC2045 and RFC2048 is maintained by IANA (Internet Assigned Numbers Authority). It is available from ftp://ftp.isi.edu/in-notes/iana/assignments/character-sets.

3 Terms, definitions, symbols, abbreviated terms

3.1 Conventions

3.1.1 Datatypes, Descriptors and Description Schemes

This part of ISO/IEC 15938 specifies datatypes, Descriptors and Description Schemes.

Datatype - a description elements that is not specific to the multimedia domain and that corresponds to a reusable basic type or structure employed by multiple Descriptors and Description Schemes.

Descriptor (D) - a description element that represents an multimedia feature, or an attribute or group of attributes of a multimedia entity. A Descriptor does not participate in many-to-one relationships with other description elements

Description Scheme (DS) - a description element that represents entities or relationships in the multimedia domain. A Description Scheme has descriptive information and may participate in many-to-one relationships with other description elements.

3.1.2 Naming convention

In order to specify datatypes, Descriptors and Description Schemes, this part of ISO/IEC 15938 uses constructs provided by the language specified in ISO/IEC 15938-2, such as "element", "attribute", "simpleType" and "complexType". The names associated to these constructs are created on the basis of the following conventions:

If the name is composed of various word, the first letter of each word is capitalized. The rule for the capitalization of the first word depends on the type of construct and is described below.

Element naming: the first letter of the first word is capitalized (e.g. TimePoint element of TimeType).

Attribute naming: the first letter of the first word is not capitalized (e.g. timeUnit attribute of IncrDurationType).

complexType naming: the first letter of the first word is capitalized, the suffix "Type" is used at the end of the name (e.g. PersonType).

simpleType naming: the first letter of the first word is not capitalized, the suffix "Type" may be used at the end of the name (e.g. timePointType).

Note that the "Type" suffix is not used when referencing a complexType or simpleType in the definition of a datatype, Descriptor or Description Scheme. For instance, the text refers to the "Time datatype" (instead of "TimeType datatype"), to the "MediaLocator D" (instead of "MediaLocatorType D") and to the "Person DS" (instead of PersonType DS).

3.1.3 Documentation convention

The syntax of each datatype, Descriptor and Description Scheme is specified using the constructs provided by ISO/IEC 15938-2, and is shown in this document using a specific font and background:

The semantics of each datatype, Descriptor and Description Scheme is specified using a table format, where each row contains the name and a definition of a type, element or attribute:

NameDefinition

ExampleTypeSpecifies an ...

element1Describes the

attribute1Describes the

Non-normative examples are included in separate subclauses, and are shown in this document using a separate font and background:

example element content

Moreover, the schema defined in this document follows a type-centric approach. As a result, almost no elements (in the XML schema sense) are defined. Most of the description tools are specified only by defining a complexType or a simpleType. In order to create a description, it has to be assumed that an element of a given type (complexType or simpleType) has been declared somewhere in the schema, for example as a member of another complexType or simpleType.

The examples in the informative subclauses assume that the following declaration has been made:

Therefore, the example shown above is a valid description.

3.2 Wrapper of the schema

The Syntax defined in this document assumes the following Schema Wrapper.

3.3 Abbreviations

For the purposes of this International Standard, the terms and definitions given in the following apply:

APS:

Advanced Photo System

AV:

Audio-visual

CIF:

Common Intermediate Format

CS:

Classification Scheme

D:

Descriptor

Ds:

Descriptors

DCT:

Discrete Cosine Transform

DDL:

Description Definition Language

DS:

Description Scheme

DSs:

Description Schemes

IANA:

Internet Assigned Numbers Authority

IPMP:

Intellectual Property Management Protocol

JPEG:

Joint Photographic Experts Group

MDS:

Multimedia Description Scheme

MPEG:

Moving Picture Experts Group

MPEG-7:ISO/IEC 15938

MP3:

MPEG1/2 layer 3 (audio coding)

QCIF:

Quarter Common Intermediate Format

SMPTE: Society of Motion Picture and Television Engineers

TZ:

Time Zone

TZD:

Time Zone Difference

URI:

Uniform Resource Identifier (IETF Standard is RFC 2396)

URL:

Uniform Resource Locator (IETF Standard is RFC 2396)

XM:

eXperimentation Model

XML:

Extensible Markup Language

3.4 Basic terminology

Audio-visual: Refers to content consisting of both audio and video.

Feature: Property of multimedia content that signifies something to a human observer, such as "color" or texture.

Multimedia:Refers to content comprising one or modalities or content types, such as images, audio, video, 3D models, electronic ink, and so forth.

4 Schema tools

This clause specifies the organization of the base type hierarchy of Descriptors and Description Schemes, and specifies the MPEG-7 root and top-level elements that shall be used for forming descriptions that are valid according to ISO/IEC 15938-5. Two types of schema valid descriptions are distinguished: complete, stand-alone description documents, and instances that carry partial or incremental information for an application, which are called description units. This clause also specifies the different task-oriented top-level types, which are used in conjunction with the MPEG-7 root element to form different types of complete descriptions. The clause also specifies different multimedia content entity description tools that shall be used in conjunction with the specific top-level type for content description to form complete descriptions of different types of multimedia content, such as images, video, audio, mixed multimedia, collections, and so forth. Finally, this clause specifies a Package tool that shall be used to describe an organization or packaging of the Descriptors and Description Schemes for an application as well as a Description Metadata tool that shall be used to describe metadata about the description itself.The tools specified in this clause include:

ToolFunctionality

Base TypesForms the base type hierarchy for description tools Descriptors, Description Schemes, and Header.

Root ElementDescribes the initial wrapper or root element of schema valid instance documents and description units.

Top-level TypesForm the content models of top-level elements that follow the root element for descriptions.

Multimedia Content EntitiesDescribes different types of multimedia content such as images, video, audio, mixed multimedia, collections, and so forth.

PackagesDescribes an organization or packaging of the Descriptors and Description Schemes for an application.

Description MetadataDescribes metadata about descriptions.

Table 2: List of Schema Tools

The concepts of root elements, top-level elements, global elements, global types, and other concepts if this clause are defined as follows:

Schema valid description: a description that is syntactically valid according to the schema defined in ISO/IEC159385.

Description document: a full description that is complete in the sense of providing a complete, stand-alone description of multimedia content for an application.

Description unit: a schema valid unit of description that carries partial or incremental information for an application. The description units may be used in specific applications where the client incrementally requests pieces of a complete description from the server. Description units may also be used in the context of streaming delivery of descriptions.

Root element: the initial element of a schema valid instance document or description unit. A root element can be thought of as a wrapper of the description.

Top-level type: Serves as a wrapper of tools for a particular description task.

Top-level element: an element that immediately follows the root element, which determines whether the description provides a schema valid instance document or description unit.

Global element: an element of the schema that is declared at the global scope. In the schema defined in ISO/IEC159385, only the root element is declared globally.

Global type: a content model or type defined immediately proceeding the element in the schema defined in ISO/IEC159385.

Package: an organization of Descriptors and Description Schemes that packages the tools for an application.

4.1 Base types

This subclause specifies the base types of the description tools. The base types are organized into the type hierarchy shown in Figure 2.

HeaderType

Summ

aryType

VisualDSType

AudioDSType

VisualDType

AudioDType

DType

Mpeg7Type

DSType

id

Mpeg7UnitType

SegmentType

HeaderType

HeaderType

SummaryType

SummaryType

:

VisualDSType

VisualD

ST

ype

AudioDSType

AudioDSType

VisualDType

VisualDType

AudioDType

AudioDType

DType

DType

Mpeg7Type

Mpeg7RootType

DSType

id

DSType

id

Mpeg7UnitType

Mpeg7

SegmentType

SegmentTyp

e

id

Figure 2: Organization of BaseTypes of Mpeg7RootType, DSType, DType, and HeaderType.

4.1.1 Mpeg7RootType

The following base types are defined in this subclause:

Mpeg7RootType: provides the base abstract type of the type hierarchy. The Description Schemes, Descriptors and Header derive from Mpeg7RootType. The Mpeg7 root element contains an element of type Mpeg7RootType.

DType: extends Mpeg7RootType to provide a base abstract type for visual and audio Descriptors. All MPEG-7 Descriptors shall be based on DType.

VisualDType: extends DType to provide a base abstract type for the visual Descriptors. All visual Descriptors that are to be used in describing visual segments shall derive from VisualDType.

AudioDType: extends DType to provide a base abstract type for the audio Descriptors. All audio Descriptors that are to be used in describing audio segments shall derive from AudioDType.

DSType: extends Mpeg7RootType to provide a base abstract type for the visual and audio Description Schemes. All Description Schemes shall derive from DSType.

VisualDSType: extends DSType to provide a base abstract type for the visual Description Schemes. All visual Description Schemes that are to be used in describing visual segments shall derive from VisualDSType.

AudioDSType: extends DSType to provide a base abstract type for the audio Description Schemes. All audio Description Schemes that are to be used in describing audio segments shall derive from AudioDSType.

HeaderType: extends Mpeg7RootType to provide a base abstract type for describing headers in descriptions. The header is intended to collect information that is common to the components (called "body part") of the description and can be instantiated within any DSType elements.

4.1.1.1 Mpeg7RootType syntax

4.1.1.2 Mpeg7Type semantics

Semantics of the Mpeg7RootType:

NameDefinition

Mpeg7RootTypeProvides the topmost base type of the type hierarchy from which all Descriptors and Description Schemes are derived (abstract). The Description Schemes, Descriptors and Header derive from Mpeg7RootType. The Mpeg7 root element contains an element of type Mpeg7RootType as one of the top-level elements.

Semantics of the DType:

NameDefinition

DTypeDescribes a feature of multimedia content that signify something to a human observer. Dtype extends Mpeg7RootType to provide a base abstract type for the visual and audio Descriptors. All Descriptors are derived from DType.

Semantics of the DSType:

NameDefinition

DSTypeDescribes concepts related to multimedia content. A Description Scheme specifies the structure and semantics of the relationships between its components, which may be both Descriptors and Description Schemes. DSType extends Mpeg7RootType to provide a base abstract type for the visual and audio Description Schemes. All Description Schemes are derived from DSType and can contain an id attribute and arbitrary number of Headers.

HeaderDescribes the Header of the DSType (optional). Header describes information that is common to multiple components of a description and is referenced from other components of the description (body part) using a reference tool.

idIdentifies the id of the DSType (optional). The id attribute shall be used for referencing and relations in order to connect descriptions.

Semantics of the VisualDType:

NameDefinition

VisualDTypeDescribes the visual features of visual segments. VisualDType extends DType to provide a base abstract type for the visual Descriptors. All visual Descriptors that are to be used in describing visual segments are derived from VisualDType.

Semantics of the AudioDType:

NameDefinition

AudioDTypeDescribes the audio features of audio segments. AudioDType extends DType to provide a base abstract type for the audio Descriptors. All audio Descriptors that are to be used in describing audio segments are derived from AudioDType.

Semantics of the VisualDSType:

NameDefinition

VisualDSTypeDescribes visual segments. VisualDSType extends DSType to provide a base type for all visual Description Schemes. All visual Description Schemes that are to be used in describing visual segments are derived from VisualDSType.

Semantics of the AudioDSType:

NameDefinition

AudioDSTypeDescribes audio segments. AudioDSType extends DSType to provide a base type for all audio Description Schemes. All audio Description Schemes that are to be used in describing audio segments are derived from AudioDSType.

Semantics of the HeaderType:

NameDefinition

HeaderTypeDescribes information that is common to the body part of the description. The Header is used to collect information that is re-used in multiple components of the description. A referencing tool is needed to associate an instance of HeaderType with other components of the descriptions (body part).

idIdentifies the id of the HeaderType (optional). The id attribute shall be used for referencing and relations in order to connect Header descriptions.

4.2 Root element

This subclause specifies the root element. The Mpeg7 root element shall be used in forming any description document in MPEG-7. The top-level elements of the Mpeg7 root element are used to distinguish the type of description as shown in Figure 3. For example, a schema-valid description built using top-level elements (usually corresponding to a stand-alone description of multimedia content) can be written with an Mpeg7 root element contaning ContentManagement, ContentDescription, or both element. On the other hand, a schema valid description unit that can carry an arbitrary descriptor or description scheme instance can be represented as a DescriptionUnit.

Mpeg7

via

DescriptionUnit

DominantColor

VideoSegment

Place

Classification

Description

Image

via

ContentManagement

via

ContentDescription

Complete Description

Description Unit

Figure 3: Valid descriptions begin with the Mpeg7 root element.

The Mpeg7 root element is the main element enclosing an entire description. In this sense, the root serves as a wrapper for an description. Mpeg7 can enclose to different kinds of descriptions defines as follows:

Complete Description: A complete description of multimedia content using the top-level elements (the Content Management and the Content Description). For example, the full description of an image is a complete description. The definition of "complete" will depend on the application using the description.

Description Unit: An description consisting of an arbitrary descriptor or description scheme instance. Unlike a complete description which will usually contains a "semantically-complete" description, a description unit can be used to send a partial descripton as required by an application such as a description of a place, a shape and texture descriptor and so on. A description unit is enclosed by the top-level element called DescriptionUnit.

4.2.1 Mpeg7 Root Element

Mpeg7 provides a root element of any types of description, which serves as an initial element of a description.

4.2.1.1 Mpeg7 Root Element syntax

4.2.1.2 Mpeg7 Root Element semantics

Semantics of the Mpeg7Type:

NameDefinition

Mpeg7TypeServes as a type of the root element of MPEG-7.

DescriptionMetadataDescribes the metadata for the descriptions contained within this root element (optional). The description metadata propagates (applies) to the descriptions contained within this root element, unless new description metadata is specified for these descriptions.

xml:langIdentifies the language of the content of description (optional). If not specified, the language of the description is not specified.

typeIdentifies the type of the description. This attribute can take one of the following values:

descriptionUnit the description is a schema valid description unit.

complete the description is a schema valid complete description.

If not specified, descriptionUnit is the default value.

Semantics of the Mpeg7 root element:

NameDefinition

Mpeg7Serves as the root element of the description.

DescriptionUnitDescribes an instance of one of the MPEG-7 tools.

ContentDescriptionDescribes multimedia data or abstraction of multimedia content (optional). The ContentDescription corresponds to the abstract base type of the different content models for describing the different types and abstractions of multimedia content.

ContentManagementDescribes the management of multimedia data (optional). The ContentManagement corresponds to the abstract base type of the different content models for describing the management of multimedia content.

4.2.1.3 Mpeg7 Root Element examples (informative)

The following example shows the use of Mpeg7 for creating a description unit containing an instance of a DominantColor color descriptor.

1.0

myUnitDescription