SlideShare una empresa de Scribd logo
1 de 53
MPEG Augmented Reality Tutorial


      Web3D Conference, August 4-5, Los Angeles, CA

                Marius Preda, MPEG 3DG Chair
                  Institut Mines TELECOM



http://www.slideshare.net/MariusPreda/mpeg-augmented-reality-tutorial
MPEG Augmented Reality Tutorial
Topics of the day


What is MPEG?


MPEG offer in the Augmented Reality field


MPEG-A Part 14 Augmented Reality Reference Model


MPEG-A Part 13 Augmented Reality Application Format
MPEG Augmented Reality Tutorial
Topics of the day


What is MPEG?


MPEG offer in the Augmented Reality field


MPEG-A Part 14 Augmented Reality Reference Model


MPEG-A Part 13 Augmented Reality Application Format
What is MPEG?
A suite of ~130 ISO/IEC standards
Coding/compression of elementary media:
     – Audio (MPEG-1, 2 and 4)
     – Video (MPEG-1, 2 and 4)
     – 2D/3D graphics (MPEG-4)
Storage and Transport
     – MPEG-2 Transport
     – File Format (MPEG-4)
     – Dynamic Adaptive Streaming over HTTP (DASH)
Hybrid (natural & synthetic) scene description, user interaction (MPEG-4)
Metadata (MPEG-7)
Media management and protection (MPEG-21)
Sensors and actuators, virtual worlds (MPEG-V)
Advanced User interaction (MPEG-U)
Media-oriented middleware (MPEG-M)

More ISO/IEC standards under development for
   – 3D Video, 3D Audio
   – Coding and Delivery in Heterogeneous Environments
   – …
What is MPEG?
Involvement, approach, deployment
A standardization activity continuing for 24 years
      – Supported by several hundreds companies/organisations from ~25 countries
      – ~500 experts participating in quarterly meetings
      – More than 2300 active contributors
      – Many thousands experts working in companies
A proven manner to organize the work to deliver useful and used standards
      – Developing standards by integrating individual technologies
      – Well defined procedures
      – Subgroups with clear objectives
      – Ad hoc groups continuing coordinated work between meetings
MPEG standards are widely referenced by industry
      – 3GPP, ARIB, ATSC, DVB, DVD-Forum, BDA, EITSI, SCTE, TIA, DLNA, DECE, OIPF…
Billions of software and hardware devices built on MPEG technologies
      – MP3 players, cameras, mobile handsets, PCs, DVD/Blue-Ray players, STBs, TVs,
         …
MPEG Augmented Reality Tutorial
Topics of the day


What is MPEG?


MPEG offer in the Augmented Reality field


MPEG-A Part 14 Augmented Reality Reference Model


MPEG-A Part 13 Augmented Reality Application Format
MPEG technologies related to AR
1992/4   1997   1998                         1999


MPEG-1/2
(AV content) VRML
                    MPEG-4 v.1
                    • Part 11 - BIFS:
                       -Binarisation of VRML
                       -Extensions for streaming
                       -Extensions for server command
                       -Extensions for 2D graphics
                       - Real time augmentation with
                         audio & video
                    • Part 2 - Visual:
                       - 3D Mesh compression         MPEG-4 v.2
                       - Face animation              • Part 2 – Visual
                                                        - Body animation
                    First form of broadcast signal augmentation
MPEG technologies related to AR
2003                2005              2007             2011


MPEG-4
• Part 16 - AFX:
   - A rich set of 3D
                           MPEG-4
     graphics tools
                           •AFX 2nd Edition:
   - Compression of
                             - Animation by
     geometry,
                               morphing
     appearance,
                             - Multi-texturing
     animation                                   MPEG-4
                                                 • AFX 3rd Edition
                                                    - WSS for terrain
                                                      and cities
        A rich set of 3D Graphics                   - Frame based MPEG-4
           representation and                         animation     • AFX 4th Edition
            compression tools                                          - Scalable complexity
                                                                         mesh coding
MPEG technologies related to AR
2003      2004       2005              2007       2009   2011


MPEG-4
• Part 16 - AFX:
   - A rich set of 3D
                            MPEG-4
     graphics tools
                            •AFX 2nd Edition:
   - Compression of
                              - Animation by
     geometry,
                                morphing
     appearance,
                              - Multi-texturing
     animation                                     MPEG-4
                                                   • AFX 3rd Edition
                                                      - WSS for terrain
                                                        and cities
  MPEG-4                MPEG-4
                                                      - Frame based MPEG-4
  • Part 16             • Part 25
                                                        animation     • AFX 4th Edition
     - X3D                 - Compression of
                                                                         - Scalable complexity
       Interactive           third-party XML
                                                                           mesh coding
       Profile               (X3D, COLLADA)
MPEG technologies related to AR
2011             2012               201x      201x        201x


MPEG-V - Media
Context and Control MPEG-U –
• 1st Edition        Advanced
  - Sensors and      User Interface MPEG-V
    actuators                       • 2nd Edition:
  - Interoperability                  - GPS
    between Virtual                   - Biosensors
   Worlds                             - 3D Camera

                                                     MPEG-H
                                                     - 3D Video
                                                     • Compression
        A rich set of sensors and                                 CDVS
                                                       of video +
                actuators                                         • Feature-point based
                                                       depth
                                                     - 3D Audio descriptors for image
                                                                  recognition
Main features of MPEG AR technologies

 All AR-related data is available from MPEG standards
 Real time composition of synthetic and natural objects
 Access to
      – Remotely/locally stored BIFS/compressed 2D/3D mesh objects
      – Streamed real-time BIFS/compressed 2D/3D mesh objects
 Inherent object scalability (e.g. for streaming)
 User interaction & server generated scene changes
 Physical context
      – Captured by a broad range of standard sensors
      – Affected by a broad range of standard actuators
MPEG vision on AR, the MPEG AR Browser
 Point to a URL – no need to download new applications for each context.
 The browser
     – Retrieves scenario from the internet
     – Starts video acquisition
     – Tracks objects
     – Recognizes objects from visual signatures
     – Recovers camera pose
     – Gets streamed 3D graphics
     – Composes new scenes
     – Gets inputs from various sensors
     – Offers optimal AR experience by constantly adapting interaction possibilities
         and objects from a remote server.
 Industry
     – Maximize number of customers through MPEG-compliant authoring tools and
         browsers
     – No need to develop a new application for each use case and device platform
MPEG vision on AR


                  Produce
                                                            Download




                                              Compression
 Authoring Tool



                            MPEG-4/MPEG-7/MPEG-21/
                                MPEG-U/MPEG-V
                                                              MPEG Player
Architecture

   Remote                    Remote                      Local                Local
  Real World                Sensors &                  Sensors &           Real World
 Environment                Actuators                  Actuators          Environment




                                        AR Player                  User
               AR file or
                stream




                                        Media     Service
                                        Servers   Servers
MPEG ongoing work on AR

 ISO/IEC 23000-14 Augmented Reality Reference Model
      – WD stage, collaborating with SC24/WG9, ARStandards, OGC, Khronos,
         Web3D
 ISO/IEC 23000-13 Augmented Reality Application Format
      – CD stage, based on MPEG standards
MPEG Augmented Reality Tutorial
Topics of the day


What is MPEG?


MPEG offer in the Augmented Reality field


MPEG-A Part 14 Augmented Reality Reference Model


MPEG-A Part 13 Augmented Reality Application Format
Augmented Reality Reference Model
WD2.0 content
Viewpoints                                 Glossary

       Community Objectives
                 Enterprise
                  Viewpoint

           Abstract/Design

   Information           Computational
     Viewpoint                 Viewpoint



   Implementation/Development
                                            Use cases
                 Engineering                  - Guide
                  Viewpoint
                                              - Create
                 Technology
                                              - Play
                  Viewpoint
Augmented Reality Reference Model
 Enterprise viewpoint: global architecture and actors


                                            Local / Remote Context
                                                                                      MCP


     ARTC
                                                            TO              Telecommunication Operator (TO)

                                                                                                           End-User (EU)
                                               DM                   MCP
     AREC
               AR                                                                                     EU
                               TO                       AR Player                            User
            Document

                                                                                    Device Manufacturer (DM)
     AC                                            TO               TO              Middleware/Component Provider (MCP)

AR Tools Creator (ARTC)
AR Experience Creator (AREC)                                                            Online Middleware/Component
Assets Creator (AC)                      Media                                          Provider (OMCP)
                                                                          Service
                                                                                        AR Service Provider (ARSP)
                                         Servers                          Servers
Assets Aggregator (AA)
                                    AC              AA              OMCP         ARSP
Augmented Reality Reference Model
Information viewpoint
                                                     Local/Remote Context                     Scene/Real World
            Device Context                                                             •   Raw image
       • Device capabilities                                                           •   Sensed data
                                                      Location of Device               •   Virtual Camera view
                                               • Location                              •   Detected features
                                               • Orientation                           •   Area of Interest/Anchors




                               Presentation                AR Player         Spatial Models
                       •                                               •   Coordinate Ref. Sys.
                           Augmentation
                                                                       •   (Geol)ocation
                                                                                                                 User
                       •   Registration
    AR                 •   Styling/complexity                          •   Projections
                       •                                               •   Coordinate conversion
 Document                  Spatial Filtering, e.g.                                                               User Input
                           range
                                                          Tracking objects                                • Query
                                                       • Markers                                          • Manipulation of
                                                       • Marker-less                                        Presentation
                                                                                                          • Topics of interest
                                                                                                          • Preferences


                     Digital Assets
                 • Presentation data           Media                         Service
                 • Trigger/Event rules         Servers                       Servers
                 • Accuracy based
Augmented Reality Reference Model
Computational viewpoint


                          Local / Remote Context



                                      2

               1                                        5
      AR
                                AR Player                   User
   Document

                           3                  4



                     Media                    Service
                     Servers                  Servers
Augmented Reality Reference Model
Computational viewpoint


                          Local / Remote Context



                                      2

               1                                        3
      AR
                                AR Player                   User
   Document

                           4                  5



                     Media                    Service
                     Servers                  Servers
Augmented Reality Reference Model
Engineering viewpoint
                             Local/Remote Context

                           Accelero
      Camera    Mic                      Compass           GPS       …
                            meter



                                     AR Player
                                                                         User
    AR         Rendering      Display        Application
 Document                                                        …
                Engine        (A/V/H)          Engine




                           Media                   Service
                           Servers                 Servers
Augmented Reality Reference Model
Glossary
Augmented Reality Reference Model
Use cases
Augmented Reality Reference Model
How to contribute?

 Use Trac!
 http://wg11.sc29.org/trac/augmentedreality/
MPEG Augmented Reality Tutorial
Topics of the day


What is MPEG?


MPEG offer in the Augmented Reality field


MPEG-A Part 14 Augmented Reality Reference Model


MPEG-A Part 13 Augmented Reality Application Format
MPEG-A Part 13 ARAF
3 components: scene, sensors/actuators, medias
 A set of scene graph nodes/protos as defined in MPEG-4 Part 11
     –   Existing nodes
          – Audio, image, video, graphics, programming, communication, user
              interactivity, animation
     –   New standard PROTOs
          – Map, MapMarker, Overlay, ReferenceSignal,
              ReferenceSignalLocation, CameraCalibration, AugmentedRegion

 Connection to sensors as defined in MPEG-V
     –   Orientation, Position, Angular Velocity, Acceleration, GPS, Geomagnetic,
         Altitude
     –   Local camera sensor

 Compressed media
MPEG-A Part 13 ARAF
 Scene: 63 XML Elements
                                      Node, Protos / Elements                                      Node, Protos / Elements
   Category          Sub-category                                  Category       Sub-category
                                    name in MPEG-4 BIFS / XMT                                    name in MPEG-4 BIFS / XMT
                                             AudioSource                                             AugmentationRegion
                        Audio                    Sound                                                   Background
                                               Sound2D                                                  Background2D
                       Image and            ImageTexture                                              CameraCalibration
                          video              MovieTexture                                                    Group
                         Textual               FontStyle                                                      Inline
                      information                 Text                                                      Layer2D
                                             Appearance          Scene related                              Layer3D
                                                 Color            information                                Layout
                                            LineProperties        (spatial and                          NavigationInfo
                                            LinearGradient          temporal                            OrderedGroup
                                                Material         relationships)                        ReferenceSignal
Elementary media                              Material2D                                           ReferenceSignalLocation
                                               Rectangle                                                     Switch
                                                 Shape                                                    Transform
                                        SBVCAnimationV2                                                  Transform2D
                       Graphics
                                                SBBone                                                     Viewpoint
                                             SBSegment                                                      Viewport
                                         SBSkinnedModel                                                       Form
                                             MorphShape                                             OrientationInterpolator
                                              Coordinate                                               ScalarInterpolator
                                        TextureCoordinate        Dynamic and                        CoordinateInterpolator
                                                Normal          animated scene                         ColorInterpolator
                                          IndexedFaceSet                                              PositionInterpolator
                                           IndexedLineSet                                                   Valuator
  Programming                                    Script                                                   BitWrapper
                                             InputSensor                                                 MediaControl
                                                                 Communication
                                            SphereSensor                                                       Map
                                                                and compression
                                             TimeSensor                              Maps                MapOverlay
User interactivity
                                             TouchSensor                                                  MapMarker
                                             MediaSensor           Terminal                                TermCap
                                             PlaneSensor
MPEG-A Part 13 ARAF
 Scene: the distance between ARAF and X3D is 32 (XML Elements)
                                      Node, Protos / Elements                                      Node, Protos / Elements
   Category          Sub-category                                  Category       Sub-category
                                    name in MPEG-4 BIFS / XMT                                    name in MPEG-4 BIFS / XMT
                                             AudioSource                                             AugmentationRegion
                        Audio                    Sound                                                   Background
                                               Sound2D                                                  Background2D
                       Image and            ImageTexture                                              CameraCalibration
                          video              MovieTexture                                                    Group
                         Textual               FontStyle                                                      Inline
                      information                 Text                                                      Layer2D
                                             Appearance          Scene related                              Layer3D
                                                 Color            information                                Layout
                                            LineProperties        (spatial and                          NavigationInfo
                                            LinearGradient          temporal                            OrderedGroup
                                                Material         relationships)                        ReferenceSignal
Elementary media                              Material2D                                           ReferenceSignalLocation
                                               Rectangle                                                     Switch
                                                 Shape                                                    Transform
                                        SBVCAnimationV2                                                  Transform2D
                       Graphics
                                                SBBone                                                     Viewpoint
                                             SBSegment                                                      Viewport
                                         SBSkinnedModel                                                       Form
                                             MorphShape                                             OrientationInterpolator
                                              Coordinate                                               ScalarInterpolator
                                        TextureCoordinate        Dynamic and                        CoordinateInterpolator
                                                Normal          animated scene                         ColorInterpolator
                                          IndexedFaceSet                                              PositionInterpolator
                                           IndexedLineSet                                                   Valuator
  Programming                                    Script                                                   BitWrapper
                                             InputSensor                                                 MediaControl
                                                                 Communication
                                            SphereSensor                                                       Map
                                                                and compression
                                             TimeSensor                              Maps                MapOverlay
User interactivity
                                             TouchSensor                                                  MapMarker
                                             MediaSensor           Terminal                                TermCap
                                             PlaneSensor
ark
MPEG-ANPartoung We: P13 ARAF
              am Y
                 -
               hu Signal F
Scene:: Reference it ion
             C          :
                 s
               Po m : FC
                  ea nal:
                T e
                  Ars
                                                                                                age
                                                                      I                       m
                                                                   ce
                                                              r en
              M arker Tracking
                                                            Name: Park
                                                            Chu- Young




                                                            fe
                                                            Position: FW
                                                            Team:




                                                          Re
                                                            Arsenal: FC




    M arker
                                                                           Ref erence Image




                          3D graphic Synchronized wit h
                           movement of marker image
MPEG-A Part 13 ARAF
  Scene:: Reference Signal



<ProtoDeclare name="ReferenceSignal” locations="org:mpeg:referencesignal">
  <field name="source" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>
  <field name="referenceResources" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>
  <field name="enabled" ="Boolean" vrml97Hint="exposedField" booleanValue="false"/>
  <field name="detectionHints" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>
  <field name="onInputDetected" ="Integer" vrml97Hint="eventOut"/>
  <field name="onError" ="Integer" vrml97Hint="eventOut"/>
</ProtoDeclare>
MPEG-A Part 13 ARAF
  Scene:: Reference Signal Location


<ProtoDeclare name="ReferenceSignalLocation" locations="org:mpeg:referencesignallocation">
  <field name="source" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>
  <field name="referenceResources" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>
  <field name="enabled" ="Boolean" vrml97Hint="exposedField" booleanValue="false"/>
  <field name="detectionHints" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>

 <field name="translation" ="Vector3Array" vrml97Hint="exposedField" Vector3ArrayValue=""/>
 <field name="rotation" ="Rotations" vrml97Hint="exposedField" rotationArrayValue=""/>

  <field name="onInputDetected" ="Integer" vrml97Hint="eventOut"/>
  <field name="onTranslationChanged" ="Integer" vrml97Hint="eventOut"/>
  <field name="onRotationChanged" ="Integer" vrml97Hint="eventOut"/>
  <field name="onError" ="Integer" vrml97Hint="eventOut"/>
</ProtoDeclare>
MPEG-A Part 13 ARAF
Scene:: Augmentation Region
                                 AR service
                                 provider A




                                              User A



  Broadcaster



                                              User B
                 Augmentation   AR service
                    Region      provider B
MPEG-A Part 13 ARAF
  Scene:: Augmentation Region


<ProtoDeclare name="AugmentationRegion" locations="org:mpeg:augmentationregion">
  <field name="source" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>
  <field name="2DRegion" ="Vector2Array" vrml97Hint="exposedField" vector2ArrayValue=""/>
  <field name="arProvider" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/>
  <field name="enabled" ="Boolean" vrml97Hint="exposedField" booleanValue="false"/>
  <field name="translation" ="Vector3Array" vrml97Hint="exposedField" Vector3ArrayValue=""/>
  <field name="rotation" ="Rotations" vrml97Hint="exposedField" rotationArrayValue=""/>
  <field name="onTranslationChanged" ="Integer" vrml97Hint="eventOut"/>
  <field name="onRotationChanged" ="Integer" vrml97Hint="eventOut"/>
  <field name="onARProviderChanged" ="Boolean" vrml97Hint="eventOut"/>
  <field name="onError" ="Integer" vrml97Hint="eventOut"/>
</ProtoDeclare>
MPEG-A Part 13 ARAF
Scene:: Map, MapMarkers and Overlay
MPEG-A Part 13 ARAF
  Scene:: Map, MapMarkers and Overlay

<ProtoDeclare name="Map" protoID="1" locations="org:mpeg:map">
  <field name="addChildren" ="Nodes" vrml97Hint="eventIn"/>
  <field name="removeChildren" ="Nodes" vrml97Hint="eventIn"/>
  <field name="addOverlays" ="Nodes" vrml97Hint="eventIn"/>
  <field name="removeOverlays" ="Nodes" vrml97Hint="eventIn"/>
  <field name="translate" ="Vector2" vrml97Hint="eventIn"/>
  <field name="zoom_in" ="Boolean" vrml97Hint="eventIn"/>
  <field name="zoom_out" ="Boolean" vrml97Hint="eventIn"/>
  <field name="gpscenter_changed" ="Vector2" vrml97Hint="eventOut"/>
  <field name="children" ="Nodes" vrml97Hint="exposedField">
    <nodes></nodes>
  </field>
  <field name="overlays" ="Nodes" vrml97Hint="exposedField">
    <nodes></nodes>
  </field>
<field name="gpsCenter" ="Vector2" vrml97Hint="exposedField" vector2Value="0 0"/>
  <field name="mode" ="Strings" vrml97Hint="exposedField" stringArrayValue="ROADMAP"/>
  <field name="provider" ="Strings" vrml97Hint="exposedField" stringArrayValue="ANY"/>
  <field name="size" ="Vector2" vrml97Hint="exposedField" vector2Value="0 0"/>
  <field name="mapWidth" ="Float" vrml97Hint="exposedField" floatValue="0"/>
  <field name="zoomLevel" ="Integer" vrml97Hint="exposedField" integerValue="0"/>
</ProtoDeclare>
MPEG-A Part 13 ARAF
  Scene:: Map, MapMarkers and Overlay

<ProtoDeclare name="MapOverlay" locations="org:mpeg:mapoverlay">
  <field name="addChildren" ="Nodes" vrml97Hint="eventIn"/>
  <field name="removeChildren" ="Nodes" vrml97Hint="eventIn"/>
  <field name="children" ="Nodes" vrml97Hint="exposedField">
  <field name="keywords" ="Strings" vrml97Hint="exposedField stringArrayValue=""/>
</ProtoDeclare>




<ProtoDeclare name="MapMarker" locations="org:mpeg:mapmarker">
  <field name="addChildren" ="Nodes" vrml97Hint="eventIn"/>
  <field name="removeChildren" ="Nodes" vrml97Hint="eventIn"/>
  <field name="gpsPosition" ="Vector2" vrml97Hint="exposedField" vector2Value="0 0"/>
  <field name="children" ="Nodes" vrml97Hint="exposedField">
    <nodes></nodes>
  </field>
  <field name="keywords" ="Strings" vrml97Hint="exposedField stringArrayValue=""/>
</ProtoDeclare>
MPEG-A Part 13 ARAF
Sensors/Actuators

                                  MPEG-4 Player                                   Scene
                                                                                mapping of
                       MPEG-4 Scene                                            captured data
    MPEG-V
    Sensor 1
                       InputSensor 1                                           Acceleration Sensor
    MPEG-V                                                                     Orientation Sensor
                                                   Compositor
    Sensor 2           InputSensor 2                                Screen       Angular Velocity
                                                                              Global Position Sensor
    MPEG-V             InputSensor 3                                             Altitude Sensor
    Sensor 3




                                   MPEG-4 Player                                Compositor
hw://camera/back
                   Camera Input     RAW Decoder        Compositor               mapping of
   Camera            Stream                                          Screen    captured data
                                                                                 Camera Sensor
MPEG-A Part 13 ARAF
Sensors/Actuators:: MPEG-V

                                       Virtual World

               Sensed           VW Object                         Sensory
             Information      Characteristics                     Effects
                  (5)              (4)                               (3)


                                                                                          Engine
        R→V Adaptation: converts Sensed              V→R Adaptation: converts
           Info from RW to VW Object                Sensory Effects from VW into
         Char/Sensed Info applied to VW              Device Cmds applied to RW



                      Sensor            Sensor        Sensory                      Sensory
           Sensed                                                      Device
                      Device          Adaptation      Effects                       Device
         Information                                                 Commands
                     Capability       Preferences   Preferences                    Capability
              (5)                                                       (5)
                        (2)                (2)           (2)                          (2)


           Real World                       User                      Real World
         (Sensor Device)                                           (Sensory Device)
MPEG-A Part 13 ARAF
Sensors/Actuators:: MPEG-V
MPEG-A Part 13 ARAF
 Sensors/Actuators:: MPEG-V types

Sensors                   Global position                     Actuators
Light                     Altitude
                          Bend                                Light
Ambient noise
Temperature               Gas                                 Flash
                          Dust                                Heating
Humidity
                          Body height
Distance
                          Body weight
                                                              Cooling
Atmospheric pressure                                          Wind
                          Body temperature
Position
                          Body fat                            Vibration
Velocity
                          Blood type                          Sprayer
Acceleration
                          Blood pressure
Orientation                                                   Scent
                          Blood sugar
Angular velocity                                              Fog
                          Blood oxygen
Angular acceleration
                          Heart rate                          Color correction
Force
Torque                    Electrograph                        Initialize color correction parameter
Pressure                  EEG , ECG, EMG, EOG , GSR           Rigid body motion
                          Weather
Motion                                                        Tactile
Intelligent camera type   Facial expression
                          Facial morphology                   Kinesthetic
Multi Interaction point
Gaze tracking             Facial expression characteristics   Global position command
Wind                      Geomagnetic
MPEG-A Part 13 ARAF
Compression

Media         Compression tool name             Reference standard
Image         JPEG                              ISO/IEC 10918
              JPEG2000                          ISO/IEC 15444
Video         Visual                            ISO/IEC 14496-2
              Advanced Video Coding             ISO/IEC 14496-10
Audio         MP3                               ISO/IEC-11172-3
              Advanced Audio Coding             ISO/IEC 14496-3
3D Graphics   Scalable Complexity Mesh Coding   ISO/IEC 14496-16
              Bone-based Animation              ISO/IEC 14496-16
Scenes        BIFS                              ISO/IEC 14496-11
MPEG-A Part 13 ARAF
Exercises

 AR Quiz              Augmented Book
MPEG-A Part 13 ARAF
Exercises

 AR Quiz                       Augmented Book




 http://youtu.be/la-Oez0aaHE    http://youtu.be/LXZUbAFPP-Y
MPEG-A Part 13 ARAF
AR Quiz setting, preparing the medias

 images, videos, audios, 2D/3D assets
 GPS location
MPEG-A Part 13 ARAF
AR Quiz XML inspection
   http://tiny.cc/MPEGARQuiz
MPEG-A Part 13 ARAF
AR Quiz Authoring Tool

 www.MyMultimediaWorld.com go to Create / Augmented Reality
MPEG-A Part 13 ARAF
Augmented Book setting

 images, audios
MPEG-A Part 13 ARAF
Augmented Book XML inspection
  http://tiny.cc/MPEGAugBook
MPEG-A Part 13 ARAF
Augmented Book Authoring Tool

 www.MyMultimediaWorld.com go to Create / Augmented Books
MPEG-A Part 13 ARAF
Next Steps

 Support for metadata at scene and object level
 Support for usage rights at scene and object level
 Collisions between real and virtual objects, partial rendering
ARAF distance to X3D

On Scene Graph
        –   32 elements
             – including 2D graphics, humanoid animation, generic
                input, media control, and pure AR protos
On Sensors/Actuators
        –   6 elements
On Compression
        –   MPEG-4 Part 25 already compresses X3D
Conclusions
 • Joint development of AR Reference Model
    – The community at large is invited to react/contribute
      such as the model became a reference
    – http://wg11.sc29.org/trac/augmentedreality
 • MPEG promoted a first version of an integrated and
   consistent solution for representing content in AR
   applications and services
    – Continue synchronized/harmonized development of
      technical specifications with X3D, COLLADA, OGC
      content models

Más contenido relacionado

La actualidad más candente

Basics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics CompressionBasics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics CompressionMarius Preda PhD
 
Point Cloud Compression in MPEG
Point Cloud Compression in MPEGPoint Cloud Compression in MPEG
Point Cloud Compression in MPEGMarius Preda PhD
 
Games related research at Institut TELECOM
Games related research at Institut TELECOMGames related research at Institut TELECOM
Games related research at Institut TELECOMMarius Preda PhD
 
Multimedia presentation video compression
Multimedia presentation video compressionMultimedia presentation video compression
Multimedia presentation video compressionLaLit DuBey
 
Video Compression Standards - History & Introduction
Video Compression Standards - History & IntroductionVideo Compression Standards - History & Introduction
Video Compression Standards - History & IntroductionChamp Yen
 
video_compression_2004
video_compression_2004video_compression_2004
video_compression_2004aniruddh Tyagi
 
Filmic Tonemapping - EA 2006
Filmic Tonemapping - EA 2006Filmic Tonemapping - EA 2006
Filmic Tonemapping - EA 2006hpduiker
 
MPEG Immersive Media
MPEG Immersive MediaMPEG Immersive Media
MPEG Immersive MediaITU
 
Iain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video CompressionIain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video CompressionIain Richardson
 
Video Compression Technology
Video Compression TechnologyVideo Compression Technology
Video Compression TechnologyTong Teerayuth
 
h.264 video compression standard.
h.264 video compression standard.h.264 video compression standard.
h.264 video compression standard.Videoguy
 
A short history of video coding
A short history of video codingA short history of video coding
A short history of video codingIain Richardson
 
Video compression
Video compressionVideo compression
Video compressionnnmaurya
 
Comparison of compression efficiency between HEVC and VP9 based on subjective...
Comparison of compression efficiency between HEVC and VP9 based on subjective...Comparison of compression efficiency between HEVC and VP9 based on subjective...
Comparison of compression efficiency between HEVC and VP9 based on subjective...Touradj Ebrahimi
 

La actualidad más candente (20)

Basics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics CompressionBasics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics Compression
 
Point Cloud Compression in MPEG
Point Cloud Compression in MPEGPoint Cloud Compression in MPEG
Point Cloud Compression in MPEG
 
Games related research at Institut TELECOM
Games related research at Institut TELECOMGames related research at Institut TELECOM
Games related research at Institut TELECOM
 
Mpeg v-awareness event
Mpeg v-awareness eventMpeg v-awareness event
Mpeg v-awareness event
 
Multimedia presentation video compression
Multimedia presentation video compressionMultimedia presentation video compression
Multimedia presentation video compression
 
H263.ppt
H263.pptH263.ppt
H263.ppt
 
Video Compression Standards - History & Introduction
Video Compression Standards - History & IntroductionVideo Compression Standards - History & Introduction
Video Compression Standards - History & Introduction
 
video_compression_2004
video_compression_2004video_compression_2004
video_compression_2004
 
Wireless PC2TV
Wireless PC2TVWireless PC2TV
Wireless PC2TV
 
Filmic Tonemapping - EA 2006
Filmic Tonemapping - EA 2006Filmic Tonemapping - EA 2006
Filmic Tonemapping - EA 2006
 
MPEG Immersive Media
MPEG Immersive MediaMPEG Immersive Media
MPEG Immersive Media
 
Iain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video CompressionIain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video Compression
 
Video coding standards ppt
Video coding standards pptVideo coding standards ppt
Video coding standards ppt
 
Video Compression Technology
Video Compression TechnologyVideo Compression Technology
Video Compression Technology
 
H.263 Video Codec
H.263 Video CodecH.263 Video Codec
H.263 Video Codec
 
h.264 video compression standard.
h.264 video compression standard.h.264 video compression standard.
h.264 video compression standard.
 
A short history of video coding
A short history of video codingA short history of video coding
A short history of video coding
 
Video compression
Video compressionVideo compression
Video compression
 
Feature avs2
Feature avs2Feature avs2
Feature avs2
 
Comparison of compression efficiency between HEVC and VP9 based on subjective...
Comparison of compression efficiency between HEVC and VP9 based on subjective...Comparison of compression efficiency between HEVC and VP9 based on subjective...
Comparison of compression efficiency between HEVC and VP9 based on subjective...
 

Destacado

MPEG-DASH Conformance and Reference Software
MPEG-DASH Conformance and Reference SoftwareMPEG-DASH Conformance and Reference Software
MPEG-DASH Conformance and Reference SoftwareAlpen-Adria-Universität
 
Standards' Perspective - MPEG DASH overview and related efforts
Standards' Perspective - MPEG DASH overview and related effortsStandards' Perspective - MPEG DASH overview and related efforts
Standards' Perspective - MPEG DASH overview and related effortsIMTC
 
Overview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesOverview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesAlpen-Adria-Universität
 
whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4aniruddh Tyagi
 
Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...
Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...
Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...Parth Lawate
 
Augmented Reality pdf
Augmented Reality pdf Augmented Reality pdf
Augmented Reality pdf Qualcomm
 
Edge 2014: MPEG DASH – Tomorrow's Format Today
Edge 2014: MPEG DASH – Tomorrow's Format TodayEdge 2014: MPEG DASH – Tomorrow's Format Today
Edge 2014: MPEG DASH – Tomorrow's Format TodayAkamai Technologies
 
Workshop: Big Data Visualization for Security
Workshop: Big Data Visualization for SecurityWorkshop: Big Data Visualization for Security
Workshop: Big Data Visualization for SecurityRaffael Marty
 
Augmented Reality Application - Final Year Project
Augmented Reality Application - Final Year ProjectAugmented Reality Application - Final Year Project
Augmented Reality Application - Final Year ProjectYash Kaushik
 
Fusion technology: the energy power
Fusion technology: the energy powerFusion technology: the energy power
Fusion technology: the energy powerDavid Stoffel
 
MPEG video compression standard
MPEG video compression standardMPEG video compression standard
MPEG video compression standardanuragjagetiya
 
Video Compression Basics - MPEG2
Video Compression Basics - MPEG2Video Compression Basics - MPEG2
Video Compression Basics - MPEG2VijayKumarArya
 

Destacado (18)

Animations final presentation
Animations final presentationAnimations final presentation
Animations final presentation
 
Mp3
Mp3Mp3
Mp3
 
MPEG-DASH Conformance and Reference Software
MPEG-DASH Conformance and Reference SoftwareMPEG-DASH Conformance and Reference Software
MPEG-DASH Conformance and Reference Software
 
Standards' Perspective - MPEG DASH overview and related efforts
Standards' Perspective - MPEG DASH overview and related effortsStandards' Perspective - MPEG DASH overview and related efforts
Standards' Perspective - MPEG DASH overview and related efforts
 
Overview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesOverview of Selected Current MPEG Activities
Overview of Selected Current MPEG Activities
 
whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4whitepaper_mpeg-if_understanding_mpeg4
whitepaper_mpeg-if_understanding_mpeg4
 
Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...
Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...
Using the Joomla Framework for Internet of Things (IoT) Case for Lighting Con...
 
Mpeg 7
Mpeg 7Mpeg 7
Mpeg 7
 
HTTP Streaming of MPEG Media
HTTP Streaming of MPEG MediaHTTP Streaming of MPEG Media
HTTP Streaming of MPEG Media
 
Augmented Reality pdf
Augmented Reality pdf Augmented Reality pdf
Augmented Reality pdf
 
Edge 2014: MPEG DASH – Tomorrow's Format Today
Edge 2014: MPEG DASH – Tomorrow's Format TodayEdge 2014: MPEG DASH – Tomorrow's Format Today
Edge 2014: MPEG DASH – Tomorrow's Format Today
 
Workshop: Big Data Visualization for Security
Workshop: Big Data Visualization for SecurityWorkshop: Big Data Visualization for Security
Workshop: Big Data Visualization for Security
 
Augmented Reality Application - Final Year Project
Augmented Reality Application - Final Year ProjectAugmented Reality Application - Final Year Project
Augmented Reality Application - Final Year Project
 
Mpeg 2
Mpeg 2Mpeg 2
Mpeg 2
 
MPEG 4
MPEG 4MPEG 4
MPEG 4
 
Fusion technology: the energy power
Fusion technology: the energy powerFusion technology: the energy power
Fusion technology: the energy power
 
MPEG video compression standard
MPEG video compression standardMPEG video compression standard
MPEG video compression standard
 
Video Compression Basics - MPEG2
Video Compression Basics - MPEG2Video Compression Basics - MPEG2
Video Compression Basics - MPEG2
 

Similar a MPEG Augmented Reality Tutorial

Similar a MPEG Augmented Reality Tutorial (20)

m4-out-20027
m4-out-20027m4-out-20027
m4-out-20027
 
m4-out-20027
m4-out-20027m4-out-20027
m4-out-20027
 
m4-out-20027
m4-out-20027m4-out-20027
m4-out-20027
 
MPEG-4 Developments
MPEG-4 DevelopmentsMPEG-4 Developments
MPEG-4 Developments
 
The MPEG Extensible Middleware API
The MPEG Extensible Middleware APIThe MPEG Extensible Middleware API
The MPEG Extensible Middleware API
 
Mpeg 7-21
Mpeg 7-21Mpeg 7-21
Mpeg 7-21
 
What’s new in MPEG?
What’s new in MPEG?What’s new in MPEG?
What’s new in MPEG?
 
MPEG4 vs H.264
MPEG4 vs H.264MPEG4 vs H.264
MPEG4 vs H.264
 
Presentation NBMP and PCC
Presentation NBMP and PCCPresentation NBMP and PCC
Presentation NBMP and PCC
 
JVC GY-HM790
JVC GY-HM790JVC GY-HM790
JVC GY-HM790
 
JVC GY-HM790
JVC GY-HM790JVC GY-HM790
JVC GY-HM790
 
GY-HM790E
GY-HM790EGY-HM790E
GY-HM790E
 
JPEG2000 Alliance IBC 2009
JPEG2000 Alliance IBC 2009JPEG2000 Alliance IBC 2009
JPEG2000 Alliance IBC 2009
 
Video Coding Standard
Video Coding StandardVideo Coding Standard
Video Coding Standard
 
Introduction to Video Compression Techniques - Anurag Jain
Introduction to Video Compression Techniques - Anurag JainIntroduction to Video Compression Techniques - Anurag Jain
Introduction to Video Compression Techniques - Anurag Jain
 
Video "Transcoding" Solutions for Mobile TV
Video "Transcoding" Solutions for Mobile TVVideo "Transcoding" Solutions for Mobile TV
Video "Transcoding" Solutions for Mobile TV
 
mpeg4copy-120428133000-phpapp01.ppt
mpeg4copy-120428133000-phpapp01.pptmpeg4copy-120428133000-phpapp01.ppt
mpeg4copy-120428133000-phpapp01.ppt
 
Introduction to MPEG21
Introduction to MPEG21Introduction to MPEG21
Introduction to MPEG21
 
MPEG-4-WWW.ppt
MPEG-4-WWW.pptMPEG-4-WWW.ppt
MPEG-4-WWW.ppt
 
Mpeg4copy 120428133000-phpapp01
Mpeg4copy 120428133000-phpapp01Mpeg4copy 120428133000-phpapp01
Mpeg4copy 120428133000-phpapp01
 

Último

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Último (20)

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

MPEG Augmented Reality Tutorial

  • 1. MPEG Augmented Reality Tutorial Web3D Conference, August 4-5, Los Angeles, CA Marius Preda, MPEG 3DG Chair Institut Mines TELECOM http://www.slideshare.net/MariusPreda/mpeg-augmented-reality-tutorial
  • 2. MPEG Augmented Reality Tutorial Topics of the day What is MPEG? MPEG offer in the Augmented Reality field MPEG-A Part 14 Augmented Reality Reference Model MPEG-A Part 13 Augmented Reality Application Format
  • 3. MPEG Augmented Reality Tutorial Topics of the day What is MPEG? MPEG offer in the Augmented Reality field MPEG-A Part 14 Augmented Reality Reference Model MPEG-A Part 13 Augmented Reality Application Format
  • 4. What is MPEG? A suite of ~130 ISO/IEC standards Coding/compression of elementary media: – Audio (MPEG-1, 2 and 4) – Video (MPEG-1, 2 and 4) – 2D/3D graphics (MPEG-4) Storage and Transport – MPEG-2 Transport – File Format (MPEG-4) – Dynamic Adaptive Streaming over HTTP (DASH) Hybrid (natural & synthetic) scene description, user interaction (MPEG-4) Metadata (MPEG-7) Media management and protection (MPEG-21) Sensors and actuators, virtual worlds (MPEG-V) Advanced User interaction (MPEG-U) Media-oriented middleware (MPEG-M) More ISO/IEC standards under development for – 3D Video, 3D Audio – Coding and Delivery in Heterogeneous Environments – …
  • 5. What is MPEG? Involvement, approach, deployment A standardization activity continuing for 24 years – Supported by several hundreds companies/organisations from ~25 countries – ~500 experts participating in quarterly meetings – More than 2300 active contributors – Many thousands experts working in companies A proven manner to organize the work to deliver useful and used standards – Developing standards by integrating individual technologies – Well defined procedures – Subgroups with clear objectives – Ad hoc groups continuing coordinated work between meetings MPEG standards are widely referenced by industry – 3GPP, ARIB, ATSC, DVB, DVD-Forum, BDA, EITSI, SCTE, TIA, DLNA, DECE, OIPF… Billions of software and hardware devices built on MPEG technologies – MP3 players, cameras, mobile handsets, PCs, DVD/Blue-Ray players, STBs, TVs, …
  • 6. MPEG Augmented Reality Tutorial Topics of the day What is MPEG? MPEG offer in the Augmented Reality field MPEG-A Part 14 Augmented Reality Reference Model MPEG-A Part 13 Augmented Reality Application Format
  • 7. MPEG technologies related to AR 1992/4 1997 1998 1999 MPEG-1/2 (AV content) VRML MPEG-4 v.1 • Part 11 - BIFS: -Binarisation of VRML -Extensions for streaming -Extensions for server command -Extensions for 2D graphics - Real time augmentation with audio & video • Part 2 - Visual: - 3D Mesh compression MPEG-4 v.2 - Face animation • Part 2 – Visual - Body animation First form of broadcast signal augmentation
  • 8. MPEG technologies related to AR 2003 2005 2007 2011 MPEG-4 • Part 16 - AFX: - A rich set of 3D MPEG-4 graphics tools •AFX 2nd Edition: - Compression of - Animation by geometry, morphing appearance, - Multi-texturing animation MPEG-4 • AFX 3rd Edition - WSS for terrain and cities A rich set of 3D Graphics - Frame based MPEG-4 representation and animation • AFX 4th Edition compression tools - Scalable complexity mesh coding
  • 9. MPEG technologies related to AR 2003 2004 2005 2007 2009 2011 MPEG-4 • Part 16 - AFX: - A rich set of 3D MPEG-4 graphics tools •AFX 2nd Edition: - Compression of - Animation by geometry, morphing appearance, - Multi-texturing animation MPEG-4 • AFX 3rd Edition - WSS for terrain and cities MPEG-4 MPEG-4 - Frame based MPEG-4 • Part 16 • Part 25 animation • AFX 4th Edition - X3D - Compression of - Scalable complexity Interactive third-party XML mesh coding Profile (X3D, COLLADA)
  • 10. MPEG technologies related to AR 2011 2012 201x 201x 201x MPEG-V - Media Context and Control MPEG-U – • 1st Edition Advanced - Sensors and User Interface MPEG-V actuators • 2nd Edition: - Interoperability - GPS between Virtual - Biosensors Worlds - 3D Camera MPEG-H - 3D Video • Compression A rich set of sensors and CDVS of video + actuators • Feature-point based depth - 3D Audio descriptors for image recognition
  • 11. Main features of MPEG AR technologies All AR-related data is available from MPEG standards Real time composition of synthetic and natural objects Access to – Remotely/locally stored BIFS/compressed 2D/3D mesh objects – Streamed real-time BIFS/compressed 2D/3D mesh objects Inherent object scalability (e.g. for streaming) User interaction & server generated scene changes Physical context – Captured by a broad range of standard sensors – Affected by a broad range of standard actuators
  • 12. MPEG vision on AR, the MPEG AR Browser Point to a URL – no need to download new applications for each context. The browser – Retrieves scenario from the internet – Starts video acquisition – Tracks objects – Recognizes objects from visual signatures – Recovers camera pose – Gets streamed 3D graphics – Composes new scenes – Gets inputs from various sensors – Offers optimal AR experience by constantly adapting interaction possibilities and objects from a remote server. Industry – Maximize number of customers through MPEG-compliant authoring tools and browsers – No need to develop a new application for each use case and device platform
  • 13. MPEG vision on AR Produce Download Compression Authoring Tool MPEG-4/MPEG-7/MPEG-21/ MPEG-U/MPEG-V MPEG Player
  • 14. Architecture Remote Remote Local Local Real World Sensors & Sensors & Real World Environment Actuators Actuators Environment AR Player User AR file or stream Media Service Servers Servers
  • 15. MPEG ongoing work on AR ISO/IEC 23000-14 Augmented Reality Reference Model – WD stage, collaborating with SC24/WG9, ARStandards, OGC, Khronos, Web3D ISO/IEC 23000-13 Augmented Reality Application Format – CD stage, based on MPEG standards
  • 16. MPEG Augmented Reality Tutorial Topics of the day What is MPEG? MPEG offer in the Augmented Reality field MPEG-A Part 14 Augmented Reality Reference Model MPEG-A Part 13 Augmented Reality Application Format
  • 17. Augmented Reality Reference Model WD2.0 content Viewpoints Glossary Community Objectives Enterprise Viewpoint Abstract/Design Information Computational Viewpoint Viewpoint Implementation/Development Use cases Engineering - Guide Viewpoint - Create Technology - Play Viewpoint
  • 18. Augmented Reality Reference Model Enterprise viewpoint: global architecture and actors Local / Remote Context MCP ARTC TO Telecommunication Operator (TO) End-User (EU) DM MCP AREC AR EU TO AR Player User Document Device Manufacturer (DM) AC TO TO Middleware/Component Provider (MCP) AR Tools Creator (ARTC) AR Experience Creator (AREC) Online Middleware/Component Assets Creator (AC) Media Provider (OMCP) Service AR Service Provider (ARSP) Servers Servers Assets Aggregator (AA) AC AA OMCP ARSP
  • 19. Augmented Reality Reference Model Information viewpoint Local/Remote Context Scene/Real World Device Context • Raw image • Device capabilities • Sensed data Location of Device • Virtual Camera view • Location • Detected features • Orientation • Area of Interest/Anchors Presentation AR Player Spatial Models • • Coordinate Ref. Sys. Augmentation • (Geol)ocation User • Registration AR • Styling/complexity • Projections • • Coordinate conversion Document Spatial Filtering, e.g. User Input range Tracking objects • Query • Markers • Manipulation of • Marker-less Presentation • Topics of interest • Preferences Digital Assets • Presentation data Media Service • Trigger/Event rules Servers Servers • Accuracy based
  • 20. Augmented Reality Reference Model Computational viewpoint Local / Remote Context 2 1 5 AR AR Player User Document 3 4 Media Service Servers Servers
  • 21. Augmented Reality Reference Model Computational viewpoint Local / Remote Context 2 1 3 AR AR Player User Document 4 5 Media Service Servers Servers
  • 22. Augmented Reality Reference Model Engineering viewpoint Local/Remote Context Accelero Camera Mic Compass GPS … meter AR Player User AR Rendering Display Application Document … Engine (A/V/H) Engine Media Service Servers Servers
  • 23. Augmented Reality Reference Model Glossary
  • 24. Augmented Reality Reference Model Use cases
  • 25. Augmented Reality Reference Model How to contribute? Use Trac! http://wg11.sc29.org/trac/augmentedreality/
  • 26. MPEG Augmented Reality Tutorial Topics of the day What is MPEG? MPEG offer in the Augmented Reality field MPEG-A Part 14 Augmented Reality Reference Model MPEG-A Part 13 Augmented Reality Application Format
  • 27. MPEG-A Part 13 ARAF 3 components: scene, sensors/actuators, medias A set of scene graph nodes/protos as defined in MPEG-4 Part 11 – Existing nodes – Audio, image, video, graphics, programming, communication, user interactivity, animation – New standard PROTOs – Map, MapMarker, Overlay, ReferenceSignal, ReferenceSignalLocation, CameraCalibration, AugmentedRegion Connection to sensors as defined in MPEG-V – Orientation, Position, Angular Velocity, Acceleration, GPS, Geomagnetic, Altitude – Local camera sensor Compressed media
  • 28. MPEG-A Part 13 ARAF Scene: 63 XML Elements Node, Protos / Elements Node, Protos / Elements Category Sub-category Category Sub-category name in MPEG-4 BIFS / XMT name in MPEG-4 BIFS / XMT AudioSource AugmentationRegion Audio Sound Background Sound2D Background2D Image and ImageTexture CameraCalibration video MovieTexture Group Textual FontStyle Inline information Text Layer2D Appearance Scene related Layer3D Color information Layout LineProperties (spatial and NavigationInfo LinearGradient temporal OrderedGroup Material relationships) ReferenceSignal Elementary media Material2D ReferenceSignalLocation Rectangle Switch Shape Transform SBVCAnimationV2 Transform2D Graphics SBBone Viewpoint SBSegment Viewport SBSkinnedModel Form MorphShape OrientationInterpolator Coordinate ScalarInterpolator TextureCoordinate Dynamic and CoordinateInterpolator Normal animated scene ColorInterpolator IndexedFaceSet PositionInterpolator IndexedLineSet Valuator Programming Script BitWrapper InputSensor MediaControl Communication SphereSensor Map and compression TimeSensor Maps MapOverlay User interactivity TouchSensor MapMarker MediaSensor Terminal TermCap PlaneSensor
  • 29. MPEG-A Part 13 ARAF Scene: the distance between ARAF and X3D is 32 (XML Elements) Node, Protos / Elements Node, Protos / Elements Category Sub-category Category Sub-category name in MPEG-4 BIFS / XMT name in MPEG-4 BIFS / XMT AudioSource AugmentationRegion Audio Sound Background Sound2D Background2D Image and ImageTexture CameraCalibration video MovieTexture Group Textual FontStyle Inline information Text Layer2D Appearance Scene related Layer3D Color information Layout LineProperties (spatial and NavigationInfo LinearGradient temporal OrderedGroup Material relationships) ReferenceSignal Elementary media Material2D ReferenceSignalLocation Rectangle Switch Shape Transform SBVCAnimationV2 Transform2D Graphics SBBone Viewpoint SBSegment Viewport SBSkinnedModel Form MorphShape OrientationInterpolator Coordinate ScalarInterpolator TextureCoordinate Dynamic and CoordinateInterpolator Normal animated scene ColorInterpolator IndexedFaceSet PositionInterpolator IndexedLineSet Valuator Programming Script BitWrapper InputSensor MediaControl Communication SphereSensor Map and compression TimeSensor Maps MapOverlay User interactivity TouchSensor MapMarker MediaSensor Terminal TermCap PlaneSensor
  • 30. ark MPEG-ANPartoung We: P13 ARAF am Y - hu Signal F Scene:: Reference it ion C : s Po m : FC ea nal: T e Ars age I m ce r en M arker Tracking Name: Park Chu- Young fe Position: FW Team: Re Arsenal: FC M arker Ref erence Image 3D graphic Synchronized wit h movement of marker image
  • 31. MPEG-A Part 13 ARAF Scene:: Reference Signal <ProtoDeclare name="ReferenceSignal” locations="org:mpeg:referencesignal"> <field name="source" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="referenceResources" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="enabled" ="Boolean" vrml97Hint="exposedField" booleanValue="false"/> <field name="detectionHints" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="onInputDetected" ="Integer" vrml97Hint="eventOut"/> <field name="onError" ="Integer" vrml97Hint="eventOut"/> </ProtoDeclare>
  • 32. MPEG-A Part 13 ARAF Scene:: Reference Signal Location <ProtoDeclare name="ReferenceSignalLocation" locations="org:mpeg:referencesignallocation"> <field name="source" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="referenceResources" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="enabled" ="Boolean" vrml97Hint="exposedField" booleanValue="false"/> <field name="detectionHints" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="translation" ="Vector3Array" vrml97Hint="exposedField" Vector3ArrayValue=""/> <field name="rotation" ="Rotations" vrml97Hint="exposedField" rotationArrayValue=""/> <field name="onInputDetected" ="Integer" vrml97Hint="eventOut"/> <field name="onTranslationChanged" ="Integer" vrml97Hint="eventOut"/> <field name="onRotationChanged" ="Integer" vrml97Hint="eventOut"/> <field name="onError" ="Integer" vrml97Hint="eventOut"/> </ProtoDeclare>
  • 33. MPEG-A Part 13 ARAF Scene:: Augmentation Region AR service provider A User A Broadcaster User B Augmentation AR service Region provider B
  • 34. MPEG-A Part 13 ARAF Scene:: Augmentation Region <ProtoDeclare name="AugmentationRegion" locations="org:mpeg:augmentationregion"> <field name="source" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="2DRegion" ="Vector2Array" vrml97Hint="exposedField" vector2ArrayValue=""/> <field name="arProvider" ="Strings" vrml97Hint="exposedField" stringArrayValue=""/> <field name="enabled" ="Boolean" vrml97Hint="exposedField" booleanValue="false"/> <field name="translation" ="Vector3Array" vrml97Hint="exposedField" Vector3ArrayValue=""/> <field name="rotation" ="Rotations" vrml97Hint="exposedField" rotationArrayValue=""/> <field name="onTranslationChanged" ="Integer" vrml97Hint="eventOut"/> <field name="onRotationChanged" ="Integer" vrml97Hint="eventOut"/> <field name="onARProviderChanged" ="Boolean" vrml97Hint="eventOut"/> <field name="onError" ="Integer" vrml97Hint="eventOut"/> </ProtoDeclare>
  • 35. MPEG-A Part 13 ARAF Scene:: Map, MapMarkers and Overlay
  • 36. MPEG-A Part 13 ARAF Scene:: Map, MapMarkers and Overlay <ProtoDeclare name="Map" protoID="1" locations="org:mpeg:map"> <field name="addChildren" ="Nodes" vrml97Hint="eventIn"/> <field name="removeChildren" ="Nodes" vrml97Hint="eventIn"/> <field name="addOverlays" ="Nodes" vrml97Hint="eventIn"/> <field name="removeOverlays" ="Nodes" vrml97Hint="eventIn"/> <field name="translate" ="Vector2" vrml97Hint="eventIn"/> <field name="zoom_in" ="Boolean" vrml97Hint="eventIn"/> <field name="zoom_out" ="Boolean" vrml97Hint="eventIn"/> <field name="gpscenter_changed" ="Vector2" vrml97Hint="eventOut"/> <field name="children" ="Nodes" vrml97Hint="exposedField"> <nodes></nodes> </field> <field name="overlays" ="Nodes" vrml97Hint="exposedField"> <nodes></nodes> </field> <field name="gpsCenter" ="Vector2" vrml97Hint="exposedField" vector2Value="0 0"/> <field name="mode" ="Strings" vrml97Hint="exposedField" stringArrayValue="ROADMAP"/> <field name="provider" ="Strings" vrml97Hint="exposedField" stringArrayValue="ANY"/> <field name="size" ="Vector2" vrml97Hint="exposedField" vector2Value="0 0"/> <field name="mapWidth" ="Float" vrml97Hint="exposedField" floatValue="0"/> <field name="zoomLevel" ="Integer" vrml97Hint="exposedField" integerValue="0"/> </ProtoDeclare>
  • 37. MPEG-A Part 13 ARAF Scene:: Map, MapMarkers and Overlay <ProtoDeclare name="MapOverlay" locations="org:mpeg:mapoverlay"> <field name="addChildren" ="Nodes" vrml97Hint="eventIn"/> <field name="removeChildren" ="Nodes" vrml97Hint="eventIn"/> <field name="children" ="Nodes" vrml97Hint="exposedField"> <field name="keywords" ="Strings" vrml97Hint="exposedField stringArrayValue=""/> </ProtoDeclare> <ProtoDeclare name="MapMarker" locations="org:mpeg:mapmarker"> <field name="addChildren" ="Nodes" vrml97Hint="eventIn"/> <field name="removeChildren" ="Nodes" vrml97Hint="eventIn"/> <field name="gpsPosition" ="Vector2" vrml97Hint="exposedField" vector2Value="0 0"/> <field name="children" ="Nodes" vrml97Hint="exposedField"> <nodes></nodes> </field> <field name="keywords" ="Strings" vrml97Hint="exposedField stringArrayValue=""/> </ProtoDeclare>
  • 38. MPEG-A Part 13 ARAF Sensors/Actuators MPEG-4 Player Scene mapping of MPEG-4 Scene captured data MPEG-V Sensor 1 InputSensor 1 Acceleration Sensor MPEG-V Orientation Sensor Compositor Sensor 2 InputSensor 2 Screen Angular Velocity Global Position Sensor MPEG-V InputSensor 3 Altitude Sensor Sensor 3 MPEG-4 Player Compositor hw://camera/back Camera Input RAW Decoder Compositor mapping of Camera Stream Screen captured data Camera Sensor
  • 39. MPEG-A Part 13 ARAF Sensors/Actuators:: MPEG-V Virtual World Sensed VW Object Sensory Information Characteristics Effects (5) (4) (3) Engine R→V Adaptation: converts Sensed V→R Adaptation: converts Info from RW to VW Object Sensory Effects from VW into Char/Sensed Info applied to VW Device Cmds applied to RW Sensor Sensor Sensory Sensory Sensed Device Device Adaptation Effects Device Information Commands Capability Preferences Preferences Capability (5) (5) (2) (2) (2) (2) Real World User Real World (Sensor Device) (Sensory Device)
  • 40. MPEG-A Part 13 ARAF Sensors/Actuators:: MPEG-V
  • 41. MPEG-A Part 13 ARAF Sensors/Actuators:: MPEG-V types Sensors Global position Actuators Light Altitude Bend Light Ambient noise Temperature Gas Flash Dust Heating Humidity Body height Distance Body weight Cooling Atmospheric pressure Wind Body temperature Position Body fat Vibration Velocity Blood type Sprayer Acceleration Blood pressure Orientation Scent Blood sugar Angular velocity Fog Blood oxygen Angular acceleration Heart rate Color correction Force Torque Electrograph Initialize color correction parameter Pressure EEG , ECG, EMG, EOG , GSR Rigid body motion Weather Motion Tactile Intelligent camera type Facial expression Facial morphology Kinesthetic Multi Interaction point Gaze tracking Facial expression characteristics Global position command Wind Geomagnetic
  • 42. MPEG-A Part 13 ARAF Compression Media Compression tool name Reference standard Image JPEG ISO/IEC 10918 JPEG2000 ISO/IEC 15444 Video Visual ISO/IEC 14496-2 Advanced Video Coding ISO/IEC 14496-10 Audio MP3 ISO/IEC-11172-3 Advanced Audio Coding ISO/IEC 14496-3 3D Graphics Scalable Complexity Mesh Coding ISO/IEC 14496-16 Bone-based Animation ISO/IEC 14496-16 Scenes BIFS ISO/IEC 14496-11
  • 43. MPEG-A Part 13 ARAF Exercises AR Quiz Augmented Book
  • 44. MPEG-A Part 13 ARAF Exercises AR Quiz Augmented Book http://youtu.be/la-Oez0aaHE http://youtu.be/LXZUbAFPP-Y
  • 45. MPEG-A Part 13 ARAF AR Quiz setting, preparing the medias images, videos, audios, 2D/3D assets GPS location
  • 46. MPEG-A Part 13 ARAF AR Quiz XML inspection http://tiny.cc/MPEGARQuiz
  • 47. MPEG-A Part 13 ARAF AR Quiz Authoring Tool www.MyMultimediaWorld.com go to Create / Augmented Reality
  • 48. MPEG-A Part 13 ARAF Augmented Book setting images, audios
  • 49. MPEG-A Part 13 ARAF Augmented Book XML inspection http://tiny.cc/MPEGAugBook
  • 50. MPEG-A Part 13 ARAF Augmented Book Authoring Tool www.MyMultimediaWorld.com go to Create / Augmented Books
  • 51. MPEG-A Part 13 ARAF Next Steps Support for metadata at scene and object level Support for usage rights at scene and object level Collisions between real and virtual objects, partial rendering
  • 52. ARAF distance to X3D On Scene Graph – 32 elements – including 2D graphics, humanoid animation, generic input, media control, and pure AR protos On Sensors/Actuators – 6 elements On Compression – MPEG-4 Part 25 already compresses X3D
  • 53. Conclusions • Joint development of AR Reference Model – The community at large is invited to react/contribute such as the model became a reference – http://wg11.sc29.org/trac/augmentedreality • MPEG promoted a first version of an integrated and consistent solution for representing content in AR applications and services – Continue synchronized/harmonized development of technical specifications with X3D, COLLADA, OGC content models