1. MPEG-H AUDIO ALLIANCE
The Next Generation System
For Interactive And Immersive Sound
STEREO IMMERSIVE
2. Introducing MPEG-H Audio:
The Next-Generation System for Interactive and Immersive Sound
With consumers viewing – and hearing
– content on more devices and in more
places than ever before, broadcasters
are increasingly pressured to meet
demands for new and better experiences
in a cost-effective manner. Consequently,
broadcasters are assessing the
capabilities of existing infrastructures
and determining how new developments
in audio and video technology will affect
their ability to deliver enhanced services
to a broad array of end-user technologies
– from high-end home theaters, to tablets
and smart phones.
The new MPEG-H Audio standard allows
broadcasters to meet these challenges
by elevating the sound quality of their
offerings beyond 5.1 Surround Sound
while incorporating new groundbreaking
interactive features in a rapid and cost-
effective manner.
3. A COLLABORATIVE APPROACH
TO BRINGING MPEG-H AUDIO TO THE WORLD’S DEVICES
Three industry leaders are collaborating to make open standards-based MPEG-H Audio a reality. Each one contributes key technical elements
as well as long-standing expertise within their market segments.
Fraunhofer IIS is the principal developer of
MPEG audio standards, providing half
the world’s TV surround sound, with its
technology already used in more than 7
billion devices today.
Technicolor is a co-developer of MP3 and is
the leading provider of production services
for content creators and distributors all over
the world.
Qualcomm is a global supplier of chipsets
and software for mobile devices delivered
around the globe.
The combined experience and broad-market presence represented by these three companies has not only been critical in developing MPEG-H
Audio, but also provides the foundation for successfully bringing exciting new audio features to market. As evidence of the promise held by
MPEG-H Audio, Qualcomm Technologies is already incorporating support for the standard into its roadmap for future mobile chipsets. This is an
important step toward widespread distribution of new audio functionality across a range of consumer devices.
The MPEG-H Audio alliance is working together to address the requirements of broadcasters in forums such as the Advanced Television Systems
Committee (ATSC), as that body considers choices for the ATSC 3.0 next-generation broadcast standard. By developing MPEG-H Audio as
an open standard, the consortium has demonstrated a strong commitment to ensuring that next-generation interactive and immersive audio
capabilities can be leveraged and commercialized throughout the industry.
NEW BREAKTHROUGH CAPABILITIES FOR HOME AND MOBILE AUDIO
MPEG-H Audio lays the foundation for broadcasters to deliver a more personalized, interactive, and immersive audio experience for end-users
by offering:
»» Interactive “sound mixing” through
object coding, which allows viewers
to customize the levels of different
sound elements – boosting selected
commentary or creating a “home team”
mix for sports broadcasts, for example.
»» Rich 3-D sound with the ability to
capitalize on additional front- and rear-
height speaker channels. This enhances
today’s surround sound broadcasts and
creates a truly realistic audio experience.
»» Higher Order Ambisonics (HOA),
to provide a fully immersive sound
experience that is ideal for live
broadcasts and performances, such as
sporting events.
New interactive menu features will make MPEG-H Audio-based offerings a key service for customers interested in more control over their
listening experience, whether at home or on mobile devices. MPEG-H Audio content will also automatically optimize audio playback across
different speaker configurations or headsets, allowing consumers to enjoy the best sound quality possible – no matter where they are or what
device they use.
4. MPEG-H AUDIO OFFERS AN EASY, STAGED IMPLEMENTATION PROCESS
The alliance has developed a roadmap for MPEG-H Audio deployment that allows broadcasters to add new functionality at the rate and pace of
their choosing, while preserving existing investments in technology and processes. The standard is backward-compatible with the systems and
practices currently used for AC-3 or HE-AAC surround sound broadcasting. The four-staged approach allows broadcasters to:
1Transmit today’s surround
sound using current metadata,
loudness profiles, and practices
– but with a 50% reduction in
the bit rate (when compared to
AC-3) due to improved coding
efficiency.
2Add interactive objects so that
viewers can adjust elements of the
audio – such as dialogue or sound
effects – to their individual tastes.
Viewers can also select alternate
languages or sound tracks to
receive a sound experience
tailored to their preferences.
3Add 3-D sound with height
channels, including the option to
use HOA.
4Add dynamic objects (audio
objects panned to track video
action).
MPEG-H Audio decoders will offer immediate support for all four stages. Broadcasters may introduce selected features whenever they choose.
As a result, MPEG-H Audio encoding is poised to become the end-to-end solution of choice for broadcasters interested in offering exciting audio
experiences that provide consumers with new levels of choice and control while offering unparalleled sound quality, performance and flexibility.
Cable/Satellite OperatorNetwork OperationsRemote Truck
Audio
Console
Audio
Console
PCM Audio
Panning Data
Playout
Server
Master
Control
Switcher
SDI
SDI
Adaptive Streaming Segments
Sat or
IP Link
Playout
Server
SDI
SDISDI
Transport
Stream
Ingest/QC
Sat or IP Link
Commercial
Server
Transport
Stream
File Transfer
OTA
Transport
Stream
Monitor
Mode
Local Affiliate
(U.S. Broadcast Networks)
Sat or
IP Link
Studio Post-Production
Speakers or
Headphones
RF
Over the Air
Signal
HDMI
HDMI
Adaptive Streaming Segments
SDI
File Transfer
Social Media Site
Social Media
Server
Mobile User-Generated Content
Internal
Storage
File Transfer
Consumer’s Smartphone
Adaptive Streaming
Segments
3D
Soundbar
MPEG-H
Core Audio
Decoder
MPEG-H
Dynamic
Range and
Loudness
Control
MPEG-H
Downmix/
Rendering
Headphones
or Device
SpeakersListening Mode/
Condiitons
Consumer’s Tablet or Smartphone
MPEG-H
Core Audio
Decoder
MPEG-H
Binaural
Processing
MPEG-H
Downmix/
Rendering
Built-In
Speakers
TV Set
MPEG-H
Monitoring
Unit
Panning Data
PCM Audio
Speakers
Integrated
Loudness
Speakers
Speakers
Tablet or
Smartphone
Settings
(Loudness,
Channels,
Configuration)
Video
Settings
(Loudness,
Channels,
Configuration)
Control Data
Video
Control Data
Typical use cases shown as examples, deployment in broadcaster’s
network will vary.
All cases may use MPEG-H Channels, Objects, or Higher-Order
Ambisonics components in any combination.
SDI connections could be replaced by IP transport as TV plants evolve.
Tablet or
Smartphone
Video
Internal
Microphones
MPEG-H
Encoder Smart TV AVR
MPEG-H
Monitoring
Unit
MPEG-H
Monitoring
Unit
SDI Video,
Embedded PCM
Dynamic Control Data
(Stage 4 only)
Existing
Equipment
New
Equipment
Upgraded for
MPEG-H Audio
Contribution link may
use PCM or MPEG-H
bitstream depending
on bandwidth available
Monitor
Mode
Integrated
Loudness
Monitor
Mode
Integrated
Loudness
Consumer’s Living RoomCasual Listening Around The HomeAway from Home
PCM Audio
MPEG-H
Control Data
Encoder
Emission
Encoder
Distribution
IRD
Contribution
IRD
File for
Broadcast
Internet
Encoder
Distribution
Encoder Transport
Stream
Processing
Cbl/Sat
Headend
Set-Top
BoxMPEG-H
Dynamic
Range and
Loudness
Control
MPEG-H
Binaural
Processing
Contribution
Encoder
MPEG-H
Control Data
Encoder
Master
Control
Switcher
Modulator/
Transmitter
Internet/CDN
5. The Kitchen. MPEG-H Audio’s flexible
rendering downmixes the program and
tailors the loudness and dynamic range
for optimum reproduction on the internal
speakers of Heather’s TV.
The Bedroom. MPEG-H Audio’s audio
objects allow Jim to turn down the
announcer and hear his favorite race
driver talk to the pit crew, while virtual
surround technology allows him to hear
the race in 3D sound.
The Office. Alfonso works in a
surprisingly quiet cube farm, but wears
premium headphones to concentrate.
MPEG-H Audio’s rendering can be set
to provide simulated 3D sound with full
dynamic range on his PC player.
Accessible. Ann’s vision faded away
years ago, but she can still enjoy TV with
her friends by receiving MPEG-H Audio’s
description channel on her tablet.
At the game, Juila wants to capture
the excitement of being there with her
friends. Her mobile phone’s built-in
MPEG-H Audio sound field encoding
provides encompassing sound to
accompany the video.
In the remote truck, Bill is able to
deliver an enhanced experience to
viewers just by sending submixes to
the MPEG-H Audio encoder instead
of a composite mix. MPEG-H Audio’s
monitoring and loudness tools prove
the stereo mix is great and the program
loudness is right on target.
On the mixing stage. John mixes
drama and film for broadcast using
familiar Dolby Atmos®
tools in his
console, with MPEG-H Audio translating
the settings for rendering in the
consumer’s home.
Home Theater. Frank has installed
height speakers in his man cave for
the ultimate 3D sound quality. MPEG-H
Audio’s immersive sound offers TV and
new media programing with sound to
match the latest cinema and Blu-ray
releases.
At the airport. Susan only has earbuds,
but she can still hear the dialogue of
her favorite show thanks to MPEG-H
Audio’s advanced dynamic range control
technology.
The Living Room. MPEG-H Audio’s
immersive sound appears so real, it’s
like being there, while its Audio Objects
let Fred adjust the elements of the sound
to his liking.
MPEG-H AUDIO
One stream offers
an optimized listening experience on every device
TRADITIONAL CHANNEL-BASED AUDIO:MONO,
STEREO, 5.1, 7.1, 7.1 + 4H
RENDERING TO ANY LOUDSPEAKER
ARRANGEMENT OR TODAY’S SOUNDBARS
ENHANCED VIRTUAL IMMERSIVE SOUND
ON TABLET SPEAKERS OR HEADPHONES
FUTURE “3D SOUNDBARS”
FOR OUT-OF-THE-BOX IMMERSIVE SOUND
HIGHER-ORDER AMBISONICS SOUND FIELD CAPTURE
AUDIO OBJECTS FOR PERSONAL SOUND MIX,
ALTERNATE LANGUAGES, “HEAR YOUR HOME TEAM”