Home

A/85:2013

image

Contents

1. Society of Motion Picture and Television Engineers White Plains N Y SMPTE SMPTE 302M 2007 Television Mapping of AES3 Data into MPEG 2 Transport Stream Society of Motion Picture and Television Engineers White Plains NY 2007 SMPTE SMPTE 2020 1 2008 Format of Audio Metadata and Description of the Asynchronous Serial Bitstream Transport SMPTE 2020 2 2008 Vertical Ancillary Data Mapping of Audio Metadata Method A SMPTE 2020 3 2008 Vertical Ancillary Data Mapping of Audio Metadata Method B Society of Motion Picture and Television Engineers White Plains NY 2008 ITU Method for the subjective assessment of intermediate quality level of coding systems Recommendation ITU R BS 1534 1 International Telecommunications Union Geneva Switzerland Soulodre G A Lavoie M C and Norcross S G The Subjective Loudness of Typical Program Material 115th Convention of the Audio Engineering Society 2003 Soulodre G A Evaluation of Objective Loudness Meters 116th Convention of the Audio Engineering Society Berlin Preprint 6161 10 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 ITU Multichannel stereophonic sound system with and without accompanying picture Recommendation ITU R BS 775 2 International Telecommunications U
2. Use this tone to align the output meters of the equipment in use for 20 dB FS on digital meters Use a meter with adequate resolution It is best if the playback device and the console input channel and output meters can all be set to precisely unity gain On analog meters this level is meant to be set to 0 VU ANSI C16 5 26 IEC 60268 17 27 to 4 on the scale of 7 of the BBC peak meter IEC 60268 10 IIa Hb 28 although note this meter displays 4 dB step so that the maximum calibrated value corresponds to 8 dB FS this problem is ameliorated by the fact that the PPM has an attack time of 80 percent reading in 10 ms much slower than a true peak meter and to corresponding values for other variant meters See Section 10 5 1 for more information on this test signal Step 2 Skip to Step 3 if no sound level meter is available Perform acoustical level alignment of the system under calibration using the following test Signal 2 500 Hz 2 kHz band limited pink noise at 20 dB FS recorded on the left channel of this stereo file Import it into a digital audio workstation and copy it to each of the Left Right Center Left Surround and Right Surround channels one at a time in turn Maintain the unity gain structure established with the sine wave tone See Section 10 5 3 for more information on this test signal http www atsc org refs a85 MidRngPinkNoise_ 20dB wav Ignore console or other electrical meters as they w
3. RF Mode gain values that can be stored in user defined presets or profiles Additional processing can be performed in the metadata domain bringing the results much closer to traditional processing but with the benefit of being reversible These technologies operate in tandem with an AC 3 encoder measuring audio and calculating the gain values then passing them to the encoder for insertion into the AC 3 bit stream The application is similar to the original Line and RF features described above but uses different profiles to satisfy different dynamic range processing goals These alternative profiles allow for an adjustable ratio of permanent DRC processing versus metadata DRC processing At one end of adjustment permanent multiband audio processing is applied to the audio after it is first pre conditioned by applied metadata to minimize the degree of permanent processing that might be required The resulting audio is AC 3 encoded with a fixed dialnorm value and one of the traditional DRC profiles can be selected At the other end of the adjustable ratio audio that has been pre conditioned by applied metadata is then analyzed and DRC gain words for Line Mode and RF Mode are generated per operator selection Traditional settings such as attack release threshold AGC range gate and freeze can be adjusted very much like a permanent audio processor but the results are instead included in the AC 3 bit stream along with the encoded audio This all
4. The listener then has the option of applying the gain words or not depending on their listening requirements One listener may choose not to apply the gain words and will hear the original dynamics of the program while another may choose to listen to the reduced dynamic range In either case the loudness of the dialog or anchor point of all programs will be normalized regardless of whether the DRC data is applied or not The choice of compression profile is up to the program producer as this allows them to make an artistic choice of how the program dynamics are reduced The AC 3 system can provide independent profiles for two modes Line mode and RF mode as explained later 51 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 Decoder Applications STB TV AVR Decode Down mix 2 to 6 Line AC3 Bitstream amp or Speaker Outputs Figure F 2 AC 3 decoder requirements Line Mode Wherever decoders are used there are two main modes required The first shown in Figure F2 is as a source of line level signals used to provide the audio for a home theatre or other hi fi application In this case usually only a light compression is required if any is needed at all Decoder Applications STB TV AVR RF Down mix Modulator Ch 3 or 4 Decode DeMux AC3 Bitstream Figure F 3 AC 3 decoder requirements RF Mode The second mode Figure F 3 is needed to drive an
5. certain basic principles Control Low Frequency Room Modes Using Irregular Dimensions and Effective Low Frequency Absorption Every room has modes determined by the geometry that cause sound to resonate at certain frequencies Making walls non parallel does not prevent this In small rooms used for control monitoring the resonant lowest frequency is within the audible range and the peaks of resonance are widely spaced Left uncontrolled this results in large irregularities in the frequency response which varies from channel to channel no matter how good the loudspeakers Simple equalization is of limited usefulness in resolving this issue because the peaks of resonance are very narrow Mixing under such conditions is a challenge as individual notes in a music mix can jump out as sounding far too loud while adjacent notes do not In rooms with few large objects choose the three dimensions to be unequal avoiding simple ratios between dimensions by at least 5 percent Always provide low frequency absorption to dampen the room response at resonant frequencies Low frequencies can only be absorbed by high quality materials thicker than 2 at a minimum or 2 material covering a deeper air cavity Carpet on floors or walls is of virtually no benefit Suspended ceilings with high performance absorptive tile with a 4 layer of backing absorbing material can provide effective low frequency absorption to dampen room modes in one plane Deep corner
6. 4 System Deployment Dolby E Metadata Over Serial Link Dolby E Metadata in VANC Metadata and Codecs File based Metadata Production Technique Live Production Technique Non Real Time Production Monitoring stuatedaeds 18 19 19 20 20 20 20 21 21 21 21 22 22 22 22 23 23 23 23 23 23 24 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 7 5 5 Semi Agile Metadata 24 7 5 6 Impact of Metadata Loss on Content 24 7 5 7 Fixed Agile Hybrid 24 7 5 8 Advantages of Agile Metadata 24 8 METHODS TO EFFECTIVELY CONTROL PROGRAM TO INTERSTITIAL LOUDNESS 045 24 8 1 Effective Solutions 25 8 1 1 For Operators Using a Fixed dialnorm System See Section 7 3 25 8 1 2 For Operators Using an Agile dialnorm System See Section 7 5 25 8 2 Adverse Conditions 25 8 3 Summary Recommendations 25 8 4 TV Station and MVPD Local Ad Insertion 26 9 DYNAMIC RANGE MANAGEMENT cccseecceseseecessseeeeesseeeeeseneeeeeseneseesseeeseseneeesasaeeseesseeeaseneeess 27 9 1 AC 3 Dynamic Range Control System Reversible 27 9 1 1 Line Mode and RF Mode DRC 28 9 1 2 Monitoring 28 9 1 3 Relation to dialnorm 29 9 1 4 Professional Encoding 29 9 1 5 AC 3 DRC Choosing None 29 9 2 Dynamic Range Processing With Metadata Interface Reversible 30 9 3 Dynamic Range Processing Without Metadata Interface Irreversible 31 9 4 Consumer Experience 32 10 AUDIO MONITORING SETUP s
7. In the case of TV station or MVPD insertion of local commercials or segments the operator should ensure that the Dialog Level of the local insertion matches the dialnorm setting of the inserted audio stream If the network originator s feed is decoded to baseband the loudness of the decoded audio needs to be measured and the value of the re encoder s AC 3 dialnorm value set to match the measured loudness for the next stage of encoding In this case either the operator modifies the network originator s loudness to match the target value of the operator s system or the originator s loudness value as measured is used to set the dialnorm value in the next stage of AC 3 encoding At this re encoding stage it is critical that the other audio metadata parameters are set appropriately and consistently 26 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 9 DYNAMIC RANGE MANAGEMENT The DTV audio system is capable of delivering very wide dynamic range the range between the softest and loudest sounds Content producers often take advantage of dynamic range as one of the methods to convey artistic intent However there could be a conflict between the desire of the content producer to deliver content with wide dynamic range and the audience who cannot or choose not to enjoy the wider dynamic range This could be caused by the inability of the viewer s equipment to reproduce the desired range of sounds or the lack
8. It modifies the characteristics of the original sound altering it from what the program provider intended to fit within the limitations of the analog system The AC 3 audio system defined in the ATSC digital television standard uses metadata or data about the data to control loudness and other audio parameters more effectively without permanently altering the dynamic range of the content The content provider or DTV operator encodes metadata along with the audio content The Dialog Normalization dialnorm metadata parameter causes receivers to adjust the loudness and if properly used can enable content to have a uniform loudness without user knowledge or intervention It achieves results similar to a viewer using a remote control to set a comfortable volume between disparate TV programs commercials and channel changing transitions The dialnorm and other metadata parameters are integral to the AC 3 audio bit stream ATSC document A 53 Part 5 1 mandates the carriage of dialnorm and correctly set dialnorm values This document is designed to enable the industry to establish and maintain proficiency in loudness measurement production monitoring metadata usage and contemporary dynamic range practices This document provides technical recommendations and information concerning e Loudness measurement using the BS 1770 recommendation ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 Target loudness for content e
9. Pub L 111 311 47 U S C 621 2011 31 AES Engineering Brief Number 38 The Effect of Downmixing on Measured Loudness Scott G Norcross and Michel C Lavoie Audio Engineering Society New York N Y 2011 32 Riedmiller J S Lyman and C Robinson Intelligent Program Loudness Measurement and Control What Satisfies Listeners AES Pre print 115th Convention October 2003 Audio Engineering Society New York NY 33 Schroeder Manfred The Schroeder Frequency Revisited JASA vol 99 no 5 pp 3240 3241 3 DEFINITION OF TERMS With respect to definition of terms abbreviations and units the practice of the Institute of Electrical and Electronics Engineers IEEE as outlined in the Institute s published standards 2 are used Where an abbreviation is not covered by IEEE practice or industry practice differs from IEEE practice the abbreviation in question will be described in Section 3 3 of this document 3 1 Compliance Notation This section defines compliance terms for use by this document vital This word indicates a course of action to be followed strictly no deviation is permitted should This word indicates that a certain course of action is preferred but not necessarily required should not This phrase means a certain possibility or course of action is undesirable but not prohibited As an additional aid to readers critical recommendations in this document are noted by the
10. RF modulator In this case the program will have to be downmixed and the loudness boosted to provide a reasonable match to the other analog off air signals used by the TV set The system will also have to provide more dynamic range reduction than in the previous case due to the boosted program signal level matching the smaller dynamic range of typical existing processed TV programs and possibly because of the listening conditions F 2 AC 3 DRC METADATA PLACEMENT IN BITSTREAMS The AC 3 system supplies gain instructions for both types of dynamic range reduction and also provides limiting in case the level build up inherent in downmixing is enough to produce clipping 52 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 The gain words for the so called Line mode shown in Figure F 2 are called dynrng words The gain words for the so called RF mode shown in Figure F 3 are called compr words The AC 3 bitstream carries both dynrng and compr gain words as shown in Figure F 4 as well as the limiting gain words when required AC 3 decoders can apply either RF or Line mode dynamic range control depending on the listener s preference though some CE manufacturers may limit the choice in specific devices ae dynrng values carried in each of the 6 Audio Blocks Figure F 4 AC 3 DRC metadata placement in bitstream All the dynamic range control words including any limiting gain instructions ar
11. absorption located at either wall wall or wall ceiling boundaries can also be effective Distribute High Quality Absorption and Diffusive Materials Randomly and Avoid Hard Parallel Surfaces Too much room reverberation makes it difficult to hear the details of a mix In addition ringing effects where a series of mid to high frequencies resonate occur if two hard surfaces are parallel especially when other room surfaces are absorptive To control both effects distribute 2 high quality absorption randomly on walls and ceilings to eliminate parallel surfaces where opposite areas are reflective Minimize the use of wall carpet and other thin absorption which are only useful at very high frequencies Where surfaces cannot be made absorptive as in the case of windows tilting the inner glazing layer can eliminate the parallel surface Diffusive elements are also useful in producing a space with reasonable but controlled reverberation Position Loudspeakers and Absorption to Prevent Discrete Reflections A flat hard surface behind the mix position creates comb filter effects from the main monitor loudspeakers The nature of the comb filter changes in frequency as the mix engineer moves forward and back in the room A similar effect at higher frequencies occurs when the loudspeaker direct sound reflects off a large console face The effect on the mix engineer can mask real comb filter problems in the mix Discrete rear wall reflection
12. achieve a desired audience effect This is sometimes done at the end of program segments going into a commercial break e An extreme variation outside of the comfort zone see Annex E may cause a listener to adjust the volume to compensate for the large temporary change in loudness When a scheduled commercial or promo plays going into or out of breaks the listener may need to readjust the volume yet again to achieve an acceptable setting for the short form content This has proven to be an annoyance to the audience 8 3 Summary Recommendations M Recommendations to lessen the negative impact of loudness variation during program to interstitial transitions include 1 Ensure proper targeted average loudness of content in a fixed metadata system or 25 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 2 Ensure proper dialnorm authoring matching the measured content loudness in an agile metadata system 3 Create awareness with the content supplier concerning the potential overuse of dynamic range and the possibility for listener complaints concerning transitions with large loudness variation a Document this condition in any content delivery specifications an operator requires of a supplier i Describe negative impact to the audience created by mixing outside of the tolerance of the listener Comfort Zone see Annex E when going into and coming out of breaks ii The Operator may rely on the Comfort Zone see Ann
13. consistency was achieved by skilled operators However as audio bandwidth increased and operator intervention decreased additional techniques had to be developed to maintain control and minimize audible side effects of the AGC process A possible artifact of wideband AGC is that the level of one part of the audio spectrum can be affected by another unrelated part of the audio spectrum that has more power This can be illustrated by imagining an audio signal comprised of a low frequency bass drum and a higher frequency flute A simple wideband AGC would vary the control signal and thus the level of all frequencies in response to the higher energy of the bass drum This level change might be appropriate for the bass drum but would cause inappropriate fluctuation in the level of the flute signal an effect commonly referred to as pumping or breathing The development of multiband processing systems significantly improves this issue since the input signal is divided into two or more frequency bands and a separate AGC is applied to each of the bands Multiband systems allow more control with less interaction between different parts of the audio spectrum and can also change the spectral balance of the applied signal Additional techniques have evolved to minimize the side effects such as pumping overload due to pre emphasis and increasing background noise These include multi stage processing look ahead processing clipping and gating Multi
14. downwards by one bit in amplitude and the dither added so that the peak level of the signal is precisely 20 dB FS including dither 10 5 3 Band Limited Pink Noise http www atsc org refs a85 MidRngPinkNoise_ 20dB wav Band limited pink noise is preferred over full band noise for several reasons At low frequencies below the Schroeder frequency in rooms standing waves have a strong influence on level and including such frequencies in the band of measurement adds uncertainty Also low frequency energy content in a stochastic signal leads to greater level variations vs time At higher frequencies uncertainty in microphone response vs angle room response curve and room absorption lead to less accuracy On the other hand narrow band noise or tones are too greatly influenced by loudspeaker response and room acoustics Thus a band of two octaves centered on 1 kHz has been determined as the most useful for the main channels while one centered on 40 Hz has been determined as the most useful for the low frequency effects channel See 33 37 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex A 12 March 2013 Annex A Program Loudness A 1 INTRODUCTION This Annex provides background to the BS 1770 loudness and true peak measurement algorithms A detailed description of the measurement algorithms can be found in 3 A 2 PROGRAM LOUDNESS The perceived loudness of an audio signal can be considered in a number
15. integrated or average loudness of the content s full program mix over its entire duration should be measured and reported as the Dialog Level 5 2 2 Long Form Content Measurement During Real Time Production Live Event The intent of loudness measurements made during a live event is to guide the mixer to produce the content at an average predetermined or Target Loudness The principle of measuring the loudness It is acceptable to employ automated Anchor Element detection algorithms that isolate and measure the loudness of dialog Such loudness measurements may directly use formula 2 from BS 1770 and are expected to give similar results to other Anchor Element based methods employing other formulas in BS 1770 16 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 of the Anchor Element of the content also applies to live productions but is done in real time as the production progresses M Some BS 1770 loudness meters offer a sliding window or moving average measurement mode that indicates the loudness of a prior segment typically 3 to 10 sec of the program sound Other implementations may use some kind of graphical representation of BS 1770 based loudness to complement a simple loudness figure Such modes should be used to help guide the mixer to the Target Loudness Using these measurements as a guide to confirm loudness is particularly useful when mixing in noisy locations or when a consistent monitor level cann
16. of an environment suitable to the enjoyment of the wide dynamic range Thus the goals of preserving the original dynamic range of the content and satisfying viewers can often be at odds A goal of the AC 3 system is to provide content producers with the greatest freedom and flexibility in the choice of DRC when producing content The AC 3 system conveys these DRC options to the viewer where DRC system will interact with the viewer s input in a known and repeatable fashion There are several methods for controlling dynamic range Some methods apply prior to audio encoding some apply after decoding and some span both domains 1 One approach is traditional compression and or limiting where gain control is applied to the audio prior to encoding 2 Another approach is to use the AC 3 coding system which generates gain control words during encoding but does not apply the gain control to the audio until after decoding allowing a user to optionally choose how much dynamic range they desire The primary difference between the two approaches is the AC 3 approach is reversible and the other approach is not A hybrid of the two methods is also possible allowing for some permanent and some reversible processing to be combined in a balance determined by the broadcaster 9 1 AC 3 Dynamic Range Control System Reversible To allow the content to be enjoyed by the most viewers in the widest variety of listening environments a mechanis
17. of ways Short term changes in loudness are a reflection of the continuously changing nature and dynamics of the audio content Examples include moment to moment changes within a program that convey changes in mood or context Short term loudness changes can also occur at the transition from one program to another Some of these changes can be inherent given that each program may have been created for different purposes They may also occur if the programs were created following different production practices A listener s impression of the loudness of a piece of content a program an ad etc is not formed by the loud or soft portions of the program but by the element s that form the anchor point of the entire mix The loud and soft portions create the dynamics of the program the Anchor Element determines its overall or average subjective loudness No matter what the dynamic range of the content matching the loudness of the Anchor Elements by applying simple gain offsets to bring each piece of content to the same overall loudness will allow listeners to enjoy the stream of content without having to readjust their volume controls In 2001 the Radiocommunication sector of the International Telecommunication Union ITU R initiated a study to identify an objective measure of the perceived loudness of typical broadcast program material The study resulted in the adoption of a method to calculate the long term integrated or average loudness of an audi
18. org refs a85 Speech_left_Ch 20dB wav 48 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex E 12 March 2013 Annex E Loudness Ranges The Comfort Zone is the range of loudness within which a listener will accept loudness changes within and between content items A subjective experiment was undertaken to determine this range and the other loudness tolerance points see 32 The results of the experiment are also shown in Figure E 1 Annoyingly Loud Turn Volume Down Louder but Acceptable Comfort Zone Softer but Acceptable Turn Volume Up 19 1 Annoyingly Soft Relative Loudness in dB of the Listening levels investigated with 95 confidence intervals Figure E 1 Critical loudness levels The experiment mimicked the transitions between long and short form content within a channel and changes between channels Subjects were placed in a typical listening or viewing situation and asked to switch between samples of long and short form content They were presented with a total of five different paired monophonic Reference and Test items reproduced by a single loudspeaker in front of them Subjects were instructed to adjust the master playback level until the Reference item was reproduced at what they considered to be a comfortable volume the zero point in the diagram The experimenter then asked them to set the Test volume control volume is a more familiar term than loudness to m
19. recommended setup of sound systems for digital television including the alignment of control room monitor systems to a reference sound pressure level corresponding to a particular electrical level and the equalization of such sound systems Monitoring conditions for a variety of spaces used for television production and postproduction is shown in Table 10 1 This section is not intended to replace SMPTE RP 200 6 for cinema sound monitoring in larger spaces This section documents the recommended original setup conditions for monitoring as well as measurements and adjustments that may be undertaken from time to time for quality control of monitoring for spaces used for television production and quality control This section is intended for planners design engineers installation engineers maintenance engineers and users A Quick Reference Guide for setting monitor levels is provided in Annex D Note The test signals given in this document are from the TMH Test Disc Series distributed by The Hollywood Edge and are copyrighted Free use of the copyright is granted to television and video studios networks and post production houses for alignment purposes All other rights reserved The TMH Test Disc Series makes available many other test signals for electrical and acoustical testing 10 1 Background As noted in Section 1 1 consumers do not expect large changes in audio loudness from program to interstitials and from channel to chann
20. results for all three datasets r 0 977 41 Figure A 5 Continuous signal peak level versus sample peak 42 Figure A 6 Basic structure of the ITU R BS 1770 true peak estimation algorithm 43 Figure E 1 Critical loudness levels 49 Figure F 1 AC 3 DRC basic concept 51 Figure F 2 AC 3 decoder requirements Line Mode 52 Figure F 3 AC 3 decoder requirements RF Mode 52 Figure F 4 AC 3 DRC metadata placement in bitstream 53 Figure F 5 Calculation of AC 3 gain words by the encoder Part 1 54 Figure F 6 Calculation of AC 3 gain words by the encoder Part 2 54 Figure F 7 Gain smoothing 56 Figure F 8 DRC loudness weighting curve 57 Figure F 9 DRC profiles input output characteristics 58 Table 10 1 Categories of Audio Control Rooms used in Television Production 33 Table 10 2 Reference Sound Pressure Level 36 Table D 1 Reference Sound Pressure Level 47 Table G 1 Critical Control Metadata 60 Table G 2 Basic Control Metadata Encoder 60 Table G 3 Basic Control Metadata Decoder 60 Table G 4 Informational Metadata 61 Table I 1 Reference Sound Pressure Level 68 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 ATSC Recommended Practice Techniques for Establishing and Maintaining Audio Loudness for Digital Television A 85 2013 1 SCOPE This ATSC Recommended Practice RP provides guidance to broadcasters and creators of audio for high definition HD or standard definition SD television content It recomm
21. stage processing involves placing two or more AGC sections of differing speeds in series allowing peaks to be controlled separately from control of average level Look ahead processing separates detection and adjustment portions of the AGC by a delay so that gain changes are applied at precisely the instant required Clipping is a process where the signal peaks are truncated and may result in audible distortion Gating prevents very low level signals and noise from being unnecessarily increased by the AGC and can be used along with thresholds of other stages to create a no processing null zone Taken together all of these techniques go far beyond the original goals of simply protecting against under or over modulation and in many cases have been used to produce a sound that is very different from the original The goal of modern digital television dynamic range control systems is different than its analog predecessors The elimination of the need for pre emphasis dramatically reduces the degree of processing necessary Newer devices may also employ more sophisticated detection schemes based upon or compatible with BS 1770 3 If the desire is to simply manage loudness with minimal impact on program content experience has shown that multiple stages of light processing can be very effective while still maintaining much of the program integrity Additionally program 31 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 to inter
22. the AC 3 encoder Use a BS 1770 meter to verify the Dialog Level of the audio content H 9 AFFILIATE DIALNORM SETTING See Section 7 3 5 of the RP An operator affiliate station MVPD etc receiving content that is delivered at a fixed loudness where there is no gain adjustment or processing after the receiver should set the value of dialnorm in the operator s AC 3 encoder to match the network originator s specified Dialog Level If a fixed gain or loss is applied in the signal chain the AC 3 encoder dialnorm value needs to be offset accordingly from the originator s Dialog Level If loudness processing is applied to the originator s audio the processor s Target Loudness value should match the operator s AC 3 encoder s dialnorm value See Section 9 3 for additional background on audio processing Key Idea Set the AC 3 encoder s dialnorm value to the originator s Dialog Level as adjusted H 10 TV STATION OR MVPD CONTENT INSERTION In the case of TV station or MVPD insertion of local commercials or segments the operator should ensure that the Dialog Level of the local insertion matches the dialnorm setting of the inserted audio stream Key Idea Ensure that the Dialog Level of inserted content matches the dialnorm setting of the inserted audio stream If the network originator s feed is decoded to baseband the loudness of the decoded audio is to be measured and the value of the re encoder s AC 3 d
23. to 20 uN m VU volume unit 5 PPM peak program meter 3 4 Terms The following terms are used within this document Agile Metadata Audio metadata values including dialnorm which can change at content boundaries Anchor Element The perceptual loudness reference point or element around which other elements are balanced in producing the final mix of the content or that a reasonable viewer would focus on when setting the volume control BS 1770 ITU R BS 1770 3 This specifies an algorithm that provides a numerical value indicative of the perceived loudness of the content that is measured Loudness meters and measurement tools which have implemented the BS 1770 algorithm will report loudness in units of LKFS Comfort Zone the Comfort Zone is a range 2 4dB 5 4dB of the change to audio loudness that was found to be acceptable to a sample of listeners per 32 The 0 dB point on the Comfort Zone scale is the average Target Loudness value or dialnorm of the channel Content Material or essence used for distribution by an operator Dialog Level The loudness in LKFS units of the Anchor Element dialnorm An AC 3 metadata parameter defined in A 52 4 that is carried in the AC 3 bit stream This is used to indicate how far the average Dialog Level is below 0 LKFS Valid values are 1 to 31 Loudness values outside this range cannot be expressed by dialnorm The value of dialnorm is numerically equal t
24. via serial links This approach may require a dedicated serial layer that remains carefully time aligned to the audio and video signals 7 5 1 2 Dolby E Metadata in VANC Dolby E metadata can also be embedded within the VANC of standard or high definition serial digital systems and extracted downstream using the SMPTE 2020 8 standard This approach may require multiplexers and de multiplexers and requires support by video storage encoding processing and distribution equipment with the ability to pass the VANC signal intact Some storage devices have limited or no VANC capability 7 5 1 3 Metadata and Codecs Certain systems used for backhaul distribution and storage applications also have the ability to carry Dolby E metadata These systems include the Dolby E compressed bitstream itself and other proprietary formats which require specialized audio encoders and decoders They also may require equipment that can offset video timing to compensate for the encoding and decoding latency introduced Most professional digital video equipment can be configured to pass these encoded signals through standard digital audio channels that comply with SMPTE 337 23 7 5 1 4 File based Metadata There are a large number of file based techniques for storing Dolby E metadata some of them standardized and some of them proprietary These are outside the scope of this document 7 5 2 Production Technique Live In live production using an agile metadat
25. within production distribution and transmission environments Section 8 Describes methods to effectively control program to interstitial loudness Section 9 Examines key issues relating to Dynamic Range Control DRC Section 10 Specifies the setup of sound systems for digital television including the alignment of control room monitor systems to a reference sound pressure level Annex A Program Loudness provides background on the BS 1770 loudness and true peak measurement algorithms Annex B Room Acoustics and Loudspeaker Placement discusses basic principles of control room monitoring Annex C Room Correction examines issues relating to the interaction of sound from loudspeakers and the room Annex D Quick Reference Guide for setting the acoustical sound level reference for TV monitoring rooms Annex E Loudness Ranges examines the range of loudness within which a listener will accept loudness changes within and between content items Annex F AC 3 Dynamic Range Control Details Annex G AC 3 Metadata Parameters Annex H Quick Reference Guide for station and MVPD engineers for loudness management by stations and MVPD operators ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 Annex I Quick Reference Guide for audio mixers creating content commercials and programs Annex J Requirements for Establishing and Maintaining Audio Loudness of Commercial Advertisi
26. AINTAINING AUDIO LOUDNESS OF COMMERCIAL ADVERTISING IN DIGITAL TELEVISION cccssseeecesseeeeesseeeeessseeesesseeeeenseneeeeesees 71 J Introduction and Scope 71 J 2 Loudness Management 71 J 3 dialnorm for AC 3 Based Systems 71 J 4 Commercial Advertising Loudness 71 J 5 Commercial Advertising at the Point of Insertion 71 ANNEX K REQUIREMENTS FOR ESTABLISHING AND MAINTAINING AUDIO LOUDNESS OF COMMERCIAL ADVERTISING IN DIGITAL TELEVISION WHEN USING NON AG 3 AUDIO CODECS re a a raea araea e s e rA raana iepa ame aa Ea Aaii aaa amr r aAa Ao raana Amane Aaaa Ena 72 K 1 Introduction and Scope 72 ATSC A 85 2013 Establishing and Maintaining Audio Loudness K 2 K 3 K 4 K 5 Loudness Management Loudness for Non AC 3 Channels Commercial Advertising Loudness Measurement Commercial Advertising at the Point of Insertion 12 March 2013 72 72 72 72 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 Index of Figures and Tables Figure 5 1 ITU R BS 1770 loudness algorithm 15 Figure 7 1 Fixed metadata concept 20 Figure 7 2 The preset metadata concept 21 Figure 7 3 The agile metadata concept 22 Figure 7 4 Metadata insertion options 24 Figure 10 1 Operational room electro acoustic response curve 34 Figure A 1 Block diagram of multichannel loudness algorithm 39 Figure A 2 The RLB weighting curve 40 Figure A 3 Pre filter response used to account for the acoustics effects of the head 40 Figure A 4 Combined
27. ARAMETERS ccccesccssseeeeeeeeeeeeeeeeenseeeeeeeeseneesseeeseeeeseeeeeeeeeseaee 60 ANNEX H QUICK REFERENCE GUIDE FOR STATION AND MVPD ENGINEERS AUDIO LOUDNESS MANAGEMEND wesicccicoscsccecececgsscsthceecoscecacvssutcvedtecenccdestiesucdecarednsugetectveceeativnanteecocessetevenctes 62 H 1 Introduction 62 H 2 Scope 62 H 3 Definitions 62 H 4 Loudness Management 63 H 5 FCC Requirement 63 H 6 Measurement of Content as Delivered 63 H 6 1 Long Form Content 63 H 6 2 Short Form Content 63 H 6 3 Newscasts or Other Live Programming 64 H 6 4 File Based Content 64 H 7 Target Loudness to Facilitate Program Exchange 64 H 8 Methods to Effectively Control Program to Interstitial Loudness 64 H 9 Affiliate dialnorm Setting 65 H 10 TV Station or MVPD Content Insertion 65 H 11 AC 3 Dynamic Range Control DRC 66 ANNEX I QUICK REFERENCE GUIDE FOR AUDIO MIXERS AND EDITORS CREATING CONTENT COMMERCIALS PROMOTIONAL MATERIALS AND PROGRAMMING WITH AUDIO LOUDNESS MANAGEMENT wissccssesetcesenssecvsnsccsevsstcvesccnescvsnconterncctersctnccvenstceseraccsrisnnscsrersetesnsentercvssesteds 67 I 1 Introduction 67 I 2 Scope 67 I 3 Definitions 67 1 4 Monitoring Environment 68 I 5 BS 1770 Level Monitoring 68 1 6 Content Loudness During Mixing 68 1 7 Target Loudness for Content Without Metadata 69 I 8 FCC Requirement 69 I 9 Measurement of Post Produced Content 69 1 9 1 Long Form Content 69 1 9 2 Short Form Content 69 ANNEX J REQUIREMENTS FOR ESTABLISHING AND M
28. ATSC ADVANCED TELEVISION SYSTEMS COMMITTEE ATSC Recommended Practice Techniques for Establishing and Maintaining Audio Loudness for Digital Television A 85 2013 Doc A 85 2013 12 March 2013 Advanced Television Systems Committee 1776 K Street N W Washington D C 20006 202 872 9160 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 The Advanced Television Systems Committee Inc is an international non profit organization developing voluntary standards for digital television The ATSC member organizations represent the broadcast broadcast equipment motion picture consumer electronics computer cable satellite and semiconductor industries Specifically ATSC is working to coordinate television standards among different communications media focusing on digital television interactive systems and broadband multimedia communications ATSC is also developing digital television implementation strategies and presenting educational seminars on the ATSC standards ATSC was formed in 1982 by the member organizations of the Joint Committee on InterSociety Coordination JCIC the Electronic Industries Association EIA the Institute of Electrical and Electronic Engineers IEEE the National Association of Broadcasters NAB the National Cable and Telecommunications Association NCTA and the Society of Motion Picture and Television Engineers SMPTE Currently there are approximately 150 members
29. E a e Ag SD A cg dG a e ecg e aL ATE On I ACRE LENE a a e a a aa N CSN Rees rcs oes ie Parte Ran as Pega ite as eve wall Sania a Sek Meee cara I o eee Seamer So a E af pe Mey Sb a I atl EN eh ed z Pe nc at AEE EE e oy cal E T A hs coh Sl sol 4 i T T i i Cc m EEEE E ic 2 S84 SOG a 2 Sh Se SiS Se oe l I I I l S kca ee hen ey ee a a l l l l l l TE l l l l l l l l l l l l N H E S N EE E E A N E O N oath O EO D EE So N EEE Eaa EEEE EEEE EEE EE EE E ee fe Na ee Tn ee es te Si Se ee A ao as ae ie One ol ne OT en lt penne RoG n n AA A AA ASAA ARARA e o gt oa ee e 18 ok miata oe sea he mn es Aes oo ete Je me E em 8 a Som l l l l 1 l Pot p So RS ce So Sal Se ale ge oT l l l l l om Bs ee TS MLN a ae Le eR eA at a es l l l l l l l l l l l l l l l l 2 L L L L 1 L oO Ke oO 1O oO 1O O Ke on ye y N N c 1 1 i i 40 Frequency Hz the head Establishing and Maintaining Audio Loudness Annex A ATSC A 85 2013 gp 19497 Saejay gP 18487 aAa Figure A 3 Pre filter response used to account for the acoustics effects of The performance of the algorithm can be illustrated by comparing the subjective impressions of loudness against measured values This is shown in Figure A 4 where the measured loudness for each of the 240 audio sequences from the three subjective tests is plotted against the subjective loudness Perfect agreement would result in the points falling on the diagonal line shown in t
30. Il above I ll Mix rooms Ill Edit rooms sometimes used for mixing IV Booths vans lt 1 500 76 V Headphones Use 2 cc Coupler and set 400 Hz level to 78 dB Per SMPTE RP 200 6 Note that Table D 1 reproduces Table 10 2 for the reader s convenience First test tone sine wave http www atsc org refs a85 440Hz_left_Ch 20dB wav Second test tone band limited pink noise http www atsc org refs a85 MidRngPinkNoise_ 20dB wav 47 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex D 12 March 2013 D 2 REALLY QUICK REFERENCE GUIDE FOR MONITOR SETUP If you can t use the band limited pink noise to calibrate the monitor levels the speech sample referred to in Section 10 4 may be used to set the monitor level appropriately Make sure that the signal path from the device playing back the speech sample to the input to the monitor system is set to unity gain Edit the mono source file into the single channel for mono playback into left and right at 3 dB each for stereo playback and into the center channel of 5 1 channel and other multichannel systems Play back the speech sample and adjust the master monitor gain to put the speech at your most comfortable listening level Since the loudness of the speech sample is 24 LKFS programs mixed so that the anchor program material such as dialogue matches this level will have approximately the same loudness http www atsc
31. MENT A correct monitoring environment is critical to satisfactory mix results Please read the background discussion in Section 10 1 as well as Section 10 2 for a full understanding of this topic It is essential that the correct monitor level be used This level varies with room volume See Section 10 4 of the RP for additional information and Table I 1 below Table I 1 Reference Sound Pressure Level Categories Room Volume in Cubic Feet SPL in dB re 20 pN m2 gt 20 000 85 10 000 lt 19 999 82 5 000 lt 9 999 80 1 500 lt 4 999 78 lt 1 499 76 Depends on room usage For editing purposes may be controlled by the editor for use with the material at hand For final program mixing follow the recommendations for categories I II above lt 1 500 76 Use 2 cc coupler and set 440 Hz level to 74 dB Per SMPTE RP 200 6 Note that Table I 1 reproduces Table 10 2 for the reader s convenience Consult Annex D for a synopsis of mixing room setup and calibration Key Idea Goal is to correctly setup your listening environment once and make sure you are always listening at this level when creating content This is true even if you must use headphones to monitor 1 5 BS 1770 LEVEL MONITORING BS 1770 provides a new measurement technique for monitoring audio levels The use of measurement tools which support the BS 1770 measurement methods by all involved in audio production will assist the in
32. S standard method for digital audio engineering Measurement of digital audio equipment Audio Engineering Society New York N Y SMPTE RDD 6 2008 Description and Guide to the Use of the Dolby E Audio Metadata Serial Bitstream Society of Motion Picture and Television Engineers White Plains N Y SMPTE SMPTE 337 2008 Format for Non PCM Audio and Data in an AES3 Serial Digital Audio Interface Society of Motion Picture and Television Engineers White Plains N Y ATSC A 79 Conversion of ATSC Signals for Distribution to NTSC Viewers Advanced Television Systems Committee Washington D C 12 December 2008 CEA CEA CEB11 A NTSC ATSC Loudness Matching Consumer Electronics Association Arlington VA September 2009 ANSI ANSI C16 5 1942 Standard Volume Indicator American National Standards Institute Washington D C IEC IEC 60268 17 Sound system equipment Part 17 Standard volume indicators International Electrotechnical Commission Geneva 1990 IEC IEC 60268 10 lia IIb 1991 Sound system equipment Part 10 Peak programme level meters International Electrotechnical Commission Geneva 1991 ANSI ANSI S1 4 1983 R 2001 with Amd S1 4A 1995 Specification for Sound Level Meters American National Standards Institute Washington D C 11 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 30 Commercial Advertisement Loudness Mitigation CALM Act
33. Standard curve Po rmp oO Tm Figure F 9 DRC profiles input output characteristics The Null zones of Film Standard and Speech are not symmetric around the dialnorm value because with typical program material and the degree of dynamic range reduction desired the relatively rapid boost zone attack combined with a slower decay time tends to leave the program in the asymmetric null zone See Figure F 9 F 2 3 DRC Encoder Parameters for Setting Metadata This illustrates some of the front panel menu choices provided by the Dolby 569 AC 3 Encoder There is a PC based remote control program for the Dolby 569 that presents all the parameters on a single screen The Dolby 570 metadata authoring and previewing device has similar front panel menus and a much easier to use remote control program as well Main Setup Menu gt Audio Service gt Channel Mode gt Choose one of 1 0 2 0 3 2 etc Main Setup Menu gt Audio Service gt Dialog Level gt Set to dB to 31 dB Main Setup Menu gt BSI Parameters gt Center Downmix level gt Choose one of 3 0 dB 4 5 dB 6 0 dB Main Setup Menu gt BSI Parameters gt Surround Downmix level gt Choose one of 3 0 dB 4 5 dB 6 0 dB Main Setup Menu gt Dynamic Range gt Line Mode gt Choose one of Speech Music Light Music Standard Film Light Film Standard Disabled Main Setup Menu gt Dynamic Range gt RF Mode gt Choose one of Speech Music Light Music S
34. The audio data after loudness normalization is mapped onto the selected DRC profile to generate a static gain value It is also used to select one of four time constants depending on whether the input loudness is larger attack or smaller decay than the smoothed loudness and by how much If the input loudness is larger than the smoothed loudness and the amount is greater than the attack threshold then the fast attack time constant is selected This is generally a very fast time constant designed to provide very quick convergence of the compressor for very loud events If the input loudness is larger than the smoothed loudness but not by more than the attack threshold then the slow attack time constant is selected This is generally a moderately fast time constant designed to provide reasonably quick convergence of the dynamic range controller for moderately loud events If the input loudness is smaller than the smoothed loudness but not by more than the decay threshold then the slow decay time constant is selected This is generally a very slow time constant designed to provide slow and therefore inaudible release of the compressor during soft passages If the input loudness is smaller than the smoothed loudness and the amount is greater than the decay threshold then the fast decay time constant is selected This is generally a moderate time constant designed to provide a faster release at the end of very loud events The other p
35. a approach the television production mix engineer selects a specific but arbitrary loudness target for each program with considerations for dynamic range headroom and the type and mood of the program This parameter establishes the loudness anchor for layering of music and effects for the soundtrack Depending on the deliverable specification the Target Loudness is either carried in the encoded signal by the value of the dialnorm parameter or communicated to the distributor for subsequent encoding 7 5 3 Production Technique Non Real Time With post produced content using the agile metadata approach the final mix loudness is determined either during program production or after it is complete Depending on the deliverable specification the Target Loudness is either carried in the encoded signal by the value of the dialnorm parameter or communicated to the distributor for subsequent encoding Insertion in the deliverable can be accomplished through a dubbing process or by making use of the pre read feature available in some video tape recorders See Figure 7 4 e g Linear Acoustic e squared format and the Ericsson Quad Phase Aligned format 23 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 Dub Insertion Pre Read Insertion Playback VANC Record E VANC VTR Embedder VTR Embedder Metadata Metadata Source Source Figure 7 4 Metadata insertion options 7 5 4 Production Monitoring The soundtrack
36. a input The dialnorm setting changes appropriately on boundaries of the content See Figure 7 3 ATSC Process Distribution Storage Distribution and Switching Source 1 Process Storage 5 Distribution AC 3 and Encoder Source n Switching 5 Metadata Consumer AC 3 Audio Decoder Meta data Values Figure 7 3 The agile metadata concept 7 5 1 System Deployment When the agile metadata approach is used by a network operator it will need to be employed throughout the plant of every broadcast station or MVPD head end that receives content from the network This requires deployment of complex encoding and decoding equipment at all input output monitoring and processing points in the distribution chain from the metadata origin through to all AC 3 encoders It is essential that the agile metadata reach the AC 3 encoder Several approaches for agile metadata delivery and storage are available and can be used separately or in combination the AC 3 metadata as discussed in Annex G is a consumer subset of Dolby E metadata as described in SMPTE RDD 6 22 This may be transported over serial data links as vertical ancillary VANC data or as data carried in compressed bitstreams It may also be stored in file based systems 22 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 7 5 1 1 Dolby E Metadata Over Serial Link Dolby E metadata in its baseband form may be carried
37. acent group by half so the effect when the sample values are added is a crossfade between groups which smoothes out any steps between them The compr words apply to an entire sync frame but the sync frames also use the overlap add process smoothing out the compr gain steps as well F 2 2 The DRV Loudness Weighting and Input Output Curves The DRC weighting curve emphasizes low frequencies more than the B weighting curve which is normally used to weight moderate level i e close to TV listening levels sounds See Figure F 8 56 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 DRC loudness weighting IEC B Weighting IEC A Weighting 10 10 10 10 10 Hz Figure F 8 DRC loudness weighting curve The total loudness of a multichannel source is computed as the sum of the weighted RMS power levels in each channel Both the weighting and loudness summation predate the ITU loudness measurement which can be seen as a refinement of this method 57 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 Input Loudness 90 80 70 60 50 40 30 20 10 0 dBFS Pit Y speech music light WJOU eIp Ol 1s00g ule film standard TITTPTTTT TTT TTT ptr rrr bh SB hae ee i et de a ed Sat ate ee eee eee ee a a gt a ap Sie th nen Hf Sn hn oO ok jus Notes 3 The Speech curve has been moved up a bit to separate it from the Film 5
38. ach with different dialnorm values and engaged via a General Purpose Interface GPI or other control interface e Agile metadata The AC 3 encoder is configured to receive external metadata An upstream agile dialnorm metadata system may be used to deliver changing dialnorm values to the encoder corresponding to the changing loudness at the content boundaries When managed properly all three methods provide a compliant and acceptable end result for the consumer The majority of the discussion in this section of this RP focuses on the dialnorm parameter Readers are encouraged to refer to Annex G and to research information on how the remaining metadata parameters may impact coding 19 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 It is also possible for the operator to apply a hybrid approach choosing one of the methods for loudness management and a different method for the remainder of the metadata e g maintaining a fixed dialnorm value but switching channel mode as required 7 3 Using Fixed dialnorm Metadata The concept of fixed dialnorm is simply to fix the AC 3 encoder dialnorm setting to a single value within a network or broadcast system and to bring the loudness of the encoder audio input signal into conformance with this setting The operator can choose any dialnorm value from 31 to 1 however compliance with A 53 requires the operator to employ a value equal to the loudness of the e
39. an be significant at higher frequencies or with signals containing sharp transients Inconsistencies in sample peak readings can result when the samples do not fall on the same locations in the audio signal In the diagram below a small phase shift between the sampling clock and the audio signal can result in different sample peak readings Peak level of audio signal Reported sample peak level Figure A 5 Continuous signal peak level versus sample peak Annex 2 of BS 1770 describes an algorithm for estimating the true peak level of a linear PCM audio signal A simplified description of the algorithm is shown in Figure A6 42 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex A 12 March 2013 Absolute Max Output 1 b enn oo LSJ 2222222222222 or Optional Absolute Max Output 2 Figure A 6 Basic structure of the BS 1770 true peak estimation algorithm Incoming signals are over sampled by four times to obtain a more accurate representation of the audio signal This translates to 192 kHz over sampling for signals originally sampled at 48 kHz the usual studio sampling rate 43 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex B 12 March 2013 Annex B Room Acoustics and Loudspeaker Placement While a full discussion of control room acoustics and loudspeaker placement are beyond the scope of this RP the quality of control room monitoring can be dramatically improved by following
40. and Maintaining Audio Loudness Table of Contents Background and Introduction Organization Informative References Compliance Notation Treatment of Syntactic Elements Acronyms and Abbreviation Terms Overview of the BS 1770 Loudness Measurement Method Overview of Making Loudness Measurements 5 2 1 5 2 2 5 2 3 5 2 4 5 2 5 5 2 6 5 2 7 Long Form Content Measurement During Production and Post Production Long Form Content Measurement During Real Time Production Live Event Measurement of Finished Long Form Content Measurement of Short Form Content Downmix Loudness Measurement and Monitoring File Based Measurement Special Considerations for Measurement of Content Downstream 12 March 2013 6 TARGET LOUDNESS AND TRUE PEAK LEVELS FOR CONTENT DELIVERY OR EXCHANGE 18 7 METADATA MANAGEMENT CONSIDERATIONS IMPACTING AUDIO LOUDNESS 7 1 7 2 7 3 7 4 7 5 Importance of dialnorm Metadata Management Modes Using Fixed dialnorm Metadata 7 3 1 7 3 2 7 3 3 7 34 7 3 5 7 3 6 Setting dialnorm by Long Term Averaging Method Setting dialnorm for Production Content Not Conforming to the Target Loudness dialnorm and Loudness Quality Control Emission dialnorm Setting for Compliance with A 53 Fixed dialnorm Advantage Using Preset dialnorm Metadata 7 4 1 1 4 2 Implementation AC 3 Framesync Requirement Using Agile dialnorm Metadata 7 5 1 7 5 1 1 7 5 1 2 7 5 1 3 7 5 1 4 132 7 5 3 7 5
41. arget Therefore it is very important that dialnorm accurately indicates the loudness of the content dialnorm is also used to set the threshold of a somewhat hidden and inescapable overload protection process designed to protect down mixed audio from overloading consumer equipment Overload protection has ballistics appropriate for preventing overload but are far less than ideal for audio quality It is good practice to avoid overload protection This can be accomplished by ensuring that the dialnorm parameter accurately reflects the actual loudness of the audio content 9 1 4 Professional Encoding In AC 3 encoders the gain reduction and expansion characteristics of the RF and Line DRC modes are determined by a group of DRC profiles These profiles describe many parameters including the gain reduction or cut range gain expansion or boost range and attack and release times In between these ranges is a linear range or null zone where no gain reduction or expansion takes place It is expected that the majority of professionally mixed content will reside within the null range where the content will be delivered exactly as produced with no additional or continual modification Excursions beyond this null band might be to convey a specific artistic intent Note that the dialnorm parameter determines the position of this null band so again it is essential that the dialnorm parameter accurately indicate th
42. ation lab to serve as a performance baseline These two algorithms were simple mean square calculations the first used a simple frequency weighting filter while the second was unweighted The simple weighting filter was a revised low frequency B weighting curve RLB A comparison of the submitted loudness meters algorithms with the subjective database revealed that the simple frequency weighted mean square algorithm performed the best 11 38 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex A 12 March 2013 To verify the performance of this algorithm a second subjective database was created from formal subjective tests conducted at one of the test sites This test had 20 subjects providing a perceived loudness rating to 96 monophonic audio sequences and used the same subjective testing methodology as the first round of tests An analysis of the results from this second subjective database confirmed the performance of the simple frequency weighted averaging algorithm In an effort to extend the algorithm to multichannel audio signals a third round of tests was carried out using 144 audio sequences 48 monophonic 48 stereo and 48 multichannel with 20 subjects participating The test used the same subjective methodology as the previous two tests The reference consisted of English female speech with stereo ambience and low level background music The loudspeakers were configured as described in Recommendation ITU R BS 775 12 The
43. blishing and Maintaining Audio Loudness 12 March 2013 of avoiding this problem during production is to assign dialog and lead singer elements primarily or exclusively to the center channel 31 Alternatively the surround mix should be adjusted in production post production to mitigate loudness build up in the resultant down mix 5 2 6 File Based Measurement The measurement methods described above also apply to content that exists as digital files In addition file based storage makes it practical to automate the loudness measurements and to examine the dialnorm value if any that may have been assigned to the content The content s loudness may be adjusted if necessary to the desired or Target Loudness value by applying an overall gain correction or the dialInorm value may be re written to match the measured loudness of the content 5 2 7 Special Considerations for Measurement of Content Downstream Once finished content moves downstream from production e g ina MVPD or broadcast station facility it may not be possible or practical to measure the loudness that should correspond to the dialnorm value Moreover it might not be possible to distinguish between long form content and short form content or determine boundaries between long form and short form content Consequently loudness measurements made downstream may cross content boundaries Once finished content moves downstream and determination of boundaries between content segment
44. calculated for each 256 samples Audio Blocks represent 512 samples 5 3 ms of audio but because of the overlap add structure of the blocks the loudness of each audio block is represented by the average of two 256 sample measurements Loudness values are thus produced every 5 3 ms 54 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 The loudness of each channel is calculated individually then summed to get the overall loudness of the program The loudness weighting curve is shown in Figure F 8 The peak level calculation has to include the center and surround channel downmixing coefficients cmixlev and surmixlev selected ideally during the production process The output of the calculation is the maximum peak level of the worst case downmix condition Loudness and peak values are normalized by the overall program dialnorm value before being passed to the subsequent calculation stages The loudness and peak level data used for the compr gain word calculations represent these values over an entire AC 3 sync frame 32 ms The audio decoder includes an overlap and add thus even the coarsely timed compr gain words are applied smoothly The dynamic range control portion of the AC 3 system has to deal with all types of program material so it includes a method of adjusting the attack and decay time constants that are determined automatically depending upon the program material
45. ccurate loudness measurements 5 LOUDNESS MEASUREMENT Because loudness is a subjective phenomenon human hearing is the best judge of loudness When combined with a known mixing environment and using a consistent monitor level such as that 14 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 described in Section 10 of this RP experienced audio mixers using their sense of hearing can produce well balanced program sound content with remarkably consistent loudness If all programs and commercials were produced at a consistent average loudness and if the loudness of the mix is preserved through the production distribution and delivery chain listeners would not be subjected to annoying changes in loudness within and between programs When measuring audio signals there are two key parameters of interest the true peak level of the signal and its loudness The true peak measurement enables the mixer to protect the program from clipping and the loudness measurement allows the mixer to protect the listener from annoying variations in loudness Although the mixer balances a mix using his or her ears an objective loudness measurement helps to maintain consistent average loudness within and between programs The familiar VU and PPM meters measure neither the loudness nor the true peak levels of the signal The characteristics of many of the common electronic meters available are unknown and contribute to the incons
46. ct in the context of this document 10 5 2 440 Hz Sine Wave Tone http www atsc org refs a85 440Hz_left_Ch 20dB wav A sine wave tone is the simplest and most widely employed test signal in audio Typical uses include signal tracing and level setting A sine wave has two properties useful for testing first the level is constant so it can be measured with great accuracy and second a mid frequency sine wave tone does not stress the limits of any system so it passes through unprocessed The level is 20 dB relative to Full Scale dB FS The 440 Hz frequency was chosen for three reasons it is in the flattest portion of the equalization frequency response curve of the BS 1770 3 specification it is musically relevant as reference A on the musical scale and it is not harmonically related to the various sampling frequencies in use so that all code values within its range are exercised Use of a tone at a precise tuning frequency helps by being able to identify audibly when there is an error in playback sample rate compared to recorded sample rate During the development of this test signal it was found that producing a test sine wave at precisely 20 dB FS and then adding 1 LSB of triangular probability density function white noise dither as required for a distortion free test signal triggered some meters to read one division higher than 20 dB FS which could result in up to a 2 dB error in units found Thus the level was adjusted
47. cument 71 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex K 12 March 2013 Annex K Requirements for Establishing and Maintaining Audio Loudness of Commercial Advertising in Digital Television When Using Non AC 3 Audio Codecs K 1 INTRODUCTION AND SCOPE The recommendations in this Annex are based on other sections of this Recommended Practice This Annex contains the courses of action necessary to perform effective loudness control for digital television commercial advertising when using non AC 3 audio codecs K 2 LOUDNESS MANAGEMENT The Operator s goal is to present to the audience consistent audio loudness across programs commercials promotional material and public service announcements K 3 LOUDNESS FOR NON AC 3 CHANNELS It is vital that the delivery channel operate at an Operator selected loudness target value as measured in units of LKFS for content on the channel M K 4 COMMERCIAL ADVERTISING LOUDNESS MEASUREMENT It is vital that when loudness of short form content e g commercial advertising is measured it be measured in units of LKFS including all audio channels and all elements of the soundtrack over the duration of the short form content MI K 5 COMMERCIAL ADVERTISING AT THE POINT OF INSERTION In the case of insertion of short form content e g commercial advertising it is vital that the loudness measured as per section K 4 of the inserted short form content match the delivery cha
48. d historically it was felt that for most programs dialog would be the Anchor Element 62 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex H 12 March 2013 H 4 LOUDNESS MANAGEMENT Key Idea Goal is to present to the viewer consistent audio loudness across commercials programs and channel changes H 5 FCC REQUIREMENT ATSC document A 53 Part 5 2010 1 mandates the carriage of dialnorm and correctly set dialnorm values Key Idea Set the station AC 3 encoder s dialnorm to match the loudness of average Dialog Level of the content H 6 MEASUREMENT OF CONTENT AS DELIVERED See Section 5 of the RP H 6 1 Long Form Content A representative section of the content that is dominated by typical dialog i e not shouting or whispering should be isolated and measured In the absence of dialog the loudness of the element of the content that a reasonable viewer would focus on when setting the volume control should be measured If neither technique is possible or practical the loudness of the entire duration of the content should be measured If the content has significant periods of quiet see Section 5 2 of the RP Key Idea Measure the long form content loudness when dialog is present This value is the Dialog Level of the content The Dialog Level in units of LKFS should match the dialnorm value of the AC 3 encoder H 6 2 Short Form Content See Section 5 2 4 of the RP Key Idea Measure the loudn
49. d in producing the final mix of the content or that a reasonable viewer would focus on when setting the volume control BS 1770 Shorthand for ITU R BS 1770 3 This specifies an algorithm that provides a numerical value indicative of the perceived loudness of the content that is measured Loudness meters and measurement tools which have implemented the BS 1770 algorithm will report loudness in units of LKFS Dialog Level The loudness in LKFS units of the Anchor Element dialnorm An AC 3 metadata parameter defined in A 52 4 that is carried in the AC 3 bit stream This is used to indicate how far the average Dialog Level is below 0 LKFS Valid values are 1 to 31 Loudness values outside this range cannot be expressed by dialnorm The value of dialnorm is numerically equal to the absolute value of the Dialog Level The value of 0 is reserved LKFS loudness K weighted relative to full scale measured with equipment that implements the algorithm specified by BS 1770 3 A unit of LKFS is equivalent to a decibel Target Loudness A specified value for the Anchor Element i e Dialog Level established to facilitate content exchange from a supplier to an operator Agile metadata is an alternative to using the fixed metadata system approach See Section 7 5 of the RP for details concerning the agile metadata approach The term Dialog Level is based on dialog s widespread use as the anchor for mixing of content an
50. dialog and to measure its loudness accurately regardless of the overall dynamic range or style of the finished program If dialog is not the Anchor Element of the content e g a music program there is always some element the mix engineer uses as the Anchor Element around which the loudness of all the other program elements are balanced This element of the sound mix is what a reasonable viewer would focus on when setting their volume control and should be measured and reported as the Dialog Level of the content The situation when it is impractical to measure the Anchor Element is covered below Once finished content moves downstream from production loudness measurements may not duplicate the measurements made during the production process This point is discussed in other parts of this document It should be noted that content measured before the effective date of this document per the methods documented in a predecessor version or as permitted by A 53 Part 5 1 need not be measured again 5 2 1 Long Form Content Measurement During Production and Post Production M During the production or post production process it is generally possible to identify and isolate the Anchor Element typically dialog of the program sound mix using various processes Once isolated the Anchor Element or a representative sample of the Anchor Element should be measured using BS 1770 3 If the Anchor Element cannot be isolated and measured then the long term
51. dustry to manage audio loudness of content consistent with artistic intent See Section 5 2 of the RP for a short discussion of how best to use the technique and Annex A for a detailed discussion of BS 1770 and how it works Key Idea Use BS 1770 compliant measurement tools 1 6 CONTENT LOUDNESS DURING MIXING Key Idea With the monitor level set correctly always mix relying on your hearing Use a BS 1770 loudness monitoring tool to confirm what you hear 68 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex 12 March 2013 I 7 TARGET LOUDNESS FOR CONTENT WITHOUT METADATA For delivery or exchange of content without metadata the Target Loudness value should be 24 LKFS Minor measurement variations of up to approximately 2 dB about this value are anticipated due to measurement uncertainty and are acceptable Content loudness should not be targeted to the high or low side of this range The true peak level should be kept below 2 dB TP in order to provide headroom to avoid potential clipping due to downstream processing such as audio coding used in delivery Key Idea When generating content and the program delivery level requirement is unknown or has not been specified mix Dialog Level to 24 LKFS with true peaks below 2 dB TP 1 8 FCC REQUIREMENT ATSC document A 53 Part 5 1 mandates the carriage of dialnorm and correctly set dialnorm values Key Idea The station AC 3 encoder s dialnorm will b
52. e generated in the AC 3 encoder sent to the AC 3 decoder and applied in the AC 3 decoder The transmitted information includes the dialnorm data which is used to normalize the dialog or anchor point loudness of each program as a whole The dialnorm and compr gain words are sent to the AC 3 decoder once every 32 ms while the dynrng gain words are carried at 6 times this rate or each 5 3 ms F 2 1 Calculation of AC 3 Gain Words As shown in Figure F 5 the first task is to calculate the normalized loudness and peak levels of the program material further detail shown in Figure F 6 acmod tells the encoder how many channels the program has 53 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 Measure Apply Normalized Loudness dialnorm Loudness Multichannel Dialnorm Audio program value Measure eak level amp p Apply Normalized emixlev dialnorm Peak surmixlev worst case acmod amp downmix profile s Required User inputs Figure F 5 Calculation of AC 3 gain words by the encoder Part 1 Compute Static Gain boost ratio Normalized Loudness Compute Time Constant Attack ee dynrng Smooth and Limit Gain Words and threshold compr Compute Maximun Gain before Clipping Clip Level gt Figure F 6 Calculation of AC 3 gain words by the encoder Part 2 Normalized Peak level For the dynrng gain words the loudness is
53. e integrated into programs in a Category II room or better If used for final mixing apply the level and equalization recommended practice herein Trucks and booths for program mixing These spaces have special considerations due to their small room volume high background noise level high level of early reflections and communication needs in a production environment Headphone monitoring systems recommendations Used for ingest stations in crowded environments quality control in machine rooms and the like 10 3 Installation Sound systems should be installed insofar as practical to the recommended practices in the reference documents see references 12 14 16 and 17 in Type I II and IV spaces Type II spaces should also fulfill the requirements of the recommended practices if they are to be used for final mixing Attention is drawn to the requirement for equal distance spacing of the loudspeakers from the primary mix monitoring location or if that is impractical the need to use time delay to make the time of arrival at the mix location constant among the loudspeaker channels This is necessary due to the very great precision of human hearing in locating phantom images appearing between adjacent pairs of loudspeakers This requirement applies especially across the front channels and 33 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 if the surround loudspeakers are closer than the f
54. e loudness of the content There are five profiles defined in the AC 3 encoder The profiles are e Music Light e Music Standard e Film Light e Film Standard e Speech The differences between Music and Film DRC choices may be subtle to a typical listener but one or the other may be better applicable to certain types of content This can be best determined by monitoring with an appropriate emulator The Light versions of the profiles have a much wider null area Thus gain reduction or expansion begins farther away from average program audio resulting in less gain reduction or expansion than with the Standard version of the profile The Speech profile is as the name would suggest intended for programs that contain only speech a talk radio format for example This profile might introduce noticeable DRC artifacts on programs with music and effects Please refer to Annex F for more information about the DRC profiles 9 1 5 AC 3 DRC Choosing None There is also a choice called None that does not select one of the named DRC profiles The selection of None by the operator inhibits the generation of DRC control words None is an acceptable choice as long as the ramifications of not choosing a DRC profile are understood e The reversibility feature of the DRC system will not be available to the consumer 29 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 e Selecting No
55. e set to match the loudness of average Dialog Level of the content 1 9 MEASUREMENT OF POST PRODUCED CONTENT See Section 5 2 1 of the RP 1 9 1 Long Form Content A representative section of the content that is dominated by typical dialog i e not shouting or whispering should be isolated and measured either during or after the final mix In the absence of dialog the loudness of the element of the content that a reasonable viewer would focus on when setting the volume control should be measured If neither technique is possible or practical the loudness of the entire content should be measured If the content has significant periods of quiet see Section 5 2 3 of the RP Key Idea Measure the long form content audio when typical dialog is present and record this value as the Dialog Level of the content 1 9 2 Short Form Content See Section 5 2 4 of the RP Key Idea Measure the loudness of all audio channels and all elements of the soundtrack integrated over the duration of the short form content 69 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex 12 March 2013 S See Section 6 16 The LFE is not included 70 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex J 12 March 2013 Annex J Requirements for Establishing and Maintaining Audio Loudness of Commercial Advertising in Digital Television J 1 INTRODUCTION AND SCOPE The recommendations in this Annex are based on o
56. el Two level related features are built into the AC 3 system specified in the ATSC standard dialogue normalization and dynamic range control If these are set and used correctly they go a long way towards achieving the goals of maintaining the artistic integrity of mixes while delivering a dynamic range suitable for the conditions of the 7 Manufacturers of devices designed for this purpose should be consulted for more information 32 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 consumer The use of a standardized monitor layout standard electrical and acoustical reference levels and reference spectrum response by all program and interstitial providers makes for a common basis in monitoring Along with the features built into the AC 3 system practicing the recommendations in this section of the RP should help satisfy the mutual needs of producers and consumers A universal observation is that an identical sound pressure level is perceived as louder in small rooms such as control rooms than in large rooms such as cinemas The reference sound pressure level in this document has been tested for interchangeability with SMPTE RP 200 6 employed in large spaces The use of reference conditions for monitoring has been shown to improve interchangeability of program commercial and other interstitial mixes and thus is desirable 10 2 Characteristics of Rooms and Spaces Five categories of sound control rooms and p
57. ends production distribution and transmission practices needed to provide the highest quality audio soundtracks to the digital television audience This RP focuses on audio measurement production and postproduction monitoring techniques and methods to effectively control loudness for content delivery or exchange It recommends methods to effectively control program to interstitial loudness discusses metadata systems and use and describes modern dynamic range control This RP also includes specific information on loudness management at the boundaries of programs and interstitial content In addition the RP includes two Annexes which can function as short stand alone Quick Reference Guides to two specific communities of interest station MVPD engineers and audio production staff 1 1 Background and Introduction Consumers do not expect large changes in audio loudness from program to interstitials and from channel to channel The NTSC analog television system uses conventional audio dynamic range processing at various stages of the signal path to manage audio loudness for broadcasts This practice compensates for limitations in the dynamic range of analog equipment and controls the various loudness levels of audio received from suppliers It also helps smooth the loudness of program to interstitial transitions Though simple and effective this practice permanently reduces dynamic range and changes the audio before it reaches the audience
58. ess is a specified value for the Dialog Level established to facilitate content exchange from a supplier to an operator For delivery or exchange of content without metadata and where there is no prior arrangement by the parties regarding loudness the ATSC specifies a Target Loudness value of 24 LKFS which serves to establish a common operating level for use with that fixed value of dialnorm Minor measurement variations of up to approximately 2 dB about this value are anticipated due to measurement uncertainty and are acceptable Content loudness should not be targeted to the high or low side of this range Key Idea For content without metadata use the Target Loudness value of 24 LKFS H 8 METHODS TO EFFECTIVELY CONTROL PROGRAN TO INTERSTITIAL LOUDNESS Large loudness variation during transitions can be effectively managed by adhering to the following practices For operators using a fixed dialnorm system see Section 7 3 of the RP a Ensure that all content meets the Target Loudness and that the dialnorm value matches this value b Employ a file based scaling device to match Dialog Level of non conformant content to the target value 64 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex H 12 March 2013 c Employ a real time loudness processing device to match the Dialog Level of non conformant content to the target value Key Idea Ensure that all program and commercial audio content matches the dialnorm value of
59. ess of all audio channels and all elements of the soundtrack integrated over the duration of the short form content The value of the loudness measurement in units of LKFS should match the dialnorm value of the AC 3 encoder 12 The LFE is not included 63 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex H 12 March 2013 H 6 3 Newscasts or Other Live Programming The principle of measuring the loudness of the dialog of the content applies to live productions done in real time as the production progresses The intent of loudness measurements made during a live event is to guide the mixer to produce the content at a Loudness that matches the dialnorm setting of the station s AC 3 encoder A BS 1770 loudness meter may be helpful when mixing in noisy environments or when a consistent monitor level cannot be maintained See Section 5 2 2 of the RP Key Idea Use a BS 1770 meter to help ensure that real time content loudness matches the dialnorm setting of the AC 3 encoder H 6 4 File Based Content File based storage makes it practical to automate the loudness measurement and to adjust the content loudness and or the dialnorm value if any that may have been assigned to the content See Section 5 2 6 of the RP Key Idea Ensure file based content agrees with the station s dialnorm setting in the AC 3 encoder H 7 TARGET LOUDNESS TO FACILITATE PROGRAM EXCHANGE See section 6 of the RP Target Loudn
60. ex E for guidance on acceptable loudness changes across the boundaries between programs and interstitials e g commercials promos iii Describe the expected loudness of interstitial material for the program supplier in an effort to create awareness of this situation possible listener dissatisfaction and potential negative impact on the show b Ensure the use of the proper sound pressure level in rooms used to mix and monitor content soundtracks i Refer content suppliers to Section 10 of this RP endorsing the use of a suitable SPL monitoring level during content post production The selected SPL monitoring level should be appropriate for the size of the mixing room with consideration for the listening environment of the typical DTV audience Lower SPL monitoring levels in post production monitoring environments yield louder mixes and more contained appropriate dynamic range than traditional louder environments A properly selected post production or playback monitoring environment is essential in establishing appropriate mixing levels for digital television ii Consider specifying a maximum true peak value for the soundtrack This practice constrains dynamic range by reducing headroom It permits the audience to adjust the overall volume level with less risk of large loudness variation 4 The AC 3 DRC system should not be relied upon to control program to interstitial loudness variations 8 4 TV Station and MVPD Local Ad Insertion M
61. g Level of the short form content M This measurement method provides an acceptable loudness match between narrow dynamic range short form content and wide dynamic range long form content that is normalized using a dialog based loudness measurement Those choosing to create and deliver wide dynamic range short form content should note that the louder elements of this type of material will increase the loudness measured with a long term integrated method and consequently reduce the perceived Anchor Element loudness after normalization This can cause an unacceptable match to long form material measured with an anchor based method 5 2 5 Downmix Loudness Measurement and Monitoring During production the loudness of surround programming should be measured in both its surround mix format and in its 2 channel downmix This is necessary because of the high percentage of consumers experiencing the downmix of surround programming and the possible loudness disparity between the two formats MI The downmix loudness measurement should be performed using the same downmix metadata coefficients as carried in downstream AC 3 bitstreams or used by discrete downmix equipment This approach will identify any loudness build up in the 2 channel downmix Loudness build up is typically caused by content mixed in phase to the three front channels and can result in downmix loudness exceeding that of the surround mix by up to 3dB One method 17 ATSC A 85 2013 Esta
62. g and stopping the measurement period predictably A low level absolute gate was added to exclude any content below 70 LKFS effectively freezing the integration period and eliminating any measurement differences caused by an operator pushing a start stop button at different times This greatly simplifies the task of measuring the loudness of the dialog or Anchor Element when the audio is not continuous Since the remaining sound intensities above 70 LKFS contribute to the overall loudness measurement quiet sounds like environmental noises are included in the calculations reducing the measured overall loudness Perceptually however a quiet period during a program typically doesn t substantially affect the listener s impression of the overall loudness of the program A second relative gate was added to overcome this effect The threshold of the relative gate is set 10 dB below the absolute gated measurement The relative gate removes any blocks whose loudness is below this threshold Real time measurements can be made by binning all the blocked loudness measurements rather like a histogram and recalculating the relative gated loudness every 100 ms by ignoring any bins whose loudness is more than 10 dB below the absolute gated measurement Users should be aware that a long term integrated relative gated measurement of wide dynamic range content will indicate that the content is louder than an anchor based measurement of the
63. graphic M When the section header is checked the entire section is deemed critical 3 2 Treatment of Syntactic Elements This document contains symbolic references to syntactic elements used in the audio video and transport coding subsystems These references are typographically distinguished by the use of a different font e g restricted may contain the underscore character e g sequence_end_code and may consist of character strings that are not English words e g dynrng 3 3 Acronyms and Abbreviation The following acronyms and abbreviations are used within this document AC 3 Digital Audio Compression Standard AC 3 E AC 3 as described in ATSC A 52 4 ATSC Advanced Television Systems Committee dB decibel dB FS decibels relative to full scale sine wave per AES17 21 dB TP decibels true peak relative to full scale per BS 1770 Annex 2 3 DRC dynamic range control GPI general purpose interface LKFS loudness K weighted relative to full scale measured with equipment that implements the algorithm specified by BS 1770 3 A unit of LKFS is equivalent to a decibel ITU International Telecommunication Union 12 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 MVPD Multichannel Video Programming Distributor includes DBS service operators local cable system operators and cable multiple system operators SPL sound pressure level in decibels referenced
64. hat is capable of simultaneously satisfying different viewers with different listening environments and situations Two types of DRC control signals are calculated because there are two different scenarios where gain reduction expansion may be desired One situation is where a gentle reduction of dynamic range is desired This is default mode of many devices that that use baseband analog interconnections often available on red and white RCA connector outputs These connections are not a professional interface but are often an acceptable consumer interface This DRC control signal is called Line Mode in the encoder control interface in most professional products because it usually refers to the line level interconnection process The name of this metadata parameter is dynrng A second more aggressive DRC control signal is simultaneously calculated It is designed to be used in products that could be interconnected by an analog RF modulated signal The dynamic range of these RF modulated signals is usually less than signals using the line level interconnection process RF mode adds 11 dB of signal boost for loudness matching purposes with legacy analog systems This mode is the default for MVPD set top boxes as well as TVs with built in DTV tuners and small built in speakers The name of this metadata parameter is compr See CEA CEB11 25 for more information about loudness matching recommendations for consumer products The recommendations
65. he diagram In this graph the correlation of subjective assessments to measured values is 0 977 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex A 12 March 2013 20 7 T T T 7 T j vi i h H i O A 1st dataset 15 b ond dataset foo Peseeesperenee EES af ee J O 3rd dataset A ae A Or ee ie E haw pected ee a hE ERAR KPID T J E coe E AATE O E A A T ee hope Dar N 4 3 A De S AO o Or A EE 4h ToO EENS EVE ANETE VERT TETA 4 2 3 We oO 8 lt 5 EE An aasi i E E E E E E 4 wO Aoh E fe SEINE ry SF besten ee eai Lh ase ERREA J PO SE 15 fi fi fi i fi 15 10 5 0 5 10 15 20 Subjective Loudness dB Figure A 4 Combined results for all three datasets r 0 977 A refinement was the addition of two level driven gating mechanisms The basic BS 1770 loudness measurement is an integrated or averaged power measurement made over the entire duration of the content being measured The gated measurement algorithm works in a similar fashion but calculates the weighted average power in 400 ms blocks of time updating the average power of this interval every 100 ms Repeatable measurements especially over relatively short pieces of content depend on startin
66. he effects of reflections can be properly accounted for e Capture frequency domain information with sufficiently high resolution in the low frequencies to address the typical problems found in small rooms e Combine multiple measurements from the listening area to account for low frequency variations due to standing waves e Reduce modal ringing The requirements for effective room correction can be met by using FIR filters for the equalization solution However standard textbook FIR approaches are not sufficient Frequency weighting methods are typically employed in the FIR filter design in order to distribute the filter power non linearly with frequency thus allocating more correction power to the lower frequencies Furthermore simple spatial averaging of multiple measurements is also not sufficient The spatial distribution of acoustical problems is not uniform and some locations exhibit larger problems than others A non linear spatial weighting method also needs to be employed when combining the measurements 46 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex D 12 March 2013 Annex D Quick Reference on Reference Monitor Setup for Television D 1 PRODUCTION AND POSTPRODUCTION SPACE MONITOR SETUP 1 Copy the first test tone 1 kHz sine wave at 20 dB FS to a digital audio workstation and then copy it into each of the channels to be employed such as 5 1 2 Play the file setting the output level controls of
67. ialnorm value is set to match the measured loudness for the next stage of encoding In this case either the operator needs to modify the network originator s loudness to match the target value of the operator s system or the originator s loudness value as measured will be used as the dialnorm value in the next stage of AC 3 encoding At this re encoding stage the operator needs to also ensure that the other audio metadata parameters are set appropriately 65 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex H 12 March 2013 Key Idea If the network originator s feed is decoded to baseband ensure that the measured Dialog Level of the content matches the dialnorm setting of the next stage of AC 3 encoding H 11 AC 3 DYNAMIC RANGE CONTROL DRC The AC 3 system includes DRC profiles for Line mode and RF mode While choosing these parameters may be useful to the operator and viewer for limiting the overall loudness range DRC should not be relied upon to correct loudness variations between programs programs and commercials or between TV stations or cable channels and during channel changes See Section 8 3 and Annex F of the RP Key Idea AC 3 Dynamic Range Control should not be relied upon to mitigate program to commercial or station to station loudness variations 66 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex 12 March 2013 Annex I Quick Reference Guide for Audio Mi
68. ighting It should be measured at the position of the center of the head of a listener seated at the main mixing location with the meter oriented towards the channel under calibration Note that the body of the person making the measurement should be to one side of the meter not behind it relative to the source where the reflection off the body may affect the reading The standard acoustical reference level for each category and for various room volumes within the categories is given in Table 10 2 Table 10 2 Reference Sound Pressure Level Categories Room Volume in Cubic Feet SPL in dB re 20 pN m2 gt 20 000 85 10 000 lt 19 999 82 5 000 lt 9 999 80 1 500 lt 4 999 78 lt 1 499 76 Depends on room usage For editing purposes may be controlled by the editor for use with the material at hand For final program mixing follow the recommendations for categories II above lt 1 500 76 Use 2 cc coupler and set 440 Hz level to 74 dB Per SMPTE RP 200 6 Step 3 Dub duplicate the original speech file downloaded see below using unity gain into the center or mono channel of the playback device in use In the case where only two channel stereo monitoring exists dub the original speech into left and right channels with an attenuation of 3 dB into each of the channels and confirm that they are in sync to the sample Ensure that the signal path is the same as was used in step above so that
69. ill read differently on noise than on tone since the peak level of this stochastic random noise source is higher than its RMS level which in turn is higher than its average level On peak meters the range will be around 10 dB higher than the sine wave tone whereas on a VU meter which is an average responding RMS calibrated instrument to IEEE C16 5 1954 the level will read about 1 dB low Apply this signal to each channel in turn and adjust the appropriate controls affecting the monitor level only for the standard acoustical level In any one given installation the items that could affect monitor acoustical level are the monitor output level control on the console which it is suggested be set to a marked standard any room loudspeaker equalizer level controls and any power amplifier gain controls or powered loudspeaker input sensitivity controls Ordinary issues of headroom and signal to noise ratio through a chain of multiple units apply 8 The Standard Volume Indicator was first standardized in an American Standard originally published in 1942 26 revised in 1954 rewritten as an IEEE ANSI standard in 1991 and withdrawn in 1999 The ANSI designation C16 5 no longer exists 35 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 The standard acoustical level should be measured using a sound level meter preferably meeting the standard ANSI S1 4 29 Type 2 using slow reading 1 s integration time and C we
70. in this document and in CEA CEB11 were coordinated and are complementary 9 1 2 Monitoring There are benefits to using a system where the DRC gain words see Annex F are calculated in the encoder and applied at the decoder One benefit of this type of system design is that it allows the compressor limiter functions to be accurately previewed or emulated during production well before the content ever gets encoded Producers can check content as it is being produced to hear how they will sound using pre established dynamic range modes While the effects of this DRC system will be audible to mixers in a professional monitoring environment the process should be considered in the context of the typical consumer in a typical home environment where such gain reduction and expansion are usually not noticeable In most situations the effects of Line Mode DRC generally improve the portrayal of the content by better fitting the audio within the reproduction capabilities of the viewer s equipment and listening space 28 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 9 1 3 Relation to dialnorm All DRC calculations are relative to and based on the indicated loudness of content as represented by the dialnorm metadata parameter In other words the encoder needs to know how loud the content is intended to be so it can determine when the content is either too loud or too quiet dialnorm effectively sets this t
71. istent and confusing situation found in practice today This RP provides guidance that if followed will result in consistency in loudness and avoidance of signal clipping The specified measurement techniques are based on the loudness and true peak measurements defined by ITU R Recommendation BS 1770 Algorithms to measure audio programme loudness and true peak audio level 3 The details of both measurements are in Annex A 5 1 Overview of the BS 1770 Loudness Measurement Method Loudness is measured by integrating or averaging the frequency weighted power of the audio signals in all channels over the duration of the measurement The general structure of the algorithm is shown in Figure 5 1 L gt K Filter Average Power Channel Weighting R gt K Filter Average Power Channel Weighting Measured c K Filter Average Power Channel Weighting gt 5 L dness Ls gt K Filter Average Power Channel Weighting Rs K Filter Average Power Channel Weighting Figure 5 1 BS 1770 loudness algorithm The BS 1770 method was validated in listening tests by comparing its results to the relative subjective loudness of mono stereo and multichannel program material Measured loudness is reported as LKFS A unit of LKFS is the same measure as a decibel A 15 LKFS program can be made to match the loudness of a quieter 22 LKFS program by attenuati
72. ithout metadata matches the pre determined reversion dialnorm parameter Reversion may also be used to protect against a loss of metadata recognizing that the reversion metadata parameters may not exactly match that of the content In the event of a metadata loss all content being encoded under reversion will be subject to these parameters It is especially critical that channel mode be set in a fashion to protect all content under any circumstance The inadvertent use of 2 0 channel mode with 5 1 content will eliminate channels 3 6 of the encoded audio and put the content at risk 7 5 8 Advantages of Agile Metadata An agile system presents the most flexibility for the content provider without imposing creative limitations 8 METHODS TO EFFECTIVELY CONTROL PROGRAMN TO INTERSTITIAL LOUDNESS The ATSC digital television audio system AC 3 with its expanded dynamic range and new techniques for managing loudness presents the possibility of undesirable loudness changes at 24 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 transitions to and from various pieces of content if not managed properly This condition is known to annoy the audience by frequently forcing the listener to adjust the audio levels at transitions to maintain a comfortable volume This condition can be alleviated when proper DTV loudness management is applied AC 3 incorporates the necessary technology to mitigate variations in loudness during prog
73. m to optionally restrict the dynamic range of the content is designed into the DTV audio system In simple terms the AC 3 Dynamic Range Control DRC system can be considered a compressor limiter that is split in half the control signal is generated in the encoder and the control signal can be applied in the decoder With this system the audio itself is not modified from the original until it reaches the consumer s decoder so the dynamic range control may be considered reversible Details of how DRC works are provided in Annex F Metadata is always transmitted as part of the coded audio bit stream AC 3 encoders require certain metadata parameters to be specifically set DRC is one of these parameters and dialnorm or the indicated loudness parameter is another While separate from dialnorm correct operation of the DRC system is very dependent on the proper measurement of content and indication of the loudness of that content See Section 5 of this document for measurement guidelines The DRC system modifies the dynamic range of the decoded content by reducing the level of very loud portions of the content to avoid annoying the viewer and by raising the level of very quiet portions of the content so that they are better adapted to the listening environment If however the viewer has a home theatre or other listening environment that can reproduce the full dynamic range of the content the viewer can choose to turn off the DRC system allowing
74. mine the dynamic range compression down mixing matrix decoding and filtering used in certain operating modes of the professional encoder and consumer decoder Optimizing the setting of these parameters for each program may enhance the listening experience under varying listening conditions and 4 Tf the content is stored in rate reduced coded format it may be necessary to decode the content alter its level and then re encode the content 18 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 with certain content types However default values may be used without detriment to the listening experience Critical control metadata which includes two parameters that are critical for proper encoding and decoding e Channel mode acmod which should be chosen correctly to engage proper channel formatting in the decoder to match the content Improper use of this parameter may alter a transmission and cause the loss of dialog when encoding a 5 1 program e g encoding a 5 1 channel soundtrack with 2 0 metadata e Dialog level dialnorm which A 53 requires to be set correctly to prevent potentially severe loudness variation during content transitions on a channel and when channel changing across the DTV dial Incorrect dialnorm values can lead to a variation in loudness as large as 30 dB Apart from the dialnorm parameter default values may be used for most of the other metadata parameters with acceptable results O
75. mix Level When the stereo downmix is Lt Rt designates downmix level for the C channel when end user has no center speaker Lo Ro C Downmix Level When the stereo downmix is Lo Ro designates downmix level for the channel when end user has no center speaker Surround Downmix Level Designates downmix level for surround channels when end user has no surround speakers Note that the 0 999 dB setting discards the surround channels Lt Rt S Downmix Level When the stereo downmix is Lt Rt designates downmix level for surround channels when end user has no surround speakers Lo Ro S Downmix Level When the stereo downmix is Lo Ro designates downmix level for surround channels when end user has no surround speakers Extended bit stream parameters Not supported by all decoders 60 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex G 12 March 2013 Table G 4 Informational Metadata Function Description Audio Production Info Indicates whether the Mixing Level and Room Type parameter settings are carried in the bitstream Bitstream Mode Describes the audio service carried in the bitstream Copyright Indicates whether the encoded bitstream is copyright protected Mixing Level Indicates absolute acoustic SPL of main dialogue channel during final mixing session Room Type Indicates size and calibration of the mixing room used for the final mixing session Original Bitstream I
76. nce mixers and producers become more familiar with these parameters by monitoring using available emulation systems they can select values that further optimize the presentation of their program content 7 1 Importance of dialnorm Carriage of and correct setting of the value of dialnorm is mandatory for DTV broadcasting in the United States see ATSC A 53 Part 5 2010 Section 5 5 Dialogue Level 1 This RP identifies methods to ensure consistent digital television loudness through the proper use of dialnorm metadata for all content and thus comply with A 53 Many of the principles for successful management of dialnorm may also apply to the management of other AC 3 metadata parameters As indicated in Section 6 minor measurement variations of up to approximately 2 dB are anticipated and these may lead to minor variations between the value of dialnorm and the actual program loudness These minor variations are acceptable due to the comfort zone see Annex E however operators should not intentionally operate at the high or low side of this range 7 2 Metadata Management Modes The requirement for accurate dialnorm channel mode acmod and other metadata can be met in three different ways at the discretion of the operator e Fixed metadata The AC 3 encoder Dialog Level is fixed to a single value and the content Dialog Levels are conformed to that setting e Preset metadata AC 3 encoder presets are programmed e
77. ncoded audio content See Figure 7 1 Source 1 ATSC Audio Distribution Process AC3 ha ES arate Encoder istribution Internal Source n and yee Switching Values Audio Consumer AC 3 Audio Decoder Figure 7 1 Fixed metadata concept 7 3 1 Setting dialnorm by Long Term Averaging Method An operator can achieve a first approximation of compliance with A 53 by measuring the long term average loudness of the station output and setting the AC 3 encoder dialnorm parameter equal to this value The averaging period should be chosen to include all types of content If the Dialog Level of individual pieces of content deviate significantly from that long term average the dialnorm parameter will not properly reflect the Dialog Level of that content This situation should be addressed by the program originator or operator and corrected see Sections 7 3 2 and 7 3 3 This method may not apply to operators using content with intentionally wide dynamic range 7 3 2 Setting dialnorm for Production A content delivery specification should specify the Target Loudness for all content This establishes the anchor for layering of music and effects for the soundtrack Loudness should be measured with a meter using the BS 1770 recommendation 3 to confirm the average loudness of dialog The supplier should indicate the actual average loudness with the deliverable Cooperation between content supplier and recipient is necessary to achieve succes
78. ndicates whether the encoded AC 3 bitstream is the master version A D Converter Type Identifies the setting for type of A D converter 61 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex H 12 March 2013 Annex H Quick Reference Guide for Station and MVPD Engineers Audio Loudness Management H 1 INTRODUCTION Note This Quick Reference Guide is based on the ATSC Recommended Practice Techniques for Establishing and Maintaining Audio Loudness for Digital Television A 85 the RP and its full text is found as Annex H of the RP This Quick Reference Guide summarizes the recommendations in the RP and provides guidance to broadcasters and other video program distributors on controlling and maintaining consistent audio loudness of their TV stations and channels H 2 SCOPE This Quick Reference Guide is not intended to replace the complete RP Its scope is limited to a how to guide for television station operators and MVPDs Readers of this document are encouraged to review the complete RP for more detailed information and the background to this Guide In the event of a conflict between the Guide and the RP the RP takes priority over this Quick Reference Guide This Quick Reference Guide is based on the use of a fixed metadata system see Section 7 3 in the RP H 3 DEFINITIONS Anchor Element The perceptual loudness reference point or element around which other elements are balance
79. ne prevents the viewer from selecting a DRC choice or enabling features such as Late Night or Midnight in some consumer equipment that use the RF DRC mode e Dynamic range should be controlled in another fashion by the operator or by the program originator e There is a possibility that DTV sets with limited volume capacity will exceed their reproduction range e The RF mode DRC control words are also used for protection limiting Protection limiting prevents clipping in consumer decoders which could result from an inappropriate dialnorm setting combined with very dynamic programming Choosing None will not prevent the generation of protection limiting DRC control words which are very aggressive and not very artistic The protection limiting process has a very short attack time and a very long release time and might cause objectionable audible artifacts e Systems using ATSC signals as a source for SD distribution e g SD analog cable tiers will not be able to use the RF mode DRC to establish an acceptable analog SD signal with reduced dynamic range See ATSC Recommended Practice A 79 24 for additional guidance In order for the AC 3 dynamic range control system to be functional a profile other than None should be used by operators when appropriate 9 2 Dynamic Range Processing With Metadata Interface Reversible Audio metadata processors have been developed that allow creation of different Line Mode and
80. ng in Digital Television Annex K Requirements for Establishing and Maintaining Audio Loudness of Commercial Advertising in Digital Television When Using Non AC 3 Audio Codecs 2 REFERENCES All referenced documents are subject to revision Users of this Recommended Practice are cautioned that newer editions might or might not be compatible 2 1 Informative References The following documents contain information that may be helpful in applying this Recommended Practice 1 2 10 11 ATSC ATSC Digital Television Standard Part 5 AC 3 Audio System Characteristics Doc A 53 Part 5 2010 Advanced Television Systems Committee Washington D C 6 July 2010 IEEE Use of the International Systems of Units SI The Modern Metric System Doc TEEE ASTM SI 10 2002 Institute of Electrical and Electronics Engineers New York N Y ITU Algorithms to measure audio programme loudness and true peak audio level Recommendation ITU R BS 1770 3 International Telecommunications Union Geneva 2012 ATSC Digital Audio Compression AC 3 E AC 3 Standard Doc A 52 2012 Advanced Television Systems Committee Washington D C 17 December 2012 TEC Sound system equipment Part 17 Standard volume indicators Doc IEC 60268 17 1990 International Electrotechnical Commission Geneva SMPTE SMPTE RP 200 Relative and Absolute Sound Pressure Levels for Motion Picture Multichannel Sound Systems
81. ng it by 7 dB Complete details of loudness and true peak measurement methods are given in Annex A 2 The Low Frequency Effects LFE channel is not included in the BS 1770 algorithm 15 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 5 2 Overview of Making Loudness Measurements The ATSC A 53 Digital Television Standard mandates that the value of the dialnorm parameter present in the AC 3 or E AC 3 elementary stream indicates the loudness of the encoded audio content The goal for correctly managing loudness is to measure and label the average loudness of each piece of content regardless of the content s dynamic range The use of this measured average loudness value to normalize the content helps eliminate annoying changes of loudness from one piece of content to another and is intended to enable listeners to enjoy programming with minimal adjustment of their volume controls It is important to understand that in most cases the measured loudness of the Anchor Element closely represents the overall or average subjective loudness of any piece of content The loud and soft portions of a mix create the dynamics of the program around the Anchor Element Because of this the average loudness of the Anchor Element of the mix may be used as a proxy for the overall loudness of the content Dialog forms the Anchor Element in the majority of television programs During production it is usually possible to isolate the
82. nion Geneva Switzerland Soulodre G A and Lavoie M C Stereo and Multichannel Loudness Perception and Metering 119th Convention of the Audio Engineering Society San Francisco Preprint 6618 AES TD 1001 Multichannel sound systems and operations Audio Engineering Society New York NY www aes org technical documents AESTD1001 pdf free download Note that an exception was made in the 2001 edition of this document the reference listening levels in Section 7 4 1 of the document were deleted ITU Requirements for Loudness and True Peak Indicating Meters Recommendation ITU R BS 1771 1 International Telecommunications Union Geneva Switzerland Eargle J M Audio Monitoring in Contemporary Post Production Environments SMPTE Journal Society of Motion Picture and Television Engineers White Plains NY January 2005 Holman Tomlinson Surround Sound Up and Running Focal Press 2nd edition 2007 originally ISBN 10 0240808290 ISBN 13 978 0240808291 AES AES3 2003 Digital input output interfacing Serial transmission format for two channel linearly represented digital audio data Audio Engineering Society New York N Y Dolby Model DP569 Dolby Digital Multichannel Audio Encoder User s Manual Dolby Laboratories Inc San Francisco CA Dolby Dolby DP570 Multichannel Audio Tool User s Manual Dolby Laboratories Inc San Francisco CA AES AES17 1998 12004 AE
83. nnel s loudness target value within 2 dB MI 2 Per BS 1770 the LFE is not included 72
84. o signal This measurement is intended for mono stereo and multichannel signals A separate ITU R Recommendation BS 1771 15 defines the measurement of short term and momentary loudness The first phase of the ITU R study was to develop a subjective test method to examine the perceived or subjective loudness of typical monophonic program material A three member panel made up of ITU R Working Party 6P SRG3 members selected 48 test sequences consisting of a wide range of broadcast material and a reference sequence consisting of English female speech was chosen to establish a target loudness level The playback level of the reference sequence was set to 60 dB SPL A weighted slow In addition to the use of a single common reference sequence test subjects were encouraged to compare all test sequences to one another This test approach is similar to 9 where the subject is able to compare various test items to each other within a trial Each of the test sequences was repeated at two amplitude levels thus creating a total of 96 monophonic audio sequences for the subjects to match A total of 97 subjects participated at five different test sites The results of the subjective tests 10 formed the basis for evaluating the performance of various proposed loudness algorithms or meters Seven proponents submitted ten monophonic loudness meters algorithms for evaluation In addition to the meters two additional loudness algorithms were submitted by the evalu
85. o the absolute value of the Dialog Level The value of 0 is reserved Dolby E Dolby E is an audio data rate reduction technology designed for use in contribution and distribution which also conveys Dolby E metadata Dolby E metadata Metadata that is multiplexed into the Dolby E bitstream Each metadata element is classified as either a Professional or Consumer type of metadata Dolby E metadata is documented in SMPTE RDD 6 22 Downstream A point in in a distribution chain containing assembled content where some content boundaries may not be readily identifiable using automated methods DRC Profile A collection of parameters that describe how dynamic range control metadata is calculated file based scaling device A device used to apply an overall gain correction to audio content stored as files framesync Short term for frame synchronizer layback A post production technique where audio is re joined with the associated video after editing mixing or sweetening The term Dialog Level is based on dialog s widespread use as the Anchor Element for mixing of content 13 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 long form content Show or program related material or essence The typical duration is greater than approximately two to three minutes loudness A perceptual quantity the magnitude of the physiological effect produced when a sound stimulates the ea
86. ore AC 3 encoding Lowpass Filter Applies a lowpass filter to the main input channels before AC 3 encoding LFE Lowpass Filter Applies a 120 Hz lowpass filter to LFE channel before AC 3 encoding Surround 3 dB Attenuation Attenuates surround channels 3 dB before encoding Surround Phase Shift Function Before encoding creates the phase shifted surround channels necessary to create an Lt Rt output in a decoder that can be decoded using Dolby Pro Logic to L C R S Table G 3 Basic Control Metadata Decoder Description LFE Channel Indicates whether LFE channel is present Available only in channel modes 3 2 3 1 3 0 2 2 and 2 1 DRC Line Mode Profile Designates preset Dynamic Range Control DRC compression configuration for line mode decoding DRC RF Mode Profile Designates preset Dynamic Range Control DRC compression configuration for RF mode decoding Dolby Surround Mode Indicates whether a two channel encoded bitstream contains a Dolby Surround Lt Rt program and requires Dolby Pro Logic decoding Dolby Surround EX Mode Identifies if audio is encoded as a Dolby Surround EX stream and requires Surround EX decoding Preferred Stereo Downmix Designates preference for Lt Rt Pro Logic encoded or Lo Ro stereo only downmix Center Downmix Level Designates downmix level for the C channel when end user has no center speaker Lt Rt C Down
87. ort loudness in units of LKFS dB TP decibels true peak relative to full scale per BS 1770 Annex 2 3 Dialog Level The loudness in LKFS units of the Anchor Element dialnorm An AC 3 metadata parameter defined in A 52 4 that is carried in the AC 3 bit stream This is used to indicate how far the average Dialog Level is below 0 LKFS Valid values are 1 to 31 Loudness values outside this range cannot be expressed by dialnorm The value of dialnorm is numerically equal to the absolute value of the Dialog Level The value of 0 is reserved LKFS loudness K weighted relative to full scale measured with equipment that implements the algorithm specified by BS 1770 3 A unit of LKFS is equivalent to a decibel Target Loudness A specified value for the Anchor Element 1 e Dialog Level established to facilitate content exchange from a supplier to an operator This Quick Reference Guide is based on the use of a fixed metadata system see Section 7 3 in the RP Agile metadata is an alternative to using the fixed metadata system approach See Section 7 5 of the RP for details concerning the agile metadata approach The term Dialog Level is based on dialog s widespread use as the anchor for mixing of content and historically it was felt that for most programs dialog would be the Anchor Element 67 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex 12 March 2013 1 4 MONITORING ENVIRON
88. ost listeners to one of the six points shown in the diagram They could switch at will between the reference and test items as often as they wanted Once they had settled on the requested loudness difference the offset was recorded The questions were asked in random order and the order of presentation of the pairs of reference and test items was randomized between subjects The Reference item and Test items were taken from another experiment that provided a library of constant subjective loudness content Since the Reference and Test items were known to be equally loud the gain offset that the subjects applied to the test item in response to the questions from the experimenter were a direct measurement of the subjects comfort zone and the other critical loudness levels investigated It is interesting to note that a gain increase of two to three dB in level is enough to move the subjective loudness of a typical program out of the listener s Comfort Zone and toward the point 49 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex E 12 March 2013 at which they would like to turn the volume down There is more latitude available on the softer side of the comfortable volume point shown here as 0 The ambient noise level in the listening room used for the tests was quite low similar to that in a rural living room on a tranquil evening Since the Annoyingly Soft point can reasonably be expected to fall some
89. ostproduction spaces are defined They are listed in Table 10 1 which shows the characteristics for each type Table 10 1 Categories of Audio Control Rooms used in Television Production Characteristics Principal audio monitoring control rooms with specialized acoustics and sound systems Channel range up to 5 1 3 front 2 surround 0 1 low frequency enhancement Well isolated from other operations Widest frequency and dynamic ranges equal to best home cinemas properly aligned This type of room may be used for quality control at the network level for example checking program material for conformance to delivery requirements when a question arises at ingest stations Sound monitor quality dominates over production requirements in this category of room Broadcast organizations might be expected to have only a small number of such rooms Audio mostly production spaces with equipment needs and placement supplanting absolute audio monitoring conditions although audio monitoring is still expected to be good Channel number equal to highest number used for material originating in the room Good isolation from other operations This type of room may be used for program origination with its output occasionally subject to check in a Category room Low frequency range and headroom may be somewhat restricted compared to a Category room Audio editing spaces premix and prelay rooms and other spaces the output of which is typically expected to b
90. ot be maintained A running measurement of the production loudness from its start to any point thereafter may also provide an indication of the average loudness of the production 5 2 3 Measurement of Finished Long Form Content A section of the content that is representative of the Anchor Element typically dialog should be isolated and measured and reported as the Dialog Level of the long form content In the absence of a specific Anchor Element the loudness of the element of the content that a reasonable viewer would focus on when setting the volume control should be measured and reported as the Dialog Level of the long form content If neither technique is possible or practical then the long term integrated or average loudness of the full program mix of the content over the entire duration should be measured and reported as the Dialog Level of the long form content MI When measuring long form content momentary or short term loudness measurements as these terms are defined in BS 1771 15 are not expected to be one constant value Loudness variations during portions of the content that do not represent the Anchor Element may deviate from the target value These variations are acceptable due to the intended dynamic range of properly mixed content 5 2 4 Measurement of Short Form Content A measurement of the long term integrated or average loudness of the content s full program mix over its duration should be reported as the Dialo
91. ows for processing that may be more aggressive than the original five profiles The original audio is delivered to the consumer and the DRC values will be applied to the audio by default but can also be bypassed by the viewer if desired 30 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 9 3 Dynamic Range Processing Without Metadata Interface Irreversible In analog AM radio broadcasting automatic gain control AGC systems were created to ensure proper modulation of a broadcast station s carrier frequency Incorrect modulation could lessen coverage if the average level was too low and could cause interference if the peak level was too high This was made more difficult by FM systems which employ pre emphasis to increase the level of high frequencies prior to transmission to minimize noise Essentially these AGC systems often termed modulation controllers were then controlling the peak to average ratio of program audio and thus controlling dynamic range Unlike the metadata based approaches described above gain changes are immediately applied to the input signal prior to transmission to consumers thus the results are deemed irreversible The earliest AGC systems were wideband using a single level detector controlling a single gain element to control the entire audio bandwidth In the early days of broadcasting this may have been an acceptable practice because audio bandwidth was restricted and better audio
92. r measured loudness The magnitude of an audio signal when measured with equipment that implements the algorithm specified by BS 1770 3 It is an approximation of perceived loudness mixing level An optional metadata parameter in the AC 3 bit stream that allows indication of the absolute sound pressure level calibration of the mixing studio that produced the content Operator A television network broadcast station DBS service local cable system or cable multiple system operator MSO or other multichannel video program distributor MVPD short form content Advertising commercial promotional or public service related material or essence Also termed interstitial content The typical duration is less than approximately two to three minutes Target Loudness A specified value for the Anchor Element i e Dialog Level established to facilitate content exchange from a supplier to an operator true peak The maximum absolute level of the signal waveform in the continuous time domain measured per BS 1770 3 Its units are dB TP meaning decibels relative to nominal 100 true peak 4 THE AC 3 MULTICHANNEL AUDIO SYSTEM The ATSC AC 3 audio system is intended to deliver a reproduction of the original unprocessed content at the output of the AC 3 decoder in a receiver normalized to a uniform loudness It provides the ability for broadcasters to allow each listener the freedom to exert some control over the deg
93. ram to interstitial transitions These techniques are described below 8 1 Effective Solutions Large loudness variation during transitions can be effectively managed by ensuring dialnorm properly reflects the Dialog Level of all content 8 1 1 For Operators Using a Fixed dialnorm System See Section 7 3 M a Ensure that all content meets the Target Loudness and that long term loudness matches the dialnorm value b Employ a file based scaling device to match long term loudness of non conformant file based content to the target value c Employ a real time loudness processing device to match the loudness of non conformant real time content to the target value 8 1 2 For Operators Using an Agile dialnorm System See Section 7 5 W a Ensure that during program production post production or ingest content is measured see Section 5 2 and labeled with the correct dialnorm value matching the actual loudness of the specific content b Employ a file based measurement and authoring device to set dialnorm to the average loudness of the specific content c Employ a real time processing device to match content to a specific loudness Apply a dialnorm value matching the loudness of all content processed by this device 8 2 Adverse Conditions Notable conditions that may adversely impact program to interstitial transitions at content boundaries e Content suppliers often increase dramatic impact by using program dynamics and manipulating loudness to
94. ree of dynamic range reduction if any that best suits their listening conditions The dynamic range processing part of the system is described in Section 9 but its operation is predicated on having properly normalized content delivered to it The metadata parameter dialnorm is transmitted to the AC 3 decoder along with the encoded audio The value of the dialnorm parameter indicates the loudness of the Anchor Element of the content The dialnorm value of a very loud program might be 12 and of a soft one 27 There is an attenuator at the output of the AC 3 decoder that applies appropriate attenuation to normalize the content loudness to 31 LKFS If the dialnorm parameter accurately reflects the overall loudness of the content then listeners will be able to set their volume controls to their preferred listening loudness level and will not have to change the volume when the audio changes from program to advertisement and back again If all broadcasters use the system properly the loudness will also be consistent across channels Section 7 describes three methods of using audio metadata Fixed Preset and Agile Any one of these approaches will deliver consistent loudness to the listeners the broadcaster is free to use the method that best suits their operational practices Whichever approach is selected the system depends on transmitting a value of dialnorm that correctly represents the Dialog Level of the content which depends in turn on a
95. representing the broadcast broadcast equipment motion picture consumer electronics computer cable satellite and semiconductor industries ATSC Digital TV Standards include digital high definition television HDTV standard definition television SDTV data broadcasting multichannel surround sound audio and satellite direct to home broadcasting Note The user s attention is called to the possibility that compliance with this recommended practice may require use of an invention covered by patent rights By publication of this document no position is taken with respect to the validity of this claim or of any patent rights in connection therewith One or more patent holders have however filed a statement regarding the terms on which such patent holder s may be willing to grant a license under these rights to individuals or entities desiring to obtain such a license Details may be obtained from the ATSC Secretary and the patent holder Revision History Version Date A 85 2009 approved 4 November 2009 A 85 2011 successor document approved 25 July 2011 A 85 2013 successor document approved 12 March 2013 Recommended Practice A 85 2013 12 March 2013 is the successor document to Recommended Practice A 85 2011 25 July 2011 ATSC A 85 2013 1 2 2 REFERENCES 2 1 3 DEFINITION OF TERMS 3 1 3 2 3 3 3 4 4 THE AC 3 MULTICHANNEL AUDIO SYSTEM 5 LOUDNESS MEASUREMENT 5 1 5 2 Establishing
96. results of this third subjective test 13 resulted in the design of the multichannel loudness algorithm shown in Figure A 1 x Pre fiter H RLB RMS HG Xr gt Pre filter RLB m RMS Bi Gr xc gt Preflter H RLB 221 RMS ce gt sae xs gt Pre filter H RLB HH RMS F G A Xr gt Preilter H RLB HST RMS 1 Ge Figure A 1 Block diagram of multichannel loudness algorithm The loudness of each channel is measured separately given a channel weighting gain GL etc and summed to give the total resulting loudness value In addition to the RLB weighting curve shown in Figure A 2 a pre filter is added to account for the effects of the human head and its frequency response is shown in Figure A 3 The combination of the two filters is known as a K weighting filter 39 12 March 2013 I I I I I l l l l l l l l l l f I I I I Yo PSs i a A eR es aM eg a eG NP ge ar ae a NAP eer eh ea een ere a a x Sanand DONGAN Po ae E E a S eg A og ee ET T T A a R ee a canann a a a a ty nenenenT poa ene t Aie raa RR x hee ee eee ag yt ee Agree MS bel a an mal sa ah oO gt Vege ehh eee gp tle eg eis ee lea oe ae GI se gate Sl he ww Wl l l l l l ad ee ee ee nada o l l l l l oQ l l l l l f l l l l 1 AEE Lapeer ie Dece She Oba Seen Sedna iS RE eS EA Eb ga CP ES
97. rincipal part of the DRC data generation process is the peak limiting function The clip level is known 0 dB FS thus the difference between that and the normalized peak level of the program is the maximum allowable gain before clipping If the static gain words call for more gain the smooth and limit gain block limits the static gain words to the maximum allowable gain The time constant just computed ensures that the gain changes will not be abrupt enough to cause objectionable artifacts and are appropriate to the program material Note that there are two parallel processes going on one running at the audio block rate 5 3 ms generating dynrng gain words and the other running at the sync frame rate 32 ms generating the compr gain words 55 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 Gain smoothing by the overlap amp add process il al Li lit Cross fades between audio blocks Figure F 7 Gain smoothing This is a very simple idea of how the overlap and add transitions between audio blocks happen and the effect they have on the gain steps quantized into 0 2 dB steps for dynrng and 0 4 dB steps for compr applied by the dynamic range control system The triangular shape shown in Figure F 7 is an approximation for easy drawing of the actual windowing function applied to the data dynmng words apply to an entire group of 512 samples but each group of 512 samples overlaps the adj
98. ront loudspeakers to the listening location they should be delayed The reference documents show the expected anechoic frequency response of monitor loudspeakers and systems In situ measurements of loudspeakers in control rooms however show strong deviations from the anechoic response of the loudspeakers in particular due to room boundary loading conditions at very low frequencies with standing wave modal effects through the range typically from about 80 Hz to 500 Hz For this reason room equalization is highly desirable to the point of necessity for higher quality spaces A description of the concerns for equalizing rooms is given as Annex C It has been observed that the widest deviation in octave to octave balance of broadcast mixes occurs at the frequency extremes below 100 Hz and above 8 kHz This probably occurs due to the varying responses of monitor systems in these ranges For this reason the operational room response curve given in Figure 10 1 applies to monitoring in Type I spaces Type II and IV spaces may have somewhat restricted low frequency range and headroom compared to Type I spaces Frequency response in relative dB QQQ TAAN STINO DO NO 1 QQNQONOO TANMTOODONOD verre NAN 1 3 octave band center frequency Hz Figure 10 1 Operational room electro acoustic response curve Figure 10 1 is a quasi steady state measurement to be made
99. s is not practical loudness Lk measurements may be made by direct use of formula 2 of BS 1770 3 Such measurements should not be expected to replicate measurements made as recommended in other sections of this Recommended Practice 6 TARGET LOUDNESS AND TRUE PEAK LEVELS FOR CONTENT DELIVERY OR EXCHANGE M For delivery or exchange of content without metadata and where there is no prior arrangement by the parties regarding loudness the Target Loudness value should be 24 LKFS Minor measurement variations of up to approximately 2 dB about this value are anticipated due to measurement uncertainty and are acceptable due to the comfort zone see Annex E Content loudness should not be targeted to the high or low side of this range The true peak level should be kept below 2 dB TP in order to provide headroom to avoid potential clipping due to downstream processing such as audio coding used in delivery 7 METADATA MANAGEMENT CONSIDERATIONS IMPACTING AUDIO LOUDNESS An AC 3 encoder allows the setting of up to 28 metadata parameters concerning the characteristics of the accompanying audio in the bit stream see Annex G The parameters can be classified in three groups Informational metadata which includes seven optional parameters that can be used to describe the encoded audio These parameters have no affect on encoding or the decoded listening experience in the home Basic control metadata which includes 19 parameters that deter
100. s should be prevented by a combination of absorption and diffusion on that surface Consider locating the loudspeakers to prevent console face reflections from reaching the ears of the mix engineer 44 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex B 12 March 2013 Control Mid Bass Resonance Through Placement and Equalization A perfectly flat loudspeaker measured anechoically can sound very different in the lower mid range in installations depending on its room placement due to reinforcement and canceling caused by reflections from adjacent reflective surfaces near the loudspeaker This effect can be mitigated by avoiding placing loudspeakers equidistant from two walls by bringing the loudspeakers further from the reflective surfaces or by providing mid range absorption on the adjacent walls Since the effects are rather broad equalization can be very useful in flattening the response Chose Loudspeakers with Flat Response and Broad Dispersion The above measure cannot correct for loudspeakers with an irregular frequency response or where the response varies dramatically with listening angle Such loudspeakers cannot be trusted to represent the frequency balance of the final mix Choose loudspeakers that sound consistent as you move from on axis to a position at the fringe of the listening area Look for evidence that the loudspeaker frequency response is flat across the audio range Consider the use of a sub woofer
101. same content This is because the relative gated measurement tends to focus on the loudest parts of the mix and ignore the rest The result is that a relative gated measurement will tend to underestimate the loudness of the Anchor Element of content with a wide dynamic range when compared to an anchor based measurement of the same content Normalization will therefore tend to push the anchor dialog 41 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex A 12 March 2013 loudness down relative to the loudness of the content normalized to the relative gated measurement A 3 TRUE PEAK Modern digital audio systems have greatly simplified the manipulation and distribution of audio signals Peak metering of these signals typically takes the form of displaying the maximum absolute sample value over a given measurement period This narrow focus on peak sample values has made it easy to overlook the underlying continuous waveform This can lead to unanticipated audio overloads inconsistent peak readings and other hidden problems True peak level measurements provide a more accurate description of the audio signal which can be useful in preventing these problems Figure A 5 illustrates the potential for inaccurate readings when using typical sample peak meters In this diagram the maximum level of the continuous waveform is higher than the maximum sample value This under reporting of the audio level is small at low frequencies but c
102. sful loudness management when implementing this practice 7 3 3 Content Not Conforming to the Target Loudness If the operator needs to make use of content not conforming to the established Target Loudness value an offsetting gain or loss will need to be inserted to compensate If the difference is unknown it will be necessary to measure the content loudness before compensation is applied 20 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 7 3 4 dialnorm and Loudness Quality Control To ensure the proper match between dialnorm value and loudness the operator should make use of loudness metering during quality control and when necessary make compensating adjustments to ensure the loudness meets the target value 7 3 5 Emission dialnorm Setting for Compliance with A 53 An operator receiving content that is delivered at a fixed loudness where there is no gain adjustment or processing after the receiver should set the value of dialnorm in the emission AC 3 encoder to match the originator s specified loudness often specified in contract signal specification document etc If a fixed gain or loss is applied in the signal chain the AC 3 encoder dialnorm value should be offset accordingly from the originator s loudness For example if the originator delivers audio with a loudness of 24 LKFS and no gain or loss is incurred in the chain dialnorm would be set at 24 However if a gain of 3 dB is added corre
103. should be measured using the BS 1770 recommendation 3 to confirm that the average loudness for the entire length of the production matches the chosen dialnorm value See Section 5 7 5 5 Semi Agile Metadata An operator may use an agile metadata system but choose to simplify metadata authoring and insertion operations by specifying fixed Target Loudness values to be used by content providers 7 5 6 Impact of Metadata Loss on Content A risk associated with the use of an agile metadata system is the potential for a severe discrepancy in loudness between programs and between stations if metadata is lost All AC 3 encoders with external metadata input provide a reversion feature to mitigate the impact of metadata loss With this feature the encoder can be configured to either retain the most recent metadata value or revert to an operator defined preset While this feature can minimize the impact upon the consumer the error in loudness or other metadata parameters such as channel mode can still be significant The reversion parameter should be chosen to minimize the impact of metadata loss on the presented content 7 5 7 Fixed Agile Hybrid In some instances an operator may choose to intentionally use the reversion feature to accommodate content without metadata It is critical that operators choose appropriate settings for all metadata parameters of the reversion preset particularly ensuring that the loudness of the distributed content w
104. sponding to a loudness of 21 LKFS dialnorm would be set to 21 If instead a loss of 2 dB is introduced corresponding to a loudness of 26 LKFS dialnorm would be set to 26 If an audio loudness processor is utilized the AC 3 encoder dialnorm value should be set to the Dialog Level at the output of the audio processor 7 3 6 Fixed dialnorm Advantage A fixed dialnorm system poses minimal risk to the content Fixed dialnorm has the advantage of simplicity with no requirement for additional metadata equipment or data management This approach can be used with every AC 3 encoder and is the only approach possible when using an encoder without metadata input or external GPI control 7 4 Using Preset dialnorm Metadata If the operator needs to accommodate a small number of discrete changes to the dialnorm value or other metadata parameters some AC 3 encoding systems can be configured to change between preset metadata values via external control e g with a contact closure to a GPI This method requires GPI external triggers for accurate preset signaling from the automation playlist or switcher It is often used to switch between stereo and 5 1 encoding modes even when the dialnorm remains fixed at a single value See Figure 7 2 Source 1 Audio aes Distribution l Process Audio AC 3 i Storage Encoder i Distribution Internal Source n and Auto Metadata Audio Switching mation Presets Audio Consumer a AC 3 Audio
105. ssssssssssssssesssssssssseessssesssnsesssnsessssecssnsesssnsesssnesssneesssueessnuesssneesssseeessans 32 10 1 Background 32 10 2 Characteristics of Rooms and Spaces 33 10 3 Installation 33 10 4 Reference Level Calibration 35 10 5 Test Signal Details 36 10 5 1 Definition of 0 dB FS 36 10 5 2 440 Hz Sine Wave Tone 37 10 5 3 Band Limited Pink Noise 37 ANNEX A PROGRAM LOUDNESS weiiscssicvcccctsccssnsseesesestseesesssvescsecevesssesenveccnsssanccnnvsonesernestieenncen 38 A l Introduction 38 A 2 Program Loudness 38 A3 True Peak 42 ANNEX B ROOM ACOUSTICS AND LOUDSPEAKER PLACEMENT ccssseecessseeeeessees 44 ANNEX C ROOM CORRECTION snseies fe dcccsccsesticcseevdcaccntesasededvecbececicesecsevarensseencuedssoecesseenessceces 46 ANNEX D QUICK REFERENCE ON REFERENCE MONITOR SETUP FOR TELEVISION 47 D 1 Production and Postproduction Space Monitor Setup 47 D 2 Really Quick Reference Guide for Monitor Setup 48 ANNEX E LOUDNESS RANGES iimarishe ati een ceed 49 ANNEX F AC 3 DYNAMIC RANGE CONTROL DETAILG cccssseeeeesseeeeeseneeeenseeeeeesseeeesseeeeeneneeeens 51 F 1 DRC Overview 51 F 2 AC 3 DRC Metadata Placement in Bitstreams 52 F 2 1 Calculation of AC 3 Gain Words 53 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 E22 The DRV Loudness Weighting and Input Output Curves 56 F 2 3 DRC Encoder Parameters for Setting Metadata 58 F 2 4 DRC Control Available at the Decoder 58 ANNEX G AC 3 METADATA P
106. stitial boundary issues can be improved with the use of irreversible dynamic range processing not part of the AC 3 system 9 4 Consumer Experience Viewers may interact with the DRC process within AC 3 bit streams in several ways Generally the MVPD set top box and DTVs with integrated tuner decoders will default to a specific DRC mode Usually this will be RF Mode as these devices either contain RF modulators or feed TVs directly that may only support minimal dynamic range Often the cost and flexibility of the equipment will determine how many other choices the viewer might have Home theatre A V receivers will often have the most DRC listening choices but the menus used to navigate these choices may lack consistency and interoperability with other products In this context interoperability means that terminology used to describe options may differ or conflict with terminology used in other devices For example the concept of wide standard or normal and narrow are commonly used In this context wide refers to no DRC being engaged resulting in the original wide dynamic range being reproduced Standard engages the profile selected for line mode during encoding Narrow engages the profile selected for RF mode during encoding As noted earlier RF mode adds 11 dB of signal boost for loudness matching purposes with legacy analog systems 10 AUDIO MONITORING SETUP This section documents the
107. tandard Film Light Film Standard Disabled F 2 4 DRC Control Available at the Decoder e Very little commonality between the names used by consumer devices for Line RF or Off DRC modes 58 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 e Typical decoder calls Line Mode Normal RF Mode Late Night and no dynamic range compression Theatre e The individual consumer can typically choose one of these three modes e In some higher end decoders the listener can adjust the fraction of the Line or RF mode compression applied 59 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex G 12 March 2013 Annex G AC 3 Metadata Parameters Default values set in some AC 3 encoders may have no correlation to specific operator content and should not be relied upon The AC 3 encoder manufacturer should be consulted for guidance Function Table G 1 Critical Control Metadata Bitstream Variable Description Dialogue Level dialnorm _ Controls dialogue normalization Channel Mode Function acmod Designates the number and type of active channels Table G 2 Basic Control Metadata Encoder Description Bitrate Encoded bitrate RF Overmodulation Protection When enabled the AC 3 encoder includes pre emphasis in its calculations for RF Mode compression DC Filter Applies a DC blocking 3 Hz high pass filter bef
108. that viewer to enjoy the content exactly as it was originally mixed 27 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 When the DRC system is turned off by the viewer the dynamic range of the content will be identical to the input to the AC 3 encoder Because the application of the DRC system is selectable by the viewer the AC 3 system can provide multiple listening options for the viewer with the ability to accommodate different reproduction systems and listening environments 9 1 1 Line Mode and RF Mode DRC In the AC 3 audio encoder two control signals are calculated by the DRC system 1 One control signal is used to slightly reduce the dynamic range of the content suitable for directly connected audio components 2 The other control signal is used for more aggressive dynamic range control suitable either for equipment connected with analog RF modulated signals or for equipment that is otherwise incapable of handling the potential wide dynamic range of the original signal Either control signal will be selected as the default mode by the equipment manufacturer depending on the design and intended use of the product For viewers who desire the full dynamic range of the original signal these control signals could be ignored resulting in the reproduction of the content exactly as the content producer created it This method of using separate control and audio signals results in a reversible DRC system t
109. the workstation to unity gain the console input to unity gain and the console master level control to unity gain You may keep the monitor level control low for the time being 3 Set the level of each channel in turn for the sine wave test level for the meter in use such as 20 dB FS for digital meters or 0 VU for VU meters 4 Copy the second test tone band limited pink noise at 20 dB FS to each channel of the digital audio workstation 5 Play one channel at a time 6 Put a sound level meter at the position of the center of the head of the normal position for the operator pointing it at the loudspeaker channel in use Keep your body perpendicular to the meter off to one side as the level can be affected by a strong reflection off your body 7 Put the master monitor level control at a standardized repeatable setting that you will use in mixing 8 Set the individual channel level controls such as power amplifier gain controls or gain controls on powered loudspeakers for the Sound Pressure Level measured C weighted and slow reading shown in Table D 1 Table D 1 Reference Sound Pressure Level Categories Room Volume in Cubic Feet SPL in dB re 20 N m gt 20 000 85 10 000 lt 19 999 82 5 000 lt 9 999 80 1 500 lt 4 999 78 lt 1 499 76 Depends on room usage For editing purposes may be controlled by the editor for use with the material at hand For final program mixing follow the recommendations for categories I
110. ther sections of this Recommended Practice This Annex contains all the courses of action necessary to perform effective loudness control of digital television commercial advertising J 2 LOUDNESS MANAGEMENT The Operator s goal is to present to the audience consistent audio loudness across programs commercials promotional material and public service announcements The Operator should whenever possible present the content to the audience with the most accurate and highest quality sound free of any type of audio artifacts not contained within the original content as delivered by the content supplier J 3 DIALNORM FOR AC 3 BASED SYSTEMS ATSC document A 53 Part 5 1 Section 5 5 mandates the carriage of dialnorm and correctly set dialnorm values J 4 COMMERCIAL ADVERTISING LOUDNESS It is vital that when loudness of short form content e g commercial advertising is measured it be measured in units of LKFS including all audio channels and all elements of the soundtrack over the duration of the content M J 5 COMMERCIAL ADVERTISING AT THE POINT OF INSERTION In the case of insertion of short form content e g commercial advertising it is vital that the loudness measured as per Section J 4 of the inserted short form content match the dialnorm setting of this inserted AC 3 audio stream per Section J 3 M 17 See Section 7 1 of this document 18 Per BS 1770 the LFE is not included See section 8 4 of this do
111. this playback encounters the same gain conditions as the tone of step 1 and the noise of step 2 Play the speech which is recorded with a measured loudness of 24 LKFS and observe that its level is normal If no sound level meter is available and you have skipped here from Step 2 confirm that you have performed Step 1 if you have such equipment and adjust this track for the most comfortable listening level e This speech recording has been adjusted in level so that it measures 24 LKFS to standard BS 1770 3 http www atsc org refs a85 Speech_left_Ch 20dB wav Step 4 If there is a separate subwoofer feed the low frequency effects channel electrically with the signal below and set the level of the subwoofer for 4 dB re the main channels when measuring with a C weighted slow reading sound level meter http www atsc org refs a85 LFPinkNoise_ 20dB wav 10 5 Test Signal Details 10 5 1 Definition of 0 dB FS Uncertainty in measurement of the level of noise with electrical meters has been described above in section 10 2 The definition of level in digital audio systems is given in AES17 21 where a full scale sine wave is defined as 0 dB FS this definition applies in this document However many software programs indicate level on virtual meters by means of a conventional RMS calculation 36 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 leading to a full scale sine wave reading 3 01 dB FS which is incorre
112. to extend the low frequency response See the references Audio Monitoring in Contemporary Post Production Environments 16 and Surround Sound Up and Running 17 for further information 45 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex C 12 March 2013 Annex C Room Correction Room correction is needed in monitoring environments in order to address numerous problems that arise from the interaction of sound from loudspeakers and the room The negative effects of small room acoustics produce a number of audible artifacts including imaging distortion due to unwanted reflections and frequency response anomalies that are particularly problematic in the low frequency range Traditionally room equalization has been performed with analyzer equalizer systems that use a single microphone measurement of pink noise and parametric or graphic equalizers There are severe limitations in the performance of such systems because 1 A single room measurement cannot provide enough information about the low frequency performance in the listening area 2 Pink noise can only measure the magnitude response and does not have any time response information 3 Parametric or graphic equalizers do not provide enough resolution even with 30 bands 4 The IIR filters used in such equalizers can suffer from phase anomalies particularly as they become narrower Effective room correction needs to e Capture time domain information so that t
113. where above the ambient noise level in the listening environment the figure of 19 1 dB probably depends strongly on the ambient noise level The other points are far enough above the ambient that their relative distribution should not be affected 50 ATSC A 85 2013 Establishing and Maintaining Audio Loudness Annex F 12 March 2013 Annex F AC 3 Dynamic Range Control Details F 1 DRC OVERVIEW The AC 3 system is NOT intended to mimic the audio processing used in the current analog TV system It is instead intended to provide 1 a consistent dialog loudness between programs and 2 to allow individual listeners to reduce the dynamic range excursions around this common loudness level if they so desire Calculate Gain Apply Dialnorm Words and Gain Words 1 to 6 1to6 Channels Channels of Encoder AC3 Bitstream Decoder of Program Program Audio Audio Untouched Program Audio Dialnorm amp Gain Words Figure F 1 AC 3 DRC basic concept The basic concept of the dynamic range control part of the AC 3 system is to do all the calculation required to reduce the dynamic range of the program to fit the selected dynamic range profile in the AC 3 encoder then to send the original wide dynamic range version of the program content to the AC 3 decoder along with the dynamic range control DRC gain words that describe the gain changes necessary to reduce the dynamic range of the program to fit the selected profile See Figure F 1
114. with small low diffraction microphone s at and around the principal listening position for each main channel Left Right Center Left Surround and Right Surround in turn Further information is given in Annex C Attention is also drawn to the level alignment of the Low Frequency Effects or 0 1 channel Some confusion occurs due to its requirement to have 10 dB of in band gain compared to the main channels This does not mean that it should measure 10 dB greater in sound pressure level than the main channels when calibrating Because the LFE channel s bandwidth is more limited than that 34 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 of the main channels its level playing a pink noise source of the correct spectrum and electrical level will be approximately 4 dB greater than one of the main channels 10 4 Reference Level Calibration The procedure for calibrating reference sound pressure level varies in level and method across the various categories with the objective of making programs interchangeable across a range of listening conditions The steps below should be performed as follows Step 1 Perform electrical alignment of the system under calibration using the following test Signal 1 440 Hz sine wave tone at 20 dB FS recorded on the left channel of this stereo file Import this into a digital audio workstation and duplicate it for each channel http www atsc org refs a85 440Hz_left_Ch 20dB wav
115. xchange without metadata The set up of reference monitoring environments when producing for the expanded range of digital television with consideration for multiple listening environments in the home Methods to facilitate control of loudness at program interstitial boundaries Effective uses of audio metadata for production distribution and transmission of digital content Dynamic range control within AC 3 audio and contemporary conventional dynamic range control as an addition or alternative including recommendations for loudness and dynamics management at the boundaries of programs and interstitial content Note that content measured before the effective date of this document per the methods documented in a predecessor version or as permitted by A 53 Part 5 1 need not be measured again 1 2 Organization This document is organized as follows Section 1 Outlines the scope of this document and provides a general introduction Section 2 Lists references and applicable documents Section 3 Provides a definition of terms acronyms and abbreviations for this document Section 4 Explains the technical background of the AC 3 multichannel audio system Section 5 Explains audio loudness measurement based on techniques defined by ITU R Recommendation BS 1770 Section 6 Makes recommendations for exchange of content without metadata Section 7 Provide guidelines for the practical use of agile and fixed metadata
116. xers and Editors Creating Content Commercials Promotional Materials and Programming with Audio Loudness Management 1 1 INTRODUCTION Note This Quick Reference Guide is based on the ATSC Recommended Practice Techniques Establishing and Maintaining Audio Loudness for Digital Television A 85 the RP and its full text is found as Annex I of the RP This guide summarizes the recommendations in the RP to provide guidance to audio mixers and editors creating audio for digital television It promotes a goal of managing audio loudness of content consistent with artistic intent Content includes commercials promotional materials promos and programming The term interstitials applies to both commercials and promos 1 2 SCOPE This Quick Reference is not intended to replace the RP The scope is limited to a how to guide for audio mixers and editors involved with the creation of audio for television The reader of this document is encouraged to read the RP to educate themselves on the details and background to this guideline In the event of a conflict between the Guide and the RP the RP takes priority over this Quick Reference Guide 1 3 DEFINITIONS BS 1770 Shorthand for ITU R BS 1770 3 This specifies an algorithm that provides a numerical value indicative of the perceived loudness of the content that is measured Loudness meters and measurement tools which have implemented the BS 1770 algorithm will rep
117. xternal Decoder Control Figure 7 2 The preset metadata concept gt As noted in Section 3 4 dialnorm is expressed as an unsigned integer 21 ATSC A 85 2013 Establishing and Maintaining Audio Loudness 12 March 2013 7 4 1 Implementation The implementation of preset metadata is similar to fixed metadata Predetermined preset values are loaded into the AC 3 encoder to accommodate known differences in content loudness Compliance with A 53 then requires that content be delivered with loudness matching one of the preset values and that the automation system be programmed to change presets according to different content loudness values and channel modes 7 4 2 AC 3 Framesync Requirement Some AC 3 encoders reset and disrupt the audio bit stream output when a preset is changed Based on the type of ATSC encoder being used this may result in an audible glitch on air To avoid this potential problem it may be necessary to provide an AC 3 framesync for the output of the AC 3 encoder to stabilize the AC 3 source for the ATSC encoder 7 5 Using Agile dialnorm Metadata An agile metadata system allows setting different dialnorm values for different content that has different loudness This is accomplished by embedding the dialnorm parameter within the metadata bit stream accompanying the content at an upstream location The metadata is dis embedded just prior to the AC 3 encoder and connected to its external serial metadat

Download Pdf Manuals

image

Related Search

A/85:2013 85201 zip code 85203 zip code 85201 weather 85201 sales tax 85201 county name 8520 14th ave s 85201 time zone 8520100 cnae 85201 city 85201 hsmv 85201-3x060-tx 85201 area code 85201 zip 85201-3x260-vyf 8520 14th ave s seattle

Related Contents

la priere du rosaire - Paroisse Sainte Thérèse de l`Enfant Jésus  Weider WEBE1012 User's Manual  Peak Performer 1 RCP User Manual  Dell NAV3000 GPS Receiver User Manual  SERVICE MANUAL  Les rythmes scolaires - Maison de la jeunesse d`auxerre  iRhythms 3 Piece Subwoofer & Satellite Speaker System, Black  Istruzioni d`Uso - Amazon Web Services  Nouvelle Procédure Ventolin_mars 2013  取扱説明書 700-TAP017  

Copyright © All rights reserved.
Failed to retrieve file