CINERAMAX
06-07-07, 11:14 PM
In the quest to better understanding what an ideal post production room should be, I have come across this article from Munro Associates (http://munro.co.uk/Articles/roomswithaview.pdf). These guys design/build the large custom Dynaudio Acoustic loudspeakers.
http://cineramax.com/images/Dynaudio5_1.jpg EDITED: THIS ROOM'S DESIGN IS BY ANOTHER 3rd party (NOT MUNRO - the speaker supplier).
Rooms with a View
Andy Munro
Munro Associates
Sound and pictures have always been willing bedfellows but the road to the current, exalted
state of both has been both rocky and fraught with argument.
In the 50s the cinema was the only place the public could hear reproduced sound with any
realistic quality. Magnetic sound recordings transferred to optical film offered the best
quality available with optical heads (RCA galvanometers) capable of 1/3 harmonic distortion
of less than 0.5% at 100% light modulation and a frequency response from DC to 10 KHz (-
3dB), a creditable performance even by today’s standards. The first standard for optical
sound tracks was published in 1947 and the SMPTE 35mm standard was finally approved in
1967. Before that, every film company and manufacturer vied for superiority with many
formats and patents fighting for commercial acceptance.
By this time cinema had serious rivals, for Hi Fi equipment had become a mass consumer
product and television could produce a sense of reality from immediacy and live
entertainment, if not high-resolution transmission. Film fought back with a concerted effort
to raise standards to even higher levels and electronics came to the rescue with signal
processing and power amplification which could turn a visit to the larger theatres into a
spectacular experience. This created the need for specialist companies that could provide the
technical skills to turn great production quality into suitably impressive cinematic
presentation.
Dolby Labs in San Francisco created the modern form of surround sound in the mid 70s. The
trick was to use the optical sound track space on 35mm film to fit an encoded, analogue,
stereo signal which could then be processed to produce a centre front channel, left and right
stereo, plus a discrete sub bass channel and a ‘surround’ channel.
Several benefits were derived from the new format, which apply equally well to current
multi channel modes
¦ The centre channel contained all the dialogue and those on screen effects, which need
maximum spatial stability. This is the equivalent of the original mono optical track and the
Dolby Stereo track is mono compatible to this day. The single sound source ensures that
there is no phasing or combing of the dialogue, off axis of the centre line of the theatre,
which would be detrimental to intelligibility and naturalness of reproduction. The centre
channel is the absolute reference source and all other channels must be phase and amplitude
aligned to it.
¦ The left and right channels reproduce the music and sound effects in such a way as to
create a sense of space and distance, which matches the on screen action as closely as
possible. Some effects are also fed to the centre channel for maximum impact. Occasionally
dialogue will be panned into the stereo ‘picture’, to create the effect of movement or drama.
Munro Associates 4 January, 2003
2 of 9
¦ The surround channel was, and is, used for off screen sound effects and atmosphere, such
as wind and rain, spooky echoes, etc. etc. The bandwidth of the surround channel was
limited to 7 kHz for improved noise and distortion performance but with 5.1 formatting this
is not the case.
¦ The sub channel was used to good effect in movies such as Earthquake but it was always
considered as an option as far as film processing was concerned and the use of a sub
loudspeaker in domestic formats is mostly to allow reduction in size of the main
loudspeakers by filtering all signals below 120 Hz and recombining them in a single
channel. With large cinema systems the sub channel has come into its own and the advent of
Dolby SR and SRD has allowed much more use of dynamic range and high energy at low
frequencies.
With the advent of digital, discrete multi channel recording all of the limitations of Dolby
Stereo and Surround were removed at a stroke and yet there was an important need to
maintain backwards compatibility between the thousands of existing theatres with Dolby
equipment and a new generation with digital playback systems.
This was achieved quite neatly at the projection end of the chain by squeezing the digital
code into the space between the film perforations, leaving the optical track in its original
form. The total flexibility and compatibility of this system means that one print can be used
in any modern theatre. There is the added bonus that the analogue track automatically cuts in
if the digital track drops out. Although DVD will be used for other formats it will be heavily
influenced by the precedent for Dolby Digital (AC3) so I make no apologies for harping on
about the film format most common and accepted by the industry.
That said I do recognise the potential in DVD for more linear audio formats, which do have
potential for much higher fidelity that AC3 but that is another story all together.
Monitoring Format
The essential question at the moment is how do the smaller control room environments used
for DVD and TV mixing compare with the traditional film dubbing theatres, which have
been mixing 5.1 for several years?
To judge this it is important to recognise the relationship between the direct and reverberant
sound in each room.
The person judging the sound balance and quality must hear the correct blend of direct
energy from each loudspeaker and a pre determined amount of room reflection and
reverberant energy. The ratio between the two can be controlled by the following factors
¦ Loudspeaker directivity over the required frequency range
¦ Distance from the loudspeaker to the listening position
¦ Room volume and geometry
¦ Room acoustic and absorption
Munro Associates 4 January, 2003
3 of 9
It is by manipulating these factors that it possible to obtain a balance in a smaller room of
say 50 sq. metres, which is similar to a much larger theatre with a highly directional horn, at
least in the speech frequency band, which is the most important from an intelligibility point
of view.
To take these factors in reverse order, acoustics and room size are largely defined by
reverberation time, which varies directly with volume. The big question is always how can a
small control room with a reverberation time of 0.1s compare with a theatre of RT ten times
longer? It can’t and yet there is a distinct relationship, which if carefully controlled will
yield acceptable results. The key to this is the ratio between the direct sound of the
loudspeaker system and the reflected energy from the room, called logically, the Direct to
Reverberant Ratio.
The distance from the loudspeaker system to the mix position will greatly affect the balance
of direct and reflected energy that the engineer will hear. The point of equal balance
between the two is called the critical distance of which more later.
Loudspeaker directivity is the ratio of direct sound from a loudspeaker to the equivalent
sound output of an omni-directional (point source) loudspeaker. In some ways it is a
necessary consequence of the other variables and is often the most important element in
balancing front and rear loudspeakers in a 5.1 system.
All these factors can be tabulated as in Figure 1, which shows the data for a typical film
dubbing theatre, which I will use later to compare with a smaller room and different types of
loudspeaker.
Munro Associates 4 January, 2003
4 of 9
Fig 1
L W H Lw Q horn Q dome D S V a T60 LpH LpD
20 10 5 0 16 4 2.0 700 1000 0.4 0.50 -5 -10
20 10 5 0 16 4 4.0 700 1000 0.4 0.50 -10 -15
20 10 5 0 16 4 6.0 700 1000 0.4 0.50 -13 -17
20 10 5 0 16 4 8.0 700 1000 0.4 0.50 -15 -18
20 10 5 0 16 4 10.0 700 1000 0.4 0.50 -16 -19
20 10 5 0 16 4 12.0 700 1000 0.4 0.50 -17 -19
20 10 5 0 16 4 14.0 700 1000 0.4 0.50 -18 -19
20 10 5 0 16 4 16.0 700 1000 0.4 0.50 -18 -20
20 10 5 0 16 4 18.0 700 1000 0.4 0.50 -19 -20
20 10 5 0 16 4 20.0 700 1000 0.4 0.50 -19 -20
KEY
L length S room surface area
W width V room volume
H height a average absorption
Lw sound power of source speaker Sa total absorption
Q directivity index T60 reverb decay time
D distance to point of measurement Lp sound pressure at D
EPR electronic power required (amplifier) Dc critical distance Ld=Lr
Horn and Dome Selection
-25
-20
-15
-10
-5
0
2.0 4.0 6.0 8.0 10.0 12.0 14.0 16.0 18.0 20.0
Distance D
Spl Lp
LpH
LpD
The graph shows the sound level produced by a horn-loaded loudspeaker and a direct
radiating loudspeaker in the same theatre at varying distance from the screen. It can be seen
that although the horn produces a much higher sound level over the first 10 metres, when we
measure at a distance of 16 metres the levels are almost the same. This is because we are
well into the reverberant sound field and the direct sound no longer has much influence,
except that it arrives at the ear first and this why we can still perceive some kind of stereo
image, even when the sound is dominated by the room itself. The distance from the
loudspeaker at which the direct and reverberant sound levels are equal is called the critical
Munro Associates 4 January, 2003
5 of 9
distance, Dc, and in a large theatre it is important to design the acoustics and the sound
system so that all of the audience is beyond the critical distance and yet still within the
coverage angle of the loudspeakers. Ignorance of this fact accounts for the large number of
complaints received by cinema operators of the ‘too loud’, ‘couldn’t understand what he was
saying’ variety. Many small theatres and studios have the wrong type of speaker for the
volume of space and the audience size and less directional speakers would give a much more
‘cinematic’ experience.
It is important to realise that acoustically ‘dead’ rooms are extremely bad news when it
comes to even sound coverage as the front seats will be blasted and the people at back
drowning all around them with the crunching of pop corn! That is why concert halls are
designed to squeeze every last drop of energy out of the woodwork; it makes every seat
worth sitting in.The Grand Unified Theory of Sound
There is a general relationship for all these parameters, which has been defined over the
years in the form
Lp=Lw+10log{(Q/4?D2)+4/R)}
This formula is less daunting that it looks; Lp is the sound pressure at the listening distance
D, Lw is the sound power radiated by the loudspeaker (which is room independent).
Q, as already stated, is the directivity of the loudspeaker, 1 being omni directional and 30 a
very narrow mid range horn. R is the room absorption, which can be measured or designed
according to the required reverberation time.
Lets start by looking at a known quantity, a major league film dubbing theatre in downtown
Hollywood or even downtown Shepperton (as Fig. 1) and compare it with a small and
medium sized control room in Soho (Fig. 2).
We can say from prior knowledge and general consent that the reverb time of the larger
room should be around 0.5 seconds at 1000 Hz and the other rooms 0.23 and
Munro Associates 4 January, 2003
6 of 9
Fig 2
KEY
L length S room surface area
W width V room volume
H height a average absorption
Lw sound power of source speakSear total absorption
Q directivity index T60 reverb decay time
D distance to point of measuremLpent sound pressure at D
EPR electronic power required (amDpclifier) critical distance Ld=Lr
Room Speaker L W H Horn Q D S V a T60 Dc D/R ratio
Theatre A 20 10 5 nearfield 4 1.0 700 1000 0.37 0.50 4.54 15.1
Small A 7 5 3 nearfield 4 1.0 142 105 0.4 0.23 2.13 8.8
Theatre A 20 10 5 nearfield 4 3.0 700 1000 0.37 0.50 4.54 5.6
Medium B 10 7 3 dome 4 3.0 242 210 0.3 0.39 2.40 -0.4
Small A 7 5 3 nearfield 4 3.0 142 105 0.4 0.23 2.13 -0.8
Theatre C 20 10 5 90*180 8 15.0 700 1000 0.37 0.50 6.42 -5.4
Theatre D 20 10 5 90*90 16 15.0 700 1000 0.37 0.50 9.08 -2.4
Theatre E 20 10 5 90*40 32 15.0 700 1000 0.37 0.50 12.84 0.7
Theatre F 20 10 5 40*40 64 15.0 700 1000 0.37 0.50 18.16 3.7
Fig. 2 shows the effect of different loudspeakers at different distances, in each type of room,
on the all-important direct to reverberant ratio. It can be seen that in a large theatre a high Q
horn will give a positive D/R at 15m as will a near field monitor at 3m. The same near field
at 1m will give virtually anechoic monitoring!
It is also the case that near field monitors in small rooms need to be as un-directional as
possible if it is desired to emulate a theatrical sound field.
This is particularly true in the reproduction of sound in 5.1 formats. The object of the
exercise is to recreate a realistic sound field based on what we perceive as natural. That
means discrete visually interactive sounds from the front channels and indirect or ambient
sound from the rear, with the occasional directional effect to give interest to the mix.
This is a far cry from some of the audio-only 5.1 mixes I have heard which could cause
severe neck ache, not to mention a low irritation threshold once the novelty of the lead
guitarist hanging on the back wall begins to wear off. I am reminded of the early days of
stereo LPs, when ping-ponging sounds and trains panning across the living room were the
order of the day.
The most pleasing results will occur when the room (monitoring or listening) is not too dead
and the loudspeakers are very open and actually given some space to radiate before early
reflections contrive to destroy the spatial images of the mix.
I have noted that multi channel digital sound has created a demand for mixing theatres which
are if anything more live that the equivalent old analogue rooms.
I can only explain this in terms of naturalness and adherence to audio fidelity, which I think,
bodes very well for the future.
Many people are unsure about the best way to set out loudspeakers for DVD mastering.
Munro Associates 4 January, 2003
7 of 9
I can only give a clear answer in the context of mixing to picture and for this format I
strongly recommend the layout shown by Dolby, in their excellent literature.
As DVD will use the Dolby Digital format for most applications it is logical to set up ones
room and monitoring system as they recommend. Broadcast engineers have also adopted the
ITU-R specification and it is used for critical listening rooms. This internationally
recognised document is full of obscure references to methods of appraising audio but is
recommended for its depth and weight! The new AES document on the same subject tends
to go with the flow rather than challenge any of the existing preconceptions!
The front three loudspeakers should be positioned at an equal distance from the mix position
with an angle of 60 degrees subtended by the left and right. All three should be identical
models with a matched response within +/- 3dB in any 1/3 octave band between 250 Hz and
2 kHz, measured at the mix position. The limits can be slightly relaxed above and below
this range but for mastering purposes this would not be advisable. THX recommend an angle
of 45 degrees for mixing to picture, to reduce the danger of off axis phasing and to keep
panned action from falling off the picture!
The surround loudspeakers should be positioned at the same distance to the mix position as
the main front loudspeakers, at an angle of 110 degrees from the centre line. If this is not
possible then a digital delay should be used to bring the loudspeakers back into coincidence.
Ideally the surround units will be the same as the front ones but if that is not possible they
should be of the same manufacturer using identical driver technology. The polarity and
phase summing of each loudspeaker should be such that any two together will produce an
increase in sound level of between 3 and 6 dB at any frequency up to 10 kHz. Above that
frequency it is difficult to place the microphone so precisely that phase coincidence is
perfect (but many purists will insist on trying).
Several differences to this set up must be considered in the context of film sound
¦ The larger rooms used for film mixing dictate that several pairs of surround loudspeakers
are used, to ensure that everyone in the room hears the same delay and direct to reverb ratio.
The use of time delay and separate power amplifiers is recommended to achieve consistent
results. THX maintain strict criteria for surround loudspeaker placement in order to achieve
world wide compatibility between theatres and dubbing stages;
¦ The surround loudspeakers must often be placed at high level and angled to cover a large
area at constant level.
¦ The surround loudspeakers are rarely the same as the front ones, which are invariably
large, horn, loaded systems of considerable bulk.
¦ The angle subtended by the main left and right loudspeakers should be 45 degrees at the
mixing console. This assumes the loudspeakers are at the extremes of the screen, which begs
the question, ‘how do we adjust for different film aspect ratios?’ One solution is to use 5
loudspeakers at the front, the inner pair used for TV formats and the outer pair for wide
screen ratios. This is also a cue for 7.1 SDDS system compatibility and one step away from
top and bottom IMAX arrangements. The most speakers I ever placed in a front wall were
14, for a rock and roll movie- the earth moved!
Munro Associates 4 January, 2003
8 of 9
This apart it should be possible to maintain compatibility between the two formats so long as
good use is made of low directivity loudspeakers in small, dry rooms.
The LFE Channel
In a true 5.1 system the .1 is a discrete low frequency channel, which is used by the mixing
engineer to create special effects and to extend the normal frequency range of music and
other mixed events. The channel can also be used as a sub woofer system to accept a feed of
bass from the main loudspeaker channels whose loudspeakers may be too small or
unsuitable to reproduce high levels of very low frequency material. This mode is created in
the decoder of the playback system and is a user definable mode, not one which would
normally be considered in the mixing or mastering room, except for comparative use or plain
curiosity.
The mixing encoder channel contains a steep filter at 120 Hz, which prevents higher
frequencies leaking into the LFE track. Dolby recommend an 80 Hz filter in the console
channels to make sure nothing is even heard (and therefore localised by the final audience).
The LFE channel loudspeaker is set at a level 10 dB higher than the other channels in the
mixing theatre so it is important to do the same in any replay situation. This should be done
with band limited pink noise to smooth out localised pressure wave variations.
Most music mixing engineers do not send discrete material to the LFE channel as there is
enough mutual coupling of the main loudspeakers to produce more than enough low
frequency energy; in fact a well matched system will produce up to 10dB more bass than a
single loudspeaker and care must be taken to balance the mix accordingly. The LFE channel
would obviously be useful for extended effects such as cannons and organ pipes and music
reproduced down to 20 Hz or less will no doubt become more interesting as skills and
production values improve. We may even see an improvement in pop music mixes as
engineers (?) finally realise that small, closed box woofers, rolling off at 70 Hz do the
serious audiophile no favours at all.
More Acoustics
Several people have asked me how to ensure that all five loudspeakers sound the same and
the answer is basically down to a few key factors
¦ The room must be symmetrical about the centre loudspeaker axis, with windows and
doors placed so as to steer sound reflections away from the mix position.
¦ Early reflections (within 15ms of the direct sound) should have amplitude of no more
that 10dB below the direct sound, especially in any band between 250 Hz and 2 kHz.
Munro Associates 4 January, 2003
9 of 9
¦ Each loudspeaker should be placed at least 1 metre from any wall and not equidistant to
two walls (or they should be completely flush with the wall for large soffit mounting
systems).
¦ The room acoustic should be as diffused as possible with a reverberation time of
(0.25(V/100)^1/3) +/- .05s. In other words the reference value is 0.25 seconds for a 100m^3
room, increasing with room size. My favourite spaces are often about this value so I would
argue that the ISO/MPEG/SMPTE guys have got it about right.
¦ Room proportions should not be vastly different but never the same and cubic is a
disaster. Room modes will always dominate the low frequency performance, closely
followed by first reflections from floor and ceiling. 7m by 5m by 3m is a good starting point
for mastering rooms
¦ Background noise should be judged according to material and the final medium. Film
and pop music are often mixed in rooms with a noise floor of NR30 because a mix which is
too wide in dynamic range will cause problems on playback (noisy theatres, car radios,
Walkman in the street etc. A critical listening room needs to be very quiet so as to judge the
limits of other systems’ dynamics and NR10 to NR15 might be called for in some
circumstances. A good compromise is NR25 for most mixing rooms but ventilation systems
must be carefully matched in noise spectrum to create a masking effect.
Final Take
DVD is a new an exciting medium with huge potential for extending audio and video quality
beyond anything the average consumer has experienced before. The industry is suffering
greatly at the moment because indecision and format wars are diluting the message for
higher levels of audio performance. The music industry in particular has not grasped DVD
as a medium to be promoted as an art form. Film producers have embraced the new video
technology but where are the new pioneers of sound?
http://cineramax.com/images/Dynaudio5_1.jpg EDITED: THIS ROOM'S DESIGN IS BY ANOTHER 3rd party (NOT MUNRO - the speaker supplier).
Rooms with a View
Andy Munro
Munro Associates
Sound and pictures have always been willing bedfellows but the road to the current, exalted
state of both has been both rocky and fraught with argument.
In the 50s the cinema was the only place the public could hear reproduced sound with any
realistic quality. Magnetic sound recordings transferred to optical film offered the best
quality available with optical heads (RCA galvanometers) capable of 1/3 harmonic distortion
of less than 0.5% at 100% light modulation and a frequency response from DC to 10 KHz (-
3dB), a creditable performance even by today’s standards. The first standard for optical
sound tracks was published in 1947 and the SMPTE 35mm standard was finally approved in
1967. Before that, every film company and manufacturer vied for superiority with many
formats and patents fighting for commercial acceptance.
By this time cinema had serious rivals, for Hi Fi equipment had become a mass consumer
product and television could produce a sense of reality from immediacy and live
entertainment, if not high-resolution transmission. Film fought back with a concerted effort
to raise standards to even higher levels and electronics came to the rescue with signal
processing and power amplification which could turn a visit to the larger theatres into a
spectacular experience. This created the need for specialist companies that could provide the
technical skills to turn great production quality into suitably impressive cinematic
presentation.
Dolby Labs in San Francisco created the modern form of surround sound in the mid 70s. The
trick was to use the optical sound track space on 35mm film to fit an encoded, analogue,
stereo signal which could then be processed to produce a centre front channel, left and right
stereo, plus a discrete sub bass channel and a ‘surround’ channel.
Several benefits were derived from the new format, which apply equally well to current
multi channel modes
¦ The centre channel contained all the dialogue and those on screen effects, which need
maximum spatial stability. This is the equivalent of the original mono optical track and the
Dolby Stereo track is mono compatible to this day. The single sound source ensures that
there is no phasing or combing of the dialogue, off axis of the centre line of the theatre,
which would be detrimental to intelligibility and naturalness of reproduction. The centre
channel is the absolute reference source and all other channels must be phase and amplitude
aligned to it.
¦ The left and right channels reproduce the music and sound effects in such a way as to
create a sense of space and distance, which matches the on screen action as closely as
possible. Some effects are also fed to the centre channel for maximum impact. Occasionally
dialogue will be panned into the stereo ‘picture’, to create the effect of movement or drama.
Munro Associates 4 January, 2003
2 of 9
¦ The surround channel was, and is, used for off screen sound effects and atmosphere, such
as wind and rain, spooky echoes, etc. etc. The bandwidth of the surround channel was
limited to 7 kHz for improved noise and distortion performance but with 5.1 formatting this
is not the case.
¦ The sub channel was used to good effect in movies such as Earthquake but it was always
considered as an option as far as film processing was concerned and the use of a sub
loudspeaker in domestic formats is mostly to allow reduction in size of the main
loudspeakers by filtering all signals below 120 Hz and recombining them in a single
channel. With large cinema systems the sub channel has come into its own and the advent of
Dolby SR and SRD has allowed much more use of dynamic range and high energy at low
frequencies.
With the advent of digital, discrete multi channel recording all of the limitations of Dolby
Stereo and Surround were removed at a stroke and yet there was an important need to
maintain backwards compatibility between the thousands of existing theatres with Dolby
equipment and a new generation with digital playback systems.
This was achieved quite neatly at the projection end of the chain by squeezing the digital
code into the space between the film perforations, leaving the optical track in its original
form. The total flexibility and compatibility of this system means that one print can be used
in any modern theatre. There is the added bonus that the analogue track automatically cuts in
if the digital track drops out. Although DVD will be used for other formats it will be heavily
influenced by the precedent for Dolby Digital (AC3) so I make no apologies for harping on
about the film format most common and accepted by the industry.
That said I do recognise the potential in DVD for more linear audio formats, which do have
potential for much higher fidelity that AC3 but that is another story all together.
Monitoring Format
The essential question at the moment is how do the smaller control room environments used
for DVD and TV mixing compare with the traditional film dubbing theatres, which have
been mixing 5.1 for several years?
To judge this it is important to recognise the relationship between the direct and reverberant
sound in each room.
The person judging the sound balance and quality must hear the correct blend of direct
energy from each loudspeaker and a pre determined amount of room reflection and
reverberant energy. The ratio between the two can be controlled by the following factors
¦ Loudspeaker directivity over the required frequency range
¦ Distance from the loudspeaker to the listening position
¦ Room volume and geometry
¦ Room acoustic and absorption
Munro Associates 4 January, 2003
3 of 9
It is by manipulating these factors that it possible to obtain a balance in a smaller room of
say 50 sq. metres, which is similar to a much larger theatre with a highly directional horn, at
least in the speech frequency band, which is the most important from an intelligibility point
of view.
To take these factors in reverse order, acoustics and room size are largely defined by
reverberation time, which varies directly with volume. The big question is always how can a
small control room with a reverberation time of 0.1s compare with a theatre of RT ten times
longer? It can’t and yet there is a distinct relationship, which if carefully controlled will
yield acceptable results. The key to this is the ratio between the direct sound of the
loudspeaker system and the reflected energy from the room, called logically, the Direct to
Reverberant Ratio.
The distance from the loudspeaker system to the mix position will greatly affect the balance
of direct and reflected energy that the engineer will hear. The point of equal balance
between the two is called the critical distance of which more later.
Loudspeaker directivity is the ratio of direct sound from a loudspeaker to the equivalent
sound output of an omni-directional (point source) loudspeaker. In some ways it is a
necessary consequence of the other variables and is often the most important element in
balancing front and rear loudspeakers in a 5.1 system.
All these factors can be tabulated as in Figure 1, which shows the data for a typical film
dubbing theatre, which I will use later to compare with a smaller room and different types of
loudspeaker.
Munro Associates 4 January, 2003
4 of 9
Fig 1
L W H Lw Q horn Q dome D S V a T60 LpH LpD
20 10 5 0 16 4 2.0 700 1000 0.4 0.50 -5 -10
20 10 5 0 16 4 4.0 700 1000 0.4 0.50 -10 -15
20 10 5 0 16 4 6.0 700 1000 0.4 0.50 -13 -17
20 10 5 0 16 4 8.0 700 1000 0.4 0.50 -15 -18
20 10 5 0 16 4 10.0 700 1000 0.4 0.50 -16 -19
20 10 5 0 16 4 12.0 700 1000 0.4 0.50 -17 -19
20 10 5 0 16 4 14.0 700 1000 0.4 0.50 -18 -19
20 10 5 0 16 4 16.0 700 1000 0.4 0.50 -18 -20
20 10 5 0 16 4 18.0 700 1000 0.4 0.50 -19 -20
20 10 5 0 16 4 20.0 700 1000 0.4 0.50 -19 -20
KEY
L length S room surface area
W width V room volume
H height a average absorption
Lw sound power of source speaker Sa total absorption
Q directivity index T60 reverb decay time
D distance to point of measurement Lp sound pressure at D
EPR electronic power required (amplifier) Dc critical distance Ld=Lr
Horn and Dome Selection
-25
-20
-15
-10
-5
0
2.0 4.0 6.0 8.0 10.0 12.0 14.0 16.0 18.0 20.0
Distance D
Spl Lp
LpH
LpD
The graph shows the sound level produced by a horn-loaded loudspeaker and a direct
radiating loudspeaker in the same theatre at varying distance from the screen. It can be seen
that although the horn produces a much higher sound level over the first 10 metres, when we
measure at a distance of 16 metres the levels are almost the same. This is because we are
well into the reverberant sound field and the direct sound no longer has much influence,
except that it arrives at the ear first and this why we can still perceive some kind of stereo
image, even when the sound is dominated by the room itself. The distance from the
loudspeaker at which the direct and reverberant sound levels are equal is called the critical
Munro Associates 4 January, 2003
5 of 9
distance, Dc, and in a large theatre it is important to design the acoustics and the sound
system so that all of the audience is beyond the critical distance and yet still within the
coverage angle of the loudspeakers. Ignorance of this fact accounts for the large number of
complaints received by cinema operators of the ‘too loud’, ‘couldn’t understand what he was
saying’ variety. Many small theatres and studios have the wrong type of speaker for the
volume of space and the audience size and less directional speakers would give a much more
‘cinematic’ experience.
It is important to realise that acoustically ‘dead’ rooms are extremely bad news when it
comes to even sound coverage as the front seats will be blasted and the people at back
drowning all around them with the crunching of pop corn! That is why concert halls are
designed to squeeze every last drop of energy out of the woodwork; it makes every seat
worth sitting in.The Grand Unified Theory of Sound
There is a general relationship for all these parameters, which has been defined over the
years in the form
Lp=Lw+10log{(Q/4?D2)+4/R)}
This formula is less daunting that it looks; Lp is the sound pressure at the listening distance
D, Lw is the sound power radiated by the loudspeaker (which is room independent).
Q, as already stated, is the directivity of the loudspeaker, 1 being omni directional and 30 a
very narrow mid range horn. R is the room absorption, which can be measured or designed
according to the required reverberation time.
Lets start by looking at a known quantity, a major league film dubbing theatre in downtown
Hollywood or even downtown Shepperton (as Fig. 1) and compare it with a small and
medium sized control room in Soho (Fig. 2).
We can say from prior knowledge and general consent that the reverb time of the larger
room should be around 0.5 seconds at 1000 Hz and the other rooms 0.23 and
Munro Associates 4 January, 2003
6 of 9
Fig 2
KEY
L length S room surface area
W width V room volume
H height a average absorption
Lw sound power of source speakSear total absorption
Q directivity index T60 reverb decay time
D distance to point of measuremLpent sound pressure at D
EPR electronic power required (amDpclifier) critical distance Ld=Lr
Room Speaker L W H Horn Q D S V a T60 Dc D/R ratio
Theatre A 20 10 5 nearfield 4 1.0 700 1000 0.37 0.50 4.54 15.1
Small A 7 5 3 nearfield 4 1.0 142 105 0.4 0.23 2.13 8.8
Theatre A 20 10 5 nearfield 4 3.0 700 1000 0.37 0.50 4.54 5.6
Medium B 10 7 3 dome 4 3.0 242 210 0.3 0.39 2.40 -0.4
Small A 7 5 3 nearfield 4 3.0 142 105 0.4 0.23 2.13 -0.8
Theatre C 20 10 5 90*180 8 15.0 700 1000 0.37 0.50 6.42 -5.4
Theatre D 20 10 5 90*90 16 15.0 700 1000 0.37 0.50 9.08 -2.4
Theatre E 20 10 5 90*40 32 15.0 700 1000 0.37 0.50 12.84 0.7
Theatre F 20 10 5 40*40 64 15.0 700 1000 0.37 0.50 18.16 3.7
Fig. 2 shows the effect of different loudspeakers at different distances, in each type of room,
on the all-important direct to reverberant ratio. It can be seen that in a large theatre a high Q
horn will give a positive D/R at 15m as will a near field monitor at 3m. The same near field
at 1m will give virtually anechoic monitoring!
It is also the case that near field monitors in small rooms need to be as un-directional as
possible if it is desired to emulate a theatrical sound field.
This is particularly true in the reproduction of sound in 5.1 formats. The object of the
exercise is to recreate a realistic sound field based on what we perceive as natural. That
means discrete visually interactive sounds from the front channels and indirect or ambient
sound from the rear, with the occasional directional effect to give interest to the mix.
This is a far cry from some of the audio-only 5.1 mixes I have heard which could cause
severe neck ache, not to mention a low irritation threshold once the novelty of the lead
guitarist hanging on the back wall begins to wear off. I am reminded of the early days of
stereo LPs, when ping-ponging sounds and trains panning across the living room were the
order of the day.
The most pleasing results will occur when the room (monitoring or listening) is not too dead
and the loudspeakers are very open and actually given some space to radiate before early
reflections contrive to destroy the spatial images of the mix.
I have noted that multi channel digital sound has created a demand for mixing theatres which
are if anything more live that the equivalent old analogue rooms.
I can only explain this in terms of naturalness and adherence to audio fidelity, which I think,
bodes very well for the future.
Many people are unsure about the best way to set out loudspeakers for DVD mastering.
Munro Associates 4 January, 2003
7 of 9
I can only give a clear answer in the context of mixing to picture and for this format I
strongly recommend the layout shown by Dolby, in their excellent literature.
As DVD will use the Dolby Digital format for most applications it is logical to set up ones
room and monitoring system as they recommend. Broadcast engineers have also adopted the
ITU-R specification and it is used for critical listening rooms. This internationally
recognised document is full of obscure references to methods of appraising audio but is
recommended for its depth and weight! The new AES document on the same subject tends
to go with the flow rather than challenge any of the existing preconceptions!
The front three loudspeakers should be positioned at an equal distance from the mix position
with an angle of 60 degrees subtended by the left and right. All three should be identical
models with a matched response within +/- 3dB in any 1/3 octave band between 250 Hz and
2 kHz, measured at the mix position. The limits can be slightly relaxed above and below
this range but for mastering purposes this would not be advisable. THX recommend an angle
of 45 degrees for mixing to picture, to reduce the danger of off axis phasing and to keep
panned action from falling off the picture!
The surround loudspeakers should be positioned at the same distance to the mix position as
the main front loudspeakers, at an angle of 110 degrees from the centre line. If this is not
possible then a digital delay should be used to bring the loudspeakers back into coincidence.
Ideally the surround units will be the same as the front ones but if that is not possible they
should be of the same manufacturer using identical driver technology. The polarity and
phase summing of each loudspeaker should be such that any two together will produce an
increase in sound level of between 3 and 6 dB at any frequency up to 10 kHz. Above that
frequency it is difficult to place the microphone so precisely that phase coincidence is
perfect (but many purists will insist on trying).
Several differences to this set up must be considered in the context of film sound
¦ The larger rooms used for film mixing dictate that several pairs of surround loudspeakers
are used, to ensure that everyone in the room hears the same delay and direct to reverb ratio.
The use of time delay and separate power amplifiers is recommended to achieve consistent
results. THX maintain strict criteria for surround loudspeaker placement in order to achieve
world wide compatibility between theatres and dubbing stages;
¦ The surround loudspeakers must often be placed at high level and angled to cover a large
area at constant level.
¦ The surround loudspeakers are rarely the same as the front ones, which are invariably
large, horn, loaded systems of considerable bulk.
¦ The angle subtended by the main left and right loudspeakers should be 45 degrees at the
mixing console. This assumes the loudspeakers are at the extremes of the screen, which begs
the question, ‘how do we adjust for different film aspect ratios?’ One solution is to use 5
loudspeakers at the front, the inner pair used for TV formats and the outer pair for wide
screen ratios. This is also a cue for 7.1 SDDS system compatibility and one step away from
top and bottom IMAX arrangements. The most speakers I ever placed in a front wall were
14, for a rock and roll movie- the earth moved!
Munro Associates 4 January, 2003
8 of 9
This apart it should be possible to maintain compatibility between the two formats so long as
good use is made of low directivity loudspeakers in small, dry rooms.
The LFE Channel
In a true 5.1 system the .1 is a discrete low frequency channel, which is used by the mixing
engineer to create special effects and to extend the normal frequency range of music and
other mixed events. The channel can also be used as a sub woofer system to accept a feed of
bass from the main loudspeaker channels whose loudspeakers may be too small or
unsuitable to reproduce high levels of very low frequency material. This mode is created in
the decoder of the playback system and is a user definable mode, not one which would
normally be considered in the mixing or mastering room, except for comparative use or plain
curiosity.
The mixing encoder channel contains a steep filter at 120 Hz, which prevents higher
frequencies leaking into the LFE track. Dolby recommend an 80 Hz filter in the console
channels to make sure nothing is even heard (and therefore localised by the final audience).
The LFE channel loudspeaker is set at a level 10 dB higher than the other channels in the
mixing theatre so it is important to do the same in any replay situation. This should be done
with band limited pink noise to smooth out localised pressure wave variations.
Most music mixing engineers do not send discrete material to the LFE channel as there is
enough mutual coupling of the main loudspeakers to produce more than enough low
frequency energy; in fact a well matched system will produce up to 10dB more bass than a
single loudspeaker and care must be taken to balance the mix accordingly. The LFE channel
would obviously be useful for extended effects such as cannons and organ pipes and music
reproduced down to 20 Hz or less will no doubt become more interesting as skills and
production values improve. We may even see an improvement in pop music mixes as
engineers (?) finally realise that small, closed box woofers, rolling off at 70 Hz do the
serious audiophile no favours at all.
More Acoustics
Several people have asked me how to ensure that all five loudspeakers sound the same and
the answer is basically down to a few key factors
¦ The room must be symmetrical about the centre loudspeaker axis, with windows and
doors placed so as to steer sound reflections away from the mix position.
¦ Early reflections (within 15ms of the direct sound) should have amplitude of no more
that 10dB below the direct sound, especially in any band between 250 Hz and 2 kHz.
Munro Associates 4 January, 2003
9 of 9
¦ Each loudspeaker should be placed at least 1 metre from any wall and not equidistant to
two walls (or they should be completely flush with the wall for large soffit mounting
systems).
¦ The room acoustic should be as diffused as possible with a reverberation time of
(0.25(V/100)^1/3) +/- .05s. In other words the reference value is 0.25 seconds for a 100m^3
room, increasing with room size. My favourite spaces are often about this value so I would
argue that the ISO/MPEG/SMPTE guys have got it about right.
¦ Room proportions should not be vastly different but never the same and cubic is a
disaster. Room modes will always dominate the low frequency performance, closely
followed by first reflections from floor and ceiling. 7m by 5m by 3m is a good starting point
for mastering rooms
¦ Background noise should be judged according to material and the final medium. Film
and pop music are often mixed in rooms with a noise floor of NR30 because a mix which is
too wide in dynamic range will cause problems on playback (noisy theatres, car radios,
Walkman in the street etc. A critical listening room needs to be very quiet so as to judge the
limits of other systems’ dynamics and NR10 to NR15 might be called for in some
circumstances. A good compromise is NR25 for most mixing rooms but ventilation systems
must be carefully matched in noise spectrum to create a masking effect.
Final Take
DVD is a new an exciting medium with huge potential for extending audio and video quality
beyond anything the average consumer has experienced before. The industry is suffering
greatly at the moment because indecision and format wars are diluting the message for
higher levels of audio performance. The music industry in particular has not grasped DVD
as a medium to be promoted as an art form. Film producers have embraced the new video
technology but where are the new pioneers of sound?