Soundfield USA

SoundField and B Format

To understand SoundField B Format technology, it's best to examine the core issues Michael Gerzon attempted to resolve in creating the SoundField system. Those issues are: capturing 3 dimensional acoustical information and "point source" technology.

We'll start with the 3 dimensional capture issues:

What the heck is B Format anyway?
To understand B Format, its first necessary to understand that sound is a three-dimensional event. A way to envision the concept of three dimensions is to think of a photograph of a cloud laying flat on a table. You look down on the photograph, and you see depth and width of the cloud, a two dimensional representation of a cloud. While looking at the photo, you know it represents a cloud, but it cannot be confused for a real cloud because we know a real cloud has height as well as depth and width. No matter how good the photo is, no matter how high the resolution, it cannot be confused for the real thing. The missing information required to define the "real" cloud is the third dimensional component- the vertical information.

Now imagine a sculpture of a cloud. It has all 3 dimensional components in evidence-we see depth, width and height. If the sculptor was highly skilled, the sculpture might look like a real cloud sitting on the table. We could probably confuse a really accurate sculpture for a real cloud. The same thing is true in audio, that high resolution, two dimensional audio (stereo) can be very good indeed, but it is not even close to describing the real live event.

The "three dimension" concept can be confirmed with your own ears. When you hear a natural acoustical event, such as a bird singing in a tree, you can almost instantly locate the source as "in front of you, to the left and up there". If you heard the event in only two dimensions, you might know the bird was singing in front of you to the left, but you would not know if it was "up there" or "down there". Without the vertical information, you couldn't be sure exactly where the sound was coming from.

B Format compares to stereo just as our cloud story does to sight. Instead of defining the world of sound in just two dimensions, B Format defines it in three. B Format contains information to define front to back (1 dimension), left to right (2 dimensions) and up and down (three dimensions).

There is one more issue to understand B Format and three dimensional descriptions of audio events. It's called the "reference". Imagine that you are standing between two trumpets, both being played at the same time. We have sound emanating in all three dimensions from each trumpet, front to back, left to right and up and down, but we have two different "points of origin" of sound. These trumpets are in different locations when compared to our location, which can be thought of as the reference point to these two sounds. To fully describe to someone else the real event of hearing the two trumpets, we need to describe all three dimensions of the sound of each trumpet and the location of the two trumpets compared to where we were standing (the reference). So we really need to describe four elements, the 3 dimensional components of the trumpets and the reference point from which the trumpets were heard. Any attempt to accurately define the real acoustical event of standing between two trumpets playing at the same time in any less than three dimensions, including a central reference point, will fall short of being accurate.

The cloud and trumpet analogy give us a good understanding of the technical value of B Format and how it works, delivering 4 channels of audio containing the 3 dimensional components plus a reference. A particularly good scientific description of B Format can be found in Ron Streicher's "The New Stereo Soundbook". Ron discusses all types of microphone methods and recording practices. In chapter 13.11, Ron says: "Gerzon's process focused on the acoustical components of the SoundField at a particular point in space: the absolute sound pressure[the reference] and the three pressure gradients [the three dimensions of sound] that specify the cardinal directions-left/right, fore/aft, and up/down. By accurately preserving these four components, all information needed to recreate that point in the SoundField could be recorded and later reproduced precisely". Click here for a link to Ron Streicher and his book.

 

 

To go a little deeper, take a look at the drawing to the right. This drawing shows a graphical representation of B Format and how it "looks". B Format is a sphere, with four elements: an "X" plane (fore and aft or front to back), a "Y" plane (left to right), a "Z" plane (up and down), all with a central reference called "W". SoundField uses B Format as its core technology for documenting and translating the four elements of real acoustical events so they can be recorded on a 4 channel audio recorder. SoundField Microphones use input processing to convert the raw tetrahedral 4 capsule outputs into B Format. Using B Format as the basis for all further processing, X, Y, Z and W components can be manipulated to change any number of parameters, yielding a different microphone pattern, a Mid/Side output, a different "virtual" stereo mike angle (width) or a mono output. The beauty of B Format and SoundField Microphone Systems is that all options derived from B Format for output modes are based on the same reference information, so there are no phase difference issues to contend with. This highlights a key feature of SoundField called "point source" technology.

 More about the B Format "W" Reference and "Point Source"
Earlier we gave you the analogy of a listener standing between two trumpets. To fully describe to someone else the sound of the two trumpets, we needed to describe all three dimensions of the sound of each trumpet and the location of the two trumpets compared to where we were standing (the reference). The reference is a critical issue, for it establishes a single point from which all things are determined. Without that, relative position of the trumpets is unknown and a realistic image cannot be constructed.

When you set up a recording microphone on a stand, its capsule location within the microphone becomes the "reference" from where the audio originates. As long as there is only one microphone picking up audio, you have only one reference point and the microphone is a true single "point source". The minute we set up two microphones, we have two pick-up points, two references (two "W's") and we cease to have a "point source". Anytime you have two different pick-up locations of a single audio event, you introduce error if audio from those two locations is ever mixed together. In addition, as we have no central reference, we do not have a realistic image. Just like not being able to explain to someone else how the trumpets sounded without referring to your own position and the trumpets relative position.

With two microphones, the sound arrives at each microphone at a different time, and are therefore not phase coherent with each other. Unless the sound you are recording is very compact (like a single voice), and the two microphones are perfectly equidistant from the source, you will have problems staying free of phase cancellation and building a realistic image. Even with equidistant mike positioning, reflections from the environment can cause phase cancellation that impacts realism.

A choir or orchestra is a good example that invites error, as many would think to set-up 2, 3 or more microphones to record such as large source. The sound from one "end" of such a large source cannot be prevented from reaching a live microphone at the other "end". There could potentially be a significant distance between the two or three microphones. While this can be avoided by using a single mono microphone, it doesn't help us much when we want a realistic image. The ideal scenario for stereo imaging would be all microphones in the same exact location, (i.e. the same "reference point"), which we know is physically impossible.

Perhaps now you are seeing the value of SoundField point source technology, that only a SoundField presents a "single point source" multi-capsule microphone that delivers stereo. Single point recording solves all kinds of phase related and imaging related problems in audio, for it includes a single reference. In stereo mode, a SoundField provides two "virtual" microphones placed at the same exact location. In mid/side, the same thing is true, as is mono recording.

In multi-channel, B Format components X, Y, Z and W define just about everything we need to know about an acoustical event. Once we have this information, we can pull specific information from B Format to define any number of points within the 3 dimensional B Format sphere for an audio output. This means we can derive any kind of direction related information to yield any kind of multi-channel surround with precise localization.

 Do all SoundField microphones offer B Format?
All SoundField Microphone Systems use B Format internally for the base information from which all output options are derived. Pattern adjustment, stereo angle (width), Mid/Side and mono output forms are all derived from B Format. Actual B Format direct outputs are available on two of the three SoundField Models, the ST250 and the Mark 5. The ST250's B Format outputs are accessed by pushing a button on the front panel and using a special output "Y" cable on each of the two main Left /Right XLR outputs. The Mark 5 has balanced dedicated B Format on the back as well as balanced stereo or mid/side outputs. Only the Mark 5 SoundField Processor has the capability to "remix" B Format in post to change the mike's "position" relative to the reference in an existing B Format recording. The ability to alter the microphone location in post production is perhaps one of the more practical benefits of B Format for those working in stereo. The primary reason one would want to record B Format is if you intend to manipulate B Format later (such as with a Mark 5 Processor), or you intend to do multi-channel decode.

What's the future of B Format with 5.1?
Inexpensive Professional grade B Format to 5.1 surround decoding is a reality with the SP451 5.1 Decoder, and the SurroundZone Plug-in software for Nuendo, SADiE, and ProTools. This SoundField Surround Decoder takes B Format inputs and delivers L, C, R, RR, LR balanced XLR outputs, all with individual level controls and meters. This enables the SoundField to become a true 5.1 microphone. For more information on this, click here to go to the SoundField 5.1 Decoder page. For information on SurroundZone, click here.

Other B Format decoding
B Format as a professional's open ended format is likely to become even more common as multi-channel demand grows. The accuracy, simplicity and flexibility of B Format is unsurpassed by any system we know of for defining 3 dimensional events in the professional environment. Decoding B Format beyond 5.1 is not so far off in the future as it would seem. It is possible to decode B Format right now into 6.1, 7.1, 10.1, 16.1 or even more channels using a sufficiently flexible analog addition/subtraction matrix. Traditional 5 channel (LCR, LR, RR) surround can be defined by using only the W reference and the X &Y components. If we choose to use Z (vertical) information as well, we can derive surround that includes vertical localization.

One example of surround with vertical localization is a B Format to 10.1 system designed by Thomas Chen we saw at AES in San Francisco in 1998. Basically its an "upper" 5 channel and "lower" 5 channel system, offering similar horizontal localization to traditional surround systems but adding vertical components for precise up/down localization. Thomas Chen has written a few pages about this system, which will be posted here on this site in the very near future. He will present this information on B Format/16.1 at AES 99 in NYC. Soon we will also post a Third Party Development page on this site for other ideas and information on multi-channel decode and consumer formats derived from B Format.

Further Reading: www.ambisonic.net

Back to SoundfieldUSA.com Home Page

Top of Page