Sync and Surround Guide
Sync and Surround Guide
Version 9.0
Legal Notices
This guide is copyrighted 2010 by Avid Technology, Inc., (hereafter Avid), with all rights reserved. Under copyright laws, this guide may not be duplicated in whole or in part without the written consent of Avid. 003, 96 I/O, 96i I/O, 192 Digital I/O, 192 I/O, 888|24 I/O, 882|20 I/O, 1622 I/O, 24-Bit ADAT Bridge I/O, AudioSuite, Avid, Avid DNA, Avid Mojo, Avid Unity, Avid Unity ISIS, Avid Xpress, AVoption, Axiom, Beat Detective, Bomb Factory, Bruno, C|24, Command|8, Control|24, D-Command, D-Control, D-Fi, D-fx, D-Show, D-Verb, DAE, Digi 002, DigiBase, DigiDelivery, Digidesign, Digidesign Audio Engine, Digidesign Intelligent Noise Reduction, Digidesign TDM Bus, DigiDrive, DigiRack, DigiTest, DigiTranslator, DINR, DV Toolkit, EditPack, Eleven, EUCON, HD Core, HD Process, Hybrid, Impact, Interplay, LoFi, M-Audio, MachineControl, Maxim, Mbox, MediaComposer, MIDI I/O, MIX, MultiShell, Nitris, OMF, OMF Interchange, PRE, ProControl, Pro Tools M-Powered, Pro Tools, Pro Tools|HD, Pro Tools LE, QuickPunch, Recti-Fi, Reel Tape, Reso, Reverb One, ReVibe, RTAS, Sibelius, Smack!, SoundReplacer, Sound Designer II, Strike, Structure, SYNC HD, SYNC I/O, Synchronic, TL Aggro, TL AutoPan, TL Drum Rehab, TL Everyphase, TL Fauxlder, TL In Tune, TL MasterMeter, TL Metro, TL Space, TL Utilities, Transfuser, Trillium Lane Labs, Vari-Fi, Velvet, X-Form, and XMON are trademarks or registered trademarks of Avid Technology, Inc. Xpand! is Registered in the U.S. Patent and Trademark Office. All other trademarks are the property of their respective owners. Product features, specifications, system requirements, and availability are subject to change without notice. Guide Part Number 9329-65058-00 REV A 9/10
Documentation Feedback
At Avid, we are always looking for ways to improve our documentation. If you have comments, corrections, or suggestions regarding our documentation, email us at techpubs@avid.com.
contents
Contents
iii
iv
chapter 1
Synchronization Concepts
This section is designed to give you a basic understanding of synchronization as it relates to Pro Tools. ditional time code issues that affect how you use synchronization (such as pull-down). Your Avid dealer may be able to offer suggestions about working with synchronization in your studio.
For information on synchronizing Pro Tools software to other devices, refer to the Pro Tools Reference Guide.
If you are using a device that sends or receives time code (such as a video tape recorder or MIDI interface), refer that devices documentation for information on how it generates or receives time code and how to configure its options.
Synchronization Peripherals
Synchronization peripherals include a wide variety of devices, ranging from simple MTC-capable MIDI interfaces, to high precision time code synchronizers capable of handling many professional time code and clock standards used in the music, television, and film industries. Though many devices are capable of providing synchronization, only our SYNC peripherals deliver frame-edge accuracy when synchronizing Pro Tools|HD systems to external time code sources.
Synchronization Requirements
Synchronization requirements and support vary among different Pro Tools hardware systems. This guide notes whenever a feature requires a synchronization peripheral (such as a SYNC HD, SYNC I/O, or equivalent). See the guides for these peripherals for details on their installation and configuration. We strongly recommend that you research your synchronization requirements thoroughly. For example, if you are using Pro Tools for audio post production work for video, consult your video engineer or editor to determine what time code format will be used. Also, there may be ad-
Synchronization Concepts
Aspects of Synchronization
Synchronization in a digital audio workstation has two concepts that need to be independently considered:
Where are we? This is called the Positional Reference. How fast are we going? This is called the Clock Reference.
In the following example, Pro Tools uses time code for positional information, and a clock reference to maintain synchronization. Pro Tools is slaved, using the SYNC peripheral, to a video tape recorder, with Pro Tools and the VTR referenced to the same house video reference source (house sync). Example: Pro Tools slaved, through a SYNC peripheral, to a video tape recorder In this example, house sync is provided to the SYNC peripheral, which provides the master Loop Sync signal to the Pro Tools|HD system through a Pro Tools|HD audio interface. House sync is also provided to the VTR.
1 When you start the video tape, time code (LTC
To synchronize Pro Tools to another device (such as a tape machine or VTR) accurately over an extended period of time, Pro Tools needs to know where the device is and at what speed it is running. Some peripherals can provide only one of these references; for example, a black burst generator provides only a clock reference. Some peripherals, such as the SYNC HD, can provide both.
or VITC) is read from the tape and routed to the SYNC peripheral. The SYNC reads the time code position and continuously forwards the positional reference information down the SYNC serial connection to the Pro Tools card.
2 Pro Tools takes the first time code address it receives, and calculates the sample location in the session that corresponds to the address. Though you can work with Pro Tools using any standard time code format, it calculates internally in sample numbers. 3 Assuming that the time code address corre-
sponds to a sample number that is within the Pro Tools session, Pro Tools converts the time code address to a sample number within the session, and begins playing from that point. The point from which playback starts is the trigger point. The clock reference is used by Pro Tools and the VTR to keep them running at the same speed.
4 At this point, if the video tape is stopped, re-
wound, and started again, the process repeats, based upon a newly calculated trigger point.
tracks, so it has no application when working with audio tape recorders, but it does offer features for post production (such as still frame and slow speed time code address reading). There is also a non-SMPTE form of time code called MIDI Time Code (MTC) that some devices use to send timing information.
Synchronization Concepts
frame) or when the machine is paused. With LTC, the VTR must be running (usually at a minimum speed of about 1/10th normal playback speed) in order to capture a SMPTE time address.
Frame rate is the rate of speed that the time code is ascending through the frames. For example, when someone refers to a time code of 29.97, they usually are referring to using a frame count of 30 frames, but counting each frame at the speed of 29.97 frames per second.
LTC/VITC Auto-Switching
Many synchronizers and devices support automatic switching between LTC and VITC, depending on the speed, to get the best of both worlds (SYNC peripherals support auto-switching). For example, VITC might be used when a VTR is paused, or crawling frame-by-frame, while the synchronizer might automatically switch to LTC when fast-forwarding.
Bi-Phase/Tach
This electronic pulse stream is used by film mag recorders, film editing stations, and film projectors. You can use this format to synchronize Pro Tools if you have a SYNC peripheral. Unlike time code, Bi-Phase/Tach doesnt actually contain absolute location information. It simply supplies speed (based upon the frequency of the pulses) and direction, and therefore, relative position. Since the SYNC unit can count both the speed and direction of the stream of pulses, it can use a Bi-Phase/Tach source to deduce positional information from a starting address point. The difference between Bi-Phase and Tach formats is that Bi-Phase encodes rate and direction on a pair of signals using a format called phase-quadrature, while Tach encodes rate on one signal and direction on the other. For more information on Bi-Phase/Tach, see the SYNC HD Guide or SYNC I/O Guide.
When you work with NTSC video (the standard in North America and Japan), you will generally work with the NTSC color video standard: either 29.97 fps Non-Drop or 29.97 fps Drop frame. If you are working with PAL (the European standard), your frame rate is 25 fps.
Pro Tools supports the following SMPTE frame rates:
Synchronization Concepts
Note that even though time code addresses are skipped in drop frame format, actual frames of video material are not dropped.
At the end of a program that spans precisely one hour of drop frame time code (for example, 1:00:00;00 to 2:00:00;00), exactly one hour of real time has elapsed. Although it sounds complicated, drop frame time code allows broadcasters to rely on time code values when calculating the true length of programs, facilitating accurate program scheduling.
down, or pulling down by 0.1%, the production sound to compensate for the change in speed from film to NTSC video. (See Figure 1 on page 9.)
3:2 Pulldown
A film clip that lasts 1000 seconds consists of 24,000 film frames (pictures). If you want to transfer that film to 1000 seconds worth of NTSC color video, you have to fit 24,000 film frames into 29,970.02997 video frames. If we use the black and white NTSC video standard (30 fps) instead of 29.97 fps, the process of converting film frames to video frames is greatly simplified. Now instead of any fractional frames, we have 24,000 film frames going into 30,000 video frames (60,000 video fields). In the Telecine process (for NTSC color video), each odd film frame is copied to two video fields, and each even film frame is copied to three video fields, creating what is called a 3:2 Pulldown. The speed of the film is also pulled down to 23.976 fps in order to accommodate the slower speed of NTSC color video compared to NTSC black and white video (29.97 fps compared to 30 fps).
Film frames
Analog audio
Telecine simo or simu DAT 29.97 fps Sample rate conversion 48 kHz or A-D-A (analog stage)
11 22 23 34 44
Video fields 1 2 3 Video frames How film frames translate to video fields and frames in a Telecine transfer 4 5
Synchronization Concepts
The following table lists the applications for each of the frame rates and counts shown in Figure 1 on page 9.
Frame Rates and their Applications Frame Rate 25 fps 24 fps 23.976 fps Applications PAL Film, and 24P HD 24P pull down, for NTSC broadcast, and HD digital video 30 frame music production, and 30 frame recording for film Broadcast NTSC video
30 fps
29.97 fps
faster
25 fps
x104.1667% up
(4% pull up)
x96.0% down
(4% pull down)
24 fps
Telecine
30 fps
x100.1% up
(0.1% pull up)
x99.9% down
(0.1% pull down)
x100.1% up
(0.1% pull up)
x99.9% down
(0.1% pull down)
23.976 fps
Telecine
29.97 fps
Using Pro Tools in Pull Up or Pull Down modes requires a SYNC HD, SYNC I/O, or third party synchronizer.
Synchronization Concepts
192 I/O, to record the audio into Pro Tools at the proper sample rate. Or you may choose only to pull Pro Tools up or down on delivery of the audio. Because mistakes in pull factors can be expensive, good communication with other participants in the project is critical. Find out as soon as possible how your task relates to the rest of the project.
with 29.97 Drop Frame or Non-Drop Frame time code. While in Pull Down mode, you can work with your reference video and everything will remain synchronized and run at the proper speed (assuming your system is completely resolved). Once you are ready to lay back your completed project to an audio device running at film speed, deselect Pull Down in the Session Setup window, and from your synchronizer if it is not a SYNC HD or SYNC I/O. Then change your time code frame rate in the Pro Tools session back to 30 fps. Once Pull Down has been deselected, the audio played back from Pro Tools will synchronize perfectly with the edited film. Alternatively, you can pull down the source audio deck while recording audio into Pro Tools, work at 29.97 fps with no Pull-Down selected in Pro Tools, and then switch to 30 Non-Drop frame format, and select Pull Up during the delivery stage. You will have to perform a sample rate conversion on the audio either digitally or by using an analog stage (D-A-D). Only the inputs on the Digital I/O card (on HD I/O, HD MADI, or 192 I/O audio interfaces) feature real-time sample rate conversion.
Most simul-dats or simo-dats already have their audio pulled down, and should not be pulled down further.
Once all the audio has been recorded, and you are locked to a video work print (at video speed), enable Pull Down. If you are using a SYNC HD or SYNC I/O, select Pull Down in the Session Setup window. If your synchronizer is not an Avid product, select Pull Down on the front of your synchronizer, then enable Pull Down in the Pro Tools Session Setup window. At this point, it is highly recommended that you verify whether the video youre working with is striped
10 Sync & Surround Concepts Guide
You will also need to select the Audio Rate Pull Up/Down option in the Session Setup window.
mode, your active sample rate is 44.056 kHz (if the audio was recorded at 44.1 kHz) or 47.952 kHz (if the audio was recorded at 48 kHz).
signed for a final destination of video; to bring this audio back up to film speed you would have to pull up Pro Tools and record to a destination that is not Pulled Up.
You will also need to select the Audio Rate Pull Up/Down option in the Session Setup window.
Synchronization Concepts
11
12
chapter 2
Surround Concepts
Pro Tools HD and Pro Tools with Complete Production Toolkit 2 support multichannel mixing for surround sound. If you are new to surround mixing, read this chapter for an introduction to surround terminology and concepts.
For information on using Pro Tools surround features, refer to the Pro Tools Reference Guide.
Surround Concepts
13
Speaker Channels
Surround Format
1 2 3 4 4
C LR Cinema Stereo Quadraphonic Dolby Surround (Pro Logic) Film (Pro Tools default), for Dolby Digital SMPTE/ITU (Control|24) DTS (ProControl) Dolby Surround EX SDDS HD Standard LCR L R Lr Rr LCRS
5.1
L C R Ls Rs LFE
5.1
L R C LFE Ls Rs
6 7 8 8
Legend: L = Left; R = Right; C = Center, S = surround (mono); Ls = Left Surround; Rs = Right Surround; Lc = Left Center; Rc = Right Center; Cs = Center Surround; Lr = Left Rear; Rr = Right Rear; Lss = Left Side Surround; Rss = Right Side Surround; Lsr = Left Surround Rear; Rsr = Right Surround Rear; LFE = Low Frequency Effects (handled by a sub-woofer or bass management systems)
14
Speaker Layouts
The following two figures illustrate the speaker arrangements of each surround format. Speaker placement for each format is approximate. For proper placement, alignment, and calibration of surround monitoring systems, consult the documentation that came with your speakers and other monitoring equipment. The correct placement of speakers is crucial to accurate monitoring of any mix. This is especially true with multichannel mixing for surround sound.
Mono
Stereo
LCR
LCRS 3+1 for Dolby Surround S Often split to a pair of surround speakers
L L C 5.1 for DTS, Dolby Digital Sub R L C 6.1 for Dolby Surround EX Sub R
Ls
Rs
Ls
Cs
Rs
Surround Concepts
15
Sub
Lss
Rss
Ls
Rs
Lc
Rc
Sub
Ls
Rs
16
Surround Monitoring
In order to monitor your current multichannel mix in a given surround format (such as DTS or Dolby Digital), encoding and decoding equipment for that format is required. Pro Tools does not provide direct support for surround format processing, but encoding/decoding hardware or software is available from Avid and third-party manufacturers. Contact Avid or your dealer for information. Proper speaker placement, studio calibration, additional surround processors, monitors, and a properly calibrated system can lessen the variables that your mix will be subjected to when the audience eventually hears it in the theater, in their home, or elsewhere.
Sub
30
30
110
110
Mixing position
Ls
Rs
Figure 4. Example of speaker arrangement for 5.1 mixing and monitoring (SMPTE specification). Sub placement is variable, while all other speakers should be as close to the suggested angles as possible. Surround Concepts 17
Instructions for calibrating an HD I/O are included in the HD I/O Guide. Instructions for calibrating a 192 I/O audio interface are included with the 192 I/O documentation.
Calibrating Your Studio The first task for surround mixing should be getting your monitor speakers in proper position, alignment and calibration for accurate listening and mixing. Speaker position and alignment will vary according to each installation, and is very specific for different speakers, manufacturers, and for each surround format. Instructions for optimal speaker location should have been provided with the speaker and monitor system, so consult the documentation provided by its manufacturer. Proper speaker calibration ensures that your system plays back at accurately balanced levels. This lets you compensate for any variances in speaker position, alignment, or performance.
18
.1 Formats
The .1 in any multichannel mix format indicates an LFE (Low Frequency Effects) track in the mix. Whenever a decimal is part of a surround format, it indicates a discrete Sub channel in the surround playback system that is intended for playback through a subwoofer speaker. For example, Dolby Digital is a 5.1 format intended for playback through systems with five full-range speakers and one subwoofer speaker. Dolby Surround EX is a 6.1 format, with six full-range speakers and one subwoofer speaker. SDDS is a 7.1 format, with seven full-range speakers and one subwoofer speaker. Sub content will include the LFE channel of a .1 format mix, as well as the effects of any bass management in the playback system. See LFE on page 20.
.0 Formats
The .0 formats (5.0, 6.0, 7.0), while not associated with any specific surround formats, are useful in many mixes as sub-paths. For example, not all tracks will have content that needs to be routed to the LFE channel (the .1 channel). Assigning such tracks to a 5.0 path provides a 360 panner without LFE controls.
For professional results, always use professional-level products (such as the Dolby Model SDU4 Decoder, Model DP564 Decoder, or Dolby Surround Tools plug-in) when mix monitoring. Pro Logic-equipped consumer products have auto-balancing features which, if used for mix monitoring, may contribute to inappropriate mixing decisions
Surround Concepts
19
LFE
When a decimal such as .1 is present in the name of a surround format, it indicates the presence of an LFE channel. The LFE channel (for Low Frequency Effects, also known as Low Frequency Enhancement) refers most often to a specific track used in production (not during playback or decoding). LFE is the .1 component in mix formats such as 5.1 and 7.1. The reason the LFE component is referred to as .1 is that unlike the other five or seven speakers, LFE is not full bandwidth (or full-range). The LFE signal provides a discrete path for low end to the subwoofer, unaffected by bass management (if any). Low frequency effects include such things as thunder, explosions, and other bass effects. LFE is generally used to enhance sound effects in films. There are no set rules about whether or not you must employ an LFE channel.
Sub Content
When surround mixes are played back on systems that include one or more subs, subwoofer content comes from either or both of the following sources: All bass from all channels that is below the threshold of the bass management filter cutoff (if any) and Audio in the LFE channel (if any) A surround sub will play back all the lowest-frequency sounds from all of the other speaker channels, below a fixed frequency threshold. The threshold varies among different surround formats. See Where to Get More Information on Surround Technology on page 21 for a list of surround specification resources. In addition to the frequency cutoffs and other specifications of each surround format, playback system variables can affect sub content. See Surround Playback System Variables on page 22 for more information.
Pro Tools applies no filtering to LFE signals. Some delivery requirements may require filtering for the LFE track. See Mixing Formats and Surround Formats on page 13 for more information.
Divergence
Divergence is a surround mixing control that lets you set panning width. Full divergence results in discrete (or narrow) panning. Lower divergence settings result in progressively less discrete (or wider) panning. With Full (100%) divergence, tracks can be panned exclusively, or discretely, to a single speaker. Sounds panned to a single speaker are only audible in that speaker. When divergence is less than 100%, tracks will be audible in neighboring speakers even when panned directly to a single speaker in the grid. Lower divergence settings result in a progressively wider source signal.
20
Divergence Example
To understand divergence, it can be helpful to imagine the inside of a large movie theater. This is a good example because one of the most challenging playback variables one faces in multichannel mixing is the size of the intended listening environment. Unlike the typical living room, movie theaters are large spaces with speakers placed widely apart. Due to distance, sounds panned discretely to the front right speaker, for example, might be inaudible in the opposite corner. To avoid this problem, variable divergence lets you control the panning width, in order to widen the sound source. This results in signals spreading into adjacent speakers, even when panned 100% to an individual speaker.
Divergence options in Pro Tools are explained in the Pro Tools Reference Guide.
Surround Specifications
Dolby Surround/Pro Logic Dolby Digital Dolby Surround EX DTS SDDS THX
Web Site
www.dolby.com www.dolby.com www.dolby.com www.dtsonline.com www.sony.com www.thx.com
To anticipate the effects of surround encode and decode, as well as potential downmixing, professional surround mixes are monitored through appropriate encoding and decoding processors. Monitor controller systems let engineers hear their mix through different speakers and configurations for reference, as well as compare factors such as different bass management settings and their effect.
Surround Concepts
21
LCRS mix. To account for these anomalies, engineers mixing for Dolby Surround listen through hardware encode and decode processors for reference. Professional mixing and mastering engineers use encoders and decoders to precisely audition the effect of the encoding and decoding process, and make any adjustments necessary.
Some delivery media on which surround formats are distributed may have additional audio constraints, which could influence your work flow in Pro Tools. Dolby Digital audio on a DVD-Video disc, for example, requires a 48 kHz sampling rate. If your Pro Tools session isn't at 48 kHz, you must sample rate convert the audio before encoding it with Dolby Digital for DVD.
Both encoding and decoding, no matter how refined, represent additional processing stages applied to your mix before it reaches its ultimate destination, the audience. For example, because the Dolby Surround algorithm depends heavily on phase relationships, there is always a significant difference in a decoded LCRS output as compared to the original
22
Bass Management
Bass management is a function of the playback system. Bass management refers to a number of processes by which a playback system can control what will be heard through the Sub(woofer) and other channels. The purpose of bass management is to optimize low frequency reproduction and overall frequency response for your specific monitoring system. Bass management provides a way to tune bass response for variables, including room size and shape, the presence (or absence) of a subwoofer, and the frequency range of each speaker.
Filtering
In its simplest form, bass management applies high-pass filtering to the full-range speakers. Frequencies below the filter cutoff are summed with the LFE channel (if present) and then routed to the subwoofer. Different systems may offer additional bass management processes, including: Bass Redirection Mutes or unmutes the low-frequency signals filtered out of the full-range channels (all signals that fall below the bass management filter cutoff frequency). This is useful when the full-range speakers are truly fullrange, capable of accurately reproducing bass without distorting. Bass Extension Allows the bass (however it is derived) to be routed back through the full-range speakers. Though this may seem like a contradiction, its main purpose is to let sub content emanate from all around, rather than only from the subwoofer.
Surround Concepts
23
Avid
2001 Junipero Serra Boulevard Daly City, CA 94014-3886 USA
Product Information
For company and product information, visit us on the web at www.avid.com