Detailed Notes – Sound in Multimedia
1. Digital Audio
Definition:
Digital Audio = Sound represented in digital (binary) form using sampling and quantization.
- Analog Sound → Continuous waves.
- Digitization → Capturing sound by measuring its amplitude at regular intervals (Sampling
Rate) and storing each value as a binary number of fixed precision (Bit Depth).
Key Terms:
- Sampling Rate (Fs): Number of samples per second. Measured in Hertz (Hz).
Example: CD Audio uses 44.1 kHz → 44,100 samples/sec.
- Bit Depth (n): Number of bits used to store each sample.
Example: 16-bit allows 65,536 amplitude levels.
- Quantization Error: Difference between original analog signal and digitized signal due to
rounding off.
Advantages:
- High fidelity sound reproduction.
- Easy editing (cut, copy, mix, noise removal).
- Can be compressed (MP3, AAC).
Disadvantages:
- Requires large storage (1 min CD-quality stereo ≈ 10 MB uncompressed).
- Processing requires powerful CPU/GPU in real-time apps.
Applications:
- Podcasts, movies, computer games, music production.
2. MIDI Audio (Musical Instrument Digital Interface)
Definition:
MIDI = Standard protocol (1982) that allows musical instruments, computers, and devices
to communicate using digital messages.
How it works:
- Stores performance instructions (Note ON, Note OFF, Pitch, Velocity, Duration,
Instrument).
- Example message: 90 3C 40 → Play middle C with velocity 64.
File Size:
- Very small (few KB for a song).
Advantages:
- Compact storage.
- Can easily change instruments (e.g., Piano → Violin).
- Ideal for interactive music applications.
Disadvantages:
- No real sound recording, only instructions.
- Output quality depends on playback device’s synthesizer.
Applications:
- Electronic keyboards, music composing software (GarageBand, FL Studio), karaoke tracks.
3. MIDI vs. Digital Audio
Feature Comparison:
MIDI:
- Data Stored: Musical instructions
- File Size: Very small (KBs)
- Flexibility: Instrument & tempo easily changeable
- Real Sound: Cannot record human voice
- Example: .mid files
Digital Audio:
- Data Stored: Actual sound samples
- File Size: Large (MBs)
- Flexibility: Fixed once recorded
- Real Sound: Can record speech, effects, music
- Example: .wav, .mp3
Conclusion:
Use MIDI for music composition where size matters.
Use Digital Audio for natural sounds, speech, and high-quality media.
4. Multimedia System Sounds
Definition:
Short audio feedback given to users in multimedia systems.
Examples:
- “Ding” when new message arrives.
- “Beep” on error.
- “Click” on button press.
Purpose:
- Reinforce actions.
- Improve user experience.
- Provide accessibility (blind users can use sound cues).
Design Guidelines (Vaughan):
- Keep sounds short (<1 sec).
- Avoid distracting background noise.
- Use different tones for different actions.
5. Audio File Formats
Uncompressed Formats:
- WAV (Waveform Audio File Format): Developed by Microsoft/IBM. High quality, large size.
- AIFF (Audio Interchange File Format): Apple’s uncompressed audio.
Compressed (Lossy) Formats:
- MP3 (MPEG-1 Audio Layer 3): Removes inaudible frequencies. Small size, popular in
music.
- AAC (Advanced Audio Coding): Successor to MP3. Better compression. Used in iTunes,
YouTube.
- WMA (Windows Media Audio): Proprietary Microsoft format.
Compressed (Lossless) Formats:
- FLAC (Free Lossless Audio Codec): Open-source, retains original sound quality.
- ALAC (Apple Lossless Audio Codec).
System Sounds:
- .ogg (open-source, used in games & apps).
6. Vaughan’s Law of Multimedia Minimums
Definition (Tay Vaughan):
Multimedia projects must maintain minimum acceptable standards of quality for audio,
video, graphics, and interactivity.
For Audio:
- Minimum CD quality (44.1 kHz, 16-bit, stereo).
- Poor-quality sound can ruin the entire project even if visuals are excellent.
Key Principle: Bad sound is worse than no sound.
Implication in Projects:
- Always use a good microphone.
- Maintain proper sampling rates.
- Balance file size and quality (optimize using MP3/AAC).
7. Adding Sound to Multimedia Project
Process Steps:
1. Planning
- Decide purpose: Narration, Music, Effects, Background Score.
2. Recording
- Use quality microphones, avoid background noise.
- Store in uncompressed format initially (WAV/AIFF).
3. Editing
- Use tools: Audacity, Adobe Audition, GarageBand.
- Apply effects: Normalize, Echo, Noise reduction.
4. Compression
- Convert to MP3/AAC/OGG for smaller file size.
5. Integration
- Sync with animation or video.
- Example: Explosion sound with video frame.
6. Testing
- Check balance between narration and background music.
- Ensure cross-platform compatibility.
Best Practices (Vaughan):
- Narration should be clear and at front.
- Music should complement, not overpower.
- System sounds should be functional, not decorative.
Fig 1: Analog to Digital Conversion (Sampling)
Fig 2: MIDI Workflow
Fig 3: Adding Sound in Multimedia Project (Workflow)