Orchestration and Audio Engineering
Orchestration is the craft of arranging music for an ensemble: choosing instruments, voicing chords, doubling lines, balancing dynamics, and writing idiomatically for each player. Audio engineering is the technical discipline of capturing, processing, and reproducing sound. The two converge in modern production, where a film score may be 80 percent sampled and 20 percent live orchestra, with the same engineers managing both microphone choice for a Hollywood scoring stage and library balance in a digital audio workstation.
Orchestration
Standard Orchestra Families
Strings are typically organized into five sections: first violins, second violins, violas, cellos, and double basses, with the ratio 16/14/12/10/8 in a large symphony orchestra. Techniques include divisi (section players split parts), col legno (struck with the wood of the bow) and col legno tratto (drawn with the wood), pizzicato (plucked), spiccato (bouncing bow), sul tasto (over the fingerboard, producing a pale tone), sul ponticello (close to the bridge, producing a glassy edge), ricochet (thrown bow), sul G (forcing a melody onto the G string for a darker color), natural and artificial harmonics, and mute (con sordino) versus senza sordino.
Woodwinds: flutes (with piccolo, alto flute, and bass flute as auxiliaries), oboes (with English horn / cor anglais and the rare heckelphone), clarinets (B flat and A as standards, with E flat sopranino, basset horn, bass clarinet, and contrabass clarinet auxiliaries), and bassoons (with the contrabassoon).
Brass: French horns (typically four in a Romantic orchestra, six to eight in late Romantic and modern), trumpets (B flat, C, D, piccolo), trombones (tenor, bass, and alto), and tuba (with euphonium and baritone in band contexts).
Percussion: timpani (four pedal kettledrums covering roughly E2 to G3 collectively), bass drum, snare drum, crash and suspended cymbals, gong / tam-tam, triangle, tambourine, xylophone, marimba, glockenspiel (orchestra bells), vibraphone, tubular bells (chimes), and a wide field of auxiliary percussion.
Keyboard: piano, celesta, harp, organ, and harpsichord in period contexts.
Voices: soprano, mezzo-soprano, alto, tenor, baritone, bass, and the SATB choir (or larger configurations including divisi, double choir, and SSAATTBB).
Standard Orchestra Sizes
- Chamber orchestra: approximately 20 to 30 players.
- Classical-period orchestra (Haydn, Mozart): 30 to 40.
- Mid-Romantic (Beethoven 9, Brahms): 50 to 70.
- Late Romantic (Mahler, Strauss): 90 to 110, with Mahler’s Symphony No. 8 requiring more than 1,000 performers in its full form (orchestra plus multiple choirs and soloists).
- Modern film orchestra: 65 to 100 typical for major studio score, with split sessions to record larger forces in segments.
Transposing Instruments
The transpositions every score reader needs:
- B flat clarinet: sounds a major second below written. E flat clarinet: sounds a minor third above written.
- A clarinet: sounds a minor third below. Bass clarinet in B flat: sounds an octave plus a major second below.
- English horn: sounds a perfect fifth below written.
- French horn in F: sounds a perfect fifth below written.
- B flat trumpet: sounds a major second below written. C trumpet: at pitch. Piccolo trumpet: sounds an octave above.
- Soprano saxophone B flat: major second below. Alto saxophone E flat: major sixth below. Tenor saxophone B flat: octave plus major second below. Baritone saxophone E flat: octave plus major sixth below.
Practical Ranges
Working ranges (lowest practical to highest comfortable):
- Violin: G3 to A7 and above.
- Viola: C3 to E6.
- Cello: C2 to E6.
- Double bass: E1 to G4 (sounds an octave below written).
- Flute: C4 to D7. Piccolo: D5 to C8, sounding an octave above. Alto flute: G3 to G6 (sounds a perfect fourth below).
- Oboe: B flat 3 to A6. Cor anglais: E3 to B flat 5 (sounding).
- Clarinet B flat: D3 to B flat 6 (sounding). Bass clarinet: E flat 1 to G5 (sounding).
- Bassoon: B flat 1 to E flat 5. Contrabassoon: B flat 0 to B flat 4.
- French horn in F: F2 to F5 and above (the high pedal F is achievable by virtuosos).
- Trumpet: F sharp 3 to D6.
- Trombone: E2 to F5. Bass trombone: B1 to D5.
- Tuba: D1 to G4.
Voice ranges (typical, working singer):
- Soprano: C4 to A5 plus.
- Mezzo-soprano: A3 to F5.
- Alto / contralto: F3 to D5.
- Tenor: C3 to A4 plus (the high C tenor C5 is operatic).
- Baritone: G2 to F4.
- Bass: E2 to D4 (or lower for basso profondo).
Articulations
Strings: legato, staccato, staccatissimo, marcato, accent (sforzando), tenuto, portato (slurred staccato), spiccato (bouncing), ricochet (multiple notes per bow), jete (thrown), col legno battuto (struck with wood), col legno tratto (drawn with wood), bowed tremolo (rapid bow), measured tremolo, fingered tremolo.
Winds and brass: flutter-tongue, double-tonguing (T-K-T-K), triple-tonguing (T-T-K or T-K-T), pedal tones (brass), multiphonics (especially on saxophone and clarinet), flatterzunge, growl (jazz brass), smear, scoop.
Voicing and Balance
Four-part voice leading rules (taught in Bach chorale exercises and applicable to orchestration):
- No parallel perfect fifths or octaves between any two voices.
- Resolve leading tones up by step in the outer voice.
- Avoid voice crossing without specific intent.
- Keep each voice in its comfortable range.
Doubling: strings and woodwinds in octaves give weight without losing clarity; horns can fortify a melody warmly; brass fortify a tutti; the bass line is the structural foundation and must always be present unless deliberately omitted.
Density: the audible difference between a solo flute and tutti unison brass is much larger than the difference in sound pressure level alone, because register and instrument timbre also drive perceived presence.
Harmonic balance: open voicings (wide spacing between adjacent chord tones) sound spacious; closed voicings sound compact and integrated. Spread chords through the orchestra by family. The chord root is essential to keep in the bass register or risk losing harmonic clarity.
Famous Treatises
- Hector Berlioz, Grand traite d’instrumentation et d’orchestration modernes (1843, revised 1855 by Berlioz with Richard Strauss editing a major 1904 edition).
- Nikolai Rimsky-Korsakov, Principles of Orchestration (published posthumously 1913).
- Walter Piston, Orchestration (1955).
- Cecil Forsyth, Orchestration (1914, second edition 1935).
- Kent Wheeler Kennan, The Technique of Orchestration (1952, multiple editions).
- Alfredo Casella, La tecnica dell’orchestra contemporanea (1950).
- Gardner Read, Orchestral Communication (1969) and Orchestral Technique (1969).
- Samuel Adler, The Study of Orchestration (1989, fourth edition 2016): the modern American standard textbook.
Hollywood Orchestrators and Composers
John Williams orchestrates much of his own work, with assistance from John Neufeld, Conrad Pope, and others. Hans Zimmer’s collaborative model at Remote Control Productions in Santa Monica involves composers Lorne Balfe, Steve Mazzaro, Benjamin Wallfisch, Henry Jackman, Tom Holkenborg (“Junkie XL”), Brian Tyler, and others, with extensive use of orchestrator-arrangers such as Conrad Pope and Edward Trybek.
Other major film composers: Michael Giacchino (The Incredibles, Up, Star Trek, Coco, The Batman), James Newton Howard (Fantastic Beasts, The Hunger Games, frequent Shyamalan collaborator), Howard Shore (Lord of the Rings), Ludwig Goransson (Black Panther, Oppenheimer, The Mandalorian; Oppenheimer Oscar 2024), Hildur Gudnadottir (Joker Oscar 2020, Chernobyl Emmy, Tar). Trevor Morris in television (The Tudors, Vikings).
Sample Libraries
Modern composers routinely “mock up” scores in MIDI using sample libraries that have grown to multi-hundred-gigabyte instruments:
- Spitfire Audio BBC Symphony Orchestra (multiple sizes), Spitfire Symphony Orchestra, Spitfire Studio Strings.
- Vienna Symphonic Library (the long-running Austrian library).
- Hollywood Orchestra by EastWest.
- Cinematic Studio Series by Alex Wallbank (Cinematic Studio Strings, Brass, Winds, Solo Strings).
- Heavyocity percussion and synth.
- Native Instruments Symphony Series (Action Strings, Brass).
- Audio Imperia Areia, Talos, Nucleus.
- Orchestral Tools Berlin Series (Strings, Woodwinds, Brass).
- ProjectSAM Symphobia, True Strike.
- Embertone solo instruments (Friedlander Violin).
- 8Dio Adagio Strings.
- Cinesamples and Strezov Sampling Storm Choir.
The live-versus-sampled debate has shifted as libraries improved through the 2010s. Mid-budget films, television, video games, and trailer music are predominantly sampled; tentpole studio releases still use live orchestra (London Symphony Orchestra, Hollywood Studio Symphony, Vienna RSO, Nashville Scoring Orchestra, Macedonian Radio Symphony Orchestra) for the perceived emotional authenticity and the marketing value.
Audio Engineering
Recording Chain
The recording signal chain runs source → microphone → microphone preamp → analog-to-digital conversion → digital storage. Each stage adds character.
Microphone preamps add gain, color, and sometimes deliberate nonlinearity:
- Neve 1073, 1081, 88R: the classic British color preamp, with transformer-coupled inputs and outputs.
- API 312, 512, 3124: the American answer, brighter and more transient-quick.
- Solid State Logic SSL: clean preamps in the 4000, 9000, and modern consoles.
- Avalon U5, VT-737sp: smooth tube and solid-state designs popular in vocal chains.
- Universal Audio 6176 (tube preamp with 1176 compressor) and 710 Twin-Finity (tube/solid hybrid).
- Manley Voxbox: high-end vocal chain.
- Grace Designs M101, M201, M501: transparent boutique designs.
- Millennia HV-3D: ultra-transparent reference preamp.
- Great River MP-2NV: Neve-inspired discrete.
- Universal Audio Apollo X: built-in interface preamps with Unison technology emulating classic preamps.
Analog-to-Digital Conversion
Bit depth and sample rate: 16-bit / 44.1 kHz is CD quality; 24-bit / 48 kHz is the production standard; 24-bit / 96 kHz and 32-bit float are common in high-end work. Integer versus floating-point: 32-bit float file formats prevent intersample overload but require conversion to integer for many distribution paths.
Major converter chipsets: ESS Technology Sabre (the leading high-end DAC chip family), AKM AK4499, Cirrus Logic, and Burr-Brown (Texas Instruments).
Converter manufacturers:
- Lavry Engineering (Black and Gold series).
- Apogee Symphony I/O Mk II and Mk III, Apogee Element.
- Universal Audio Apollo X16D, X16, X8P, X4.
- RME ADI-2 Pro FS, Fireface UFX III, UCX II, MADIface XT.
- Antelope Audio Orion Studio Synergy Core.
- Prism Sound Atlas, Lyra.
- Solid State Logic SSL Big SiX, SSL 2, SSL 12, SSL 18.
- MOTU 16A, 8M.
- Audient EVO 16, iD44, ASP.
- Focusrite Scarlett (entry to mid), RedNet (Dante networked).
Clock jitter (timing instability) was historically a major concern for sound quality; modern converters with femtosecond clocks have effectively neutralized jitter audibility for properly synchronized systems.
Studio Monitors
Near-field studio monitor reference standards:
- Genelec 8030C, 8330A SAM, 8341A “The Ones” coaxial: Finnish; the de facto modern reference for transparency and waveguide design.
- Neumann KH 120 II, KH 150 (with built-in DSP and MA1 alignment).
- Adam Audio A7V, A4V, A77H (with the ribbon X-ART tweeter).
- Yamaha NS-10 (legacy reference, deliberately mid-forward to expose mix problems; production discontinued 2001 but used widely).
- Auratone 5C (“the cube”): single-driver mono check reference, recently reissued.
- Focal Solo6, Twin6, Trio11.
- PMC twotwo.6, twotwo.8, Result6.
- ATC SCM25A, SCM45A, 50ASL.
- Eve Audio SC205, SC207.
- KRK Rokit 5, 7, 8 (entry budget).
Mid-field and main monitors include the Genelec 8351A and 8361A, custom Augspurger main monitors found in many top studios, ATC mains, and Quested mains.
Reference Headphones
- Sennheiser HD 600, HD 650, HD 660S2, HD 800S: the modern open-back references.
- Beyerdynamic DT 770 Pro (closed, 80 ohm and 250 ohm versions), DT 990 Pro, DT 880 Pro, DT 1990 Pro.
- Audeze LCD-X, LCD-2, MM-500: planar magnetic.
- Focal Clear MG, Stellia, Utopia 2022.
- AKG K712, K240 Studio, K371.
- Sony MDR-7506, MDR-CD900ST (the Japan studio standard).
- Audio-Technica ATH-M50x, R70x.
- Neumann NDH 20, NDH 30.
Mixing Concepts
Gain staging: nominal levels in professional analog gear are +4 dBu RMS (consumer is -10 dBV); digital reference levels are typically aligned to -18 dBFS or -20 dBFS to match analog +4 dBu RMS (EBU R68 / SMPTE RP 155).
Loudness and metering: peak meters show instantaneous sample value; RMS shows energy; LUFS (Loudness Units Full Scale, defined in ITU-R BS.1770) standardize perceived loudness measurement with K-weighting.
Panning: mono compatibility check; correlation meter for phase coherence; mid/side processing isolates the mono center from the side stereo information.
Equalization: subtractive EQ first (cut what is excessive before boosting); high-pass filter for rumble below typical instrument fundamental; carve narrow Q notches for resonances; broad Q for tonal shaping.
Dynamics: compressors with attack, release, ratio, knee, and makeup gain controls. Parallel (New York) compression blends a heavily-compressed copy back with the dry. Multiband compressors target frequency ranges. Sidechain compression pumps another signal (kick ducking bass, EDM pumping bass under kick).
Effects:
- Reverb types: rooms (short, dense), plates (smooth, vintage), halls (long, large), chambers (spatial); pre-delay controls perceived size; the classic plate reverb is the EMT 140.
- Delay: slap (under 100 ms), 1/8, 1/4, dotted patterns for groove.
- Modulation: chorus, flanger, phaser.
- Saturation: tape, tube, transformer; harmonics add perceived loudness without peak level increase.
Stereo enhancement: the Haas effect (precedence effect) localizes a sound to whichever channel arrives first within roughly 30 ms; subtle widening via short delay difference between channels. Mid/Side EQ processes mono (sum) and stereo (difference) signals independently. SPL Transient Designer-style processors independently boost or reduce attack and sustain envelopes.
Mix workflow: top-down (overall balance first, then refine subgroups, then refine individual elements) versus bottom-up (build each element first, then assemble). The SSL G-series bus compressor at 2:1 ratio, 30 ms attack, auto release, and 2 to 4 dB gain reduction on the mix bus is a near-universal “glue” treatment.
Mastering
Mastering is the final stage between mix and distribution. Goals: translate the mix across playback systems (car, earbuds, club, broadcast), achieve consistent tonal balance, set appropriate loudness, and produce album-level consistency.
Mastering tools:
- Mastering EQ: gentle broad shapes; Maag EQ4 air band, Manley Massive Passive, Weiss EQ1, SPL Iron, Pultec EQP-1A (vintage), Sontec MES-432.
- Mastering compression: Manley Variable Mu (Vari-Mu tube compressor), Shadow Hills Mastering Compressor, SSL G-series bus compressor.
- Limiting: FabFilter Pro-L 2 (the modern transparent reference), Sonnox Oxford Limiter, Slate FG-X, Massey L2007, Brainworx bx_limiter, IK Multimedia Stealth Limiter. True-peak limiting (-1 dBTP) prevents intersample peak overload on lossy codec playback.
- Harmonic processing: SPL Iron, Tube-Tech CL 1B, Maag Magnum-K.
- De-essing: WAVE FabFilter Pro-DS, Sonnox SuprEsser, oeksound soothe2.
- Mid/Side: Brainworx bx_digital V3, FabFilter Pro-Q 4.
Loudness standards: streaming normalization changed loudness practice from the late-2010s onward.
- Spotify and YouTube target -14 LUFS integrated.
- Apple Music targets -16 LUFS integrated.
- Tidal targets -14 LUFS.
- EBU R128 broadcast standard: -23 LUFS with +/- 1 LU tolerance.
- US ATSC A/85 broadcast: -24 LUFS with +/- 2 LU tolerance.
- Cinema (DCI): approximately -27 LUFS reference dialog.
The “loudness wars” of the 1990s through 2010s drove pop masters to -6 LUFS or louder with heavy limiting; streaming normalization reversed the incentive by 2015. Modern pop masters target -10 to -8 LUFS, with true peak below -1 dBTP.
Major mastering houses and engineers:
- Sterling Sound (originally Manhattan, now also Edgewater NJ): Greg Calbi (Bowie, Lou Reed, John Lennon), Ted Jensen (Eagles Hotel California, Norah Jones, Green Day), Tom Coyne (deceased 2017; Adele 21, Taylor Swift 1989), Joe LaPorta, Chris Gehringer.
- Bernie Grundman Mastering (Los Angeles): Bernie Grundman, Chris Bellman, Mike Bozzi, Patricia Sullivan.
- Gateway Mastering (Portland Maine): Bob Ludwig (Led Zeppelin remasters, Daft Punk Random Access Memories, Beatles 2009 remasters; retired 2023).
- Digital Domain (Florida): Bob Katz (creator of the K-system, author of Mastering Audio).
- Mastering Lab (founded by Doug Sax): legacy of analog mastering.
- Howie Weinberg Mastering (Los Angeles).
- 11th Hour Mastering: Emily Lazar; first woman to win the Best Engineered Album (Non-Classical) Grammy in 2019 for Beck’s Colors.
- Glenn Schick Mastering (Atlanta).
- Black Saloon Studios (London): Mandy Parnell.
- Mike Wells Mastering (Hollywood).
- Maor Appelbaum Mastering (Los Angeles).
- Joe Hutchinson at Joe’s Mastering.
Mastering for vinyl requires additional steps. The Neumann VMS80 and VMS70 lathes (no longer manufactured but maintained) cut lacquer masters. Pre-master EQ moves bass to mono below approximately 150 Hz, de-essing to prevent inner-groove sibilance distortion, RIAA pre-emphasis applied by the cutting amplifier. From lacquer, the metalwork goes to electroplating to produce the stamper.
Restoration
iZotope RX (currently version 11) is the standard audio restoration suite, with modules including Voice De-noise, Dialogue Isolate, De-click, De-crackle, De-hum, De-clip, Spectral Repair, Mouth De-click, De-rustle, Mouth De-click, Music Rebalance (source-separation isolation of stems), and the Stem Composer 2024 update. Cedar Audio (CEDAR DNS, Retouch) is the high-end alternative used in film post and forensic work. Sonnox Restore and CrumplePop AudioDenoise are commercial competitors.
Microphone Selection by Source
Vocals:
- Neumann U 47 (vintage, original 1949): the classic vocal large-diaphragm tube condenser; modern reissue Neumann U 47 fet, Telefunken U47 reissue.
- Neumann U 87 Ai: the workhorse studio vocal mic.
- Sony C-800G: the modern pop and hip-hop vocal standard.
- Shure SM7B: the dynamic broadcast mic used heavily for podcast and pop vocals.
- Electro-Voice RE20: the broadcast standard, also used on kick drum and brass.
- AKG C12 (vintage), C414: jazz and classical vocal options.
- Manley Reference Cardioid.
- Mojave Audio MA-300 and MA-200 tube condensers.
Acoustic guitar:
- Neumann KM 84 / KM 184 small-diaphragm cardioid.
- AKG C 451 B.
- Schoeps CMC 6 / MK 4 series.
- DPA 4011 small-diaphragm.
- Royer R-121 ribbon (warm color, often blended with a condenser).
Electric guitar:
- Shure SM57: the universal close mic on guitar cabinets.
- Sennheiser MD 421: tighter alternative.
- Royer R-121 and R-122 ribbon: warm cab mic.
- Beyerdynamic M 160 ribbon.
- Combining a 57 with a ribbon and a room mic is standard.
Drums:
- Kick: Shure Beta 52A or AKG D 112 inside; sub-kick subwoofer-as-mic or Yamaha SubKick outside.
- Snare top: Shure SM57.
- Snare bottom: Shure SM57 or sennheiser e 905 (flipped polarity).
- Toms: Sennheiser MD 421 or Audix D series.
- Hi-hat: AKG C 451 B or Neumann KM 184.
- Overheads: Neumann KM 84, AKG C 414, Coles 4038 ribbon.
- Room mics: AEA R88, large-diaphragm condensers.
Stereo techniques:
- XY: two cardioid mics at 90 degrees, capsules coincident; mono-compatible.
- ORTF: two cardioid mics at 110 degrees, capsules 17 cm apart; the French national radio standard.
- NOS: two cardioid mics at 90 degrees, capsules 30 cm apart.
- Blumlein: two figure-8 mics at 90 degrees, coincident; the original 1931 Alan Blumlein patent.
- Decca Tree: three omnidirectional mics (typically Neumann M 50 or Schoeps MK 2H) in a T configuration above the conductor.
- Mid-side (MS): one cardioid forward plus one figure-8 sideways; sum-and-difference matrix produces stereo with adjustable width in post.
Digital Audio Workstations
The major DAWs:
- Pro Tools (Avid): the industry standard for film, TV, and major-label music; HD and HDX hardware integration; first released 1989 (Sound Tools).
- Logic Pro (Apple): Mac-only; included with one-time purchase; popular with songwriters and Apple-platform producers.
- Cubase (Steinberg): originated MIDI sequencing concept; ASIO driver standard; strong in Europe.
- Ableton Live: clip-launching paradigm; electronic music and live performance dominant.
- FL Studio (Image-Line): hip-hop and electronic production; pattern-based.
- Studio One (PreSonus): clean modern workflow.
- Reaper (Cockos): lightweight, scriptable, very affordable license.
- Bitwig Studio: modular and clip-launching hybrid.
- Digital Performer (MOTU): film scoring with strong notation integration.
- Nuendo (Steinberg): post-production-focused Cubase variant.
Plugin Formats
VST (Steinberg, since 1996); VST3 (since 2008); AU (Apple Audio Units); AAX (Avid Audio eXtension, Pro Tools only since 11); CLAP (CLever Audio Plug-in API, 2022, open-source-friendly modern format). RTAS and TDM (legacy Pro Tools formats, discontinued).
Major Plugin Companies
- FabFilter: Pro-Q 4, Pro-C 2, Pro-L 2, Pro-MB; the modern reference.
- Waves: large catalog, subscription Waves Now.
- Universal Audio: native (since 2022 abandonment of UAD-2 hardware requirement); LA-2A, 1176, Pultec EQP-1A, Manley Massive Passive emulations.
- iZotope: Ozone (mastering), Neutron (mixing assistant), Nectar (vocal), RX (restoration), Tonal Balance Control.
- Soundtoys: EchoBoy, Decapitator, Little AlterBoy, Crystallizer.
- Plugin Alliance: Brainworx, SPL, Maag, Vertigo emulations.
- Slate Digital: Virtual Mix Rack, Virtual Tape Machines, FG-X.
- Eventide: H3000, H910 Harmonizer, Blackhole reverb.
- Valhalla DSP: Valhalla Room, Plate, Shimmer, Delay.
- Acustica Audio: Nebula and Acqua plugins using volterra-kernel sampling.
- oeksound: soothe2 (dynamic resonance suppression).
- Sonible: smart:EQ, pure:limit AI-assisted plugins.
- TDR (Tokyo Dawn Records): Kotelnikov, Nova, SlickEQ.
- Klanghelm: TAL: U-He.
- Native Instruments: Komplete bundle.
DSP Concepts: Sampling and Aliasing
Nyquist-Shannon sampling theorem: sample rate must exceed twice the highest frequency component to avoid aliasing. 44.1 kHz captures up to 22.05 kHz; 48 kHz to 24 kHz; 96 kHz to 48 kHz. Anti-aliasing filter (low-pass) before A/D conversion removes above-Nyquist content. Reconstruction filter at D/A removes mirror images above Nyquist. Oversampling in plugins (typically 2x, 4x, 8x) reduces audible aliasing from nonlinear processing (saturation, distortion). Intersample peaks: between samples, the analog reconstruction can exceed the sample values; true-peak metering accounts for this.
Convolution and IR-Based Processing
Impulse responses (IRs) capture the frequency and time-domain behavior of a linear time-invariant system. Convolution reverbs (Altiverb by Audio Ease, IR-1 by Waves, Liquidsonics Cinematic Rooms) load space IRs. Speaker IRs (cabinet emulations by Two Notes, ML Sound Lab, OwnHammer, 3Sigma Audio) are standard for guitar amp simulators. Impulse responses are also used for microphone emulation (Slate VMS) and outboard gear emulation when nonlinearities are negligible.
Acoustics and Studio Design
Room modes: standing waves at room dimension related frequencies. Axial modes (between two parallel surfaces) are strongest. Bass traps (porous absorbers, membrane absorbers, Helmholtz resonators) damp low-frequency modes. Diffusion (Schroeder diffusers, primitive root diffusers) scatters mid and high frequencies, preventing flutter echo without dead room sound. RT60: reverberation time to decay 60 dB. Critical listening rooms target 0.2 to 0.4 seconds. Standards: EBU R128 / ITU-R BS.1116 listening room recommendations. Major studio designers: John Storyk (Walters-Storyk Design Group: Jimi Hendrix’s Electric Lady 1970, hundreds of studios), Sam Toyoshima (Toyoshima Acoustic Consultants), Russ Berger Design Group, Vincent van Haaff (Waterland Design).
Adjacent
- orchestral-history-and-composition in Music for the historical evolution of orchestral writing
- mixing-and-mastering-detailed in Music for deeper mix engineering technique
- live-sound-and-acoustics in Music for live PA and room acoustics
- film-and-game-music in Music for the screen-music ecosystem
- sampling-and-virtual-instruments in Music for sample library technology
- psychoacoustics-and-hearing in Music for the underlying perceptual science
Notation Software
Sibelius (Avid): widely used in publishing and film orchestration; subscription model since 2017. Finale (MakeMusic): the longtime US competitor; announced end-of-life August 2024 (no further development). Dorico (Steinberg): the modern entrant by the former Sibelius development team; rapidly gaining share post-Finale EOL. MuseScore: open-source; popular for personal and educational use. LilyPond: code-based engraving; preferred by some for typographic quality.
Time-Domain Concepts
Sample: discrete amplitude measurement at a moment in time. Frame: a multichannel sample (e.g., a stereo sample is two samples in one frame). Buffer: a block of samples processed together; buffer size affects latency. Latency = buffer size / sample rate (e.g., 128 samples at 48 kHz = 2.67 ms per direction). Round-trip latency (RTL) is double this plus converter and driver overhead. Live monitoring requires low RTL (under 10 ms preferred).
Audio File Formats
WAV (RIFF container, typically PCM): lossless, the studio interchange standard. AIFF (Apple Interchange File Format): lossless, Apple equivalent. FLAC (Free Lossless Audio Codec): lossless compression. ALAC (Apple Lossless): Apple’s lossless format. MP3 (MPEG-1 Audio Layer III): lossy, 128 to 320 kbps common. AAC (Advanced Audio Coding): lossy, more efficient than MP3; used by iTunes and YouTube. Opus: open lossy codec; superior at low bitrates; used by WhatsApp, Zoom, Discord. DSD (Direct Stream Digital): 1-bit format used by SACD; DSD64, DSD128, DSD256.
Compression Codecs in Detail
Perceptual coding uses psychoacoustic masking models. Frequency masking: a loud tone masks adjacent quieter tones. Temporal masking: a loud event masks quieter sounds shortly before and after. MP3 uses MDCT (Modified Discrete Cosine Transform) on overlapping windows. AAC improves with better filter banks and prediction. Opus combines CELT (low-latency) and SILK (speech) modes.
MIDI Protocol
MIDI 1.0: 1983 specification; 31.25 kbaud serial; 7-bit data values; channels 1-16. MIDI 2.0: 2020 specification; 16-bit and 32-bit data resolution; bidirectional; backward compatible. MIDI Polyphonic Expression (MPE): each note gets its own channel for independent expression. MIDI Show Control (MSC) for theater and entertainment. MIDI Time Code (MTC) for synchronization. General MIDI (GM) standard sound bank; GM2, GS, XG extensions.
Surround and Immersive Audio
5.1 surround: L, C, R, Ls, Rs plus LFE; cinema standard since the 1990s. 7.1 surround: adds Lrs, Rrs rear surrounds. Dolby Atmos: object-based audio with up to 128 simultaneous objects; height channels added. Atmos for Music: streaming-distributed Atmos via Apple Music, Tidal, Amazon Music. Dolby Atmos Renderer software for home and mobile playback. DTS:X: Atmos competitor. Sony 360 Reality Audio: object-based for streaming. Ambisonics: B-format (W, X, Y, Z) for spherical capture; first-order through higher-order.
Console Workflows
Solid State Logic Duality, AWS, Origin, ORIGIN 32: hybrid analog console with DAW automation. SSL 4000 G/E series: the 1980s-90s pop and rock console standard. Neve VR, 88R, Genesys: large-format analog with summing. API Vision and Legacy AXS: discrete analog with VCA automation. Avid S6 and S6L: digital control surfaces for Pro Tools.
Vocal Production Techniques
Comping: take selection across multiple recorded vocal takes. Tuning: Antares Auto-Tune (since 1997, written by Andy Hildebrand), Melodyne (Celemony, capable of polyphonic editing), Logic Pro Flex Pitch, Steinberg VariAudio. Vocal tracking: layered double-tracking, octaves, harmonies, ad-libs. De-essing strategies: dynamic EQ (FabFilter Pro-DS, oeksound soothe2) versus traditional broadband de-esser. Vocal effects chains: aural exciter, harmonic enhancer, parallel saturation.
Drum Replacement and Augmentation
Drumagog (the original 2002 plugin). Slate Trigger 2. SPL DrumXchanger. Logic Pro Drum Replacement. Toontrack Superior Drummer 3 sample library. BFD3 (FXpansion now inMusic). EZdrummer 3 (Toontrack). Steven Slate Drums. Getgood Drums.
Modern Production Trends
Stem-based delivery: trackouts, alternates, instrumental, TV mix, beds. Streaming optimization: -14 LUFS targeting; mono compatibility on smart speakers. Spatial audio mixes for Apple Music: Atmos delivery. Real-time collaboration: Avid Cloud Collaboration, Soundtrap, BandLab, Splice. Stem separation AI: Spleeter (Deezer), Demucs (Meta), iZotope RX 11 Music Rebalance, LALAL.AI, Ultimate Vocal Remover. AI mastering: LANDR (since 2014), iZotope Ozone AI Master Assistant, BandLab Mastering, eMastered.
Famous Producers and Studios
Rick Rubin (Shangri-La Studios Malibu): minimalist production legend. Quincy Jones (deceased 2024): Thriller, Off the Wall, We Are the World. George Martin (deceased 2016, AIR Studios London): Beatles producer. Brian Eno: ambient music pioneer, U2, Talking Heads, Coldplay. Nile Rodgers (Daft Punk Random Access Memories, Chic, David Bowie, Madonna). Pharrell Williams (Neptunes, N.E.R.D., countless hits). Max Martin (Cheiron, MXM, dozens of #1 hits including Britney Spears, Backstreet Boys, Taylor Swift, The Weeknd). Dr. Dre (Aftermath, beats by dre). Jack Antonoff (Bleachers, Taylor Swift, Lana Del Rey, Lorde). Finneas O’Connell (Billie Eilish): bedroom-studio Atmos productions. Mark Ronson (Uptown Funk, Amy Winehouse, Adele co-writes). Greg Kurstin (Adele Hello, Sia). Pharrell + Chad Hugo (Neptunes). Timbaland. Andrew Watt (Post Malone, Ozzy Osbourne, Rolling Stones Hackney Diamonds 2023).
Iconic Studios
Abbey Road Studios (London). Capitol Studios (Los Angeles). Sunset Sound (Los Angeles). Sound City (Van Nuys, immortalized in Dave Grohl’s 2013 documentary). Electric Lady Studios (NYC, Jimi Hendrix founded 1970). Henson Recording Studios (former A and M Studios, Los Angeles). Power Station (NYC, now Avatar Studios then Power Station NY). Ocean Way Recording (Nashville and LA branches). Real World Studios (Peter Gabriel, England). Conway Recording (Los Angeles). RCA Studio B (Nashville). Muscle Shoals (Alabama). Air Studios (London).
Famous Engineers and Mix Engineers
Bruce Swedien (Quincy Jones, Michael Jackson Thriller; deceased 2020). Bob Clearmountain (legendary mix engineer: Bruce Springsteen, Bryan Adams, Tom Petty). Chris Lord-Alge (CLA, his brother Tom Lord-Alge): Green Day, Nickelback, Carrie Underwood; the modern dense-pop mix sound. Andy Wallace (Nirvana Nevermind mix, dozens of grunge and metal). Tom Elmhirst (Beyonce, Mark Ronson, Frank Ocean, Adele). Spike Stent (Madonna, Muse, Lady Gaga, Coldplay, Beyonce). Manny Marroquin (Bruno Mars, John Legend, Alicia Keys, Lizzo). Serban Ghenea (Taylor Swift, Justin Timberlake, Beyonce). Dave Pensado (countless R and B and pop; YouTube show Pensado’s Place). Michael Brauer (My Morning Jacket, John Mayer, Coldplay). Jaycen Joshua (J Cole, Kanye West, Beyonce). Mike Dean (Kanye West, Travis Scott, Jay-Z). Leslie Brathwaite (Pharrell, Janelle Monáe).
Famous Mastering Engineers Beyond US
Mandy Parnell (Black Saloon Studios, London): Bjork, FKA twigs, The xx. Stephen Marcussen (Marcussen Mastering, Hollywood): Rolling Stones, Tom Petty. Joe Hutchinson Jr. Patricia Sullivan (Bernie Grundman Mastering): Janet Jackson, Brandy. Gavin Lurssen (Lurssen Mastering, LA): Sheryl Crow, Foo Fighters, T-Bone Burnett. John Davis (Metropolis Mastering London): Lana Del Rey, U2. Frank Arkwright (Abbey Road London): Arctic Monkeys, Doves. Miles Showell (Abbey Road London): Bowie reissues, half-speed master vinyl. Geoff Pesche (Abbey Road London): Coldplay, Gorillaz. Gareth Jones. Adam Ayan (Gateway Mastering, Maine).
Live Sound Engineers
Robert Scovill (Tom Petty, Rush, Def Leppard); pioneer of digital PA. Dave Natale (Rolling Stones, Fleetwood Mac). Brad Madix (multiple major arena tours). Buford Jones (Pink Floyd Wall tour 1980-81 design). Big Mick Hughes (Metallica FOH). Sean Spuehler (Daft Punk).
Console Generations
1960s vintage: Pultec, Fairchild 670 (early limiters). 1970s: Neve A series (now revered for sonic character), API 1604, Trident A, Helios. 1980s: SSL 4000 E and G (the pop and rock standard), Neve VR. 1990s: Neve 88R, SSL 9000J, AMS Neve Capricorn (early digital). 2000s: digital consoles (Yamaha DM2000, Digidesign Icon, Sony Oxford OXF-R3). 2010s-2020s: hybrid analog with DAW integration (SSL Duality, Origin, ORIGIN 32; API Vision Legacy; AMS Neve Genesys Black).
Synthesizers Major Lines
Moog: Minimoog Model D, Moog One, Subsequent 37, Voyager, Mother-32, DFAM, Subharmonicon. Sequential: Prophet-5, Prophet 6, Prophet X, OB-6, Take 5. Oberheim: OB-X, OB-Xa, OB-8, Matrix-12, OB-X8 (reissue 2022). Roland: Jupiter-8, Juno-60, Juno-106, D-50, JD-800, SH-101, System-700, modern AIRA series, Roland Cloud. Yamaha: DX7, CS-80, Montage M, MODX+. Korg: MS-20, Wavestate, Modwave, Opsix, Minilogue XD. Modular Eurorack: Doepfer, Make Noise, Mutable Instruments (now public domain after 2022 closure), Intellijel, 4ms, Befaco. Software synthesis: Native Instruments Massive X, FabFilter Twin 3, Arturia V Collection, U-He Diva, Spectrasonics Omnisphere, Synthogy Ivory.
Effects Pedals and Outboard
Eventide H3000, H9, H910 Harmonizer. Strymon: BigSky, TimeLine, El Capistan, Volante, Cloudburst, NightSky. Empress Effects: ZOIA, Reverb. Chase Bliss Audio: Mood, Dark World, Generation Loss, MOOD MKII. Universal Audio outboard: 1176, LA-2A, Distressor by Empirical Labs. Lexicon 480L (the 1980s classic). TC Electronic M-One, M3000, System 6000. Bricasti Design M7 (modern reverb reference). Cooper Time Cube, AKG BX-20.
Vinyl Manufacturing
Lacquer cutting: Neumann VMS70, VMS80, Scully, Westrex. Master cutting engineers: Steve Hoffman, Bob Pollard, Bernie Grundman, Kevin Gray, George Marino (deceased). Pressing plants: GZ Media (Czech Republic, largest), United Record Pressing (Nashville), QRP (Quality Record Pressings), RTI (Record Technology Inc.). Electroplating: lacquer to silver, then father, mother, stamper. Vinyl resurgence: from sub-USD 100 million annual US sales to USD 1.5 billion 2024.
Cassette and Tape Renaissance
Boutique cassette labels: Burger Records (closed 2020), Cassette Store Day movement. Reel-to-reel tape: Studer A800, Otari MX-80, Ampex ATR-102 (Daniel Lanois favorite). Tape emulation plugins: Universal Audio Studer A800, Slate Digital Virtual Tape Machines, Waves Kramer Master Tape. Tape compression characteristics: high-frequency rolloff, low-frequency bump, mid-range warmth, harmonic saturation.
Tracking Techniques
Live tracking: full band performing together to capture chemistry. Overdubbing: layer by layer construction. Isolation: gobos, vocal booths, baffles. Headphone monitoring with click track and cue mix. Talkback intercom between control room and live room. Multi-room tracking: drum room with isolated guitar amp room.
Vocal Booth Acoustics
Treatment: bass traps, absorbers, diffusion. Avoid parallel walls. Low-frequency control especially important in small booths. Vocal Booth To Go portable acoustic enclosures. WhisperRoom isolation booths. Studiobricks modular booths.
Spatial Audio Production Workflow
Dolby Atmos Production Suite (DAW-integrated renderer). Dolby Atmos Music Panner. Bed channels and object channels. ADM (Audio Definition Model) BWF metadata. Mastering for Atmos: Dolby Atmos Renderer, Dolby Atmos Mastering Suite. Encoding to Atmos DD+JOC or AC-4 for delivery. Apple Spatial Audio with head tracking on AirPods Pro/Max.
Notable Sample Library Companies
Spitfire Audio (London): BBC Symphony Orchestra series; Albion ONE; Hans Zimmer Strings (HZS). Vienna Symphonic Library (Austria): Synchron Series, Vienna Instruments. Orchestral Tools (Berlin): Berlin Strings, Berlin Brass, Berlin Woodwinds, Metropolis Ark, SINE Player. EastWest (Hollywood Orchestra Opus Edition; Composer Cloud subscription). Cinesamples (CineStrings, CineBrass, CineWinds, CinePerc). Audiobro (LA Scoring Strings, Modern Scoring Strings). 8Dio (Adagio series, Insolidus choir). Heavyocity (Damage, Forzo, Master Sessions). ProjectSAM (Symphobia, True Strike, Animator). Embertone (Friedlander Violin, Joshua Bell Violin). Native Instruments Symphony Series. LASS by Audiobro.
Recent Spatial Mixing Releases
Dolby Atmos Music albums available on Apple Music, Tidal, Amazon Music HD. Hans Zimmer Hollywood Studio Symphony Atmos mixes. The Beatles Get Back Atmos remixes by Giles Martin. Pink Floyd Dark Side of the Moon 50th Anniversary Atmos. Major label catalog rework: Universal Music Group Atmos delivery program.
Music AI Ethics and Industry
RIAA lawsuits: vs. Suno (June 2024) and Udio AI music generation companies. Voice cloning: Drake “Heart on My Sleeve” AI track 2023 removed. SAG-AFTRA 2023 strikes including AI use limits. Concert tours AI integration: ABBA Voyage (London, since 2022). Stem separation services: LALAL.AI, Moises, Vocal Remover. AI mastering: LANDR, eMastered, BandLab, Ozone AI.
Notable Recording Equipment Manufacturers
Microphones: Neumann, AKG (Harman), Shure, Sennheiser, Audio-Technica, Royer Labs, Coles, Telefunken, Manley Labs, Mojave Audio, Soyuz, Lewitt, Earthworks. Preamps: Neve (AMS Neve), API, Heritage Audio, BAE Audio, Avalon, Universal Audio, Grace Designs, Millennia Media. Compressors: Universal Audio (1176, LA-2A), Empirical Labs (Distressor), Tube-Tech, SSL (G-series Bus Comp, Fusion), Smart Research C2. EQs: Pultec, API 550, Neve 1073, SSL 4000, Maag, Manley Massive Passive, GML.
Frequency Spectrum Guidance
20-60 Hz: sub bass; felt more than heard; kick fundamental, bass drop. 60-200 Hz: bass; chest punch. 200-500 Hz: muddiness band; carve here for clarity. 500 Hz-2 kHz: midrange; vocal fundamentals. 2-5 kHz: presence; ear sensitivity peak; vocal intelligibility. 5-10 kHz: brilliance; cymbals, vocal sibilance. 10-20 kHz: air; cymbals, breath, ambience.