Audio Production, Mixing & Mastering — Music Reference
A practitioner-oriented reference for the full audio production chain: capture, edit, mix, master, monitor. Covers the canonical analog gear, the modern digital toolchain (2024-2026), and the immersive + AI-assisted workflows that have become standard in the last 24 months.
Companion notes: music-theory-essentials, signal-processing-dsp, acoustics-noise-control, fft-spectral.
1. Signal Flow — The Big Picture
The canonical recording signal path, source to monitor:
Acoustic source → Microphone (transducer)
→ Preamp (mic-level → line-level, ~+40-60 dB gain)
→ A/D converter (analog voltage → PCM samples)
→ DAW (digital audio workstation; storage + plugin processing)
→ D/A converter (PCM → analog voltage)
→ Monitor amplifier (line → speaker level)
→ Loudspeaker or headphone driver (electrical → acoustic)
→ Listener's ears + room acoustics
Key transitions and the failure modes at each one:
- Source → mic: distance, off-axis coloration, room reflections bleed in.
- Mic → preamp: gain staging — too little gain leaves the converter under-fed (raises noise floor); too much clips the preamp.
- Preamp → A/D: aim for peaks around -12 to -6 dBFS at 24-bit. There is zero reason to track hot in 2026; the converter’s dynamic range (~120 dB at 24-bit) far exceeds any source.
- DAW: stay in 32-bit float internally — sums never clip mathematically, only at output.
- D/A → monitors: the room is the final filter and is almost always the weakest link.
Reference levels: +4 dBu = 1.228 Vrms (pro line); -10 dBV = 0.316 Vrms (consumer); 0 dBFS = full-scale digital; analog 0 VU typically calibrated to -18 dBFS (EBU) or -20 dBFS (SMPTE).
Sample rates and bit depths: 44.1 kHz / 24-bit is the universal release standard; 48 kHz for film + broadcast; 88.2 / 96 kHz when working with heavy nonlinear processing (saturation, pitch-shift) to push aliasing artifacts above audibility; 192 kHz is mostly marketing. Internal mix bus: 32-bit float.
2. Microphones
2.1 Dynamic
Moving-coil capsule; rugged, no phantom power, lower sensitivity, smoother high-end. Workhorses for loud sources.
- Shure SM57 — universal snare top + guitar amp mic. Presence peak around 6 kHz.
- Shure SM58 — live vocals, ball-grille version of the SM57 capsule.
- Electro-Voice RE20 — broadcast/voiceover staple, also kick + bass cab; Variable-D design rejects proximity effect.
- Sennheiser MD 421 — toms, guitar cabs, broadcast; 5-position bass roll-off.
- Shure SM7B — vocal podcast/broadcast standard; needs ~60 dB of clean gain (pair with Cloudlifter CL-1 or FetHead).
- Electro-Voice RE320 — broadcast + kick-drum dual-voice variant of the RE20.
2.2 Condenser — Large Diaphragm (LDC)
Capsule >1”, externally polarized or electret; high sensitivity, extended top end, requires +48 V phantom (except tube models with dedicated PSU).
- Neumann U 87 Ai — the lead-vocal default of countless studios; cardioid/omni/figure-8.
- AKG C414 XLII — nine polar patterns; brighter sibling to the C414 XLS.
- Neumann TLM 103 — single-pattern fixed cardioid, large K87-derived capsule.
- Sony C-800G — hip-hop and modern pop vocal go-to; Peltier-cooled tube.
- Telefunken U47 (reissue) and Wunder CM7 — classic German vocal sound, M7 capsule.
- Austrian Audio OC818 — modern dual-output capsule, Vienna-built post-AKG-Vienna.
- Lewitt LCT 440 Pure / LCT 540 Subzero — budget-to-mid LDCs that consistently outperform their price.
- Shure MV7+ (2024) — USB/XLR hybrid podcast mic, replaces the MV7; on-board DSP and touch panel.
2.3 Condenser — Small Diaphragm (SDC)
Capsule <1”; flatter response, faster transient, better off-axis behavior. Pencil mics; pairs for stereo.
- Neumann KM 184 — overheads, acoustic guitar, hi-hat; bright but tame.
- AKG C451 B — hi-hat, acoustic, drum overheads; classic presence lift.
- DPA 4011 / 4006 — high-end orchestral capsules.
- Schoeps CMC 6 + MK 4 — broadcast and classical reference.
- Earthworks SR40 — flat to 40 kHz, used for measurement-grade capture.
2.4 Ribbon
Corrugated aluminum/duralumin ribbon in a magnetic field; figure-8 inherently; warm, rolled-off top end, fast transient response. Most modern ribbons take phantom; older RCAs do not — phantom on a passive ribbon can destroy it.
- Royer R-121 — modern active/passive ribbon; guitar cabs, brass, drum room.
- Royer R-10 / R-122 MKII — affordable + active versions.
- Coles 4038 — BBC-era ribbon, still the orchestral and drum-room reference.
- AEA R84 / R44 — RCA-44 lineage modern builds.
- sE Electronics Voodoo VR1 / VR2 — active ribbons, extended top end.
2.5 USB and MEMS / Lavalier
- Shure MV7+ and Audio-Technica AT2020USB-XP (2024) — content-creator workhorses.
- Rode NT-USB+ — built-in DSP, zero-latency monitoring.
- Sanken COS-11D — film/TV lavalier standard; omni, sweat-resistant.
- DPA 4061 / 4060 — high-fidelity miniature lavs.
- DPA 6060 subminiature — broadcast clip-on.
- MEMS capsules (Knowles, Infineon, ICS-40720) — phone and earbud capture, increasingly used in pro lav designs for low handling noise.
3. Preamps and Interfaces
3.1 Preamps — Topology Families
- Transformer-coupled discrete (Neve) — Marinair/Carnhill input, class-A discrete; the 1073/1081/Shelford “Neve sound” — saturated bottom, midrange thickness.
- Op-amp + transformer (API) — 2520 op-amp + API 2503 output transformer; punchy, fast.
- Transformerless solid-state (Millennia, Grace, GML) — clean, ruler-flat, the “wire-with-gain” school.
- Tube (Manley, Telefunken V72/V76, Universal Audio LA-610) — harmonic generation at higher gain.
- SSL VHD / SuperAnalogue — clean by default, switchable variable harmonic distortion.
3.2 Interfaces (with built-in preamps and conversion)
- Universal Audio Apollo x6 / x8 / x8p / x16 — Thunderbolt, UAD-2 DSP onboard for tracking through UAD plugins at near-zero latency; Unison preamp emulation (1073, API, Neve 88RS, Helios).
- Audient iD14 MkII / iD44 MkII / iD48 — class-A console preamps borrowed from the ASP8024 console.
- Apogee Symphony Desktop / Symphony I/O MkII — top-tier conversion; Apogee Alloy mic emulation.
- RME Fireface UFX III / UCX II / Babyface Pro FS — driver stability legend; SteadyClock FS clocking.
- Focusrite Scarlett 4th Gen (2023) — bedroom-studio default; auto-gain and clip-safe at the consumer tier.
- AVID Pro Tools | Carbon and HDX — Pro Tools-native hybrid DSP/native rigs.
- SSL 12 / SSL 2 MK2 (2024) — SSL preamp coloration in a desktop unit.
- MOTU Ultralite-mk5 / 828es / 1248 AVB — solid bus-powered Thunderbolt/USB.
- Cranborne Audio Camden 500 / Camden EC1 / Camden 250 — vari-distortion 500-series preamps; “Mojo” saturation circuit.
- Topping E70 / D90 III / Pre90 — measurement-darling DA/preamp stack.
- Audient EVO 16 — affordable 16-input, smart-gain.
3.3 A/D and D/A Conversion
Converter chips you’ll see in pro gear (2024-26):
- AKM AK4499EX / AK4497 — flagship multibit ΔΣ DAC chips (Asahi Kasei post-fire recovery).
- ESS Sabre ES9038PRO / ES9039PRO — used in Apogee, Topping, and most “ultra” interfaces.
- Burr-Brown PCM4222 — Apogee Symphony reference ADC.
- Cirrus Logic CS43198 / CS5366 — common in portable + audio interfaces.
- RME uses a custom front-end around AKM/ESS plus SteadyClock FS for jitter reduction.
For interview/conversion purposes: THD+N around -110 dB and dynamic range ~120 dB is the floor of pro-grade conversion in 2026. Differences below this are inaudible; choose on driver stability and feature set.
4. Tracking — Per-Instrument Mic Strategies
4.1 Vocals (Lead)
- Mic: U 87 / TLM 103 / Sony C-800G / SM7B for darker voices.
- Distance: 6-12 in, on-axis, slightly above the mouth to reduce sibilance.
- Pop filter 4-6 in from the capsule; reflection filter or treated booth behind.
- Chain: mic → Neve 1073 / API 512c / Manley VOXBOX → LA-2A or 1176 (3-5 dB GR) → Pultec EQP-1A → A/D.
- Tracking comp: light (3-5 dB GR), conservative attack; keep dynamics for the mix.
4.2 Drums
- Kick: AKG D112 or Shure Beta 91A inside; Sennheiser MD 421 or RE20 outside for sub.
- Snare top: SM57, 1-2 in above the head, 45° pointing at the center.
- Snare bottom: SM57 or Sennheiser e904, polarity-flipped relative to top.
- Toms: Sennheiser MD 421 (clip-on or stand) or Audix D2/D4.
- Hi-hat: AKG C451 B or Neumann KM 184, 4-6 in from bell, edge-on to avoid air wash.
- Overheads: matched pair of small-diaphragm condensers (KM 184, C451 B, Coles 4038 ribbons) — XY, ORTF, spaced pair, or Glyn Johns 3-mic method.
- Room: Coles 4038 / Royer R-121 / U 87 in omni, 8-12 ft back, compressed heavily for “smash” track.
- Phase: measure top-to-bottom snare phase relationship; flip the bottom if necessary; check kick-in vs. kick-out polarity.
4.3 Bass
- DI + amp blend: Radial JDI passive or Avalon U5 active DI for direct; mic’d cab with SM57, RE20, or MD421 in front of the speaker (slightly off-center for less ice-pick top).
- Amp sim alternative: Neural DSP Parallax / Darkglass Ultra plugins.
4.4 Electric Guitar
- Classic Fredman: SM57 on-axis to cone edge + SM57 off-axis at 45°, capsules touching.
- SM57 + Royer R-121 blend — bright SM57 with the smooth R-121.
- Sennheiser MD 421 for chunky low-mids.
- Room mic 6-10 ft back when the room is good.
- Amp sim: Neural DSP (Archetype: Plini, Nolly, Petrucci, Gojira), Universal Audio UAFX pedals + plugins, STL Tones, Helix Native.
4.5 Acoustic Guitar
- Spaced pair: KM 184 / C451 B at 12th fret + lower bout, ~12 in away, slight angle inward.
- X/Y or ORTF: pair of KM 184s aimed at 12th fret and bridge.
- LDC alone: U 87 in cardioid, 8-12 in from the 12th fret.
4.6 Piano
- X/Y above strings for upright/grand mid-field.
- AB pair ~2 ft apart, ~8 in above strings — pop/rock standard.
- Decca tree (M50/CMC6 + LDC) over the player for classical capture.
- Spot mics under the lid for definition.
4.7 Orchestral / Choir
- Decca tree: three spaced omnis (Neumann M50 or DPA 4006) in a T configuration above the conductor.
- Outriggers: spaced omnis ~3 m further apart for stereo width.
- Spot mics: section spots on first violins, woodwinds, brass, percussion.
- Hall mics: AB or Blumlein pair at the back of the hall for ambience.
5. Editing — Pre-Mix Cleanup
5.1 Comping
Take multiple performance passes (“takes”); use the DAW’s playlist/take-comping view (Pro Tools Playlists, Logic Take Folders, Cubase Lanes, Studio One Layers) to assemble the best phrase from each. Crossfade boundaries; aim for equal energy crossfades to avoid level dips.
5.2 Gain Staging
Normalize clip gain so peaks sit at -12 to -6 dBFS before plugins. With 32-bit float internally this is about preserving headroom for plugin emulation models (which often have analog-referenced sweet spots around -18 dBFS RMS).
5.3 Fades and Edits
Always fade in/out across cuts to avoid clicks. Use logarithmic fades for natural decay. Hide edits on transients; never edit across a sustained vowel without crossfade.
5.4 Time-Alignment
- Manual: zoom to sample level, drag the later mic’s first transient to align with the close mic (e.g., kick out vs. kick in).
- Automatic: Sound Radix Auto-Align Post 2, Pro Tools Snap-to-Transient, Cubase Audio Alignment.
- Polarity sweeps as the final check.
5.5 Pitch Correction
- Auto-Tune Pro X (Antares, 2023-24) — the original since 1997, “graph” + “auto” modes.
- Melodyne 5 Studio (Celemony) — DNA Direct Note Access lets you edit individual notes inside a chord; the surgical pitch + timing reference.
- Waves Tune Real-Time — low-latency live correction; the modern “T-Pain” effect baseline.
- Synchro Arts Revoice Pro 5 / Vocalign Project 6 — align doubles + harmonies to the lead’s timing and pitch.
5.6 Restoration and Repair
- iZotope RX 11 (2024) — industry-standard suite: Spectral Repair, De-noise, De-click, De-hum, Mouth De-click, Dialogue Isolate, Music Rebalance (AI stems), Voice De-noise.
- Acon Digital Acoustica Premium Edition — competitor with Restoration Suite 2.
- Accentize dxRevive Pro (2024) — AI dialog de-reverb + restore.
- CEDAR Studio — high-end forensic/post.
6. EQ — Equalization
6.1 EQ Types
- Parametric — frequency, gain, Q (bandwidth) all adjustable; the main mixing tool.
- Graphic — fixed bands (typically 1/3-octave); live sound and broad tonal shaping.
- Dynamic — gain change depends on signal level above a threshold (combines compression + EQ).
- Linear-phase — zero phase shift across bands; pre-ringing artifact, use sparingly (mastering, parallel chains).
- Minimum-phase — natural analog behavior, phase rotates around boost/cut frequency.
6.2 Filter Shapes
- Bell (peaking) — boost/cut around center frequency.
- Shelf — boost/cut everything above (high shelf) or below (low shelf) a corner.
- High-pass / Low-pass — remove energy below/above a cutoff at a slope (12, 24, 48 dB/oct).
- Notch — narrow Q cut, kills resonances or hum.
- Tilt — single control rotates the spectrum.
6.3 Classic Analog EQs and Their Emulations
- Pultec EQP-1A — passive program EQ; the famous “boost and cut at the same frequency” low-end trick on bass + kick (cut 100 Hz, boost 100 Hz, shelf wider).
- Manley Massive Passive — passive parallel-band tube EQ; bold, musical wide curves.
- SSL G-Series 4000 / 9000 channel EQ — Black knob (E) vs. Brown knob (G) variants; pop/rock console standard.
- Neve 1073 / 1081 — three musical bands with stepped frequencies; mid-forward.
- API 550A (3-band) / 550B (4-band) / 560 (graphic) — Proportional-Q; the API “punch.”
- GML 8200 — pristine surgical mastering EQ (Massenburg).
6.4 Modern Digital EQs
- FabFilter Pro-Q 4 (2024) — dynamic + linear-phase + natural-phase modes, spectrum grab, surgical and musical.
- Soothe 2 (oeksound) — dynamic resonance suppressor; tames vocal sibilance + harsh guitar resonances.
- DMG Audio EQuilibrium — every EQ topology in one plugin (Brown, Crisp, Vintage, etc.).
- Brainworx bx_digital V3 / bx_console — M/S EQ, console-channel emulations.
- Acustica Audio Diamond / Pink / Amber / Gold / Sand / Silver — convolution-based vintage console emulations (Vibe / sampled approach).
- Waves Q10 / SSL E + G / API 550 / Scheps Omni Channel — broad budget catalog.
6.5 EQ Approach
- Subtractive first — cut problems (mud 200-400 Hz, harsh 2-4 kHz, sizzle 6-8 kHz) before boosting.
- HPF everything that doesn’t need sub — vocals at 80-100 Hz, hi-hat at 200 Hz, etc.
- Boost broadly, cut narrowly is a rough rule; cuts can be wide too when shaping tone.
- Solo cautiously — EQ moves that sound right in solo often sound wrong in context.
- Reference: bypass frequently; compare against unprocessed.
7. Compression and Dynamics
7.1 Topologies
- FET (1176-style) — fast attack (down to ~20 µs), aggressive; vocals, drums, parallel “smash.”
- Universal Audio 1176LN; emulations: UAD 1176 Rev A/E, Waves CLA-76, FabFilter Pro-C 2 (FET mode), Klanghelm DC8C 3.
- Optical (LA-2A / LA-3A) — program-dependent attack/release via electroluminescent panel + photocell; smooth, slow, “self-leveling”; vocals + bass.
- Teletronix LA-2A; UAD LA-2A, Waves CLA-2A, Black Rooster Audio VLA-2A.
- VCA (SSL G-Master, dbx 160, API 2500) — fast, transparent, tight knee; bus and individual.
- SSL G-Master Buss Comp (“the SSL mix glue”); dbx 160A on snare; API 2500.
- Vari-Mu (Fairchild 670, Manley Variable Mu) — tube-bias modulation, slowest, smoothest, used on mix bus + mastering.
- Manley Variable Mu; UAD Fairchild 670 Legacy + Tube-Tech CL 1B.
- Compander/diode-bridge (Neve 33609, 2254) — gentle bus compression, vintage glue.
7.2 Controls
- Threshold — level above which compression engages.
- Ratio — input:output above threshold. 2:1 gentle, 4:1 moderate, 8:1 firm, 20:1+ limiting.
- Attack — time before compression reaches full ratio. Fast (~1 ms) flattens transients; slow (~30-100 ms) preserves them.
- Release — time to return to unity after signal drops below threshold. Match to the program tempo and groove.
- Knee — soft (smooth onset) or hard (sharp).
- Makeup gain — restores output level after gain reduction.
- Sidechain — external trigger (e.g., kick triggers bass ducking) or filtered internal (HPF the detector around 100 Hz so kicks don’t pump the whole mix).
7.3 Rules of Thumb
- Vocals: LA-2A (3-5 dB GR, slow program) → 1176 (4:1, fast attack, fast release, 3-5 dB GR) — the “compressor in series” trick.
- Snare: 1176 or dbx 160 — slow attack to let the transient through, fast release.
- Kick: 4:1, medium attack (~30 ms), medium release; let the click come through.
- Bass: LA-2A or 1176 in 4:1, slow attack — even out the level.
- Drum bus: SSL G or API 2500, 4:1, fast attack, auto release, 2-4 dB GR.
- Mix bus: SSL G or Shadow Hills Mastering Compressor, 2:1 or 4:1, ~1-2 dB GR, slow attack.
7.4 Modern Digital Compressors
- FabFilter Pro-C 2 — six modes (Clean, Classic, Opto, Vocal, Mastering, Bus, Pumping); excellent metering.
- Cytomic The Glue — SSL Bus Compressor emulation; the producer’s secret weapon for cheap, transparent glue.
- Klanghelm DC8C 3 — minimalist but deeply tweakable.
- Tokyo Dawn Labs Kotelnikov GE — mastering-grade VCA + program-dependent release.
- iZotope Neutron 5 — AI-assisted mix compressor.
- Sonible smart:comp 2 — AI-set thresholds + spectral-aware compression.
7.5 Multiband + Dynamics Variants
- Multiband compression — splits signal into bands, compresses each independently; powerful but easy to over-use.
- FabFilter Pro-MB, iZotope Ozone Dynamics, Waves C6, TDR Nova GE.
- Multipressor (Logic) — multiband baked into Logic.
- De-esser — narrowband compressor at 5-9 kHz on vocal sibilance.
- FabFilter Pro-DS, Waves DeEsser, oeksound Soothe 2.
- Transient designer — independent attack/sustain control without level threshold.
- SPL Transient Designer, Native Instruments Transient Master, Waves Smack Attack.
- Expander / Gate — opposite of compression; below threshold, signal is reduced. Drum tracks, dialogue noise floor.
- Limiter — extreme ratio (∞:1), brick-wall ceiling; mastering’s final stage.
7.6 Parallel + Sidechain
- Parallel compression (“New York compression”) — duplicate the bus, smash a copy with 1176 (all-buttons-in), blend underneath. Adds density without losing transients.
- Sidechain ducking — kick triggers bass compression so the low end stays clean. EDM uses this aggressively for the “pump.”
- External sidechain — feed any source as the detector input (vocal ducks the synth pad, for example).
7.7 Mid-Side Processing
Decode stereo into Mid (sum, L+R) and Side (difference, L-R), process independently, re-encode.
- Widen the high end: shelf-boost Side > 5 kHz.
- Tighten the low end: HPF Side below 120 Hz, mono’d bottom.
- Mono-compatible imaging: critical for vinyl, AM radio, club PA.
- Tools: Brainworx bx_digital V3, FabFilter Pro-Q 4, iZotope Ozone Imager 2.
7.8 Automation and Bussing
- Automation modes: read, write, touch, latch, trim. Volume rides on vocals/instruments through verses/choruses; sends to reverb/delay automated for transitions.
- Bus structure typical:
- DRUMS bus → drum compression + EQ
- VOX bus (lead + BGV subgroups)
- GTR bus / SYNTH bus
- FX RETURNS bus (reverb + delay aux returns)
- MASTER bus (final SSL G + tape + EQ + limiter chain)
- VCA/group faders for grouped control without summing audio (Pro Tools VCAs, Logic stack groups).
8. Reverb
8.1 Reverb Types
- Plate — large suspended metal plate driven by transducer; dense, bright, vocal-friendly.
- EMT 140 / EMT 240 — vocal + snare classic.
- Spring — coiled springs; characteristic “boing” + drip.
- Fender amp tank, AKG BX-20.
- Room — natural small/medium acoustic space.
- Chamber — purpose-built acoustic reverb room (e.g., Capitol Studios).
- Hall — large concert-hall reverberation, long decay, dense late field.
- Cathedral — extreme tail, mostly orchestral/cinematic.
8.2 Algorithmic vs. Convolution
- Algorithmic — DSP networks (Schroeder, Moorer, FDN) generating dense diffuse decay; tweakable in real time.
- Lexicon 224 / 480L / 960L — the cinematic standard.
- Bricasti M7 — modern high-end hardware, “magic” lush hall.
- TC Electronic System 6000.
- Convolution — impulse responses (IRs) of real spaces or hardware; ultra-realistic, less morphable.
- Audio Ease Altiverb 8 — film standard, hundreds of sampled venues.
- LiquidSonics Cinematic Rooms Pro — fused convolution + algorithmic for shifting tails.
- Waves IR-L / IR-1.
8.3 Modern Plugins
- Valhalla DSP — Room, Plate, VintageVerb, Delay, Supermassive (free), Shimmer. The price-to-quality ratio in modern reverb.
- FabFilter Pro-R 2 (2024) — spectrum-EQ-style reverb shaping.
- UAD Lexicon 224 / EMT 140 / EMT 250 — hardware-grade emulations.
- Eventide Blackhole / Stereoroom / SP2016 Reverb.
- Soundtoys Little Plate / Crystallizer.
- 2C Audio B2 / Aether.
8.4 Reverb Usage
- Pre-delay 20-80 ms separates the dry signal from the reverb tail (clarifies the source).
- High-cut the return at 6-10 kHz so reverb doesn’t sizzle.
- Low-cut the send around 200-300 Hz so low-end doesn’t muddy.
- Short (0.5-1.2 s plate) for vocals; medium (1.5-2.5 s hall) for ensemble; long (3-8 s) for cinematic.
- Send/return, not insert, so multiple sources share the same reverb space.
- Ducking the reverb to the dry vocal (sidechain) keeps lyrics intelligible.
9. Delay
9.1 Analog and Tape Echo
- Echoplex EP-2 / EP-3 — tape echo on a continuous loop; warmth + saturation in the regeneration path.
- Roland Space Echo RE-201 / RE-301 — tape + spring reverb combo; the dub + reggae sound.
- Electro-Harmonix Memory Man / Deluxe Memory Man — bucket-brigade analog delay (BBD).
- Maestro Echoplex (USED on Page, Hendrix, Beatles).
9.2 Digital Delay
- TC Electronic 2290 — pristine clean digital delay (1985), still a reference.
- Lexicon PCM 42 — early digital with modulation.
- Eventide H3000 / H9000 — multi-tap, harmonized delays, pitch-shifted FX.
9.3 Modern Plugins
- Soundtoys EchoBoy 5 — five tape modes + style presets, the modern producer’s default delay.
- Valhalla Delay — tape, BBD, HiFi, ghost modes, pitch-shifted.
- UAD Galaxy Tape Echo / Cooper Time Cube / Roland RE-201.
- Eventide UltraTap / MicroPitch.
- FabFilter Timeless 3.
9.4 Delay Techniques
- Tempo-sync to project BPM (1/8, dotted 1/8, 1/4) for rhythmic interplay.
- Ping-pong — alternating L/R taps for stereo width without phase issues.
- Diffusion — soft-edged repeats that bleed into reverb-like territory.
- Slap delay — 60-120 ms single repeat; classic Sun Records / Elvis / rockabilly.
- Haas effect — sub-30 ms delay on one channel widens perceived stereo without imaging artifacts.
10. Saturation, Harmonic Distortion, Tape
10.1 Why Saturate
Adds harmonics (mostly even-order from tubes, odd-order from transistors, both with mild tape) that the ear perceives as warmth, weight, presence, and “glue.” Gentle saturation also acts as a soft limiter on transients.
10.2 Hardware Source Models
- Tape: Studer A800 / A820, Ampex ATR-102, MCI JH-110.
- Tube: Pultec EQP-1A (tube stage), Manley VOXBOX, Fairchild 670.
- Transformer: Neve 1073 input/output, API 312/325, Cinemag/Carnhill/Lundahl iron.
- Transistor / FET: SSL VHD, Empirical Labs Distressor (especially Nuke + Dist modes).
- Console summing: SSL 4000G, API 1608, Neve 8068.
10.3 Plugin Saturators
- UAD Studer A800 / Ampex ATR-102 / Oxide Tape — gold-standard tape emulations.
- Slate Digital Virtual Tape Machines (VTM).
- Softube Tape — three machines (A, B, C) — slick UI, lower CPU.
- Waves Kramer Master Tape / J37 Tape.
- iZotope Vinyl (free) — vinyl artifact emulation.
- FabFilter Saturn 2 — multiband distortion + saturation.
- Soundtoys Decapitator / Radiator — aggressive saturation flavors.
- Klanghelm SDRR — micro-CPU tape + tube.
- Acustica Audio Taupe / Lemon / Sienna — sampled tape behavior.
- Tone Empire Reelight LE (free) — entry-level tape.
10.4 Where to Use
- Bass: 1-3 dB drive on tape model adds harmonic weight that translates on small speakers.
- Drum bus: subtle tape softens cymbal peaks, glues kit.
- Vocals: light tube on the channel adds presence without EQ.
- Mix bus: parallel-blended tape — full saturated copy at -15 dB mixed in.
- Master bus: Ampex ATR or Studer A800 emulation at low drive, calibrated to -18 dBFS = 0 VU.
11. Modulation Effects
- Chorus — pitch-modulated short delays summed with dry; thickens, widens.
- Roland Dimension D, Boss CE-1, Eventide TriceraChorus.
- Flanger — comb-filter delay sweep; “jet plane” sweep.
- MXR Flanger, EHX Electric Mistress, UAD MXR Flanger.
- Phaser — all-pass filter cascade; rotating notches.
- MXR Phase 90, EHX Small Stone, Soundtoys PhaseMistress.
- Tremolo — amplitude modulation.
- Fender amp tremolo, Soundtoys Tremolator.
- Auto-pan — LFO-driven panning.
- Logic Tremolo, Cableguys PanShaper.
- Rotary — Leslie speaker emulation.
- UAD Leslie 122, Native Instruments B4, IK Multimedia Leslie.
12. Pitch — Tuning, Harmony, Doubling
- Antares Auto-Tune Pro X (2024) — original real-time pitch correction (1997); “Auto” mode for the T-Pain effect, “Graph” mode for note-by-note.
- Celemony Melodyne 5 Studio (2024) — DNA Direct Note Access (since 2009) for polyphonic editing; pitch + timing + amplitude per note; the most surgical pitch tool.
- Waves Tune Real-Time — sub-10 ms latency live correction.
- Synchro Arts Revoice Pro 5 + Vocalign Project 6 — automatic alignment of doubles/harmonies to a guide vocal’s pitch and timing.
- iZotope Nectar 4 — vocal channel strip with pitch correction.
- Soundtoys Little AlterBoy — pitch-shift + formant + robot voice.
- Pitch shifters / harmonizers: Eventide H3000 H910, Eventide UltraPitch, Soundtoys Crystallizer.
13. Mastering — The Final Stage
Mastering is the bridge between the mix and the distribution chain: corrective + cohesive processing, loudness compliance, format-specific deliverables.
13.1 Canonical Chain
Mix bounce (24-bit, 0 to -6 dBFS peak)
↓
1. Subtractive corrective EQ (cut problem resonances)
↓
2. Multiband compression (control unruly bands)
↓
3. Saturation / tape (glue, harmonic body)
↓
4. Bus/Glue compression (1-2 dB GR, slow attack)
↓
5. Tonal-balance EQ (musical broad-strokes)
↓
6. Stereo / M-S processing (image control)
↓
7. Limiter (final loudness + true-peak ceiling)
↓
8. Dither (only when reducing bit depth)
↓
Master file (24-bit / 16-bit, 44.1 / 48 kHz)
13.2 Loudness Standards (LUFS)
LUFS (Loudness Units relative to Full Scale) replaced peak metering for loudness measurement; ITU-R BS.1770-5 defines the K-weighting + gating algorithm. Targets in 2026:
| Destination | Integrated LUFS | True Peak |
|---|---|---|
| EBU R128 broadcast (TV/radio EU) | -23 LUFS (±1) | -1 dBTP |
| ATSC A/85 (US broadcast) | -24 LKFS | -2 dBTP |
| Spotify | -14 LUFS (normalization on) | -1 dBTP |
| Apple Music | -16 LUFS (Sound Check) | -1 dBTP |
| YouTube | -14 LUFS | -1 dBTP |
| Tidal | -14 LUFS | -1 dBTP |
| Amazon Music | -14 LUFS | -2 dBTP |
| Cinema (Dolby) | -23 to -27 LUFS | -2 dBTP |
| Audiobook (ACX) | -23 to -18 LUFS, peak ≤ -3 dBFS, noise ≤ -60 dBFS | — |
Streaming services normalize louder masters down; you do not need to push past target — over-limited masters get attenuated AND retain the squashed-transient artifacts.
13.3 True-Peak (TP) Limiting
Inter-sample peaks can exceed sample-domain 0 dBFS by up to ~3 dB after D/A reconstruction or after encoding to lossy formats (MP3, AAC, Opus). True-peak limiters oversample (4-8×) to catch these. Ceiling of -1 dBTP is the modern recommendation, -2 dBTP for safety with heavy lossy encoding.
13.4 Limiters and Maximizers
- FabFilter Pro-L 2 — 8 algorithm styles, true-peak limiting, oversampling up to 32×. The transparent modern reference.
- Waves L2 / L3 Ultramaximizer / L3-LL — IDR dithering, the original look-ahead limiter (L1, 1994).
- iZotope Ozone 11 Maximizer (2024) — IRC IV; Modern, Transient, and Soft modes; integrated with Ozone’s full mastering suite.
- Sonnox Oxford Limiter v3.
- DMG Limitless / Compassion.
- Sonible smart:limit (2023) — AI-assist for limiter settings against a target curve.
- Newfangled Audio Elevate — multiband limiter with intelligent transient retention.
13.5 Mastering EQs and Bus Compressors
- EQs: FabFilter Pro-Q 4 (linear-phase mode), DMG EQuilibrium, Maag EQ4, Brainworx bx_digital V3, Massive Passive (UAD/Plugin Alliance), Manley Massive Passive Mastering.
- Bus compressors: Shadow Hills Mastering Compressor, Manley Variable Mu, SSL G-Master, Tokyo Dawn Kotelnikov GE, FabFilter Pro-MB, Maselec MLA-4.
- Multiband: iZotope Ozone Dynamics, FabFilter Pro-MB, Waves C6.
- Imaging: iZotope Ozone Imager 2 (free), Brainworx bx_stereomaker, Waves S1 Imager.
- Reference: iZotope Tonal Balance Control 2 — compare against genre-specific target curves.
13.6 Dither
Reducing bit depth (24 → 16 for CD/16-bit deliverable) introduces quantization distortion; dither adds shaped noise that decorrelates the distortion, sounding like benign hiss instead.
- TPDF dither (triangular) — baseline; flat noise.
- Noise-shaped dither — moves noise to less-audible high frequencies; iZotope MBIT+, POW-r 1/2/3, Apogee UV22HR, Waves IDR.
- Dither once, last in the chain. Never dither again afterward.
13.7 Deliverables (2026)
- DDP image (Disc Description Protocol) — CD replication master.
- 24-bit / 44.1 kHz WAV — primary digital distribution; streaming services accept this and downsample as needed.
- 24-bit / 48 kHz WAV — for video sync.
- MFiT / Apple Digital Masters — passes Apple’s metadata + headroom requirements.
- MQA — declining in 2024-26 after Tidal dropped it; FLAC-based hi-res replacing it.
- Dolby Atmos ADM BWF — immersive deliverable (see §16).
14. Monitoring — Speakers and Headphones
14.1 Studio Monitors
Near-field (~1-1.5 m listening distance, anechoic + sealed cabinets):
- Genelec 8030C / 8040B / 8050B / 8351B — Finnish-built; auto-calibrated with GLM. Smooth coaxial designs in higher tiers.
- Genelec The Ones (8331 / 8341 / 8351 / 8361) — coincident point-source design; flagship near-field reference.
- Neumann KH 120 II / KH 150 / KH 310 (2023-24 redesigned) — flat, neutral, surgically detailed; KH 150 has Neumann.Control DSP room correction.
- Adam Audio A4V / A7V / S2V / S3V / S5V — X-ART ribbon tweeters; bright clinical detail.
- Focal Trio6 BE / Solo6 BE / ST6 — beryllium tweeter, French build.
- Dynaudio LYD 5/7/8 / Core Series.
- PSI Audio A17-M / A23-M — Swiss, ultra-flat.
Mid-field (~2-3 m, larger control rooms):
- ATC SCM25A Pro / SCM45A / SCM50ASL Pro — soft-dome tweeter + mid-dome (3” SL); classical + post + high-end pop reference.
- PMC twotwo.6 / twotwo.8 / IB1S — transmission-line; mastering reference.
- Genelec 1238A / 1234A — main monitors.
Legacy + supplemental nearfields:
- Yamaha NS-10M — discontinued 2001, still the universal “if it sounds good on these, it’ll translate” reference. Paired with classic Auratone reference.
- Auratone 5C Sound Cube — single-driver mono cube; midrange-only “telephone check.” Modern version: Avantone MixCube.
- Avantone Mixcube (active + passive) — modern Auratone replacement.
14.2 Headphones
Open-back (mixing, mastering, no isolation):
- Sennheiser HD 600 / HD 650 / HD 660S2 — 300/150/300 Ω; neutral references; HD 600 is the engineer’s default.
- Audeze LCD-X / LCD-2 / LCD-MX4 — planar magnetic; weighty, accurate.
- Focal Clear MG Pro / Utopia / Clear MG — beryllium domes; airy and detailed.
- Beyerdynamic DT 1990 Pro / DT 1770 Pro (closed).
- HiFiMan Sundara / Arya Stealth / Susvara.
Closed-back (tracking, isolation):
- Sony MDR-7506 — broadcast standard for 30+ years; sub-$100 reliability.
- Audio-Technica ATH-M50x / M70x.
- Beyerdynamic DT 770 Pro / DT 1770 Pro.
- Sennheiser HD 280 Pro / HD 25.
- Shure SRH1540 / SRH840A.
Headphone correction (2024-26): Sonarworks SoundID Reference 5, Slate VSX (modeled rooms), dSONIQ Realphones 2, Waves Nx — bring headphone response toward a target curve so mixes translate.
14.3 Translation Discipline
- A/B: switch between near-fields and mono cube/laptop speakers + earbuds.
- Reference tracks: 2-3 commercial mixes in the same genre, level-matched to your mix.
- Multiple volume levels: quiet (where treble + bass appear weak — Fletcher-Munson), moderate (decision level), loud (sanity check, not mixing level).
15. Room Acoustics and Treatment
Even the best monitors are useless in an untreated room. The room imposes:
- Modal resonances at frequencies determined by dimensions (
f = c / (2L)for axial modes); dominant below ~250 Hz. - Early reflections (5-30 ms) from desk + sidewalls + ceiling — comb-filter the direct sound.
- Reverberation time (RT60) — control room target 0.2-0.4 s, fairly flat across 100 Hz to 4 kHz.
15.1 Treatment Categories
- Bass traps — porous absorbers (rockwool / fiberglass) thick enough (≥4-6 in) to be effective below 200 Hz. Corner-loaded for maximum efficiency. Membrane / tuned diaphragmatic traps for surgical low-end control.
- Broadband absorption — 2-4 in panels on first-reflection points (sidewalls + ceiling cloud above the engineer).
- Diffusion — Schroeder QRD / skyline diffusers behind the engineer, restoring liveness without comb-filtering.
- First reflection points — mirror-test from the listening position; treat all points where you see a monitor.
15.2 Vendors
- GIK Acoustics — affordable + custom; the small-studio default.
- Vicoustic — modular European panels with decorative options.
- RPG Acoustics / RPG Diffusor Systems — high-end QRDs + bass traps (Modex Plate, Skyline).
- Auralex — entry-tier foam + panels (overhyped 1-inch foam is largely cosmetic; not bass-effective).
- Primacoustic — broadband panels + ceiling clouds.
- ATS Acoustics — DIY-friendly + pre-built.
15.3 Room Correction DSP
For control rooms past acoustic treatment, DSP supplements (does not replace) physical treatment:
- Sonarworks SoundID Reference — measurement-mic + correction curve.
- Genelec GLM — included with Genelec SAM monitors.
- Neumann MA 1 / Neumann.Control — KH-Series room calibration.
- Dirac Live — multipoint room correction.
- Trinnov D-Mon / ST2 — high-end studio room correction.
16. DAWs — Digital Audio Workstations
16.1 Major DAWs (2026)
| DAW | Strengths | Used For |
|---|---|---|
| Avid Pro Tools 2024.10 | Industry standard for post + recording; AAX format | Film, TV, dialog, large session recording |
| Apple Logic Pro 11 (2024) | Bundled instruments, Stem Splitter (AI), Session Players | Pop, songwriting, all-in-one |
| Ableton Live 12 (2024) | Session view, MIDI tools, M4L | Electronic, hip-hop, live performance |
| Image-Line FL Studio 21 | Step sequencer, pattern-based workflow | Hip-hop, EDM, beat-making |
| Steinberg Cubase 13 / Nuendo 13 | MIDI heritage, post Nuendo for film | Film, classical, German pop/rock |
| PreSonus Studio One 6.6 | Drag-and-drop workflow, mastering project page | All-around, mastering integration |
| Cockos Reaper 7 | Cheap ($60-225), customizable, lightweight | Indie, multi-platform, scripting |
| Bitwig Studio 5 | Modular signal flow, CLAP-native | Sound design, electronic |
| Reason Studios 13 | Rack-style modular instruments | Sound design, hip-hop |
| BandLab Cakewalk by BandLab | Free, full DAW | Hobbyists, education |
| MOTU Digital Performer 11 | Film scoring, MIDI editing | Film + TV scoring |
16.2 Plugin Formats
- VST3 (Steinberg, 2008) — cross-platform, most plugins.
- AU (Audio Unit) — Apple, Logic Pro / GarageBand.
- AAX — Avid, Pro Tools-only; AAX DSP for HDX hardware.
- LV2 — open source, Linux.
- CLAP (CLever Audio Plug-in, 2022) — new open standard by Bitwig + u-he, MIT-licensed, threaded audio, polyphonic modulation; uptake accelerating 2024-26 (Bitwig, Reaper, FL Studio, Studio One supporting; Logic + Pro Tools holdouts).
- VST2 — legacy, Steinberg deprecated; many plugins still ship it.
17. Immersive Audio (3D / Spatial)
17.1 Dolby Atmos
Object-based + bed-channel audio. Up to 128 audio objects + 7.1.2 bed, rendered to listener’s playback configuration (5.1, 7.1, 7.1.4, 9.1.6, headphones via binaural).
- Mastered for: Apple Music (since 2021), Tidal, Amazon Music HD, Netflix, Disney+, cinema (Atmos Cinema). Spotify supports stereo only as of 2026.
- Workflow:
- Mix in DAW with Atmos panner plugin (Pro Tools, Logic, Nuendo, Studio One).
- Send to Dolby Atmos Renderer (standalone or DAW-integrated in Logic 11).
- Monitor in 7.1.4 (4 height) or 9.1.6 setup; binaural monitor on headphones.
- Deliver ADM BWF (Audio Definition Model Broadcast WAV) — single file with object metadata.
- Loudness target: -18 LUFS dialog-gated; -1 dBTP.
17.2 Apple Spatial Audio with Dolby Atmos
Apple’s branding for Atmos delivery + head-tracked binaural on AirPods (since 2021). Personalized Spatial Audio (since iOS 16) takes a TrueDepth scan of the listener’s ear to customize HRTF.
17.3 Sony 360 Reality Audio (360RA)
Object-based, MPEG-H 3D Audio under the hood; supports up to 24 objects. Distribution via Tidal, Amazon Music, nugs.net, Deezer (limited). Authoring with the Sony 360 Walkmix Creator and 360 RA Creative Suite.
17.4 Other Formats
- DTS:X — competitor to Atmos, used in some cinemas + Blu-ray.
- Auro-3D — channel-based 9.1 / 11.1 / 13.1 immersive; ITU layer + height layer + voice-of-god.
- MPEG-H 3D Audio — broadcast immersive (Korea, Brazil, Germany).
- Sennheiser AMBEO — Ambisonic microphone + ecosystem; first-order + fourth-order ambisonic capture.
17.5 Binaural and Ambisonics
- Binaural mixing: HRTF convolution puts a stereo source in 3D space on headphones (Dolby Atmos Production Suite, Waves Nx, dearVR Pro).
- Ambisonics: B-format (W, X, Y, Z) channel-agnostic sound field. Useful intermediate for VR / 360 video.
- Bracket Audio (2024-25) — binaural and HRTF-personalized monitoring solutions.
18. 2024-2026 AI-Assisted Production Trends
18.1 AI Mastering Services
- iZotope Ozone 11 Mastering Assistant (2023; 11.x in 2024-25) — analyzes the master and suggests EQ + dynamics + maximizer settings against a target reference.
- LANDR — cloud-based AI mastering (since 2014); the original; tiered subscription.
- eMastered — Grammy-engineer-trained models; reference-track mastering.
- CloudBounce / BandLab Mastering / Cryo Mix — competitors.
- Sonible smart:limit / smart:EQ 4 / smart:comp 2 — AI-set thresholds, learned from a target curve.
18.2 AI Mix Assistants
- iZotope Neutron 5 (2024) — Mix Assistant analyzes your tracks, sets EQ + comp + dynamic balance.
- iZotope Nectar 4 — Vocal Assistant.
- Mixed In Key Captain Chords / Pro Tools Sketch — songwriting AI.
18.3 AI Stem Separation (the big one)
Source separation has gone from research to commodity since ~2022:
- iZotope RX 11 Music Rebalance (2024) — separates Vocals, Bass, Drums, Other; gain-rebalance or extract.
- LALAL.AI — web-based; multiple model versions (Phoenix, Orion).
- AudioShake — pro-grade stem separation, used in remastering catalog releases.
- Audionamix XTRAX Stems 2 / TRAX SP Pro — original commercial separation.
- Spleeter (Deezer, 2019, free) — open-source baseline (lower quality vs. 2024 commercial models).
- Logic Pro 11 Stem Splitter (2024) — built-in, Vocals/Bass/Drums/Other.
- Moises.ai — practice-focused with stems + pitch + tempo.
- Demucs / HTDemucs (Hybrid Transformer Demucs) — Meta research, state-of-the-art open model (2023).
Uses: remixing, restoration, mash-ups, live sampling, sample-clearance reissues, removing vocals for instrumental versions.
18.4 AI Voice and Restoration
- Resemble AI / ElevenLabs / Descript Overdub — voice cloning + TTS (consent + rights questions still active in 2026).
- Voice-AI Voice Cloning — commercial speech voice models.
- Sonantic (Spear, acquired by Spotify 2022) — emotive voice synthesis.
- Accentize dxRevive / dxRevive Pro — AI dialog restoration (de-reverb + intelligibility).
- iZotope RX 11 Voice De-noise + Dialogue Isolate — voice-aware spectral repair.
- Audionamix VocalRePair.
18.5 Cautions (2026 reality check)
- AI mastering produces a competent default master; for any release of importance, a human mastering engineer still hears + decides better.
- Stem separation is excellent for many genres but still leaves artifacts on dense mixes; for archival/release-quality re-stems, use multiple model passes + manual cleanup.
- Voice cloning carries legal + ethical exposure; the U.S. NO FAKES Act (proposed 2024, pending 2026) and EU AI Act (in force 2025) regulate synthetic voice use.
19. Cross-References
- music-theory-essentials — harmony + rhythm context that drives mixing decisions (groove, ensemble, dynamics).
- signal-processing-dsp — sampling, filtering, FFT, windowing, the math underneath every plugin.
- acoustics-noise-control — room modes, absorption coefficients, noise floor measurement, RT60 calculation.
- fft-spectral — spectral analysis, windowing, STFT, the foundation of every spectrum analyzer + dynamic EQ.
20. Citations and Further Reading
Foundational Books
- Bobby Owsinski — The Mixing Engineer’s Handbook (4th ed., 2017) and The Mastering Engineer’s Handbook (4th ed., 2017). Industry-interview-based, practical.
- Bob Katz — Mastering Audio: The Art and the Science (3rd ed., 2014). Loudness, dither, K-system metering originator.
- David Miles Huber + Robert E. Runstein — Modern Recording Techniques (10th ed., 2024). The university audio-engineering textbook.
- Mike Senior — Mixing Secrets for the Small Studio (2nd ed., 2018). Bedroom-studio-translated mixing knowledge.
- Roey Izhaki — Mixing Audio: Concepts, Practices, and Tools (4th ed., 2024).
- Alex Case — Sound FX: Unlocking the Creative Potential of Recording Studio Effects (2012).
- F. Alton Everest + Ken Pohlmann — Master Handbook of Acoustics (7th ed., 2022). Room acoustics reference.
Periodicals
- Sound on Sound (UK) — Hugh Robjohns + Paul White; deep-dive technique articles since 1985, monthly through 2026.
- Mix Magazine — US trade pub, studio profiles + tech reviews.
- Tape Op — independent magazine, free print + online, engineer-interview-heavy.
- Resolution Magazine — UK pro-audio + post.
- Production Expert (online) — Pro Tools-leaning daily content.
Standards
- AES17-2020 — measurement of digital audio equipment.
- AES31-3-2008 — file format for audio interchange (BWF).
- EBU R128 / R128 s1 / R128 s2 / R128 s3 — loudness for broadcast, streaming, podcast, short-form.
- ITU-R BS.1770-5 — loudness measurement algorithm.
- ITU-R BS.2076-2 — Audio Definition Model (ADM), basis for Atmos delivery.
- SMPTE ST 2098-x — immersive audio for cinema.
Engineers and Producers to Study
- Bob Clearmountain — pop/rock mix architect; Bruce Springsteen, Bryan Adams, Rolling Stones. SSL + parallel + reverb worship.
- Chris Lord-Alge (CLA) and Tom Lord-Alge — modern rock/pop, aggressive parallel compression.
- Andrew Scheps — Adele, Red Hot Chili Peppers; mix-bus-in-the-box pioneer.
- Michael Brauer — Coldplay, John Mayer; multi-bus compression chains.
- Tony Maserati — Beyoncé, Mary J. Blige; vocal production reference.
- Serban Ghenea — Taylor Swift, Bruno Mars, BTS; modern pop mix sound.
- Jaycen Joshua — Beyoncé, Rihanna, Justin Bieber; modern R&B/hip-hop.
- Manny Marroquin — modern pop mixer (top of Mix With The Masters lecture series).
- Bob Ludwig (mastering, Gateway Mastering) — 50+ years; canonical mastering reference.
- Bernie Grundman (Bernie Grundman Mastering) — Thriller, Aja, countless classics.
- Bob Katz — mastering + author; coined K-system metering.
- Emily Lazar (The Lodge) — first woman to win mastering Grammy (2020).
- Mandy Parnell (Black Saloon) — modern mastering, ambient/electronic.
Online Education
- Mix With The Masters — multi-hour seminar videos from working mix engineers; flagship subscription pro-audio education.
- pureMix.net — engineer-led tutorials, Fab Dupont’s platform.
- Sound on Sound’s weekly tutorials and archived columns.
- The Recording Revolution / Produce Like A Pro / Warren Huart — YouTube-tier instruction.
- Berklee Online — university-level audio engineering certificates.
End of reference. This note is a snapshot of practice and tooling at 2026-05-17; verify version numbers and confirm current loudness specs against the destination platform’s published guidelines before delivery.