Enchant.
Tools / Audio Concepts / 9. Spatial Audio & Localization
9. Spatial Audio & Localization · Concept 10 of 12

Binaural Rendering

Faking real 3D sound over ordinary headphones by recreating the exact cues your two ears would naturally receive.

Binaural Rendering: one source, two different ear signals Recreate the timing gap (ITD), loudness gap (ILD) & ear-shape filter (HRTF) for each ear SOUND SOURCE (to the left, 90 degrees) head shadow blocks far ear NEAR EAR first & loud FAR EAR later & quieter ITD up to ~0.65 ms extra path = head width (~21 cm) / 343 m/s LEFT CUP (L) louder, earlier + near-ear HRTF ILD up to ~20 dB RIGHT CUP (R) quieter, later + far-ear HRTF head shadow HEADPHONES ONLY: each ear must get its own signal (speakers crosstalk & kill it) ITD = timing gap (<1.5 kHz) | ILD = loudness gap (>1.5 kHz) | HRTF = pinna/head colouring (4-16 kHz)

One mono source becomes two different ear signals: the near ear hears it first and louder (ITD + ILD), then your brain rebuilds the 3D position.

What it is

Faking true 3D sound over normal headphones by recreating the exact timing, loudness and ear-shape cues your two ears would receive in real life.

Key facts

How it works

  1. Capture or model how each ear colours a sound from every direction (the HRTF).
  2. Take a mono source and split it into a left and right ear path.
  3. Apply the correct timing gap (ITD) and loudness gap (ILD) for the target direction.
  4. Convolve each ear's signal with that direction's HRIR (head + pinna filtering).
  5. Add early reflections and reverb for distance and room feel.
  6. Feed L to left cup, R to right cup; add head-tracking so the scene stays fixed when the head turns.

Real examples

How it helps in live sound

Everyday analogy

It's like fitting each ear with its own tiny stethoscope that hears exactly what a real ear would, so your brain swears the sound is genuinely behind or above you.

Watch out

Myth: binaural makes any playback 3D. Truth: it collapses to flat or weird on loudspeakers because crosstalk destroys the per-ear cues - it only works through isolated headphones.

Fun fact

Your brain reads a timing gap as small as ~10 microseconds (millionths of a second) between your ears to place a sound left or right - faster than any drum hit you can consciously perceive.

Key takeaways

  • Binaural = recreate the exact two cues each ear gets: timing (ITD) and loudness (ILD), plus ear-shape (HRTF) colouring.
  • HRTF/HRIR is the per-direction filter of head + pinna + torso; convolve audio with it to place a sound.
  • Headphones mandatory - speaker crosstalk kills the effect.
  • Pinna notches at 4-16 kHz cue up/down and front/back; head-tracking fixes front/back confusion.
  • Backbone of VR, 3D video and spatial music; great for IEMs and headphone streams, useless for the PA.
← Previous
Ambisonics
☰ All 123 concepts

Need the gear and a crew who know this stuff?

Enchant Entertainment hires and operates sound, lighting and staging across Perth and regional WA.

Get a quoteAll concepts