Enchant.
Tools / Audio Concepts / 8. Psychoacoustics (Perception Layer)
8. Psychoacoustics (Perception Layer) · Concept 7 of 18

Auditory Scene Analysis

How your brain sorts one messy wall of sound into separate things you can recognise.

Auditory Scene Analysis: one wave in, many sources out 3 sources play at once KKick VVoice GGuitar they SUM into one messy pressure wave EAR + BRAIN split by pitch, onset, direction heard as 3 streams Kick Voice Guitar Cues the brain uses: shared fundamental (harmonicity) & common onset = FUSE one source pitch + timbre + ITD/ILD direction = SEPARATE streams over time

One summed pressure wave enters the ear; the brain uses harmonicity, common onset and direction to rebuild it into separate kick, voice and guitar streams.

What it is

Your brain's automatic process of splitting one mixed sound wave into separate, recognisable sources like voice, kick and guitar.

Key facts

How it works

  1. Ears send one summed waveform; cochlea splits it by frequency into critical bands.
  2. Brain groups partials that share a fundamental (harmonicity) into one source.
  3. Bits that start and stop together (common onset) get fused into the same object.
  4. Pitch, timbre and direction link sounds over time into separate 'streams'.
  5. ITD and ILD between ears assign each stream a location in space.
  6. You consciously hear distinct objects: voice here, kick there, guitar over there.

Real examples

How it helps in live sound

Everyday analogy

Like glancing at a crowded room and instantly seeing separate people instead of a coloured blur, your brain instantly carves one wall of sound into a voice, a drum and a guitar.

Watch out

Myth: louder = clearer. Truth: clarity comes from SEPARATION (distinct EQ band, pan position and timing) not volume; cranking a buried source just masks everything else.

Fun fact

The precedence effect means a sound copied to your other ear up to ~35 ms later is still heard as ONE source from the first ear, the trick behind why a stereo PA and delay towers don't sound like an echo chamber.

Key takeaways

  • One waveform hits the ear; the brain rebuilds many sources from it.
  • Fusion cues: shared fundamental, common onset, common movement.
  • Streaming cues: pitch, timbre and direction link sounds over time.
  • Location comes from ITD (timing) and ILD (level) between your two ears.
  • Mix to HELP it: unique EQ band + pan position + timing per source.
← Previous
Critical Bands
☰ All 123 concepts

Need the gear and a crew who know this stuff?

Enchant Entertainment hires and operates sound, lighting and staging across Perth and regional WA.

Get a quoteAll concepts