Enchant.
Tools / Audio Concepts / 6. Audio Compression & Perceptual Coding
6. Audio Compression & Perceptual Coding · Concept 5 of 6

Psychoacoustic Masking Model

A built-in map of when a louder sound makes a nearby quieter sound impossible to hear, so the coder knows what it can safely delete.

Psychoacoustic Masking Model A loud tone raises the hearing threshold and hides quieter nearby sounds Frequency (pitch) -> Loudness (dB SPL) threshold of hearing (quiet limit) masking threshold (raised by the masker) MASKER loud kick @ 80 dB HEARD above curve MASKED quiet guitar deleted free bin it SMR = bits spent here masker (loud) audible signal (kept) masked (deleted) masking threshold

A loud masker lifts the hearing threshold (blue curve); anything beneath it (amber) is inaudible and deleted for free, while audible peaks (green) keep their bits.

What it is

A coder's predictive map of when a loud sound makes a nearby softer sound inaudible, so the hidden sound can be deleted for free.

Key facts

How it works

  1. Codec splits audio into many narrow frequency bands (critical bands / sub-bands).
  2. It measures the loudest energy (the masker) in each band frame by frame.
  3. It calculates the masking threshold curve: the level below which sound is inaudible.
  4. Anything sitting under that curve is flagged as masked and gets few or zero bits.
  5. It allocates the saved bits to audible parts and pushes quantisation noise under the curve.
  6. Result: smaller file, no audible difference because deleted sound was already inaudible.

Real examples

How it helps in live sound

Everyday analogy

Like a truck roaring past kills your ability to hear a whisper beside you, the model maps exactly which quiet sounds the loud ones swallow so it can bin them for free.

Watch out

Myth: lossy codecs just chop off high frequencies. Truth: they delete masked sound across all frequencies, guided by the masking model, and keep audible highs.

Fun fact

A loud sound can mask a quieter one that has not even started yet: pre-masking erases sounds up to about 5 ms BEFORE the loud event because your brain processes the loud spike first.

Key takeaways

  • Loud sounds hide nearby quieter sounds in pitch AND in time.
  • Masking raises the threshold of hearing; anything under it is deleted for free.
  • Critical bands (approx 24 Bark) are the ear's channels where masking bites hardest.
  • Post-masking lasts 50-200 ms; pre-masking only approx 2-5 ms.
  • Codecs hide quantisation noise under the masking curve so loss is inaudible.
  • On a gig, do not chase detail nobody can hear under a louder source.
← Previous
Bit Allocation Theory
☰ All 123 concepts

Need the gear and a crew who know this stuff?

Enchant Entertainment hires and operates sound, lighting and staging across Perth and regional WA.

Get a quoteAll concepts