Enchant.
Tools / Audio Concepts / 6. Audio Compression & Perceptual Coding
6. Audio Compression & Perceptual Coding · Concept 1 of 6

Perceptual Coding

A way of shrinking audio files by throwing away the sounds your ears would never have noticed anyway.

Perceptual Coding: hide quiet sounds under loud ones Anything UNDER the masking curve gets thrown away Frequency Level (dB) 20 Hz 20 kHz Threshold of hearing Masking curve LOUD tone (masker) KEPT (audible) DELETED (hidden, ear can't hear) WAV 1411 kbps → MP3 128 kbps (~11x smaller)

Sounds under the blue masking curve are inaudible, so the coder deletes them and the file shrinks roughly 11x.

What it is

Shrinking audio by deleting sounds your ears physically cannot hear, using a model of human hearing.

Key facts

How it works

  1. Split the audio into short time frames (MP3 frame = 1152 samples, ~26 ms at 44.1 kHz).
  2. Transform each frame into frequency bands (filterbank + MDCT) so you see what pitches are present.
  3. Run the psychoacoustic model: calculate the masking threshold for every band right now.
  4. Assign more bits to audible parts, fewer or zero bits to masked/inaudible parts.
  5. Quantise so the error noise stays buried under the masking threshold, then Huffman-code the result.
  6. Pack into frames with a header; decoder reverses it back to playable audio.

Real examples

How it helps in live sound

Everyday analogy

Like packing a suitcase and leaving out clothes you know you'll never wear, so it's far lighter but you still have everything you'll actually reach for.

Watch out

Myth: 'higher bitrate always sounds better.' Truth: above ~256 kbps AAC most people can't pick it from the original, but no bitrate undoes data already thrown away.

Fun fact

The MP3's tuning was perfected using Suzanne Vega's a cappella 'Tom's Diner', earning her the nickname 'the mother of the MP3'.

Key takeaways

  • It deletes sound your ear can't detect, not random data.
  • Masking is the core trick: loud sounds hide nearby quiet ones, in pitch and in time.
  • Lossy = permanent loss; lossless only halves the size but keeps everything.
  • Bitrate sets quality vs size: 128 kbps small/rough, 320 kbps near-transparent.
  • For live PA, feed full-quality files; save perceptual codecs for delivery only.
← Previous
Redundancy
☰ All 123 concepts

Need the gear and a crew who know this stuff?

Enchant Entertainment hires and operates sound, lighting and staging across Perth and regional WA.

Get a quoteAll concepts