Enchant.
Tools / Audio Concepts / 5. Information Theory (The Deep Root)
5. Information Theory (The Deep Root) · Concept 5 of 6

Rate Distortion Theory

It is the rulebook for the trade between making a file smaller and how much sound quality you are willing to lose doing it.

Rate Distortion Theory: bits vs quality Smallest bitrate that still hits the quality you want Rate (bitrate) Distortion D more loss / worse quality → R(D) curve you cannot go below this achievable IMPOSSIBLE region (no codec lives here) the KNEE diminishing returns → 320 kbps full, low loss 128 kbps small, brittle highs Original: Low bitrate: fine detail tossed +1 bit = +6 dB SNR

The R(D) curve sets the hard floor: spend bits where the ear hears them, and past the knee extra bitrate buys nothing.

What it is

The maths of the trade between file size (bitrate) and how much sound quality you accept losing.

Key facts

How it works

  1. Pick a target: file size or bitrate you can afford.
  2. R(D) curve tells you the lowest bitrate that still hits the quality you want.
  3. Encoder runs a psychoacoustic model to find what the ear can't hear.
  4. It spends its bit budget on audible detail, dumps the masked stuff first.
  5. Result: smallest file for that quality, or best quality for that size.
  6. Push bitrate too low and distortion climbs fast past the curve's knee.

Real examples

How it helps in live sound

Everyday analogy

Like choosing photo quality on your phone: each step smaller saves space but blurs detail, and the theory marks the smartest spot to draw that line.

Watch out

Myth: 'higher bitrate always sounds better.' Truth: past the transparency knee (~256-320 kbps MP3) extra bits add size but no audible gain, while the first bits matter most.

Fun fact

The 44.1 kHz CD sample rate was chosen so audio fit on existing Sony PCM video recorders, not for any acoustic reason.

Key takeaways

  • R(D) = the hard floor: lowest bitrate for a given quality, you can't cheat it.
  • Each extra bit/sample buys ~6 dB SNR; gains shrink as you climb.
  • Lossy codecs dump sound the ear can't hear (masking), not random data.
  • 320 kbps sounds fuller than 128 kbps because it keeps ~2.5x more bits.
  • Diminishing returns: first bits = huge quality jump, last bits = waste.
← Previous
Source Coding Theory
☰ All 123 concepts

Need the gear and a crew who know this stuff?

Enchant Entertainment hires and operates sound, lighting and staging across Perth and regional WA.

Get a quoteAll concepts