LUFS explained dynamic range music true peak audio loudness war

Understanding Audio Quality Metrics: LUFS, Dynamic Range, and True Peak

A practical guide to measuring audio quality. Learn what LUFS, dynamic range, and true peak actually mean, how to spot bad masterings, and how to compare different releases of the same album.

· 10 min read

Why Numbers Matter More Than Format

There’s a persistent myth in the audiophile world that better formats automatically mean better sound. Buy the hi-res version, get better audio. Simple, right?

Not even close.

A well-mastered CD at 16-bit/44.1 kHz will absolutely destroy a brickwalled 24/192 remaster that’s been crushed to -6 LUFS. Format tells you the container’s potential. Metrics tell you what’s actually inside it. You can put garbage in a crystal vase — it’s still garbage.

This is something the hi-res marketing machine doesn’t want you thinking about. They’d rather sell you the same album again at a higher sample rate than admit the 1987 CD pressing sounds better than their shiny new remaster. But the numbers don’t lie. LUFS, dynamic range, true peak, clipping count — these measurements reveal the actual quality of a recording’s mastering, regardless of what format it’s wrapped in.

So before you spend another dollar chasing sample rates, let’s talk about the metrics that actually matter.

LUFS: How Loud Is This Track?

LUFS stands for Loudness Units relative to Full Scale. It’s the international standard for measuring perceived loudness, defined by the ITU-R BS.1770 standard. Unlike raw decibel measurements, LUFS accounts for how human ears actually perceive sound.

The key insight is K-weighting. Before measuring loudness, the signal passes through two filters: a high shelf that boosts frequencies around 1.5 kHz by about 4 dB (modeling how your head and ears shape incoming sound), and a high-pass filter that rolls off everything below 38 Hz (because deep bass doesn’t contribute much to perceived loudness). This K-weighted signal is then measured in 400-millisecond blocks, and a dual-gated integration process throws out silent passages and very quiet sections before computing the final number.

The result is integrated loudness — a single number that represents how loud a track feels to a human listener across its full duration.

What do the numbers mean in practice?

LUFS ValueWhat It Sounds LikeTypical Examples
-23 to -20Quiet, spacious, full dynamic rangeClassical recordings, film scores
-18 to -14Moderate, comfortable listeningJazz, acoustic, well-mastered rock
-14Spotify’s normalization targetMost modern streaming masters
-12 to -10Loud, limited dynamicsMainstream pop, EDM
-8 to -6Crushed, fatiguingLoudness war casualties (Death Magnetic, anyone?)

A difference of 1 LUFS is roughly perceptible. A difference of 3 LUFS is obvious. If you’re comparing two versions of the same album and one is at -10 LUFS while the other sits at -14 LUFS, the quieter one almost certainly has more dynamic headroom and sounds more natural — even though your brain’s first instinct is to prefer the louder version.

That instinct, by the way, is exactly what the loudness war exploits.

Dynamic Range: The Space Between Loud and Quiet

Dynamic range is the distance between the loudest and quietest moments in a recording. It’s what makes music breathe. When a snare drum cracks above a quiet verse, when a symphony swells from pianissimo to fortissimo, when a guitar solo tears through a mix — that’s dynamic range doing its job.

The DR14 measurement (named after the scale’s practical ceiling of about 14 dB for most music) captures this by analyzing each channel of audio in blocks, taking the loudest 20% of those blocks by RMS energy, and comparing that against the second-highest peak. The result is a single number in dB.

Here’s a rough guide:

DR ValueQualityExamples
DR12 - DR14+ExcellentSteely Dan’s Aja, most vinyl-era masters
DR9 - DR11GoodWell-mastered modern rock, quality remasters
DR6 - DR8MediocreTypical modern pop/rock masters
DR3 - DR5PoorLoudness war victims, heavily compressed

Take Steely Dan’s Aja — an album legendary for its meticulous engineering. The original CD master comes in around DR13. Every instrument has room to exist in its own space. The drum hits have punch, the bass has weight, and the mix has depth you can practically walk into.

Now compare that to a typical modern pop production mastered to DR5. Everything sits at roughly the same level. There’s no punch because there’s no contrast. The drums hit at the same volume as the vocals, which hit at the same volume as the synths. It’s loud, sure. But it’s also flat and exhausting to listen to for more than twenty minutes.

Compression isn’t inherently evil — every genre has an appropriate dynamic range, and a punk record mastered to DR14 would sound wrong. But there’s a difference between artistic dynamic control and slamming everything into a limiter because someone decided louder equals better on a Spotify playlist.

True Peak: The Clipping You Can’t See

Here’s something most people don’t realize: the loudest point in a digital audio signal doesn’t necessarily land on an actual sample.

Digital audio is a series of discrete sample points. When your DAC reconstructs the analog waveform between those points, the continuous signal can overshoot any individual sample value. Two consecutive samples might both read -0.5 dBFS, but the reconstructed curve between them could peak at +0.3 dBFS. That’s an inter-sample peak, and it’s invisible to anyone only checking sample values.

True peak detection solves this by oversampling — inserting interpolated points between the real samples to estimate what the reconstructed waveform actually does. The ITU-R BS.1770 standard specifies 4x oversampling: for every pair of adjacent samples, three intermediate values are calculated using a windowed sinc filter. The highest absolute value across all original and interpolated samples is the true peak.

Why does this matter? Because when inter-sample peaks exceed 0 dBFS, your DAC clips. It can’t output a value higher than its maximum, so the waveform gets flattened at the ceiling. The result is distortion — sometimes subtle, sometimes not, but always a degradation of the original signal.

This is why streaming services require true peak levels below -1 dBTP (decibels true peak). Apple Music specifies -1 dBTP. Spotify and YouTube target -1 dBTP as well. The extra 1 dB of headroom accounts for the lossy codec’s own reconstruction introducing additional inter-sample peaks.

A recording with a true peak of -0.1 dBTP is living dangerously. One at -3 dBTP has comfortable headroom. And if the true peak exceeds 0 dBTP, something has already gone wrong in the mastering chain.

The Loudness War — And Why It’s (Slowly) Ending

The loudness war started in earnest in the mid-1990s, when mastering engineers began pushing average loudness levels higher and higher. The logic was simple and self-reinforcing: a louder track grabs attention on radio, sounds more impressive in a quick A/B comparison, and stands out on a playlist. So every release had to be at least as loud as the last one.

The cost was dynamic range. You can’t make a track louder without reducing the gap between its quiet and loud parts. Limiters, clippers, and multiband compressors were cranked harder with each passing year. Albums that were mastered at -14 LUFS in 1990 were remastered at -8 LUFS in 2005. Metallica’s Death Magnetic (2008) became the poster child — so heavily clipped that the Guitar Hero game files sounded better than the official CD release.

It was insane. And it went on for nearly two decades.

Streaming normalization is what’s finally turning the tide. When Spotify normalizes everything to -14 LUFS, a track mastered at -8 LUFS doesn’t sound louder — it just sounds worse, because all that dynamic range was sacrificed for nothing. The algorithm turns it down to the same perceived loudness as everything else, and now you’re left with a crushed, lifeless version competing against tracks that kept their dynamics intact.

This doesn’t mean mastering doesn’t matter anymore. It absolutely does. A skilled mastering engineer working to a -14 LUFS target can make a track that sounds punchy, clear, and loud enough within that framework — while preserving the dynamic intent of the mix. Bad mastering is still bad mastering, regardless of the loudness target.

But the economic incentive to brickwall everything is finally, mercifully, fading.

Comparing Releases: Which Version Sounds Best?

Here’s where this gets practical. If you’ve been collecting music for any length of time, you probably own multiple versions of the same album. The original CD. A “remastered” edition. Maybe a hi-res FLAC download. They’re all supposedly the same music, but they can sound dramatically different.

The 2003 remaster might have been pushed 4 dB louder than the original, crushing the dynamics in the process. The hi-res version might be the same loud remaster upsampled to 24/96 — more bits, but no more music. Or the hi-res version might genuinely be struck from the original analog tapes at a higher resolution, with careful mastering that preserves the original dynamic intent.

You can’t tell from the format. You can’t tell from the marketing copy. You can tell from the metrics.

Comparing releases across LUFS, dynamic range, true peak, and clipping count reveals which version was mastered with the most care. The one with higher DR and lower LUFS is almost always the more natural, more listenable version. The one with zero clipping events was treated with more respect than the one showing 47 clips. The one where the true peak sits at -1.5 dBTP was mastered by someone who understood the delivery chain.

This kind of comparison is exactly what Echobox was built to do. Point it at your library, and it’ll tell you which pressing of Dark Side of the Moon actually sounds the best — regardless of which one has the fanciest packaging.

How We Measure This in Echobox

We don’t cut corners on analysis. Every track in your library goes through a full-decode analysis pipeline that runs in the background after your initial library scan. Here’s what happens under the hood.

For loudness, we implement ITU-R BS.1770 K-weighting with dual-gated integration. The signal passes through the standard pre-filter (high shelf at ~1.5 kHz) and RLB high-pass (38 Hz cutoff), then gets measured in overlapping 400 ms blocks. We apply both the absolute gate (-70 LUFS) and the relative gate (-10 LU below the preliminary measurement) before computing integrated loudness. This isn’t a quick RMS estimate — it’s the same methodology that broadcast standards require.

True peak detection uses 4x oversampling with a 12-tap windowed sinc filter. For every input sample, we compute three interpolated values between adjacent samples, tracking the maximum absolute value across all channels. The result is a true peak measurement that catches inter-sample overs that raw sample peak would miss entirely.

Dynamic range follows the DR14 specification: per-channel block analysis, top-20% RMS selection, and comparison against the second-highest peak. We take the minimum DR across channels for a conservative, honest number.

Clipping detection scans for runs of three or more consecutive samples at full scale (|sample| >= 0.9999). A single sample hitting 1.0 can happen naturally. Three in a row means the signal was truncated by a limiter — the waveform has been flattened at the ceiling. We count clipping events, not individual samples, giving you a meaningful measure of how aggressively a track was limited.

All of these measurements get aggregated at the album level: average LUFS, LUFS spread (how consistent the mastering is across tracks), average DR, and total clipping count. When you’re comparing releases, we line up these numbers side by side across every dimension — dynamic range, loudness, clipping, peak headroom, even hi-res plausibility. The comparison tells you which version was mastered with more care, backed by data instead of opinion.

Everything runs on a background thread after your library scan completes. You don’t have to configure anything, wait for it, or think about it. Scan your library, and the numbers are there when you want them. We cache results per analyzer version, so if we improve an algorithm, your library gets re-analyzed automatically.

If you’re the kind of listener who cares about parametric EQ tuning or bit-perfect output chains, you already know that the signal chain matters. Analysis metrics are the other half of that equation — they tell you whether the source material deserves all that care in the first place.

Trust the Numbers, Not the Sticker

The bottom line is this: a “hi-res” badge, a “remastered” sticker, or a higher price tag tells you nothing about actual audio quality. LUFS tells you how loud it is. Dynamic range tells you how much it breathes. True peak tells you whether it clips. And clipping count tells you whether someone cared enough to avoid distortion.

Format is the ceiling. Mastering is the floor. And most of the time, the floor matters more. Next time you’re deciding between two versions of a favorite album, don’t reach for the one with more bits — reach for the one with more dynamics.


Related Guides


Try Echobox

Experience what these guides describe — precision playback on Android.

One email per milestone. No noise.