QUOTE(Roseval @ Jul 6 2008, 15:11)

The question of sampling rate and bit depth are a bit unclear to me.
Sampling rate is the easiest to understand, as long as you can’t prove Nyquist wrong, a sampling rate double our hearing threshold is good enough. In real life there is a thing called technology so we have to deal with a couple of problems which don’t exist in the pure mathematical world. As Dunn phrased it quit nicely:
A direct effect of the higher sampling rate is that for an identical filter design the time
displacements will scale inversely with sample rate. Hence an improvement can be
made just from raising the sample rate - even for those who cannot hear above
20kHz.
As far as I knew, there is no theorem proving that a minimum as X bits is needed to reproduce all the details in the signal. We have no Nyquist for the bit depth.
From another post in this forum I learned that flipping even the LSB in 16 bit PCM is clearly audible. I’m inclined to think that a resolution well above the hearing threshold is a bit to ‘rough’
Listening to recordings at a higher resolution (24 bits) should bring an improvement in detail.
Can anybody explains why this doesn’t seems to work in practice?
a) in practice, the average person (and sadly enough, many audio engineers) these days has far more hearing damage than they did 10 years ago, due to noise pollution, poorly mixed/mastered recordings, intentionally distorted musical content, excessive compression, improper EQ (can you say "Smiley face"), and high volume volume listening.
b) In practice, the average person has not developed any ability to appreciate what's missing from lower-resolution audio. They simply don't know what to listen for, and haven't developed an appreciation for what larger wordlengths combined with greater dynamic range bring to the table.
c) The best way to do any kind of test is to use a live source as a control in any experiment. Because of the nature of the potential benefits of greater dynamic range, perceptions of sound reproduction methodologies should not be compared to themselves, but rather to the live source. Most "listening tests," blind or otherwise, are not done this way, and the listener from the start has no reference from which to evaluate the quality representation of low level signals, reverb tails, room reflections, etc. They don't hear the real tails and reflections from the beginning, therefore they have little to compare it to.
d) The experiment part of the test should involve raw, unmastered, unprocessed recordings (with the exception of dither/noise shaping that might be used in the creation of the 16-bit recording).
To make a *loose* analogy, it's like comparing 2 glasses to see which one is more full, without having a sense as to how tall the glasses are. You can compare the amount of liquid in the bottom of the glasses with respect to the bottom of the glasses... but how full the glasses are, and whether the differences between the contents of the glasses are significant, ultimately depend on how tall the glass is. If you don't have the live reference, you cannot as easily perceive the differences (and therefore value) between the live source and the test reproductions.
These are the things greater dynamic range bring to the table. The loss of detail due to poor representation afforded by shorter word lengths occurs in the smoothness of the dynamic changes (waveform amplitude changes) in lower level signals/components of the recording. These are concepts that only a trained ear can discern. It's not hard to train yourself to hear these "features" of a signal, provided that your hearing is not damaged in the manner many peoples' ears are these days. I can instantaneously hear, on my system, the difference between a raw 16-bit recording, and a 24 bit recording recorded at the same levels. It is night and day. Note the word of the use *recording*, and not mixing/mastering/final product.
d) As has been said many times, the greatest value and first bottleneck comes from the initial recording's wordlength and dynamic range afforded by analog circuitry.
When you start with more significant bits, you have the ability to end up with more significant bits in the final product. Merely playing back a PCM stream that uses 24-bit words doesn't imply that the extra bits are significant by itself. Many listening tests involve a normalized 24 bit recording and a normalized 16 bit recording that has been dithered and noise shaped from the same 24 bit source. There is far less audible difference between those two sources, assuming a good dither/noise shaping algorithm is used. However, that test is not the test that validates the value of recording, mixing, and mastering in the 24 bit domain.
In the case of listening back to a raw recording where signal levels were not maximized to take advantage of all of the available dynamic range (i.e. not peaking near 0dBFS) at the time of the actual recording, a 24 bit recording will blow away it's 16bit counterpart in perceived quality for anyone who isn't too deaf to hear reverb tails, overtones and harmonics, and high frequency sounds that usually are not utilizing the full dynamic range available to begin with.
Finally, you used the phrase "above the hearing threshold." The value of 24-bit recording comes into play towards the bottom of the hearing threshold.
Anyone with a *trained ear* who can't hear what raw 24-bit recordings are capable of doesn't either have the proper source material (i.e. source material with at least 19 significant bits of true signal), or they don't have a capable reproduction system.
There's nothing religious about the argument whatsoever, and any pro audio engineer worth their weight in ears will tell you how audibly valuable the 24-bit domain can be. Where people get lost in evaluating these "listening tests" is in understanding what their expectations should be given the test subjects' listening experience, hearing, the quality of the source material, the production methodologies employed, and the reproduction system and environment. People then end up thinking that one test's results regarding a *mastered* recording in some magazine is capable of proving something about all aspects of the 24-bit domain..... The fact that a fully mastered 16-bit recording from a 24bit source can be made to sound as good as its 24bit counterpart is a testament to the fact that there *is* an audible difference between 16-bit and 24-bit *raw source* material.
Folks that question what many audiophiles and pro audio engineers claim to be able to hear generally never make any effort to train their ear to be able to hear them. They'd prefer to jump on the popular bandwagon of folks ready to patently dismiss what isn't blatantly obvious to them at first. In my opinion, the inability to hear the benefits are attributed to either lazyness, deafness, psychological barriers, or lack of proper reproduction environment.
My background and basis for being able to hear what I'm talking about comes purely from listening to raw and/or lightly mastered 24-bit recordings, on good monitors, with a good D/A, in a good room, with little to no EQ, and absolutely no compression. Compare that to the same source's 16-bit counterpart, and it's no contest. Many audiophiles I know don't want to hear any processing at all. They want to hear the pure interaction between mic diaphragm, analog components, cables, and drivers. They want to be able to perceive subtle colorations of components as they add or subtract from the reproduction experience. People who blindly bash audiophiles and audiophile jargon generally don't understand what it is audiophiles are interested in hearing in their music.
-DH