I wonder if comparative tests have been done on HA about judging lossy codecs on headphones and speakers.
When public listening tests are done, is one required to specify listening conditions ?
It's my impression that the optimization of codecs is largely based on feedback from headphone listeners (at least on HA).
Are there any statistics about that and if so, does it matter ?
Although I like headphones in some situations, I wouldn't use them to make important sonical judgements during mixing or mastering.
Frequency response of headphones (
examples) is often far from "flat", which can have quite some influence on masking properties.
If you can't feed your hifi stereo system with the sound from your pc, putting various samples on a cd would seem a workable solution.
You could do an ABX test by using 3 samples of each fragment: original, coded version and X (one of the previous ones).
One file will be used twice, which costs more discspace, but it makes testing easier IMO.
You'll need someone (or a tool) to put the files in random order to make the test double blind.
Please do report your findings. It's an interesting issue.