First, I’ve encoded many classical samples with latest encoder, with –quality 4 (--radio) profile. After two encouraging results on collective tests, musepack’s quality and reputation are now well-established at this bitrate (~135 kbps). I’ve experienced it myself, and I clearly admit that MPC at mid-bitrate performs really well… with some exceptions.
In reality, the same collective tests revealed serious issue with musepack, especially with classical music: bloated bitrate with some samples first (up to 180-210 kbps on harpsichord), and the opposite with some other samples, especially low-volume moments. With the sample named Debussy.wav (opera part: two people speaking/singing with a small orchestral accompaniment wreathed in recording noise), musepack performance was much worse than other competitors during Roberto’s 2nd 128 multiformat collective test (worse than WMA and atrac3!). This unusual low performance was probably correlated to a strange bitrate plunge (~100 kbps), not bad by itself (as long as it not hurts, lower bitrate are always welcome), but really questionable when quality is so strangely bad. Of course, Debussy.wav is only one sample, but it’s not a hard task for someone enjoying classical music to find numerous moments of low-volume including recording noise…
My own experience of musepack at mid-bitrate is not a big one, and a good half of it could be found on the first (and partially flawed) listening test I’ve published twenty months ago (here). In October 2003, musepack 1.14 performed equally with vorbis, slightly better than lame 3.90.3, but worse than WMA9Pro (winner) and QuickTime AAC. In other words, the relative performance of MPC –radio was not as enjoying than quality obtained with less specific music genre. Last but not least, with all progress made during this time by AAC (both Nero & Apple implementations – and faac too), lame, vorbis and maybe WMA(pro and only pro), musepack could seriously pretend to be the potential looser among all other modern audio format.
Yesterday, by listening quickly to various classical samples (I’m building a full library of very short samples, yet unfinished), I was simply amazed by poor performance of musepack at this bitrate on my music (I insist: the encoder performs differently and much better with everything else than the general category of ‘classical’). I’m now used to live with ABR encodings coming from LAME 3.97a, and I could swear with a great confidence that LAME is free of most issues audible with musepack: ringing, lifeless sound, and a typical artifact of mpc which reminds me for unknown reasons a washing machine
Few words about musepack –radio history (and correct me if I’m wrong). Frank Klemm worked on musepack at mid/low bitrate few time after he released mppenc 1.00 (it must be 1.02, 1.04 or 1.06), by introducing a tool called PNS (different beast IIRC from MPEG-4 AAC Perceptual Noise Substitution). As a consequence, the use of PNS lowered the bitrate of some MPC profiles (radio, thumb, telephone) without jeopardizing the output quality. It means that 1.01j --radio will systematically be weightier than 1.15u --radio (~15 kbps) and it implies that both settings are not directly comparable. The comparison I made is logically unfair. But it nevertheless reveals unexpected drawbacks of former musepack tunings, and (I hope so) will maybe open the door of further improvements
As example, Debussy’s encodings (replaygained with high value) could be downloaded here:
• 1.01j
• 1.15u
• Debussy (lossless)
Ok, and then? Now that you’ve read my awful literature, you’re probably disappointed to not see any table of results, ABX score or ANOVA analysis. No? It’s simply because I haven’t made a complete comparison between 1.01j and 1.15u at --radio profile… in order to make one at the famous --standard setting. It might sound totally absurd to compare a three-year old version of mppenc (released on May 2002) to the most recent version of the encoder (February 2005). But if tunings made three years ago hurts on several classical music samples at --radio setting, it is natural to speculate about possible regression with other profiles. But I don’t really like unwarranted speculation (probably as dangerous -if not more- than placebo), and I’d rather make a listening test in order to obtain results, whatever they are, than being suspicious or throwing doubt on current release of musepack. My biggest risk is to hear nothing (good or wrong) between both encoders. But of course, if I have created a long topic and smashed the record of grammatical mistake per square centimeter, it’s not to tell to everybody that I can’t hear something wrong between two encoders considered for several years as transparent to most people


