A couple of weeks ago we had a minor debate about the quality of the modern lossy encoders at a Finnish AV forum. My plan was to provide the forum users an opportunity to test their beliefs and hearing by providing a few test samples and instructions how to use the foo_abx tool.
I prepared five samples of different genres from my collection. I thought the selected samples would be a bit above average in complexity - nothing like killer samples, but not too easy for the encoders. I encoded the samples using three different quality levels and two encoders: Vorbis b4.5 (-q 1.5, -q 4.25 and -q 6.25) and LAME 3.97b1 (-V8, -V5 and -V2; all --vbr -new). The idea was to explain something like this:
1. the lowest quality is useful e.g. with portables, but not actually "hifi" quality
2. the middle quality is very good and finding differences is not easy
3. the highest quality is transparent or almost transparent.
My plan didn't work out. After ABX testing the lowest quality samples I realized that my samples are going to be way too easy for the encoders at the higher quality levels. For example, one of the local audio gurus who accepts only lossless files tried to ABX one of the samples. He could ABX MP3 -V8 and didn't like it, but the Vorbis -q 1.5 sample made him almost angry because he couldn't ABX it. He used his high-end speakers instead of headphones, but this tells something about the Vorbis quality anyway. There was no sense to continue the test with the higher quality samples.
I had no plans to publish any results here, but because of the ongoing debate about the 128 kbps test here is a nice example:
The test sample
hot_tequilla_brown.flac (genre: ~electronic/pop/funk (?), 21 s, 2.63 MB)
- This sample produces 91-96 kbps (my overall test target was ~88 kbps for this quality level).
The tested lossy files are available in this package: lossy_samples.zip (767 kB)
My ABX results
LAME 3.97 beta1 -V8 --vbr-new ~94 kbps
CODE
foo_abx v1.2 report
foobar2000 v0.8.3
2005/11/13 00:01:32
File A: file://E:\test\Monkey's Audio 3.99 High\hot_tequilla_brown.ape
File B: file://E:\test\LAME 3.97 beta 1 -V8 --vbr-new\hot_tequilla_brown.mp3
00:01:35 : Test started.
00:02:04 : 01/01 50.0%
00:02:09 : 02/02 25.0%
00:02:17 : 03/03 12.5%
00:02:29 : 04/04 6.3%
00:02:35 : 05/05 3.1%
00:02:41 : 06/06 1.6%
00:02:51 : 07/07 0.8%
00:03:04 : 08/08 0.4%
00:03:08 : Test finished.
----------
Total: 8/8 (0.4%)
foobar2000 v0.8.3
2005/11/13 00:01:32
File A: file://E:\test\Monkey's Audio 3.99 High\hot_tequilla_brown.ape
File B: file://E:\test\LAME 3.97 beta 1 -V8 --vbr-new\hot_tequilla_brown.mp3
00:01:35 : Test started.
00:02:04 : 01/01 50.0%
00:02:09 : 02/02 25.0%
00:02:17 : 03/03 12.5%
00:02:29 : 04/04 6.3%
00:02:35 : 05/05 3.1%
00:02:41 : 06/06 1.6%
00:02:51 : 07/07 0.8%
00:03:04 : 08/08 0.4%
00:03:08 : Test finished.
----------
Total: 8/8 (0.4%)
LAME was easy to ABX because of the obvious lowpass. I think -V8 is too low setting for anything that contains high frequencies. (Though, in general my high frequency hearing is not excellent. I can't ABX a lowpass over 16 kHz)
LAME at -V5 was much better with this sample. It sounded fine in casual listening, but I didn't ABX it.
Vorbis aoTuV beta 4.5 -q 1.5 ~96 kbps
CODE
foo_abx v1.2 report
foobar2000 v0.8.3
2005/11/13 00:27:18
File A: file://E:\test\Monkey's Audio 3.99 High\hot_tequilla_brown.ape
File B: file://E:\test\Vorbis aoTuV beta 4.5 -q1,5\hot_tequilla_brown.ogg
00:27:21 : Test started.
00:32:10 : 01/01 50.0%
00:32:50 : 02/02 25.0%
00:33:14 : 02/03 50.0%
00:33:27 : 03/04 31.3%
00:34:26 : 04/05 18.8%
00:35:00 : 05/06 10.9%
00:35:12 : 06/07 6.3%
00:35:25 : 07/08 3.5%
00:37:55 : 08/09 2.0%
00:38:09 : 09/10 1.1%
00:38:29 : 10/11 0.6%
00:38:48 : Test finished.
----------
Total: 10/11 (0.6%)
foobar2000 v0.8.3
2005/11/13 00:27:18
File A: file://E:\test\Monkey's Audio 3.99 High\hot_tequilla_brown.ape
File B: file://E:\test\Vorbis aoTuV beta 4.5 -q1,5\hot_tequilla_brown.ogg
00:27:21 : Test started.
00:32:10 : 01/01 50.0%
00:32:50 : 02/02 25.0%
00:33:14 : 02/03 50.0%
00:33:27 : 03/04 31.3%
00:34:26 : 04/05 18.8%
00:35:00 : 05/06 10.9%
00:35:12 : 06/07 6.3%
00:35:25 : 07/08 3.5%
00:37:55 : 08/09 2.0%
00:38:09 : 09/10 1.1%
00:38:29 : 10/11 0.6%
00:38:48 : Test finished.
----------
Total: 10/11 (0.6%)
Very difficult to ABX. It took me 5 minutes to find a passage where I could possibly hear a difference. Vorbis was almost transparent with this sample. In casual listening I couldn't hear any problems.
Window Media Audio
Today I tested WMA standard with the same sample. I had never properly tried WMA at VBR quality 25.
WMA 9.1 Standard VBR25 ~91 kbps
CODE
foo_abx v1.2 report
foobar2000 v0.8.3
2005/11/28 17:38:04
File A: file://E:\test\Monkey's Audio 3.99 High\hot_tequilla_brown.ape
File B: file://E:\test\WMA9.1 STD VBR25\hot_tequilla_brown.wma
17:38:06 : Test started.
17:39:10 : 01/01 50.0%
17:39:42 : 02/02 25.0%
17:40:24 : 03/03 12.5%
17:40:42 : 04/04 6.3%
17:41:14 : 05/05 3.1%
17:42:03 : 06/06 1.6%
17:42:44 : 07/07 0.8%
17:43:14 : 07/08 3.5%
17:44:11 : 08/09 2.0%
17:47:26 : 08/10 5.5%
17:47:44 : 08/11 11.3%
17:49:04 : 09/12 7.3%
17:49:46 : 10/13 4.6%
17:50:37 : 11/14 2.9%
17:53:24 : 12/15 1.8%
17:54:03 : 13/16 1.1%
17:54:34 : 14/17 0.6%
17:54:54 : 15/18 0.4%
17:58:55 : 16/19 0.2%
17:59:53 : 17/20 0.1%
18:00:01 : Test finished.
----------
Total: 17/20 (0.1%)
foobar2000 v0.8.3
2005/11/28 17:38:04
File A: file://E:\test\Monkey's Audio 3.99 High\hot_tequilla_brown.ape
File B: file://E:\test\WMA9.1 STD VBR25\hot_tequilla_brown.wma
17:38:06 : Test started.
17:39:10 : 01/01 50.0%
17:39:42 : 02/02 25.0%
17:40:24 : 03/03 12.5%
17:40:42 : 04/04 6.3%
17:41:14 : 05/05 3.1%
17:42:03 : 06/06 1.6%
17:42:44 : 07/07 0.8%
17:43:14 : 07/08 3.5%
17:44:11 : 08/09 2.0%
17:47:26 : 08/10 5.5%
17:47:44 : 08/11 11.3%
17:49:04 : 09/12 7.3%
17:49:46 : 10/13 4.6%
17:50:37 : 11/14 2.9%
17:53:24 : 12/15 1.8%
17:54:03 : 13/16 1.1%
17:54:34 : 14/17 0.6%
17:54:54 : 15/18 0.4%
17:58:55 : 16/19 0.2%
17:59:53 : 17/20 0.1%
18:00:01 : Test finished.
----------
Total: 17/20 (0.1%)
This was not easy, but I could hear the difference at a certain passage, mostly because of the slight lowpass and perhaps a bit narrower stereo width. But after the first seven tries my ears got tired and I had difficulties to ABX. I wanted to be sure and continued through 20 trials. It took over 20 minutes. In general I couldn't hear any obvious problems. (I didn't expect VBR25 to be this good. Am I becoming deaf?) Since I tested Vorbis two weeks ago I cannot directly compare these two codecs.
The test gear used: Terratec DMX 6fire 24/96 soundcard, Harman/Kardon AVI 200 MKII amp, KOSS HV/1A headphones.
Edit: typo
Edit 2: changed the lossless sample to FLAC format
Edit 3: added the lossy samples
EDIT 4:
I removed the samples to make room. PM me if you like to try them.
