Public MP3 Listening Test @ 128 kbps - FINISHED |
Public MP3 Listening Test @ 128 kbps - FINISHED |
Nov 24 2008, 22:30
Post
#1
|
|
![]() Group: Members Posts: 3620 Joined: 14-May 03 From: Bad Herrenalb Member No.: 6613 |
The much awaited results of the Public, MP3 Listening Test @ 128 kbps are ready - partially. So far, I only uploaded an overall plot along with a zoomed version. The details will be available in the next few days. You can also download the encryption key along with the submitted results on the results page that is located here: http://www.listening-tests.info/mp3-128-1/results.htm
The results show that all encoders are tied on first place, except l3enc which of course comes out last being the low anchor. What is interesting to see is how the MP3 codec actually evolved since its first days (l3enc was the first MP3 software encoder back in 1994 when it was released) and how it is still competitive with newer formats like AAC or Ogg Vorbis. Another very interesting thing, which was also one of the goals for this test, is that Fraunhofer and especially Helix, which both outperform LAME in terms of encoding speed, are still very competitive. While statistically being tied to LAME on first place, Helix actually even received a higher rating than LAME 3.98.2 - and this at 90x encoding speed! Even FhG received a slightly higher score at least against LAME 3.97 which was the recommended encoder by the Hydrogenaudio community for a long time. But again, statistically, they are all tied so there is no quality winner. ![]() The quality at 128 kbps is very good and MP3 encoders improved a lot since the last test. This was the last test conducted by me at this bitrate. It's time to move to bitrates like 96 kbps or 80 kbps. Here is a zoomed version of the plot showing the competitors only and leaving out the low anchor l3enc. ![]() Finally, I would like to thank everyone who participated! EDIT: Whoops, the link to the results was pointing to the 64 kbps multiformat test by mistake. Corrected now. This post has been edited by Sebastian Mares: Nov 24 2008, 22:46 -------------------- http://listening-tests.hydrogenaudio.org/sebastian/
|
|
|
|
![]() |
Nov 24 2008, 23:25
Post
#2
|
|
|
Group: Banned Posts: 185 Joined: 1-July 08 Member No.: 55148 |
They are all techincally tied, but Helix outperformed all of them. Also, the encoding speed compared to LAME is absurd faster. Could these two arguments qualify Helix for the new recommended MP3 encoder? (LAME being the second recommended)
|
|
|
|
Nov 25 2008, 00:43
Post
#3
|
|
|
Group: Members Posts: 2257 Joined: 9-October 05 From: Dormagen, Germany Member No.: 25015 |
... Could these two arguments qualify Helix for the new recommended MP3 encoder? ... I've never been too happy with recommendations especially when it's about just one encoder. I was especially unhappy with recommending Lame 3.97. There was also a listening test where Lame 3.97 came out great, with a bigger quality difference against the contenders compared to the more or less equal scores in this test as far as average score is concerned. It was after the test that 3.97's 'sandpaper problem' became known. The question is how to weigh it, the question is: how annoying is it for the person who reads the recommendation? It may be negligible, it may be a big issue. The problem is that we can't test encoders on the universe of music. We can get significant experience with encoders, that's why Sebastian's test is important. But we should always take the results with a grain of salt. There's also the question what kind of a result you have in focus. Usually people concentrate on the average result of an encoder averaged over all samples. But is this really the real thing which is most important? That's a very personal question. You can look at worst case behavior which is what I do in the first place. To me it's more important that my favorite encoder has a low number of scores below 4.0, and - at best - there is no sample with a score below 3.0. But this too has to be taken with a grain of salt. A bad score on an (to me) exotic sample doesn't count much to me, but it has a very high impact if it happens with music of my favorite genre. So evaluating an encoder is more than just looking at the average scores of a listening test. Instead of giving a rather strong recommendation as was done so far I'd prefer if we had a weaker suggestion, kind of: When targeting at a quality which can be achieved with ~128 kbps on average the most recent mp3 listening test has shown that the current versions of Lame, Helix, Fraunhofer, iTunes all do an excellent job. Quality differences between them were negligible within this test as far as the average outcome was concerned, with XXX and YYY having the best consistency in high quality (in case it turns out that such a statement can be made). -------------------- lame3100i -V0.5+ --adbr_short 480
|
|
|
|
Nov 25 2008, 00:48
Post
#4
|
|
|
Moderator Group: Super Moderator Posts: 3934 Joined: 29-September 01 Member No.: 73 |
Excuse me, but what is the correspondance between the files and the encoders ?
|
|
|
|
Nov 25 2008, 01:16
Post
#5
|
|
![]() Group: Developer Posts: 2980 Joined: 2-December 07 Member No.: 49183 |
Excuse me, but what is the correspondance between the files and the encoders ? As shown on the pictures above -- samplexx_1.mp3 is encoded by iTunes, samplexx_2.mp3 - lame 3.98.2 etc. Of course, ABC/HR tool randomizes order of samples every time you load abc/hr config file (Samplexx.ecf) Sorry, it's not clear to me how many subjects participated. Can you point me to that in the graph? Downloaded results.rar: 39 - 26 - 26 - 27 - 30 - 26 - 26 - 26 - 26 - 26 - 27 - 26 - 29 - 30. |
|
|
|
Sebastian Mares Public MP3 Listening Test @ 128 kbps - FINISHED Nov 24 2008, 22:30
/mnt Wow am really shocked, Helix (Xing) has performed ... Nov 24 2008, 22:52
Sebastian Mares I kept telling you guys that the results will be q... Nov 24 2008, 22:56
Neasden Does that make Helix the new recommended MP3 encod... Nov 24 2008, 23:00
/mnt QUOTE (Neasden @ Nov 24 2008, 23:00) Does... Nov 24 2008, 23:05
Neasden Yes it is, I just noticed it!
I can't bel... Nov 24 2008, 23:14
greynol I don't think open source has anything to do w... Nov 24 2008, 23:14
Sebastian Mares If you submitted results, I recommend you look at ... Nov 24 2008, 23:14
halb27 I am curious about the detailed results as my inte... Nov 24 2008, 23:22
Canar Thank you very much Sebastian. We have some things... Nov 24 2008, 23:51
DigitalDictator This is indeed surprising. I'm sure I've s... Nov 24 2008, 23:56
halb27 QUOTE (DigitalDictator @ Nov 25 2008, 00... Nov 25 2008, 00:11

Pio2001 AAAAAAAAAAAARRRRRRRRGGGGHGHHHHHH !!
They... Nov 25 2008, 00:34
melomaniac I analyzed my results and the ranking of the encod... Nov 25 2008, 09:19
Pio2001 QUOTE (melomaniac @ Nov 25 2008, 09:19) A... Nov 25 2008, 14:04
ZinCh This is only 128k tests, so Helix in the winner in... Nov 24 2008, 23:59
Neasden I encoded a few tracks using -V150 (VBR range 0-15... Nov 25 2008, 00:07
greynol >Helix can be recomended for 128k encoding.
Ho... Nov 25 2008, 00:10
kwanbis Wow (even if the difference between LAME 3.98.2 an... Nov 25 2008, 00:42
krabapple Sorry, it's not clear to me how many subjects ... Nov 25 2008, 01:02
Pio2001 Thanks, I analyzed my own results. That's what... Nov 25 2008, 01:37
Raiden QUOTE (Pio2001 @ Nov 25 2008, 01:34) Oh, ... Nov 25 2008, 02:12
Canar QUOTE (Raiden @ Nov 24 2008, 17:12) agree... Nov 25 2008, 02:32
sld Regarding statistics... the confidence intervals w... Nov 25 2008, 04:51
Sebastian Mares QUOTE (sld @ Nov 25 2008, 04:51) Regardin... Nov 25 2008, 09:02
Squeller Is this claim correct? There has been no improveme... Nov 25 2008, 08:46
halb27 Zoomed view is formally correct, but has a tendenc... Nov 25 2008, 09:19
Squeller QUOTE (halb27 @ Nov 25 2008, 10:19) Zoome... Nov 25 2008, 09:28
halb27 QUOTE (Squeller @ Nov 25 2008, 10:28) Bas... Nov 25 2008, 09:44
melomaniac QUOTE (Squeller @ Nov 25 2008, 09:28) Abo... Nov 25 2008, 09:46
sld QUOTE (Squeller @ Nov 25 2008, 16:28) Bas... Nov 25 2008, 20:05
memomai Just confused. Helix worse than lame, Helix better... Nov 25 2008, 10:08
halb27 QUOTE (memomai @ Nov 25 2008, 11:08) Just... Nov 25 2008, 10:30
Alexxander QUOTE (halb27 @ Nov 25 2008, 10:30) ...
W... Nov 25 2008, 10:55
halb27 QUOTE (Alexxander @ Nov 25 2008, 11:55) .... Nov 25 2008, 13:04
Alexxander Before anything I have to thank Sebastian again fo... Nov 25 2008, 10:44
muaddib It is not good to conclude, from the results of th... Nov 25 2008, 11:15
Alexxander QUOTE (muaddib @ Nov 25 2008, 11:15) It i... Nov 25 2008, 12:37
Jan S. Wouldn't it be possible to compare the varianc... Nov 25 2008, 12:33
Sebastian Mares QUOTE (Jan S. @ Nov 25 2008, 12:33) Would... Nov 25 2008, 12:44
robert I would be more interested in Quartile, instead of... Nov 25 2008, 13:01
kwanbis QUOTE (robert @ Nov 25 2008, 12:01) I wou... Nov 25 2008, 13:06
Sebastian Mares QUOTE (robert @ Nov 25 2008, 13:01) I wou... Nov 25 2008, 14:47
westgroveg If anything the test shows samples where LAME need... Nov 25 2008, 13:55
Alex B QUOTE (westgroveg @ Nov 25 2008, 14:55) I... Nov 25 2008, 14:51
uart QUOTE (Alex B @ Nov 25 2008, 05:51) In ge... Nov 25 2008, 16:30
/mnt Just try some Metal tracks on Helix at V60, I guar... Nov 25 2008, 15:19
Neasden /mnt told me that Helix is not gapless, which is t... Nov 25 2008, 16:07
Synthetic Soul QUOTE (Neasden @ Nov 25 2008, 15:07) /mnt... Nov 25 2008, 17:21
Alexxander QUOTE (Synthetic Soul @ Nov 25 2008, 17:2... Nov 25 2008, 18:27
DigitalDictator Why would Helix struggle with metal? IIRC it also ... Nov 25 2008, 16:15
/mnt QUOTE (DigitalDictator @ Nov 25 2008, 16... Nov 25 2008, 16:38
Sebastian Mares Well, one thing you have to consider is that Helix... Nov 25 2008, 16:32
IgorC I can confirm statements of /mnt . Helix isn't... Nov 25 2008, 17:23
ff123 QUOTE (IgorC @ Nov 25 2008, 08:23) I can ... Nov 25 2008, 17:26
Alex B QUOTE (Sebastian Mares @ Nov 25 2008, 17... Nov 25 2008, 17:34
guruboolez The 2 or 3 first seconds were already ignored in t... Nov 25 2008, 18:04
TechVsLife @sebastian mares: thanks for the test!
@gurub... Nov 25 2008, 21:44
Sebastian Mares QUOTE (ff123 @ Nov 25 2008, 17:26) QUOTE ... Nov 25 2008, 18:08
Alex B QUOTE (Sebastian Mares @ Nov 25 2008, 19... Nov 25 2008, 18:50
Jillian I like the part where test result (quality and enc... Nov 25 2008, 19:13
halb27 QUOTE (Jillian @ Nov 25 2008, 20:13) I li... Nov 25 2008, 21:13
Canar QUOTE (Neasden @ Nov 25 2008, 07:07) /mnt... Nov 25 2008, 19:34
Alex B QUOTE (Synthetic Soul @ Nov 25 2008, 18:2... Nov 25 2008, 19:36
Canar Gapless encoding using Helix is possible by encodi... Nov 25 2008, 19:51
Neasden Do you mean having to "split" manually t... Nov 25 2008, 20:06
Alex B QUOTE (Neasden @ Nov 25 2008, 21:06) Do y... Nov 25 2008, 20:33
guruboolez QUOTE (Alex B @ Nov 25 2008, 21:33) Perso... Nov 25 2008, 21:00
Synthetic Soul QUOTE (guruboolez @ Nov 25 2008, 20:00) T... Nov 25 2008, 21:07
Sebastian Mares Created 7 / 14 sample graphs...
BTW, Tukey's ... Nov 25 2008, 20:10
kwanbis Sebastian, where it says, "The results are gr... Nov 25 2008, 20:18
Bodhi Great Job once again. Thank you Sebastian! Nov 25 2008, 20:44
Neasden QUOTE As Canar said, it's a hack.
That alone ... Nov 25 2008, 20:58
Neasden unless someone patches Helix and release it with t... Nov 25 2008, 21:05
Sebastian Mares In case you are interested, here is a quick and di... Nov 25 2008, 21:10
halb27 QUOTE (Sebastian Mares @ Nov 25 2008, 22... Nov 25 2008, 21:26
benski QUOTE (Sebastian Mares @ Nov 25 2008, 15... Nov 25 2008, 21:50
Canar The point I was trying to make was that though Hel... Nov 25 2008, 21:14
Neasden in this graph, LAME 3.98.2 seems the more stable e... Nov 25 2008, 21:42
Alex B QUOTE (Neasden @ Nov 25 2008, 22:42) in t... Nov 25 2008, 22:06
Sebastian Mares The graphs for all samples are available on the re... Nov 25 2008, 22:18
halb27 QUOTE (Sebastian Mares @ Nov 25 2008, 23... Nov 25 2008, 22:25
singaiya Is anybody else not surprised that each contender ... Nov 25 2008, 22:21
Sunhillow Thank you for this great checkup, Sebastian! I... Nov 25 2008, 22:35
Zilog Jones I, as many others was also very surprised by the r... Nov 25 2008, 22:53
Sebastian Mares QUOTE (Zilog Jones @ Nov 25 2008, 22:53) ... Nov 25 2008, 23:24
sizetwo Without adding fuel to the fire, I think its stran... Nov 25 2008, 23:16
sld QUOTE (sizetwo @ Nov 26 2008, 06:16) Does... Nov 26 2008, 05:14
singaiya QUOTE (sld @ Nov 25 2008, 20:14) You shou... Nov 26 2008, 05:46
DigitalDictator I've been asking this a couple of times, but I... Nov 25 2008, 23:30
sizetwo QUOTE Statistically, for the people who tested and... Nov 25 2008, 23:34
guruboolez QUOTE (sizetwo @ Nov 26 2008, 00:34) Sorr... Nov 25 2008, 23:52
kwanbis QUOTE (guruboolez @ Nov 25 2008, 22:52) N... Nov 26 2008, 00:36
/mnt I have posted some ABX logs and samples of tracks ... Nov 26 2008, 00:49
JasonQ Good test. Good to see that Helix had a solid sho... Nov 26 2008, 06:18
sizetwo QUOTE (guruboolez @ Nov 25 2008, 15:52) Q... Nov 26 2008, 07:18![]() ![]() |
|
Lo-Fi Version | Time is now: 20th May 2013 - 23:25 |