Public Listening Test [2010], Discussion |
![]() ![]() |
Public Listening Test [2010], Discussion |
Feb 11 2010, 15:59
Post
#176
|
|
|
Group: Developer Posts: 1126 Joined: 11-February 03 From: Germany Member No.: 4961 |
Very interesting! Have you made this by hand or is there a tool that could do that?
This post has been edited by rpp3po: Feb 11 2010, 16:17 |
|
|
|
Feb 11 2010, 16:58
Post
#177
|
|
|
Group: Members Posts: 73 Joined: 14-December 06 Member No.: 38681 |
Very interesting! Have you made this by hand or is there a tool that could do that? mp4box can split streams, but I just extracted each frame into separate file "mp4box -raws 1 emese_looped.m4a" Also, I made simple m-script to * remove silence * trim emese to contain multiple of 1024 samples * compensate encoder delay (add silence and skip first 2 aac frames latter) * write loop using writewav (it can append) * collect bitrate data |
|
|
|
Feb 11 2010, 20:04
Post
#178
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
I see. Then I wonder what the graph for QuickTime TVBR would look like...
Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Feb 12 2010, 03:28
Post
#179
|
|
|
Group: Developer Posts: 1126 Joined: 11-February 03 From: Germany Member No.: 4961 |
I am also very interested to see a comparison to the TVBR version, if you find the time. Sadly I'm not practiced in m myself.
emese_qt_tvbr_60_highest.m4a ( 179.52K )
Number of downloads: 109 |
|
|
|
Feb 12 2010, 11:50
Post
#180
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
Using a modified version of faad released by Ivan Dimkovic some years ago, this is the bitrate distribution I get with your sample, rpp3po:
This post has been edited by guruboolez: Feb 12 2010, 12:03 |
|
|
|
Feb 12 2010, 12:30
Post
#181
|
|
|
Group: Members Posts: 73 Joined: 14-December 06 Member No.: 38681 |
The CVBR looks different since I had to update quicktime to use qtaacenc.
![]() Note this is bitrate distribution for looped emese sample. And each point corresponds to bitrate of one repetition of emese sample. EDIT: legend This post has been edited by .alexander.: Feb 12 2010, 12:34 |
|
|
|
Feb 13 2010, 16:38
Post
#182
|
|
|
Group: Developer Posts: 1126 Joined: 11-February 03 From: Germany Member No.: 4961 |
I would never have thought, that the average bitrate variation for CVBR would be that large depending on iterations. Could that effect be mitigated by adding pre-silence to the test samples?
|
|
|
|
Feb 13 2010, 17:11
Post
#183
|
|
|
Group: Members Posts: 1315 Joined: 3-January 05 From: Argentina, Bs As Member No.: 18803 |
As AlexB has found that Divx hadn't suitable VBR option for 128 kbps http://www.hydrogenaudio.org/forums/index....st&p=686087
I still didn't receive the answer from Divx developer.One week has passed. He answered very late before. Let's make decision by ourselfs. Possible Divx settings: a) v4/v5 alternate setting which produces more close bitrate to other competitors on specific sample. I know it's not the best idea but still an option. b) CBR 128. This post has been edited by IgorC: Feb 13 2010, 17:16 |
|
|
|
Feb 13 2010, 17:19
Post
#184
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
I would never have thought, that the average bitrate variation for CVBR would be that large depending on iterations. Could that effect be mitigated by adding pre-silence to the test samples? Probably not, pre-silence doesn't empty the bit reservoir. I think the best thing would be to concatenate all test samples into a single file (it seems this is commonly done while standardizing MPEG coders), and then to prepend a few seconds of noise to the beginning of that file so that the first sample is encoded with a half empty bit reservoir state. Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Feb 13 2010, 17:46
Post
#185
|
|
|
Group: Developer Posts: 1126 Joined: 11-February 03 From: Germany Member No.: 4961 |
That sounds like a good approach and not too much overhead. A little script employing mp4box could be used to produce readily cut file sets after encoding.
I think Divx should be CBR 128, with the developers' option to provide a better matching preset if they want to improve their chances in the competition. This post has been edited by rpp3po: Feb 13 2010, 18:03 |
|
|
|
Feb 13 2010, 18:19
Post
#186
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
Agreed. Agreed.
-------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Feb 13 2010, 21:36
Post
#187
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
During the last listening tests the two first seconds of each encoding were discarded (by ABC/HR). Isn't it enough to avoid some technical issues?
This post has been edited by guruboolez: Feb 13 2010, 21:37 |
|
|
|
Feb 14 2010, 08:29
Post
#188
|
|
|
Group: Members Posts: 288 Joined: 14-August 06 Member No.: 34027 |
During the last listening tests the two first seconds of each encoding were discarded (by ABC/HR). Isn't it enough to avoid some technical issues? Why on earth would you throw out the first two seconds??? |
|
|
|
Feb 14 2010, 10:36
Post
#189
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
It was explained some years ago. Some elements of answers:
http://www.hydrogenaudio.org/forums/index....c=29555&hl= |
|
|
|
Feb 14 2010, 13:04
Post
#190
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
Yes, but some test samples are only 5-6 seconds long. I wouldn't want to throw away precious 25 or more percent of that.
Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Feb 14 2010, 14:10
Post
#191
|
|
![]() Group: Members Posts: 1303 Joined: 14-September 05 From: Helsinki, Finland Member No.: 24472 |
In addition to the link Guruboolez provided, here are some other relevant links:
http://www.hydrogenaudio.org/forums/index....mp;#entry343318 http://www.hydrogenaudio.org/forums/index....mp;#entry447231 http://www.hydrogenaudio.org/forums/index....mp;#entry382267 Edit: these are the initial related posts in the linked threads. Related replies may follow after a few unrelated other replies. Yes, but some test samples are only 5-6 seconds long. I wouldn't want to throw away precious 25 or more percent of that. So as an AAC developer, can you tell if your AAC encoder needs some time to adopt to the content before it provides the best possible quality? Regardless of the encoder specific behavior, I think that if you want to test a certain very short passage you should simply encode a sample that starts 2 seconds earlier. If the original source track actually starts with the critical passage that is intended to be tested you can always configure that sample to start from the beginning. It would be fair for all encoders. You would then be testing how the encoders can handle a track that starts with such content. iTunes CVBR may be a real problem if its behavior is inconsistent. We discussed about a bit similar problem when WMA 2-pass VBR was one of the possible test contenders. In general, the only really correct and fair way to simulate a real life usage situation would be to encode the complete original source tracks, decode the encoded tracks, cut the test samples from the decoded tracks, and store the samples in a lossless format. This has been discussed in the past, but it has never been a viable option for various practical reasons. This post has been edited by Alex B: Feb 14 2010, 14:48 -------------------- http://listening-tests.freetzi.com
|
|
|
|
Feb 14 2010, 14:43
Post
#192
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
Yes, but some test samples are only 5-6 seconds long. I wouldn't want to throw away precious 25 or more percent of that. Chris If needed I can easily upload a longer sample for emese (though I'm not convinced that this kind of extreme sample really belongs to this listening test). |
|
|
|
Feb 14 2010, 15:15
Post
#193
|
|
|
Group: Developer Posts: 1126 Joined: 11-February 03 From: Germany Member No.: 4961 |
For AAC at ~128 kbit/s rates, extreme samples like emese are the bread & butter of this test.
This post has been edited by rpp3po: Feb 14 2010, 15:20 |
|
|
|
Feb 14 2010, 16:25
Post
#194
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
That's right for a public listening test. That's also why such test at 128 kbps test is already doomed.
If you don't want an « all-tied » conclusion like previous tests at this bitrate you necessary have to feed the listeners with extremely difficult to encode samples - which are also unrepresentative ones (by unrepresentative, I mean that the tested material won't correspond to what people are listening on a daily basis). Such methodology is - be sure of that - a very good argument against the validity of the test. Of course we may put one or two special samples for pedagogic purpose, in order to show that lossy encoders could fail on extreme material. But not more. With more « musical » (difficult but normal) samples the conclusion should be the same than previous LT at 128 kbps : a complete statu quo. I don't think it's worth to put so much energy to show that 128 kbps encoders are equally transparent for a panel of 25...30 persons. http://www.listening-tests.info/mp3-128-1/results.htm http://www.listening-tests.info/mf-128-1/results.htm This post has been edited by guruboolez: Feb 14 2010, 16:26 |
|
|
|
Feb 14 2010, 16:43
Post
#195
|
|
|
Group: Developer Posts: 1126 Joined: 11-February 03 From: Germany Member No.: 4961 |
A "representative" test would just show what we already know. Why should anymore time be wasted on that? There is no need to prove again that all tested encoders can be transparent for most material. The 2010 edition tries to explore the tiny range between most and 100%. Which encoder comes closest? An thus it bears the potential to actually produce new results, we didn't know already: Like is TVBR really better than CVBR or even the other way around, if TVBR choked on some content? How does Nero compare to QT? And what about the new contenders. All of them should be perfectly fine for most material, we don't need to test that again. And going for more representative music selection at lower bitrates would just show which encoder produces the best low bitrate results and not necessarily tell anything about transparency potential in the bitrate range people actually use.
This post has been edited by rpp3po: Feb 14 2010, 16:45 |
|
|
|
Feb 14 2010, 19:05
Post
#196
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
If I understand your last message correctly, I think I'm very far to share any interest in the future test (I apologize from discovering it right now: I haven't read the full debate).
Extreme samples were usually helpful to test encoders under stressing situation and then validate the choice of any high bitrate settings (like lame --alt-preset, musepack, lossywav, etc...). People using such high lossy encodings expect transparent results even in extreme cases (at least in most extreme ones). But 128 kbps is very far from high bitrate and I don't think many people would expect from such low bitrate a robust, artefact-free, music library. So if I understand all this correctly the future test should tell us how good (or bad) are 128 kbps encodings under extraordinary situations but won't give us any idea about how different they could be with music for daily usage (I assume it was what people are expecting from a test at 128 kbps). If I'm right, the practical interest in such test seems very limited. At least to me. Regards. |
|
|
|
Feb 14 2010, 20:20
Post
#197
|
|
|
Group: Developer Posts: 1126 Joined: 11-February 03 From: Germany Member No.: 4961 |
But 128 kbps is very far from high bitrate and I don't think many people would expect from such low bitrate a robust, artefact-free, music library. The general sentiment was that 128 is even a very high bitrate for an AAC listening test. It was expected that at that bitrate, with normal samples, only very few participants would be able to contribute anything at all, because most wouldn't hear a difference. |
|
|
|
Feb 14 2010, 21:36
Post
#198
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
128 kbps is indeed high for a public listening test of modern encoders. That's why I really think it would be judicious to not make a new one rather than forcing the chance to get detailed results by using a lot of meaningless samples.
I realize that shouldn't have open the debate about the chosen bitrate. I'm sorry for that and will stop it right now. I repeat my offer: if needed I can upload a longer version of the emese sample (from the original CD) and maybe other samples if I can. |
|
|
|
Feb 14 2010, 23:20
Post
#199
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
Thanks for the offer, guruboolez! I'm sure we will need to make use of it.
... but won't give us any idea about how different they could be with music for daily usage ... Correct, but as rpp3po mentioned, we already know that. Except maybe for the DivX encoder, all encoders in question have been developed for years, so we can expect them to perform equally well (i.e. with satistically insignificant quality differences on average) on daily music. The question is how large the quality differences are for extremely critical material. Sure, an "average" reader might not care about such material, but such a reader can refer to the previous listening tests you referred to. By the way, related question: Do you - or does anyone else, for that matter - think a 96-kb test with less critical samples will show greater differences between the encoders? Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Feb 14 2010, 23:32
Post
#200
|
|
![]() Group: Members (Donating) Posts: 3474 Joined: 7-November 01 From: Strasbourg (France) Member No.: 420 |
Honestly, I don't think it will. I strongly believe that the difference between contenders must be very strong and immediately obvious to appear on the final plots. And I'm pretty sure that the gap between Nero & Apple is far to be large enough (assuming that a gap really exists) for expecting such results.
|
|
|
|
![]() ![]() |
|
Lo-Fi Version | Time is now: 23rd May 2013 - 23:29 |