Help - Search - Members - Calendar
Full Version: Multiformat 48 kbps Listening Test
Hydrogenaudio Forums > Hydrogenaudio Forum > Listening Tests
Pages: 1, 2, 3, 4
Egor
Make sure you're ripping a CD without pre-gap audio (no INDEX 00) and check if DAE error correction is enabled in WMP's preferences.

Hope what you've found isn't an audio watermarking smile.gif .
Sebastian Mares
The CD I tested with has no gaps (I checked with EAC with both my Plextor PX-755A and my LG E10L). DAE error correction is not checked, but the CD is in perfect condition and I ripped it several times with EAC and burst mode without any error or speed decrease.

You watermarking idea isn't bad.
Sebastian Mares
I ripped the same song with WMP once more and the first WMP rip is identical with the second one. I also encoded the WAV file using the VB script and here, the first version is identical with the second one. I am wondering why each encoder produces a different file.
Egor
Try to rip with WMP in PCM, then convert the resulting file to WAV.
Rip the same track with EAC in PCM (adjust no offset correction).
Then bit-compare these two WAVs using EAC's wav comparison function, this will show any offset added by WMP.
Sebastian Mares
Damn, wait a second - EAC is configured to shift data by 30 samples with my PX-755A. I cannot change the read offsets because of AccurateRip, but I will compare WMP vs. Winamp or iTunes.
Sebastian Mares
fb2k vs. WMP:

QUOTE
Comparing:
"C:\Neuer Ordner\10 Rock Da Jam.wav"
"C:\Neuer Ordner\10_CD Track 10.wav"
No differences in decoded data found.



Converted a WAV to WMA using the VBS file and compared it to a fresh WMA rip made with WMP:

QUOTE
Comparing:
"C:\Neuer Ordner\1_10 Rock Da Jam.wma"
"C:\Neuer Ordner\10 Rock Da Jam.wma"
differences found: 20243806 sample(s), starting at 0.1524263 second(s), peak: 0.8087769 at 210.4971429 second(s), 2ch

Sebastian Mares
Guru, did you test if this also happens with WMA Standard?
guruboolez
No, not yet. I was waiting to similar reports (it would be pointless to investigate further if the problems comes from my PC, drive or wrong software installation).
I'm going to bed now. I'll try this tomorrow if nobody will test it in the meantime.
Bye smile.gif
Sebastian Mares
I just encoded with Windows Media Player and then with foobar2000 and converted the fb2k rip to WMA 9.2 Standard, Q25_44_2:

QUOTE
Comparing:
"C:\Neuer Ordner\1_10_CD Track 10.wma"
"D:\Eigene Musik\Verschiedene Interpreten\Just the Best Vol. 12\10 Rock Da Jam.wma"
No differences in decoded data found.



I think the problem occurs with WMA Professional 10 only.

Edit: I used Q25 because that is the lowest setting in WMP.
Egor
Does WMP show identical "File information" (encoding options, bitrate) for those different WMA10Pro tracks?

Edit: you need to start playback in order to see "File information".
Sebastian Mares
Yes-sir.
Egor
QUOTE(Sebastian Mares @ Nov 5 2006, 14:48) *
Yes-sir.
[offtopic]
Yassir? That made me remember that old joke laugh.gif
[/offtopic]
Ivan Dimkovic
QUOTE

My question now: isn't Nero ABR affected by the same problem?


No, because the "window" in our ABR mode is just couple of seconds big. That is easy to test.
Sebastian Mares
QUOTE(Egor @ Nov 5 2006, 10:49) *

QUOTE(Sebastian Mares @ Nov 5 2006, 14:48) *
Yes-sir.
[offtopic]
Yassir? That made me remember that old joke laugh.gif
[/offtopic]


I know that one from here: http://www.youtube.com/watch?v=3rJT5IpyAok biggrin.gif
Alex B
Ivan,

Why would Nero ABR be better than VBR at about 48 kbps average bitrate?

Is this true also with complete tracks that possibly have long difficult passages and long easy passages?


Edit

I mean if the VBR q value was selected by testing a large number of various complete tracks that represent a complete audio library.
guruboolez
I compared the WMA Std CBR encodings directly created by WMP to the output created by Winamp and WMCmd.vbs script (based on the .wav file I got with WMP ripping tool), and all three encodings are different.

The difference is nonetheless small (peak is low and several parts are identical). Length is also different:
CODE

ENCODING TOOL        SAMPLES/fb2k    SAMPLES/Audition

• Windows Media Player  6043596        6043596
• Winamp 5.31           6058414        6043648 [-14766]
• WMCmd.vbs/fb2k        6043640        6043640
___
• PCM source            6041700        6041700

I have no explanation about these difference. I also wonder why fb2k and Audition aren't counting the same number of sample of Winamp's encoding and only this one.
Ivan Dimkovic
QUOTE(Alex B @ Nov 5 2006, 13:32) *

Ivan,

Why would Nero ABR be better than VBR at about 48 kbps average bitrate?


I guess it would not be better. If I recall the test of various bit rate allocation modes in Nero AAC encoder, one VBR method scored first with the highest SDG - but all modes were tied. We used that VBR algorithm in the retail version of the encoder.


QUOTE
Is this true also with complete tracks that possibly have long difficult passages and long easy passages?


ABR buffer is limited in size - so, if we are talking about 30 seconds of silence and 30 seconds of hard to encode signals - I don't think ABR would be much different from CBR.
Sebastian Mares
QUOTE(guruboolez @ Nov 5 2006, 15:56) *

I compared the WMA Std CBR encodings directly created by WMP to the output created by Winamp and WMCmd.vbs script (based on the .wav file I got with WMP ripping tool), and all three encodings are different.

The difference is nonetheless small (peak is low and several parts are identical). Length is also different:
CODE

ENCODING TOOL        SAMPLES/fb2k    SAMPLES/Audition

• Windows Media Player  6043596        6043596
• Winamp 5.31           6058414        6043648 [-14766]
• WMCmd.vbs/fb2k        6043640        6043640
___
• PCM source            6041700        6041700

I have no explanation about these difference. I also wonder why fb2k and Audition aren't counting the same number of sample of Winamp's encoding and only this one.


Now that is weird - WMA Standard CBR doesn't cause problems here. huh.gif
Alex B
I tried a WMP11 ripped wave file (which was bit to bit identical with an EAC ripped track).

I encoded it with Winamp 5.31 & Transcoder @ WMA Pro 48 kbps 1-pass CBR.

The resulting WMA Pro file was different from a file that I ripped directly with WMP11 in the same format.


Edit

WMP -> Winamp

I tried to disable everything that could have changed the result in Winamp output options. (In case Transcoder uses them.)
benski
A test with WMA lossless would quickly determine whether or not different input data is reaching the encoder (my own tests with winamp+wmalossless showed as being identical to the original WAV)

My guess is that either 1) WMP is using special tunings or 2) WMP is watermarking (http://windowssdk.msdn.microsoft.com/en-us...y/aa391543.aspx)

edit: URL
Alex B
QUOTE(Ivan Dimkovic @ Nov 5 2006, 17:20) *

QUOTE(Alex B @ Nov 5 2006, 13:32) *

Ivan,

Why would Nero ABR be better than VBR at about 48 kbps average bitrate?


I guess it would not be better. If I recall the test of various bit rate allocation modes in Nero AAC encoder, one VBR method scored first with the highest SDG - but all modes were tied. We used that VBR algorithm in the retail version of the encoder.


QUOTE
Is this true also with complete tracks that possibly have long difficult passages and long easy passages?


ABR buffer is limited in size - so, if we are talking about 30 seconds of silence and 30 seconds of hard to encode signals - I don't think ABR would be much different from CBR.


If the VBR mode has no specific known problems at this bitrate/quality range I suppose it would be better to use VBR then (as usual).

In real life VBR should be better at least for stuff like classical symphonies, progressive rock, movie soundtracks etc. (i.e. if the audio signal is not something like a highly compressed pop tune without any long term dynamics variation).

Edit: typo
Sebastian Mares
benski, it's not that WMP is the only encoder that produces different outputs. Winamp and WME also produce different files. Funny enough, my test WMP vs. WME with WMA Standard showed no difference while Guru and Alex spotted differences. Maybe I should get in touch with Zambelli again?
Alex B
QUOTE
In real life VBR should be better at least for stuff like classical symphonies, progressive rock, movie soundtracks etc. (i.e. if the audio signal is not something like a highly compressed pop tune without any long term dynamics variation).


- If this "compressed pop tune" happens to be very difficult for the encoder VBR should be better also in this case. VBR should be able to use a higher average bitrate for the complete track if needed.
Alex B
QUOTE(Sebastian Mares @ Nov 5 2006, 18:05) *
benski, it's not that WMP is the only encoder that produces different outputs. Winamp and WME also produce different files. Funny enough, my test WMP vs. WME with WMA Standard showed no difference while Guru and Alex spotted differences. Maybe I should get in touch with Zambelli again?


I revisited this thread: http://www.hydrogenaudio.org/forums/index....showtopic=44760.

You should try to find out if WMP11 really uses some techniques that are not available with the "WMCmd.vbs" method.

Undocumented watermarking in the audio stream itself and only with the WMA Pro format would be rather strange. I hope the explanation is not that.
benski
QUOTE(Sebastian Mares @ Nov 5 2006, 11:05) *

benski, it's not that WMP is the only encoder that produces different outputs. Winamp and WME also produce different files. Funny enough, my test WMP vs. WME with WMA Standard showed no difference while Guru and Alex spotted differences. Maybe I should get in touch with Zambelli again?


I'm showing no differences on a machine with WM Codecs version 9.1, but differences on a machine with WM Codecs 9.2/10 (both Standard CBR and WMA Pro CBR). The three encoders (WMP, Winamp [via WMF SDK], EAC [via WMEncoder.exe]) are all showing different results. If watermarking is the reason, perhaps WMP and WME are using slightly different watermarking settings (so it could be determine which application was used).

Test with Standard 128 CBR:
QUOTE

Comparing:
"C:\temp\EAC_9_2_128_CBR.wma"
"C:\temp\Winamp_9_2_128_CBR.wma"
differences found: 3592144 sample(s), starting at 2.0288662 second(s), peak: 0.1263123 at 92.3111791 second(s), 1ch

Comparing:
"C:\temp\Winamp_9_2_128_CBR.wma"
"C:\temp\WMP_9_2_128_CBR.wma"
differences found: 2361123 sample(s), starting at 2.0288662 second(s), peak: 0.1157227 at 45.3302041 second(s), 1ch

Comparing:
"C:\temp\EAC_9_2_128_CBR.wma"
"C:\temp\WMP_9_2_128_CBR.wma"
differences found: 3604875 sample(s), starting at 2.0767574 second(s), peak: 0.1263123 at 92.3111791 second(s), 1ch
Sebastian Mares
Well, if differences are only marginal and don't affect quality, I guess we should go with WMCmd.vbs because it can be used for batch encoding. WMP does not encode to Q10 WMA Standard. I could also use Winamp, but I just uninstalled it again. tongue.gif
Sebastian Mares
I was talking with Roberto about the problems of testing WMA 2-pass VBR the other day and was wondering about one thing - is only 2-pass VBR affected by the issue I described here or does this affect all VBR modes actually. Therefore, I asked both Ivan and Gabriel how their VBR implementations work and whether or not it is true that "free" VBR will always allocate the same number of bits to a given sample, regardless of the fact that it's part of a full song or the sample was encoded as-is: as an already extracted part of a track. While Ivan confirmed my initial thoughts, Nero producing two more or less identical encodes, Gabriel said this is not the case with LAME. He explained that LAME is using a variable ATH level whichs value is based on the previous loudness. Therefore, encoding a full track is not the same as encoding a sample - even if VBR was used, the sample encoded as-is will not be the same as the sample encoded from the whole track.
I am now wondering how big the effect is. Does this "news" render all previous listening tests based on samples as useless with regards to LAME?
guruboolez
It's for that reason Gabriel suggested 2 years ago (and sometimes recalled it) that testers should discard the first one or two seconds from the tested files.
And if I remember correctly it was done for the last listening tests (an option allows this in ABC/HR).

It needs to be confirmed by Gabriel anyway.
Sebastian Mares
OK, so it's not something that affects the whole encode, but only the first few samples.
Ivan Dimkovic
@Sebastian,

I would not treat LAME variable ATH as such a problem for the listening test. Fact is that many psychoacoustic models take into account the previous samples - and it is not just variable ATH.

For example, there is temporal post-masking phenomenon - which would create different bit distributions for a given sample, based on the loudness of the samples in the past - however, this phenomenon is very local in time - e.g. maximum duration is approx. 200 ms (unless encoder is buggy)

Also, some encoders are using time-domain methods to estimate tonality of the signal - for example, if the masker is behaving unpredictable in the time domain in the past, encoder might judge the masker as being "noisy" - and this can mean up to 20+ dB in the masker power difference.

Additionally, in SBR you might get slightly different results as there is usually small "SBR Reset" flag being sent every second or so (depending on the encoder) - the difference between two encodings of the same sample, but located in the different region is also not big, but it is definitely there.

Etc..

These are just a few factors that might render samples encoded with different quantization resolution depending on the past samples. However, all of these differences IMHO are not so relevant for a listening test.

I think just adding 2-3 seconds of "run-in" is more than enough to make a fair test.
Alex B
QUOTE
I think just adding 2-3 seconds of "run-in" is more than enough to make a fair test.

Isn't cutting the first two seconds off in the ABC/HR options the usual practice?

However, this may be a problem with very short samples or samples that start with audio signal that is meant to demostrate a specific problem. Here is an example of such a sample: http://www.hydrogenaudio.org/forums/index....st&p=420360

The first two or three seconds seem to be problematic for all MP3 encoders at about 128 kbps. The sample is also from the very beginning of a real audio track so it is not artificial.

Perhaps a few seconds of some PCM material could be addded before the sample, but should this be digital silence or some average audio material? Would a few seconds of silence make the encoder behave differently when the real sample starts? If the sudden signal change alters the encoding result we would need to know what is the encoder "default" before it starts adjusting its parameters and use an audio signal that would not change this default if possible.

Edit

Naturally it is possible to decode the sample and add an audio signal after that. The only downside would be the larger file size of the lossless test sample.
Gabriel
Most modern audio and video encoders will produce different results based on previous samples. It might be because of detection methods (predictability, ATH level,...) or because the encoder is "learning" (mostly video encoders).

In both cases, discarding a few seconds at start (those discarded data beeing similar to tested range - ie no "scene cut") are enough to compensate for this behaviour.

In the Java ABC/HR, up to now, we have to trick it by adjusting the "sample delay" by 2 seconds. (would be nice to be able to specify a testing range instead of this hack)
Alex B
So the correct approach for my example sample would be to encode it as it is (since it is from the beginning of a real audio track), decode it and add at least two seconds of digital silence in the beginning.

If some other "too short" sample is from the middle of the audio track, a longer passage of the same track should be encoded. At least it should start more than two seconds before the intended sample starting point.*

Edit:

*If preferred, this type of encoded sample can be cutted to the intended length after decoding. In this case at least two seconds of silence must be added in the beginning if the sample is going to be used with the two second Java ABC/HR delay setting.
Gabriel
QUOTE(Alex B @ Nov 6 2006, 13:06) *

So the correct approach for my example sample would be to encode it as it is (since it is from the beginning of a real audio track), decode it and add at least two seconds of digital silence in the beginning.

No.

You encode it as it is, and do not test the first 2 seconds

or

You add two seconds of something at the beginning, encode it, and do not test the first 2 seconds.

(first solution is highly preferable)
Alex B
QUOTE(Gabriel @ Nov 6 2006, 13:28) *

QUOTE(Alex B @ Nov 6 2006, 13:06) *

So the correct approach for my example sample would be to encode it as it is (since it is from the beginning of a real audio track), decode it and add at least two seconds of digital silence in the beginning.

No.

You encode it as it is, and do not test the first 2 seconds

or

You add two seconds of something at the beginning, encode it, and do not test the first 2 seconds.

(first solution is highly preferable)


The example sample demonstrates a problem in the first few seconds of the track. It represents a real life situation. Just try for example the L3enc version I uploaded. The guitar chords in the very beginning are very bad.

I am not removing the first two seconds when I listen to this track outside a listening test.
Alex B
Out of curiosity, I tried the first three seconds of this AC/DC sample with aoTuV b5 @ -q-1, Nero AAC @ ABR 48kbps and l3enc MP3 @ 128 kbps.

Foobar ABX result was 10/10 for all three when compared with the reference.

In my opinion Vorbis and l3enc produced unusable quality. Nero AAC was much better, I would say "slightly annoying".


Edit: I used "-br 48000" with Nero Digital cl encoder v. 1.0.0.2.
Sebastian Mares
Ivan, do you still recommend ABR or is it OK if VBR used?

Does anyone mind the following settings:

Ogg Vorbis AoTuV AO; aoTuV b5 [20061024] (based on Xiph.Org's libVorbis): q-1.0

Nero HE-AAC Nero AAC codec / May 1 2006: VBR, Q0.20

WMA Standard Windows Media Audio 9.2: VBR Quality 10, 44 kHz, stereo 1-pass VBR

WMA Professional Windows Media Audio 10 Professional: 48 kbps, 44 kHz, 2 channel 16 bit 1-pass CBR

The settings were chosen so that all encoders reach more or less the same bitrate with my material. Bitrate tables are welcome.

Edit: WMA Professional will reach 48 kbps with all material because it encodes with CBR. The other encoders produce ~50 kbps.

If developers and majority of the community agrees with this, I suggest we should start discussing samples. Should we use some samples from the HE-AAC test? I also have some files I would like to post (in case I didn't already), like a Vangelis and a Uriah Heep one.
Ivan Dimkovic
QUOTE

Ivan, do you still recommend ABR or is it OK if VBR used?


I'm fine with both - ABR should provide less quality deviation, but VBR should score a bit higher on average.

Up to you guys.
Sebastian Mares
Sorry, but I am afraid I did not understand. What do you mean with "ABR should provide less quality deviation"? blush.gif
Ivan Dimkovic
I meant - ABR quality (subjective grade) is more consistent, with "shorter" confidence intervals than VBR at that bitrate.

This is because VBR mode could undercode some samples and they would sound slightly less good than when they are coded with ABR mode.

However at average VBR is indeed a bit better.

Sebastian Mares
QUOTE(Gabriel @ Nov 6 2006, 12:28) *

You encode it as it is, and do not test the first 2 seconds

or

You add two seconds of something at the beginning, encode it, and do not test the first 2 seconds.

(first solution is highly preferable)


Gabriel, but what if a song doesn't start "fading in" but like Alex B pointed out with the AC/DC sample?
Sebastian Mares
How many samples should we use, 12?
benski
QUOTE(Sebastian Mares @ Nov 6 2006, 10:46) *

Ivan, do you still recommend ABR or is it OK if VBR used?


Whatever mode was used in the HE-AAC listening test should be used for this test, also.
IgorC
QUOTE(Sebastian Mares @ Nov 6 2006, 09:45) *

How many samples should we use, 12?

Last time there were 18 samples in multi-aac test. Now it's multi-codec test. So more people should be interesting in it. 18-20 samples?
Sebastian Mares
Well, I think 18 samples is maximum.
Gabriel
QUOTE(Sebastian Mares @ Nov 6 2006, 18:36) *

Sorry, but I am afraid I did not understand. What do you mean with "ABR should provide less quality deviation"? blush.gif

What Ivan is telling is that he's not totally confident in his VBR mode ;-)

Full VBR is a matter of trusting your psymodel, which most of the time is not perfect. If your codec is efficient enough compared to competitors, it's usually safer to rely on ABR (ie VBR is not worth the risk if you are good enough).
(now you know why iTunes is ABR and not fully VBR, and why it is recommended to use Lame in VBR)


QUOTE(Sebastian Mares @ Nov 6 2006, 19:35) *

Gabriel, but what if a song doesn't start "fading in" but like Alex B pointed out with the AC/DC sample?

If you really want to test the start of your sample, you would have two choices:

*re-rip the samples with 2 extra seconds at the beginning
*add 2 seconds of silence at the start of the sample
Ivan Dimkovic
QUOTE

Full VBR is a matter of trusting your psymodel, which most of the time is not perfect. If your codec is efficient enough compared to competitors, it's usually safer to rely on ABR (ie VBR is not worth the risk if you are good enough).


Actually,

Looking here:

http://www.hydrogenaudio.org/forums/index....showtopic=41191

It looked like Nero VBR @48 kbits/s was just a bit better than ABR.

However, at such a low bit-rate I don't believe there are big benefits of using true VBR - there is not too much space to scale the bit-rate down before sound start to degrade a lot - which means that there won't be space to scale it up, either - in case of need.

So, ABR should do just fine.
Sebastian Mares
OK, ABR for Nero then. If everything else is fine, we should focus on samples now.
Alex B
We should remember that in the test we should use a setting that should produce the best average quality with various complete audio tracks. So if Ivan recommends ABR to users who are going to encode a complete audio library at about 48 kbps then it should be used.

If the recommendation is VBR then it should be tested even if a certain set of selected test samples would possibly result a bit better quality in ABR mode... *

Edit

* ... or when the ABR mode would be a safer choice for winning this particular test, like Gabriel explained.
Alex B
Here's a bitrate table and graph in Excel format. I used my usual set of 25 various full length tracks:

bitrates_48kbps_test.xls

Average bitrates:

Nero Digital 1.0.0.2 -br 48000 => 48 kbps
Nero Digital 1.0.0.2 -q 0.21 => 50 kbps
Nero Digital 1.0.0.2 -q 0.20 => 48 kbps
WMA 10 Pro CBR 48 kbps => 48 kbps
WMA 9.2 standard VBR10 => 47 kbps
Vorbis aoTuV beta 5 -q -1 => 49 kbps


Some of you may find the following screenshot interesting too. Some track peaks of my test file set, starting from the highest peak:

IPB Image

Any comments?


EDIT

I tested Nero -q 0.2 and changed Nero -q 0.205 to -q 0.2 since it is the selected test option (it was: Nero -q 0.205 => 49 kbps). Also the linked Excel file is updated.
This is a "lo-fi" version of our main content. To view the full version with more information, formatting and images, please click here.
Invision Power Board © 2001-2008 Invision Power Services, Inc.