HE AAC Audio Quality & Tuning |
![]() ![]() |
HE AAC Audio Quality & Tuning |
May 23 2012, 15:39
Post
#1
|
|
|
Group: Members Posts: 7 Joined: 23-May 12 Member No.: 100087 |
I am new to the lossy coding world. I recently became a DROID owner and, in an effort to get down to carrying one device, I ripped all of my music to my DROID at 64 kbps. In the environments I often listen in (Airports, Planes, Rental Cars, Taxi's, etc.) this is good enough for me and has the great benefit of using minimal storage space. The efficiency of a codec that provides satisfactory audio quality while using lower bitrates appeals to me.
My question is whether this codec is still being developed and "tuned"? I guess I am curious as to whether HE-AAC is now about as good as it will ever get. For example, is the crossover point from real sound to where the SBR begins as finely tuned as possible? Will we ever get to the point where a 48 kbps rip is perceived qualitatively by a general audience like a 128 kbps AAC file is today? Thanks! |
|
|
|
May 23 2012, 15:54
Post
#2
|
|
![]() Server Admin Group: Admin Posts: 4808 Joined: 24-September 01 Member No.: 13 |
Which codec?
HE-AAC is a standard. There are several implementations of it. |
|
|
|
May 23 2012, 17:31
Post
#3
|
|
|
Group: Members Posts: 1315 Joined: 3-January 05 From: Argentina, Bs As Member No.: 18803 |
My question is whether this codec is still being developed and "tuned"? Yes, Fraunhofer and Apple HE-AAC encoders have been updated not so long time ago. For HE-AAC ~64 kbps (variable bitrate VBR) I'd go with Fraunhofer encoder from the latest Winamp. Will we ever get to the point where a 48 kbps rip is perceived qualitatively by a general audience like a 128 kbps AAC file is today? No, that won't happen. Developers improve a quality of both HE-AAC and LC-AAC. HE-AAC is excelent for low bitrates. For bitrates more than 80 kbps use LC-AAC. Some wiki articles about this format will be a good start. This post has been edited by IgorC: May 23 2012, 17:36 |
|
|
|
May 23 2012, 18:59
Post
#4
|
|
|
Group: Members Posts: 7 Joined: 23-May 12 Member No.: 100087 |
|
|
|
|
May 27 2012, 11:03
Post
#5
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
For HE-AAC ~64 kbps (variable bitrate VBR) I'd go with Fraunhofer encoder from the latest Winamp. Thanks a lot, I take that as a compliment Will we ever get to the point where a 48 kbps rip is perceived qualitatively by a general audience like a 128 kbps AAC file is today? I agree with Igor. 48-kbps HE-AAC uses parametric tools for the high frequencies, which don't allow you to perfectly reconstruct the original spectrum. Anyone with normal hearing who concentrates during a comparison test will hear the degradation over the 128-kbps encoding, I would say. Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
May 28 2012, 16:22
Post
#6
|
|
|
Group: Members Posts: 1315 Joined: 3-January 05 From: Argentina, Bs As Member No.: 18803 |
Chris,
The Average scores are: Apple HE-AAC - 2.85 FhG HE-AAC - 3.01 One would think that the difference is pretty small. Yes, but I've performed the test on these codecs before and the results were the same. Let me see if I can find those results on my hard disk later. Also the scores are pretty low because I didn't include the low anchor. The quality is actually pretty good and if it was a public test the average scores would be in range of 3.9-4.0. ![]() CODE Apple HEAAC FhG HEAAC 3 3,7 4,1 4 1,8 4,3 3,2 3,2 3,1 2,8 1,5 1 2 2,4 3 3 2 3,1 1 1,7 3 3,1 1 1 2,7 3,5 3,2 3,6 3,8 3,8 1 1 3,8 3,8 2,5 2,5 4 3,6 3,4 3,1 2 2,5 3,3 3,5 3,4 3,4 3,5 4 3,4 2,9 3,3 3,3 3,3 3,6 3,4 3,2 4 3,8 2,8 1,8 I won't post here particular ABC/HR logs as this test wasn't meant to be published. Nothing special, just private stuff. Those are 30 samples from public test 64 kbps (2011). There were some pathological cases where both encoders don't perform well. Those are classical transient samples like castanets, fatboy and eig. FhG did a good job comparing to Apple on all samples except the last one. Sample30. It was also a difficult sample for FhG LC-AAC encoder in last public AAC test. It will be great to see an updates of your encoder. |
|
|
|
May 29 2012, 20:47
Post
#7
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
Awesome, thanks a lot, Igor!
FhG did a good job comparing to Apple on all samples except the last one. Sample30. It was also a difficult sample for FhG LC-AAC encoder in last public AAC test. Yeah, that Sample30 (or 20 in the 96-kbps test) is a pain. I recently added a minimally invasive patch which gives a bit more (hopefully enough) bit-rate to that item without affecting others much. Once that encoder version becomes publicly available, I'll let you know. Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Jun 22 2012, 16:00
Post
#8
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
I recently added a minimally invasive patch which gives a bit more (hopefully enough) bit-rate to that item without affecting others much. Once that encoder version becomes publicly available, I'll let you know. Bump. Just found a forum post saying that Winamp 5.63 has been released. Relevant quotes from the release notes: * Improved: [enc_fhgaac] Added VBR mode 6, plus other minor quality improvements * Fixed: [aacdec] Memory leak when re-syncing to AAC stream * Updated: [aacdec] Fraunhofer AAC Decoder v1.762.208 Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Jun 22 2012, 19:45
Post
#9
|
|
|
Group: Members Posts: 1315 Joined: 3-January 05 From: Argentina, Bs As Member No.: 18803 |
Good news.
Eagerly waiting to get home and try it. Thank You. 22 June. This post has been edited by IgorC: Jun 22 2012, 20:09 |
|
|
|
Jun 22 2012, 21:43
Post
#10
|
|
|
Group: Members Posts: 32 Joined: 22-June 12 Member No.: 100900 |
I recently added a minimally invasive patch which gives a bit more (hopefully enough) bit-rate to that item without affecting others much. Once that encoder version becomes publicly available, I'll let you know. Bump. Just found a forum post saying that Winamp 5.63 has been released. Relevant quotes from the release notes: * Improved: [enc_fhgaac] Added VBR mode 6, plus other minor quality improvements * Fixed: [aacdec] Memory leak when re-syncing to AAC stream * Updated: [aacdec] Fraunhofer AAC Decoder v1.762.208 Chris Hi i have tried encoding in vbr 6 and i find there is hardly any increase in bitrate between 5 and 6. Could you please give me some more info in the changes? |
|
|
|
Jun 22 2012, 21:56
Post
#11
|
|
|
Winamp Developer Group: Developer Posts: 662 Joined: 17-July 05 From: Ashburn, VA Member No.: 23375 |
Hi i have tried encoding in vbr 6 and i find there is hardly any increase in bitrate between 5 and 6. Could you please give me some more info in the changes? I just tested on a "difficult" (kraftwerk-style electronic music) track and saw a substantial difference, 222 kbps for VBR5 and 277 kbps VBR6. For "easy" material, particularly when lacking in high frequency energy, you might not see much of a difference. |
|
|
|
Jun 22 2012, 22:08
Post
#12
|
|
|
Group: Members Posts: 32 Joined: 22-June 12 Member No.: 100900 |
Hi i have tried encoding in vbr 6 and i find there is hardly any increase in bitrate between 5 and 6. Could you please give me some more info in the changes? I just tested on a "difficult" (kraftwerk-style electronic music) track and saw a substantial difference, 222 kbps for VBR5 and 277 kbps VBR6. For "easy" material, particularly when lacking in high frequency energy, you might not see much of a difference. Ok thanks for the info, yeah the audio was prob easy to encode like you said, its good to see updates on this already excellent encoder |
|
|
|
Jun 26 2012, 03:39
Post
#13
|
|
|
Group: Members Posts: 7 Joined: 23-May 12 Member No.: 100087 |
Is there a reference or an explanation somewhere for what frequency SBR begins when using HE AAC? It would seem to me that the LC "cut point" would be higher in frequency the higher the bit rate used, so that 48 kbps HE AAC has more use of SBR, relatively, than 80 kbps? Is this correct?
|
|
|
|
Jun 26 2012, 04:07
Post
#14
|
|
|
Group: Members Posts: 4138 Joined: 2-September 02 Member No.: 3264 |
|
|
|
|
Jun 26 2012, 17:20
Post
#15
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
Is there a reference or an explanation somewhere for what frequency SBR begins when using HE AAC? It would seem to me that the LC "cut point" would be higher in frequency the higher the bit rate used, so that 48 kbps HE AAC has more use of SBR, relatively, than 80 kbps? Is this correct? Correct. The SBR start (or cross-over) frequency is chosen by the HE-AAC encoder from a table of possible frequencies (or rather, fractions of the sampling rate). The creation of the table is described in the ISO/IEC standard and a bit complicated, so I won't go into detail. Just so much: the minimum start frequency is around 4 kHz, the maximum is half the sampling rate (in "downsampled" SBR mode) or a quarter of the sampling rate ("normal" SBR mode). So what saratoga mentioned is just one possible - and rare - configuration. Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Jun 26 2012, 19:03
Post
#16
|
|
|
Group: Members Posts: 4138 Joined: 2-September 02 Member No.: 3264 |
Is there a reference or an explanation somewhere for what frequency SBR begins when using HE AAC? It would seem to me that the LC "cut point" would be higher in frequency the higher the bit rate used, so that 48 kbps HE AAC has more use of SBR, relatively, than 80 kbps? Is this correct? Correct. The SBR start (or cross-over) frequency is chosen by the HE-AAC encoder from a table of possible frequencies (or rather, fractions of the sampling rate). The creation of the table is described in the ISO/IEC standard and a bit complicated, so I won't go into detail. Just so much: the minimum start frequency is around 4 kHz, the maximum is half the sampling rate (in "downsampled" SBR mode) or a quarter of the sampling rate ("normal" SBR mode). So what saratoga mentioned is just one possible - and rare - configuration. I didn't realize that. How does that work with LC backwards compatibility? Would you get a badly distorted file when listening on an LC only decoder? |
|
|
|
Jun 26 2012, 19:32
Post
#17
|
|
|
Group: Developer Posts: 618 Joined: 6-December 08 From: Erlangen Germany Member No.: 64012 |
I didn't realize that. How does that work with LC backwards compatibility? Would you get a badly distorted file when listening on an LC only decoder? Depends on what you consider "badly distorted". It's playable but the frequency bandwidth is limited to that of the AAC LC core coder signal, so it will sound dull/muffled. If the SBR cross-over frequency is 4 kHz, it will probably sound like coming from a telephone on an LC-only decoder. Chris -------------------- If I don't reply to your reply, it means I agree with you.
|
|
|
|
Jun 26 2012, 20:11
Post
#18
|
|
|
Winamp Developer Group: Developer Posts: 662 Joined: 17-July 05 From: Ashburn, VA Member No.: 23375 |
Side note: This is also the rationale behind the so-called "downsampled SBR" mode (which isn't exposed in the Winamp AAC encoder at the moment, as I'm not sure how well it's tuned). If you run the LC core @ 44.1khz and the SBR @ 88.2khz, and downsample to 44.1khz in the decoder, you can now effectively move the crossover point above 11,025hz
Edit: I should have fully read what Chris wrote before replying, as he already mentions this This post has been edited by benski: Jun 26 2012, 20:14 |
|
|
|
Jun 26 2012, 21:00
Post
#19
|
|
|
Group: Members Posts: 7 Joined: 23-May 12 Member No.: 100087 |
In my recent experience, there is a surprisingly large difference in sound quality between music at 64kbps and 80kbps, both encoded with iTunes HE-AAC, which is why I posed the question about SBR. Much more pronounced than I expected. An interesting comparison (perhaps to assess how well SBR really works) would be a number of 80kbps HE-AAC samples to 96kbps LC-AAC samples.
My paramount objective is to keep file size small so I can fit everything onto my phone, but I may move to 80kbps because of what I perceive to be much better results. Thanks for the answers! |
|
|
|
Jun 27 2012, 00:26
Post
#20
|
|
![]() Group: Developer Posts: 2986 Joined: 2-December 07 Member No.: 49183 |
The crossover frequency for Apple HE-AAC (CVBR 64 and CVBR 80) is ~9600 Hz.
(Decode M4A file to WAV and make a spectrogram with SoX: sox.exe decoded.wav -n remix 1,2i spectrogram -w K -z 90 ) |
|
|
|
Jun 27 2012, 01:17
Post
#21
|
|
|
Group: Members Posts: 1315 Joined: 3-January 05 From: Argentina, Bs As Member No.: 18803 |
An interesting comparison (perhaps to assess how well SBR really works) would be a number of 80kbps HE-AAC samples to 96kbps LC-AAC samples. Some time ago I've performed a personal blind test between Apple HE-AAC/LC-AAC both at 80 kbps. http://www.hydrogenaudio.org/forums/index....c=74781&hl= I've prefered a fair but lowpassed LC-AAC (15 kHz) instead of parametrically extended bandwidth (SBR) up to ~20 kHz During the last ye/ar I've performed some blind tests with Apple HE and LC codecs on another sets of samples and other hardware. The results were same. It will be great to see your own comparison. The crossover frequency for Apple HE-AAC (CVBR 64 and CVBR 80) is ~9600 Hz. That can explain why Apple HE-AAC CVBR 64 and CVBR 80 have very similar quality in my personal comparison . This post has been edited by IgorC: Jun 27 2012, 01:19 |
|
|
|
Jun 27 2012, 01:50
Post
#22
|
|
|
Group: Members Posts: 7 Joined: 23-May 12 Member No.: 100087 |
I may try to do this if I can spare the time - are there web resources that can facilitate ABX testing for a solo practitioner?
From a purely unscientific point of view, at such low bit rates, it would seem like the addition of 16,000 additional bits would inherently result in better sound (64kbps to 80kbps). But I agree that one's ears are the ultimate arbiter. My own are not perfect any more - too much time in loud environments, either music or travel or rifle ranges in the military...! This post has been edited by db1989: Jun 27 2012, 06:30
Reason for edit: deleting unnecessary full quote of above post
|
|
|
|
Jun 27 2012, 06:34
Post
#23
|
|
|
Group: Super Moderator Posts: 4360 Joined: 23-June 06 Member No.: 32180 |
are there web resources that can facilitate ABX testing for a solo practitioner? IgorC’s linked test specifically mentions ABC/HR:QUOTE ( http://abchr.sourceforge.net/ ) ABC/HR is an audio comparison tool that allows users to self-conduct double-blind listening tests of digital audio codecs. From a purely unscientific point of view, at such low bit rates, it would seem like the addition of 16,000 additional bits would inherently result in better sound (64kbps to 80kbps). Again, in IgorC’s test, it did.
|
|
|
|
Jun 30 2012, 01:03
Post
#24
|
|
|
Group: Members Posts: 7 Joined: 23-May 12 Member No.: 100087 |
Some time ago I've performed a personal blind test between Apple HE-AAC/LC-AAC both at 80 kbps.
http://www.hydrogenaudio.org/forums/index....c=74781&hl= I've prefered a fair but lowpassed LC-AAC (15 kHz) instead of parametrically extended bandwidth (SBR) up to ~20 kHz During the last ye/ar I've performed some blind tests with Apple HE and LC codecs on another sets of samples and other hardware. The results were same. It will be great to see your own comparison. Turns out I have the same preferences as you for classical music, but the opposite for rock music. |
|
|
|
![]() ![]() |
|
Lo-Fi Version | Time is now: 26th May 2013 - 09:26 |