Wavelets, why use in vorbis? |
![]() ![]() |
Wavelets, why use in vorbis? |
Feb 11 2004, 01:11
Post
#1
|
|
![]() Group: Members Posts: 47 Joined: 11-January 04 From: Old Saybrook, Connecticut, USA Member No.: 11142 |
as i have heard, wavelets are to be implemented at some point in vorbis (if ever). I did some reading about them (most of it i didn't understand) and was curious as to how it would be used to encode music.
|
|
|
|
Feb 11 2004, 01:31
Post
#2
|
|
![]() Group: Developer Posts: 1245 Joined: 16-December 02 From: Australia Member No.: 4097 |
Wavelets have certain desirable properties that make them attractive in coding. First thing is that they can be easily implemented in a subband coding algorithm (like in MPC or MPEG-1 Layer 2) so they enjoy low complexity decoding. Secondly, they have a good balance between temporal resolution as well as frequency, so they tend to handle transients better than transform coders, which have great frequency resolution but terrible temporal resolution.
This post has been edited by QuantumKnot: Feb 11 2004, 01:34 |
|
|
|
Feb 11 2004, 01:56
Post
#3
|
|
![]() Group: Members Posts: 47 Joined: 11-January 04 From: Old Saybrook, Connecticut, USA Member No.: 11142 |
are they difficult to implement? have they been implemented in any other codecs?
|
|
|
|
Feb 11 2004, 02:06
Post
#4
|
|
![]() Group: Developer Posts: 1245 Joined: 16-December 02 From: Australia Member No.: 4097 |
QUOTE (MGuti @ Feb 11 2004, 10:56 AM) are they difficult to implement? have they been implemented in any other codecs? Not at all. Implementing wavelets is often as simple as replacing the filter coefficients of a subband coder. I'm not sure if wavelets have been used in other audio coders (I assume MPC and MPEG-1 Layer 2 use QMFs) but I'm sure they've been tried before since they were heavily hyped in the signal processing community over the past few years. Wavelets were tried in video coding, image coding, speech recognition, etc. The most successful application of wavelets has been in image coding where they are astronomically superior to transform coders. I don't think they has found a place in audio (yet). I guess the good thing about Vorbis II is, when Monty does get the wavelets to work, the adoption of it will be easier since Vorbis I has already made a moderate impact. Essentially you could say, Vorbis I has provided the customer base for Vorbis II, a wavelet-based coder, which is certainly an advantage over some unknown company that has released their own unknown wavelet coder. |
|
|
|
Feb 11 2004, 02:10
Post
#5
|
|
![]() Rarewares admin Group: Members Posts: 7515 Joined: 30-September 01 From: Brazil Member No.: 81 |
QUOTE (MGuti @ Feb 10 2004, 10:56 PM) are they difficult to implement? have they been implemented in any other codecs? http://www.hydrogenaudio.org/forums/index....indpost&p=27546 -------------------- Get up-to-date binaries of Lame, AAC, Vorbis and much more at RareWares:
http://www.rarewares.org |
|
|
|
Feb 11 2004, 02:19
Post
#6
|
|
|
Group: Members Posts: 160 Joined: 16-January 03 Member No.: 4597 |
I distinctly remember somebody saying, perhaps even on this board, that attempts to implement wavelet-based audio compression did not provide any significant advantage over other methods.
Edit: thanks Roberto, that's what I meant. ;-) This post has been edited by Doctor: Feb 11 2004, 02:20 |
|
|
|
Feb 11 2004, 02:23
Post
#7
|
|
![]() Group: Developer Posts: 1245 Joined: 16-December 02 From: Australia Member No.: 4097 |
QUOTE (Doctor @ Feb 11 2004, 11:19 AM) I distinctly remember somebody saying, perhaps even on this board, that attempts to implement wavelet-based audio compression did not provide any significant advantage over other methods. Edit: thanks Roberto, that's what I meant. ;-) Or perhaps you meant the thread about the telephone conversation with Frank Klemm where he said he sees no advantages of subband coders (eg. MPC) over transform coders (eg. AAC, Vorbis), other than the psychoacoustic model. This post has been edited by QuantumKnot: Feb 11 2004, 02:24 |
|
|
|
Feb 11 2004, 02:58
Post
#8
|
|
![]() Rarewares admin Group: Members Posts: 7515 Joined: 30-September 01 From: Brazil Member No.: 81 |
QUOTE (QuantumKnot @ Feb 10 2004, 11:23 PM) Or perhaps you meant the thread about the telephone conversation with Frank Klemm where he said he sees no advantages of subband coders (eg. MPC) over transform coders (eg. AAC, Vorbis), other than the psychoacoustic model. Nope. He was probably referring to this post: http://www.hydrogenaudio.org/forums/index....indpost&p=26677 And, most importantly, http://www.hydrogenaudio.org/forums/index....indpost&p=24193 This post has been edited by rjamorim: Feb 11 2004, 02:59 -------------------- Get up-to-date binaries of Lame, AAC, Vorbis and much more at RareWares:
http://www.rarewares.org |
|
|
|
Feb 11 2004, 03:07
Post
#9
|
|
|
Group: Members Posts: 160 Joined: 16-January 03 Member No.: 4597 |
Roberto: exactly. ;-)
|
|
|
|
Feb 11 2004, 03:14
Post
#10
|
|
![]() Group: Members Posts: 47 Joined: 11-January 04 From: Old Saybrook, Connecticut, USA Member No.: 11142 |
so, from what i've read here, theres no real reason to get excited about wavelets in vorbis. although it could potentially improve quality in some areas, it is no better than the commonly used methods.
alas, and i was excited for a while about what wavelets might have accomplished for vorbis (if it was so great everyone would be using it i guess). |
|
|
|
Feb 11 2004, 03:23
Post
#11
|
|
![]() Rarewares admin Group: Members Posts: 7515 Joined: 30-September 01 From: Brazil Member No.: 81 |
QUOTE (MGuti @ Feb 11 2004, 12:14 AM) (if it was so great everyone would be using it i guess). Very good point. MPEG seems to be rather fast to add state-of-the-art technologies to their codecs (SBR, PS, H264, etc.) This post has been edited by rjamorim: Feb 11 2004, 03:26 -------------------- Get up-to-date binaries of Lame, AAC, Vorbis and much more at RareWares:
http://www.rarewares.org |
|
|
|
Feb 11 2004, 08:38
Post
#12
|
|
![]() Group: Developer Posts: 1245 Joined: 16-December 02 From: Australia Member No.: 4097 |
Compared with images, audio certainly is a different beast.
|
|
|
|
Feb 11 2004, 08:55
Post
#13
|
|
![]() Group: Members Posts: 158 Joined: 16-May 03 From: nsk.su Member No.: 6653 |
At least wavelets have advantage over PQF, because they're lossless and hence do not add noise. Also, you can make non-uniform transform with different time-frequency resolution in different parts of the spectrum (some static choices like one band for 11025-22050 to use some sbr-like method, or dynamic choices - better time resolution in parts with transients and better frequency resolution in tonal parts - something like mp3's mixed blocks, but easier to implement). But since audio signals usually have a lot of separated frequencies wavelets do not have such great advantages like they do in image coding.
-Eugene -------------------- The greatest programming project of all took six days; on the seventh day the programmer rested. We've been trying to debug the !@#$%&* thing ever since. Moral: design before you implement.
|
|
|
|
Feb 11 2004, 09:01
Post
#14
|
|
![]() Group: Developer Posts: 1245 Joined: 16-December 02 From: Australia Member No.: 4097 |
Just a minor sidepoint....the new buzzword in image coding literature these days seems to be 'curvelets'. I'm sure it will be a very long time till we see much mention of 'curvelets' for audio coding.
|
|
|
|
Feb 11 2004, 09:03
Post
#15
|
|
|
Nero MPEG4 developer Group: Developer Posts: 1466 Joined: 22-September 01 Member No.: 8 |
Despite some theoretical backings that Wavelets might be better than subband/transform filterbanks in audio codecs (namely because of possibility of designing T/F resolution similar to critical band analysis in the human ear) so far there has not been >any< proof that a wavelet audio coder might achieve better performance than current state of the art.
Except some marketing claims from various industry start-ups (that are probably gone now), and few university projects - I haven't seen any successul solution that could suprass even MP3, leave alone some better coding solutions. In the last scientific test I know of, it has been shown that switched 128/1024 MDCT filterbank has best properties for audio coding (this filterbank is used in AAC and Vorbis - and probably WMA) I'd like to see some breakthrough there, but I am not very optimistic about wavelet technology as a filterbank in audio coder. |
|
|
|
![]() ![]() |
|
Lo-Fi Version | Time is now: 19th June 2013 - 14:13 |