What's the meaning of "sample rate" in MP3/Vorbis/AAC?, Considering the audio was transformed |
What's the meaning of "sample rate" in MP3/Vorbis/AAC?, Considering the audio was transformed |
Oct 7 2012, 03:44
Post
#1
|
|
![]() Group: Members Posts: 89 Joined: 3-November 04 Member No.: 17971 |
What's the meaning of "sample rate" in MP3/Vorbis/AAC? Wouldn't it be meaningless after transform, same as bit depth?
|
|
|
|
![]() |
Oct 7 2012, 05:05
Post
#2
|
|
|
Group: Members Posts: 4133 Joined: 2-September 02 Member No.: 3264 |
No, the transform occurs on sampled data and so the sample rate determines which samples correspond to which frequencies.
|
|
|
|
Oct 9 2012, 05:21
Post
#3
|
|
![]() Group: Members Posts: 89 Joined: 3-November 04 Member No.: 17971 |
I guess the way I put it does seem like a retread. The thing is, some statements above make it sound like it could be an implementation problem rather than a complete inherent impossibility.
JAZ says "the compressed data is specific for one max frequency". Makes it sound like he refers to decoding to higher frequencies. Then the mention of Opus using 48kHz internally for all sample rates (though, afterwards I found an old discussion that suggest 44.1kHz *is* a problem). Benski says "If you didn't know the sample rate, you wouldn't be able to decode the audio", makes it sound like just needing the rate for the math. And the "easily" in "can decode easily to sample rates that are multiples" like a problem of complexity. ...sample those sinusoids at any intervals... Could you explain what you mean more precisely? Its difficult to understand what your idea is. This is what I was wondering about. Perhaps it's just my oversimplified idea of what the transformed data represents. Taken to an extreme, let's say you have 10 seconds of 1kHz sine wave at a fixed level. If encoding ultimately just means storing the frequency, level, and phase, it seems you should be able to recreate that 1kHz sine wave with as many or as few samples as you'd like. What would actually be stored in such a case? the sample rate determines which samples correspond to which frequencies. What does "which samples correspond to which frequencies" mean? This post has been edited by sheh: Oct 9 2012, 05:22 |
|
|
|
Oct 9 2012, 19:03
Post
#4
|
|
|
Group: Members Posts: 4133 Joined: 2-September 02 Member No.: 3264 |
Benski says "If you didn't know the sample rate, you wouldn't be able to decode the audio", makes it sound like just needing the rate for the math. And the "easily" in "can decode easily to sample rates that are multiples" like a problem of complexity. I'm interpreting your question to actually ask "Is there some way to start with (M)DCT transformed data and then transform to time domain at an arbitrary sample rate that is algorithmically more efficient than a combination of inverse (M)DCT and resampling. My answer is no. The choice of domain makes no difference with respect to sampling and starting or ending in one domain or the other is orthogonal to questions about sampling. I can see no way to exploit one to make the other easier. If encoding ultimately just means storing the frequency, level, and phase, it seems you should be able to recreate that 1kHz sine wave with as many or as few samples as you'd like. What would actually be stored in such a case? The process of fitting a function to data for purposes of changing the sampling rate is called resampling. So your idea here is probably something like "resampling then fourier transforming". This will work but its no different than the usual process once you actually implement it. You've just expressed the same process in a different way. the sample rate determines which samples correspond to which frequencies. What does "which samples correspond to which frequencies" mean? Are you familiar with the discrete fourier transform (or fast fourier transform)? In these (and all) discrete fourier transforms, the choice of sampling rate determines which samples in the Fourier domain contain which frequencies. This post has been edited by saratoga: Oct 9 2012, 19:04 |
|
|
|
sheh What's the meaning of "sample rate" in MP3/Vorbis/AAC? Oct 7 2012, 03:44
pdq Still. since the data are stored as frequencies an... Oct 7 2012, 14:23
[JAZ] When we say that lossy streams don't have a st... Oct 7 2012, 14:44
jensend I'm not intimately acquainted with the details... Oct 7 2012, 16:46
benski fs is part of the MDCT equation. If you didn... Oct 7 2012, 18:32
sheh Okay, so the original fs needs to be plugged in so... Oct 8 2012, 18:33
lvqcl Winamp MP3 decoder still have Full/Half/Quarter qu... Oct 7 2012, 18:56
benski QUOTE (lvqcl @ Oct 7 2012, 13:56) Winamp ... Oct 10 2012, 16:16
saratoga Using a dct means you could easily drop the sample... Oct 7 2012, 19:00
saratoga QUOTE (sheh @ Oct 8 2012, 13:33) Okay, so... Oct 8 2012, 23:41
jensend I still think you're wrong here. Once you... Oct 7 2012, 19:57
saratoga QUOTE (jensend @ Oct 7 2012, 14:57) I sti... Oct 7 2012, 20:16
Dynamic I think one problem is that we're never doing ... Oct 8 2012, 23:04
[JAZ] Mmm.. the tranformation that (most) lossy codecs d... Oct 9 2012, 19:03
Dynamic QUOTE ([JAZ] @ Oct 9 2012, 19:03)... Oct 9 2012, 22:22
pdq So am I understanding correctly that 99.9% of deco... Oct 10 2012, 17:12
saratoga QUOTE (benski @ Oct 10 2012, 11:16) QUOTE... Oct 10 2012, 17:30
[JAZ] QUOTE (Dynamic @ Oct 9 2012, 23:22) As [J... Oct 10 2012, 17:52
saratoga QUOTE ([JAZ] @ Oct 10 2012, 12:52... Oct 10 2012, 18:20
[JAZ] Winamp with the in-house decoder (i.e. not the fra... Oct 10 2012, 18:41
saratoga QUOTE ([JAZ] @ Oct 10 2012, 13:41... Oct 10 2012, 18:55
Dynamic My recollection is that Winamp 1.7 running under W... Oct 11 2012, 16:14
sheh So sample rate matters. I suppose without being fa... Oct 11 2012, 23:20![]() ![]() |
|
Lo-Fi Version | Time is now: 23rd May 2013 - 21:16 |