Lately I've read a lot about dithering here on HA and did some "experiments", mainly resampling of artificial signals using SSRC and CEP with different settings. What caused my interest in this: I've got a life concert DVD with 48kHz 16bit 2Ch uncompressed audio stream and I wanted to create an audio CD as "archive" and mp3 files for portable use (--alt-preset standard), each with decent quality. - OK, I admit: best possible quality, wasting a lot of time and gaining a lot knowledge
What I'll write next is mainly what I filtered out of what I've read, so if I'm wrong please to correct me.
For decent resampling it's necessary to raise resolution in a first step, e.g. from 16bit to 24 or 32 bit. The next step is changing sample rate using algorithms I don't have a clue about
1. Could someone explain to me why it is like that, please? Maybe it's possible to generate an example for this intentionally, e.g. "Generate a 100Hz sine tone in CEP - resample it from 44.1kHz to 48kHz and the other way round 10 times using ... settings and you'll hear ..."?
2. I've read a post here by Frank Klemm IIRC but I can't find it anymore. It made me think that it's a good idea to apply noise with the same frequency response as the background noise of the source if you can suppose that noise shaped dithering has been applied to the source already (e.g. audio stream of a DVD). What do you think of this?
3. To create noise for this "customised" dithering I would create white noise with CEP, do a frequency analysis of the source's background noise, apply an equalizer filter to the noise fitting to the frequency analysis, reduce the volume of the resulting shaped noise and add it to the already downsampled but still 32bit source. Then lowering resolution to 16 bit (truncating, no extra dithering). Any suggestions to improve this?
4. What amplitude should my "customized" shaped noise have before adding to the source signal?
5. I'm wondering about the same in CEP: There's a "Dither dephth (bits)" setting. What value gives the best "avoid distortion/add noise" ratio when using CEP's built in dithering - and why?
6. AFAIK --alt-presets in mp3 work better with 44.1 kHz than with 48kHz and there's some lowpass applied. So my idea is to use 48kHz noise shaped dithering (20kHz+ frequency range noise) or even add a 22.05 kHz, +/-1 amplitude sine tone to the source because both will completely cut away by lame's lowpass. Will this work or could there be a bad surprise?
Thanks for your answers
tigre