Hi, Can someone tell me or suggest me some links where I can find the answers to the questions like..
1. In speech we take around 20ms frame for prediction. What is the proper frame length(in time) for 44.1KHz audio(hereafter audio refers to 44.1KHz, 16bit audio; CD quality)?
2. Unlike speech, audio will have more than one frequency at a time so how do predict pitch for that? So we need to consider very small frame length for which the pitch does not change?
3. Speech follows a peculiar spectral pattern. Ploting the speech spectral envelope will have 3 peaks decreasing in magnitude. Does Audio follow any such pattern which helps in predictio using FIR Linear Prediction??
5. We model vocal tract for Speech source modeling in LPC. Do we model the tube of the music production instrument in case of the Audio when we use LPC model for it?
4. Other properties of audio which are used in linear predicion model for music.
p.s.: I understand that most audio coders use DCT/MDCT based algorithms but I am interested in Linear Prediction of audio, which is used (atleast) in lossless coding of audio like FLAC.
---
dev
