I am wondering what's the fundamental difference between speech and audio coding that makes speech coder (such as Speex that utilises the CELP algorithm) perform better (in terms of output quality) than general audio coder (such as Lame which utilises MP3 coding algorithm) when given a speech input. Likewise why is it that general audio coder tends to produce better results when given non-speech audio input, even at low bitrates?
By the way, if there's any reading material that I can read to understand their fundamental differences would you please refer it to me?
