Only human, speech detection in different SNR conditions

2013-11-28 06:31:39

Hi all :-),

I am currently working on an academic project to find the human speech in the varying SNR conditions. I have implemented a method (using SNR band energy and SNR peaks in frequency domain) and is working fine to detect the voice activity but failing to detect only human speech that is, it detecting tapping sound, horn sound, keyboard sounds are getting detected as Voice. Currently I am failing to detect only human speech. I have tried with the speech feature extraction, but not able to make the decisions as thresholds values are varying for different environments.

Please do some one suggest how to detect only human voice activity. Any suggestions will be very helpful.

Thank you,
ksam917

Notice