Skip to main content

Notice

Please note that most of the software linked on this forum is likely to be safe to use. If you are unsure, feel free to ask in the relevant topics, or send a private message to an administrator or moderator. To help curb the problems of false positives, or in the event that you do find actual malware, you can contribute through the article linked here.
Topic: Only human, speech detection in different SNR conditions (Read 2471 times) previous topic - next topic
0 Members and 1 Guest are viewing this topic.

Only human, speech detection in different SNR conditions

Hi all :-),

I am currently working on an academic project to find the human speech in the varying SNR conditions. I have implemented a method (using SNR band energy and SNR peaks in frequency domain) and is working fine to detect the voice activity but failing to detect only human speech that is, it detecting tapping sound, horn sound, keyboard sounds are getting detected as Voice. Currently I am failing to detect only human speech. I have tried with the speech feature extraction, but not able to make the decisions as thresholds values are varying for different environments. 

Please do some one suggest how to detect only human voice activity. Any suggestions will be very helpful.

Thank you,
ksam917