Train the Deep Learning Ahem Detector with two sets of audio files, “a negative sample with clean voice/sound” (minimum 3 minutes) and “a positive one with ‘ahem’ sounds concatenated” (minimum 10s) and it will detect “ahems” in any voice sample thereafter.
It was developed by the people behind the Data Science at Home podcast and can be used to automatically remove “ahems” from episodes.
deeplearning-ahem-detector [Worldofpiggy/Github]
(via 4 Short Links)