Skip to content

Standalone VAD: Standalone Voice Activity Detection


Warning

Do not use this software-based VAD in conjunction with a hardware-based VAD. The interaction between the two will adversely affect performance.

Overview

Voice Activity Detection (VAD) preprocesses an audio stream to determine whether or not it contains voice. It is typically used as a gate in front of higher-CPU algorithms that perform more complex tasks such as speech recognition, keyword spotting, voice authentication, etc. in order to optimize power consumption. There are several different methods for performing VAD, each with its own accuracy, latency and resource trade-offs.

Amazon Wake Word Engine offers a way to do this through its own software-based implementations.

EnergyDetection

EnergyDetection is an acoustic energy detector that triggers off any acoustic energy in the input, be it human or otherwise. See EnergyDetector for more details.

FAQ

See the VAD section of the FAQ.