Wake Word Pre-Roll Overview

Definition

The following diagram illustrates the structure of a typical Alexa utterance, and defines the term "wake word pre-roll". pre_roll_in_alexa_utterance The pre-roll audio segment is the first segment of a wake word-initiated audio utterance, preceding the wake word audio segment.

AVS Pre-Roll Streaming Requirements

AVS Streaming Requirements mandate that each AVS SpeechRecognizer request MUST provide a 500 millisecond span of pre-roll audio ahead of the wake word audio segment. Additional context is provided in the linked requirements document regarding the importance of pre-roll for optimal AVS performance, taking cloud-based wake word verification as an example.

Please read the AVS streaming requirements document linked above in full, and also note that although the document is titled "Requirements for Cloud-Based Wake Word Verification", pre-roll audio has an importance over and above just cloud-based wake word verification. Many downstream audio algorithms require or benefit from the additional pre-wake-word acoustic context that pre-roll provides, including media-induced wake suppression and automatic speech recognition. Failure to send sufficient pre-roll will degrade the AVS response to your device's requests.

More Information

See the Pre-Roll Integration section in this guide for more details.