Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features
System-directed speech detection (DDSD) is the binary classification job of distinguishing between queries directed at a voice assistant versus aspect ...