Data for: Ok Google, What Am I Doing? Acoustic Activity Recognition Bounded by Conversational Assistant Interactions
收藏DataCite Commons2026-03-31 更新2026-05-05 收录
下载链接:
https://dataverse.tdl.org/citation?persistentId=doi:10.18738/T8/OCWAZW
下载链接
链接失效反馈官方服务:
资源简介:
An annotated dataset of audio interactions with a conversational assistant with background sounds of 19 activities. </br>
<br/>Abstract: Conversational assistants in the form of stand-alone devices such as Amazon Echo and Google Home have become popular and embraced by millions of people. By serving as a natural interface to services ranging from home automation to media players, conversational assistants help people perform many tasks with ease, such as setting timers, playing music and managing to-do lists. While these systems offer useful capabilities, they are largely passive and unaware of the human behavioral context in which they are used. In this work, we explore how off-the-shelf conversational assistants can be enhanced with acoustic-based human activity recognition by leveraging the short interval after a voice command is given to the device. Since always-on audio recording can pose privacy concerns, our method is unique in that it does not require capturing and analyzing any audio other than the speech-based interactions between people and their conversational assistants. In particular, we leverage background environmental sounds present in these short duration voice-based interactions to recognize activities of daily living. We conducted a study with 14 participants in 3 different locations in their own homes. We showed that our method can recognize 19 different activities of daily living with average precision of 84.85% and average recall of 85.67% in a leave-one-participant-out performance evaluation with 30-second audio clips bound by the voice interactions. </br>
<br/>IRB approved under ID: 2016020035-MODCR01
提供机构:
Texas Data Repository
创建时间:
2021-01-14



