Abstract: We introduce EPIC-SOUNDS, a large-scale dataset of audio annotations capturing temporal extents and class labels within the audio stream of the egocentric videos. We propose an annotation ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results