The field of audio-visual event localisation and scene understanding explores how systems can jointly analyse auditory and visual cues to accurately identify, segment and classify events within ...
Demonstration of different visual-inertial odometry methods: (a) traditional VIO methods, which rely on handcrafted features and geometry-based optimization; (b) existing deep learning-based methods, ...