Abstract: This paper describes a background audio and speech generation system for the Inspirational and Convincing Audio Gener-ation Challenge 2024. Our system mainly includes three modules, namely, ...
Abstract: The integration of thermal imaging data with Multimodal Large Language Models (MLLMs) constitutes an exciting opportunity for improving the safety and functionality of autonomous driving ...