JavaScript Language Background Image

Vivid Background Audio Generation Based on Large Language Models and AudioLDM

Abstract: This paper describes a background audio and speech generation system for the Inspirational and Convincing Audio Gener-ation Challenge 2024. Our system mainly includes three modules, namely, ...

IEEE

Multimodal Large Language Models (MLLMs) for Object Detection from Thermal Images: Initial Experiments

Abstract: The integration of thermal imaging data with Multimodal Large Language Models (MLLMs) constitutes an exciting opportunity for improving the safety and functionality of autonomous driving ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Vivid Background Audio Generation Based on Large Language Models and AudioLDM

Multimodal Large Language Models (MLLMs) for Object Detection from Thermal Images: Initial Experiments

Trending now