Abstract: Recently, many multimodal trackers have prioritized RGB as the dominant modality, treating other modalities as auxiliary, and fine-tuning separately various multimodal tasks. This imbalance ...
Abstract: Visual Working Memory (VWM) is the ability to maintain task-relevant visual information over a brief delay after direct visual input has been removed and it is essential for learning new ...