Text Align JavaScript

Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment

Abstract: Text-to-video retrieval systems have recently made significant progress by utilizing pre-trained models trained on large-scale image-text pairs. However, most of the latest methods primarily ...

IEEE

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

Abstract: Text-based Visual Question Answering (TextVQA) focuses on answering questions about the scene text in images. Most works in this field uses transformer based models to modeling the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment

Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

Trending now