Video Understanding Python Arrays

Human brain cells on a chip learned to play Doom in a week

Neuron-powered computer chips can now be easily programmed to play a first-person shooter game, bringing biological computers a step closer to useful applications ...

GitHub

EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting

where Submission folder contains 5 videos from HOI4D and 4 videos from EPIC-KITCHENS, which are used to generate the results in Table 1 and Figure 3 of the paper, Webpage folder contains 2 additional ...

IEEE

Ego-VPA: Egocentric Video Understanding with Parameter-Efficient Adaptation

Abstract: Video understanding typically requires fine-tuning the large backbone when adapting to new domains. In this paper, we leverage the egocentric video foundation models (EgoVFMs) based on video ...

IEEE

Domain Adaptation of VLM for Soccer Video Understanding

Abstract: Vision Language Models (VLMs) have demonstrated strong performance in multi-modal tasks by effectively aligning visual and textual representations. However, most video understanding VLM ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results