
DepthAnything/Video-Depth-Anything - GitHub
Jan 21, 2025 · This work presents Video Depth Anything based on Depth Anything V2, which can be applied to arbitrarily long videos without compromising quality, consistency, or …
PKU-YuanGroup/Video-LLaVA - GitHub
😮 Highlights Video-LLaVA exhibits remarkable interactive capabilities between images and videos, despite the absence of image-video pairs in the dataset.
Video-R1: Reinforcing Video Reasoning in MLLMs - GitHub
Feb 23, 2025 · Inspired by DeepSeek-R1's success in eliciting reasoning abilities through rule-based RL, we introduce Video-R1 as the first work to systematically explore the R1 paradigm …
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model …
Jun 3, 2024 · Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding This is the repo for the Video-LLaMA project, which is working on empowering …
GitHub - Kosinkadink/ComfyUI-VideoHelperSuite: Nodes related …
Load Video Converts a video file into a series of images video: The video file to be loaded force_rate: Discards or duplicates frames as needed to hit a target frame rate. Disabled by …
GitHub - lllyasviel/FramePack: Lets make video diffusion practical!
Apr 17, 2025 · Lets make video diffusion practical! Contribute to lllyasviel/FramePack development by creating an account on GitHub.
Wan: Open and Advanced Large-Scale Video Generative Models
Feb 25, 2025 · Wan: Open and Advanced Large-Scale Video Generative Models In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models …
Download the Google Meet app
With the Google Meet app, you can: Create or join scheduled or instant cloud-encrypted Google Meet meetings with a link. Ring directly to a Google Workspace, personal account, or phone …
GitHub - stepfun-ai/Step-Video-T2V
We present Step-Video-T2V, a state-of-the-art (SoTA) text-to-video pre-trained model with 30 billion parameters and the capability to generate videos up to 204 frames. To enhance both …
GitHub - visomaster/VisoMaster: Powerful & Easy-to-Use Video …
VisoMaster is a powerful yet easy-to-use tool for face swapping and editing in images and videos. It utilizes AI to produce natural-looking results with minimal effort, making it ideal for both …