About 92,600,000 results
Open links in new tab
  1. DepthAnything/Video-Depth-Anything - GitHub

    Jan 21, 2025 · This work presents Video Depth Anything based on Depth Anything V2, which can be applied to arbitrarily long videos without compromising quality, consistency, or …

  2. PKU-YuanGroup/Video-LLaVA - GitHub

    😮 Highlights Video-LLaVA exhibits remarkable interactive capabilities between images and videos, despite the absence of image-video pairs in the dataset.

  3. Video-R1: Reinforcing Video Reasoning in MLLMs - GitHub

    Feb 23, 2025 · Inspired by DeepSeek-R1's success in eliciting reasoning abilities through rule-based RL, we introduce Video-R1 as the first work to systematically explore the R1 paradigm …

  4. Video-LLaMA: An Instruction-tuned Audio-Visual Language Model …

    Jun 3, 2024 · Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding This is the repo for the Video-LLaMA project, which is working on empowering …

  5. GitHub - Kosinkadink/ComfyUI-VideoHelperSuite: Nodes related …

    Load Video Converts a video file into a series of images video: The video file to be loaded force_rate: Discards or duplicates frames as needed to hit a target frame rate. Disabled by …

  6. GitHub - lllyasviel/FramePack: Lets make video diffusion practical!

    Apr 17, 2025 · Lets make video diffusion practical! Contribute to lllyasviel/FramePack development by creating an account on GitHub.

  7. Wan: Open and Advanced Large-Scale Video Generative Models

    Feb 25, 2025 · Wan: Open and Advanced Large-Scale Video Generative Models In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models …

  8. Download the Google Meet app

    With the Google Meet app, you can: Create or join scheduled or instant cloud-encrypted Google Meet meetings with a link. Ring directly to a Google Workspace, personal account, or phone …

  9. GitHub - stepfun-ai/Step-Video-T2V

    We present Step-Video-T2V, a state-of-the-art (SoTA) text-to-video pre-trained model with 30 billion parameters and the capability to generate videos up to 204 frames. To enhance both …

  10. GitHub - visomaster/VisoMaster: Powerful & Easy-to-Use Video …

    VisoMaster is a powerful yet easy-to-use tool for face swapping and editing in images and videos. It utilizes AI to produce natural-looking results with minimal effort, making it ideal for both …