About 92,000,000 results
Open links in new tab
  1. DepthAnything/Video-Depth-Anything - GitHub

    Jan 21, 2025 · This work presents Video Depth Anything based on Depth Anything V2, which can be applied to arbitrarily long videos without compromising quality, consistency, or …

  2. Video-R1: Reinforcing Video Reasoning in MLLMs - GitHub

    Feb 23, 2025 · Video-R1 significantly outperforms previous models across most benchmarks. Notably, on VSI-Bench, which focuses on spatial reasoning in videos, Video-R1-7B achieves a …

  3. GitHub - DAMO-NLP-SG/Video-LLaMA: [EMNLP 2023 Demo] …

    Jun 3, 2024 · Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding This is the repo for the Video-LLaMA project, which is working on empowering …

  4. GitHub - MME-Benchmarks/Video-MME: [CVPR 2025] Video …

    We introduce Video-MME, the first-ever full-spectrum, M ulti- M odal E valuation benchmark of MLLMs in Video analysis. It is designed to comprehensively assess the capabilities of MLLMs …

  5. GitHub - k4yt3x/video2x: A machine learning-based video super ...

    A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018. - k4yt3x/video2x

  6. Download the Google Meet app

    With the Google Meet app, you can: Create or join scheduled or instant cloud-encrypted Google Meet meetings with a link. Ring directly to a Google Workspace, personal account, or phone …

  7. GitHub - Lightricks/LTX-Video: Official repository for LTX-Video

    LTX-Video is the first DiT-based video generation model that contains all core capabilities of modern video generation in one model: synchronized audio and video, high fidelity, multiple …

  8. GitHub - wxbool/video-srt-windows: 这是一个可以识别视频语音自 …

    这是一个可以识别视频语音自动生成字幕SRT文件的开源 Windows-GUI 软件工具。. Contribute to wxbool/video-srt-windows development by creating ...

  9. Generate Video Overviews in NotebookLM - Google Help

    Video Overviews, including voices and visuals, are AI-generated and may contain inaccuracies or audio glitches. NotebookLM may take a while to generate the Video Overview, feel free to …

  10. 【EMNLP 2024 】Video-LLaVA: Learning United Visual ... - GitHub

    😮 Highlights Video-LLaVA exhibits remarkable interactive capabilities between images and videos, despite the absence of image-video pairs in the dataset.