video-reasoning

Here are 11 public repositories matching this topic...

NJU-3DV / SpatialVID

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

3d-reconstruction 3d-generation video-generation world-models video-dataset 4d-reconstruction video-reasoning vision-language-model 4d-generation spatial-intelligence

Updated Dec 15, 2025
Python

thuml / MiniVeo3-Reasoner

Star

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

maze wan visual-reasoning visual-planning video-reasoning world-model video-diffusion-model veo3 chain-of-frames

Updated Oct 12, 2025
Python

scofield7419 / Video-of-Thought

Star

Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

video video-reasoning chain-of-thought multimodal-large-language-models chain-of-thought-reasoning video-model video-chain-of-thought

Updated Feb 25, 2025
Python

LJungang / Awesome-Video-Reasoning-Landscape

Star

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

awesome artificial-intelligence benchmarks omni video-generation paper-list video-reasoning large-language-models chain-of-thought multimodal-large-language-models world-model chain-of-frames think-with-video streaming-video-understanding real-time-video-understanding streaming-video-reasoning

Updated Dec 25, 2025

sutdcv / SUTD-TrafficQA

Star

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

paper annotations dataset vqa cvpr video-qa vqa-dataset traffic-events multimodal multimodal-deep-learning cvpr2021 video-reasoning

Updated Aug 19, 2024
JavaScript

We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.

wanxiang video-reasoning gpt-5 gemini-pro veo3 sora2

Updated Dec 17, 2025
Python

The-Martyr / Awesome-Multimodal-Reasoning

Star

Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models

reinforcement-learning rl image-generation video-understanding r1 image-understanding multimodal-learning cot video-generation o1 video-reasoning large-language-models llm chain-of-thought mllm lvlm multimodal-reasoning image-reasoning

Updated Oct 30, 2025

OpenGVLab / VRBench

Star

[ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos

benchmark dataset video-understanding vlm evaluation-kit multi-step-reasoning video-reasoning llm

Updated Aug 8, 2025
Python

PRITHIVSAKTHIUR / SAGE-MM-Video-Reasoning

Star

A Gradio-based demonstration for the AllenAI SAGE-MM-Qwen3-VL-4B-SFT_RL multimodal model, specialized in video reasoning tasks. Users upload MP4 videos, provide natural language prompts (e.g., "Describe this video in detail" or custom questions), and receive detailed textual analyses.

torch accelerate gradio opencv-python torchvision huggingface-transformers decord video-reasoning huggingface-spaces qwen2-5-vl qwen3-vl molmo2

Updated Dec 21, 2025
Python

giusha12i / Thinking-with-Video

Star

🎥 Generate videos with advanced multimodal reasoning to enhance understanding and interaction, pushing the boundaries of video content creation.

react javascript css html video maze css-grid javascript-game wan css-flexbox nvda visual-reasoning visual-planning video-reasoning world-model video-diffusion-model multimodal-reasoning chain-of-frames

Updated Dec 26, 2025
Python

Nehs6xy3hgdguzjs / Awesome-Video-Reasoning

Star

🎥 Explore cutting-edge research focused on reasoning with video models, featuring key papers and projects in the field of video intelligence.

awesome project benchmarks omni wade video-generation paper-list video-reasoning large-language-models chain-of-thought multimodal-large-language-models world-model video-llms chain-of-frames think-with-video streaming-video-understanding real-time-video-understanding streaming-video-reasoning

Updated Dec 26, 2025

Improve this page

Add a description, image, and links to the video-reasoning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-reasoning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-reasoning

Here are 11 public repositories matching this topic...

NJU-3DV / SpatialVID

thuml / MiniVeo3-Reasoner

scofield7419 / Video-of-Thought

LJungang / Awesome-Video-Reasoning-Landscape

sutdcv / SUTD-TrafficQA

FoundationAgents / VR-Bench

The-Martyr / Awesome-Multimodal-Reasoning

OpenGVLab / VRBench

PRITHIVSAKTHIUR / SAGE-MM-Video-Reasoning

giusha12i / Thinking-with-Video

Nehs6xy3hgdguzjs / Awesome-Video-Reasoning

Improve this page

Add this topic to your repo