Skip to content
View simplewhite9's full-sized avatar

Block or report simplewhite9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 57 4 Updated Oct 2, 2025

[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)

Python 817 137 Updated Nov 5, 2025

[NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding

Python 46 1 Updated Sep 21, 2025

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 160 18 Updated Sep 23, 2025

Sparking "Thinking with Videos" via Reinforcement Learning

Python 145 6 Updated Oct 30, 2025

[NeurIPS 2025🔥]Main source code of SRPO framework.

Python 188 19 Updated Nov 25, 2025

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,675 196 Updated Jan 10, 2026

Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents

Jupyter Notebook 28 5 Updated Nov 24, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,337 1,604 Updated Jan 30, 2026

Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation) using a novel three-stage RL curriculum. Includes the Time-…

Python 63 2 Updated Jun 11, 2025

[CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online

Python 89 5 Updated Oct 7, 2025

FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)

Python 34 4 Updated Apr 17, 2025

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Python 46 2 Updated Nov 25, 2025

Online video temporal grounding

Python 14 Updated Oct 20, 2025

Official PyTorch Implementation for Advancing Bayesian Optimization via Learning Correlated Latent Space (CoBO)

Python 18 3 Updated Apr 22, 2025

[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"

31 Updated Nov 15, 2025

Official PyTorch implementation of "Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection" (CVPR 2024).

Python 41 8 Updated Apr 19, 2024

Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025

30 2 Updated Jul 30, 2025

Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)

Python 20 Updated Aug 1, 2025

[ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 509 16 Updated Nov 18, 2025

CoS: Chain-of-Shot Prompting for Long Video Understanding

Python 53 5 Updated Feb 13, 2025

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,076 139 Updated Dec 20, 2025

[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding

Jupyter Notebook 160 4 Updated Jul 12, 2025

Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025

Python 15 1 Updated Jan 15, 2025

Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main)

Python 12 Updated Mar 10, 2025

Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning", AAAI 2025

Python 23 1 Updated Jan 26, 2025

Official Implementation (Pytorch) of the "LLaMo: Large Language Model-based Molecular Graph Assistant", NeurIPS 2024

Python 33 4 Updated Feb 12, 2025

Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024

Python 35 Updated Oct 31, 2025

Official Implementation (Pytorch) of "DAVI: Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems", ECCV 2024 Oral paper

Python 74 4 Updated Aug 16, 2024

Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".

Python 32 2 Updated Mar 10, 2025
Next