long8v

🤓

Happy Research

JeongYeon Nam long8v

🤓

Happy Research

46 followers · 118 following

twelve-labs
Seoul

Achievements

Stars

thunlp / LLaVA-UHD

LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs

Python 409 21 Updated Dec 20, 2025

mit-han-lab / streaming-vlm

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 815 53 Updated Oct 15, 2025

tokenbender / mHC-manifold-constrained-hyper-connections

implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880

Python 202 15 Updated Jan 4, 2026

YanjieZe / awesome-humanoid-robot-learning

A Paper List for Humanoid Robot Learning.

1,491 70 Updated Jan 6, 2026

naver-ai / mambamia

Official Implementation of MambaMia (AAAI-26 Oral)

3 Updated Dec 21, 2025

naver-ai / LLaVA-AV-SSM

Official repository of the paper "Does audio matter for modern video-LLMs and their benchmarks?"

3 Updated Nov 24, 2025

2U1 / Qwen-VL-Series-Finetune

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,565 189 Updated Dec 19, 2025

Siyou-Li / QTSplus

Query-aware Token Selector (QTSplus), a lightweight yet powerful visual token selection module that serves as an information gate between the vision encoder and LLMs.

Python 129 9 Updated Nov 29, 2025

Becomebright / ReKV

Official PyTorch Code of ReKV (ICLR'25)

Python 88 6 Updated Nov 4, 2025

OpenGVLab / VideoChat-Flash

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 492 14 Updated Nov 18, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 3,258 408 Updated Jan 9, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 2,074 271 Updated Jan 10, 2026

OpenBMB / ChatDev

ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration

Python 28,148 3,558 Updated Jan 10, 2026

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 961 93 Updated Sep 10, 2025

yunlong10 / Awesome-Video-LMM-Post-Training

🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training

Python 225 10 Updated Nov 21, 2025

showlab / videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Python 616 66 Updated Nov 26, 2025

KlingTeam / VANS

Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Python 83 Updated Dec 1, 2025

TencentARC / ARC-Chapter

Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

33 1 Updated Nov 19, 2025

hrlics / HoPE

[NeurIPS 2025] HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models

Python 22 1 Updated Nov 30, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,372 271 Updated Jan 9, 2026

zai-org / GLM-V

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,112 145 Updated Dec 18, 2025

BytedanceDouyinContent / SAIL-RL

SAIL-RL: Guiding MLLMs in When and How to Think via Dual-Reward RL Tuning

9 Updated Nov 8, 2025

dmlc / decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 2,394 214 Updated Jul 17, 2024

marinero4972 / Open-o3-Video

Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"

Python 127 7 Updated Dec 18, 2025

bytedance / video-SALMONN-2

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsin…

Python 136 15 Updated Dec 22, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,395 53 Updated Dec 30, 2025

allenai / molmo

Code for the Molmo Vision-Language Model

Python 854 83 Updated Dec 12, 2024

EvolvingLMMs-Lab / lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 693 27 Updated Jan 9, 2026

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,065 139 Updated Dec 18, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 40,038 5,134 Updated Jan 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JeongYeon Nam long8v

Achievements

Achievements

Block or report long8v

Stars

thunlp / LLaVA-UHD

mit-han-lab / streaming-vlm

tokenbender / mHC-manifold-constrained-hyper-connections

YanjieZe / awesome-humanoid-robot-learning

naver-ai / mambamia

naver-ai / LLaVA-AV-SSM

2U1 / Qwen-VL-Series-Finetune

Siyou-Li / QTSplus

Becomebright / ReKV

OpenGVLab / VideoChat-Flash

THUDM / slime

vllm-project / vllm-omni

OpenBMB / ChatDev

zhuzilin / ring-flash-attention

yunlong10 / Awesome-Video-LMM-Post-Training

showlab / videollm-online

KlingTeam / VANS

TencentARC / ARC-Chapter

hrlics / HoPE

inclusionAI / AReaL

zai-org / GLM-V

BytedanceDouyinContent / SAIL-RL

dmlc / decord

marinero4972 / Open-o3-Video

bytedance / video-SALMONN-2

baaivision / Emu3.5

allenai / molmo

EvolvingLMMs-Lab / lmms-engine

facebookresearch / perception_models

karpathy / nanochat