[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.

Python 6,417 775 Updated Apr 16, 2026

baidubce / Qianfan-VL

Qianfan-VL: Domain-Enhanced Universal Vision-Language Models

381 28 Updated Mar 18, 2026

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 3,648 439 Updated Mar 23, 2026

zhiyuandaily / Any2Full

Any2Full: Prompting Depth Anything for Depth Completion in One Stage

Python 37 2 Updated Mar 14, 2026

EnVision-Research / DVD

DVD: Deterministic Video Depth Estimation with Generative Priors

Python 284 21 Updated Apr 7, 2026

PKU-YuanGroup / Helios

Helios: Real Real-Time Long Video Generation Model

Python 1,703 130 Updated Apr 16, 2026

Aryan-Garg / gQIR

CVPR 2026 - Generative Quanta Image Reconstruction

Jupyter Notebook 5 1 Updated Apr 9, 2026

OpenVGLab / OmniLottie

[CVPR 2026🔥] 🧑‍🎨 OmniLottie, an open-sourced multi-modal instructed vector animation generator that produces Lottie JSONs.

Python 638 36 Updated Apr 6, 2026

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 115,380 19,257 Updated Apr 18, 2026

andimarafioti / faster-qwen3-tts

Real-time text-to-speech with Qwen3-TTS

Python 872 123 Updated Apr 17, 2026

tue-mps / videomt

[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).

Python 194 18 Updated Mar 4, 2026

huggingface / skills

Give your agents the power of the Hugging Face ecosystem

Python 10,208 636 Updated Apr 16, 2026

XciD / claude-bar

Swift 11 1 Updated Mar 14, 2026

Rolling-Sink / Rolling-Sink

Official implementation of "Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion"

Python 88 4 Updated Mar 30, 2026

kyutai-labs / hibiki-zero

A real-time and multilingual speech translation model

Python 234 22 Updated Feb 13, 2026

TIGER-AI-Lab / OpenResearcher

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Python 662 70 Updated Apr 16, 2026

Soul-AILab / SoulX-Singer

Official inference code for SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis

Python 562 63 Updated Apr 13, 2026