Stars
Official implementation of "Rethinking Garment Conditioning in Diffusion-based Virtual Try-On (Re-CatVTON)"
An official implementation of "Hulk: A Universal Knowledge Translator for Human-Centric Tasks"
akanametov / yolo-face
Forked from ultralytics/ultralyticsYOLO Face 🚀 in PyTorch
Effortless data labeling with AI support from Segment Anything and other awesome models.
The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
State-of-the-art 2D and 3D Face Analysis Project
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
the code repository of paper (AAAI-2025) "CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing"
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A summary of patent information database URLs from all over the world.
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
