Stars
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
A Comprehensive Benchmark Suite for AI Story Visualization
A collection of resources on personalized image generation.
A Training-free Iterative Framework for Long Story Visualization
Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
[CVPR 2025] Official code repository for "Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach"
[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution
Let us democratise high-resolution generation! (CVPR 2024)
[ACM MM 2024] Reasoning and Correcting Diffusion for HOI Generation
The repository for the paper "Image Inversion: A Survey from GANs to Diffusion and Beyond".
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
[ICCV'21] Official PyTorch implementation for paper "Spatially Conditioned Graphs for Detecting Human–Object Interactions"
This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
The official code of OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance (NeurIPS 2024)
Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
Official inference repo for FLUX.1 models