Stars
Lets make video diffusion practical!
Wan: Open and Advanced Large-Scale Video Generative Models
A lightweight, powerful framework for multi-agent workflows
The official Python library for the OpenAI API
Knowledge base for software engineers and research engineers
A generic, spec-compliant, thorough implementation of the OAuth request-signing logic
[CVPR 2025 Workshop] CatV2TON is a lightweight DiT-based visual virtual try-on model, capable of supporting try-on for both images and videos.
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …
Solutions for Object Oriented Design Problems
Official inference repo for FLUX.1 models
Official repository of In-Context LoRA for Diffusion Transformers
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
[ICCV 2021] GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion
High-resolution models for human tasks.
🌀 OpenAPI to TypeScript codegen. Production-ready SDKs, Zod schemas, TanStack Query hooks, and 20+ plugins. Used by Vercel, OpenCode, and PayPal.
Python scripts for the Segment Anythin 2 (SAM2) model in ONNX
Hiera: A fast, powerful, and simple hierarchical vision transformer.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
OpenTelemetry backend in a Docker image
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
FOSS Image background remover with 10 open source rmbg models
PyTorch code and models for the DINOv2 self-supervised learning method.
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
