Skip to content

ncnn implementation of Z-Image image generater

License

Notifications You must be signed in to change notification settings

nihui/zimage-ncnn-vulkan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Z-Image ncnn Vulkan

❗ ❗ ❗ This software is in the early development stage, it may bite your cat

CI download

ncnn implementation of Z-Image image generater.

zimage-ncnn-vulkan uses ncnn project as the universal neural network inference framework.

Download Windows/Linux/macOS Executable for Intel/AMD/NVIDIA/Apple-Silicon GPU

https://github.com/nihui/zimage-ncnn-vulkan/releases

This package includes all the binaries required. It is portable, so no CUDA or PyTorch runtime environment is needed :)

prepare model files

Download the z-image-turbo and z-image model folders to the same directory as the executable file

https://huggingface.co/nihui-szyl/z-image-ncnn/tree/main

About Z-Image

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

https://github.com/Tongyi-MAI/Z-Image

Usages

requirements

  • Minimum (Linux / macOS): 16GB RAM, any Vulkan capable GPU

  • Minimum (Windows):

    Due to WDDM limitations: Vulkan applications can only use half of the system RAM.

    The following condition shall be met: (Half of system RAM) + (GPU memory) >= 16GB

    Examples of valid combinations:

    • Any amount RAM, 16GB dedicated GPU
    • 16GB RAM, 8GB dedicated GPU
    • 24GB RAM, 4GB dedicated GPU
    • 32GB RAM, any Vulkan capable GPU
  • Recommended: 32GB RAM, 16GB dedicated GPU with tensorcore/matrix hardware

Example Command

zimage-ncnn-vulkan.exe -p "風的彷徨." -o output.png

Full Usages

Usage: zimage-ncnn-vulkan -p prompt -o outfile [options]...

  -h                   show this help
  -p prompt            prompt (default=rand)
  -n negative-prompt   negative prompt (optional)
  -o output-path       output image path (default=out.png)
  -s image-size        image resolution (default=1024,1024)
  -l steps             denoise steps (default=auto)
  -r random-seed       random seed (default=rand)
  -m model-path        z-image model path (default=z-image-turbo)
  -g gpu-id            gpu device to use (-1=cpu, default=auto)

If you encounter a crash or error, try upgrading your GPU driver:

Build from Source

  1. Clone this project with all submodules
git clone https://github.com/nihui/zimage-ncnn-vulkan.git
cd zimage-ncnn-vulkan
git submodule update --init --recursive --depth 1
  1. Build with CMake
mkdir build
cd build
cmake ../src
cmake --build . -j 4

Sample Images

expand for full command

zimage-ncnn-vulkan.exe -p "A half-length portrait in the warm light of a convenience store late at night. An East Asian beauty, holding milk, meets your gaze in front of the freezer." -r 2065820720

zimage

expand for full command

zimage-ncnn-vulkan.exe -p "A Suzhou garden at night under a bright autumn moon, silvery light bathing the pavilions and rocks, paper lanterns glowing warmly along the corridors, the pond reflecting the moon and lantern light, cool color palette with accents of warmth, mysterious and poetic atmosphere." -r 1593388185

zimage

expand for full command

zimage-ncnn-vulkan.exe -p "Cinnamoroll, Sanrio character, white puppy with long floppy ears, blue eyes, chubby cheeks, plump tail like a cinnamon roll, simple blue bow on right ear, high-quality anime figure, prize figure style, seamless joints, glossy paint finish, detailed sculpting, sitting pose with head tilted, innocent expression, standing on a fluffy cloud-shaped display base, soft gradient pastel background, studio lighting, product photography, sharp focus, shallow depth of field" -r 1362178531

zimage

expand for full command

zimage-ncnn-vulkan.exe -p "a cute girl taking a mirror selfie, full-body view in a full-length mirror, holding a large smartphone in front of her face, the phone screen is visible and glowing, slightly obscuring her eyes or lower face, long twin tails, soft makeup, blush, wearing a trendy and cute outfit, in a cozy, clean bedroom with soft lighting, mirror may have a decorative frame, shallow depth of field, digital photo style, cinematic lighting, sharp focus on the phone and reflection, natural skin texture, Instagram aesthetic, candid feeling" -r 2029158043

zimage

expand for full command

zimage-ncnn-vulkan.exe -p "風的彷徨." -r 77

zimage

expand for full command

zimage-ncnn-vulkan.exe -p "風的彷徨." -r 777

zimage

expand for full command

zimage-ncnn-vulkan.exe -p "Anya Forger hugging a large white dog (Bond), both looking happy, in a park, sunny day, grass field, action shot, from Spy x Family" -r 499008670

zimage

expand for full command

zimage-ncnn-vulkan.exe -p "traditional Chinese handscroll painting in the style of “清明上河图”, horizontal panorama, bustling riverside cityscape in Northern Song Dynasty, Bianliang city, with Rainbow Bridge at the center, boats on the river, shops along the street, teahouses, inns, hundreds of diverse people engaged in daily activities, travelers, merchants, scholars, entertainers, children playing, ink and color on silk, fine brushwork for architecture and figures, subtle expressive brushwork for trees and landscape, soft natural colors, high aerial viewpoint, continuous narrative, lively, prosperous, detailed, harmonious, capturing a complete day of city life, extremely wide aspect ratio, vanishing perspective, Chinese seals and calligraphy on the margins, slight aged paper texture" -r 1321390901 -s 2048,512

zimage

Original Z-Image Project

Other Open-Source Code Used