CivArchive
    Z Image Base - Text Encoder (FP4)
    Preview 119003007

    We'll have Z-Image-Base available for on-site generation shortly! Stay tuned!

    Z-Image is a powerful and highly efficient image generation model with 6B parameters. It is currently has three variants:

    • 🚀 Z-Image-Turbo – A distilled version of Z-Image that matches or exceeds leading competitors with only 8 NFEs (Number of Function Evaluations). It offers ⚡️sub-second inference latency⚡️ on enterprise-grade H800 GPUs and fits comfortably within 16G VRAM consumer devices. It excels in photorealistic image generation, bilingual text rendering (English & Chinese), and robust instruction adherence.

    • 🧱 Z-Image-Base (this model) – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.

    • ✍️ Z-Image-Edit – A variant fine-tuned on Z-Image specifically for image editing tasks. It supports creative image-to-image generation with impressive instruction-following capabilities, allowing for precise edits based on natural language prompts.

    Original ComfyUI Models: https://huggingface.co/Comfy-Org/z_image

    Original HF Repo: https://huggingface.co/Tongyi-MAI/Z-Image

    Checkpoint
    ZImageBase

    Details

    Downloads
    530
    Platform
    CivitAI
    Platform Status
    Available
    Created
    1/27/2026
    Updated
    2/1/2026
    Deleted
    -

    Files

    zImageBase_textEncoderFP4.safetensors

    Mirrors