CivArchive
    ERNIE Image Quants - Turbo Mixed NVFP4
    Preview 127727808
    Preview 127727809
    Preview 127727811
    Preview 127727810
    Preview 127727812

    Ernie in different quants:

    • Mixed FP8 - Mostly fp8_e4m3 some are not quantized, fast.

    • Mixed NVFP4 - NVFP4 except final layers to give a higher quality finish, faster than FP8

    • NVFP4 - Mostly NVFP4 - Fastest

    Note: You will only see speedups from NVFP4 on Blackwell series NVIDIA cards.

    https://ernie.baidu.com/blog/posts/ernie-image/

    text_encoders

    vae

    Model Storage Location

    📂 ComfyUI/
    ├── 📂 models/
    │   ├── 📂 diffusion_models/
    │   │   └── ernie-image-turbo-nvfp4.safetensors
    │   ├── 📂 text_encoders/
    │   │   ├── ministral-3-3b.safetensors
    │   │   └── ernie-image-prompt-enhancer.safetensors
    │   └── 📂 vae/
    │       └── flux2-vae.safetensors

    Description

    Has the more important layers as FP8 the rest as NVFP4 to give a good mix of the two

    Checkpoint
    Ernie

    Details

    Downloads
    20
    Platform
    CivitAI
    Platform Status
    Available
    Created
    4/17/2026
    Updated
    4/17/2026
    Deleted
    -

    Files

    ernieImageQuants_turboMixedNVFP4.safetensors