CivArchive
    Folder Image Captioner with Qwen-VL WF - v1.0
    Preview 126186632

    Folder Image Captioner with Qwen-VL

    This ComfyUI workflow allows you to batch caption entire folders of images quickly and efficiently.

    It loads images from a selected folder, resizes them if needed, generates high-quality detailed captions using Qwen-VL-Mod (Qwen3-VL-8B-Instruct-Abliterated), and saves both the original image and its corresponding caption file with the exact same filename (e.g., photo.jpg + photo.txt).

    Ideal for creating training datasets for LoRAs, character fine-tuning, or any project that requires consistent captions.

    Features:

    • Batch processing directly from folder

    • Saves image + caption with the same name

    • High detail and accuracy thanks to Qwen-VL

    • Maintains the same pose, camera angle, lighting, and location from the original image

    Required Custom Nodes:

    • ComfyUI Custom Nodes

    • Qwen-VL-Mod (or Qwen3-VL-8B-Instruct-Abliterated)

    • Resize Image v2

    • Load Image Dataset from Folder

    • Save Image and Text Dataset to Folder

    Created by: bobgus39 Original profile: https://civarchive.com/user/bobgus39

    Usage: Simply select your image folder and run the workflow. The captions will respect the original pose, camera angle, lighting, and background/location of each image, making them perfect for training consistent characters or scenes.

    Description

    FAQ

    Comments (1)

    lowered99gt739Apr 3, 2026ยท 1 reaction
    CivitAI

    seems to work well, but the descriptions are WAY too long for something like ostris. Trying the "simple description" selection, hopefully with more concise prompts.

    Workflows
    Qwen

    Details

    Downloads
    181
    Platform
    CivitAI
    Platform Status
    Available
    Created
    4/2/2026
    Updated
    5/27/2026
    Deleted
    -

    Files

    folderImageCaptioner_v10.zip

    Mirrors

    HuggingFace (1 mirrors)
    CivitAI (1 mirrors)