Folder Image Captioner with Qwen-VL
This ComfyUI workflow allows you to batch caption entire folders of images quickly and efficiently.
It loads images from a selected folder, resizes them if needed, generates high-quality detailed captions using Qwen-VL-Mod (Qwen3-VL-8B-Instruct-Abliterated), and saves both the original image and its corresponding caption file with the exact same filename (e.g., photo.jpg + photo.txt).
Ideal for creating training datasets for LoRAs, character fine-tuning, or any project that requires consistent captions.
Features:
Batch processing directly from folder
Saves image + caption with the same name
High detail and accuracy thanks to Qwen-VL
Maintains the same pose, camera angle, lighting, and location from the original image
Required Custom Nodes:
ComfyUI Custom Nodes
Qwen-VL-Mod (or Qwen3-VL-8B-Instruct-Abliterated)
Resize Image v2
Load Image Dataset from Folder
Save Image and Text Dataset to Folder
Created by: bobgus39 Original profile: https://civarchive.com/user/bobgus39
Usage: Simply select your image folder and run the workflow. The captions will respect the original pose, camera angle, lighting, and background/location of each image, making them perfect for training consistent characters or scenes.
Description
FAQ
Comments (1)
seems to work well, but the descriptions are WAY too long for something like ostris. Trying the "simple description" selection, hopefully with more concise prompts.
