I used a dataset of around 400 images total, all captioned manually, but not perfect because of how many there were to do.
Use it like "black n1kepr0 shorts" for example. Lots of variations should work.
Hope you enjoy :)
Nike_Pros.safetensors
203269_training_data.zip