This is tests of training a flux model, and all licensing conditions for the Flux Dev model apply. You can find the details here: https://huggingface.co/black-forest-labs/FLUX.1-dev. It's a great model!
A huge thank you to Datacrunch.io for generously providing their resources for training and support. I highly recommend them.
This is Checkpoint. CLIP and VAE is already on
This is my model trained on Flux, and it aims to create more diverse and realistic faces. The results show improvements in anatomy, facial features, and overall realism. However, a side effect is that noise appears in the images. This can be corrected with any upscaler - just find one that works for you. The model was trained on approximately 7,000 images over the course of several hours using powerful H100 cards. If the test results are mostly positive, I will train it on a larger dataset.
The number of training steps was limited due to budget constraints. Additionally, I used a small portion of my dataset, only 7,000 images, to reduce training time. When I get the opportunity, I plan to train a more refined model. You can use this model as you like, but please remember to adhere to the Black Forest Lab license terms.
I welcome any feedback in the comments - feel free to share any errors or shortcomings of the model. This is just an experiment.
Description
This is the NF4 version. This is the checkpoint with Unet, Clip and VAE inside. I've made this version smaller for you. I hope you are happy and can test it.
FAQ
Comments (35)
Since you have NF4, can you also make a Q4_1 version?
It has better support in Comfy UI
https://github.com/city96/ComfyUI-GGUF
Promising model. I'll wait for your next version when you fix the noise issues.
Your side by samples are simply the same image repeated. There is no difference. I lined them up and they are identical.
The NF4 model is clearly different. I prefer the others (who has identical output as you point out - unless there is a mix-up and the output of one model was duplicated in the samples)
If you look closely, there is a difference, but it is not big. The quality drops, but overall the image remains the same.
there huge difference but i have a monitor 2k , i think is be super good if you buy a new monitor , with time the monitor start to dimm and the quality image gone. i have the 27GL83A-B
Not sure if you're viewing on a mobile device, but on a monitor the difference is very clear. The NF4 versions have more natural imperfections in facial anatomy and skin tone, making them a bit more "photorealistic". The v1 seems more "plastic". Definitely a WIP, but on a great trajectory.
Oh I see I was looking at the "U Net Only" model. I do see the differences on the NF4 model.
Seems you trained this on the broken trainer, it looks like it was passed through a mesh cloth. It's all garbled and pixelated at closeup.
great results in ForgeUI. thanks a lot !
Thanks for sharing! How did you turn it into an NF4?
Thank your for your work, it's promising :) Please keep going !
What trainer did you use?
How can I train flux model with custom dataset
Solid start but will wait for an update when you fix noise issues.
I'm pretty new to this. I was able tu run perfectly the U-NET version. But I'd like to know how can I run the NF4 version. I've tried many things considering it included the CLIP and VAE, but nothing work. What are the correct nodes for it?
Thanks!
can I run this with comfyUI, I am getting an error, torch.Size([9437184, 1]) from checkpoint, the shape in current model is torch.Size([6144, 3072]).
if you are using the nf4 model you have to use a nf4 loader. sounds like you are loading the model wrong
That torch size happens because you are using a Lora. Bypass the Lora. NF4 and Loras work in Forge only, until now
full compatible with Ruined Fooocus 1.55 ! thanks a lot !
Pleas tell me, how? I put the model in the model folder, Ruined Fooocus 1.56 sees the model, loads it and it says type unknown. When I click generate, it just crashes. How do you get it to work?
@superuser111 did you find out?
is it dreambooth training?
is anyone having issues generating images with Realistic FLUX? I am.
Hi! What's problem you have?
@Sa_May i am getting an error. i use this model in both unet and load model nodes.
@Sa_May Error(s) in loading state_dict for Flux: size mismatch for img_in.weight: copying a param with shape torch.Size([98304, 1]) from checkpoint, the shape in current model is torch.Size([3072, 64]). size mismatch for time_in.in_layer.weight: copying a param with shape torch.Size([393216, 1]) from checkpoint, the shape in current model is torch.Size([3072, 256]). size mismatch for time_in.out_layer.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for vector_in.in_layer.weight: copying a param with shape torch.Size([1179648, 1]) from checkpoint, the shape in current model is torch.Size([3072, 768]). size mismatch for vector_in.out_layer.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for guidance_in.in_layer.weight: copying a param with shape torch.Size([393216, 1]) from checkpoint, the shape in current model is torch.Size([3072, 256]). size mismatch for guidance_in.out_layer.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for txt_in.weight: copying a param with shape torch.Size([6291456, 1]) from checkpoint, the shape in current model is torch.Size([3072, 4096]). size mismatch for double_blocks.0.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.0.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.0.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.0.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.0.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.0.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.0.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.0.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.0.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.0.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.1.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.1.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.1.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.1.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.1.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.1.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.1.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.1.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.1.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.1.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.2.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.2.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.2.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.2.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.2.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.2.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.2.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.2.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.2.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.2.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.3.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.3.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.3.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.3.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.3.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.3.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.3.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.3.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.3.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.3.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.4.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.4.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.4.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.4.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.4.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.4.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.4.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.4.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.4.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.4.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.5.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.5.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.5.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.5.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.5.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.5.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.5.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.5.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.5.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.5.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.6.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.6.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.6.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.6.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.6.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.6.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.6.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.6.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.6.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.6.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.7.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.7.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.7.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.7.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.7.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.7.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.7.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.7.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.7.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.7.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.8.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.8.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.8.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.8.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.8.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.8.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.8.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.8.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.8.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.8.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.9.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.9.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.9.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.9.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.9.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.9.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.9.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.9.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.9.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.9.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.10.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.10.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.10.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.10.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.10.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.10.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.10.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.10.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.10.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.10.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.11.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.11.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.11.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.11.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.11.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.11.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.11.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.11.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.11.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.11.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.12.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.12.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.12.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.12.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.12.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.12.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.12.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.12.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.12.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.12.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.13.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.13.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.13.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.13.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.13.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.13.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.13.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.13.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.13.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.13.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.14.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.14.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.14.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.14.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.14.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.14.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.14.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.14.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.14.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.14.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.15.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.15.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.15.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.15.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.15.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.15.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.15.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.15.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.15.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.15.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.16.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.16.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.16.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.16.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.16.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.16.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.16.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.16.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.16.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.16.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.17.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.17.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.17.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.17.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.17.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.17.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.17.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.17.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.17.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.17.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.18.img_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.18.img_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.18.img_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.18.img_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.18.img_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for double_blocks.18.txt_mod.lin.weight: copying a param with shape torch.Size([28311552, 1]) from checkpoint, the shape in current model is torch.Size([18432, 3072]). size mismatch for double_blocks.18.txt_attn.qkv.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for double_blocks.18.txt_attn.proj.weight: copying a param with shape torch.Size([4718592, 1]) from checkpoint, the shape in current model is torch.Size([3072, 3072]). size mismatch for double_blocks.18.txt_mlp.0.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([12288, 3072]). size mismatch for double_blocks.18.txt_mlp.2.weight: copying a param with shape torch.Size([18874368, 1]) from checkpoint, the shape in current model is torch.Size([3072, 12288]). size mismatch for single_blocks.0.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.0.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.0.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.1.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.1.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.1.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.2.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.2.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.2.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.3.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.3.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.3.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.4.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.4.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.4.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.5.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.5.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.5.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.6.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.6.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.6.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.7.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.7.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.7.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.8.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.8.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.8.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.9.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.9.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.9.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.10.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.10.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.10.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.11.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.11.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.11.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.12.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.12.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.12.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.13.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.13.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.13.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.14.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.14.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.14.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.15.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.15.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.15.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.16.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.16.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.16.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.17.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.17.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.17.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.18.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.18.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.18.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.19.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.19.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.19.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.20.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.20.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.20.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.21.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.21.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.21.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.22.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.22.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.22.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.23.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.23.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.23.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.24.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.24.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.24.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.25.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.25.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.25.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.26.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.26.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.26.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.27.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.27.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.27.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.28.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.28.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.28.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.29.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.29.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.29.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.30.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.30.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.30.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.31.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.31.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.31.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.32.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.32.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.32.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.33.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.33.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.33.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.34.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.34.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.34.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.35.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.35.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.35.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.36.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.36.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.36.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for single_blocks.37.linear1.weight: copying a param with shape torch.Size([33030144, 1]) from checkpoint, the shape in current model is torch.Size([21504, 3072]). size mismatch for single_blocks.37.linear2.weight: copying a param with shape torch.Size([23592960, 1]) from checkpoint, the shape in current model is torch.Size([3072, 15360]). size mismatch for single_blocks.37.modulation.lin.weight: copying a param with shape torch.Size([14155776, 1]) from checkpoint, the shape in current model is torch.Size([9216, 3072]). size mismatch for final_layer.linear.weight: copying a param with shape torch.Size([98304, 1]) from checkpoint, the shape in current model is torch.Size([64, 3072]). size mismatch for final_layer.adaLN_modulation.1.weight: copying a param with shape torch.Size([9437184, 1]) from checkpoint, the shape in current model is torch.Size([6144, 3072]).
Show Report
Find Issues
Are you using a model version nf4 but not using a special nodes for it?
@Sa_May thats very possible, ill use the unet model that node i use.
@Noob_ee If you use unet, then you need to use clip and vae separately. Perhaps you mixed up the versions and downloaded NF4 - this is also unet, But you need a special node
@Sa_May correct, all fixed, thank you.
Details
Files
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.









