Holy shit I was looking at the new SD 3
First Text encoder standard open clip 200MB
Second Text Encoder CLIP Vit-L used in XL 2GB
Third Text Encoder T5XXL 10GB
OK I cant even fit the TE in memory let alone the Unet
The pruned combined model isn't up yet but Im guessing it will be at least 12GB
If not over the 16GB threshold wich would put it into the 0.1% of users.
First Text encoder standard open clip 200MB
Second Text Encoder CLIP Vit-L used in XL 2GB
Third Text Encoder T5XXL 10GB
OK I cant even fit the TE in memory let alone the Unet
The pruned combined model isn't up yet but Im guessing it will be at least 12GB
If not over the 16GB threshold wich would put it into the 0.1% of users.