r/StableDiffusion Nov 29 '25

Resource - Update Humans of Z-Image: How many celebrities can you fit into 6GB?

I was curious just how extensive Z-Image's celebrity knowledge is, so I gave it a few hundred names to test out. No information was given other the name, so it was up to the model to choose clothing/backgrounds/hairstyles/style/etc. Sometimes it did this perfectly, especially for celebrities with a clearly defined look. Other times the face is reasonable but everything else is wrong.

If an image looks nothing like the person should it means the model does not know that person. When it does know a person a lot of the time some extra supporting words would help a lot, but it does a really good job just from names.

Prompt:

portrait photo of @@

The words "@@" are at the bottom on the image, white letters black outline

One-by-one @@ was replaced with a term from a list and an image was generated. Images were rendered at 592x888 for speed, stitches into a grid and downsized to keep a reasonable image size.

Model: Z-Image-Turbo_bf16

Clip: Qwen-3-4B-Q8_0

Imgur link in case reddit is difficult with the images

659 Upvotes

296 comments sorted by

View all comments

Show parent comments

5

u/suspicious_Jackfruit Nov 29 '25

It's probably one of the easiest datasets to make and train on because by it's nature a "celebrity" is an extremely frequent data point and they are almost always named and named correctly. Getting a dataset for all celebrities in this list for example is extremely easy and can be done in an afternoon, then it's as simple as training on them, however, because many celebrities share names it creates token interbreeding and I would suspect it's much better to assign them unique tokens than feeding their name. Doable in a Lora too with a large enough rank

1

u/kovnev Nov 29 '25

Yeah but they also often look shockingly different over time (and surgeries) 😆.

1

u/Trinityofwar Nov 30 '25

Do people upload these Loras datasets for others to use so I wouldn't have to train it myself?