Textual inversion token #9

Junoh-Kang · 2024-08-08T00:38:25Z

Thank you for your code. How can I visualize prompt with textual inversion tokens?

wooyeolBaek · 2024-09-01T04:32:09Z

@Junoh-Kang
Sorry for the late reply. I looked into the examples/textual_inversion in Diffusers and understood that a placeholder token like is used in the prompt, e.g., prompt = "A <cat-toy> backpack". The tokenizer likely splits <cat-toy> into <, cat, -, toy, and >, with an attention map stored for each part. If you want an attention map for the entire <cat-toy> token, you can simply modify the resize_and_save function in utils.py to sum the attention maps for <, cat, -, toy, and > before normalizing them, and save this as the attention map for <cat-toy>. I think this should solve the issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Textual inversion token #9

Textual inversion token #9

Junoh-Kang commented Aug 8, 2024

wooyeolBaek commented Sep 1, 2024 •

edited

Loading

Textual inversion token #9

Textual inversion token #9

Comments

Junoh-Kang commented Aug 8, 2024

wooyeolBaek commented Sep 1, 2024 • edited Loading

wooyeolBaek commented Sep 1, 2024 •

edited

Loading