Stable Diffusion Interpretability and Analysis

Investigated layers and attention heads in the CLIP Text Encoder in stable diffusion models, focusing on prompts involving negation and bias.
Determining spurious correlations and adapting the model to improve zero-shot performance on challenging prompts.

Quick Links:

Report

Slide Deck