Stable Diffusion Interpretability and Analysis

  • Investigated layers and attention heads in the CLIP Text Encoder in stable diffusion models, focusing on prompts involving negation and bias.
  • Determining spurious correlations and adapting the model to improve zero-shot performance on challenging prompts.

Quick Links:

Slide Deck