Eval-Learn

A benchmarking framework for evaluating concept unlearning techniques in text-to-image diffusion models.

Unlearning techniques modify or constrain Stable Diffusion to prevent it from generating specific concepts — nudity, violence, artistic styles, named individuals. Eval-Learn provides a common interface to run, compare, and evaluate these techniques under consistent conditions.

What it includes

13 techniques — ESD, MACE, UCE, SSD, CA, CoGFD, TraSCE, AdvUnlearn, SAeUron, SAFREE, SLD, Concept Steerers, Free Run
9 metrics — ASR I2P, ASR P4D, ASR MMA-Diffusion, ASR Ring-A-Bell, FID, CLIP Score, ERR, TIFA, UA-IRA
2 evaluation modes — single metric or multiple metrics per technique run

Hardware

A CUDA GPU is required. Inference-only techniques need ~5 GB VRAM; training-based techniques peak at 10–16 GB during the training phase. See GPU Requirements for details.