EGLOCE: Training-Free Energy-Guided Latent Optimization for Concept Erasure

2026-04-10Computer Vision and Pattern Recognition

Computer Vision and Pattern Recognition
AI summary

The authors introduce a new method called EGLOCE to remove unwanted concepts (like explicit content or copyrighted styles) from images generated by text-to-image models. Unlike previous methods that need retraining or weaken other details, EGLOCE works during the image creation process without changing the model. It uses two guiding energies to steer the image away from unwanted ideas while keeping the original prompt's meaning. Their tests show EGLOCE works better at removing concepts while keeping image quality intact.

text-to-image diffusion modelsconcept erasureenergy-guided samplinglatent spacegradient descentinference-timedual-objective frameworksemantic alignmentplug-and-play integration
Authors
Junyeong Ahn, Seojin Yoon, Sungyong Baik
Abstract
As text-to-image diffusion models grow increasingly prevalent, the ability to remove specific concepts-mostly explicit content and many copyrighted characters or styles-has become essential for safety and compliance. Existing unlearning approaches often require costly re-training, modify parameters at the cost of degradation of unrelated concept fidelity, or depend on indirect inference-time adjustment that compromise the effectiveness of concept erasure. Inspired by the success of energy-guided sampling for preservation of the condition of diffusion models, we introduce Energy-Guided Latent Optimization for Concept Erasure (EGLOCE), a training-free approach that removes unwanted concepts by re-directing noisy latent during inference. Our method employs a dual-objective framework: a repulsion energy that steers generation away from target concepts via gradient descent in latent space, and a retention energy that preserves semantic alignment to the original prompt. Combined with previous approaches that either require erroneous modified model weights or provide weak inference-time guidance, EGLOCE operates entirely at inference and enhances erasure performance, enabling plug-and-play integration. Extensive experiments demonstrate that EGLOCE improves concept removal while maintaining image quality and prompt alignment across baselines, even with adversarial attacks. To the best of our knowledge, our work is the first to establish a new paradigm for safe and controllable image generation through dual energy-based guidance during sampling.