Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers

Aug 12, 2024

Joshua Nathaniel Williams, Avi Schwarzschild, J. Zico Kolter

Share this with someone who'll enjoy it:

Abstract:Recovering natural language prompts for image generation models, solely based on the generated images is a difficult discrete optimization problem. In this work, we present the first head-to-head comparison of recent discrete optimization techniques for the problem of prompt inversion. We evaluate Greedy Coordinate Gradients (GCG), PEZ , Random Search, AutoDAN and BLIP2's image captioner across various evaluation metrics related to the quality of inverted prompts and the quality of the images generated by the inverted prompts. We find that focusing on the CLIP similarity between the inverted prompts and the ground truth image acts as a poor proxy for the similarity between ground truth image and the image generated by the inverted prompts. While the discrete optimizers effectively minimize their objectives, simply using responses from a well-trained captioner often leads to generated images that more closely resemble those produced by the original prompts.

* 9 Pages, 4 Figures

View paper on

Share this with someone who'll enjoy it:

Title:Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers

Paper and Code