Text-guided image editing by learning guidance scales via reinforcement learning
Assignee
QUALCOMM Incorporated
Inventors
Samuel Showalter, Risheek Garrepalli, Debasmit Das, Munawar Hayat, Fatih Murat Porikli
Abstract
Certain aspects of the present disclosure provide techniques and apparatus for improved machine learning. In an example method, a first latent tensor generated during a first iteration of processing data using a denoising backbone of a diffusion machine learning model is accessed. A guidance scale is generated based on processing the first latent tensor using a guidance machine learning model. A second latent tensor is generated during a second iteration of processing data using the denoising backbone based on the first latent tensor and the first guidance scale, and an output from the diffusion machine learning model is generated based at least in part on the second latent tensor.
CPC Classifications
Filing Date
2024-01-25
Application No.
18422692
Claims
15