← USPTO Patent Grants

Text-guided image editing by learning guidance scales via reinforcement learning

Grant US12579713B2 Kind: B2 Mar 17, 2026

Assignee

QUALCOMM Incorporated

Inventors

Samuel Showalter, Risheek Garrepalli, Debasmit Das, Munawar Hayat, Fatih Murat Porikli

Abstract

Certain aspects of the present disclosure provide techniques and apparatus for improved machine learning. In an example method, a first latent tensor generated during a first iteration of processing data using a denoising backbone of a diffusion machine learning model is accessed. A guidance scale is generated based on processing the first latent tensor using a guidance machine learning model. A second latent tensor is generated during a second iteration of processing data using the denoising backbone based on the first latent tensor and the first guidance scale, and an output from the diffusion machine learning model is generated based at least in part on the second latent tensor.

CPC Classifications

G06T 11/60 G06T 11/40 G06T 19/00 G06T 11/00 G06T 11/20 G06T 11/206 G06T 7/0012 G06T 2207/10081 G06T 2207/30004 G06T 11/203 G06T 2211/441 G06T 2210/28 G06T 11/001 G06T 2200/12 G06F 9/4443 G06F 3/0481 G06F 30/13 G06F 3/04845 G06F 3/04883 G06F 40/143 G06F 16/54 G06F 16/56 G06F 16/5838 G06F 16/58 G06F 16/55 G06F 40/40 G06V 20/20 G06V 10/7753 G11B 27/10 H04N 21/47217 G16H 30/20 G06N 20/20 G06N 3/045 G06N 3/047 G06N 3/08 G06Q 30/0643

Filing Date

2024-01-25

Application No.

18422692

Claims

15