Image generation model based on log-likelihood

US11995151B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11995151-B2
Application numberUS-202117445891-A
CountryUS
Kind codeB2
Filing dateAug 25, 2021
Priority dateSep 29, 2020
Publication dateMay 28, 2024
Grant dateMay 28, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method of training an image generation model. The image generation model comprises an argmax transformation configured to compute a discrete index feature indicating an index of a feature of the continuous feature vector with an extreme value. The image generation model is trained using a log-likelihood optimization. This involves obtaining a value of the index feature for the training image, sampling values of the continuous feature vector given the value of the index feature according to a stochastic inverse transformation of the argmax transformation, and determining a likelihood contribution of the argmax transformation for the log-likelihood based on a probability that the stochastic inverse transformation generates the values of the continuous feature vector given the value of the index feature.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of training an image generation model, the image generation model being configured to generate an image from a latent feature representation, the method comprising the following steps: accessing model data representing parameters of the image generation model, and training data representing a training dataset of multiple training images; training the image generation model using a log-likelihood optimization, wherein: the training includes selecting a training image of the training images, and determining a log-likelihood of the training image being generated according to the image generation model, the image generation model includes a transformation configured to determine a discrete feature from a continuous feature vector, the transformation being an argmax transformation configured to compute a discrete index feature indicating an index of a feature of the continuous feature vector with an extreme value, and determining the log-likelihood includes obtaining a value of the index feature for the training image, sampling values of the continuous feature vector given the value of the index feature according to a stochastic inverse transformation of the argmax transformation, and determining a likelihood contribution of the argmax transformation for the log-likelihood based on a probability that the stochastic inverse transformation generates values of the continuous feature vector given the value of the index feature; and outputting the trained image generation model. 2. The method of claim 1 , wherein the stochastic inverse transformation is parameterized by parameters included in the parameters of the image generation model. 3. The method of claim 2 , wherein sampling the values of the continuous feature vector given the value of the index feature includes sampling an initial feature vector, and applying an injective transformation to the initial feature vector based on the value of the index feature to obtain the values of the continuous feature vectors, the injective transformation being defined such that the index feature indicates an index of a feature of the continuous feature vector with an extreme value. 4. The method of claim 3 , wherein applying the injective transformation includes: (i) applying a smooth thresholding function to the feature of the continuous feature vector indicated by the index feature to make the feature indicated by the index feature larger or smaller than one or more values of the continuous feature vector, and/or (ii) applying a smooth thresholding function to one or more other values of the continuous feature vector to make the one or more other values smaller or larger than the feature indicated by the index feature. 5. The method of claim 2 , wherein sampling the values of the continuous feature vector given the value of the index feature includes: sampling the value of the continuous feature vector indicated by the index feature according to a Gumbel distribution; and sampling a value of the continuous feature vector not indicated by the index feature according to a truncated Gumbel distribution based on the sampled value indicated by the index feature. 6. The method of claim 1 , wherein the index feature corresponds to a particular pixel of the training image. 7. The method of claim 1 , wherein the image generation model is configured to determine a discrete feature by computing multiple respective discrete index features using the argmax transformation and combining the multiple discrete index features. 8. A computer-implemented method of using a trained image generation model, comprising the following steps: accessing model data representing parameters of an image generation model, the image generation model including a transformation configured to determine a discrete feature from a continuous feature vector, the transformation being an argmax transformation configured to compute a discrete index feature indicating an index of a feature of the continuous feature vector with an extreme value, the image generation model having been trained on a training dataset, an inverse of the argmax transformation being approximated by a stochastic inverse transformation; and applying the image generation model to a latent feature representation to obtain a generated image, and/or using the image generation model to determine a conformance value indicating a conformance of an input image to the training dataset, the conformance value being based on a log-likelihood of the input image being generated according to the image generation model and being computed using the stochastic inverse transformation. 9. The method of claim 8 , further comprising accessing model data representing an image classifier trained on the same training dataset as the image generation model, and using the conformance value as a reliability measure of the image classifier for the input image. 10. The method of claim 9 , wherein the input image is an image of an environment of a computer-controlled system, the method further comprising obtaining the input image from a camera of the system and, at least if the conformance value indicates a sufficient reliability of the image classifier for the input image, using an output of the image classifier on the input image to control said system. 11. The method of claim 8 , further comprising applying the image generation model repeatedly to obtain multiple generated images, and using the multiple generated images as training data to train a further machine learning model. 12. The method of claim 11 , further comprising obtaining an input instance to the further machine learning model and applying the further machine learning model to the input instance to determine an output of the further machine learning model. 13. A system for training an image generation model, the image generation model being configured to generate an image from a latent feature representation, the system comprising: a data interface configured to access model data representing parameters of the image generation model, and training data representing a training dataset of multiple training images; and a processor subsystem configured to train the image generation model using a log-likelihood optimization and output the trained image generation model, wherein: the training includes selecting a training image of the training images, and determining a log-likelihood of the training image being generated according to the image generation model, the image generation model includes a transformation configured to determine a discrete feature from a continuous feature vector, the transformation being an argmax transformation configured to compute a discrete index feature indicating an index of a feature of the continuous feature vector with an extreme value, and determining the log-likelihood includes obtaining a value of the index feature for the training image, sampling values of the continuous feature vector given the value of the index feature according to a stochastic inverse transformation of the argmax transformation, and determining a likelihood contribution for the argmax transformation based on a probability that the stochastic inverse transformation generates values of the continuous feature vector given the value of the index feature. 14. A system for using a trained image generation model, comprising: a data interface configured to access model data representing parameters of an image generation model, the image generation model including a transformation configured to determine a discrete feature from a continuous feature vector, the transformation being an argmax

Assignees

Inventors

Classifications

  • Two-dimensional [2D] image generation · CPC title

  • G06F18/214Primary

    Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods · CPC title

  • Machine learning · CPC title

  • G06V20/56Primary

    exterior to a vehicle by using sensors mounted on the vehicle · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11995151B2 cover?
A computer-implemented method of training an image generation model. The image generation model comprises an argmax transformation configured to compute a discrete index feature indicating an index of a feature of the continuous feature vector with an extreme value. The image generation model is trained using a log-likelihood optimization. This involves obtaining a value of the index feature fo…
Who is the assignee on this patent?
Bosch Gmbh Robert
What technology area does this patent fall under?
Primary CPC classification G06F18/214. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 28 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).