Image classifier comprising a non-injective transformation

US12450891B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12450891-B2
Application numberUS-202117345702-A
CountryUS
Kind codeB2
Filing dateJun 11, 2021
Priority dateJul 3, 2020
Publication dateOct 21, 2025
Grant dateOct 21, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method of training an image classifier which uses any combination of labelled and/or unlabelled training images. The image classifier comprises a set of transformations between respective transformation inputs and transformation outputs. An inverse model is defined in which for a deterministic, non-injective transformation of the image classifier, its inverse is approximated by a stochastic inverse transformation. During training, for a given training image, a likelihood contribution for this transformation is determined based on a probability of its transformation inputs being generated by the stochastic inverse transformation given its transformation outputs. This likelihood contribution is used to determine a log-likelihood for the training image to be maximized (and its label, if the training image is labelled), based on which the model parameters are optimized.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of training an image classifier, the image classifier being configured to classify an input image into a class from a set of classes, the method comprising the following steps: accessing a training dataset, the training dataset including at least one labelled training image labelled with a training class from the set of classes and at least one unlabelled training image; defining an inverse model for the image classifier, the inverse model configured to map output classes of the image classifier to input images, wherein the image classifier includes a set of transformations, the set of transformations including at least one deterministic and non-injective transformation, an inverse of the deterministic and non-injective transformation being approximated in the inverse model by a stochastic inverse transformation, the inverse model including trainable parameters, and sharing a plurality of the trainable parameters with the image classifier; training the image classifier using a log-likelihood optimization, the training including: selecting a training image from the training dataset, applying the image classifier to the training image, including applying the deterministic and non-injective transformation to transformation inputs of the deterministic and non-injective transformation to obtain transformation outputs of the deterministic and non-injective transformation, determining a likelihood contribution for the deterministic and non-injective transformation of the image classifier based on a probability that the stochastic inverse transformation of the inverse model generates the transformation inputs given the transformation outputs, when the training image is the labelled training image, using the likelihood contribution to determine a log-likelihood for the labelled training image and its label according to a joint probability distribution of input images and classes determined by the image classifier, when the training image is the unlabelled training image, using the determined likelihood contribution to determine a log-likelihood for the unlabelled training image according to a probability distribution of input images being generated by the inverse model; and optimizing parameters of the image classifier to maximize the log-likelihood for the labelled training image occurring according to the joint probability distribution and optimizing parameters of the inverse model to maximize the log-likelihood for the unlabelled training image for the unlabelled training image occurring according to the probability distribution of input images being generated by the inverse model. 2. The method of claim 1 , wherein the determining of the log-likelihood for the training image includes determining a sum of likelihood contributions for respective transformations of the set of transformations. 3. The method of claim 2 , wherein the image classifier includes a densely connected component given by a linear bijective transformation and a slicing transformation, wherein the slicing transformation is configured to select a subset of outputs of the linear bijective transformation, wherein an inverse of the slicing transformation is approximated in the inverse model by a stochastic inverse transformation configured to sample non-selected outputs for the linear bijective transformation based on the selected outputs of the linear bijective transformation. 4. The method of claim 2 , wherein the image classifier includes a coupling transformation configured to determine first and second transformation outputs given first and second transformation inputs by combining the first transformation input with a first function of the second transformation input to obtain the first transformation output, and combining the second transformation input with a second function of the first transformation output, wherein the first and second functions are convolutions. 5. The method of claim 1 , wherein the image classifier includes a max pooling transformation computing a transformation output as a maximum of multiple transformation inputs, wherein an inverse of the max pooling transformation is approximated in the inverse model by an inverse transformation configured to sample an index of a maximal transformation input and values of non-maximal transformation inputs. 6. The method of claim 1 , wherein the image classifier includes a ReLU transformation configured to compute a transformation output by mapping a transformation input from an interval to a constant, wherein an inverse of the ReLU transformation is approximated in the inverse model by an inverse transformation configured to, given a transformation output equal to the given constant, sample a transformation input from the given interval. 7. The method of claim 1 , wherein the image classifier is configured to classify the input image into the class by determining a vector of class probabilities for respective classes, and, in an output layer, determining the class from the vector, wherein an inverse of the output layer is approximated in the inverse model based on a conditional probability distribution for the vector of class probabilities given the determined class. 8. The method of claim 1 , wherein the image classifier further includes a stochastic transformation with a deterministic inverse transformation, and the method further comprises computing a likelihood contribution of the stochastic transformation based on a probability that the deterministic and non-injective transformation generates the transformation outputs of the stochastic transformation given the transformation inputs of the stochastic transformation. 9. The method of claim 1 , further comprising: obtaining an input image data using a sensor of an at least semi-autonomous vehicle; classifying the input image data using the trained image classifier, including applying the trained image classifier to classify the input image into a class from the set of classes; and controlling steering and/or braking of the vehicle based on the classification of the input image data. 10. The method of claim 1 , further comprising: obtaining a target class; applying the trained inverse model to the target class to generate, using the trained inverse model, multiple synthetic images representative of the target class; and further training the trained image classifier using the generated synthetic images. 11. The method of claim 1 , further comprising: obtaining a target class; applying the trained inverse model to the target class to generate, using the trained inverse model, multiple synthetic images representative of the target class; and training a machine learning model using the generated synthetic images. 12. A computer-implemented method of using a trained image classifier, the trained image classifier being configured to classify an input image into a class from a set of classes, the method comprising the following steps: accessing model data representing the trained image classifier, wherein the image classifier is trained by: accessing a training dataset, the training dataset including at least one labelled training image labelled with a training class from the set of classes and at least one unlabelled training image, defining an inverse model for the image classifier, the inverse model configured to map output classes of the image classifier to input images, wherein the image classifier includes a set of transformations, the set of transformations including at least one deterministic and non-injective transformation, an inverse of the deterministic and non-injective transformation being approximated in the inverse model by

Assignees

Inventors

Classifications

  • Validation; Performance evaluation · CPC title

  • Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title

  • using classification, e.g. of video objects · CPC title

  • Activation functions · CPC title

  • Multiple classes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12450891B2 cover?
A computer-implemented method of training an image classifier which uses any combination of labelled and/or unlabelled training images. The image classifier comprises a set of transformations between respective transformation inputs and transformation outputs. An inverse model is defined in which for a deterministic, non-injective transformation of the image classifier, its inverse is approxima…
Who is the assignee on this patent?
Bosch Gmbh Robert
What technology area does this patent fall under?
Primary CPC classification G06F18/2415. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 21 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).