System, method and application to convert transdermal alcohol concentration to blood or breath alcohol concentration
US-2024188886-A1 · Jun 13, 2024 · US
US12482160B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12482160-B2 |
| Application number | US-202418624960-A |
| Country | US |
| Kind code | B2 |
| Filing date | Apr 2, 2024 |
| Priority date | May 19, 2022 |
| Publication date | Nov 25, 2025 |
| Grant date | Nov 25, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.
Opening claim text (preview).
What is claimed is: 1 . A method performed by one or more computers, the method comprising: receiving a text prompt describing a scene; processing the text prompt, using a frozen text encoder neural network, to generate a set of contextual embeddings of the text prompt; and processing the contextual embeddings, using a sequence of neural networks, to generate an image depicting the scene, wherein the sequence of neural networks comprises: an initial generative neural network configured to: receive the contextual embeddings; and process the contextual embeddings to generate, as output, an initial representation of the image having an initial dimensionality; one or more subsequent generative neural networks proceeding the initial generative neural network, each subsequent generative neural network configured to: receive a respective input comprising an input representation of the image generated as output by a preceding neural network in the sequence; and process the respective input to generate, as output, a respective output representation of the image having higher dimensionality than the input representation; and a final neural network following the one or more subsequent generative neural networks, the final neural network configured to: receive an output representation of the image generated by a subsequent generative neural network in the sequence; and process the output representation of the image to generate, as output, the image depicting the scene; wherein the frozen text encoder neural network was held frozen during training of the initial generative neural network and the one or more subsequent generative neural networks, and wherein the training of the initial generative neural network comprises: obtaining a plurality of training examples that each comprise: (i) a respective training text prompt, and (ii) a respective ground truth image that depicts a scene described by the respective training text prompt; pre-computing contextual embeddings of the training text prompts for the plurality of training examples using the frozen text encoder neural network; and training the initial generative neural network on the plurality of training examples using the pre-computed contextual embeddings of the training text prompts without re-computing the contextual embeddings of the training text prompts or modifying parameter values of the frozen text encoder neural network. 2 . The method of claim 1 , wherein the image has higher dimensionality than each representation of the image. 3 . The method of claim 1 , wherein each representation of the image is a respective compressed representation of the image. 4 . The method of claim 3 , wherein each compressed representation of the image is a respective latent representation of the image. 5 . The method of claim 1 , wherein each representation of the image is a respective pixel representation of the image. 6 . The method of claim 1 , wherein the respective input of each subsequent generative neural network further comprises the contextual embeddings. 7 . The method of claim 1 , wherein the initial generative neural network and each subsequent generative neural network is a diffusion-based generative neural network. 8 . The method of claim 7 , wherein the initial diffusion-based generative neural network and each subsequent diffusion-based generative neural network is parameterized in continuous time. 9 . The method of claim 7 , wherein the initial diffusion-based generative neural network and each subsequent diffusion-based generative neural network is a denoising diffusion model. 10 . The method of claim 7 , wherein the initial diffusion-based generative neural network and each subsequent diffusion-based generative neural network has been trained using classifier-free guidance. 11 . The method of claim 7 , wherein the initial diffusion-based generative neural network and each subsequent diffusion-based generative neural network has a convolutional neural network architecture. 12 . The method of claim 11 , wherein the convolutional neural network architecture is a U-Net architecture. 13 . The method of claim 1 , wherein the initial generative neural network and each subsequent generative neural network has 400 million or more network parameters. 14 . The method of claim 1 , wherein the final neural network is a decoder neural network. 15 . The method of claim 1 , wherein the neural networks in the sequence have been trained jointly on the plurality of training examples. 16 . The method of claim 1 , wherein the initial generative neural network, or the one or more subsequent generative neural networks, or both, are diffusion-based generative neural networks that have been trained by progressive distillation. 17 . The method of claim 16 , wherein the initial generative neural network, or the one or more subsequent generative neural networks, or both, implement v-parametrization. 18 . A system comprising one or more computers and one or more storage devices storing instructions that, when executed by the one or more computers, cause the one or more computers to perform operations comprising: receiving a text prompt describing a scene; processing the text prompt, using a frozen text encoder neural network, to generate a set of contextual embeddings of the text prompt; and processing the contextual embeddings, using a sequence of neural networks, to generate an image depicting the scene, wherein the sequence of neural networks comprises: an initial generative neural network configured to: receive the contextual embeddings; and process the contextual embeddings to generate, as output, an initial representation of the image having an initial dimensionality; one or more subsequent generative neural networks proceeding the initial generative neural network, each subsequent generative neural network configured to: receive a respective input comprising an input representation of the image generated as output by a preceding neural network in the sequence; and process the respective input to generate, as output, a respective output representation of the image having higher dimensionality than the input representation; and a final neural network following the one or more subsequent generative neural networks, the final neural network configured to: receive an output representation of the image generated by a subsequent generative neural network in the sequence; and process the output representation of the image to generate, as output, the image depicting the scene; wherein the frozen text encoder neural network was held frozen during training of the initial generative neural network and the one or more subsequent generative neural networks, and wherein the training of the initial generative neural network comprises: obtaining a plurality of training examples that each comprise: (i) a respective training text prompt, and (ii) a respective ground truth image that depicts a scene described by the respective training text prompt; pre-computing contextual embeddings of the training text prompts for the plurality of training examples using the frozen text encoder neural network; and training the initial generative neural network on the plurality of training examples using the pre-computed contextual embeddings of the training text prompts without re-computing the contextual embeddings of the training text prompts or modifying parameter values of the frozen text encoder neural network. 19 . One or more non-transitory computer-rea
using neural networks · CPC title
Syntactic or structural pattern recognition, e.g. symbolic string recognition · CPC title
Denoising; Smoothing · CPC title
based on super-resolution, i.e. the output image resolution being higher than the sensor resolution · CPC title
Learning methods · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.