Image analysis by prompting of machine-learned models using chain of thought

US12346828B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12346828-B2
Application numberUS-202418967327-A
CountryUS
Kind codeB2
Filing dateDec 3, 2024
Priority dateJun 3, 2022
Publication dateJul 1, 2025
Grant dateJul 1, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An example technique for image analysis is provided. An example image analysis method includes obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response. The example image analysis method includes inputting, to a machine-learned model, the instructive sequence and an operative image processing query that comprises image data, wherein the machine-learned model is configured to process the operative query with attention over the instructive sequence. The example method can include generating, using the machine-learned model and responsive to the operative query, an operative image processing response that comprises an analysis of the image data.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for performing image analysis, the method comprising: obtaining, by a computing system comprising one or more processors, an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response; inputting, by the computing system and to a machine-learned model, the instructive sequence and an operative image processing query comprising image data, wherein the machine-learned model is configured to process the operative image processing query with attention over the instructive sequence; and generating, by the computing system, using the machine-learned model and responsive to the operative image processing query, an operative image processing response. 2. The computer-implemented method of claim 1 , comprising: generating, by the computing system, using the machine-learned model and responsive to the operative image processing query, an operative trace of intermediate states from the operative query to the operative image processing response. 3. The computer-implemented method of claim 1 , wherein the instructive sequence is prepended to the operative image processing query. 4. The computer-implemented method of claim 2 , wherein the instructive trace comprises a chain of intermediate responses to intermediate queries. 5. The computer-implemented method of claim 1 , wherein the instructive sequence comprises an input flag and an output flag. 6. The computer-implemented method of claim 1 , wherein the instructive sequence comprises a tokenized representation of a natural language. 7. The computer-implemented method of claim 1 , wherein the instructive trace comprises one or more intermediate states of one or more variables declared by a computer-executable coding language. 8. The computer-implemented method of claim 1 , wherein generating the operative response comprises: generating, by the computing system and using the machine-learned model, a plurality of operative responses; and determining, by the computing system, the operative image processing response based on a sample of the plurality of operative responses. 9. The computer-implemented method of claim 8 , wherein determining the operative image processing response comprises: determining, by the computing system, a consistency metric based on the sample of the plurality of operative responses. 10. The computer-implemented method of claim 8 , wherein the sample is based on respective probabilities associated with the plurality of operative responses. 11. The computer-implemented method of claim 9 , wherein the consistency metric comprises at least one of: a plurality vote, or a majority vote. 12. The computer-implemented method of claim 9 , wherein the consistency metric comprises a vote based on operative responses respectively associated with diverse operative traces. 13. The computer-implemented method of claim 1 , wherein the operative image processing query is a first query component and the operative image processing response is a first response component, and wherein the method comprises: inputting, by the computing system and to the machine-learned model, the instructive sequence, the first query component, the first response component, and a second query component; and generating, by the computing system, using the machine-learned model and responsive to the second query component, a second response component. 14. The computer-implemented method of claim 13 , comprising: generating, by the computing system and responsive to a target query, one or more query components. 15. The computer-implemented method of claim 13 , comprising: inputting, by the computing system and to the machine-learned model, a preliminary instructive sequence comprising a preliminary instructive query and a preliminary instructive response, wherein the preliminary instructive response comprises a plurality of preliminary instructive query components. 16. The computer-implemented method of claim 13 , wherein the first query component and the second query component are generated with a different machine-learned model other than the machine-learned model used to obtain the first response component and the second response component. 17. The computer-implemented method of claim 14 , wherein the second query component corresponds to the target query. 18. The computer-implemented method of claim 13 , comprising, for a plurality of iterations: generating, by the computing system, an updated instructive sequence based on combining one or more prior input sequences with one or more output sequences respectively corresponding thereto; inputting, by the computing system and to the machine-learned model, the updated instructive sequence and an additional query component; and generating, by the computing system, using the machine-learned model and responsive to the additional query component, an additional response component. 19. One or more memory devices storing non-transitory computer-readable instructions executable to cause one or more processors to perform operations for performing image analysis, the operations comprising: obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response; inputting, to a machine-learned model, the instructive sequence and an operative image processing query comprising image data, wherein the machine-learned model is configured to process the operative image processing query with attention over the instructive sequence; and generating, using the machine-learned model and responsive to the operative image processing query, an operative image processing response. 20. A computing system for performing image analysis, the system comprising: one or more processors; and one or more memory devices storing non-transitory computer-readable instructions that are executable to cause the one or more processors to perform operations, the operations comprising: obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response; inputting, to a machine-learned model, the instructive sequence and an operative image processing query comprising image data, wherein the machine-learned model is configured to process the operative image processing query with attention over the instructive sequence; and generating, using the machine-learned model and responsive to the operative image processing query, an operative image processing response.

Assignees

Inventors

Classifications

  • Knowledge-based neural networks; Logical representations of neural networks · CPC title

  • Transfer learning · CPC title

  • G06N5/022Primary

    Knowledge engineering; Knowledge acquisition · CPC title

  • G06N3/0455Primary

    Auto-encoder networks; Encoder-decoder networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12346828B2 cover?
An example technique for image analysis is provided. An example image analysis method includes obtaining an instructive sequence descriptive of an instructive query, an instructive response, and an instructive trace of intermediate states from the instructive query to the instructive response. The example image analysis method includes inputting, to a machine-learned model, the instructive sequ…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06N5/022. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).