Method and system of generating customized three-dimensional images

US2024386661A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024386661-A1
Application numberUS-202318319536-A
CountryUS
Kind codeA1
Filing dateMay 18, 2023
Priority dateMay 18, 2023
Publication dateNov 21, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A data processing system includes: a processor; a memory storing executable instructions which, when executed by the processor, cause the processor, alone or in combination with other processors, to implement a client application with a user interface. The client application is configured to: receive an image depicting an object; generate a fine-tuning input to an image-generating Artificial Intelligence (AI) model to associate image data of the object with an identifier; with the fine-tuning input, fine-tune the AI model; structure a prompt for the AI model using the identifier; and obtain from the AI model a new customized image that depicts the object while preserving an appearance of the object.

First claim

Opening claim text (preview).

What is claimed is: 1 . A data processing system comprising; a processor; a memory storing executable instructions which, when executed by the processor, cause the processor, alone or in combination with other processors, to implement a client application with a user interface to: receive a two-dimensional image depicting an object; generate a fine-tuning input to an image-generating Artificial Intelligence (AI) model to associate image data of the object with an identifier; with the fine-tuning input, fine-tune the AI model; structure a prompt for the AI model using the identifier; and obtain from the AI model a new customized image that depicts the object while preserving an appearance of the object. 2 . The data processing system of claim 1 , the AI model being trained to output a three-dimensional (3D) image based on the prompt. 3 . The data processing system of claim 1 , wherein the user interface of the client application is configured to receive textual user input describing the customized image, the application to structure the prompt for the AI model based on the user input. 4 . The data processing system of claim 1 , wherein the client application is further to receive an image depicting a background in which the object is to be visualized in the customized image, wherein the fine-tuning input further comprises association of image data of the background with a second identifier. 5 . The data processing system of claim 4 , wherein the client application is further to structure the prompt for the AI model using the identifiers for both the image data of the object and background such that the customized image depicts the object and background. 6 . The data processing system of claim 4 , wherein the user interface of the client application is configured to receive textual user input describing a relationship between the object and background to be used in generating the customized image. 7 . The data processing system of claim 1 , wherein the client application is to call an instance segmentation service to produce the image depicting the object from a different image depicting the object along with other image content. 8 . The data processing system of claim 1 , wherein the client application is to obtain from the AI model a new customized image that depicts the object while preserving an appearance of the object, the customized image being a two-dimensional image, the client application to then call a service to convert the two-dimensional image into a three-dimensional image. 9 . A non-transitory computer-readable medium comprising instructions for a client application for execution by a processor, alone or in combination with other processors, the client application comprising; a user interface to receive input images and input text; a fine-tuning tool to generate a fine-tuning input to an image-generating Artificial Intelligence (AI) model to associate image data of an object with an identifier using an image of the object input through the user interface; the fine-tuning tool to implant the fine-tuning input in an output domain of the AI model; and a prompt engine to structure a prompt for the AI model using the identifier and to obtain from the AI model a new customized image that depicts the object based on the prompt while preserving an appearance of the object. 10 . The medium of claim 9 , the AI model being trained to output a three-dimensional (3D) image based on the prompt. 11 . The medium of claim 9 , wherein the user interface of the client application is configured to receive textual user input describing the customized image, the prompt engine to structure the prompt for the AI model based on the user input. 12 . The medium of claim 9 , wherein the client application is further to receive an image depicting a background in which the object is to be visualized in the customized image, wherein the fine-tuning input further comprises association of image data of the background with a second identifier. 13 . The medium of claim 12 , wherein the fine-tuning tool of the client application is further to structure the prompt for the AI model using the identifiers for both the image data of the object and background such that the customized image depicts the object and background. 14 . The medium of claim 12 , wherein the user interface of the client application is configured to receive textual user input describing a relationship between the object and background to be used generating the customized image. 15 . The medium of claim 10 , wherein the user interface includes tools for instructing the prompt engine to generate a new prompt for a new 3D image after viewing a first 3D image, the new prompt revising lighting, background or other element of the first 3D image. 16 . The medium of claim 9 , wherein the client application is to obtain from the AI model a new customized image that depicts the object while preserving an appearance of the object, the customized image being a two-dimensional image, the client application to then call a service to convert the two-dimensional image into a three-dimensional image. 17 . A method of generating a customized three-dimensional image, the method comprising; receiving an image depicting an object; generating a fine-tuning input to an image-generating Artificial Intelligence (AI) model to associate image data of the object with an identifier; with the fine-tuning input, fine-tuning the AI model; structuring a prompt for the AI model using the identifier; and obtaining from the AI model the customized three-dimensional image that depicts the object while preserving an appearance of the object. 18 . The method of claim 17 , the AI model being trained to output a three-dimensional (3D) image based on the prompt. 19 . The method of claim 17 , further comprising: receiving textual user input describing the customized image; and structuring the prompt for the AI model based on the user input. 20 . The method of claim 17 , further comprising: receiving an image depicting a background in which the object is to be visualized in the customized image; and with the fine-tuning input, specifying an association of image data of the background with a second identifier.

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Training; Learning · CPC title

  • involving the use of two or more images · CPC title

  • Semantic analysis · CPC title

  • Interaction techniques based on graphical user interfaces [GUI] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024386661A1 cover?
A data processing system includes: a processor; a memory storing executable instructions which, when executed by the processor, cause the processor, alone or in combination with other processors, to implement a client application with a user interface. The client application is configured to: receive an image depicting an object; generate a fine-tuning input to an image-generating Artificial In…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G06T17/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 21 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).