Color conditioned diffusion prior
US-2024404144-A1 · Dec 5, 2024 · US
US2024193824A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2024193824-A1 |
| Application number | US-202318472923-A |
| Country | US |
| Kind code | A1 |
| Filing date | Sep 22, 2023 |
| Priority date | Dec 9, 2022 |
| Publication date | Jun 13, 2024 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Disclosed is a method for realistic visualization of a digital human, the method including: setting a specific action of the digital human; determining a scene including the specific action of the digital human and rendering the determined scene to generate a first rendered video; capturing images constituting the first rendered video for each frame to obtain frame data; inputting each piece of the frame data of the first rendered video to two or more realistic visualization modules to obtain frame data of a second rendered video; and combining the frame data of the second rendered video to generate a realistically visualized scene.
Opening claim text (preview).
What is claimed is: 1 . A method for realistic visualization of a digital human, the method comprising: setting a specific action of the digital human; determining a scene including the specific action of the digital human and rendering the determined scene to generate a first rendered video; capturing images constituting the first rendered video for each frame to obtain frame data; inputting each piece of the frame data of the first rendered video to two or more realistic visualization modules to obtain frame data of a second rendered video; and combining the frame data of the second rendered video to generate a realistically visualized scene. 2 . The method of claim 1 , wherein the determining of the scene includes determining a scene including at least one of a posture of the digital human, a camera, lighting, a background, a viewing angle, a distance, and coordinate information. 3 . The method of claim 1 , wherein the two or more realistic visualization modules include at least one pair of realistic visualization modules connected in parallel. 4 . The method of claim 3 , wherein each of the pair of realistic visualization modules connected in parallel includes one or more realistic visualization modules connected thereto in series. 5 . The method of claim 4 , wherein the one or more realistic visualization modules connected in series to each of the pair of realistic visualization modules connected in parallel are provided in different types. 6 . The method of claim 1 , further comprising: generating identification information for the realistically visualized scene; and when the identification information for the realistically visualized scene matches identification information for a newly input third rendered video, caching the second rendered video to generate a realistically visualized video. 7 . The method of claim 6 , wherein the identification information includes at least one of scene identification information (scene_ID), action identification information (action_ID), and query identification information (query_id) including the scene identification information and the action identification information. 8 . A computing device for realistic visualization of a digital human, the computing device comprising at least one processor configured to perform an operation for realistic visualization of the digital human, wherein the at least one processor is configured to: set a specific action of the digital human; determine a scene including the specific action of the digital human and render the determined scene to generate a first rendered video; capture images constituting the first rendered video for each frame to obtain frame data; input each piece of the frame data of the first rendered video to two or more realistic visualization modules to obtain frame data of a second rendered video; and combine the frame data of the second rendered video to generate a realistically visualized scene. 9 . The computing device of claim 8 , wherein the at least one processor is configured to, when determining the scene, determine a scene including at least one of a posture of the digital human, a camera, lighting, a background, a viewing angle, a distance, and coordinate information. 10 . The computing device of claim 8 , wherein the two or more realistic visualization modules include at least one pair of realistic visualization modules connected in parallel. 11 . The computing device of claim 10 , wherein each of the pair of realistic visualization modules connected in parallel includes one or more realistic visualization modules connected thereto in series. 12 . The computing device of claim 11 , wherein the one or more realistic visualization modules connected in series to each of the pair of realistic visualization modules connected in parallel are provided in different types. 13 . The computing device of claim 8 , wherein the at least one processor is configured to: generate identification information for the realistically visualized scene; and when the identification information for the realistically visualized scene matches identification information for a newly input third rendered video, cache the second rendered video to generate a realistically visualized image. 14 . The computing device of claim 13 , wherein the identification information includes at least one of scene identification information (scene_ID), action identification information (action_ID), and query identification information (query_id) including the scene identification information and the action identification information. 15 . A method for realistic visualization of a specific region of a digital human, the method comprising: determining a scene including a specific action of the digital human and rendering the determined scene to generate a first rendered video; extracting a facial region of the digital human; performing a realistic visualization operation on the facial region using two or more realistic visualization modules connected in parallel to generate frame data of a realistically visualized facial region video; and synthesizing each piece of the frame data of the realistically visualized facial region video and each piece of frame data of the first rendered video.
Texturing; Colouring; Generation of textures or colours (retouching, inpainting or scratch removal G06T5/77) · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.