Video encoding by providing geometric proxies

US12192518B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12192518-B2
Application numberUS-202117443913-A
CountryUS
Kind codeB2
Filing dateJul 28, 2021
Priority dateSep 26, 2018
Publication dateJan 7, 2025
Grant dateJan 7, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Compressing a frame of video includes receiving a frame of a video, identifying a three dimensional (3D) object in the frame, matching the 3D object to a stored 3D object, compressing the frame of the video using a color prediction scheme based on the 3D object and the stored 3D object, and storing the compressed frame with metadata, the metadata identifying the 3D object, indicating a position of the 3D object in the frame of the video and indicating an orientation of the 3D object in the frame of the video.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving a frame of a video; identifying a three-dimensional (3D) object in the frame; matching the 3D object to a stored 3D object; generating a first 3D object proxy based on the stored 3D object; transforming the first 3D object proxy based on the 3D object identified in the frame; compressing the frame of the video using a color prediction scheme based on the transformed first 3D object proxy and a transformed second 3D object proxy based on the stored 3D object; and storing the compressed frame with metadata, the metadata identifying the 3D object, indicating a position of the 3D object in the frame of the video and indicating an orientation of the 3D object in the frame of the video. 2. The method of claim 1 , wherein the compressing of the frame of the video using the color prediction scheme based on the 3D object and the stored 3D object includes: generating the second 3D object proxy based on the stored 3D object; identifying the 3D object in a key frame of the video; transforming the second 3D object proxy based on the 3D object identified in the key frame; mapping color attributes from the 3D object to the transformed first 3D object proxy; mapping color attributes from the 3D object identified in the key frame to the transformed second 3D object proxy; and generating residuals for the 3D object based on the color attributes for the transformed first 3D object proxy and the color attributes for the transformed second 3D object proxy. 3. The method of claim 1 , wherein the compressing of the frame of the video using the color prediction scheme based on the 3D object and the stored 3D object includes: generating the second 3D object proxy based on the stored 3D object; identifying the 3D object in a key frame of the video; transforming the second 3D object proxy based on the 3D object identified in the key frame; mapping color attributes from the 3D object to the transformed first 3D object proxy; and generating residuals for the 3D object based on the color attributes for the transformed first 3D object proxy and default color attributes for the transformed second 3D object proxy. 4. The method of claim 1 , wherein the compressing of the frame of the video using the color prediction scheme based on the 3D object and the stored 3D object includes: encoding the first 3D object proxy using an auto encoder, wherein the transforming the first 3D object proxy includes transforming the encoded first 3D object proxy based on the 3D object identified in the frame; generating the second 3D object proxy based on the stored 3D object; encoding the second 3D object proxy using an autoencoder; identifying the 3D object in a key frame of the video; transforming the encoded second 3D object proxy based on the 3D object identified in the key frame; mapping color attributes from the 3D object to the transformed first 3D object proxy; mapping color attributes from the 3D object identified in the key frame to the transformed second 3D object proxy; and generating residuals for the 3D object based on the color attributes for the transformed first 3D object proxy and the color attributes for the transformed second 3D object proxy. 5. The method of claim 1 , wherein the compressing of the frame of the video using the color prediction scheme based on the 3D object and the stored 3D object includes: encoding the first 3D object proxy using an auto encoder, wherein the transforming the first 3D object proxy includes transforming the encoded first 3D object proxy based on the 3D object identified in the frame; generating the second 3D object proxy based on the stored 3D object; encoding the second 3D object proxy using an autoencoder; identifying the 3D object in a key frame of the video; transforming the encoded second 3D object proxy based on the 3D object identified in the key frame; mapping color attributes from the 3D object to the transformed first 3D object proxy; and generating residuals for the 3D object based on the color attributes for the transformed first 3D object proxy and default color attributes for the transformed second 3D object proxy. 6. The method of claim 1 , further comprising: before storing the 3D object: identifying at least one 3D object of interest associated with the video; determining a plurality of mesh attributes associated with the 3D object of interest; determining a position associated with the 3D object of interest; determining an orientation associated with the 3D object of interest; determining a plurality of color attributes associated with the 3D object of interest; and reducing a number of variables associated with the mesh attributes for the 3D object of interest using an autoencoder. 7. The method of claim 1 , wherein compressing the frame of the video includes determining position coordinates of the 3D object relative to an origin coordinate of a background 3D object in a key frame. 8. The method of claim 1 , wherein the stored 3D object includes default color attributes, and the color prediction scheme uses the default color attributes. 9. The method of claim 1 , further comprising: identifying at least one 3D object of interest associated with the video; generating at least one stored 3D object based on the at least one 3D object of interest, each of the at least one stored 3D object being defined by a mesh including a collection of points connected by faces, each point storing at least one attribute, the at least one attribute including a position coordinate for the respective point; and storing the at least one stored 3D object in association with the video. 10. A method comprising: receiving a frame of a video; identifying a three-dimensional (3D) object in the frame; matching the 3D object to a stored 3D object; generating a 3D object proxy based on the stored 3D object; transforming the 3D object proxy based on the 3D object identified in the frame; mapping color attributes from the 3D object to the transformed 3D object proxy; compressing the frame of the video using a color prediction scheme based on the transformed 3D object proxy, a second transformed 3D object proxy and the stored 3D object; and storing the frame of the video. 11. The method of claim 10 , wherein the 3D object proxy is a first 3D object proxy and the compressing of the frame of the video using the color prediction scheme based on the 3D object and the stored 3D object includes: generating the second 3D object proxy based on the stored 3D object; identifying the 3D object in a key frame of the video; transforming the second 3D object proxy as the second transformed 3D object proxy based on the 3D object identified in the key frame; mapping color attributes from the 3D object identified in the key frame to the transformed second 3D object proxy; and generating color attributes for the 3D object based on the color attributes for the transformed first 3D object proxy and the color attributes for the transformed second 3D object proxy. 12. The method of claim 10 , wherein the 3D object proxy is a first 3D object proxy and the compressing of the frame of the video using the color prediction scheme based on the 3D object and the stored 3D object includes: encoding the first 3D object proxy using an autoencoder; transforming the encoded first 3D object proxy based on metadata associated with the 3D object; generating the second 3D object proxy based on the stored 3D object; encoding the second 3D object proxy using an autoencoder; identifying the 3D object in a key frame of the video; transforming the encoded second 3D object proxy

Assignees

Inventors

Classifications

  • characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation (H04N19/635 takes precedence) · CPC title

  • the unit being a colour or a chrominance component · CPC title

  • involving spatial prediction techniques · CPC title

  • using video object coding · CPC title

  • Predictors, e.g. intraframe, interframe coding · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12192518B2 cover?
Compressing a frame of video includes receiving a frame of a video, identifying a three dimensional (3D) object in the frame, matching the 3D object to a stored 3D object, compressing the frame of the video using a color prediction scheme based on the 3D object and the stored 3D object, and storing the compressed frame with metadata, the metadata identifying the 3D object, indicating a position…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification H04N19/597. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jan 07 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).