Method and apparatus for video encoding and decoding

US11412229B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11412229-B2
Application numberUS-202016994183-A
CountryUS
Kind codeB2
Filing dateAug 14, 2020
Priority dateJun 20, 2018
Publication dateAug 9, 2022
Grant dateAug 9, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A video coding method includes obtaining an input video frame; determining down-sampling information corresponding to the input video frame in response to determining that a target processing manner corresponding to the input video frame is a down-sampling processing manner; encoding the input video frame according to the down-sampling information to obtain encoded data corresponding to the input video frame; and adding the down-sampling information to the encoded data according to a processing range corresponding to the down-sampling information. The down-sampling information is capable of identifying at least one of a down-sampling proportion and a down-sampling method for the input video frame or identifying a rule for identifying the at least one of a down-sampling proportion and a down-sampling method.

First claim

Opening claim text (preview).

What is claimed is: 1. A video coding method, comprising: obtaining an input video frame; determining down-sampling information corresponding to the input video frame in response to determining that a target processing manner corresponding to the input video frame is a down-sampling processing manner, wherein the down-sampling information includes a down-sampling method, and the down-sampling method includes a first down-sample method with a first calculation complexity and a second down-sampling method with a second calculation complexity lower than the first calculation complexity; in response to determining a real-time requirement is a first real-time requirement, employing the first down-sampling method, and in response to determining the real-time requirement is a second real-time requirement higher than the first real-time requirement, employing the second down-sampling method; encoding the input video frame according to the down-sampling information to obtain encoded data corresponding to the input video frame; obtaining a processing range corresponding to the down-sampling information, wherein the processing range includes one or more of a sequence level range, a group of pictures (GOP) level range, a frame level range, and a coded block level range, and wherein the sequence level range corresponds to a video sequence including two or more GOPs, the GOP level range corresponds to a GOP including two or more frames, the frame level range corresponds to a frame including two or more coded blocks, and the coded block level range corresponds to a coded block; and adding the down-sampling information to the encoded data corresponding to the processing range, by: determining whether the processing range is the sequence level range, the GOP level range, the frame level range, or the coded block level range; in response to determining the processing range is the sequence level range, adding the down-sampling information to header information of the video sequence; in response to determining the processing range is the GOP level range, adding the down-sampling information to header information of the GOP; in response to determining the processing range is the frame level range, adding the down-sampling information to header information of the frame; and in response to determining the processing range is the coded block level range, adding the down-sampling information to header information of the coded block. 2. The method according to claim 1 , wherein the down-sampling information includes a down-sampling proportion, and the down-sampling proportion is obtained by: obtaining a current quantization parameter (QP) estimation value corresponding to the input video frame; obtaining a QP threshold corresponding to the input video frame; and determining, according to a difference between the current QP estimation value and the QP threshold, the down-sampling proportion corresponding to the input video frame. 3. The method according to claim 2 , wherein the determining, according to a difference between the current QP estimation value and the QP threshold, the down-sampling proportion corresponding to the input video frame comprises: obtaining a correspondence between a candidate difference range and a candidate down-sampling proportion; determining, from the candidate difference range according to the difference between the current QP estimation value and the QP threshold, a target difference range corresponding to the difference; and determining, from the candidate down-sampling proportion according to the target difference range and the correspondence, the down-sampling proportion corresponding to the input video frame. 4. The method according to claim 2 , wherein the obtaining a QP threshold corresponding to the input video frame comprises: obtaining video frame information corresponding to the input video frame; and determining, according to the video frame information, the QP threshold corresponding to the input video frame. 5. The method according to claim 1 , wherein the down-sampling information includes a down-sampling method, and the down-sampling method is determined by: calculating an image content complexity corresponding to the input video frame; obtaining a correspondence between a candidate image content complexity and a candidate down-sampling method; and determining, according to the correspondence, the down-sampling method corresponding to the image content complexity. 6. The method according to claim 1 , wherein the down-sampling information includes a down-sampling method, and the down-sampling method is determined by: obtaining a current coded block of the input video frame; calculating a module gradient value corresponding to the current coded block; classifying the current coded block into a target module class in a module set according to the module gradient value, candidate module classes in the module class set respectively corresponding to different module gradient value ranges; obtaining a correspondence between a candidate module class and a candidate down-sampling method; and determining, according to the correspondence between the candidate module class and the candidate down-sampling method and the target module class, the down-sampling method corresponding to the current coded block. 7. The method according to claim 1 , wherein the down-sampling information includes a down-sampling proportion, and the encoded data is obtained by: processing the input video frame according to the down-sampling proportion to obtain a current frame; obtaining a reference frame corresponding to the current frame; and encoding the current frame according to the reference frame, to obtain the encoded data corresponding to the input video frame. 8. The method according to claim 7 , wherein the encoding the current frame according to the reference frame, to obtain the encoded data corresponding to the input video frame comprises: obtaining a down-sampling proportion corresponding to the current frame, and processing the reference frame according to the down-sampling proportion to obtain a target reference frame; and encoding the current frame according to the target reference frame, to obtain the encoded data corresponding to the input video frame. 9. The method according to claim 1 , wherein the determining down-sampling information corresponding to the input video frame comprises: obtaining a correspondence between a candidate environmental factor and candidate down-sampling information; and obtaining a current environmental factor, and determining, according to the correspondence between the candidate environmental factor and the candidate down-sampling information, the down-sampling information corresponding to the current environmental factor. 10. The method according to claim 1 , wherein the down-sampling method includes a first down-sampling proportion and a second down-sampling proportion higher than the first down-sampling proportion, and the method further comprises: determining whether the input video frame is a B frame or a P frame; and in response to determining the input video frame is the B frame, employing the second down-sampling proportion; and in response to determining the input video frame is the P frame, employing the first down-sampling proportion. 11. A video coding method, comprising: obtaining encoded data corresponding to a current frame to be decoded; determining a processing range according to the encoded data, wherein the processing range includes one or more of a sequence level range, a group of pictures (GOP) level range, a frame level range, and a coded block level range, and wherein the sequence level rang

Assignees

Inventors

Classifications

  • by predictive encoding · CPC title

  • H04N19/147Primary

    according to rate distortion criteria (rate-distortion as a criterion for motion estimation H04N19/567) · CPC title

  • in combination with predictive coding · CPC title

  • the region being a block, e.g. a macroblock · CPC title

  • Embedding additional information in the video signal during the compression process (H04N19/517, H04N19/68, H04N19/70 take precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11412229B2 cover?
A video coding method includes obtaining an input video frame; determining down-sampling information corresponding to the input video frame in response to determining that a target processing manner corresponding to the input video frame is a down-sampling processing manner; encoding the input video frame according to the down-sampling information to obtain encoded data corresponding to the inp…
Who is the assignee on this patent?
Tencent Tech Shenzhen Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04N19/147. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Aug 09 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).