Method and apparatus for processing a video

US10897620B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10897620-B2
Application numberUS-201916567976-A
CountryUS
Kind codeB2
Filing dateSep 11, 2019
Priority dateDec 14, 2018
Publication dateJan 19, 2021
Grant dateJan 19, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure relate to a method and apparatus for processing a video. The method can include: acquiring a to-be-processed video, and dividing the video into at least one video segment according to a scene; for a video segment of the at least one video segment, extracting a feature of the video segment; inputting the feature and a target video quality into a pre-trained encoding parameter prediction model, to obtain an encoding parameter of the video segment, the encoding parameter prediction model being used to represent a corresponding relationship between the feature of the video segment and the encoding parameter under the target video quality; and performing an encoding process on the video according to the encoding parameter corresponding to the video segment of the at least one video segment, to generate an encoded video.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing a video, comprising: acquiring a to-be-processed video, and dividing the video into at least one video segment according to a scene; for a video segment of the at least one video segment, extracting a feature of the video segment; inputting the feature and a target video quality into a pre-trained encoding parameter prediction model, to obtain an encoding parameter of the video segment, the encoding parameter prediction model being used to represent a corresponding relationship between the feature of the video segment and the encoding parameter under the target video quality; and performing an encoding process on the video according to the encoding parameter corresponding to the video segment of the at least one video segment, to generate an encoded video; wherein the encoding parameter prediction model is trained and obtained by: acquiring a sample video set, and annotating an encoding parameter and a video quality of a sample video in the sample video set; extracting, for the sample video in the sample video set, a feature of the sample video, and determining a complexity factor of the sample video according to the feature of the sample video; classifying the sample video in the sample video set into at least one category according to the complexity factor; and performing, for a category of the at least one category, a regression on the encoding parameter and the video quality of the sample video belonging to the category, to obtain a relationship curve between the encoding parameter and the video quality. 2. The method according to claim 1 , further comprising: determining a video quality of the encoded video; and outputting, in response to the determined video quality not matching the target video quality, predetermined information. 3. The method according to claim 1 , wherein the extracting a feature of the video segment includes: decoding the video segment, to extract a time domain feature and/or a frequency domain feature of the video segment. 4. The method according to claim 1 , wherein the annotating an encoding parameter and a video quality of a sample video in the sample video set includes: encoding, for the sample video in the sample video set, the sample video according to a predetermined constant quality value, and annotating the encoding parameter and the video quality during the encoding. 5. The method according to claim 1 , wherein the complexity factor is related to a pixel depth of the encoded sample video. 6. An apparatus for processing a video, comprising: at least one processor; and a memory storing instructions, the instructions when executed by the at least one processor, causing the at least one processor to perform operations, the operations comprising: acquiring a to-be-processed video, and divide the video into at least one video segment according to a scene; extracting, for a video segment of the at least one video segment, a feature of the video segment; inputting the feature and a target video quality into a pre-trained encoding parameter prediction model, to obtain an encoding parameter of the video segment, the encoding parameter prediction model being used to represent a corresponding relationship between the feature of the video segment and the encoding parameter under the target video quality; and performing an encoding process on the video according to the encoding parameter corresponding to the video segment of the at least one video segment, to generate an encoded video; wherein the encoding parameter prediction model is trained and obtained by: acquiring a sample video set, and annotating an encoding parameter and a video quality of a sample video in the sample video set; extracting, for the sample video in the sample video set, a feature of the sample video, and determining a complexity factor of the sample video according to the feature of the sample video; classifying the sample video in the sample video set into at least one category according to the complexity factor; and performing, for a category of the at least one category, a regression on encoding parameter and the video quality of the sample video belonging to the category, to obtain a relationship curve between the encoding parameter and the video quality. 7. The apparatus according to claim 6 , the operations further comprising: determining a video quality of the encoded video; and outputting, in response to the determined video quality not matching the target video quality, predetermined information. 8. The apparatus according to claim 6 , wherein the extracting a feature of the video segment includes: decoding the video segment, to extract a time domain feature and/or a frequency domain feature of the video segment. 9. The apparatus according to claim 6 , wherein the annotating an encoding parameter and a video quality of a sample video in the sample video set includes: encoding, for the sample video in the sample video set, the sample video according to a predetermined constant quality value, and annotating the encoding parameter and the video quality during the encoding. 10. The apparatus according to claim 6 , wherein the complexity factor is related to a pixel depth of the encoded sample video. 11. A non-transitory computer readable medium, storing a computer program, wherein the computer program, when executed by a processor, causes the processor to perform operations, the operations comprising: acquiring a to-be-processed video, and dividing the video into at least one video segment according to a scene; for a video segment of the at least one video segment, extracting a feature of the video segment; inputting the feature and a target video quality into a pre-trained encoding parameter prediction model, to obtain an encoding parameter of the video segment, the encoding parameter prediction model being used to represent a corresponding relationship between the feature of the video segment and the encoding parameter under the target video quality; and performing an encoding process on the video according to the encoding parameter corresponding to the video segment of the at least one video segment, to generate an encoded video; wherein the encoding parameter prediction model is trained and obtained by: acquiring a sample video set, and annotating an encoding parameter and a video quality of a sample video in the sample video set; extracting, for the sample video in the sample video set, a feature of the sample video, and determining a complexity factor of the sample video according to the feature of the sample video; classifying the sample video in the sample video set into at least one category according to the complexity factor; and performing, for a category of the at least one category, a regression on encoding parameter and video quality of sample video belonging to the category, to obtain a relationship curve between the encoding parameter and the video quality.

Assignees

Inventors

Classifications

  • H04N19/103Primary

    Selection of coding mode or of prediction mode · CPC title

  • Evaluation of the quality of the acquired pattern · CPC title

  • H04N19/154Primary

    Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion (use of rate-distortion criteria H04N19/147) · CPC title

  • Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes · CPC title

  • Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10897620B2 cover?
Embodiments of the present disclosure relate to a method and apparatus for processing a video. The method can include: acquiring a to-be-processed video, and dividing the video into at least one video segment according to a scene; for a video segment of the at least one video segment, extracting a feature of the video segment; inputting the feature and a target video quality into a pre-trained …
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tec
What technology area does this patent fall under?
Primary CPC classification H04N19/103. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jan 19 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).