Techniques for predicting perceptual video quality

US10475172B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10475172-B2
Application numberUS-201816017929-A
CountryUS
Kind codeB2
Filing dateJun 25, 2018
Priority dateMay 11, 2015
Publication dateNov 12, 2019
Grant dateNov 12, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In one embodiment of the present invention, a quality trainer and quality calculator collaborate to establish a consistent perceptual quality metric via machine learning. In a training phase, the quality trainer leverages machine intelligence techniques to create a perceptual quality model that combines objective metrics to optimally track a subjective metric assigned during viewings of training videos. Subsequently, the quality calculator applies the perceptual quality model to values for the objective metrics for a target video, thereby generating a perceptual quality score for the target video. In this fashion, the perceptual quality model judiciously fuses the objective metrics for the target video based on the visual feedback processed during the training phase. Since the contribution of each objective metric to the perceptual quality score is determined based on empirical data, the perceptual quality score is a more accurate assessment of observed video quality than conventional objective metrics.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: selecting a set of objective metrics that includes an anti-noise signal-to-noise ratio; for each training video included in a set of training videos, receiving a subjective value for a perceptual video quality metric and a set of objective values for the set of objective metrics, wherein the subjective value and the set of objective values describe the training video; deriving a composite relationship based on a correlation between the subjective value, the set of objective values, and a measure of pixel motion within at least one of the set of training videos, wherein the composite relationship specifies a level of contribution for at least one of the set of objective metrics to the perceptual video quality metric; for a target video, calculating a first set of values for the set of objective metrics; and applying the composite relationship to the first set of values to generate an output value for the perceptual video quality metric. 2. The computer-implemented method of claim 1 , wherein deriving the composite relationship comprises performing one or more training operations on the data sets. 3. The computer-implemented method of claim 2 , wherein performing one or more training operations on a given data set comprises applying a support vector machine algorithm or an artificial neural network algorithm to the set of objective values included in the data set. 4. The computer-implemented method of claim 1 , further comprising: determining that a value included in the first set of values exceeds a predetermined threshold; and modifying the output value for the perceptual quality metric based on an adjustment factor. 5. The computer-implemented method of claim 1 , further comprising: computing a motion value based on pixel differences between two consecutive frames of the target video; determining that the motion value exceeds a predetermined threshold; and increasing the output value for the perceptual quality metric by a predetermined amount. 6. The computer-implemented method of claim 1 , wherein the set of objective metrics further includes at least one of a detail loss measure and a visual information fidelity. 7. The computer-implemented method of claim 1 , wherein the target video is derived from a source video, and calculating a first set of values for the set of objective metrics comprises calculating a first value for the anti-noise signal-to-noise ratio. 8. The computer-implemented method of claim 7 , wherein calculating a first value for the anti-noise signal-to-noise ratio comprises: applying a first low pass filter to the source video; applying a second low pass filter to the target video that is stronger than the first low pass filter; and performing one or more signal-to-noise ratio calculations based on the filtered source video and the filtered target video. 9. The computer-implemented method of claim 1 , wherein a first subjective value for the perceptual video quality metric is a human-observed score for the visual quality of a reconstructed video that is derived from the first training video. 10. The computer-implemented method of claim 1 , wherein a first training video included in the set of training videos includes at least one of compressed data and scaled data. 11. One or more non-transitory computer-readable media including instructions that, when executed by one or more processors, cause the one or more processors to perform the steps of: selecting a set of objective metrics that includes an anti-noise signal-to-noise ratio; for each training video included in a set of training videos, receiving a subjective value for a perceptual video quality metric and a set of objective values for the set of objective metrics, wherein the subjective value and the set of objective values describe the training video; deriving a composite relationship based on a correlation between the subjective value and the set of objective values, wherein the composite relationship specifies a level of contribution for at least one of the set of objective metrics to the perceptual video quality metric; for a target video, calculating a first set of values for the set of objective metrics; and applying the composite relationship to the first set of values to generate an output value for the perceptual video quality metric. 12. The non-transitory computer-readable medium of claim 11 , wherein deriving the composite relationship comprises performing one or more training operations on the data sets. 13. The non-transitory computer-readable medium of claim 12 , wherein performing one or more training operations on a given data set comprises applying a support vector machine algorithm or an artificial neural network algorithm to the set of objective values included in the data set. 14. The non-transitory computer-readable medium of claim 11 , further comprising: determining that a value included in the first set of values exceeds a predetermined threshold; and modifying the output value for the perceptual quality metric based on an adjustment factor. 15. The non-transitory computer-readable medium of claim 11 , further comprising: computing a motion value based on pixel differences between two consecutive frames of the target video; determining that the motion value exceeds a predetermined threshold; and increasing the output value for the perceptual quality metric by a predetermined amount. 16. The non-transitory computer-readable medium of claim 11 , wherein the set of objective metrics further includes at least one of a detail loss measure and a visual information fidelity. 17. The non-transitory computer-readable medium of claim 11 , wherein the target video is derived from a source video, and calculating a first set of values for the set of objective metrics comprises calculating a first value for the anti-noise signal-to-noise ratio. 18. The non-transitory computer-readable medium of claim 17 , wherein calculating a first value for the anti-noise signal-to-noise ratio comprises: applying a first low pass filter to the source video; applying a second low pass filter to the target video that is stronger than the first low pass filter; and performing one or more signal-to-noise ratio calculations based on the filtered source video and the filtered target video. 19. The non-transitory computer-readable medium of claim 11 , wherein a first subjective value for the perceptual video quality metric is a human-observed score for the visual quality of a reconstructed video that is derived from the first training video. 20. The non-transitory computer-readable medium of claim 11 , wherein a first training video included in the set of training videos includes at least one of compressed data and scaled data. 21. A system, comprising: one or more memories including instructions; and one or more processors that are coupled to the one or more memories and, when executing the instructions, are configured to perform the steps of: selecting a set of one or more objective metrics representing video characteristics; for each training video included in a set of training videos, receiving a subjective value for a perceptual video quality metric and a set of one or more objective values for the set of one or more objective metrics, wherein the subjective value and the set of objective values describe the training video; deriving a composite relationship based on a correlation between the subjective value and the set o

Assignees

Inventors

Classifications

  • Learning process for intelligent management, e.g. learning user preferences for recommending movies {(services using the results of monitoring in broadcast systems H04H60/61)} · CPC title

  • G06T7/0002Primary

    Inspection of images, e.g. flaw detection · CPC title

  • using neural networks, e.g. processing the feedback provided by the user · CPC title

  • G06T9/002Primary

    using neural networks · CPC title

  • Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion (use of rate-distortion criteria H04N19/147) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10475172B2 cover?
In one embodiment of the present invention, a quality trainer and quality calculator collaborate to establish a consistent perceptual quality metric via machine learning. In a training phase, the quality trainer leverages machine intelligence techniques to create a perceptual quality model that combines objective metrics to optimally track a subjective metric assigned during viewings of trainin…
Who is the assignee on this patent?
Netflix Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/0002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 12 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).