Training A Non-Reference Video Scoring System With Full Reference Video Scores

US2019258902A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2019258902-A1
Application numberUS-201816216699-A
CountryUS
Kind codeA1
Filing dateDec 11, 2018
Priority dateFeb 16, 2018
Publication dateAug 22, 2019
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The disclosed technology teaches training a NR VMOS score generator by generating synthetically impaired images from FR video using filters tuned to generate impaired versions and applying a FR VMOS generator to pairs of unimpaired FR images from the FR video and the impaired versions of the FR images to create ground truth scores for the impaired versions. The disclosed method also includes training by machine learning model an image evaluation classifier using the ground truth scores and the impaired versions to generate NR VMOS scores, and storing coefficients of the image evaluation classifier for use as the NR VMOS score generator. Also disclosed is generating a NR VMOS score by invoking the trained NR VMOS score generator, with stored coefficients generated by feeding the trained NR VMOS score generator with images captured from scenes in a video to be scored, and evaluating the images to generate NR VMOS scores.

First claim

Opening claim text (preview).

We claim as follows: 1 . A tangible non-transitory computer readable storage media impressed with computer program instructions that, when executed on a processor, cause the processor to implement a method of training a no-reference video mean opinion score (abbreviated NR VMOS) score generator, the method including: generating synthetically impaired images from full reference (abbreviated FR) video, using filters tuned to generate impaired versions of unimpaired FR images from the FR video; applying a FR video mean opinion score (abbreviated FR VMOS) generator to pairs of the unimpaired FR images and the impaired versions of the FR images to create ground truth scores for the impaired versions; training by machine learning model an image evaluation classifier using the ground truth scores and the impaired versions to generate NR VMOS scores; and storing coefficients of the image evaluation classifier for use as the NR VMOS score generator. 2 . The tangible non-transitory computer readable storage media of claim 1 , wherein: the unimpaired FR images from the FR video are selected from a series of scenes; and the filters tuned to generate impaired versions from the FR video approximate effects of constrained video delivery bandwidth. 3 . The tangible non-transitory computer readable storage media of claim 1 , further including generating 50,000 to 10,000,000 synthetically impaired images for use in the applying and the training. 4 . The tangible non-transitory computer readable storage media of claim 1 , further including generating 100,000 to 1,000,000 synthetically impaired images for use in the applying and the training. 5 . The tangible non-transitory computer readable storage media of claim 1 , wherein the machine learning model is a support vector machine (abbreviated SVM) model. 6 . The tangible non-transitory computer readable storage media of claim 1 , wherein the machine learning model is a convolutional neural network (abbreviated CNN) model. 7 . A computer-implemented method for training a no-reference video mean opinion score (abbreviated NR VMOS) score generator, including executing on a processor the program instructions from the non-transitory computer readable storage media of claim 1 , to implement the generating, applying, training and storing. 8 . A computer-implemented method for training a no-reference video mean opinion score (abbreviated NR VMOS) score generator, including executing on a processor the program instructions from the non-transitory computer readable storage media of claim 2 , to implement the generating, applying, training and storing. 9 . A computer-implemented method for training a no-reference video mean opinion score (abbreviated NR VMOS) score generator, including executing on a processor the program instructions from the non-transitory computer readable storage media of claim 5 , to implement the generating, applying, training and storing. 10 . A computer-implemented method for training a no-reference video mean opinion score (abbreviated NR VMOS) score generator, including executing on a processor the program instructions from the non-transitory computer readable storage media of claim 6 , to implement the generating, applying, training and storing. 11 . A system for training a no-reference video mean opinion score (abbreviated NR VMOS) score generator, the system including a processor, memory coupled to the processor, and computer instructions from the non-transitory computer readable storage media of claim 1 loaded into the memory. 12 . The system of claim 11 , wherein: the unimpaired FR images from the FR video are selected from a series of scenes; and the filters tuned to generate impaired versions from the FR video approximate effects of constrained video delivery bandwidth. 13 . The system of claim 11 , wherein the machine learning model is a support vector machine (abbreviated SVM) model. 14 . The system of claim 11 , wherein the machine learning model is a convolutional neural network (abbreviated CNN) model. 15 . A tangible non-transitory computer readable storage media impressed with computer program instructions that, when executed a processor, cause the processor to implement a method of generating a no-reference video mean opinion score (abbreviated NR VMOS) using a trained NR VMOS score generator, the method including: invoking the trained NR VMOS score generator that includes stored coefficients generated by training an image evaluation classifier using unimpaired and impaired images from a full reference (abbreviated FR) video; feeding the trained NR VMOS score generator with at least three images captured from different scenes in a video sequence to be scored; evaluating the at least three images to generate NR VMOS scores; and combining the NR VMOS scores from the least three images to generate a sequence NR VMOS score for the video sequence. 16 . The tangible non-transitory computer readable storage media of claim 15 , wherein the at least three images are separated by at least three seconds of video sequence between respective images. 17 . The tangible non-transitory computer readable storage media of claim 15 , wherein the video sequence NR VMOS score for the video sequence satisfies a predetermined correlation with standards-based FR VMOS scores. 18 . A system for generating a no-reference video mean opinion score (abbreviated NR VMOS) using a trained NR VMOS score generator, the system including a processor, memory coupled to the processor, and computer instructions from the non-transitory computer readable storage media of claim 15 loaded into the memory. 19 . A computer-implemented method for generating a no-reference video mean opinion score (abbreviated NR VMOS) using a trained NR VMOS score generator, including executing on a processor the program instructions from the non-transitory computer readable storage media of claim 15 .

Assignees

Inventors

Classifications

  • G06T7/0002Primary

    Inspection of images, e.g. flaw detection · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • Validation; Performance evaluation; Active pattern learning techniques · CPC title

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019258902A1 cover?
The disclosed technology teaches training a NR VMOS score generator by generating synthetically impaired images from FR video using filters tuned to generate impaired versions and applying a FR VMOS generator to pairs of unimpaired FR images from the FR video and the impaired versions of the FR images to create ground truth scores for the impaired versions. The disclosed method also includes tr…
Who is the assignee on this patent?
Spirent Communications Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/0002. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Aug 22 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).