What technology area does this patent fall under?

Primary CPC classification G06N3/09. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Electronic device for performing video quality assessment, and operation method of the electronic device

US12266165B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12266165-B2
Application number	US-202217824587-A
Country	US
Kind code	B2
Filing date	May 25, 2022
Priority date	May 25, 2021
Publication date	Apr 1, 2025
Grant date	Apr 1, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An electronic device is provided. The electronic device includes a memory storing one or more instructions, and a processor configured to execute the one or more instruction stored in the memory. The processor is configured to execute the one or more instructions to obtain a subjective assessment score for each of a plurality of sub-regions included in an input frame, the subjective assessment score being a Mean Opinion Score (MOS); obtain a location weight for each of the plurality of sub-regions, the location weight indicating characteristics according to a location of a display; obtain a weighted assessment score for each of the plurality of sub-regions, based on the subjective assessment score for each of the plurality of sub-regions and the location weight for each of the plurality of sub-regions; and obtain a final quality score for the entire video frame, based on the weighted assessment score for each of the plurality of sub-regions.

First claim

Opening claim text (preview).

What is claimed is: 1. An electronic device comprising: a memory storing one or more instructions; and a processor configured to execute the one or more instructions stored in the memory to: obtain a subjective assessment score for each of a plurality of sub-regions included in an input frame, the subjective assessment score being a Mean Opinion Score (MOS); obtain a location weight for each of the plurality of sub-regions based on the subjective assessment score for each of the plurality of sub-regions and a subjective assessment score for the input frame, the location weight indicating characteristics according to a location of a display; obtain a weighted assessment score for each of the plurality of sub-regions, based on the subjective assessment score for each of the plurality of sub-regions and the location weight for each of the plurality of sub-regions; and obtain a final quality score for the entire input frame, based on the weighted assessment score for each of the plurality of sub-regions. 2. The electronic device of claim 1 , wherein the processor is further configured to execute the one or more instructions to predict the subjective assessment score for each of the plurality of sub-regions included in the input frame, by using a first neural network trained to learn, from a video frame received, a subjective assessment score for each of the plurality of sub-regions included in the video frame. 3. The electronic device of claim 2 , wherein the first neural network is trained to allow the subjective assessment score for each of the plurality of sub-regions included in the input frame to be equal to a Ground Truth (GT) subjective assessment score for the entire input frame, the GT subjective assessment score being a GT MOS. 4. The electronic device of claim 2 , wherein the processor is further configured to execute the one or more instructions to predict the location weight for each of the plurality of sub-regions from the subjective assessment score for each of the plurality of sub-regions by using a second neural network, and the second neural network is trained to predict a weight corresponding to a difference between the subjective assessment score for each sub-region and a Ground Truth (GT) subjective assessment score for the entire input frame as the location weight for each sub-region, from the subjective assessment score for each of the plurality of sub-regions included in the input frame predicted through the first neural network. 5. The electronic device of claim 4 , wherein the second neural network is trained to allow a mean value of weighted assessment scores obtained by multiplying the subjective assessment score for each of the plurality of sub-regions included in the input frame by the location weight to be equal to the GT subjective assessment score for the entire input frame. 6. The electronic device of claim 1 , wherein the processor is further configured to execute the one or more instructions to obtain the location weight for each of the plurality of sub-regions from the memory. 7. The electronic device of claim 6 , wherein the location weight for each of the plurality of sub-regions is predicted through a second neural network and stored in the memory, and the second neural network is trained to predict a weight corresponding to a difference between the subjective assessment score for each sub-region and a Ground Truth (GT) subjective assessment score for the entire input frame as the location weight for each sub-region, from the subjective assessment score for each of the plurality of sub-regions included in the input frame received, and the second neural network is trained to allow a mean value of weighted assessment scores obtained by multiplying the subjective assessment score for each of the plurality of sub-regions by the location weight to be equal to the GT subjective assessment score for the entire input frame. 8. The electronic device of claim 1 , wherein the processor is further configured to execute the one or more instructions to obtain the weighted assessment score for each respective sub-region of the plurality of sub-regions by multiplying the subjective assessment score for the respective sub-region by the location weight for the respective sub-region. 9. The electronic device of claim 1 , wherein the processor is further configured to execute the one or more instructions to: obtain high-complexity information indicating a region of interest from the input frame; and obtain the final quality score for the entire input frame based on the weighted assessment score and the high-complexity information. 10. The electronic device of claim 9 , wherein the high-complexity information includes at least one of speaker identification information, semantic segmentation information, object detection information, or saliency map information. 11. A video quality assessment method performed by an electronic device, the video quality assessment method comprising: obtaining a subjective assessment score for each of a plurality of sub-regions included in an input frame, the subjective assessment score being a Mean Opinion Score (MOS); obtaining a location weight for each of the plurality of sub-regions based on the subjective assessment score for each of the plurality of sub-regions and a subjective assessment score for the input frame, the location weight indicating characteristics according to a location of a display; obtaining a weighted assessment score for each of the plurality of sub-regions, based on the subjective assessment score for each of the plurality of sub-regions and the location weight for each of the plurality of sub-regions; and obtaining a final quality score for the entire input frame, based on the weighted assessment score for each of the plurality of sub-regions. 12. The video quality assessment method of claim 11 , wherein the obtaining of the subjective assessment score for each of the plurality of sub-regions included in the input frame comprises predicting the subjective assessment score for each of the plurality of sub-regions, by using a first neural network trained to learn, from a video frame received a subjective assessment score for each of the plurality of sub-regions included in the video frame. 13. The video quality assessment method of claim 12 , wherein the first neural network is trained to allow the subjective assessment score for each of the plurality of sub-regions included in the input frame to be equal to a Ground Truth (GT) subjective assessment score for the entire input frame, the GT subjective assessment score being a GT MOS. 14. The video quality assessment method of claim 12 , wherein the obtaining of the location weight for each of the plurality of sub-regions comprises predicting the location weight for each of the plurality of sub-regions from the subjective assessment score for each of the plurality of sub-regions by using a second neural network, and the second neural network is trained to predict a weight corresponding to a difference between the subjective assessment score for each sub-region and a Ground Truth (GT) subjective assessment score for the entire input frame as the location weight for each sub-region, from the subjective assessment score for each of the plurality of sub-regions included in the input frame predicted through the first neural network. 15. The video quality assessment method of claim 14 , wherein the second neural network is trained to allow a mean value of weighted assessment scores obtained by multiplying the subjective assessment score for each of the plurality of sub-regions included in the input f

Assignees

Samsung Electronics Co Ltd

Inventors

Classifications

G06N3/0464
Convolutional networks [CNN, ConvNet] · CPC title
G06N3/09Primary
Supervised learning · CPC title
G06N3/045
Combinations of networks · CPC title
G06V10/82
using neural networks · CPC title
G06V20/40
in video content (extracting overlay text G06V20/62; video retrieval G06F16/70; processing of video elementary streams in video servers H04N21/234; processing of video elementary streams in video clients H04N21/44) · CPC title

Patent family

Related publications grouped by family.

View patent family 84229025

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12266165B2 cover?: An electronic device is provided. The electronic device includes a memory storing one or more instructions, and a processor configured to execute the one or more instruction stored in the memory. The processor is configured to execute the one or more instructions to obtain a subjective assessment score for each of a plurality of sub-regions included in an input frame, the subjective assessment …
Who is the assignee on this patent?: Samsung Electronics Co Ltd
What technology area does this patent fall under?: Primary CPC classification G06N3/09. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Apparatuses and methods for performing artificial intelligence encoding and artificial intelligence decoding on image

Video quality measurement for virtual cameras in volumetric immersive media

Saliency-weighted video quality assessment

Method for encoding/decoding image, and device therefor

Image quality objective evaluation method based on manifold feature similarity

Mobile video quality prediction systems and methods

Frequently asked questions