Method and apparatus for processing image

US11734809B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11734809-B2
Application numberUS-202117174002-A
CountryUS
Kind codeB2
Filing dateFeb 11, 2021
Priority dateApr 23, 2020
Publication dateAug 22, 2023
Grant dateAug 22, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, where the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing an image, the method comprising: acquiring a value to be processed, wherein the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, wherein the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed; wherein the value to be processed is a score of the image to be processed in an original scoring domain; and processing the value to be processed by using the quality scoring model to generate the score of the image to be processed in the target scoring domain comprises: inputting the score of the image to be processed in the original scoring domain into the quality scoring model to obtain the score of the image to be processed in the target scoring domain, wherein the quality scoring model is a monotonic neural network, and a number of hidden units in the monotonic neural network is smaller than a preset threshold. 2. The method according to claim 1 , wherein the quality scoring model comprises a scoring network and a monotonic neural network; and acquiring the value to be processed comprises: inputting the image to be processed into the scoring network to obtain an initial score, output from the scoring network, of the image to be processed; and processing the value to be processed by using the quality scoring model to generate the score of the image to be processed in the target scoring domain comprises: inputting the initial score into the monotonic neural network to obtain the score of the image to be processed in the target scoring domain, wherein a number of hidden units in the monotonic neural network is smaller than a preset threshold. 3. The method according to claim 2 , wherein the quality scoring model comprises at least two monotonic neural networks, and different monotonic neural networks in the at least two monotonic neural networks correspond to different scoring domains; and inputting the initial score into the monotonic neural network to obtain the score of the image to be processed in the target scoring domain comprises: inputting the initial score into the at least two monotonic neural networks to obtain a score, output from each of the at least two monotonic neural networks, of the image to be processed in a scoring domain corresponding to the monotonic neural network. 4. The method according to claim 2 , wherein the method further comprises: acquiring a training sample set, wherein a training sample in the training sample set comprises a sample image and a reference score of the sample image in a specified scoring domain, and the specified scoring domain and the target scoring domain are different scoring domains; inputting the sample image into the scoring network to obtain an initial score of the sample image; inputting the initial score of the sample image into a monotonic neural network to be trained to obtain a predicted score of the sample image in the specified scoring domain; and determining a loss value of the predicted score based on the reference score and the predicted score, and training the monotonic neural network to be trained by means of the loss value to obtain a trained monotonic neural network. 5. An electronic device, comprising: one or more processors; and a memory for storing one or more programs, wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to perform operations, the operations comprising: acquiring a value to be processed, wherein the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, wherein the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed; wherein the value to be processed is a score of the image to be processed in an original scoring domain; and processing the value to be processed by using the quality scoring model to generate the score of the image to be processed in the target scoring domain comprises: inputting the score of the image to be processed in the original scoring domain into the quality scoring model to obtain the score of the image to be processed in the target scoring domain wherein the quality scoring model is a monotonic neural network and a number of hidden units in the monotonic neural network is smaller than a preset threshold. 6. The electronic device according to claim 5 , wherein the quality scoring model comprises a scoring network and a monotonic neural network; and acquiring the value to be processed comprises: inputting the image to be processed into the scoring network to obtain an initial score, output from the scoring network, of the image to be processed; and processing the value to be processed by using the quality scoring model to generate the score of the image to be processed in the target scoring domain comprises: inputting the initial score into the monotonic neural network to obtain the score of the image to be processed in the target scoring domain, wherein a number of hidden units in the monotonic neural network is smaller than a preset threshold. 7. The electronic device according to claim 6 , wherein the quality scoring model comprises at least two monotonic neural networks, and different monotonic neural networks in the at least two monotonic neural networks correspond to different scoring domains; and inputting the initial score into the monotonic neural network to obtain the score of the image to be processed in the target scoring domain comprises: inputting the initial score into the at least two monotonic neural networks to obtain a score, output from each of the at least two monotonic neural networks, of the image to be processed in a scoring domain corresponding to the monotonic neural network. 8. The electronic device according to claim 6 , wherein the operations further comprise: acquiring a training sample set, wherein a training sample in the training sample set comprises a sample image and a reference score of the sample image in a specified scoring domain, and the specified scoring domain and the target scoring domain are different scoring domains; inputting the sample image into the scoring network to obtain an initial score of the sample image; inputting the initial score of the sample image into a monotonic neural network to be trained to obtain a predicted score of the sample image in the specified scoring domain; and determining a loss value of the predicted score based on the reference score and the predicted score, and training the monotonic neural network to be trained by means of the loss value to obtain a trained monotonic neural network. 9. A non-transitory computer-readable storage medium, storing a computer program thereon, wherein the computer program, when executed by a processor, causes the processor to perform operations, the operations comprising: acquiring a value to be processed, wherein the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, wherein the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed; wherein the value to be processed is a score of the image to be processed in an original scoring domain; and processing the value to be processed by u

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06T7/0002Primary

    Inspection of images, e.g. flaw detection · CPC title

  • Architecture, e.g. interconnection topology · CPC title

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11734809B2 cover?
Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to b…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T7/0002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 22 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).