Image recognition method and apparatus, device, and computer storage medium

US11687779B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11687779-B2
Application numberUS-202117208611-A
CountryUS
Kind codeB2
Filing dateMar 22, 2021
Priority dateJun 29, 2020
Publication dateJun 27, 2023
Grant dateJun 27, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An image recognition method is provided, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. An implementation includes: performing five-sense-organ recognition on a preprocessed human face image and marking positions of the human facial five sense organs in the human face image, to obtain the marked human face image; determining human face images at multiple scales of the marked human face image, inputting the human face images of multiple scales into a backbone network model, and performing feature extraction, to obtain a wrinkle feature of the human face image at each of the multiple scales; and fusing the wrinkle feature at each scale that is located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image.

First claim

Opening claim text (preview).

What is claimed is: 1. An image recognition method, comprising: performing five-sense-organ recognition on a preprocessed human face image and marking positions of the human facial five sense organs in the human face image, to obtain a marked human face image; determining human face images at multiple scales of the marked human face image, inputting the human face images of multiple scales into a backbone network model, and performing feature extraction, to obtain a wrinkle feature of the human face image at each of the multiple scales; and fusing the wrinkle feature at each scale that is located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image. 2. The method according to claim 1 , further comprising: performing homogenization processing on pixels of the human face image, to obtain a homogenized human face image; and performing normalization processing on pixel variances of the homogenized human face image, to obtain the preprocessed human face image. 3. The method according to claim 1 , wherein the inputting the marked human face image into a backbone network model and performing feature extraction, to obtain wrinkle features of the human face image at different scales, comprises: performing multi-scale stretching and retraction processing on the marked human face image, to obtain the human face images of multiple scales; and inputting the human face images of multiple scales into the backbone network model, to obtain a wrinkle feature of the human face image at each of the multiple scales. 4. The method according to claim 3 , further comprising: inputting the human face images of multiple scales into the backbone network model, to obtain a recognition result of areas of the human facial five sense organs. 5. The method according to claim 3 , wherein the fusing the wrinkle features at different scales that are located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image, comprises: ignoring the wrinkle recognition result of the human face image in a case where a face area corresponding to the wrinkle recognition result of the human face image is an area of the five sense organs that has no wrinkle. 6. An image recognition apparatus, comprising: at least one processor; and a memory communicatively connected to the at least one processor, wherein the memory stores instructions executable by the at least one processor, the instructions are executed by the at least one processor to enable the at least one processor to: perform five-sense-organ recognition on a preprocessed human face image and mark positions of the human facial five sense organs in the human face image, to obtain a marked human face image; determine human face images at different scales of the marked human face image, input the human face images of multiple scales into a backbone network model, and performing feature extraction, to obtain a wrinkle feature of the human face image at each of the multiple scales; and fuse the wrinkle feature at each scale that is located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image. 7. The apparatus according to claim 6 , wherein the instructions are executed by the at least one processor to further enable the at least one processor to: perform homogenization processing on pixels of the human face image, to obtain a homogenized human face image; and perform normalization processing on pixel variances of the homogenized human face image, to obtain the preprocessed human face image. 8. The apparatus according to claim 6 , wherein the instructions are executed by the at least one processor to further enable the at least one processor to: perform multi-scale stretching and retraction processing on the marked human face image, to obtain the human face images of multiple scales; and input the human face images of multiple scales into the backbone network model to obtain a wrinkle feature of the human face image at each of the multiple scales. 9. The apparatus according to claim 8 , wherein the instructions are executed by the at least one processor to further enable the at least one processor to input the human face images of multiple scales into the backbone network model, to obtain a recognition result of areas of the human facial five sense organs. 10. The apparatus according to claim 8 , wherein the instructions are executed by the at least one processor to further enable the at least one processor to ignore the wrinkle recognition result of the human face image in a case where a face area corresponding to the wrinkle recognition result of the human face image is an area of the five sense organs that has no wrinkle. 11. A non-transitory computer-readable storage medium for storing computer instructions, wherein the computer instructions, when executed by a computer, cause the computer to: perform five-sense-organ recognition on a preprocessed human face image and mark positions of the human facial five sense organs in the human face image, to obtain a marked human face image; determine human face images at multiple scales of the marked human face image, input the human face images of multiple scales into a backbone network model, and perform feature extraction, to obtain a wrinkle feature of the human face image at each of the multiple scales; and fuse the wrinkle feature at each scale that is located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image. 12. The non-transitory computer-readable storage medium according to claim 11 , wherein the computer instructions, when executed by a computer, further cause the computer to: perform homogenization processing on pixels of the human face image, to obtain a homogenized human face image; and perform normalization processing on pixel variances of the homogenized human face image, to obtain the preprocessed human face image. 13. The non-transitory computer-readable storage medium according to claim 11 , wherein the computer instructions, when executed by a computer, further cause the computer to: perform multi-scale stretching and retraction processing on the marked human face image, to obtain the human face images of multiple scales; and input the human face images of multiple scales into the backbone network model, to obtain a wrinkle feature of the human face image at each of the multiple scales. 14. The non-transitory computer-readable storage medium according to claim 13 , wherein the computer instructions, when executed by a computer, further cause the computer to: input the human face images of multiple scales into the backbone network model, to obtain a recognition result of areas of the human facial five sense organs. 15. The non-transitory computer-readable storage medium according to claim 13 , wherein the computer instructions, when executed by a computer, further cause the computer to: ignore the wrinkle recognition result of the human face image in a case where a face area corresponding to the wrinkle recognition result of the human face image is an area of the five sense organs that has no wrinkle.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11687779B2 cover?
An image recognition method is provided, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. An implementation includes: performing five-sense-organ recognition on a preprocessed human face image and marking positions of the human facial five sense organs in the human face image, to obtain the marked human face image; de…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06V10/806. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 27 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).