Image recognition method and apparatus, electronic device, and readable storage medium using an update on body extraction parameter and alignment parameter

US11417095B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11417095-B2
Application numberUS-201916685526-A
CountryUS
Kind codeB2
Filing dateNov 15, 2019
Priority dateDec 14, 2017
Publication dateAug 16, 2022
Grant dateAug 16, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An image recognition method is provided. The method includes obtaining predicted locations of joints of a target person in a to-be-recognized image based on a joint prediction model, where the joint prediction model is pre-constructed by: obtaining a plurality of sample images; inputting training features of the sample images and a body model feature to a neural network and obtaining predicted locations of joints in the sample images outputted by the neural network; updating a body extraction parameter and an alignment parameter; and inputting the training features of the sample images and the body model feature to the neural network to obtain the joint prediction model.

First claim

Opening claim text (preview).

What is claimed is: 1. An image recognition method, applied to an electronic device, and comprising: obtaining a to-be-recognized feature of a to-be-recognized image, the to-be-recognized image comprising a target person; obtaining a preset body model feature of a body frame image, the preset body model feature comprising locations of joints in the body frame image; inputting the to-be-recognized feature and the preset body model feature to a pre-constructed joint prediction model, the joint prediction model is pre-constructed by: obtaining a plurality of positive sample images and a plurality of negative sample images respectively, to obtain a plurality of sample images, the positive sample image comprising a person, and the negative sample image comprising no person; obtaining training features of the sample images respectively; inputting the training features of the sample images and the preset body model feature to a neural network and obtaining predicted locations of joints in the sample images outputted by the neural network; comparing the predicted locations of the joints with real locations of the joints respectively, to obtain comparison results; updating a body extraction parameter and an alignment parameter based on the comparison results, the body extraction parameter being used to extract the person from a background environment in the sample images and the alignment parameter being used to represent respective correspondences between locations of joints in the preset body model feature and the predicted locations of the joints in the sample images; in response to determining the plurality of positive sample images reflects a sitting posture, assigning a higher weight ratio to lower body joint location data than to upper body joint location data; and inputting the training features of the sample images and the preset body model feature to the neural network to obtain the joint prediction model; and obtaining predicted locations of joints of the target person in the to-be-recognized image based on the joint prediction model. 2. The image recognition method according to claim 1 , further comprising: obtaining a body posture of the target person in the to-be-recognized image outputted by the joint prediction model, the body posture comprising the predicted locations of joints of the target person in the to-be-recognized image. 3. The image recognition method according to claim 1 , wherein updating the body extraction parameter and the alignment parameter comprises: updating the body extraction parameter in response to determining any of the predicted locations of the joints is located in a background environment area of the sample images; and updating the alignment parameter in response to determining any of the predicted locations of the joints is different from its corresponding real location. 4. The image recognition method according to claim 1 , wherein obtaining the to-be-recognized feature comprises: obtaining multi-frame video images in a video and using a frame of the video image as the to-be-recognized image, to obtain the to-be-recognized feature of the to-be-recognized image. 5. The image recognition method according to claim 4 , further comprising: obtaining tracking information of the target person based on the predicted locations of joints in the multi-frame video images. 6. The image recognition method according to claim 1 , wherein the to-be-recognized feature includes pixel values of color channels. 7. The image recognition method according to claim 1 , wherein a resolution of the to-be-recognized image is M1×N1, both M1 and N1 are positive integers, and a pixel value of a pixel in the i th row and the j th column in the to-be-recognized image is (R ij , G ij , B ij ), a matrix representation of the to-be-recognized feature of the to-be-recognized image is: [ ( R 1 , 1 , G 1 , 1 , B 1 , 1 ) … ( R 1 , N ⁢ ⁢ 1 , G 1 , N ⁢ ⁢ 1 , B 1 , N ⁢ ⁢ 1 ) ⋮ ⋱ ⋮ ( R M ⁢ ⁢ 1 ,

Assignees

Inventors

Classifications

  • G06V10/764Primary

    using classification, e.g. of video objects · CPC title

  • using neural networks · CPC title

  • G06V20/40Primary

    in video content (extracting overlay text G06V20/62; video retrieval G06F16/70; processing of video elementary streams in video servers H04N21/234; processing of video elementary streams in video clients H04N21/44) · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11417095B2 cover?
An image recognition method is provided. The method includes obtaining predicted locations of joints of a target person in a to-be-recognized image based on a joint prediction model, where the joint prediction model is pre-constructed by: obtaining a plurality of sample images; inputting training features of the sample images and a body model feature to a neural network and obtaining predicted …
Who is the assignee on this patent?
Tencent Tech Shenzhen Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06V10/764. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 16 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).