Who is the assignee on this patent?

Beijing Baidu Netcom Sci & Tech Co Ltd

What technology area does this patent fall under?

Primary CPC classification G06T7/73. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Dec 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Method and apparatus for positioning key point, device, and storage medium

US2021390731A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2021390731-A1
Application number	US-202117201665-A
Country	US
Kind code	A1
Filing date	Mar 15, 2021
Priority date	Jun 12, 2020
Publication date	Dec 16, 2021
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for positioning a key point, comprising: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point. 2 . The method according to claim 1 , wherein the extracting the first feature map and the second feature map of the to-be-positioned image comprises: inputting a to-be-positioned feature map into a main network to output an initial feature map of the to-be-positioned image; and inputting the initial feature map into a first sub-network and a second sub-network respectively to output the first feature map and the second feature map, wherein the first sub-network and the second sub-network are two different branches of the main network. 3 . The method according to claim 1 , wherein the determining, based on the first feature map, the initial position of the key point in the to-be-positioned image comprises: generating, based on the first feature map, a heat map of the key point in the to-be-positioned image; and determining, based on a heat value of a point on the heat map, the initial position of the key point. 4 . The method according to claim 3 , wherein the generating, based on the first feature map, the heat map of the key point in the to-be-positioned image comprises: performing 1×1 convolution on the first feature map to obtain the heat map, wherein channels of the heat map correspond to key points one to one. 5 . The method according to claim 1 , wherein the determining, based on the second feature map, the offset of the key point comprises: extracting, based on the initial position of the key point, a feature from a corresponding position of the second feature map; and performing offset regression by using the feature to obtain the offset of the key point. 6 . An electronic device, comprising: one or more processors; and a storage apparatus storing one or more programs thereon, the one or more programs, when executed by the one or more processors, causing the one or more processors to perform operations comprising: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point. 7 . The electronic device according to claim 6 , wherein the extracting the first feature map and the second feature map of the to-be-positioned image comprises: inputting a to-be-positioned feature map into a main network to output an initial feature map of the to-be-positioned image; and inputting the initial feature map into a first sub-network and a second sub-network respectively to output the first feature map and the second feature map, wherein the first sub-network and the second sub-network are two different branches of the main network. 8 . The electronic device according to claim 6 , wherein the determining, based on the first feature map, the initial position of the key point in the to-be-positioned image comprises: generating, based on the first feature map, a heat map of the key point in the to-be-positioned image; and determining, based on a heat value of a point on the heat map, the initial position of the key point. 9 . The electronic device according to claim 8 , wherein the generating, based on the first feature map, the heat map of the key point in the to-be-positioned image comprises: performing 1×1 convolution on the first feature map to obtain the heat map, wherein channels of the heat map correspond to key points one to one. 10 . The electronic device according to claim 6 , wherein the determining, based on the second feature map, the offset of the key point comprises: extracting, based on the initial position of the key point, a feature from a corresponding position of the second feature map; and performing offset regression by using the feature to obtain the offset of the key point. 11 . A non-transitory computer readable medium, storing a computer program thereon, the computer program, when executed by a processor, causing the processor to perform operations comprising: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point. 12 . The non-transitory computer readable medium according to claim 11 , wherein the extracting the first feature map and the second feature map of the to-be-positioned image comprises: inputting a to-be-positioned feature map into a main network to output an initial feature map of the to-be-positioned image; and inputting the initial feature map into a first sub-network and a second sub-network respectively to output the first feature map and the second feature map, wherein the first sub-network and the second sub-network are two different branches of the main network. 13 . The non-transitory computer readable medium according to claim 11 , wherein the determining, based on the first feature map, the initial position of the key point in the to-be-positioned image comprises: generating, based on the first feature map, a heat map of the key point in the to-be-positioned image; and determining, based on a heat value of a point on the heat map, the initial position of the key point. 14 . The non-transitory computer readable medium according to claim 13 , wherein the generating, based on the first feature map, the heat map of the key point in the to-be-positioned image comprises: performing 1×1 convolution on the first feature map to obtain the heat map, wherein channels of the heat map correspond to key points one to one. 15 . The non-transitory computer readable medium according to claim 11 , wherein the determining, based on the second feature map, the offset of the key point comprises: extracting, based on the initial position of the key point, a feature from a corresponding position of the second feature map; and performing offset regression by using the feature to obtain the offset of the key point.

Assignees

Beijing Baidu Netcom Sci & Tech Co Ltd

Inventors

Classifications

G06V10/82
using neural networks · CPC title
G06V10/764
using classification, e.g. of video objects · CPC title
G06T7/73Primary
using feature-based methods · CPC title
G06V10/462Primary
Salient features, e.g. scale invariant feature transforms [SIFT] · CPC title
G06N3/045
Combinations of networks · CPC title

Patent family

Related publications grouped by family.

View patent family 72480804

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021390731A1 cover?: A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; d…
Who is the assignee on this patent?: Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?: Primary CPC classification G06T7/73. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Dec 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Compression of images having overlapping fields of view using machine-learned models

Method and apparatus for iteratively establishing object position

Information processing apparatus, method and non-transitory computer-readable storage medium

Method and system for unsupervised word image clustering

Frequently asked questions