Compression of images having overlapping fields of view using machine-learned models
US-11019364-B2 · May 25, 2021 · US
US2021390731A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2021390731-A1 |
| Application number | US-202117201665-A |
| Country | US |
| Kind code | A1 |
| Filing date | Mar 15, 2021 |
| Priority date | Jun 12, 2020 |
| Publication date | Dec 16, 2021 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point.
Opening claim text (preview).
What is claimed is: 1 . A method for positioning a key point, comprising: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point. 2 . The method according to claim 1 , wherein the extracting the first feature map and the second feature map of the to-be-positioned image comprises: inputting a to-be-positioned feature map into a main network to output an initial feature map of the to-be-positioned image; and inputting the initial feature map into a first sub-network and a second sub-network respectively to output the first feature map and the second feature map, wherein the first sub-network and the second sub-network are two different branches of the main network. 3 . The method according to claim 1 , wherein the determining, based on the first feature map, the initial position of the key point in the to-be-positioned image comprises: generating, based on the first feature map, a heat map of the key point in the to-be-positioned image; and determining, based on a heat value of a point on the heat map, the initial position of the key point. 4 . The method according to claim 3 , wherein the generating, based on the first feature map, the heat map of the key point in the to-be-positioned image comprises: performing 1×1 convolution on the first feature map to obtain the heat map, wherein channels of the heat map correspond to key points one to one. 5 . The method according to claim 1 , wherein the determining, based on the second feature map, the offset of the key point comprises: extracting, based on the initial position of the key point, a feature from a corresponding position of the second feature map; and performing offset regression by using the feature to obtain the offset of the key point. 6 . An electronic device, comprising: one or more processors; and a storage apparatus storing one or more programs thereon, the one or more programs, when executed by the one or more processors, causing the one or more processors to perform operations comprising: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point. 7 . The electronic device according to claim 6 , wherein the extracting the first feature map and the second feature map of the to-be-positioned image comprises: inputting a to-be-positioned feature map into a main network to output an initial feature map of the to-be-positioned image; and inputting the initial feature map into a first sub-network and a second sub-network respectively to output the first feature map and the second feature map, wherein the first sub-network and the second sub-network are two different branches of the main network. 8 . The electronic device according to claim 6 , wherein the determining, based on the first feature map, the initial position of the key point in the to-be-positioned image comprises: generating, based on the first feature map, a heat map of the key point in the to-be-positioned image; and determining, based on a heat value of a point on the heat map, the initial position of the key point. 9 . The electronic device according to claim 8 , wherein the generating, based on the first feature map, the heat map of the key point in the to-be-positioned image comprises: performing 1×1 convolution on the first feature map to obtain the heat map, wherein channels of the heat map correspond to key points one to one. 10 . The electronic device according to claim 6 , wherein the determining, based on the second feature map, the offset of the key point comprises: extracting, based on the initial position of the key point, a feature from a corresponding position of the second feature map; and performing offset regression by using the feature to obtain the offset of the key point. 11 . A non-transitory computer readable medium, storing a computer program thereon, the computer program, when executed by a processor, causing the processor to perform operations comprising: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point. 12 . The non-transitory computer readable medium according to claim 11 , wherein the extracting the first feature map and the second feature map of the to-be-positioned image comprises: inputting a to-be-positioned feature map into a main network to output an initial feature map of the to-be-positioned image; and inputting the initial feature map into a first sub-network and a second sub-network respectively to output the first feature map and the second feature map, wherein the first sub-network and the second sub-network are two different branches of the main network. 13 . The non-transitory computer readable medium according to claim 11 , wherein the determining, based on the first feature map, the initial position of the key point in the to-be-positioned image comprises: generating, based on the first feature map, a heat map of the key point in the to-be-positioned image; and determining, based on a heat value of a point on the heat map, the initial position of the key point. 14 . The non-transitory computer readable medium according to claim 13 , wherein the generating, based on the first feature map, the heat map of the key point in the to-be-positioned image comprises: performing 1×1 convolution on the first feature map to obtain the heat map, wherein channels of the heat map correspond to key points one to one. 15 . The non-transitory computer readable medium according to claim 11 , wherein the determining, based on the second feature map, the offset of the key point comprises: extracting, based on the initial position of the key point, a feature from a corresponding position of the second feature map; and performing offset regression by using the feature to obtain the offset of the key point.
using neural networks · CPC title
using classification, e.g. of video objects · CPC title
using feature-based methods · CPC title
Salient features, e.g. scale invariant feature transforms [SIFT] · CPC title
Combinations of networks · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.