Pedestrian re-identification method based on spatio-temporal joint model of residual attention mechanism and device thereof

US11468697B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11468697-B2
Application numberUS-202017121698-A
CountryUS
Kind codeB2
Filing dateDec 14, 2020
Priority dateDec 31, 2019
Publication dateOct 11, 2022
Grant dateOct 11, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The disclosure provides a pedestrian re-identification method based on a spatio-temporal joint model of a residual attention mechanism and a device thereof. The method includes: performing feature extraction for an input pedestrian with a pre-trained ResNet-50 model; constructing a residual attention mechanism network including a residual attention mechanism module, a feature sampling layer, a global average pooling layer and a local feature connection layer; calculating a feature distance by using a cosine distance and denoting the feature distance as a visual probability according to the trained residual attention mechanism network; performing modeling for a spatio-temporal probability according to camera ID and frame number information in a pedestrian tag of a training sample, and performing Laplace smoothing for a probability model; and calculating a final spatio-temporal joint probability by using the visual probability and the spatio-temporal probability to obtain a pedestrian re-identification result.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: a) performing feature extraction for an input pedestrian x with a ResNet-50 model obtained through pre-training to obtain a feature matrix denoted as f; b) constructing a residual attention mechanism network with a network structure comprising a residual attention mechanism module, a feature sampling layer, a global average pooling layer and a local feature connection layer; c) taking the feature matrix f with dimensions being H×W×C obtained in a) as an input of the residual attention mechanism network, and taking corresponding identity information y as a target output, wherein H, W, C refer to a length, a width and a channel number of a feature map, respectively; performing channel averaging for each spatial position of the feature matrix f as a spatial weight matrix according to the residual attention mechanism module; activating the spatial weight matrix by softmax to ensure that a convolution kernel learns different features, and calculating an attention mechanism map M SA to obtain a feature matrix F RSA with dimensions being H×W×C by F RSA =f*M SA +f; d) sampling the feature matrix F RSA with dimensions being H×W×C into local feature matrixes (F RSA 1 , F RSA 2 . . . , F RSA 6 ) with dimensions being H 6 × W × C by the feature sampling layer, and calculating local feature vectors (V RSA 1 , V RSA 2 . . . V RSA 6 ) by the global average pooling layer; e) connecting local features into a feature vector V RSA by the local feature connection layer, and calculating a cross entropy loss between the feature vector V RSA and the pedestrian identity y to obtain the trained residual attention mechanism network after training; f) obtaining feature vectors V RSA-α and V RSA-β corresponding to tested pedestrian images x α and x β respectively according to the trained residual attention mechanism network obtained in e), and calculating a feature distance based on a cosine distance and denoting the feature distance as a visual probability P V ; g) performing modeling for a spatio-temporal probability according to camera ID and frame number information in a pedestrian tag of a training sample, and calculating the spatio-temporal probability P ST according to the obtained spatio-temporal model; and h) calculating a final joint spatio-temporal probability using the visual probability P V obtained in f) and the spatio-temporal probability P ST obtained in g) to obtain a pedestrian re-identification result. 2. The method of claim 1 , wherein in c), the residual attention mechanism model is defined as follows: Q ⁡ ( i , j ) = ∑ t = 0 C ⁢ ⁢ f t ⁡ ( i , j ) C M SA ⁡ ( i , j ) = e Q ⁡ ( i , j ) Σ ⁡ ( i , j ) e Q ⁡ ( i , j ) F RSA t ⁡ ( i , j ) = f t ⁡ ( i , j ) ⁢ M SA ⁡ ( i , j ) + f t ⁡ ( i , j ) , wherein (i,j) refers to spatial position information, t refers to a channel serial number, f t (i,j) refers to a pixel point with the spatial position being (i,j) in a t-th channel of the feature matrix f, e refers to a base of a natural logarithm, and F RSA (i,j) refers to a pixel point with the spatial position being (i,j) in the feature matrix F RSA . 3. The method of claim 1 , wh

Assignees

Inventors

Classifications

  • G06V40/23Primary

    Recognition of whole body movements, e.g. for sport training · CPC title

  • Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

  • Surveillance or monitoring of activities, e.g. for recognising suspicious objects (recognising microscopic objects G06V20/69) · CPC title

  • using neural networks · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11468697B2 cover?
The disclosure provides a pedestrian re-identification method based on a spatio-temporal joint model of a residual attention mechanism and a device thereof. The method includes: performing feature extraction for an input pedestrian with a pre-trained ResNet-50 model; constructing a residual attention mechanism network including a residual attention mechanism module, a feature sampling layer, a …
Who is the assignee on this patent?
Univ Wuhan
What technology area does this patent fall under?
Primary CPC classification G06V40/23. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 11 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).