Method and apparatus for generating target re-recognition model and re-recognizing target

US11915484B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11915484-B2
Application numberUS-202117304296-A
CountryUS
Kind codeB2
Filing dateJun 17, 2021
Priority dateOct 23, 2020
Publication dateFeb 27, 2024
Grant dateFeb 27, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, an apparatus, device and a storage medium for generating a target re-recognition model are provided. The method may include: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering features extracted from the set of the unlabeled samples by using a clustering algorithm; assigning, for each sample in the set of the unlabeled samples, a pseudo label to the sample according to a cluster corresponding to the sample in a feature space; and mixing a set of samples with a pseudo label and the set of the labeled samples as a set of training samples, and performing supervised training on the initialization model to obtain a target re-recognition model.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for generating a target re-recognition model, the method comprising: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering features extracted from the set of the unlabeled samples by using a clustering algorithm; assigning, for each sample in the set of the unlabeled samples, a pseudo label to the each sample according to a cluster corresponding to the each sample in a feature space; and mixing a set of samples with a pseudo label and the set of the labeled samples as a set of training samples, and performing the supervised training on the initialization model to obtain a target re-recognition model, wherein mixing the set of samples with the pseudo label and the set of the labeled samples as the set of training samples, and performing the supervised training on the initialization model to obtain the target re-recognition model, comprises: performing feature extraction on each sample in the set of the labeled samples by using the initialization model; fusing, by using a graph convolutional neural network, a feature of each sample in the set of the samples with the pseudo label and a feature of each sample in the set of the labeled samples to obtain a fused feature of each sample; and training the initialization model based on the fused feature of each sample in the set of the samples with the pseudo label and the fused feature of each sample in the set of the labeled samples to obtain the target re-recognition model. 2. The method according to claim 1 , wherein the graph convolutional neural network comprises a first graph convolutional layer and a second graph convolutional layer, and wherein: the first graph convolutional layer comprises at least one sample node representing a sample and at least one proxy node representing a set of samples, and sample nodes belonging to a given set of samples are unidirectionally connected to a given proxy node, and proxy nodes are interconnected, and each proxy node performs a weighted sum on sample features of sample nodes connected to the each proxy node to obtain a proxy feature of the each proxy node, and proxy features of all proxy nodes are fused through the first graph convolutional layer to obtain output features of the proxy nodes of the first graph convolutional layer; and the second graph convolutional layer comprises at least one sample node representing a sample and at least one proxy node representing a set of samples, and sample nodes belonging to a given set of samples are bidirectionally connected to a given proxy node, and proxy nodes are interconnected, and the output features of the proxy nodes of the first graph convolutional layer are fused through the second graph convolutional layer to obtain an output feature of each sample node. 3. The method according to claim 2 , wherein the graph convolutional neural network comprises the first graph convolutional layer and at least one second graph convolutional layer, wherein an output of the first graph convolutional layer is used as an input of the second graph convolutional layer, and an output feature of each second graph convolutional layer is used as an input of a next second graph convolutional layer or an input of a classification layer of the initialization model. 4. The method according to claim 1 , wherein the method uses at least one set of labeled samples, and each of the at least one set of labeled samples is from one data source. 5. The method according to claim 1 , further comprising: acquiring a to-be-recognized monitoring image; and inputting the monitoring image into the target re-recognition model to generate a target re-recognition result. 6. An electronic device, comprising: at least one processor; and a memory storing instructions executable by the at least one processor, the instructions, when executed by the at least one processor, cause the at least one processor to perform operations comprising: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering features extracted from the set of the unlabeled samples by using a clustering algorithm; assigning, for each sample in the set of the unlabeled samples, a pseudo label to the each sample according to a cluster corresponding to the each sample in a feature space; and mixing a set of samples with a pseudo label and the set of the labeled samples as a set of training samples, and performing the supervised training on the initialization model to obtain a target re-recognition model, wherein mixing the set of samples with the pseudo label and the set of the labeled samples as the set of training samples, and performing the supervised training on the initialization model to obtain the target re-recognition model, comprises: performing feature extraction on each sample in the set of the labeled samples by using the initialization model; fusing, by using a graph convolutional neural network, a feature of each sample in the set of the samples with the pseudo label and a feature of each sample in the set of the labeled samples to obtain a fused feature of each sample; and training the initialization model based on the fused feature of each sample in the set of the samples with the pseudo label and the fused feature of each sample in the set of the labeled samples to obtain the target re-recognition model. 7. The electronic device according to claim 6 , wherein the graph convolutional neural network comprises a first graph convolutional layer and a second graph convolutional layer, and wherein: the first graph convolutional layer comprises at least one sample node representing a sample and at least one proxy node representing a set of samples, and sample nodes belonging to a given set of samples are unidirectionally connected to a given proxy node, and proxy nodes are interconnected, and each proxy node performs a weighted sum on sample features of sample nodes connected to the each proxy node to obtain a proxy feature of the each proxy node, and proxy features of all proxy nodes are fused through the first graph convolutional layer to obtain output features of the proxy nodes of the first graph convolutional layer; and the second graph convolutional layer comprises at least one sample node representing a sample and at least one proxy node representing a set of samples, and sample nodes belonging to a given set of samples are bidirectionally connected to a given proxy node, and proxy nodes are interconnected, and the output features of the proxy nodes of the first graph convolutional layer are fused through the second graph convolutional layer to obtain an output feature of each sample node. 8. The electronic device according to claim 7 , wherein the graph convolutional neural network comprises the first graph convolutional layer and at least one second graph convolutional layer, wherein an output of the first graph convolutional layer is used as an input of the second graph convolutional layer, and an output feature of each second graph convolutional layer is used as an input of a next second graph convolutional layer or an input of a classification layer of the initialization model. 9. The electronic device according to claim 6 , wherein the operations use at least one set of labeled samples, and each of the at least one set of labeled samples is from one data source. 10. The electronic device according to

Assignees

Inventors

Classifications

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06V20/52Primary

    Surveillance or monitoring of activities, e.g. for recognising suspicious objects (recognising microscopic objects G06V20/69) · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11915484B2 cover?
A method, an apparatus, device and a storage medium for generating a target re-recognition model are provided. The method may include: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering featur…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06V20/52. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 27 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).