Object recognition method and device, and storage medium
US-2020364863-A1 · Nov 19, 2020 · US
US2022180649A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2022180649-A1 |
| Application number | US-201917438348-A |
| Country | US |
| Kind code | A1 |
| Filing date | Jul 31, 2019 |
| Priority date | Jul 31, 2019 |
| Publication date | Jun 9, 2022 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method is described herein. The method includes the designation of a player as a profile player or a non-profile player in each camera view. In response to the player being a non-profile player, the method includes extracting features from the detected player within the bounding box and classifying the features according to a label. In response to the player being a non-profile player, the method also includes selecting a label with a highest number of votes according to a voting policy as a final label.
Opening claim text (preview).
What is claimed is: 1 . A method, comprising: detecting a player in a camera view captured by a camera; determining a player location of the player in each camera view, wherein the player location is defined by a bounding box; classifying the player as a profile player or a non-profile player based on a visibility of an identifier; in response to the player being a non-profile player: extracting features from the detected player within the bounding box; classifying a plurality of labels according to the extracted features; and selecting a label from the plurality of labels with a highest number of votes according to a voting policy as a final label. 2 . The method of claim 1 , comprising applying hard non-maximum suppression to the extracted features to obtain bounding boxes with the plurality of labels to be classified. 3 . The method of claim 1 , wherein the identifier is a jersey number worn by the player during game play. 4 . The method of claim 1 , wherein the classification of the player as a profile player or a non-profile player indicates the orientation of the player with respect to an image plane of the camera. 5 . The method of claim 1 , wherein the identifier of a non-profile player is substantially visible wherein the camera view of the identifier is used to derive the entire identifier. 6 . The method of claim 1 , wherein the identifier of each profile player is not substantially visible, wherein the camera view of the identifier cannot be used to derive the entire identifier. 7 . The method of claim 1 , wherein in response to the player being classified as a profile player, not using the camera view for jersey number recognition. 8 . The method of claim 1 , wherein in preparation for processing the extracted features by a convolutional neural network (CNN), the bounding box for the player is padded to correspond to an input size of the CNN. 9 . The method of claim 1 , wherein extracting features from the detected player within the bounding box precisely locates a candidate identifier. 10 . The method of claim 1 , wherein extracting features from the detected player within the bounding box extracts high-resolution low-level features and higher-level semantic low-resolution features. 11 . A system, comprising: a processor to: detect a player in a camera view captured by a camera; determine a player location of the player in each camera view, wherein the player location is defined by a bounding box; classify the player as a profile player or a non-profile player based on a visibility of an identifier; and in response to the player being a non-profile player: extract features from the detected player within the bounding box; classify the features according to a label; and select a label with a highest number of votes according to a voting policy as a final label. 12 . The system of claim 11 , wherein the identifier is a jersey number worn by the player during game play. 13 . The system of claim 11 , wherein the classification of the player as a profile player or a non-profile player indicates the orientation of the player with respect to an image plane of the camera. 14 . The system of claim 11 , wherein the identifier of a non-profile player is substantially visible wherein the camera view of the identifier is used to derive the entire identifier. 15 . The system of claim 11 , wherein the identifier of each profile player is not substantially visible, wherein the camera view of the identifier cannot be used to derive the entire identifier. 16 . The system of claim 11 , wherein in response to the player being classified as a profile player, not using the camera view for jersey number recognition. 17 . The system of claim 11 , wherein in preparation for processing the extracted features by a convolutional neural network (CNN), the bounding box for the player is padded to correspond to an input size of the CNN. 18 . The system of claim 11 , wherein extracting features from the detected player within the bounding box precisely locates a candidate identifier. 19 . The system of claim 11 , wherein extracting features from the detected player within the bounding box extracts high-resolution low-level features and higher-level semantic low-resolution features. 20 . The system of claim 11 , wherein hard non-maximum suppression is applied to the extracted features. 21 . At least one non-transitory computer-readable medium, comprising instructions to direct a processor to: detect a player in a camera view captured by a camera; determine a player location of the player in each camera view, wherein the player location is defined by a bounding box; classify the player as a profile player or a non-profile player based on a visibility of an identifier; in response to the player being a non-profile player: extract features from the detected player within the bounding box; classify a plurality of labels according to the extracted features; and select a label from the plurality of labels with a highest number of votes according to a voting policy as a final label. 22 . The computer readable medium of claim 21 , comprising applying hard non-maximum suppression to the extracted features to obtain bounding boxes with the plurality of labels to be classified. 23 . The computer readable medium of claim 21 , wherein the identifier is a jersey number worn by the player during game play. 24 . The computer readable medium of claim 21 , wherein the classification of the player as a profile player or a non-profile player indicates the orientation of the player with respect to an image plane of the camera. 25 . The computer readable medium of claim 21 , wherein the identifier of a non-profile player is substantially visible wherein the camera view of the identifier is used to derive the entire identifier.
related to virtual studio applications · CPC title
Scene text, e.g. street names · CPC title
of sport video content · CPC title
Determination of region of interest · CPC title
Character recognition · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.