System and method for video content analysis using depth sensing

US9247211B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9247211-B2
Application numberUS-201313744254-A
CountryUS
Kind codeB2
Filing dateJan 17, 2013
Priority dateJan 17, 2012
Publication dateJan 26, 2016
Grant dateJan 26, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and system for performing video content analysis based on two-dimensional image data and depth data are disclosed. Video content analysis may be performed on the two-dimensional image data, and then the depth data may be used along with the results of the video content analysis of the two-dimensional data for tracking and event detection.

First claim

Opening claim text (preview).

What is claimed is: 1. A video content analysis method comprising: receiving a video sequence that includes a plurality of frames, each frame including a video image; for each frame, receiving two-dimensional (2D) image data and also receiving depth data; processing the 2D image data of the video sequence to differentiate foreground data from background data and to detect one or more blobs comprised of the foreground data, the one or more blobs corresponding to one or more objects, wherein differentiating the foreground data from the background data is performed without analyzing the depth data; for each detected blob, using the depth data to determine whether at least part of the blob corresponds to at least part of a target by at least (1) mapping the blob to a set of Z-planes; (2) determining that on at least some Z-planes the blob is clustered into different blob regions corresponding to two objects separated in space; and (3) grouping the separated blob regions of the Z-planes into two physical objects by checking their spatial overlaps, wherein one of the physical objects corresponds to the target; and after it is determined that at least part of a blob corresponds to at least part of a target, tracking the target and detecting at least one event associated with the target. 2. The method of claim 1 , wherein using the depth data to determine whether at least part of each blob corresponds to at least part of a target includes: using the depth data to determine that only part of a first blob corresponds to a first target. 3. The method of claim 2 , wherein using the depth data to determine whether at least part of each blob corresponds to at least part of a target includes: using the depth data to determine that part of the first blob does not correspond to the first target. 4. The method of claim 2 , wherein using the depth data to determine whether at least part of each blob corresponds to at least part of a target includes: using the depth data to determine that part of the first blob corresponds to a second target different from the first target. 5. The method of claim 4 , wherein a first part of the first blob corresponds to the first target and a second part of the first blob corresponds to a second target, one of the first and second target occluding at least part of the other. 6. The method of claim 2 , wherein using the depth data to determine whether at least part of each blob corresponds to at least part of a target includes: using the depth data to determine that a second blob combined with part or all of the first blob correspond to a second target. 7. The method of claim 1 , wherein the 2D image data includes RGB data for each pixel in the video image. 8. The method of claim 7 , wherein only pixels of foreground data are projected onto the set of Z-planes. 9. The method of claim 1 , wherein determining whether at least part of each blob corresponds to at least part of a target is performed without analyzing depth data associated with the background data. 10. The method of claim 1 , wherein using the depth data to determine whether at least part of each blob corresponds to at least part of a target comprises: using the depth data to determine one or more of a height and a volume of the blob; and using one or more of the height and the volume of the blob to determine whether at least part of the blob corresponds to a target. 11. The method of claim 10 , wherein determining whether the blob corresponds to a target includes determining whether the blob is a person. 12. A video content analysis method comprising: receiving a video sequence that includes a plurality of frames, each frame including a video image; for each frame, receiving two-dimensional (2D) image data and also receiving depth data; processing the 2D image data of the video sequence to differentiate foreground data from background data and to detect one or more blobs comprised of the foreground data, the one or more blobs corresponding to one or more objects, wherein differentiating the foreground data from the background data is performed without analyzing the depth data; for each detected blob, using the depth data to determine whether to track at least a first part of the blob as a target; and after it is determined to track the target, detecting at least one event associated with the target, wherein determining whether to track at least the first part of the blob as a target includes: mapping the blob to a set of Z-planes; determining that on at least some Z-planes the blob is clustered into different blob regions corresponding to the first part of the blob and a second part of the blob separated in space; and grouping blob slices corresponding to the Z-planes from the first part of the blob to correspond to a physical object by checking their spatial overlaps, wherein the physical object corresponds to the target. 13. The method of claim 12 , wherein using the depth data to determine whether to track at least part of the blob as a target includes: determining that part of a first blob corresponds to a first target. 14. The method of claim 13 , wherein using the depth data to determine whether to track at least part of the blob as a target includes: determining that part of the first blob does not correspond to the first target. 15. The method of claim 13 , wherein using the depth data to determine whether to track at least part of the blob as a target includes: determining that a part of the first blob corresponds to a second target different from the first target. 16. The method of claim 15 , wherein determining that part of the first blob corresponds to the second target includes determining that the first blob corresponds to a first person and the second blob corresponds to a second person, one of the first and second person occluding at least part of the other. 17. The method of claim 13 , wherein using the depth data to determine whether to track at least part of the blob as a target includes: determining that a second blob combined with part or all of the first blob corresponds to a second target. 18. The method of claim 12 , wherein the 2D image data includes RGB data for each pixel in the video image. 19. The method of claim 18 , wherein only pixels of foreground data are projected onto the set of Z-planes. 20. The method of claim 19 , wherein determining whether to track at least part of the blob as a target is performed without analyzing depth data associated with the background data. 21. The method of claim 12 , wherein using the depth data to determine whether to track at least part of the blob as a target includes: using the depth data to determine one or more of a height and a volume of the blob; and using one or more of the height and volume of the blob to determine whether the blob corresponds to a target. 22. The method of claim 21 , wherein determining whether the blob corresponds to a target includes determining whether the blob is a person.

Assignees

Inventors

Classifications

  • A61B5/1072Primary

    measuring distances on the body, e.g. measuring length, height or thickness (A61B5/1076 takes precedence) · CPC title

  • Devices for viewing the surface of the body, e.g. camera, magnifying lens · CPC title

  • using optical or photographic means · CPC title

  • Local tracking of patients, e.g. in a hospital or private home · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9247211B2 cover?
A method and system for performing video content analysis based on two-dimensional image data and depth data are disclosed. Video content analysis may be performed on the two-dimensional image data, and then the depth data may be used along with the results of the video content analysis of the two-dimensional data for tracking and event detection.
Who is the assignee on this patent?
Avigilon Fortress Corp
What technology area does this patent fall under?
Primary CPC classification A61B5/1072. Mapped technology areas include Human Necessities.
When was this patent published?
Publication date Tue Jan 26 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).