Selecting a video frame for notification using audio/video recording and communication devices

US11069210B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11069210-B2
Application numberUS-201816019909-A
CountryUS
Kind codeB2
Filing dateJun 27, 2018
Priority dateJun 28, 2017
Publication dateJul 20, 2021
Grant dateJul 20, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Some embodiments provide for obtaining image data representative of a field of view of a camera as captured by the camera of an A/V recording and communication device. The image data may be analyzed and, based at least in part on the analysis, it may be determined that the image data is representative of a first facial image of a person and a second facial image of the person. From the facial images, it may be determined that the first facial image is of higher quality than the second facial image and, based on this determination, a frame may be selected that is represented by the image data and corresponds to the first facial image. A notification may be generated that includes a portion of the image data that represents the frame, and the notification may be transmitted to a client device associated with the A/V recording and communication device.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: based at least in part on detection of an object by an A/V recording and communication device, obtaining image data representative of a field of view of a camera as captured by the camera of the A/V recording and communication device; analyzing the image data; based at least in part on the analyzing the image data, determining that the image data is representative of a first facial image of a person and a second facial image of the person; determining that the first facial image is of higher quality than the second facial image; based at least in part on the first facial image being of higher quality than the second facial image, selecting a frame represented by the image data and corresponding to the first facial image; generating a first notification including a portion of the image data representing the frame; transmitting the first notification to a client device associated with the A/V recording and communication device; generating a second notification including an updated frame based on a status of the client device; and transmitting the second notification including the updated frame to the client device. 2. The method of claim 1 , wherein at least one of the obtaining the image data, the analyzing the image data, the determining that the first facial image is of higher quality than the second facial image, the selecting the frame, the generating the notification, or the transmitting the notification is performed by one or more processors of the A/V recording and communication device. 3. The method of claim 1 , wherein at least one of the obtaining the image data, the analyzing the image data, the determining that the first facial image is of higher quality than the second facial image, the selecting the frame, the generating the notification, or the transmitting the notification is performed by one or more processors of one or more backend devices. 4. The method of claim 3 , wherein the backend device is at least one of a server, an application programming interface, or a storage device. 5. The method of claim 1 , wherein the first notification is a push-notification including a visual representation of the first facial image and the frame, and wherein the push-notification is programmed such that when an input is received to the push-notification to a display of the client device, a visual representation of at least one other frame represented by the image data is displayed on the display. 6. The method of claim 5 , wherein the image data is representative of video of the field of view of the camera. 7. The method of claim 1 , wherein the detection of the object is by at least one of the camera or a motion sensor of the A/V recording and communication device. 8. The method of claim 1 , wherein the determining that the image data is representative of the first facial image of the person and the second facial image of the person comprises: determining that the object is the person; identifying a face of the person represented by the image data; and identifying at least the frame that includes the first facial image of the person and another frame that includes the second facial image of the person. 9. The method of claim 1 , wherein the determining that the first facial image is of higher quality than the second facial image comprises: determining a first portion of the face of the person in the first facial image; determining a second portion of the face of the person in the second facial image; and determining that the first portion of the face is more identifiable than the second portion of the face based on the first portion of the face being positioned closer to the camera than the second portion of the face. 10. The method of claim 1 , wherein the determining that the first facial image is of higher quality than the second facial image comprises: determining a first image quality of the first facial image; determining a second image quality of the second facial image; and determining that the first image quality is greater than the second image quality based on a determination that a first resolution of the first facial image is greater than a second resolution of a second facial image. 11. The method of claim 1 , wherein the image data is captured during a first time and during a motion event, the method further comprising: obtaining additional image data at a second time after the first time and during the motion event, the additional image data representative of the field of view of the camera as captured by the camera of the A/V recording and communication device; analyzing the additional image data; based at least in part on the analyzing the additional image data, determining that the additional image data is representative of a third facial image of the person; determining that the third facial image is of higher quality than the first facial image; based at least in part on the third facial image being of higher quality than the first facial image, selecting an additional frame represented by the additional image data and corresponding to the third facial image; generating an additional notification including a portion of the additional image data representing the additional frame; and transmitting the additional notification to the client device associated with the A/V recording and communication device, the additional notification is an additional push-notification including another visual representation of at least a portion of the additional frame including the third facial image. 12. An audio/video device (A/V device) comprising: a camera configured to capture image data representative of a field of view of the camera, the image data comprising a plurality of frames; a processor; a memory storing computer-readable instructions that, when executed by the processor, cause the A/V device to perform operations comprising: receiving the image data; analyzing the image data; based at least in part on the analyzing the image data, determining that a first frame by of the image data is representative of a first facial image of a person and that a second frame of the image data is representative of a second facial image of the person; determining that the first facial image is of higher quality than the second facial image; based at least in part on the first facial image being of higher quality than the second facial image, selecting the first frame for inclusion in a first image notification; and generating the first image notification including the first frame; transmitting, to a client device associated with the A/V device, the first image notification; and generating a second notification including an updated frame based on an input to the client device; and transmitting, to the client device, the second notification including the updated frame. 13. The A/V device of claim 12 , wherein at least one of the receiving the image data, the analyzing the image data, the determining that the first frame of the image data is representative of the first facial image of the person, the generating the image notification, or the transmitting the image notification is performed by one or more processors of the A/V device. 14. The A/V device of claim 12 , wherein at least one of the receiving the image data, the analyzing the image data, the determining that the first frame of the image data is representative of the first facial image of the person, the generating the image notification, or the transmitting the image notification is performed by one or more processors of one or more backend devices. 15. The A/V devi

Assignees

Inventors

Classifications

  • Evaluation of the quality of the acquired pattern · CPC title

  • Push-based network services · CPC title

  • Classification, e.g. identification · CPC title

  • Surveillance or monitoring of activities, e.g. for recognising suspicious objects (recognising microscopic objects G06V20/69) · CPC title

  • Event detection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11069210B2 cover?
Some embodiments provide for obtaining image data representative of a field of view of a camera as captured by the camera of an A/V recording and communication device. The image data may be analyzed and, based at least in part on the analysis, it may be determined that the image data is representative of a first facial image of a person and a second facial image of the person. From the facial i…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G08B13/19691. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 20 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).