Scaled perspective zoom on resource constrained devices

US11418704B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11418704-B2
Application numberUS-202016933314-A
CountryUS
Kind codeB2
Filing dateJul 20, 2020
Priority dateJun 15, 2017
Publication dateAug 16, 2022
Grant dateAug 16, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A dolly zoom effect can be applied to one or more images captured via a resource-constrained device (e.g., a mobile smartphone) by manipulating the size of a target feature while the background in the one or more images changes due to physical movement of the resource-constrained device. The target feature can be detected using facial recognition or shape detection techniques. The target feature can be resized before the size is manipulated as the background changes (e.g., changes perspective).

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: generating an image using an image sensor of a user device, generating, using a convolutional neural network, image feature areas for different image features in the image; identifying, on the user device, a target image feature from one of the image feature areas, the target image feature corresponding to a physical object depicted in the image; resizing the target image feature; generating a zoom video sequence by maintaining the scaling of the resized target image feature in the zoom video sequence without maintaining the scaling of image features areas that are not the target image feature such that the resized target image feature covers the depiction of the physical object in the zoom video sequence as the depiction of the physical object diminishes in size due to the user device moving closer to or away from the physical object; and storing the zoom video sequence on the user device. 2. The method of claim 1 , further comprising: receiving, through a touchscreen of the user device, an instruction to change the target image feature from an initial size to the size that is manipulated by increased scaling in the zoom video sequence. 3. The method of claim 1 , wherein the target image feature is identified in response to a user input received through a touchscreen of the user device. 4. The method of claim 1 , wherein generating the target image feature comprises applying, by the user device, the convolutional neural network to the image to identify the target image feature. 5. The method of claim 4 , wherein the target image feature is a human face. 6. The method of claim 4 , wherein the target image feature includes one or more segments of a human body. 7. The method of claim 4 , wherein the target image feature is a shape in the one or more images. 8. The method of claim 7 , further comprising: receiving selection of the shape through an input made through a touchscreen of the user device. 9. The method of claim 1 , further comprising: stabilizing the target image feature in the zoom video sequence such that the target image feature remains in an initial area in the zoom video sequence. 10. The method of claim 1 , further comprising: stabilizing the target image feature in the zoom video sequence such that the target image feature remains in an initial area in the zoom video sequence as the user device moves away from the physical object. 11. The method of claim 1 , wherein the image is from a live video feed generated by the image sensor of the user device. 12. The method of claim 1 , further comprising: transmitting the zoom video sequence to a network server. 13. A user device comprising: one or more processors; an image sensor; and a memory storing instructions that, when executed by the one or more processors, cause the user device to perform operations comprising: generating an image using the image sensor; generating, using a convolutional neural network, image feature areas for different image features in the image; identifying, on the user device, a target image feature from one of the image feature areas, the target image feature corresponding to a physical object depicted in the image; resizing the target image feature; generating a zoom video sequence by maintaining the scaling of the resized target image feature in the zoom video sequence without maintaining the scaling of image features areas that are not the target image feature such that the resized target image feature covers the depiction of the physical object in the zoom video sequence as the depiction of the physical object diminishes in size due to the user device moving closer to or away from the physical object; and storing the zoom video sequence. 14. The user device of claim 13 , the operations further comprising: receiving, through a touchscreen of the user device, an instruction to change the target image feature from an initial size to the size that is manipulated by increased scaling in the zoom video sequence. 15. The user device of claim 13 , wherein the target image feature is identified in response to a user input received through a touchscreen of the user device. 16. The user device of claim 13 , wherein generating the target image feature comprises applying, by the user device, the convolutional neural network to the image to identify the target image feature. 17. The user device of claim 16 , wherein the target image feature is a human face. 18. The user device of claim 16 , wherein the target image feature includes one or more segments of a human body. 19. The user device of claim 16 , wherein the target image feature is a shape in the one or more images. 20. A non-transitory machine-readable storage device embodying instructions that, when executed by a machine, cause the machine to perform operations comprising: generating an image using the image sensor; generating, using a convolutional neural network, image feature areas for different image features in the image; identifying, on the user device, a target image feature from one of the image feature areas, the target image feature corresponding to a physical object depicted in the image; resizing the target image feature; generating a zoom video sequence by maintaining the scaling of the resized target image feature in the zoom video sequence without maintaining the scaling of image features areas that are not the target image feature such that the resized target image feature covers the depiction of the physical object in the zoom video sequence as the depiction of the physical object diminishes in size due to the user device moving closer to or away from the physical object; and storing the zoom video sequence.

Assignees

Inventors

Classifications

  • H04N23/62Primary

    Control of parameters via user interfaces · CPC title

  • performed by a processor, e.g. controlling the readout of an image memory · CPC title

  • where the recognised objects include parts of the human body · CPC title

  • by using electronic viewfinders · CPC title

  • Control of means for changing angle of the field of view, e.g. optical zoom objectives or electronic zooming · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11418704B2 cover?
A dolly zoom effect can be applied to one or more images captured via a resource-constrained device (e.g., a mobile smartphone) by manipulating the size of a target feature while the background in the one or more images changes due to physical movement of the resource-constrained device. The target feature can be detected using facial recognition or shape detection techniques. The target featur…
Who is the assignee on this patent?
Snap Inc
What technology area does this patent fall under?
Primary CPC classification H04N23/62. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Aug 16 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).