Method for determining correct scanning distance using augmented reality and machine learning models

US12008724B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12008724-B2
Application numberUS-202318101982-A
CountryUS
Kind codeB2
Filing dateJan 26, 2023
Priority dateOct 23, 2018
Publication dateJun 11, 2024
Grant dateJun 11, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A smart device is provided with an application program for displaying a video feed received from the smart device's camera. The application can determine the coordinates for an intersection point, which is a point on the ground where the smart device is pointing at. The application can display a target on the visual representation of the intersection point. Based on whether the smart device is at an appropriate distance from the intersection point, the user interface can superimpose an indicator on the video feed received from the camera. This can inform the user whether the smart device is at an optimal scan distance from the intersection point (or an object) so that the object can be identified by a machine learning model.

First claim

Opening claim text (preview).

The invention claimed is: 1. A non-transitory computer-accessible medium having stored thereon computer-executable instructions executable by a computing hardware arrangement, wherein, when the computing hardware arrangement executes the instructions, the computing hardware arrangement is configured to perform procedures comprising: displaying, on a display, a video feed captured by a camera from a first position; determining an intersection point between a ground plane and the first position; displaying a target over a visual representation of the intersection point on the display; determining a distance between the first position and the intersection point; and capturing by the camera an image of an object in the video feed when the distance between the first position and the intersection point is within an optimal scan distance. 2. The non-transitory computer-accessible medium of claim 1 , wherein the computing hardware arrangement further comprises instruction for identifying, using an object recognition model, an object in the video feed. 3. The non-transitory computer-accessible medium of claim 2 , wherein the computing hardware arrangement further comprises instruction for displaying information about the identified object on the display. 4. The non-transitory computer-accessible medium of claim 1 , wherein the intersection point is a point on a ground plane where a perpendicular hypothetical line extending from the first position would intersect the ground plane. 5. The non-transitory computer-accessible medium of claim 1 , wherein the optimal scanning distance is determined based on a threshold percentage of a field of view being occupied by the object. 6. The non-transitory computer-accessible medium of claim 1 , wherein the optimal scan distance is a learned value received from the object recognition model. 7. The non-transitory computer-accessible medium of claim 1 , wherein the intersection point corresponds to a center point of a field of view of the camera located at the first position. 8. The non-transitory computer-accessible medium of claim 1 , wherein the image of the object is captured automatically based upon the distance between the first position and the intersection point. 9. The non-transitory computer-accessible medium of claim 1 , wherein the optimal scan distance is determined based on an altitude of the first position relative to the ground plane. 10. The non-transitory computer-accessible medium of claim 1 , wherein the optical scan distance is determined based on a field of view and an angular separation of pixels of the camera located at the first position. 11. The non-transitory computer-accessible medium of claim 2 , wherein the computing hardware arrangement further comprises instruction displaying a notification in the video feed upon the capture of the image of the object. 12. A method comprising: displaying, on a display, a video feed captured by a camera from a first position; determining an intersection point between a ground plane and the first position; displaying a target over a visual representation of the intersection point; determining a distance between the first position and the intersection point; and capturing by the camera an image of an object in the video feed when the distance between the first position and the intersection point is within an optimal scan distance. 13. The method of claim 12 , further comprising: identifying, using an object recognition model, an object in the video feed. 14. The method of claim 13 , further comprising: displaying information about the identified object on the display. 15. The method of claim 12 , wherein the intersection point is a point on a ground plane where a perpendicular hypothetical line extending from the first position would intersect the ground plane. 16. The method of claim 12 , wherein the optimal scanning distance is determined based on a threshold percentage of a field of view being occupied by the object. 17. The method of claim 12 , wherein the optimal scan distance is a learned value received from the object recognition model. 18. The method of claim 12 , wherein the intersection point corresponds to a center point of a field of view of the camera located at the first position. 19. The method of claim 12 , wherein the image of the object is captured automatically based upon the distance between the first position and the intersection point. 20. The method of claim 12 , wherein the optimal scan distance is determined based on an altitude of the first position relative to the ground plane. 21. The method of claim 12 , wherein the optical scan distance is determined based on a field of view and an angular separation of pixels of the camera located at the first position. 22. A device comprising: a camera configured to capture a video feed from a first position; a display configured to display the video feed; and a processor, wherein the processor is configured to: display, on the display, a video feed captured by a camera from a first position; determine an intersection point between a ground plane and the first position; display a target over a visual representation of the intersection point; determine a distance between the first position and the intersection point; and capture, by the camera, an image of an object in the video feed when the distance between the first position and the intersection point is within an optimal scan distance.

Assignees

Inventors

Classifications

  • Three-dimensional [3D] objects · CPC title

  • in augmented reality scenes · CPC title

  • using classification, e.g. of video objects · CPC title

  • using hand-held instruments · CPC title

  • of vehicle lights or traffic lights · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12008724B2 cover?
A smart device is provided with an application program for displaying a video feed received from the smart device's camera. The application can determine the coordinates for an intersection point, which is a point on the ground where the smart device is pointing at. The application can display a target on the visual representation of the intersection point. Based on whether the smart device is …
Who is the assignee on this patent?
Capital One Services Llc
What technology area does this patent fall under?
Primary CPC classification G06T19/006. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 11 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).