Live document detection in a captured video stream

US12475300B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12475300-B2
Application numberUS-202418744349-A
CountryUS
Kind codeB2
Filing dateJun 14, 2024
Priority dateJul 22, 2016
Publication dateNov 18, 2025
Grant dateNov 18, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure is directed toward systems and methods to quickly and accurately identify boundaries of a displayed document in a live camera image feed, and provide a document boundary indicator within the live camera image feed. For example, systems and methods described herein utilize different display document detection processes in parallel to generate and provide a document boundary indicator that accurately corresponds with a displayed document within a live camera image feed. Thus, a user of the mobile computing device can easily see whether the document identification system has correctly identified the displayed document within the camera viewfinder feed.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method comprising: detecting a visual representation of a physical document within a live camera feed from a camera on a mobile device; providing, for display within a graphical user interface of the mobile device, a live boundary indicator associated with the visual representation of the physical document within the live camera feed, the live boundary indicator updating based on changes to a position or an orientation of the visual representation of the physical document within the live camera feed; capturing an image of the visual representation of the physical document from the live camera feed; converting the image of the visual representation of the physical document that is in a picture file format to a document file that is in a document file format; and sending the document file to a second computing device for storage. 2 . The computer-implemented method of claim 1 , wherein sending the document file to the second computing device for storage comprises sending the document file to a content management system causing the content management system to send the document file to the second computing device. 3 . The computer-implemented method of claim 1 , wherein the mobile device and the second computing device are both associated with a user account. 4 . The computer-implemented method of claim 1 , wherein converting the image of the visual representation of the physical document comprises: generating an enhanced image from the image of the visual representation of the physical document; and converting the enhanced image of the visual representation of the physical document that is in a picture file format to a document file that is in a document file format the image of the visual representation of the physical document is in a picture file format; and converting the image of the visual representation of the physical document to a document file comprises converting the image from the picture file format to a document file format. 5 . The computer-implemented method of claim 1 , wherein the mobile device is associated with a first user account and the second computing device is associated with a second user account. 6 . The computer-implemented method as recited in claim 1 , further comprising providing a shaded overlay comprising a document area within a perimeter of the live boundary indicator and a non-document area outside the perimeter of the live boundary indicator, wherein the document area has a different shading compared to the non-document area. 7 . The computer-implemented method as recited in claim 1 , further comprising: cropping the image of the visual representation of the physical document based on the live boundary indicator; and providing a preview of the cropped image prior to converting the image of the visual representation of the physical document to the document file. 8 . A non-transitory computer-readable medium storing instructions thereon that, when executed by at least one processor, cause a mobile device to: detect a visual representation of a physical document within a live camera feed from a camera on the mobile device, the mobile device associated with a user account; provide, for display within a graphical user interface of the mobile device, a live boundary indicator associated with the visual representation of the physical document within the live camera feed; capture an image of the visual representation of the physical document from the live camera feed; generate an enhanced image from the image of the visual representation of the physical document; convert the enhanced image of the visual representation of the physical document that is in a picture file format to a document file that is in a document file format; and provide the document file to a second client device associated with the user account. 9 . The non-transitory computer-readable medium as recited in claim 8 , wherein providing the document file to the second client device associated with the user account comprises providing the document file to a content management system that synchronizes the document file on the second client device. 10 . The non-transitory computer-readable medium as recited in claim 8 , wherein generating the enhanced image from the image of the visual representation of the physical document comprises cropping the image based on the live boundary indicator. 11 . The non-transitory computer-readable medium as recited in claim 8 , wherein: the image of the visual representation of the physical document is in a picture file format; and converting the image of the visual representation of the physical document to a document file comprises converting the image from the picture file format to a document file format. 12 . The non-transitory computer-readable medium as recited in claim 8 , wherein providing the document file to the second client device associated with the user account is in response to receiving a user selection of a share file option. 13 . The non-transitory computer-readable medium as recited in claim 8 , further comprising instructions that, when executed by the at least one processor, cause the mobile device to provide a preview of the enhanced image prior to converting the enhanced image to the document file. 14 . The non-transitory computer-readable medium as recited in claim 8 , further comprising instructions that, when executed by the at least one processor, cause the mobile device to provide a shaded overlay comprising a document area within a perimeter of the live boundary indicator and a non-document area outside the perimeter of the live boundary indicator, wherein the document area has a different shading compared to the non-document area. 15 . A system comprising: at least one processor; and at least one non-transitory computer-readable storage medium storing instructions thereon that, when executed by the at least one processor, cause the system to: detect a visual representation of a physical document within a live camera feed from a camera on a mobile device, the mobile device associated with a user account; provide, for display within a graphical user interface of the mobile device, a live boundary indicator associated with the visual representation of the physical document within the live camera feed; capture an image of the visual representation of the physical document from the live camera feed; convert the image of the visual representation of the physical document that is in a picture file format to a document file that is in a document file format; and provide the document file to a second client device associated with the user account. 16 . The system as recited in claim 15 , wherein providing the document file to the second client device associated with the user account comprises providing the document file to a content management system that synchronizes the document file on the second client device. 17 . The system as recited in claim 15 , further comprising instructions that, when executed by the at least one processor, cause the system to: crop the image of the visual representation of the physical document based on the live boundary indicator; and provide a preview of the cropped image prior to converting the image of the visual representation of the physical document to the document file. 18 . The system as recited in claim 15 , further comprising instructions that, when executed by the at least one processor, cause the system to provide a shaded overlay comprising a document

Assignees

Inventors

Classifications

  • Texturing; Colouring; Generation of textures or colours (retouching, inpainting or scratch removal G06T5/77) · CPC title

  • Denoising; Smoothing · CPC title

  • Dynamic range modification of images or parts thereof · CPC title

  • Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components · CPC title

  • Image acquisition (document image scanning and transmission H04N1/00; control of digital cameras H04N23/60) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12475300B2 cover?
The present disclosure is directed toward systems and methods to quickly and accurately identify boundaries of a displayed document in a live camera image feed, and provide a document boundary indicator within the live camera image feed. For example, systems and methods described herein utilize different display document detection processes in parallel to generate and provide a document boundar…
Who is the assignee on this patent?
Dropbox Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/123. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 18 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).