Scanbox

US9558401B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9558401-B2
Application numberUS-201514861586-A
CountryUS
Kind codeB2
Filing dateSep 22, 2015
Priority dateSep 10, 2013
Publication dateJan 31, 2017
Grant dateJan 31, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments are provided for content item classification. In some embodiments, an image for classification is received, a compact representation for the image having values indicative of pixel values within the received image is generated, a plurality of angle measurements for possible edges of at least one potential document within the received image are determined, and the image is classified using said compact representation and said plurality of angle measurements.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for content item classification, comprising: receiving a digital image; identifying, by at least one processor, a plurality of angle measurements for possible page edges of at least one potential document within the received image, wherein identifying the plurality of angle measurements comprises: identifying, by the at least one processor, a plurality of edge candidates of the at least one potential document; and calculating an angle measurement for each edge candidate of the plurality of edge candidates based on a plurality of vectors, each vector of the plurality of vectors extending orthogonally from a selected origin to a point corresponding to a given edge candidate of the plurality of edge candidates; determining, based on the identified plurality of angle measurements, whether the image contains a document; and in response to determining that the image contains a document, classifying the image as a document-containing image. 2. The method as recited in claim 1 , further comprising converting the plurality of angle measurements to polar coordinates. 3. The method as recited in claim 2 , further comprising determining, based on the polar coordinates, a location for each page edge of the document within the received digital image. 4. The method as recited in claim 1 , wherein identifying the plurality of edge candidates of the at least one potential document comprises calculating gradient values for a plurality of points within the received digital image. 5. The method as recited in claim 4 , wherein identifying the plurality of edge candidates of the at least one potential document further comprises identifying a location for each of the plurality of edge candidates within the received digital image based on the gradient values. 6. The method as recited in claim 4 , wherein calculating the plurality of angle measurements comprises calculating polar coordinates for the plurality of points within the received image. 7. The method as recited in claim 6 , further comprising eliminating at least one of the plurality of edge candidates as a page edge for the document based on the plurality of angle measurements. 8. The method as recited in claim 6 , further comprising confirming at least one of the plurality of edge candidates as a page edge for the document based on the plurality of angle measurements. 9. A non-transitory computer-readable medium containing storing instructions that, when executed by at least one processor of a computing device, cause the computing device to: receive a digital image; identify a plurality of angle measurements for possible page edges of at least one potential document within the received image, wherein identifying the plurality of angle measurements comprises: identifying a plurality of edge candidates of the at least one potential document; and calculating an angle measurement for each edge candidate of the plurality of edge candidates based on a plurality of vectors, each vector of the plurality of vectors extending orthogonally from a selected origin to a point corresponding to a given edge candidate of the plurality of edge candidates; determine, based on the identified plurality of angle measurements, whether the image contains a document; and in response to determining that the image contains a document, classify the image as a document-containing image. 10. The non-transitory computer-readable medium of claim 9 , further comprising instructions that, when executed by the at least one processor of the computing device, cause the computing device to convert the plurality of angle measurements to polar coordinates. 11. The non-transitory computer-readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor of the computing device, cause the computing device to determine, based on the polar coordinates, a location for each page edge of the document within the received digital image. 12. The non-transitory computer-readable medium of claim 9 , wherein identifying the plurality of edge candidates of the at least one potential document comprises calculating gradient values for a plurality of points within the received digital image. 13. The non-transitory computer-readable medium of claim 12 , wherein identifying the plurality of edge candidates of the at least one potential document further comprises identifying a location for each of the plurality of edge candidates within the received digital image based on the gradient values. 14. The non-transitory computer-readable medium of claim 12 , wherein calculating the plurality of angle measurements comprises calculating polar coordinates for the plurality of points within the received image. 15. A system for content item classification, the system comprising: one or more processors; and memory containing instructions thereon that, when executed, cause the one or more processors to: receive a digital image; identify a plurality of angle measurements for possible page edges of at least one potential document within the received image, wherein identifying the plurality of angle measurements comprises: identifying a plurality of edge candidates of the at least one potential document; and calculating an angle measurement for each edge candidate of the plurality of edge candidates based on a plurality of vectors, each vector of the plurality of vectors extending orthogonally from a selected origin to a point corresponding to a given edge candidate of the plurality of edge candidates; determine, based on the identified plurality of angle measurements, whether the image contains a document; and in response to determining that the image contains a document, classify the image as a document-containing image. 16. The system of claim 15 , further comprising instructions that, when executed, cause the one or more processors to convert the plurality of angle measurements to polar coordinates. 17. The system of claim 16 , further comprising instructions that, when executed, cause the one or more processors to determine, based on the polar coordinates, a location for each page edge of the document within the received digital image. 18. The system of claim 15 , wherein identifying the plurality of edge candidates of the at least one potential document comprises calculating gradient values for a plurality of points within the received digital image. 19. The system of claim 18 , wherein identifying the plurality of edge candidates of the at least one potential document further comprises identifying a location for each of the plurality of edge candidates within the received digital image based on the gradient values. 20. The system of claim 18 , wherein calculating the plurality of angle measurements comprises calculating polar coordinates for the plurality of points within the received image.

Assignees

Inventors

Classifications

  • Classification of content, e.g. text, photographs or tables · CPC title

  • G06V10/48Primary

    by mapping characteristic values of the pattern into a parameter space, e.g. Hough transformation · CPC title

  • by compensating for image skew or non-uniform image deformations · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9558401B2 cover?
Embodiments are provided for content item classification. In some embodiments, an image for classification is received, a compact representation for the image having values indicative of pixel values within the received image is generated, a plurality of angle measurements for possible edges of at least one potential document within the received image are determined, and the image is classified…
Who is the assignee on this patent?
Dropbox Inc
What technology area does this patent fall under?
Primary CPC classification G06V10/48. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 31 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).