Structured text and pattern matching for data loss prevention in object-specific image domain

US10482280B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10482280-B2
Application numberUS-201715419756-A
CountryUS
Kind codeB2
Filing dateJan 30, 2017
Priority dateJan 30, 2017
Publication dateNov 19, 2019
Grant dateNov 19, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Structured text and pattern matching may be performed for data loss prevention in object-specific image domain. According to some embodiments, a method may include receiving an image, identifying one or more objects in the image based on attributes of the one or more objects, and determining an object type of a first object of the one or more objects by a computing device. The method may include identifying, by the computing device, one or more specific regions of the first object for recognition based on the object type of the first object and recognizing text in the one or more specific regions of the first object. In some embodiments, the method may then include providing, by the computing device, the text recognized in the one or more specific regions of the first object to a security engine, wherein the security engine may be configured to evaluate whether the text comprises sensitive information.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving, by a computing device, an image; identifying, by the computing device, multiple physical objects represented in the image based on attributes of the multiple physical objects; determining, by the computing device, an object type of a first object of the multiple physical objects; determining, by the computing device, an object type of a second object of the multiple physical objects; identifying, by the computing device, one or more specific regions of the first object for recognition based on the object type of the first object; identifying, by the computing device, one or more specific regions of the second object for recognition based on the object type of the second object; responsive to identifying the one or more specific regions, recognizing, by the computing device, text in the one or more specific regions of the first object; recognizing, by the computing device, text in the one or more specific regions of the second object; providing, by the computing device, the text recognized in the one or more specific regions of the first object to a security engine, the security engine configured to evaluate whether the text comprises sensitive information; and providing, by the computing device, the text in the one or more specific regions of the second object to the security engine. 2. The computer-implemented method of claim 1 , further comprising: in response to the security engine determining that the text in the one or more specific regions of the first object does not comprise sensitive information, classifying, by the computing device, the first object as not including sensitive information. 3. The computer-implemented method of claim 1 , further comprising: determining, by the computing device, whether text is present in the one or more specific regions of the first object; and in response to determining that no text is present in the one or more specific regions, classifying the object as not including sensitive information. 4. The computer-implemented method of claim 1 , wherein a location of each specific region of the one or more specific regions is determined based on a known layout of the object type. 5. The computer-implemented method of claim 1 , wherein the security engine is further configured to determine whether the text comprises sensitive information based on the metadata tagged on the text. 6. The computer-implemented method of claim 1 , further comprising: determining that the image contains sensitive information based on the object type of the first object and the object type of the second object. 7. The computer-implemented method of claim 6 , wherein it is determined that the image contains sensitive information responsive to the object type of the first object being a first particular type and the object type of the second object being a second, different particular object type. 8. The computer-implemented method of claim 1 , wherein the security engine evaluates whether the text comprises sensitive information by matching the text in the one or more specific regions of the first object against a database of defined sensitive information. 9. The computer-implemented method of claim 1 , further comprising: automatically blocking the image from being electronically transferred in response to the text recognized in the one or more specific regions of the first object being evaluated as including sensitive information. 10. The computer-implemented method of claim 1 , further comprising: automatically transmitting an electronic message to an administrator in response to the text recognized in the one or more specific regions of the first object being evaluated as including sensitive information. 11. The computer-implemented method of claim 1 comprising: adjusting a rotation or a perspective of the one or more specific regions using a determined rotation or perspective of the first object in order to facilitate text recognition within the one or more specific regions of the first object. 12. The computer-implemented method of claim 1 , wherein the attributes comprise dimensions of the first object. 13. The computer-implemented method of claim 1 , wherein the object type of the first object comprises a check. 14. The computer-implemented method of claim 13 , wherein the check is identified based on attributes comprising MICR (Magnetic Ink Character Recognition) characters. 15. The computer-implemented method of claim 1 , wherein the object type of the first object comprises a photo identification card. 16. The computer-implemented method of claim 15 , wherein the photo identification card is identified based on attributes comprising an image of a face at a given location. 17. A non-transitory computer readable medium storing instructions that, when executed by a computing device having one or more processors, causes the one or more processors to perform operations comprising: receiving, by a computing device, an image; identifying, by the computing device, multiple physical objects represented in the image based on attributes of the multiple physical objects; determining, by the computing device, an object type of a first object of the multiple physical objects; determining, by the computing device, an object type of a second object of the multiple physical objects; identifying, by the computing device, one or more specific regions of the first object for recognition based on the object type of the first object; identifying, by the computing device, one or more specific regions of the second object for recognition based on the object type of the second object; responsive to identifying the one or more specific regions, recognizing, by the computing device, text in the one or more specific regions of the first object; recognizing, by the computing device, text in the one or more specific regions of the second object; providing, by the computing device, the text recognized in the one or more specific regions of the first object to a security engine, the security engine configured to evaluate whether the text comprises sensitive information; and providing, by the computing device, the text in the one or more specific regions of the second object to the security engine. 18. A computer system comprising: a receiving module programmed to receive an image, at least one processor configured to execute the receiving module; an object identification module programmed to identify multiple physical objects represented in the image based on attributes of the multiple physical objects, the at least one processor configured to execute the object identification module, the object identification module communicatively coupled to the receiving module; an object analysis module programmed to determine an object type of a first object of the multiple physical objects, determine an object type of a second object of the multiple physical objects, identify one or more specific regions of the first object for recognition based on the object type of the first object, and identify one or more specific regions of the second object for recognition based on the object type of the second object, the at least one processor configured to execute the object analysis module, the object analysis module communicatively coupled to the object identification module; and a text recognition module programmed to, responsive to identifying the one or more specific regions, recognize text in the one or more specific regions of the first object, recognize text in the

Assignees

Inventors

Classifications

  • involving the use of external additional devices, e.g. dongles or smart cards · CPC title

  • Character recognition · CPC title

  • Region-based segmentation · CPC title

  • Protecting personal data, e.g. for financial or medical purposes · CPC title

  • Still image; Photographic image · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10482280B2 cover?
Structured text and pattern matching may be performed for data loss prevention in object-specific image domain. According to some embodiments, a method may include receiving an image, identifying one or more objects in the image based on attributes of the one or more objects, and determining an object type of a first object of the one or more objects by a computing device. The method may includ…
Who is the assignee on this patent?
Symantec Corp
What technology area does this patent fall under?
Primary CPC classification G06F21/6245. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 19 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).