Document redaction with data retention

US2016239668A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016239668-A1
Application numberUS-201514622651-A
CountryUS
Kind codeA1
Filing dateFeb 13, 2015
Priority dateFeb 13, 2015
Publication dateAug 18, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for redacting an electronic document (ED) having a file format, including: obtaining a request to redact a sensitive data item in the ED; identifying a first instance and a second instance of the sensitive data item in a markup of the ED, where the second instance of the sensitive data item is not visible in a rendered version of the ED; and generating a redacted ED having the file format by: replacing the first instance of the sensitive data item and the second instance of the sensitive data item with a neutral data item, and inserting, into the markup, an encrypted version of the sensitive data item at a first location.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for redacting an electronic document (ED) having a file format, comprising: obtaining a request to redact a sensitive data item in the ED; identifying a first instance and a second instance of the sensitive data item in a markup of the ED, wherein the second instance of the sensitive data item is not visible in a rendered version of the ED; and generating a redacted ED having the file format by: replacing the first instance of the sensitive data item and the second instance of the sensitive data item with a neutral data item, and inserting, into the markup, an encrypted version of the sensitive data item at a first location. 2 . The method of claim 1 , wherein generating the redacted ED further comprises: inserting, into the markup, the encrypted version of the sensitive data item at a second location corresponding to the second instance of the sensitive data item, wherein the first location corresponds to the first instance of the sensitive data item. 3 . The method of claim 2 , further comprising: identifying a third instance of the sensitive data item in the markup of the ED, wherein generating the redacted ED further comprises: replacing the third instance of the sensitive data item with the neutral data item; and inserting, into the markup, a third encrypted instance of the sensitive data item at a third location corresponding to the third instance of the sensitive data item, wherein the first instance of the sensitive data item and the third instance of the sensitive data item are located in different files in the markup of the ED. 4 . The method of claim 1 , further comprising: identifying, during a search of the markup, a third instance of the sensitive data item; displaying a prompt based on the third instance; receiving, in response to the prompt, an instruction to not redact the third instance; and resuming the search without replacing the third instance based on the instruction. 5 . The method of claim 1 , wherein inserting the encrypted version of the sensitive data item comprises: creating an alternate content section in the markup comprising the encrypted version of the sensitive data item. 6 . The method of claim 1 , wherein inserting the encrypted version of the sensitive data item comprises: creating a content extension section in the markup comprising the encrypted version of the sensitive data item. 7 . The method of claim 1 , wherein the file format is Open Office XML (OOXML). 8 . The method of claim 1 , wherein replacing the first instance of the sensitive data item comprises: determining a size of a bounding box for the first instance of the sensitive data item; removing the first instance of the sensitive data item from the markup of the ED; and modifying the markup of the ED to include the neutral data item having the size of the bounding box. 9 . A system for redacting an electronic document (ED) having a file format, comprising: a computer processor; a user interface (UI) configured to obtain a request to redact a sensitive data item in the ED; an identification engine (IE) configured to identify a first and a second instance of the sensitive data item in a markup of the ED, wherein the second instance of the sensitive data item is not visible in a rendered version of the ED; and a redaction engine (RE), executing on the computer processor, and configured to generate a redacted ED having the file format by: replacing the first instance of the sensitive data item and the second instance of the sensitive data item with a neutral data item, and inserting, into the markup, an encrypted version of the sensitive data item at a first location. 10 . The system of claim 9 , wherein the RE is further configured to: insert, into the markup, the encrypted version of the sensitive data item at a second location corresponding to the second instance of the sensitive data item, wherein the first location corresponds to the first instance of the sensitive data item. 11 . The system of claim 10 wherein: the IE is further configured to identify a third instance of the sensitive data item in the markup of the ED, and the RE is further configured to: replace the third instance of the sensitive data item with the neutral data item; and insert, into the markup, a third instance of the sensitive data item at a third location corresponding to the third instance of the sensitive data item, wherein the first instance of the sensitive data item and the third instance of the sensitive data item are located in different files of the ED. 12 . The system of claim 9 , wherein inserting the encrypted version of the sensitive data item comprises: creating an alternate content section in the markup comprising the encrypted version of the sensitive data item. 13 . The system of claim 9 , wherein inserting the encrypted version of the sensitive data item comprises: creating a content extension section in the markup comprising the encrypted version of the sensitive data item. 14 . The system of claim 9 , wherein the file format is Open Office XML (OOXML). 15 . A non-transitory computer readable medium (CRM) storing instructions for redacting an electronic document (ED) having a file format, the instructions comprising functionality for: obtaining a request to redact a sensitive data item in the ED; identifying a first instance and a second instance of the sensitive data item in a markup of the ED, wherein the second instance of the sensitive data item is not visible in a rendered version of the ED; and generating a redacted ED having the file format by: replacing the first instance of the sensitive data item and the second instance of the sensitive data item with a neutral data item, and inserting, into the markup, an encrypted version of the sensitive data item at a first location. 16 . The non-transitory CRM of claim 15 , wherein the instructions for generating the redacted ED further comprise functionality for: inserting, into the markup, the encrypted version of the sensitive data item at a second location corresponding to the second instance of the sensitive data item, wherein the first location corresponds to the first instance of the sensitive data item. 17 . The non-transitory CRM of claim 16 , wherein the instructions for redacting the ED further comprise functionality for: identifying a third instance of the sensitive data item in the markup of the ED, wherein generating the redacted ED further comprises: replacing the third instance of the sensitive data item with the neutral data item; and inserting, into the markup, a third encrypted instance of the sensitive data item at a third location corresponding to the third instance of the sensitive data item, wherein the first instance of the sensitive data item and the third instance of the sensitive data item are located in different files in the markup of the ED. 18 . The non-transitory CRM of claim 15 , wherein the instructions for inserting the encrypted version of the sensitive data item comprise functionality for: creating an alternate content section in the markup comprising the encrypted version of the sensitive data item. 19 . The non-transitory CRM of claim 15 , wherein the instructions for inserting the encrypted version of the sensitive data item comprise functionality for: creating a content extension section in the markup comprising the encrypted version of the sensitive data item. 20 . The non-

Assignees

Inventors

Classifications

  • G06F21/602Primary

    Providing cryptographic facilities or services · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

  • to a single file or object, e.g. in a secure envelope, encrypted and accessed using a key, or with access control rules appended to the object itself · CPC title

  • Editing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016239668A1 cover?
A method for redacting an electronic document (ED) having a file format, including: obtaining a request to redact a sensitive data item in the ED; identifying a first instance and a second instance of the sensitive data item in a markup of the ED, where the second instance of the sensitive data item is not visible in a rendered version of the ED; and generating a redacted ED having the file for…
Who is the assignee on this patent?
Konica Minolta Laboratory Usa Inc
What technology area does this patent fall under?
Primary CPC classification G06F21/602. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Aug 18 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).