Privacy annotation from differential analysis of snapshots

US10839103B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10839103-B2
Application numberUS-201916539231-A
CountryUS
Kind codeB2
Filing dateAug 13, 2019
Priority dateMar 23, 2017
Publication dateNov 17, 2020
Grant dateNov 17, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method is provided for preventing divulgation of sensitive data in two snapshots, taken at different times, of one or more same systems in a cloud environment. The method identifies a set of files from among file pairs. Each file pair is formed from a respective file that includes at least one difference with respect to each snapshot. The method performs a pattern reducing process that removes, from the set of files, any of the files having, as the difference, a predetermined non-sensitive difference between respective executions of a pre-determined system operation. The method performs a commonality reducing process that removes, from the set of files, any files having, as the difference, a common difference between different users. The method annotates data in remaining files in the set as potentially being the sensitive data, subsequent to the reducing processes. The two snapshots include at least one Sandbox-based image.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method for preventing divulgation of sensitive data in two snapshots of one or more same systems in a cloud environment, the method comprising: identifying a set of files from among a plurality of file pairs, each of the plurality of file pairs being formed from a respective file that includes at least one difference with respect to each of the two snapshots, taken at different times; performing a pattern reducing process that removes, from the set of files, any of the files having, as the at least one difference, a predetermined non-sensitive difference between respective executions of a pre-determined system operation: performing a commonality reducing process that removes, from the set of files, any of the files having, as the at least one difference, a common difference between different system users; and annotating data in remaining ones of the files in the set of files as potentially being the sensitive data, subsequent to said pattern reducing and commonality reducing processes, wherein the two snapshots comprise at least one Sandbox-based image of the one or more same systems of the cloud environment, and wherein the predetermined non-sensitive difference between the respective executions of the pre-determined system operation is determined using a Sandbox host. 2. The computer-implemented method of claim 1 , further comprising: prompting the user to provide a user input indicating whether to delete the annotated data; and deleting the annotated data responsive to the user input. 3. The computer-implemented method of claim 1 , further comprising: checking annotations of the annotated data to generate an annotation checking result; and modifying a system configuration of at least one of the one or more same systems, responsive to the annotation checking result. 4. The computer-implemented method of claim 1 , wherein each of the plurality of file pairs is formed based on the respective files therein having the at least one difference there between selected from the group consisting of (i) different attributes, (ii) different hash values and (iii) a status of one of the respective files being added or deleted relative to the other one of the respective files in a given one of the file pairs. 5. The computer-implemented method of claim 1 , wherein the common difference between the different system users is determined based on image content similarity data and image relationship data. 6. The computer-implemented method of claim 5 , wherein the image content similarity data is selected from the group consisting of operating system data, distribution data, file creation data, and file update data. 7. The computer-implemented method of claim 5 , wherein the image relationship data comprises meta-data derived image history data. 8. The computer-implemented method of claim 1 , wherein the common difference between the different system users is determined using, an actual one of the one or more systems. 9. The computer-implemented method of claim 1 , wherein the commonality reducing process and the pattern reducing process are iteratively performed based on one or more iteration criterion. 10. The computer-implemented method of claim 9 , wherein the one or more iteration criterion comprise an absence of further size reduction in the remaining ones of the files in the set.

Assignees

Inventors

Classifications

  • Protecting personal data, e.g. for financial or medical purposes · CPC title

  • Assessing vulnerabilities and evaluating computer system security · CPC title

  • using diagnostics (G06F11/0703 takes precedence) · CPC title

  • involving event detection and direct action · CPC title

  • Protecting data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10839103B2 cover?
A method is provided for preventing divulgation of sensitive data in two snapshots, taken at different times, of one or more same systems in a cloud environment. The method identifies a set of files from among file pairs. Each file pair is formed from a respective file that includes at least one difference with respect to each snapshot. The method performs a pattern reducing process that remove…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F21/6245. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 17 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).