Data classification system for hybrid clouds

US9953075B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9953075-B1
Application numberUS-201213728490-A
CountryUS
Kind codeB1
Filing dateDec 27, 2012
Priority dateDec 27, 2012
Publication dateApr 24, 2018
Grant dateApr 24, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A data classification system is associated with a hybrid cloud comprising at least one private cloud and at least one public cloud. The data classification system comprises a data set classification model classifying data sets, a cloud classification model classifying the private and public clouds of the hybrid cloud, and mapping policies each specifying a particular mapping between one or more classes of the data set classification model and one or more classes of the cloud classification model. The data classification system classifies a received data set using the data set classification model, and determines for the received data set at least one cloud of the hybrid cloud to which the received data set should be directed for further processing based at least in part on a result of the classification of the received data set, the cloud classification model and a selected one of the mapping policies.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus comprising: a data classification system associated with a hybrid cloud comprising at least one private cloud and at least one public cloud; the data classification system comprising a data set classification model classifying data sets, a cloud classification model classifying the private and public clouds of the hybrid cloud into two or more different cloud classes, and a plurality of mapping policies each specifying a particular mapping between one or more classes of the data set classification model and one or more classes of the cloud classification model; wherein the data classification system is configured to classify a received data set using the data set classification model; wherein the data classification system further comprises a reasoner configured to determine a selected one of the mapping policies for the received data set, the reasoner comprising: a semantic ontology application programming interface; a model parser; a model updater; and a semantic learner; wherein the semantic ontology application programming interface provides programming access for adjusting one or more semantic ontologies utilized by the semantic learner; wherein the model parser extracts information from at least one of the data set classification model and the cloud classification model so as to make said information available to the semantic learner for use in performing one or more reasoning operations; wherein the model updater adjusts one or more characteristics of at least one of the data set classification model, the cloud classification model and one or more of the mapping policies responsive to feedback from the semantic learner; and wherein the semantic learner is configured to identify one or more additional relationship types that are not already captured in a current version of at least one of the data set classification model and the cloud classification model and to provide those relationship types to the model updater as part of said feedback; wherein the data classification system is further configured to determine for the received data set at least one cloud of the hybrid cloud to which the received data set should be directed for further processing based at least in part on a result of the classification of the received data set, the cloud classification model and the selected mapping policy; wherein the reasoner is further configured to migrate a given data set from a first cloud associated with a first one of the cloud classes to a second cloud associated with a second one of the cloud classes responsive to a change in at least one of the data set classification model, the cloud classification model and one or more of the mapping policies; and wherein the data classification system is implemented using at least one processing device comprising a processor coupled to a memory. 2. The apparatus of claim 1 wherein the data classification system comprises: a classifier configured to classify the received data set in accordance with the data set classification model; and a mapper configured to apply the selected mapping policy for the received data set to determine said at least one cloud to which that received data set should be directed for further processing. 3. The apparatus of claim 1 wherein the data classification system is adapted for coupling between at least one enterprise storage system and the hybrid cloud. 4. The apparatus of claim 3 wherein the data classification system is configured to receive the data set and associated metadata from at least one cloud agent associated with the enterprise storage system. 5. The apparatus of claim 4 wherein the data classification system is further configured to select for the received data set a particular one of the private and public clouds in which the received data set will be stored and to inform the cloud agent regarding its selection for the received data set. 6. The apparatus of claim 2 wherein the classifier in classifying the received data set utilizes metadata associated with the received data set. 7. The apparatus of claim 6 wherein the metadata characterizes at least one of properties of the received data set and relationships between the received data set and one or more other data sets. 8. The apparatus of claim 6 wherein the metadata characterizes the received data set in accordance with at least one specified semantic ontology. 9. The apparatus of claim 1 wherein at least one of the data set classification model, the cloud classification model and the mapping policies are represented at least in part in an RDF format. 10. The apparatus of claim 1 wherein the reasoner is configured to traverse at least one of the data set classification model, the cloud classification model and the mapping policies using one or more SPARQL queries. 11. The apparatus of claim 1 wherein the cloud classification model is based at least in part on designated parameters of the private and public clouds including one or more of auditability, availability, capacity, colocation, cost, performance and security. 12. The apparatus of claim 1 wherein the data set classification model and the cloud classification model are each arranged in the form of a class hierarchy. 13. The apparatus of claim 1 wherein said at least one processing device comprises an element of a processing platform of an information processing system that implements the data classification system and the hybrid cloud. 14. The apparatus of claim 1 wherein the data classification system automatically maps the received data set to an appropriate cloud of the hybrid cloud using the results of the classification of the received data set, the cloud classification model and the selected one of the mapping policies. 15. The apparatus of claim 2 wherein the mapper combines multiple mapping policies to determine said at least one cloud to which the received data set should be directed for further processing. 16. The apparatus of claim 12 wherein the hierarchy for the cloud classification model classifies the hybrid cloud into classes that comprise a private cloud class and a public cloud class. 17. The apparatus of claim 1 wherein: the data set classification model is configured to classify the received data set based at least in part on relationships between the received data set and one or more other data sets; and the selected mapping policy for the received data set comprises at least one mapping policy relating to colocation of the received data set with the one or more other data sets based on the relationships between the received data set and the one or more other data sets. 18. The apparatus of claim 1 wherein: the data set classification model is configured to classify the received data set based at least in part on properties of the received data set; and the reasoner is configured to modify a data set class of the received data set based on relationships between the received data set and one or more other data sets. 19. A method comprising the steps of: obtaining a data set classification model classifying data sets and a cloud classification model classifying private and public clouds of a hybrid cloud into two or more different cloud classes; instantiating a plurality of mapping policies each specifying a particular mapping between one or more classes of the data set classification model and one or more classes of the cloud classification model; receiving a data set; classifying the received data set using the data set classification model; p

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9953075B1 cover?
A data classification system is associated with a hybrid cloud comprising at least one private cloud and at least one public cloud. The data classification system comprises a data set classification model classifying data sets, a cloud classification model classifying the private and public clouds of the hybrid cloud, and mapping policies each specifying a particular mapping between one or more…
Who is the assignee on this patent?
Emc Corp, Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F17/30598. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 24 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).