Estimating asset sensitivity using information associated with users

US10984322B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10984322-B2
Application numberUS-201313939402-A
CountryUS
Kind codeB2
Filing dateJul 11, 2013
Priority dateApr 9, 2013
Publication dateApr 20, 2021
Grant dateApr 20, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Automatically estimating a sensitivity level of an information technology (IT) asset in one aspect may obtain information about an asset. Characteristics of the asset assigned based on the information may be compared with stored characteristics of known sensitive assets. A sensitivity level of the asset may be determined based on the comparing.

First claim

Opening claim text (preview).

We claim: 1. A method for automatically estimating a sensitivity level of an information technology asset, comprising: obtaining information about an asset, the asset comprising a computer network and the obtaining the information comprises identifying Internet Protocol (IP) address from which the computer network is accessed, identifying a user associated with the IP address and determining user attributes of the user and entropy of user accesses of the computer network, the obtaining performed without having to access the asset such that privacy of the computer network is preserved; assigning characteristics to the asset based on the information comprising at least the user attributes and the entropy of user accesses to the asset comprising a computer network; comparing, by a processor, the characteristics of the asset with stored characteristics of known sensitive assets; and determining, by the processor, a sensitivity level of the asset based on the comparing, the sensitivity level determined without having to access the asset and privileged knowledge of the asset, the characteristics comprising features and wherein a machine learning algorithm is trained based on the features to predict the sensitivity level, the determining performed by executing the trained machine learning algorithm, wherein the features can be uniformly used for different asset types in training the machine learning algorithm to predict a sensitivity level for a respective different asset type, wherein the method identifies sensitivity levels for multiple assets respectively, the method facilitating computer security protection by automatically identifying a target asset among the multiple assets based on a corresponding determined sensitivity for providing security protection. 2. The method of claim 1 , wherein the obtaining information about an asset comprises obtaining one or more meta-level features of the asset. 3. The method of claim 1 , wherein the obtaining information about an asset comprises one or more of identifying user features comprising an identity of a user associated with the asset and user attributes associated with the user, identifying usage features associated with the asset, or identifying external content features from published data available outside the asset, or combinations thereof. 4. The method of claim 3 , wherein the characteristics of the asset are assigned by normalizing one or more of said user features associated with the asset, said usage features associated with the asset, or said external content features from published data available outside the asset, or combinations thereof. 5. The method of claim 1 , wherein the obtaining information about an asset comprises at least one of extracting access information from access logs, or performing Internet Protocol address-to-user mapping. 6. The method of claim 1 , wherein the obtaining information about an asset comprises identifying at least one of a job role or rank in an organization, or member type. 7. The method of claim 1 , wherein the obtaining information about an asset comprises at least identifying user features, and wherein the identifying user features comprises determining likely users of an asset. 8. The method of claim 1 , wherein the obtaining information about an asset comprises at least identifying usage features, and wherein the identifying usage features comprises obtaining at least one of usage frequency, usage patterns, access times, access duration, and number of users accessing the asset. 9. The method of claim 1 , wherein the obtaining information about an asset comprises at least identifying external data content features, and wherein the identifying external data content features comprises at least obtaining source information comprising at least one of titles, summaries, user documents, user data sources, patents, publications, file names, project groups, blogs, wiki sites, user web pages, and tags of user-related content. 10. The method of claim 9 , wherein the source information is used to generate a document representation of an asset. 11. The method of claim 10 , wherein the source information is aggregated for a plurality of document representations for the asset. 12. The method of claim 11 , further comprising applying one of topic modeling and clustering to the document representations to obtain content features for the asset. 13. The method of claim 1 , wherein a machine learning algorithm performs the comparing and the determining. 14. The method of claim 13 , wherein the machine learning algorithm comprises a k nearest neighbor based algorithm, wherein the sensitivity level of the asset is determined using V ⁡ ( A ) = ∑ i = 1 k ⁢ ⅇ - d ⁡ ( A , S i ) · V ⁡ ( S i ) wherein for sensitivity of a new asset A, V(A) is then defined as a weighted average score of its k-nearest neighbors among known sensitive assets, {S 1 , . . . , S k }, and V(S i ) is a sensitivity value of S i , and e −d(A,S i ) is a weight function where d(A,S i ) is the Euclidean distance of two assets A and S i . 15. The method of claim 13 , wherein the machine learning algorithm comprises a clustering-based algorithm wherein the sensitivity level of the asset wherein the stored characteristics of known sensitive assets are clustered into subgroups and the sensitivity level of the asset is determined from using a subgroup having characteristics related to the asset. 16. The method of claim 13 , wherein the machine learming comprsies a k nearest neighbor method with at least one distance metric learning algorithm. 17. The method of claim 1 , further comprising allowing intelligently backing up of the asset based on the determined sensitivity level.

Assignees

Inventors

Classifications

  • involving long-term monitoring or reporting · CPC title

  • G06N5/02Primary

    Knowledge representation; Symbolic representation · CPC title

  • Machine learning · CPC title

  • Assessing vulnerabilities and evaluating computer system security · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10984322B2 cover?
Automatically estimating a sensitivity level of an information technology (IT) asset in one aspect may obtain information about an asset. Characteristics of the asset assigned based on the information may be compared with stored characteristics of known sensitive assets. A sensitivity level of the asset may be determined based on the comparing.
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N5/02. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 20 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).