Clustering approach for detecting DDoS botnets on the cloud from IPFix data

US10129295B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10129295-B2
Application numberUS-201615253586-A
CountryUS
Kind codeB2
Filing dateAug 31, 2016
Priority dateAug 31, 2016
Publication dateNov 13, 2018
Grant dateNov 13, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Use machine learning to train a classifier to classify entities to increase confidence with respect to an entity being part of a distributed denial of service attack. The method includes training a classifier to use a first classification method, to identify probabilities that entities from a set of entities are performing denial of service attacks. The method further includes identifying a subset of entities meeting a threshold probability of performing a denial of service attack. The method further includes using a second classification method, identifying similarity of entities in the subset of entities. The method further includes based on the similarity, classifying individual entities.

First claim

Opening claim text (preview).

What is claimed is: 1. A system configured to train and use a classifier to classify entities to determine whether the entities are part of a distributed denial of service (DDoS) attack, the system comprising: one or more hardware processors; and one or more computer-readable storage devices having stored thereon instructions that are executable by the one or more hardware processors to configure the system to perform at least the following: train a classifier to use a first classification method to identify probabilities that entities are performing denial of service attacks, the training comprising applying a captured dataset including data flow protocol information associated with known DDoS attacks; using the trained classifier, identify a subset of entities from a set of candidate entities that meet or exceed a threshold probability of performing a denial of service attack; using a second classification method, identify similarity of entities in the identified subset of entities; and based on the similarity, classify individual entities of the subset of entities as belonging to one or more similarity subgroups, each similarity subgroup comprising entities having a probability of participating in a same DDoS. 2. The system of claim 1 , wherein the second classification method clusters similar entities into similarity clusters. 3. The system of claim 2 , wherein the one or more computer-readable storage devices further have stored thereon instructions that are executable by the one or more hardware processors to configure the computer system to identify a cluster as a set of compromised entities. 4. The system of claim 1 , wherein classifying individual entities comprises identifying entities as performing denial of service. 5. The system of claim 1 , wherein the one or more computer-readable storage devices further have stored thereon instructions that are executable by the one or more hardware processors to configure the computer system to identify entities in a particular botnet based on similarity. 6. The system of claim 1 , wherein the one or more computer-readable storage devices further have stored thereon instructions that are executable by the one or more processors to configure the computer system to identify entities infected by the same means based on similarity. 7. The system of claim 6 , wherein the same means comprises the same malicious software. 8. The system of claim 6 , wherein the same means comprises the same command and control. 9. The system of claim 1 , wherein using the second classification method, to identify similarity of entities in the subset of entities comprises using the L-method. 10. The system of claim 1 , wherein using the second classification method, to identify similarity of entities in the subset of entities comprises using hierarchal clustering. 11. The system of claim 1 , wherein the one or more computer-readable storage devices further have stored thereon instructions that are executable by the one or more hardware processors to configure the computer system to correlate entity activity and wherein using hierarchal clustering is based on correlated entity activity. 12. The system of claim 1 , wherein the one or more computer-readable storage devices further have stored thereon instructions that are executable by the one or more hardware processors to configure the computer system to use available external data to identify a particular botnet. 13. A computer implemented method for training a classifier for classifying entities to determine whether the entities are part of a distributed denial of service (DDoS) attack, the method comprising: training a classifier to use a first classification method to identify probabilities that entities are performing denial of service attacks, the training comprising applying a captured dataset including data flow protocol information associated with known DDoS attacks; using the trained classifier, identifying a subset of entities from a set of candidate entities that meet or exceed a threshold probability of performing a denial of service attack; using a second classification method, identifying similarity of entities in the subset of identified entities; and based on the similarity, classifying individual entities of the subset of entities as belonging to one or more similarity subgroups, each similarity subgroup comprising entities having a probability of participating in a same DDoS. 14. The method of claim 13 , wherein the second classification method clusters similar entities into similarity clusters. 15. The method of claim 14 , further comprising identifying a cluster as a set of compromised entities. 16. The method of claim 13 , wherein classifying individual entities comprises identifying entities as performing denial of service. 17. The method of claim 13 , further comprising identifying entities in a particular botnet based on similarity. 18. The method of claim 13 , further comprising identifying entities infected by the same means based on similarity. 19. The method of claim 18 , wherein the same means comprises the same malicious software. 20. A computer system configured to use a trained classifier to classify entities to determine whether the entities are part of a distributed denial of service (DDoS) attack, the system comprising: a botnet classifier coupled to a plurality of computing entities, the botnet classifier comprising one or more computer processors, wherein the botnet classifier is configured to: capture data flow protocol information from the entities in the plurality of entities; provide the captured data flow protocol information from the entities to a trained classifier, the trained classifier having been trained by applying previously captured data including data flow protocol information associated with known DDoS attacks; the trained classifier implementing a first classification method to identify probabilities that entities are performing denial of service attacks based on the captured data flow protocol information; identify a subset of entities from a set of candidate entities that meet or exceed a threshold probability of performing a denial of service attack; use a second classification method, identify similarity of entities in the identified subset of entities; and based on the similarity, classify individual entities of the subset of entities as belonging to one or more similarity subgroups, each similarity subgroup comprising entities having a probability of participating in a same DDoS.

Assignees

Inventors

Classifications

  • Traffic logging, e.g. anomaly detection · CPC title

  • Machine learning · CPC title

  • Denial of Service · CPC title

  • Clustering; Classification · CPC title

  • Active attacks involving interception, injection, modification, spoofing of data unit addresses, e.g. hijacking, packet injection or TCP sequence number attacks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10129295B2 cover?
Use machine learning to train a classifier to classify entities to increase confidence with respect to an entity being part of a distributed denial of service attack. The method includes training a classifier to use a first classification method, to identify probabilities that entities from a set of entities are performing denial of service attacks. The method further includes identifying a sub…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification H04L63/1458. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 13 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).