Dynamic application degrouping to optimize machine learning model accuracy

US10318887B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10318887-B2
Application numberUS-201615188140-A
CountryUS
Kind codeB2
Filing dateJun 21, 2016
Priority dateMar 24, 2016
Publication dateJun 11, 2019
Grant dateJun 11, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In one embodiment, a device in a network identifies a plurality of applications from observed traffic in the network. The device forms two or more application clusters from the plurality of applications. Each of the application clusters includes one or more of the applications, and wherein a particular application in the plurality of applications is included in each of the application clusters. The device generates anomaly detection models for each of the application clusters. The device tests the anomaly detection models, to determine a measure of efficacy for each of the models with respect to traffic associated with the particular application. The device selects a particular anomaly detection model to analyze the traffic associated with the particular application based on the measures of efficacy for each of the models.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: identifying, by a device in a network, a plurality of applications from observed traffic in the network; forming, by the device, two or more application clusters from the plurality of applications, wherein each of the application clusters includes one or more of the applications, and wherein a particular application in the plurality of applications is included in each of the application clusters; generating, by the device, anomaly detection models for each of the application clusters; testing, by the device, the anomaly detection models, to determine a measure of efficacy for each of the models with respect to traffic associated with the particular application; and selecting, by the device, a particular anomaly detection model to analyze the traffic associated with the particular application based on the measures of efficacy for each of the models. 2. The method as in claim 1 , wherein testing the anomaly detection models comprises: determining, for each of the anomaly detection models, the measure of efficacy of the model based in part on an estimated set of inputs that the model would deem anomalous. 3. The method as in claim 2 , wherein the selected anomaly detection model is selected based in part on an amount of overlap between the estimated sets of inputs that each of the anomaly detection models would deem anomalous. 4. The method as in claim 3 , wherein the selected anomaly detection model models the traffic associated with the particular application and traffic associated with at least one other application. 5. The method as in claim 1 , wherein testing the anomaly detection models comprises: using the anomaly detection models to detect anomalous traffic; and reporting the detected anomalous traffic to a supervisory device. 6. The method as in claim 5 , wherein the measures of efficacy comprise feedback from the supervisory device regarding the reported anomalous traffic. 7. The method as in claim 1 , further comprising: receiving, at the device, a request from a supervisory device to use a separate anomaly detection model for the traffic associated with the particular application; determining, by the device, whether the device has sufficient resource to execute a separate anomaly detection model for the traffic associated with the particular application; and notifying, by the device, the supervisory device when there are not sufficient resources on the device to execute a separate anomaly detection model for the traffic associated with the particular application. 8. The method as in claim 7 , wherein the request to use a separate anomaly detection model for the traffic associated with the particular application is sent by the supervisory device based in part on a threat intelligence index of compromise. 9. The method as in claim 1 , wherein the anomaly detection models are statistical behavioral models. 10. An apparatus, comprising: one or more network interfaces to communicate with a network; a processor coupled to the network interfaces and configured to execute one or more processes; and a memory configured to store a process executable by the processor, the process when executed configured to: identify a plurality of applications from observed traffic in the network; form two or more application clusters from the plurality of applications, wherein each of the application clusters includes one or more of the applications, and wherein a particular application in the plurality of applications is included in each of the application clusters; generate anomaly detection models for each of the application clusters; test the anomaly detection models, to determine a measure of efficacy for each of the models with respect to traffic associated with the particular application; and select a particular anomaly detection model to analyze the traffic associated with the particular application based on the measures of efficacy for each of the models. 11. The apparatus as in claim 10 , wherein the apparatus tests the anomaly detection models by: determining, for each of the anomaly detection models, the measure of efficacy of the model based in part on an estimated set of inputs that the model would deem anomalous. 12. The apparatus as in claim 11 , wherein the apparatus selects the selected anomaly detection model based in part on an amount of overlap between the estimated sets of inputs that each of the anomaly detection models would deem anomalous. 13. The apparatus as in claim 12 , wherein the selected anomaly detection model models the traffic associated with the particular application and traffic associated with at least one other application. 14. The apparatus as in claim 10 , wherein the apparatus tests the anomaly detection models by: using the anomaly detection models to detect anomalous traffic; and reporting the detected anomalous traffic to a supervisory device. 15. The apparatus as in claim 14 , wherein the measures of efficacy comprise feedback from the supervisory device regarding the reported anomalous traffic. 16. The apparatus as in claim 10 , wherein the process when executed is further operable to: receive a request from a supervisory device to use a separate anomaly detection model for the traffic associated with the particular application; determine whether the apparatus has sufficient resource to execute a separate anomaly detection model for the traffic associated with the particular application; and notify the supervisory device when there are not sufficient resources on the apparatus to execute a separate anomaly detection model for the traffic associated with the particular application. 17. The apparatus as in claim 16 , wherein the request to use a separate anomaly detection model for the traffic associated with the particular application is sent by the supervisory device based in part on a threat intelligence index of compromise. 18. The apparatus as in claim 10 , wherein the anomaly detection models are statistical behavioral models. 19. A tangible, non-transitory, computer-readable medium storing program instructions that cause a device in a network to execute a process comprising: identifying, by the, a plurality of applications from observed traffic in the network; forming, by the device, two or more application clusters from the plurality of applications, wherein each of the application clusters includes one or more of the applications, and wherein a particular application in the plurality of applications is included in each of the application clusters; generating, by the device, anomaly detection models for each of the application clusters; testing, by the device, the anomaly detection models, to determine a measure of efficacy for each of the models with respect to traffic associated with the particular application; and selecting, by the device, a particular anomaly detection model to analyze the traffic associated with the particular application based on the measures of efficacy for each of the models. 20. The tangible, non-transitory, computer-readable medium as in claim 19 , wherein the anomaly detection models are statistical behavioral models.

Assignees

Inventors

Classifications

  • Testing arrangements · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • related to network traffic · CPC title

  • using statistical or mathematical methods · CPC title

  • using software, i.e. software packages (network security related monitoring H04L63/1408) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10318887B2 cover?
In one embodiment, a device in a network identifies a plurality of applications from observed traffic in the network. The device forms two or more application clusters from the plurality of applications. Each of the application clusters includes one or more of the applications, and wherein a particular application in the plurality of applications is included in each of the application clusters.…
Who is the assignee on this patent?
Cisco Tech Inc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 11 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).