Method and system for proactive anomaly detection in devices and networks
US-2019036795-A1 · Jan 31, 2019 · US
US11361197B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11361197-B2 |
| Application number | US-201816023110-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 29, 2018 |
| Priority date | Jun 29, 2018 |
| Publication date | Jun 14, 2022 |
| Grant date | Jun 14, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Techniques are provided for anomaly detection in time-series data using state inference and machine learning. An exemplary method comprises: obtaining detected states of a plurality of data samples in temporal data, wherein each data sample in the temporal data has a corresponding detected state; obtaining a likelihood that each of the data samples belongs to the corresponding detected state; obtaining a distribution of likelihoods of the data samples indicating a number of observations of each of a plurality of likelihood values; training, using a supervised learning technique, an anomaly detection model that, given the distribution of likelihoods and one or more anomaly thresholds, generates a quality score for each of the anomaly thresholds; and selecting at least one anomaly threshold based on the quality score, wherein the trained anomaly detection model is applied to detect anomalies in new temporal data samples using the selected at least one anomaly threshold.
Opening claim text (preview).
What is claimed is: 1. A method, comprising: obtaining one or more detected states of a plurality of data samples in temporal data, wherein each of the data samples in said temporal data has a corresponding detected state; obtaining a likelihood that each of said data samples belongs to said corresponding detected state; obtaining a distribution of the likelihoods of the data samples indicating a number of observations of each of a plurality of likelihood values; training, by at least one processing device, using a supervised learning technique, and using a training dataset comprising the distribution of the likelihoods, a plurality of anomaly thresholds, and a corresponding quality score for each anomaly threshold of the plurality of anomaly thresholds indicating a performance of detecting anomalies in the plurality of data samples using the corresponding anomaly threshold, a first machine learning model, wherein training the first machine learning model comprises adjusting one or more parameters of the first machine learning model to predict the corresponding quality score for each combination of: (i) the distribution of the likelihoods and (ii) each of the anomaly thresholds; selecting, by the at least one processing device and using the first machine learning model, at least one anomaly threshold to apply to new temporal data samples; and configuring a second machine learning model for anomaly detection using at least the selected at least one anomaly threshold, wherein the second machine learning model for anomaly detection is applied to detect anomalies in the new temporal data samples. 2. The method of claim 1 , wherein the distribution of the likelihoods is an aggregation of the likelihoods that each data sample belongs to said corresponding detected state. 3. The method of claim 1 , further comprising the step of clustering the data samples from the temporal data into a plurality of clusters using temporal information, wherein each of the plurality of clusters corresponds to one detected state. 4. The method of claim 3 , wherein the likelihood that each of said data samples belongs to said corresponding detected state is obtained from a probability distribution provided by the clustering step. 5. The method of claim 1 , wherein the likelihood comprises a log likelihood. 6. The method of claim 1 , further comprising a testing phase that evaluates a performance of the second machine learning model for anomaly detection on new labeled temporal data samples. 7. The method of claim 1 , wherein the second machine learning model for anomaly detection detects anomalies in the new temporal data samples by comparing one or more of the new temporal data samples to the selected at least one anomaly threshold. 8. A computer program product, comprising a non-transitory processor-readable storage medium having encoded therein executable code of one or more software programs, wherein the one or more software programs when executed by at least one processing device perform the following steps: obtaining one or more detected states of a plurality of data samples in temporal data, wherein each of the data samples in said temporal data has a corresponding detected state; obtaining a likelihood that each of said data samples belongs to said corresponding detected state; obtaining a distribution of the likelihoods of the data samples indicating a number of observations of each of a plurality of likelihood values; training, by at least one processing device, using a supervised learning technique, and using a training dataset comprising the distribution of the likelihoods, a plurality of anomaly thresholds, and a corresponding quality score for each anomaly threshold of the plurality of anomaly thresholds indicating a performance of detecting anomalies in the plurality of data samples using the corresponding anomaly threshold, a first machine learning model, wherein training the first machine learning model comprises adjusting one or more parameters of the first machine learning model to predict the corresponding quality score for each combination of: (i) the distribution of the likelihoods and (ii) each of the anomaly thresholds; selecting, by the at least one processing device and using the first machine learning model, at least one anomaly threshold to apply to new temporal data samples; and configuring a second machine learning model for anomaly detection using at least the selected at least one anomaly threshold, wherein the second machine learning model for anomaly detection is applied to detect anomalies in the new temporal data samples. 9. The computer program product of claim 8 , wherein the distribution of the likelihoods is an aggregation of the likelihoods that each data sample belongs to said corresponding detected state. 10. The computer program product of claim 8 , further comprising the step of clustering the data samples from the temporal data into a plurality of clusters using temporal information, wherein each of the plurality of clusters corresponds to one detected state. 11. The computer program product of claim 8 , wherein the likelihood comprises a log likelihood. 12. The computer program product of claim 8 , further comprising a testing phase that evaluates a performance of the second machine learning model for anomaly detection on new labeled temporal data samples. 13. The computer program product of claim 8 , wherein the second machine learning model for anomaly detection detects anomalies in the new temporal data samples by comparing one or more of the new temporal data samples to the selected at least one anomaly threshold. 14. An apparatus, comprising: a memory; and at least one processing device, coupled to the memory, operative to implement the following steps: obtaining one or more detected states of a plurality of data samples in temporal data, wherein each of the data samples in said temporal data has a corresponding detected state; obtaining a likelihood that each of said data samples belongs to said corresponding detected state; obtaining a distribution of the likelihoods of the data samples indicating a number of observations of each of a plurality of likelihood values; training, by at least one processing device, using a supervised learning technique, and using a training dataset comprising the distribution of the likelihoods, a plurality of anomaly thresholds, and a corresponding quality score for each anomaly threshold of the plurality of anomaly thresholds indicating a performance of detecting anomalies in the plurality of data samples using the corresponding anomaly threshold, a first machine learning model, wherein training the first machine learning model comprises adjusting one or more parameters of the first machine learning model to predict the corresponding quality score for each combination of: (i) the distribution of the likelihoods and (ii) each of the anomaly thresholds; selecting, by the at least one processing device and using the first machine learning model, at least one anomaly threshold to apply to new temporal data samples; and configuring a second machine learning model for anomaly detection using at least the selected at least one anomaly threshold, wherein the second machine learning model for anomaly detection is applied to detect anomalies in the new temporal data samples. 15. The apparatus of claim 14 , wherein the distribution of the likelihoods is an aggregation of the likelihoods that each data sample belongs to said corresponding detected state. 16. The apparatus of claim 14 , further comprising the step of clustering the data sampl
using kernel methods, e.g. support vector machines [SVM] · CPC title
Single-class perspective, e.g. one-against-all classification; Novelty detection; Outlier detection · CPC title
based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate · CPC title
Probabilistic graphical models, e.g. probabilistic networks · CPC title
Clustering techniques · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.