Machine learning-based anomaly detection for human presence verification

US11461441B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11461441-B2
Application numberUS-201916401616-A
CountryUS
Kind codeB2
Filing dateMay 2, 2019
Priority dateMay 2, 2019
Publication dateOct 4, 2022
Grant dateOct 4, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are provided for machine learning-based anomaly detection in a monitored location. One method comprises obtaining data from multiple data sources associated with a monitored location for storage into a data repository; processing the data to generate substantially continuous time-series data for multiple distinct features within the data; applying the substantially continuous time-series data for the distinct features to a machine learning baseline behavioral model to obtain a probability distribution representing a behavior of the monitored location over time; and evaluating a probability score generated by the machine learning baseline behavioral model to identify an anomaly at the monitored location. The machine learning baseline behavioral model is trained, for example, to identify anomalies in correlations between the plurality of distinct features at each timestamp. A presence verification is optionally provided based on a deviation from the machine learning baseline behavioral model at the monitored location.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: obtaining data from a plurality of data sources associated with a monitored physical location for storage into a data repository; processing the data to generate substantially continuous time-series data for a plurality of distinct features within the data; applying the substantially continuous time-series data for the plurality of distinct features to at least one machine learning baseline behavioral model to obtain a probability distribution representing a behavior of the monitored physical location over time, wherein the at least one machine learning baseline behavioral model is trained to learn a baseline behavior comprising one or more expected times of at least one expected occupant at the monitored physical location, wherein an unexpected occupant at the monitored physical location at a given time is identified based on a deviation of the unexpected occupant at the monitored physical location at the given time from the learned one or more expected times of the at least one expected occupant at the monitored physical location in the at least one machine learning baseline behavioral model, and wherein the probability distribution comprises a multi-dimensional probability distribution representing one or more human properties, wherein the multi-dimensional probability distribution takes into account (i) a temporal pattern behavior of each of the plurality of distinct features related to the one or more human properties and (ii) temporal correlations between feature values of at least two of the plurality of distinct features, related to the one or more human properties, at each timestamp, wherein the at least one machine learning baseline behavioral model is further trained to treat a presence of a given expected occupant at the monitored physical location at a different time than the one or more expected times for the given expected occupant as a non-anomalous event; and evaluating a probability score generated by the at least one machine learning baseline behavioral model to identify an anomaly at the monitored physical location; wherein the method is performed by at least one processing device comprising a processor coupled to a memory. 2. The method of claim 1 , wherein the plurality of data sources comprises one or more of sensor devices at the monitored location, physiological sensor devices for one or more humans at the monitored location, a network device associated with the monitored location and a smart appliance at the monitored location. 3. The method of claim 1 , wherein the substantially continuous time-series data for the plurality of distinct features is applied to the at least one machine learning baseline behavioral model as a data vector with a value for each distinct feature for a given timestamp. 4. The method of claim 1 , wherein the step of evaluating the probability score further comprises comparing the probability score to one or more predefined thresholds. 5. The method of claim 1 , wherein the step of evaluating the probability score further comprises the step of evaluating the probability score for each of the distinct features. 6. The method of claim 1 , wherein the processing step further comprises applying at least one function to the data to obtain a plurality of time-series counters for the plurality of distinct features within the data. 7. The method of claim 1 , wherein the applying the substantially continuous time-series data for the plurality of distinct features to the at least one machine learning baseline behavioral model comprises applying a difference, between a predicted value of a given feature by a given machine learning baseline behavioral model and a measured value of the given feature, to an aggregate model. 8. The method of claim 1 , wherein the temporal pattern behavior of each of the plurality of distinct features is used to identify at least one anomaly in one or more of a given distinct feature and a plurality of the distinct features. 9. The method of claim 1 , wherein the at least one machine learning baseline behavioral model comprises a different machine learning model for each of the plurality of distinct features within the data and an additional aggregated machine learning model that aggregates an output of each of the different machine learning models for each of the plurality of distinct features. 10. A computer program product, comprising a tangible machine-readable storage medium having encoded therein executable code of one or more software programs, wherein the one or more software programs when executed by at least one processing device perform the following steps: obtaining data from a plurality of data sources associated with a monitored physical location for storage into a data repository; processing the data to generate substantially continuous time-series data for a plurality of distinct features within the data; applying the substantially continuous time-series data for the plurality of distinct features to at least one machine learning baseline behavioral model to obtain a probability distribution representing a behavior of the monitored physical location over time, wherein the at least one machine learning baseline behavioral model is trained to learn a baseline behavior comprising one or more expected times of at least one expected occupant at the monitored physical location, wherein an unexpected occupant at the monitored physical location at a given time is identified based on a deviation of the unexpected occupant at the monitored physical location at the given time from the learned one or more expected times of the at least one expected occupant at the monitored physical location in the at least one machine learning baseline behavioral model, and wherein the probability distribution comprises a multi-dimensional probability distribution representing one or more human properties, wherein the multi-dimensional probability distribution takes into account (i) a temporal pattern behavior of each of the plurality of distinct features related to the one or more human properties and (ii) temporal correlations between feature values of at least two of the plurality of distinct features, related to the one or more human properties, at each timestamp, wherein the at least one machine learning baseline behavioral model is further trained to treat a presence of a given expected occupant at the monitored physical location at a different time than the one or more expected times for the given expected occupant as a non-anomalous event; and evaluating a probability score generated by the at least one machine learning baseline behavioral model to identify an anomaly at the monitored physical location. 11. The computer program product of claim 10 , wherein the substantially continuous time-series data for the plurality of distinct features is applied to the at least one machine learning baseline behavioral model as a data vector with a value for each distinct feature for a given timestamp. 12. The computer program product of claim 10 , wherein the step of evaluating the probability score further comprises the step of evaluating the probability score for each of the distinct features. 13. The computer program product of claim 10 , wherein the applying the substantially continuous time-series data for the plurality of distinct features to the at least one machine learning baseline behavioral model comprises applying a difference, between a predicted value of a given feature by a given machine learning baseline behavioral model and a measured value of the given feature, to an aggregate model. 14. The computer program product of claim 10 , whe

Assignees

Inventors

Classifications

  • Classification, e.g. identification · CPC title

  • of input or preprocessed data · CPC title

  • Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands · CPC title

  • using classification, e.g. of video objects · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11461441B2 cover?
Techniques are provided for machine learning-based anomaly detection in a monitored location. One method comprises obtaining data from multiple data sources associated with a monitored location for storage into a data repository; processing the data to generate substantially continuous time-series data for multiple distinct features within the data; applying the substantially continuous time-se…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 04 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).