Provenance-based threat detection tools and stealthy malware detection

US11423146B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11423146-B2
Application numberUS-202016991288-A
CountryUS
Kind codeB2
Filing dateAug 12, 2020
Priority dateAug 27, 2019
Publication dateAug 23, 2022
Grant dateAug 23, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for a provenance based threat detection tool that builds a provenance graph including a plurality of paths using a processor device from provenance data obtained from one or more computer systems and/or networks; samples the provenance graph to form a plurality of linear sample paths, and calculates a regularity score for each of the plurality of linear sample paths using a processor device; selects a subset of linear sample paths from the plurality of linear sample paths based on the regularity score, and embeds each of the subset of linear sample paths by converting each of the subset of linear sample paths into a numerical vector using a processor device; detects anomalies in the embedded paths to identify malicious process activities, and terminates a process related to the embedded path having the identified malicious process activities.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer implemented provenance-based threat detection method, comprising: building a provenance graph including a plurality of paths using a processor device from provenance data obtained from one or more computer systems and/or networks, wherein the provenance graph is built by collecting the provenance data using hook functions that intercept operating system calls; sampling the provenance graph to form a plurality of linear sample paths; calculating a regularity score for each of the plurality of linear sample paths using a processor device; selecting a subset of linear sample paths from the plurality of linear sample paths based on the regularity score; embedding each of the subset of linear sample paths by converting each of the subset of linear sample paths into a numerical vector using a processor device; detecting anomalies in the embedded paths to identify malicious process activities, wherein the anomalies in the embedded paths are detected using an anomaly detection model that is configured to identify malicious activity, wherein the anomaly detection model is selected from the group consisting of one-class support vector machine (OC-SVM) and Local Outlier Factor (LOF); and terminating a process related to the embedded path having the identified malicious process activities. 2. The method as recited in claim 1 , wherein selecting a subset of linear sample paths addresses a dependency explosion problem. 3. The method as recited in claim 1 , wherein the anomaly detection model is trained using a benign training data set. 4. The method as recited in claim 3 , wherein embedding each of the plurality of paths is done using graph2vec or doc2vec. 5. A non-transitory computer readable storage medium comprising a computer readable program for a computer implemented provenance-based threat detection tool, wherein the computer readable program when executed on a computer causes the computer to perform the steps of: building a provenance graph including a plurality of paths using a processor device from provenance data obtained from one or more computer systems and/or networks, wherein the provenance graph is built by collecting the provenance data using hook functions that intercept operating system calls; sampling the provenance graph to form a plurality of linear sample paths; calculating a regularity score for each of the plurality of linear sample paths using a processor device; selecting a subset of linear sample paths from the plurality of linear sample paths based on the regularity score; embedding each of the subset of linear sample paths by converting each of the subset of linear sample paths into a numerical vector using a processor device; detecting anomalies in the embedded paths to identify malicious process activities, wherein the anomalies in the embedded paths are detected using an anomaly detection model that is configured to identify malicious activity, wherein the anomaly detection model is selected from the group consisting of one-class support vector machine (OC-SVM) and Local Outlier Factor (LOF); and terminating a process related to the embedded path having the identified malicious process activities. 6. The method as recited in claim 5 , wherein selecting a subset of linear sample paths addresses a dependency explosion problem. 7. The computer readable program as recited in claim 5 , wherein the anomaly detection model is trained using a benign training data set. 8. The computer readable program as recited in claim 7 , wherein embedding each of the plurality of paths is done using graph2vec or doc2vec. 9. A system for provenance-based threat detection, comprising: a computer system including: random access memory configured to store a provenance-based threat detection tool; one or more processor devices and an operating system having a kernel, wherein one or more hook functions operating in the kernel are configured to collect provenance data using the hook functions that intercept operating system calls; and a database configured to store the provenance data collected by the one or more hook functions, wherein the provenance-based threat detection tool is configured to: build a provenance graph including a plurality of paths using the one or more processor devices from provenance data obtained from the computer systems and/or a network; sample the provenance graph to form a plurality of linear sample paths; calculate a regularity score for each of the plurality of linear sample paths using the one or more processor devices; select a subset of linear sample paths from the plurality of linear sample paths based on the regularity score; embed each of the subset of linear sample paths by converting each of the subset of linear sample paths into a numerical vector using the one or more processor devices; detect anomalies in the embedded paths to identify malicious process activities, wherein the anomalies in the embedded paths are detected using an anomaly detection model that is configured to identify malicious activity, wherein the anomaly detection model is selected from the group consisting of one-class support vector machine (OC-SVM) and Local Outlier Factor (LOF); and terminate a process related to the embedded path having the identified malicious process activities. 10. The system as recited in claim 9 , wherein selecting a subset of linear sample paths addresses a dependency explosion problem. 11. The system as recited in claim 9 , wherein the anomaly detection model is trained using a benign training data set.

Assignees

Inventors

Classifications

  • G06F21/562Primary

    Static detection · CPC title

  • G06F21/566Primary

    Dynamic detection, i.e. detection performed at run-time, e.g. emulation, suspicious activities · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

  • Graphical models, e.g. Bayesian networks · CPC title

  • based on distances to training or reference patterns · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11423146B2 cover?
Systems and methods for a provenance based threat detection tool that builds a provenance graph including a plurality of paths using a processor device from provenance data obtained from one or more computer systems and/or networks; samples the provenance graph to form a plurality of linear sample paths, and calculates a regularity score for each of the plurality of linear sample paths using a …
Who is the assignee on this patent?
Nec Lab America Inc, Nec Corp
What technology area does this patent fall under?
Primary CPC classification G06F21/562. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 23 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).