Augmented machine learning malware detection based on static and dynamic analysis

US11977633B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11977633-B2
Application numberUS-202117646130-A
CountryUS
Kind codeB2
Filing dateDec 27, 2021
Priority dateDec 27, 2021
Publication dateMay 7, 2024
Grant dateMay 7, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for malware detection uses static and dynamic analysis to augment a machine learning model. At the training step, static and dynamic features are extracted from training datasets and used to train a malware classification model. The malware classification model is used to classify unknown files based on verdicts from both static and dynamic models.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer implemented method for adjusted machine-learning malware detection based on static and dynamic analysis of a file, the method executed on a processor of a target computing system, the method comprising: classifying the file with a static analysis machine learning model based on static features extracted from the file before execution, wherein classifying the file comprises obtaining a first verdict with a rate of conformity to a class of files and a rate of a predicted dynamic feature; executing the file on the target computing system; collecting data related to file execution on the target computing system; extracting the predicted dynamic feature from the collected data; classifying the file with a dynamic analysis machine learning model based on the predicted dynamic feature, wherein classifying the file comprises obtaining a second verdict with a rate of conformity to a class of files and wherein the second verdict is adjusted with the rate of the predicted dynamic feature; and classifying the file with a malware classification machine learning model based on the first verdict of the static analysis machine learning model and the second verdict of the dynamic analysis machine learning model. 2. The method of claim 1 , further comprising the step of processing a malware classification verdict at an endpoint protection agent to detect malware, wherein the step of processing occurs after classifying the file with a malware classification machine learning model. 3. The method of claim 2 wherein the target computing system comprises a sandbox for securely executing the file. 4. The method of claim 2 , further comprising the step of performing a detection response action at the endpoint protection agent to counter the malware. 5. The method of claim 4 wherein the target computing system comprises a sandbox for securely executing the file. 6. A system for adjusted machine learning malware detection comprising: a processor coupled to a storage device configured for training and storing a plurality of machine learning models; a static analysis machine learning model configured to extract static features from a file before execution and to classify the file based on extracted static features, wherein the file classification comprises a first verdict with a rate of conformity to at least one class of files and a rate of a predicted dynamic feature; a target computing system for executing the file; collected data related to the file's execution on the target computing system; dynamic features from the collected data extracted in accordance to a predicted dynamic feature list comprising the predicted dynamic feature; a dynamic analysis machine learning model configured to classify the file based on the extracted dynamic features, wherein the file classification results in a second verdict comprising a rate of conformity to a class of files and adjusted with the rate of the predicted dynamic feature; and a malware classification machine learning model configured to classify the file based on the first verdict of the static analysis machine learning model and the second verdict of the dynamic analysis machine learning model. 7. The system of claim 6 , wherein the target computing system for running the file comprises a sandbox for securely executing the file. 8. A computer implemented method for adjusted machine-learning malware detection based on static and dynamic analysis of the file, the method executed on a processor of a target computing system, the method comprising: classifying a file with a static analysis machine learning model based on static features extracted from the file before execution, wherein the classification results in a first verdict comprising a rate of conformity to a class of files and a rate of first and second predicted dynamic features; executing the file on the target computing system; collecting data related to file execution on the target computing system; extracting dynamic features from collected data in accordance with a predicted dynamic feature list to create a first dynamic features dataset; wherein the predicted dynamic feature list comprises first and second predicted dynamic features; wherein the first predicted dynamic feature has a first weight and the second predicted dynamic feature has a second weight; extracting dynamic features from collected data to create a second dynamic features dataset; training a first dynamic analysis machine learning model using the first dynamic features dataset and training a second dynamic analysis machine learning model using the second dynamic features dataset; classifying the file with the first dynamic analysis machine learning model, wherein the classification results in a second verdict adjusted with the rate of the first predicted dynamic feature; classifying the file with the second dynamic analysis machine learning model, wherein the classification results in a third verdict adjusted with the rate of the second predicted dynamic feature; and classifying the file with a malware classification machine learning model based on the first and second and third verdicts. 9. The method of claim 8 , wherein the the step of classifying the file with a malware classification machine learning model further comprises ranking the verdicts of the first and second dynamic analysis machine learning models with the rate of the predicted dynamic feature. 10. The method of claim 8 , wherein the first verdict includes a rate of a plurality of predicted dynamic features.

Assignees

Inventors

Classifications

  • G06F21/565Primary

    by checking file integrity · CPC title

  • by executing in a restricted environment, e.g. sandbox or secure virtual machine · CPC title

  • G06F21/566Primary

    Dynamic detection, i.e. detection performed at run-time, e.g. emulation, suspicious activities · CPC title

  • Knowledge engineering; Knowledge acquisition · CPC title

  • Test or assess a computer or a system · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11977633B2 cover?
A system and method for malware detection uses static and dynamic analysis to augment a machine learning model. At the training step, static and dynamic features are extracted from training datasets and used to train a malware classification model. The malware classification model is used to classify unknown files based on verdicts from both static and dynamic models.
Who is the assignee on this patent?
Acronis Int Gmbh
What technology area does this patent fall under?
Primary CPC classification G06F21/565. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 07 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).