Automatically generating malware definitions using word-level analysis

US11222113B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-11222113-B1
Application numberUS-201916363709-A
CountryUS
Kind codeB1
Filing dateMar 25, 2019
Priority dateMar 25, 2019
Publication dateJan 11, 2022
Grant dateJan 11, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems are provided for automatically generating malware definitions and using generated malware definitions. One example method generally includes receiving information associated with a malicious application and extracting malware strings from the malicious application. The method further includes filtering the malware strings using a set of safe strings to produce filtered strings and scoring the filtered strings to produce string scores by evaluating words of the filtered strings based on word statistics of a set of known malicious words. The method further includes selecting a set of candidate strings from the filtered strings based on the string scores and generating a malware definition for the malicious application based on the set of candidate strings. The method also includes performing one or more security actions to protect against the malicious application, using the malware definition.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for automatically generating and using malware definitions, at least a portion of the method being performed by a computing device comprising one or more processors, the method comprising: receiving information associated with a malicious application; extracting malware strings from the malicious application; filtering the malware strings using a set of safe strings to produce filtered strings; scoring the filtered strings to produce string scores by evaluating words of the filtered strings based on a frequency of appearance of a set of known malicious words; selecting a set of candidate strings from the filtered strings based on the string scores; generating a malware definition for the malicious application based on the set of candidate strings; and performing one or more security actions to protect against the malicious application, using the malware definition. 2. The method of claim 1 , wherein generating the malware definition for the malicious application comprises generating and combining logical rules for enforcement by a security application. 3. The method of claim 1 , further comprising performing a false positive and false negative verification on the malware definition. 4. The method of claim 3 , further comprising publishing the malware definition to a plurality of publication targets. 5. The method of claim 1 , further comprising, prior to scoring the filtered strings, segmenting the filtered strings using a dictionary of malware terms. 6. The method of claim 1 , wherein the set of known malicious words comprises: words derived from a set of known malicious strings; and words derived from at least one existing malware definition. 7. The method of claim 1 , wherein performing the one or more security actions comprises performing malware monitoring on the computing device. 8. The method of claim 1 , wherein segmenting the filtered strings using a dictionary of malware terms comprises: reducing inflected words to their root form. 9. A computing device for automatically generating and using malware definitions, comprising: at least one processor; and a memory coupled to the processor, wherein the processor is configured to: receive information associated with a malicious application; extract malware strings from the malicious application; filter the malware strings using a set of safe strings to produce filtered strings; score the filtered strings to produce string scores by evaluating words of the filtered strings based on a frequency of appearance of a set of known malicious words; select a set of candidate strings from the filtered strings based on the string scores; generate a malware definition for the malicious application based on the set of candidate strings; and perform one or more security actions to protect against the malicious application, using the malware definition. 10. The computing device of claim 9 , wherein the processor is configured to generate the malware definition for the malicious application by generating and combining logical rules for enforcement by a security application. 11. The computing device of claim 9 , wherein the processor is further configured to perform a false positive and false negative verification on the malware definition. 12. The computing device of claim 11 wherein the processor is further configured to publish the malware definition to a plurality of publication targets. 13. The computing device of claim 9 , wherein the processor is further configured to, prior to scoring the filtered strings, segment the filtered strings using a dictionary of malware terms. 14. The computing device of claim 9 , wherein the set of known malicious words comprises: words derived from a set of known malicious strings; and words derived from at least one existing malware definition. 15. The computing device of claim 9 , wherein the processor is configured to perform the one or more security actions by performing malware monitoring. 16. The computing device of claim 9 , wherein the processor is further configured to: reduce inflected words to their root form. 17. A non-transitory computer-readable medium having instructions stored thereon which, when executed by at least one processor of a computing device, perform operations for automatically generating and using malware definitions, the operations comprising: receiving information associated with a malicious application; extracting malware strings from the malicious application; filtering the malware strings using a set of safe strings to produce filtered strings; scoring the filtered strings to produce string scores by evaluating words of the filtered strings based on a frequency of appearance of a set of known malicious words; selecting a set of candidate strings from the filtered strings based on the string scores; generating a malware definition for the malicious application based on the set of candidate strings; and performing one or more security actions to protect against the malicious application, using the malware definition. 18. The computer-readable medium of claim 17 , wherein generating the malware definition for the malicious application comprises generating and combining logical rules for enforcement by a security application. 19. The computer-readable medium of claim 17 , wherein the operations further comprise: prior to scoring the filtered strings, segmenting the filtered strings using a dictionary of malware terms; performing a false positive and false negative verification on the malware definition; and publishing the malware definition to a plurality of publication targets. 20. The computer-readable medium of claim 17 , wherein the set of known malicious words comprises: words derived from a set of known malicious strings; and words derived from at least one existing malware definition.

Assignees

Inventors

Classifications

  • eliminating virus, restoring damaged files · CPC title

  • G06F21/563Primary

    by source code analysis · CPC title

  • using dedicated hardware · CPC title

  • Test or assess software · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11222113B1 cover?
Methods and systems are provided for automatically generating malware definitions and using generated malware definitions. One example method generally includes receiving information associated with a malicious application and extracting malware strings from the malicious application. The method further includes filtering the malware strings using a set of safe strings to produce filtered strin…
Who is the assignee on this patent?
Ca Inc
What technology area does this patent fall under?
Primary CPC classification G06F21/563. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 11 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).