Method of application malware detection based on dynamic api extraction, and readable medium and apparatus for performing the method

US2020344261A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2020344261-A1
Application numberUS-201916515723-A
CountryUS
Kind codeA1
Filing dateJul 18, 2019
Priority dateApr 25, 2019
Publication dateOct 29, 2020
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided is a method of application malware detection based on dynamic Application Programming Interface (API) extraction, and a readable medium and an apparatus for performing the same. The method of application malware detection based on dynamic API extraction includes generating an API classifier which classifies an input API as malicious or benign using API used in a sample application classified as malicious application apps or benign application apps, and inputting a pre-stored target API into the API classifier to classify the target API as malicious or benign.

First claim

Opening claim text (preview).

What is claimed is: 1 . A malware detection method of a malware detection apparatus based on dynamic Application Programming Interface (API) extraction, the method comprising: extracting an API from a sample application which is classified as malicious application apps or benign application apps, and generating an API list; extracting the API from the API list according to a frequency of occurrence of each API included in the API list in the malicious application apps or the benign application apps, and generating a training dataset; learning a machine learning algorithm with the training dataset and generating an API classifier which classifies an input API as malicious or benign; and inputting a pre-stored target API into the API classifier to classify the target API as malicious or benign. 2 . The malware detection method of claim 1 , wherein the step of the extracting the API from the API list according to the frequency of occurrence of each API included in the API list in the malicious application apps or the benign application apps and generating the training dataset comprises: measuring the frequency of occurrence of each API included in the API list in the malicious application apps or the benign application apps in reply to whether the sample application from which each API included in the API list is extracted is classified as the malicious application apps or the benign application apps, respectively; calculating a Mutual Information (MI) value between the frequency of occurrence of each API included in the API list in the malicious application apps and the frequency of occurrence in the benign application apps; and extracting the API that constitutes the training dataset from the API list on the basis of the MI value of each API included in the API list. 3 . The malware detection method of claim 1 , further comprising: accessing a reference site of an Android application, and collecting feature information including a package name, a class name, an API name, and description for each API provided at the site; and storing the feature information for each API in a hierarchy to build an API database. 4 . The malware detection method of claim 3 , wherein the step of the extracting the API from the API list according to the frequency of occurrence of each API included in the API list in the malicious application apps or the benign application apps and generating the training dataset comprises: labeling the API extracted from the API list as malicious or benign; acquiring, from the API database, the feature information corresponding to the API extracted from the API list; and generating the training dataset including the malicious or benign label and the feature information for each API extracted from the API list. 5 . The malware detection method of claim 4 , wherein the step of the inputting the pre-stored target API into the API classifier to classify the target API as malicious or benign comprises: acquiring the feature information corresponding to the target API from the API database; and inputting the feature information corresponding to the target API into the API classifier to classify as malicious or benign. 6 . A non-transitory computer-readable recording medium having recorded thereon a computer program for performing the method of application malware detection based on dynamic Application Programming Interface (API) extraction according to claim 1 . 7 . An apparatus for malware detection application based on dynamic Application Programming Interface (API) extraction, the apparatus comprising: a processor and a memory; an API extraction unit which extracts an API used in a sample application classified as malicious application apps or benign application apps and generates an API list; a training dataset generation unit which extracts the API from the API list according to a frequency of occurrence of each API included in the API list in the malicious application apps or the benign application apps and generates a training dataset; and an API classifying unit which generates an API classifier that learns a machine learning algorithm with the training dataset and classifies an input API as malicious or benign, and inputs a pre-stored target API into the API classifier to classify the target API as malicious or benign. 8 . The apparatus for malware detection application based on dynamic API extraction according to claim 7 , wherein the training dataset generation unit comprises: an API counter unit which measures the frequency of occurrence of each API included in the API list in the malicious application apps or the benign application apps in reply to whether the sample application from which each API included in the API list is extracted is classified as the malicious application apps or the benign application apps, respectively; and a Mutual Information (MI) vector generation unit which calculates an MI value between the frequency of occurrence of each AP included in the API list in the malicious application apps and the frequency of occurrence in the benign application apps, and extracts the API that constitutes the training dataset from the API list on the basis of the MI value of each API included in the API list. 9 . The apparatus for malware detection application based on dynamic API extraction according to claim 7 , further comprising: an API metadata collection unit which accesses a reference site of an Android application, collects feature information including a package name, a class name, an API name and description for each API provided at the site, and stores the feature information for each API in a hierarchy to build an API database. 10 . The apparatus for malware detection application based on dynamic API extraction according to claim 9 , wherein the API classifying unit acquires the feature information corresponding to the target API from the API database, and inputs the feature information corresponding to the target API into the API classifier to classify as malicious or benign.

Assignees

Inventors

Classifications

  • service impersonation, e.g. phishing, pharming or web spoofing (detection of rogue wireless access points H04W12/12) · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • Tree-organised classifiers · CPC title

  • characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020344261A1 cover?
Provided is a method of application malware detection based on dynamic Application Programming Interface (API) extraction, and a readable medium and an apparatus for performing the same. The method of application malware detection based on dynamic API extraction includes generating an API classifier which classifies an input API as malicious or benign using API used in a sample application clas…
Who is the assignee on this patent?
Foundation Soongsil Univ Industry Cooperation
What technology area does this patent fall under?
Primary CPC classification H04L63/1483. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Oct 29 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).