Machine Learning Model Score Obfuscation Using Step Function, Position-Dependent Noise

US2020349400A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2020349400-A1
Application numberUS-201916399677-A
CountryUS
Kind codeA1
Filing dateApr 30, 2019
Priority dateApr 30, 2019
Publication dateNov 5, 2020
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An artefact is received. Features are extracted from this artefact which are, in turn, used to populate a vector. The vector is then input into a classification model to generate a score. The score is then modified using a step function so that the true score is not obfuscated. Thereafter, the modified score can be provided to a consuming application or process. Related apparatus, systems, techniques and articles are also described.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method comprising: receiving an artefact; extracting features from the artefact and populating a vector; inputting the vector into a classification model to generate a score; modifying the score using a step function; and providing the modified score to a consuming application or process. 2 . The method of claim 1 further comprising reducing features in the vector prior to the inputting into the classification model. 3 . The method of claim 2 , wherein the features are reduced using random projection matrices. 4 . The method of claim 2 , wherein the features are reduced using principal component analysis. 5 . The method of claim 1 , wherein the classification model is a machine learning model trained using a training data set and providing a continuous scale output. 6 . The method of claim 1 , wherein the classification model characterizes the artefact as being malicious or benign to access, execute, or continue to execute. 7 . The method of claim 6 further comprising: preventing access or execution of the artefact when the classification model characterizes the artefact as being malicious. 8 . The method of claim 1 , wherein the machine learning model comprises one or more of: a logistic regression model, a neural network, a concurrent neural network, a recurrent neural network, a generative adversarial network, a support vector machine, a random forest, or a Bayesian model. 9 . The method of claim 1 , wherein the step function applies position-dependent noise to the score. 10 . A system comprising: at least one data processor; and memory storing instructions which, when executed by the at least one data processor, result in operations comprising: receiving an artefact; extracting features from the artefact and populating a vector; inputting the vector into a classification model to generate a score; modifying the score using a step function; and providing the modified score to a consuming application or process. 11 . The system of claim 10 , wherein the operations further comprising: reducing features in the vector prior to the inputting into the classification model. 12 . The system of claim 11 , wherein the features are reduced using random projection matrices. 13 . The system of claim 11 , wherein the features are reduced using principal component analysis. 14 . The system of claim 10 , wherein the classification model is a machine learning model trained using a training data set and providing a continuous scale output. 15 . The system of claim 10 , wherein the classification model characterizes the artefact as being malicious or benign to access, execute, or continue to execute. 16 . The system of claim 15 , wherein the operations further comprise: preventing access or execution of the artefact when the classification model characterizes the artefact as being malicious. 17 . The system of claim 10 , wherein the machine learning model comprises one or more of: a logistic regression model, a neural network, a concurrent neural network, a recurrent neural network, a generative adversarial network, a support vector machine, a random forest, or a Bayesian model. 18 . The system of claim 10 , wherein the step function applies position-dependent noise to the score. 19 . A computer-implemented method comprising: receiving a file; extracting features from the file and populating a vector; inputting the vector into a classification model to generate a score, the classification model being a machine learning model trained to characterize a likelihood of the file as being malicious; obfuscating the score using a step function; and providing the modified score to a consuming application or process. 20 . The method of claim 19 further comprising: preventing access or execution of the artefact when the classification model characterizes the artefact as being malicious.

Assignees

Inventors

Classifications

  • G06F21/562Primary

    Static detection · CPC title

  • based on the proximity to a decision surface, e.g. support vector machines · CPC title

  • based on approximation criteria, e.g. principal component analysis · CPC title

  • Classification techniques · CPC title

  • Generative networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020349400A1 cover?
An artefact is received. Features are extracted from this artefact which are, in turn, used to populate a vector. The vector is then input into a classification model to generate a score. The score is then modified using a step function so that the true score is not obfuscated. Thereafter, the modified score can be provided to a consuming application or process. Related apparatus, systems, tech…
Who is the assignee on this patent?
Cylance Inc
What technology area does this patent fall under?
Primary CPC classification G06F21/562. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 05 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).