Methods and apparatus to visualize machine learning based malware classification

US12483590B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12483590-B2
Application numberUS-202217714023-A
CountryUS
Kind codeB2
Filing dateApr 5, 2022
Priority dateApr 5, 2021
Publication dateNov 25, 2025
Grant dateNov 25, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, apparatus, systems, and articles of manufacture are disclosed. An example apparatus includes at least one memory, instructions, and processor circuitry to execute the instructions. The processor circuitry executes the instructions to identify a test data distribution, generate a first visualization of the identified test data distribution, select a visualization type for a machine learning model, generate a second visualization including an indication of features extracted from the test data by the machine learning model, and generate a third visualization of results of inference performed by the machine learning model, the inference performed on the test data.

First claim

Opening claim text (preview).

What is claimed is: 1 . An apparatus comprising: at least one memory; machine-readable instructions; and processor circuitry to execute the machine-readable instructions to: identify a distribution associated with test data; generate a first visualization of the identified distribution; generate a recommendation to adjust a machine learning model, the recommendation based on a classification error, the recommendation to be displayed with the first visualization, the recommendation to include an instruction to reduce a malware classification error of the machine learning model; select a visualization type for the machine learning model; generate a second visualization including an indication of features extracted from the test data by the machine learning model, the second visualization generated using gradient weighted class activation mapping t-distributed stochastic neighbor embedding feature projections; and generate a third visualization of results of an inference performed by the machine learning model, the inference performed on the test data, wherein the inference is at least one of a classification of a sample as malware or a classification of the sample as benign. 2 . The apparatus of claim 1 , wherein the processor circuitry is to generate a recommendation to improve inference accuracy based on a comparison of at least two of: the first visualization, the second visualization, or the third visualization. 3 . The apparatus of claim 1 , wherein the processor circuitry is to generate a fourth visualization of a machine learning pipeline, the fourth visualization including the first visualization, the second visualization, and the third visualization. 4 . The apparatus of claim 1 , wherein at least one of the first visualization, the second visualization, and the third visualization includes an indication that a data is an outlier data. 5 . The apparatus of claim 1 , wherein the third visualization includes a receiver operating characteristic curve and an indication of a partial area under the receiver operating characteristic curve. 6 . A non-transitory computer readable medium comprising instructions which, when executed, cause processor circuitry to: identify a distribution associated with test data; generate a first visualization of the identified distribution; generate a recommendation to adjust a machine learning model, the recommendation based on a classification error, the recommendation to be displayed with the first visualization, the recommendation to include an instruction to reduce a malware classification error of the machine learning model; select a visualization type for the machine learning model; generate a second visualization including an indication of features extracted from the test data by the machine learning model, the second visualization generated using t-distributed stochastic neighbor embedding feature projections; and generate a third visualization of results of an inference performed by the machine learning model, the inference performed on the test data, wherein the inference is at least one of a classification of a sample as malware or a classification of the sample as benign. 7 . The non-transitory computer readable medium of claim 6 , wherein the instructions, when executed, cause the processor circuitry to generate a recommendation to improve inference accuracy based on a comparison of at least two of: the first visualization, the second visualization, or the third visualization. 8 . The non-transitory computer readable medium of claim 6 , wherein the instructions, when executed, cause the processor circuitry to generate a fourth visualization of a machine learning pipeline, the fourth visualization including the first visualization, the second visualization, and the third visualization. 9 . The non-transitory computer readable medium of claim 6 , wherein at least one of the first visualization, the second visualization, and the third visualization includes an indication that a data is an outlier data. 10 . The non-transitory computer readable medium of claim 6 , wherein the third visualization includes a receiver operating characteristic curve and an indication of a partial area under the receiver operating characteristic curve. 11 . A method for error analysis in a machine learning pipeline, the method comprising: identifying, by executing an instruction with processor circuitry, a distribution associated with test data; generating, by executing an instruction with the processor circuitry, a first visualization of the identified distribution; generating, by executing an instruction with the processor circuitry, a recommendation to adjust a machine learning model, the recommendation based on a classification error, the recommendation to be displayed with the first visualization, the recommendation to include an instruction to reduce a malware classification error of the machine learning model; selecting, by executing an instruction with processor circuitry, a visualization type for the machine learning model; generating, by executing an instruction with processor circuitry, a second visualization including an indication of features extracted from the test data by the machine learning model, the second visualization generated using t-distributed stochastic neighbor embedding feature projections; and generating, by executing an instruction with processor circuitry, a third visualization of results of an inference performed by the machine learning model, the inference performed on the test data, wherein the inference is at least one of a classification of a sample as malware or a classification of the sample as benign. 12 . The method of claim 11 , further including generating a recommendation to improve inference accuracy based on a comparison of at least two of: the first visualization, the second visualization, and the third visualization. 13 . The method of claim 11 , further including generating a fourth visualization of the machine learning pipeline, the fourth visualization including the first visualization, the second visualization, and the third visualization. 14 . The method of claim 11 , wherein at least one of the first visualization, the second visualization, and the third visualization includes an indication that a data is an outlier data. 15 . The method of claim 11 , wherein the third visualization includes a receiver operating characteristic curve and an indication of a partial area under the receiver operating characteristic curve.

Assignees

Inventors

Classifications

  • by monitoring network traffic (monitoring network traffic per se H04L43/00) · CPC title

  • Filtering policies (mail message filtering H04L51/212) · CPC title

  • H04L41/16Primary

    using machine learning or artificial intelligence · CPC title

  • for managing network security; network security policies in general (filtering policies H04L63/0227) · CPC title

  • H04L63/145Primary

    the attack involving the propagation of malware through the network, e.g. viruses, trojans or worms · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12483590B2 cover?
Methods, apparatus, systems, and articles of manufacture are disclosed. An example apparatus includes at least one memory, instructions, and processor circuitry to execute the instructions. The processor circuitry executes the instructions to identify a test data distribution, generate a first visualization of the identified test data distribution, select a visualization type for a machine lear…
Who is the assignee on this patent?
Mcafee Llc
What technology area does this patent fall under?
Primary CPC classification H04L41/16. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 25 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).