System and method for assigning performance defects to software processing components

US12505377B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12505377-B2
Application numberUS-202117236892-A
CountryUS
Kind codeB2
Filing dateApr 21, 2021
Priority dateApr 21, 2021
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure relate to a method, a system, and a non-transitory machine-readable medium for assigning performance defects to software processing components. Provided is a method that receives performance data of a set of performance runs of a software application across a set of software processing components of a file software system, where the performance data comprises multiple labeled performance features that are associated with the set of software processing components; receives, from a software processing component expert, a user-selection of a subset of labeled performance features of the several labeled performance features; and trains a machine learning (ML) model to determine whether one or more of the set of software processing components is associated with performance regressions of the software application, using the subset of labeled performance features of the performance data as training data.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: performing a set of performance runs of a software application across a set of software processing components of a file system, wherein the set of performance runs comprises a performance regression; receiving performance data of the set of performance runs, wherein the performance data comprises a plurality of labeled performance features that are associated with the set of software processing components; receiving, from a user device, a user-selection of a subset of labeled performance features of the plurality of labeled performance features via a user interface displayed on the user device, wherein the user interface includes the plurality of labeled performance features; and training a machine learning (ML) model to determine whether one or more of the set of software processing components of the file system is associated with performance defects of the software application, using the subset of labeled performance features of the performance data as training data, and assign the one or more of the set of software processing components of the file system to one or more users when the ML model determines that the one or more of the set of software processing components of the file system is associated with the performance defects. 2 . The method of claim 1 , wherein at least one of the set of performance runs of the software application is a performance defect in which one or more software processing components of the set of software processing components performed below a performance threshold. 3 . The method of claim 2 , wherein the subset of labeled performance features are associated with a subset of the one or more software processing components that are responsible for the performance defect. 4 . The method of claim 1 further comprising generating synthetic performance data using the subset of labeled performance features, wherein the training data further comprises the synthetic performance data. 5 . The method of claim 4 , wherein the user-selection of the subset of labeled performance features reduces the performance data used as training data by removing data associated with the unselected labeled performance features, wherein the synthetic performance data includes more data of the subset of labeled performance features than the reduced performance data. 6 . The method of claim 4 further comprising identifying labeled performance features of the subset from a performance run of the set of performance runs that is not a performance defect of the software application, wherein generating the synthetic performance data comprises varying each of the identified labeled performance features from the performance run within a predefined range. 7 . The method of claim 1 , wherein the ML model is one of a decision tree, a random forest, a recurrent neural network (RNN), a long short term memory (LSTM) neural network, and a multi-head attention. 8 . The method of claim 1 , wherein at least one labeled performance feature indicates a usage of a resource of an electronic device by a software processing component during a performance run of the software application. 9 . A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations, the operations comprising: performing a set of performance runs of a software application across a set of software processing components of a file system, wherein the set of performance runs comprises a performance regression; receiving performance data of the set of performance runs, wherein the performance data comprises a plurality of labeled performance features that are associated with the set of software processing components; receiving, from a user device, a user-selection of a subset of the labeled performance features of the plurality of labeled performance features via a user interface displayed on the user device, wherein the user interface includes the plurality of labeled performance features; and training a machine learning (ML) model to determine whether one or more of the set of software processing components of the file system is associated with performance defects of the software application, using the subset of labeled performance features of the performance data as training data, and assign the one or more of the set of software processing components of the file system to one or more users when the ML model determines that the one or more of the set of software processing components of the file system is associated with the performance defects. 10 . The non-transitory machine-readable medium of claim 9 , wherein at least one of the set of performance runs of the software application is a performance defect in which one or more software processing components of the set of software processing components performed below a performance threshold. 11 . The non-transitory machine-readable medium of claim 10 , wherein the subset of labeled performance features are associated with a subset of the one or more software processing components that are responsible for the performance defect. 12 . The non-transitory machine-readable medium of claim 9 , wherein the operations further comprise generating synthetic performance data using the subset of labeled performance features, wherein the training data further comprises the synthetic performance data. 13 . The non-transitory machine-readable medium of claim 12 , wherein the user-selection of the subset of labeled performance features reduces the performance data used as training data by removing data associated with the unselected labeled performance features, wherein the synthetic performance data includes more data of the subset of labeled performance features than the reduced performance data. 14 . The non-transitory machine-readable medium of claim 12 , wherein the operations further comprise: identifying labeled performance features of the subset from a performance run of the set of performance runs that is not a performance defect of the software application, wherein generating the synthetic performance data comprises varying each of the identified labeled performance features from the performance run within a predefined range. 15 . The non-transitory machine-readable medium of claim 9 , wherein the ML model is one of a decision tree, a random forest, a recurrent neural network (RNN), a long short term memory (LSTM) neural network, and a multi-head attention. 16 . The non-transitory machine-readable medium of claim 9 , wherein at least one labeled performance feature indicates a usage of a resource of an electronic device by a software processing component during a performance run of the software application. 17 . A method comprising: receiving performance data of a performance defect of a software application across a set of software processing components of a file system; and determining, using a machine learning (ML) model that has an input based on the performance data of the performance defect, a software processing component of the set of software processing components that has a performance defect, wherein the performance data includes a plurality of labeled performance features that are associated with the set of software processing components, wherein the ML model has been trained, using a subset of the labeled performance features that have been user-selected, via a user interface displayed on a user device, from the plurality of labeled performance features as training data, to determine whether one or

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • by runtime analysis (performance monitoring G06F11/3466) · CPC title

  • Details of monitoring file system events, e.g. by the use of hooks, filter drivers, logs · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • Performance evaluation by modeling · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12505377B2 cover?
Embodiments of the present disclosure relate to a method, a system, and a non-transitory machine-readable medium for assigning performance defects to software processing components. Provided is a method that receives performance data of a set of performance runs of a software application across a set of software processing components of a file software system, where the performance data compris…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).