Allocation of shared computing resources using source code feature extraction and machine learning

US11567807B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11567807-B2
Application numberUS-201815941434-A
CountryUS
Kind codeB2
Filing dateMar 30, 2018
Priority dateMar 30, 2018
Publication dateJan 31, 2023
Grant dateJan 31, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are provided for allocation of shared computing resources using source code feature extraction and machine learning techniques. An exemplary method comprises obtaining source code for execution in a shared computing environment; extracting a plurality of discriminative features from the source code; obtaining a trained machine learning model; and generating a prediction of an allocation of one or more resources of the shared computing environment needed to satisfy one or more service level agreement requirements for the source code. The generated prediction is optionally adjusted using a statistical analysis of an error curve, based on one or more error boundaries obtained by the trained machine learning model. The trained machine learning model can be trained using a set of discriminative features extracted from training source code and corresponding measurements of metrics of the service level agreement requirements obtained by executing the training source code on a plurality of the resources of the shared computing environment.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: obtaining source code for execution in a shared computing environment; extracting, using at least one processing device, a plurality of discriminative features from the source code; obtaining, using the at least one processing device, at least one trained machine learning model; and applying, using the at least one processing device, the plurality of extracted discriminative features to the at least one trained machine learning model, wherein the at least one trained machine learning model generates a prediction of an allocation comprising a required amount of one or more resources of the shared computing environment needed to satisfy one or more service level agreement requirements for the source code to be executed in the shared computing environment, and wherein the at least one trained machine learning model is trained using a set of discriminative features extracted from training source code and corresponding measurements of one or more metrics of the service level agreement requirements obtained by executing the training source code on a plurality of the resources of the shared computing environment. 2. The method of claim 1 , further comprising generating one or more error boundaries of the predicted allocation of the one or more resources based on a standard deviation of the at least one machine learning model. 3. The method of claim 1 , wherein the extracting the plurality of discriminative features from the source code employs one or more of natural language processing techniques, pattern-based techniques and worst-case execution time techniques. 4. The method of claim 3 , wherein the natural language processing techniques extract one or more of the discriminative features from the source code based on a term frequency metric. 5. The method of claim 3 , wherein the pattern-based techniques extract one or more of the discriminative features from the source code by identifying one or more patterns in function calls of the source code. 6. The method of claim 3 , wherein the worst-case execution time techniques extract one or more of the discriminative features from the source code by obtaining statistics regarding execution times for each type of task in the source code. 7. The method of claim 1 , further comprising estimating an amount of time required to process the source code until a completion of the execution of the source code. 8. A system, comprising: a memory; and at least one processing device, coupled to the memory, operative to implement the following steps: obtaining source code for execution in a shared computing environment; extracting, using at least one processing device, a plurality of discriminative features from the source code; obtaining, using the at least one processing device, at least one trained machine learning model; and applying, using the at least one processing device, the plurality of extracted discriminative features to the at least one trained machine learning model, wherein the at least one trained machine learning model generates a prediction of an allocation comprising a required amount of one or more resources of the shared computing environment needed to satisfy one or more service level agreement requirements for the source code to be executed in the shared computing environment, and wherein the at least one trained machine learning model is trained using a set of discriminative features extracted from training source code and corresponding measurements of one or more metrics of the service level agreement requirements obtained by executing the training source code on a plurality of the resources of the shared computing environment. 9. The system of claim 8 , further comprising the step of generating one or more error boundaries of the predicted allocation of the one or more resources based on a standard deviation of the at least one machine learning model. 10. The system of claim 8 , wherein the step of extracting the plurality of discriminative features from the source code employs one or more of natural language processing techniques, pattern-based techniques and worst-case execution time techniques. 11. The system of claim 10 , wherein the natural language processing techniques extract one or more of the discriminative features from the source code based on a term frequency metric. 12. The system of claim 10 , wherein the pattern-based techniques extract one or more of the discriminative features from the source code by identifying one or more patterns in function calls of the source code. 13. The system of claim 10 , wherein the worst-case execution time techniques extract one or more of the discriminative features from the source code by obtaining statistics regarding execution times for each type of task in the source code. 14. The system of claim 8 , further comprising estimating an amount of time required to process the source code until a completion of the execution of the source code. 15. A computer program product, comprising a tangible machine-readable storage medium having encoded therein executable code of one or more software programs, wherein the one or more software programs when executed by at least one processing device perform the following steps: obtaining source code for execution in a shared computing environment; extracting, using at least one processing device, a plurality of discriminative features from the source code; obtaining, using the at least one processing device, at least one trained machine learning model; and applying, using the at least one processing device, the plurality of extracted discriminative features to the at least one trained machine learning model, wherein the at least one trained machine learning model generates a prediction of an allocation comprising a required amount of one or more resources of the shared computing environment needed to satisfy one or more service level agreement requirements for the source code to be executed in the shared computing environment, and wherein the at least one trained machine learning model is trained using a set of discriminative features extracted from training source code and corresponding measurements of one or more metrics of the service level agreement requirements obtained by executing the training source code on a plurality of the resources of the shared computing environment. 16. The computer program product of claim 15 , further comprising the step of generating one or more error boundaries of the predicted allocation of the one or more resources based on a standard deviation of the at least one machine learning model. 17. The computer program product of claim 15 , wherein the step of extracting the plurality of discriminative features from the source code employs one or more of natural language processing techniques, pattern-based techniques and worst-case execution time techniques. 18. The computer program product of claim 17 , wherein the natural language processing techniques extract one or more of the discriminative features from the source code based on a term frequency metric. 19. The computer program product of claim 17 , wherein the pattern-based techniques extract one or more of the discriminative features from the source code by identifying one or more patterns in function calls of the source code. 20. The computer program product of claim 15 , further comprising estimating an amount of time required to process the source code until a completion of the execution of the source code.

Assignees

Inventors

Classifications

  • Dependency analysis; Data or control flow analysis · CPC title

  • for supporting services specification, e.g. SLA · CPC title

  • Machine learning · CPC title

  • Processing or translation of natural language (natural language analysis G06F40/20; semantic analysis G06F40/30) · CPC title

  • Electricity · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11567807B2 cover?
Techniques are provided for allocation of shared computing resources using source code feature extraction and machine learning techniques. An exemplary method comprises obtaining source code for execution in a shared computing environment; extracting a plurality of discriminative features from the source code; obtaining a trained machine learning model; and generating a prediction of an allocat…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F9/5077. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 31 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).