Server and cloud computing resource optimization method thereof for cloud big data computing architecture

US10460241B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10460241-B2
Application numberUS-201615372348-A
CountryUS
Kind codeB2
Filing dateDec 7, 2016
Priority dateNov 23, 2016
Publication dateOct 29, 2019
Grant dateOct 29, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A server and a cloud computing resource optimization method thereof for big data cloud computing architecture are provided. The server runs a dynamic scaling system to perform the following operations: receiving a task message; executing a profiling procedure to generate a profile based on an to-be-executed task recorded in the task message; executing a classifying procedure to determine a task classification of the to-be-executed task; executing a prediction procedure to obtain a plurality of predicted execution times corresponding to a plurality of computing node numbers, a computing node type and a system parameter of the to-be-executed task; executing an optimization procedure to determine a practical computing node number of the to-be-executed task; and transmitting an optimization output message to a management server to make the management server allocate at least one data computing system to execute a program file of the to-be-executed task.

First claim

Opening claim text (preview).

What is claimed is: 1. A server for big data cloud computing architecture, comprising: a transceiver connected to a network; and a processor electrically connected to the transceiver, being configured to run a dynamic scaling system to execute the following operations: receiving a task message via the transceiver, the task message recording a user-defined attribute, a program file and a plurality of data files of a to-be-executed task, the program file and the data files being stored in a data storage system which runs in a data storage server assembly in the big data cloud computing architecture; executing a profiling procedure that comprises the following steps of: sampling the data files recorded in the task message to choose a plurality of sampled data files and to generate an advance execution message which records the sampled data files and the program file of the to-be-executed task; transmitting the advance execution message to a big data computing server assembly of the big data cloud computing architecture via the transceiver so that a sampled data computing system running in the big data computing server assembly executes the program file on the sampled data files and generates a profile according to an execution result of the program file; and receiving the profile from the big data computing server assembly via the transceiver; executing a classifying procedure that comprises the following step of: based on a classification model, comparing the profile with the classification model to determine a task classification of the to-be-executed task; executing a prediction procedure that comprises the following steps of: based on a computing node type and a system parameter recorded in a prediction sub-model corresponding to the task classification in a prediction model, assigning the computing node type and the system parameter to the to-be-executed task; and based on the prediction sub-model, generating an execution time prediction curve of the to-be-executed task according to the task classification and the profile to obtain a plurality of predicted execution times of the to-be-executed task corresponding to a plurality of computing node numbers, the computing node numbers corresponding to the predicted execution times in one-to-one correspondence; executing an optimization procedure that comprises the following steps of: determining whether there is at least one additional task; when the at least one additional task exists, determining a practical computing node number of the to-be-executed task according to the user-defined attribute and the predicted execution times of the to-be-executed task, an additional user-defined attribute and a plurality of additional predicted execution times of each of the at least one additional task, and a maximum computing resource of the big data computing server assembly; generating an optimization output message, which records the program file, the data files, the practical computing node number, the computing node type and the system parameter of the to-be-executed task; and transmitting the optimization output message to a management server of the big data cloud computing architecture via the transceiver so that a management system run by the management server allocates at least one data computing system running in the big data computing server assembly to execute the program file on the data files of the to-be-executed task according to the optimization output message, wherein the number of the at least one data computing system is equal to the practical computing node number. 2. The server of claim 1 , wherein the processor further executes a monitoring procedure that comprises the following step of: updating the execution time prediction curve of the to-be-executed task when the at least one data computing system is executing the program file of the to-be-executed task. 3. The server of claim 1 , wherein the processor further determines a file size of the data files of the to-be-executed task before executing the profiling procedure, and executes the profiling procedure if the file size of the data files is greater than a threshold. 4. The server of claim 1 , wherein the user-defined attribute comprises a task execution priority value, a task deadline time, a minimum computing node number and a maximum computing node number, and the optimization procedure further comprises the following step of: calculating a task weight value of the to-be-executed task according to the following formula: V = EP × D D - WT where V is the task weight value, EP is the task execution priority value, D is the task deadline time of the to-be-executed task, and WT is an execution waiting time. 5. The server of claim 4 , wherein the processor determines the practical computing node number of the to-be-executed task in the optimization procedure according to the task weight value and the predicted execution times of the to-be-executed task, an additional task weight value and the additional predicted execution times of each of the at least one additional task, and the maximum computing resource of the big data computing server assembly. 6. The server of claim 5 , wherein the optimization procedure further comprises the following steps of: calculating a plurality of ratios between the task weight value of the to-be-executed task and each of the predicted execution times; calculating a plurality of additional ratios between the additional task weight value and each of the additional predicted execution times for each of the at least one additional task; calculating a plurality of sum values between any of the ratios and any of the additional ratios of each of the at least one additional task; selecting the maximum sum value from among the sum values; and setting the practical computing node number to the computing node number which corresponds to the predicted execution time of the ratio contributing to the maximum sum value. 7. The server of claim 6 , wherein the optimization output message further records an additional practical computing node number of each of the at least one additional task, and the optimization procedure further comprises the following step of: for each of the at least one additional task, setting the additional practical computing node number to the additional computing node number which corresponds to the additional predicted execution time of the additional ratio contributing to the maximum sum value. 8. The server of claim 1 , wherein the data storage server assembly further runs a dynamic scaling data storage system, and the dynamic scaling data storage system is configured to store the sampled data files, the profile, the classification model and the prediction model of the to-be-executed task. 9. The server of claim 1 , wherein the profile of the to-be-executed task comprises program file parse data, a sampled task log and a system resource usage record. 10. The server of claim 1 , wherein the classification model has a plurality of standard task classifications, and the processor further uses a clustering algorithm to modify the standard task classifications according to a plurality of historical profiles of a plurality of already executed tasks. 11. The server of claim 1 , further comprising storage configured to store the classification model and the prediction m

Assignees

Inventors

Classifications

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • in which an application is distributed across nodes in the network (software deployment G06F8/60; multiprogramming arrangements G06F9/46) · CPC title

  • G06F9/5027Primary

    the resource being a machine, e.g. CPUs, Servers, Terminals · CPC title

  • G06N5/02Primary

    Knowledge representation; Symbolic representation · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10460241B2 cover?
A server and a cloud computing resource optimization method thereof for big data cloud computing architecture are provided. The server runs a dynamic scaling system to perform the following operations: receiving a task message; executing a profiling procedure to generate a profile based on an to-be-executed task recorded in the task message; executing a classifying procedure to determine a task…
Who is the assignee on this patent?
Inst Information Ind
What technology area does this patent fall under?
Primary CPC classification G06F9/5027. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 29 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).