What is claimed is:
1. A computer program product deployed in storage servers in a cloud computing environment for determining a health status of the storage servers, wherein the storage servers provide cloud services including access to customer data to customers of the cloud computing environment, wherein the cloud computing environment includes a management system implementing a management program, wherein the computer program product comprises a computer readable storage medium having program code executed in each storage server of the storage servers to perform operations, the operations comprising:
receiving an error message for a resource indicating resource needs to be taken offline;
indicating the resource as unavailable;
calculating an availability score as a function of a number of available resources at the storage server; and
transmitting information on the availability score to a management program to use to determine whether to migrate at least one of the cloud services and the customer data from a source storage server of the storage servers from which the availability score is received to at least one other of the storage servers in the cloud computing environment.
2. The computer program product of claim 1 , wherein the function calculates the availability score by dividing the number of available resources by a total number of resources.
3. The computer program product of claim 2 , wherein the operations further comprise:
indicating one less available resource in response to the error message before calculating the availability score.
4. The computer program product of claim 2 , wherein the function for calculating the availability score additionally considers a number of recovery events resulting from Input/Output (I/O) requests and a total number of allowed recovery events at the storage server.
5. The computer program product of claim 4 , wherein the function calculates the availability score by combining a percentage of the number of the available storage server resources to the total number of resources and a percentage of the number of recovery events divided by the total number of allowed recovery events.
6. The computer program product of claim 1 , wherein the available storage server resources used in calculating the availability score comprise different resource types.
7. The computer program product of claim 1 , wherein the operations further comprise:
determining, from the availability score, a severity level associated with the availability score, wherein the transmitted information on the availability score comprises the determined severity level, and wherein the at least one of the cloud services of the source storage server is migrated in response to the transmitted determined severity level comprising a highest severity level.
8. A cloud computing infrastructure for providing cloud services including access to customer data to customers at computing nodes of the cloud computing infrastructure, comprising:
a management system including a management program; and
a plurality of storage servers providing the cloud services including access to customer data in the cloud computing infrastructure, wherein each storage server includes a computer readable storage medium having program instructions embodied therein that when executed by a processor perform operations, the operations comprising:
receiving an error message for a resource indicating resource needs to be taken offline;
indicating the resource as unavailable;
calculating an availability score as a function of a number of available resources at the storage server; and
transmitting information on the availability score to a management program to use to determine whether to migrate at least one of the cloud services and the customer data from a source storage server of the storage servers from which the availability score is received to at least one other of the storage servers in the cloud computing infrastructure.
9. The cloud computing infrastructure of claim 8 , wherein the function calculates the availability score by dividing the number of available resources by a total number of resources.
10. The cloud computing infrastructure of claim 9 , wherein the operations further comprise:
indicating one less available resource in response to the error message before calculating the availability score.
11. The cloud computing infrastructure of claim 9 , wherein the function for calculating the availability score additionally considers a number of recovery events resulting from Input/Output (I/O) requests and a total number of allowed recovery events at the storage server.
12. The cloud computing infrastructure of claim 11 , wherein the function calculates the availability score by combining a percentage of the number of the available storage server resources to the total number of the resources and a percentage of the number of recovery events divided by the total number of allowed recovery events.
13. The cloud computing infrastructure of claim 8 , wherein the available storage server resources used in calculating the availability score comprise different resource types.
14. The cloud computing infrastructure of claim 8 , wherein the operations further comprise:
determining, from the availability score, a severity level associated with the availability score, wherein the transmitted information on the availability score comprises the determined severity level, and wherein the at least one of the cloud services of the source storage server is migrated in response to the transmitted determined severity level comprising a highest severity level.
15. A method for determining a health status of storage servers in a cloud computing environment in which the storage servers provide cloud services including access to customer data to customers of the cloud computing environment, comprising:
deploying a health status monitor program in the storage servers to perform operations in the storage servers, wherein the storage servers provide the cloud services including access to computational services and data to customers of the cloud computing environment, and wherein the health status monitor program performs in each storage server of the storage servers in which it is deployed:
receiving an error message for a resource indicating resource needs to be taken offline;
indicating the resource as unavailable;
calculating an availability score as a function of a number of available resources at the storage server; and
transmitting information on the availability score to a management program to use to determine whether to migrate at least one of the cloud services and the customer data from a source storage server of the storage servers from which the availability score is received to at least one other of the storage servers in the cloud computing environment.
16. The method of claim 15 , wherein the function calculates the availability score by dividing the number of available resources by a total number of resources.
17. The method of claim 16 , wherein the operations further comprise:
indicating one less available resource in response to the error message before calculating the availability score.
18. The method of claim 16 , wherein the function for calculating the availability score additionally considers a number of recovery events resulting from Input/Output (I/O) requests and a total number of allowed recovery events at the storage server.
19. The method of claim 18 , wherein the function calculates the availability score by combining a perc