Use Azure Batch to run large-scale parallel and high-performance computing (HPC) batch jobs efficiently in Azure.

Azure Batch does the following:

  • Creates and manages a pool of compute nodes (virtual machines).
  • Installs the applications you want to run.
  • Schedules jobs to run on the nodes.

There is no cluster or job scheduler software to install, manage, or scale.

Use the OpsRamp Azure public cloud integration to discover and collect metrics against the Azure service.

Setup

To set up the OpsRamp Azure integration and discover the Azure service, go to Azure Integration Discovery Profile and select Batch Accounts.

Metrics

OpsRamp MetricMetric Display NameUnitAggregation Type
azure_core_count

Total number of dedicated cores in the batch account.
Dedicated Core CountCountTotal
azure_total_node_count

Total number of dedicated nodes in the batch account.
Dedicated Node CountCountTotal
azure_low_priority_core_count

Total number of low priority cores in the batch account.
LowPriority Core CountCountTotal
azure_total_low_priority_node_count

Total number of low priority nodes in the batch account.
Low Priority Node CountCountTotal
azure_creating_node_count

Number of nodes being created.
Creating Node CountCountTotal
azure_starting_node_count

Number of nodes starting.
Starting Node CountCountTotal
azure_waiting_for_start_task_node_count

Number of nodes waiting for the Start Task to complete.
Waitng For Start Task Node CountCountTotal
azure_start_task_failed_node_count

Number of nodes where the Start Task has failed.
Start Task Failed Node CountCountTotal
azure_idle_node_count

Number of idle nodes.
Idle Node CountCountTotal
azure_offline_node_count

Number of offline nodes.
Offline Node CountCountTotal
azure_rebooting_node_count

Number of rebooting nodes.
Rebooting Node CountCountTotal
azure_remaining_node_count

Number of reimaging nodes.
Reimaging Node CountCountTotal
azure_running_node_count

Number of running nodes.
Running Node CountCountTotal
azure_leaving_pool_node_count

Number of nodes leaving the pool.
Leaving Pool Node CountCountTotal
azure_unusable_node_count

Number of unusable nodes.
Unusable Node CountCountTotal
azure_preempted_node_count

Number of preempted nodes.
Preempted Node CountCountTotal
azure_task_start_event

Number of tasks that have started.
Task Start EventsCountTotal
azure_task_complete_event

Total number of tasks that have completed.
Task Complete EventsCountTotal
azure_task_fail_event

Total number of tasks that have completed in a failed state.
Task Fail EventsCountTotal
azure_pool_create_event

Total number of pools that have been created.
Pool Create EventsCountTotal
azure_pool_resize_start_event

Total number of pool resizes that have started.
Pool Resize Start EventsCountTotal
azure_pool_resize_complete_event

Total number of pool resizes that have completed.
Pool Resize Complete EventsCountTotal
azure_pool_delete_start_event

Total number of pool deletes that have started.
Pool Delete Start EventsCountTotal
azure_pool_delete_complete_event

Total number of pool deletes that have completed.
Pool Delete Complete EventsCountTotal
azure_batchaccount_job_delete_complete_event

Total number of jobs that have been successfully deleted.
Job Delete Complete EventsCountTotal
azure_batchaccount_job_delete_start_event

Total number of jobs that have been requested to be deleted.
Job Delete Start EventsCountTotal
azure_batchaccount_job_disable_complete_event

Total number of jobs that have been successfully disabled.
Job Disable Complete EventsCountTotal
azure_batchaccount_job_disable_start_event

Total number of jobs that have been requested to be disabled.
Job Disable Start EventsCountTotal
azure_batchaccount_job_start_event

Total number of jobs that have been successfully started.
Job Start EventsCountTotal
azure_batchaccount_job_terminate_complete_event

Total number of jobs that have been successfully terminated.
Job Terminate Complete EventsCountTotal
azure_batchaccount_job_terminate_start_event

Total number of jobs that have been requested to be terminated.
Job Terminate Start EventsCountTotal
cloud_instance_state

Status/State
Status/Staten/an/a

Event support

  • Supported
  • Configurable in OpsRamp Azure Integration Discovery Profile.

External reference