Use Azure Batch to run large-scale parallel and high-performance computing (HPC) batch jobs efficiently in Azure.

Azure Batch does the following:

  • Creates and manages a pool of compute nodes (virtual machines).
  • Installs the applications you want to run.
  • Schedules jobs to run on the nodes.

There is no cluster or job scheduler software to install, manage, or scale.

Use the OpsRamp Azure public cloud integration to discover and collect metrics against the Azure service.

Setup

To set up the OpsRamp Azure integration and discover the Azure service, go to Azure Integration Discovery Profile and select Batch Accounts

Metrics

OpsRamp MetricMetric Display NameUnitAggregation TypeDescription
azure_core_countDedicated Core CountCountTotalTotal number of dedicated cores in the batch account.
azure_total_node_countDedicated Node CountCountTotalTotal number of dedicated nodes in the batch account.
azure_low_priority_core_countLowPriority Core CountCountTotalTotal number of low priority cores in the batch account.
azure_total_low_priority_node_countLow Priority Node CountCountTotalTotal number of low priority nodes in the batch account.
azure_creating_node_countCreating Node CountCountTotalNumber of nodes being created.
azure_starting_node_countStarting Node CountCountTotalNumber of nodes starting.
azure_waiting_for_start_task_node_countWaitng For Start Task Node CountCountTotalNumber of nodes waiting for the Start Task to complete.
azure_start_task_failed_node_countStart Task Failed Node CountCountTotalNumber of nodes where the Start Task has failed.
azure_idle_node_countIdle Node CountCountTotalNumber of idle nodes.
azure_offline_node_countOffline Node CountCountTotalNumber of offline nodes.
azure_rebooting_node_countRebooting Node CountCountTotalNumber of rebooting nodes.
azure_remaining_node_countReimaging Node CountCountTotalNumber of reimaging nodes.
azure_running_node_countRunning Node CountCountTotalNumber of running nodes.
azure_leaving_pool_node_countLeaving Pool Node CountCountTotalNumber of nodes leaving the pool.
azure_unusable_node_countUnusable Node CountCountTotalNumber of unusable nodes.
azure_preempted_node_countPreempted Node CountCountTotalNumber of preempted nodes.
azure_task_start_eventTask Start EventsCountTotalNumber of tasks that have started.
azure_task_complete_eventTask Complete EventsCountTotalTotal number of tasks that have completed.
azure_task_fail_eventTask Fail EventsCountTotalTotal number of tasks that have completed in a failed state.
azure_pool_create_eventPool Create EventsCountTotalTotal number of pools that have been created.
azure_pool_resize_start_eventPool Resize Start EventsCountTotalTotal number of pool resizes that have started.
azure_pool_resize_complete_eventPool Resize Complete EventsCountTotalTotal number of pool resizes that have completed.
azure_pool_delete_start_eventPool Delete Start EventsCountTotalTotal number of pool deletes that have started.
azure_pool_delete_complete_eventPool Delete Complete EventsCountTotalTotal number of pool deletes that have completed.
azure_batchaccount_job_delete_complete_eventJob Delete Complete EventsCountTotalTotal number of jobs that have been successfully deleted.
azure_batchaccount_job_delete_start_eventJob Delete Start EventsCountTotalTotal number of jobs that have been requested to be deleted.
azure_batchaccount_job_disable_complete_eventJob Disable Complete EventsCountTotalTotal number of jobs that have been successfully disabled.
azure_batchaccount_job_disable_start_eventJob Disable Start EventsCountTotalTotal number of jobs that have been requested to be disabled.
azure_batchaccount_job_start_eventJob Start EventsCountTotalTotal number of jobs that have been successfully started.
azure_batchaccount_job_terminate_complete_eventJob Terminate Complete EventsCountTotalTotal number of jobs that have been successfully terminated.
azure_batchaccount_job_terminate_start_eventJob Terminate Start EventsCountTotalTotal number of jobs that have been requested to be terminated.
cloud_instance_stateStatus/Staten/an/aStatus/State

Event support

  • Supported
  • Configurable in OpsRamp Azure Integration Discovery Profile.

External reference