Documentation is now available for the Fall 2020 Update release!

Azure Batch Accounts

Leave Feedback

Introduction

Use Azure Batch to run large-scale parallel and high-performance computing (HPC) batch jobs efficiently in Azure.

Azure Batch does the following:

  • Creates and manages a pool of compute nodes (virtual machines).
  • Installs the applications you want to run.
  • Schedules jobs to run on the nodes.

There is no cluster or job scheduler software to install, manage, or scale.

Setup

To set up the OpsRamp Azure integration and discover the Azure service, go to Azure Integration Discovery Profile and select Batch Accounts

Metrics

OpsRamp MetricMetric Display NameUnitAggregation TypeDescription
azure.core.countDedicated Core CountCountTotalTotal number of dedicated cores in the batch account.
azure.total.node.countDedicated Node CountCountTotalTotal number of dedicated nodes in the batch account.
azure.low.priority.core.countLowPriority Core CountCountTotalTotal number of low priority cores in the batch account.
azure.total.low.priority.node.countLow Priority Node CountCountTotalTotal number of low priority nodes in the batch account.
azure.creating.node.countCreating Node CountCountTotalNumber of nodes being created.
azure.starting.node.countStarting Node CountCountTotalNumber of nodes starting.
azure.waiting.for.start.task.node.countWaitng For Start Task Node CountCountTotalNumber of nodes waiting for the Start Task to complete.
azure.start.task.failed.node.countStart Task Failed Node CountCountTotalNumber of nodes where the Start Task has failed.
azure.idle.node.countIdle Node CountCountTotalNumber of idle nodes.
azure.offline.node.countOffline Node CountCountTotalNumber of offline nodes.
azure.rebooting.node.countRebooting Node CountCountTotalNumber of rebooting nodes.
azure.remaining.node.countReimaging Node CountCountTotalNumber of reimaging nodes.
azure.running.node.countRunning Node CountCountTotalNumber of running nodes.
azure.leaving.pool.node.countLeaving Pool Node CountCountTotalNumber of nodes leaving the pool.
azure.unusable.node.countUnusable Node CountCountTotalNumber of unusable nodes.
azure.preempted.node.countPreempted Node CountCountTotalNumber of preempted nodes.
azure.task.start.eventTask Start EventsCountTotalNumber of tasks that have started.
azure.task.complete.eventTask Complete EventsCountTotalTotal number of tasks that have completed.
azure.task.fail.eventTask Fail EventsCountTotalTotal number of tasks that have completed in a failed state.
azure.pool.create.eventPool Create EventsCountTotalTotal number of pools that have been created.
azure.pool.resize.start.eventPool Resize Start EventsCountTotalTotal number of pool resizes that have started.
azure.pool.resize.complete.eventPool Resize Complete EventsCountTotalTotal number of pool resizes that have completed.
azure.pool.delete.start.eventPool Delete Start EventsCountTotalTotal number of pool deletes that have started.
azure.pool.delete.complete.eventPool Delete Complete EventsCountTotalTotal number of pool deletes that have completed.
azure.batchaccount.job.delete.complete.eventJob Delete Complete EventsCountTotalTotal number of jobs that have been successfully deleted.
azure.batchaccount.job.delete.start.eventJob Delete Start EventsCountTotalTotal number of jobs that have been requested to be deleted.
azure.batchaccount.job.disable.complete.eventJob Disable Complete EventsCountTotalTotal number of jobs that have been successfully disabled.
azure.batchaccount.job.disable.start.eventJob Disable Start EventsCountTotalTotal number of jobs that have been requested to be disabled.
azure.batchaccount.job.start.eventJob Start EventsCountTotalTotal number of jobs that have been successfully started.
azure.batchaccount.job.terminate.complete.eventJob Terminate Complete EventsCountTotalTotal number of jobs that have been successfully terminated.
azure.batchaccount.job.terminate.start.eventJob Terminate Start EventsCountTotalTotal number of jobs that have been requested to be terminated.
cloud.instance.stateStatus/Staten/an/aStatus/State

Event support

  • Supported
  • Configurable in OpsRamp Azure Integration Discovery Profile.

External reference