Introduction
HPE Compute Ops Management (COM) is a monitoring and management service that automates and streamlines management of HPE and 3rd party servers from data center to edge.
COM provides a RESTful API that enables customers to manage their devices programmatically or through a command line. The API allows users to invoke any operation or task that is available through the UI web interface.
The event integration between COM and OpsRamp uses webhooks for real-time event forwarding and centralized observability. Events are instantly pushed from COM to OpsRamp, enabling customers to manage alerts effectively and benefit from a more unified experience.
This integration provides centralized visibility, empowering users to manage alerts and incidents through actions like alert suppression, acknowledgment, incident creation and attachment, process automation, correlation, and escalation.
Note
This integration is available only within the Client scope.Intended Audience
COM customers who would want to view all software and hardware events at a centralized location and take additional actions such as suppress, acknowledge, run automation, create incident on the incoming alerts.
Use Case: Centralized Event Management
- The customer has a centralized event management function – the Command Center, which serves as the first point of triage for events from compute, network, and storage devices.
- A dedicated Compute Team manages compute devices.
- Hardware (HW) health-related events from HPE servers should be triaged by the Command Center.
- The Compute Team should have the ability to drill down into triaged events when needed.
Desired Customer Experience
- User installs COM and OpsRamp in the same workspace.
- From OpsRamp, the user instantiates and configures the COM integration.
- During configuration, the user specifies whether certain COM event types should flow into OpsRamp.
- COM-generated HW health events appear in OpsRamp’s alert console.
- The user can click on these alerts to take appropriate actions such as acknowledging, suppressing, create incidents, or running automation.