Collector Type: Agent

Category: Application Monitors

Application Name: MesosMaster

Global Template Name: Mesos Master Monitoring Template

Parameters

NamesDescriptionDefault Value
Host IP AddressThe host on which Monitd is running.127.0.0.1
PortThe port on which Mesos is running.8080
UsernameThe username of the server, if authentication is enabled.NA
PasswordThe password of the server, if authentication is enabled.NA

Note: All field attributes are mandatory. Use default values wherever applicable.

Collected Metrics

Metric NameDisplay Name
mesos.framework.cpuMesos Framework CPU
mesos.framework.memMesos Framework Memory
mesos.framework.diskMesos Framework Disk
marathon.appsMarathon Applications Count
marathon.deploymentsMarathon Deployments
marathon.backoffFactorMarathon Backoff Factor
marathon.backoffSecondsMarathon Backoff Seconds
marathon.cpusMarathon CPUs
marathon.diskMarathon DISK
marathon.instancesMarathon Instances
marathon.memMarathon Memory
marathon.taskRateLimitMarathon Task Rate Limit
marathon.tasksRunningMarathon Task Running
marathon.tasksStagedMarathon Task Staged
marathon.tasksHealthyMarathon Tasks Healthy
marathon.tasksUnhealthyMarathon Tasks Unhealthy
marathon.queue.sizeMarathon Queue Size
marathon.queue.countMarathon Queue Count
marathon.queue.delayMarathon Queue Delay
marathon.queue.offers.processedMarathon Queue Offer Processed
marathon.queue.offers.unusedMarathon Queue Offers Unused
marathon.queue.offers.reject.lastMarathon Queue Offers Reject Last
marathon.queue.offers.reject.launchMarathon Queue Offers Reject Launch
mesos.registrar.registry_size_bytesMesos Registrar registry_size_bytes
mesos.registrar.state_store_ms.p90Registrar state_store_ms.p90
mesos.registrar.state_store_ms.p99Registrar state_store_ms.p99
mesos.registrar.queued_operationsMesos Registrar queued_operations
mesos.registrar.state_store_ms.p999Registrar state_store_ms.p999
mesos.registrar.state_store_ms.p95Registrar state_store_ms.p95
mesos.registrar.state_store_ms.p9999Registrar state_store_ms.p9999
mesos.invalid_status_update_acknowledgementsNumber of invalid status update acknowledgements
mesos.registrar.state_store_ms.p50Registrar state_store_ms.p50
mesos.stats.electedElected as master
mesos.registrar.log.recoveredRegistrar log recovered
mesos.master.countRegistry write count
mesos.role.diskMesos Role Disk
mesos.role.cpuMesos Role CPU
mesos.role.memMesos Role Memory
mesos.cluster.slave_registrationsSlave registrations
mesos.cluster.mem_percentAllocated memory percent
mesos.cluster.tasks_errorInvalid tasks
mesos.cluster.disk_totalDisk space total
mesos.cluster.tasks_finishedTasks finished
mesos.cluster.tasks_killedTasks killed
mesos.cluster.slave_shutdowns_scheduledSlave shutdowns scheduled
mesos.cluster.frameworks_activeFrameworks active
mesos.cluster.frameworks_connectedFrameworks connected
mesos.cluster.slaves_inactiveAgents inactive
mesos.cluster.slaves_unreachableAgents unreachable
mesos.cluster.gpus_usedNumber of GPUs used
mesos.cluster.mem_totalMemory total
mesos.cluster.frameworks_inactiveFrameworks inactive
mesos.cluster.event_queue_http_requestsEvent queue HTTP requests
mesos.cluster.tasks_startingTasks starting
mesos.cluster.slave_removalsSlave removals
mesos.cluster.cpus_totalCPUs total
mesos.cluster.tasks_stagingTasks staging
mesos.cluster.mem_usedAllocated memory
mesos.cluster.slaves_activeAgents active
mesos.cluster.gpus_totalGPUs total
mesos.cluster.disk_percentAllocated disk space percent
mesos.cluster.frameworks_disconnectedFrameworks disconnected
mesos.cluster.invalid_status_updatesInvalid status updates
mesos.cluster.valid_framework_to_executor_messagesValid framework to executor messages
mesos.cluster.tasks_failedFailed tasks
mesos.cluster.tasks_lostTasks lost
mesos.cluster.event_queue_messagesEvent queue messages
mesos.cluster.slave_reregistrationsSlave reregistrations
mesos.cluster.slaves_connectedAgents connected
mesos.cluster.valid_status_update_acknowledgementsValid status update acknowledgement messages
mesos.cluster.slave_shutdowns_canceledSlave shutdowns canceled
mesos.cluster.slaves_disconnectedAgents disconnected
mesos.cluster.cpus_usedNumber of CPUs used
mesos.cluster.outstanding_offersOutstanding resource offers
mesos.cluster.disk_usedAllocated disk space
mesos.cluster.dropped_messagesDropped messages
mesos.cluster.invalid_framework_to_executor_messagesInvalid framework to executor messages
mesos.cluster.gpus_percentAllocated GPUs percent
mesos.cluster.tasks_runningTasks running
mesos.cluster.cpus_percentAllocated CPUs percent
mesos.cluster.event_queue_dispatchesDispatches in the event queue
mesos.cluster.valid_status_updatesValid status updates
dcos.health.admin.router.agentAdmin router agent service health
dcos.health.log.agentAgent Log service health
dcos.health.marathonMarathon service health
dcos.health.telegrafTelegraf service health
dcos.health.admin.router.masterAdmin router master service health
dcos.health.checks.api.socketChecks API socket health
dcos.health.checks.timerChecks Timer service health
dcos.health.historyHistory service health
mdcos.health.log.master.socketMaster Log socket health
dcos.health.net.watchdogNet Watchdog service health
dcos.health.gcDocker GC
dcos.health.resolv.timerGenerate resolv.conf Timer service health
dcos.health.mesos.masterMesos Master service health
dcos.health.authenticationAuthentication service health
dcos.health.gc.timerDocker GC Timer
dcos.health.diagnostics.agentDiagnostics Agent service health
dcos.health.jobsJobs service health
dcos.health.netNet service health
dcos.health.rexrayREX_Ray service health
dcos.health.diagnostics.agent.socketDiagnostics Agent socket health
dcos.health.logrotate.agentAgent Logrotate service health
dcos.health.logrotate.masterMaster Logrotate service health
dcos.health.logrotate.master.timerLogrotate Timer
dcos.health.mesos.agent.publicMesos Public Agent service health
dcos.health.poststart.checksPoststart Checks service health
dcos.health.signalSignal service health
dcos.health.signal.timerSignal Timer service health
dcos.health.resolvGenerate resolv.conf service health
dcos.health.telegraf.socketTelegraf socket health
dcos.health.log.masterMaster Log service health
dcos.health.exhibitorExhibitor service health
dcos.health.checks.apiChecks API service health
dcos.health.component.package.managerComponent Package Manager (Pkgpanda) service health
dcos.health.log.agent.socketAgent Log socket health
dcos.health.package.managerPackage Manager service health
dcos.health.logrotate.agent.timerLogrotate Timer
dcos.health.mesos.agentMesos Agent service health
dcos.health.mesos.dnsMesos DNS service health