Windows Service is stopped for at least one Resource Server

Description

Alert is true if the Windows Service is stopped for at least one agent server, web background processing server, worker manager server or analytics server.

Alert Details

Alert ID: d14f3e28-fa2b-4df8-8b02-4d35d45bb800

Tags: Each tag should follow "key:value" format.

  • FeatureDomain:Host Infrastructure
  • PageType:Dashboard
  • PageID:cd200ee0-1e61-4645-8220-83ce82914a71
  • CreatedBy:Relativity
  • ResolutionText:For any Resource Server where the Windows Service is stopped, go to that Server page and click 'Restart Service'
  • Resolution

Metric Details

Metric Name: relsvr.windows_service.running

Metric Attributes:

Attribute NameDescriptionValue
labels.nameName of Service
labels.startup_mode
labels.stateaccessible/inaccessible

Rule details

Alert Condition Description: Alert triggers on at least one windows service stopped for at least 90 seconds.

NameValueDescription
Rule TypeElastic Query
Data Viewmetrics-*
Filter Queryrelsvr.windows_service.running : 0Windows service is stopped
GroupCountnumber of Windows service is stopped
Threshold> 0Count greter than 0, alert triggers
Time Window90 secVerified data for last 90 sec
Frequency30 secChecks for each 30 seconds

Requires User Intervention

  • Yes: alert immediately
    • Min time before the alert is active/inactive: 90 seconds

Kibana dashboard link

  • Host Heartbeat alert should not be in active state.
  • If windows service is associated with any resource server, then "One or more Resource Servers are inactive" alert should fire.
Return to top of the page
Feedback