HTTP Health Check - at least one application endpoint failed

Description

The alert is active when HTTP request to application returns status code other than 200.

Alert Details

Alert ID: bfec0546-118b-453e-a058-e64038639084

Tags: Each tag should follow "key:value" format.

  • FeatureDomain:Service Host
  • PageType:Saved Search
  • PageID:5ab1db6f-84b8-49ee-ad56-43c025ace303
  • CreatedBy:Relativity
  • ResolutionText:Restart the 'kCura Service Host Manager' Windows service
  • Resolution

Metric/Log/Trace Details

Metric Name: NOT numeric_labels.http_status_code : 200 AND labels.http_url : imaging%20health%20check%20service AND httpcheck.status: *

Metric Attributes:

Attribute Name Description Value
labels.http_url HTTP request endpoint Example:https://emttest:8990/Kepler/relativity.imaging.services.interfaces.private.healthcheck.iimagingHealthCheckModule1/imaging%20health%20check%20service/getenvironmentstatusasync
httpcheck.status HTTP request status 0/1
numeric_labels.http_status_code Status code of HTTP request 200/404/500
labels.http_status_class HTTP request stages 1XX/2XX/3XX/4XX/5XX

Rule details

Alert Condition Description: Alert triggers on when HTTP request to application end point returns status code other than 200.

Name Value Description
Rule Type Elastic Query
Data View metrics-*
Filter Query NOT numeric_labels.http_status_code : 200 AND labels.http_url : imaging%20health%20check%20service AND httpcheck.status: * HTTP request of application is inaccessible
Threshold > 0 Count greater than 0, alert triggers
Time Window 5 min Verified data for last 5 min
Frequency 1 min Checks for every 1 min

Requires User Intervention

  • Yes: alert immediately
    • Min time before the alert is active/inactive: 5 minutes

Saved Search link:5ab1db6f-84b8-49ee-ad56-43c025ace303

One or more Resource Servers are inactive.