8b899465-f542-46a1-9ea3-90c8db0acad2
Kepler service failed to start after three attempts
Description
This alert is triggered when a Kepler Service has failed to start after three attempts.
Impact When Active
-
When the Service Host starts up, it creates a Kepler Service for each associated application.
Each Kepler Service process includes an automatic retry mechanism that attempts to restart the service if it fails during initialization. The Service Host will attempt up to three restarts. If all three attempts fail, it will stop retrying, and no further automatic restart will occur.
-
Any functionality associated with the affected application will be unavailable.
-
API calls targeting the failed Kepler Service will not be processed.
-
In other words, there will be issues with running any Relativity application functionality
or making any API calls to the Kepler Service that is not running.
How To Resolve
- Go to the Backend of the Relativity instance.
- Verify the 'kCura Service Host Manager' Windows service is running.
- If any one application not working, restart the 'kCura Service Host Manager' Windows service.
- Wait for 6 min so that all applications/services are deleted and recreated.
- Refresh Kibana browser and the alert will be recovered.
Alert Details
Alert Condition Details
Name |
Value |
Description |
Rule Type |
Elastic Query |
|
Data View |
Logs-* |
|
Filter Query |
labels.event_type :"health_check" AND labels.status_state: "unhealthy" AND labels.event_source :"relsvr.servicehost" AND labels.name:"relsvr.servicehost.startup_service.failure" |
|
Threshold |
> 0 |
Count greater than 0, alert triggers |
Time Window |
5 min |
Verified data for last 5 min |
Frequency |
1 min |
Checks for every 1 min |