On-the-Fly Conversion (P1) - Queue Backlog Threshold Exceeded

Description

This alert is triggered when the message backlog in the On-the-Fly Conversion (P1) ConversionAgent and ConversionCompleteAgent queues exceeds 50 messages for more than 5 minutes, indicating potential processing delays.

Impact When Active

  • A backlog in the On-the-Fly Conversion queue leads to slower or failed document rendering in the viewer, delaying review workflows.
  • Risk of further backlog accumulation if not addressed.

Resolution Guidance

  1. In RabbitMQ, locate the affected queues:
    • conversions_RP*_ConversionAgent_Priority1*
    • conversionresponses_ConversionCompleteAgent_Priority1* Confirm the message count is greater than 50 (the threshold) and observe whether the count is decreasing or increasing/static.
  2. Queues to check: ConversionAgent / ConversionCompleteAgent.
  3. Check the message count in each queue.
  4. Observe the trend:
    • Decreasing: Agents are actively processing messages.
    • Increasing or static: Agents may be stalled or underperforming.
  5. Check the status of the ConversionAgent / ConversionCompleteAgent in Relativity:
    • Log into Relativity
    • Go to the Agents tab
    • Filter by name containing 'conversion'
  6. Ensure RabbitMQ is running and healthy.
  7. After remediation, monitor the backlog until it falls below the threshold.

You can also try..

If the queue continues to remain backlogged. You can try following:

  • Restart the kCura services on the Conversion Agent servers.
  • Review system resources (CPU, memory, disk) on Conversion Agent and RabbitMQ hosts.

Alert Details

This alert monitors the message count of the On-the-Fly Conversion (P1) Conversion Agent and Conversion Complete Agent queues. It triggers if the backlog exceeds 50 messages for more than 5 minutes.

Alert Condition Details

NameValueDescription
Rule TypeMetric threshold
Data Viewmetrics-*
Filter Query((labels.rabbitmq_queue_name: conversions_RP*_ConversionAgent_Priority1* AND NOT labels.rabbitmq_queue_name: conversions_RP-_ConversionAgent_Priority1) OR labels.rabbitmq_queue_name: conversionresponses_ConversionCompleteAgent_Priority1* )Target both P1 conversion queues
AggregationmaxHighest queue depth within the time window
Group bylabels.rabbitmq_queue_nameEvaluate per queue
Threshold>= 50Alert when queue depth exceeds 50
Time Window5 minutesEvaluate the last 5 minutes
Frequency2 minutesCheck every 2 minutes

Alert Metric Details

Metric Name: rabbitmq.message.current

Metric Description: The alert is true when a queue’s message count remains above 50 for the last 5 minutes.

Metric Attributes:

Attribute NameDescription
labels.rabbitmq_queue_nameThe name of the RabbitMQ queue
rabbitmq.message.currentCurrent number of messages in the queue
Return to top of the page
Feedback