Datahopper Production Manual

Alerts

job_metrics

The job metrics goroutine has not successfully updated its job cache for some time.

If there are Task Scheduler alerts, resolve those first.

Otherwise, you should check the logs to try to diagnose what's failing.

bot_coverage_metrics

The bot coverage metrics goroutine has not successfully completed a cycle for some time. You should check the logs to try to diagnose what's failing.

swarming_task_metrics

The Swarming task metrics goroutine has not successfully queried for Swarming tasks for some time. You should check the logs to try to diagnose what's failing.

event_metrics

The event metrics goroutine has not successfully updated metrics based on event data for some time. You should check the logs to try to diagnose what's failing. Double-check the instance name to verify which log stream to investigate.

swarming_bot_metrics

The Swarming bot metrics goroutine has not successfully queried for Swarming bots for some time. See the alert for which pool and server is failing. You should check the logs to try to diagnose what's failing.