General information about the Cluster Telemetry is available in the design doc. The maintenance doc details how to maintain CT's different components.
This alert indicates there are many tasks in the queue. There are several possibilities:
RunOnGCE
is false
) requested in a short period of time, it may take a while to complete all tasks."TsStarted": 0
(ignoring “scheduled in the future” tasks). CT normally picks up tasks in < 1m, so if a task is not started, that could mean that the CT poller is down (see below) or that something is wrong with the CT framework possibly related to a recent push.SwarmingLogs
link shown in the “Task Details.” If build_chromium
has been running for > 1h, something is probably wrong.