blob: b7e7a70ad78580249bd50f5d4ad01cbfe6e52e45 [file] [view]
Monitoring (Grains, Prober and Alert Server)
============================================
The monitoring server runs InfluxDB to accept and manage timeseries data and
uses Grafana to construct dashboards for that data. InfluxDB has a module to
make it compatible with Graphite/Carbon, which we used to use to store
timeseries data before InfluxDB. Our servers still upload metrics using this
Graphite/Carbon API, so you'll see mentions of Graphite or Carbon here and
there.
Logs for all applications are served from skiamonitor.com:10115 which is
restricted to internal IPs only.
Full Server Setup
=================
Do once
-------
$ ./vm_create_instance.sh
$ ./vm_setup_instance.sh
Make sure to 'set daemon 2' in /etc/monit/monitrc so that monit
runs every 2 seconds.
Make sure to log in InfluxDB at port 10117 and create the 'graphite' and
'grafana' databases. Username and Password should also be set according to
valentine.
Once that is done then set the Metadata for the instance using
cloud.google.com/console, see below:
Do on update
------------
$ ./vm_push_update.sh
Notes
-----
To SSH into the instance:
gcutil --project=google.com:skia-buildbots ssh --ssh_user=default skia-monitoring
If you need to modify the constants for the vm_XXX.sh scripts they are
specified in vm_config.sh.