Yihau Chen 5e9bc2312b bump solana-transaction to 3.0.2, solana-hash to 3.1.0, solana-sha256-hasher to 3.1.0 (#9145) vor 1 Tag
..
src 5af914802e Move solana-logger back to agave as agave-logger (#8622) vor 3 Wochen
.gitignore 2db28cae41 Add solana-watchtower program vor 6 Jahren
Cargo.toml 5e9bc2312b bump solana-transaction to 3.0.2, solana-hash to 3.1.0, solana-sha256-hasher to 3.1.0 (#9145) vor 1 Tag
README.md a2d7eadfae Support multiple RPC URLs in watchtower (#4748) vor 8 Monaten

README.md

The agave-watchtower program is used to monitor the health of a cluster. It periodically polls the cluster over an RPC API to confirm that the transaction count is advancing, new blockhashes are available, and no validators are delinquent. Results are reported as InfluxDB metrics, with an optional push notification on sanity failure.

If you only care about the health of several specific validators, the --validator-identity command-line argument can be used to restrict failure notifications to issues only affecting that set of validators.

User can provide either 1 or 3 RPC URLs for the cluster via the --url or --urls command-line arguments respectively. 2 URLs are not accepted because it's not enough to have redundnacy, and more than 3 URLs are not accepted because there's little benefit from having more than 3. If 3 URLs are provided, at least 2 of them have to confirm health of a cluster.

Metrics

watchtower-sanity

On every iteration this data point will be emitted indicating the overall result using a boolean ok field.

watchtower-sanity-failure

On failure this data point contains details about the specific test that failed via the following fields:

  • test: name of the sanity test that failed
  • err: exact sanity failure message