2021-06-25 17:21:30 +01:00
|
|
|
storj.io/storj/private/lifecycle."slow_shutdown" Event
|
|
|
|
storj.io/storj/private/lifecycle."unexpected_shutdown" Event
|
2021-04-22 18:32:42 +01:00
|
|
|
storj.io/storj/satellite/accounting."bucket_bytes" IntVal
|
|
|
|
storj.io/storj/satellite/accounting."bucket_objects" IntVal
|
|
|
|
storj.io/storj/satellite/accounting."bucket_segments" IntVal
|
|
|
|
storj.io/storj/satellite/accounting."total_bytes" IntVal
|
|
|
|
storj.io/storj/satellite/accounting."total_objects" IntVal
|
|
|
|
storj.io/storj/satellite/accounting."total_segments" IntVal
|
2021-01-25 20:07:24 +00:00
|
|
|
storj.io/storj/satellite/accounting/tally."nodetallies.totalsum" IntVal
|
2019-11-19 16:30:28 +00:00
|
|
|
storj.io/storj/satellite/audit."audit_contained_nodes" IntVal
|
|
|
|
storj.io/storj/satellite/audit."audit_contained_nodes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."audit_contained_percentage" FloatVal
|
|
|
|
storj.io/storj/satellite/audit."audit_fail_nodes" IntVal
|
|
|
|
storj.io/storj/satellite/audit."audit_fail_nodes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."audit_failed_percentage" FloatVal
|
satellite/audit: better handling of piece fetch errors
We have an alert on `not_enough_shares_for_audit` which fires too
frequently. Every time so far, it has been because of a network blip of
some nature on the satellite side.
Satellite operators are expected to have other means in place for
alerting on network problems and fixing them, so it's not necessary for
the audit framework to act in that way.
Instead, in this change, we add three new metrics,
`audit_not_enough_nodes_online`, `audit_not_enough_shares_acquired`, and
`audit_suspected_network_problem`. When an audit fails, and emits
`not_enough_shares_for_audit`, we will now determine whether it looks
like we are having network problems (most errors are connection
failures, possibly also some successful connections which subsequently
time out) or whether something else has happened.
After this is deployed, we can remove the alert on
`not_enough_shares_for_audit` and add new alerts on
`audit_not_enough_nodes_online` and `audit_not_enough_shares_acquired`.
`audit_suspected_network_problem` does not need an alert.
Refs: https://github.com/storj/storj/issues/4669
Change-Id: Ibb256bc19d2578904f71f5229111ac98e5212fcb
2022-09-27 00:47:03 +01:00
|
|
|
storj.io/storj/satellite/audit."audit_not_enough_nodes_online" Counter
|
|
|
|
storj.io/storj/satellite/audit."audit_not_enough_shares_acquired" Counter
|
2019-11-19 16:30:28 +00:00
|
|
|
storj.io/storj/satellite/audit."audit_offline_nodes" IntVal
|
|
|
|
storj.io/storj/satellite/audit."audit_offline_nodes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."audit_offline_percentage" FloatVal
|
|
|
|
storj.io/storj/satellite/audit."audit_success_nodes" IntVal
|
|
|
|
storj.io/storj/satellite/audit."audit_success_nodes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."audit_successful_percentage" FloatVal
|
satellite/audit: better handling of piece fetch errors
We have an alert on `not_enough_shares_for_audit` which fires too
frequently. Every time so far, it has been because of a network blip of
some nature on the satellite side.
Satellite operators are expected to have other means in place for
alerting on network problems and fixing them, so it's not necessary for
the audit framework to act in that way.
Instead, in this change, we add three new metrics,
`audit_not_enough_nodes_online`, `audit_not_enough_shares_acquired`, and
`audit_suspected_network_problem`. When an audit fails, and emits
`not_enough_shares_for_audit`, we will now determine whether it looks
like we are having network problems (most errors are connection
failures, possibly also some successful connections which subsequently
time out) or whether something else has happened.
After this is deployed, we can remove the alert on
`not_enough_shares_for_audit` and add new alerts on
`audit_not_enough_nodes_online` and `audit_not_enough_shares_acquired`.
`audit_suspected_network_problem` does not need an alert.
Refs: https://github.com/storj/storj/issues/4669
Change-Id: Ibb256bc19d2578904f71f5229111ac98e5212fcb
2022-09-27 00:47:03 +01:00
|
|
|
storj.io/storj/satellite/audit."audit_suspected_network_problem" Counter
|
2019-11-19 16:30:28 +00:00
|
|
|
storj.io/storj/satellite/audit."audit_total_nodes" IntVal
|
|
|
|
storj.io/storj/satellite/audit."audit_total_nodes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."audit_total_pointer_nodes" IntVal
|
|
|
|
storj.io/storj/satellite/audit."audit_total_pointer_nodes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."audit_unknown_nodes" IntVal
|
|
|
|
storj.io/storj/satellite/audit."audit_unknown_nodes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."audit_unknown_percentage" FloatVal
|
|
|
|
storj.io/storj/satellite/audit."audited_percentage" FloatVal
|
2021-08-16 21:11:45 +01:00
|
|
|
storj.io/storj/satellite/audit."could_not_verify_audit_shares" Counter
|
satellite/audit: better handling of piece fetch errors
We have an alert on `not_enough_shares_for_audit` which fires too
frequently. Every time so far, it has been because of a network blip of
some nature on the satellite side.
Satellite operators are expected to have other means in place for
alerting on network problems and fixing them, so it's not necessary for
the audit framework to act in that way.
Instead, in this change, we add three new metrics,
`audit_not_enough_nodes_online`, `audit_not_enough_shares_acquired`, and
`audit_suspected_network_problem`. When an audit fails, and emits
`not_enough_shares_for_audit`, we will now determine whether it looks
like we are having network problems (most errors are connection
failures, possibly also some successful connections which subsequently
time out) or whether something else has happened.
After this is deployed, we can remove the alert on
`not_enough_shares_for_audit` and add new alerts on
`audit_not_enough_nodes_online` and `audit_not_enough_shares_acquired`.
`audit_suspected_network_problem` does not need an alert.
Refs: https://github.com/storj/storj/issues/4669
Change-Id: Ibb256bc19d2578904f71f5229111ac98e5212fcb
2022-09-27 00:47:03 +01:00
|
|
|
storj.io/storj/satellite/audit."not_enough_shares_for_audit" Counter
|
2019-11-19 16:30:28 +00:00
|
|
|
storj.io/storj/satellite/audit."reverify_contained_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."reverify_fails_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."reverify_offlines_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."reverify_successes_global" Meter
|
|
|
|
storj.io/storj/satellite/audit."reverify_unknown_global" Meter
|
2020-02-26 21:19:58 +00:00
|
|
|
storj.io/storj/satellite/audit."verify_shares_downloaded_successfully" IntVal
|
2022-05-24 19:18:52 +01:00
|
|
|
storj.io/storj/satellite/console."create_user_attempt" Counter
|
|
|
|
storj.io/storj/satellite/console."create_user_captcha_error" Counter
|
|
|
|
storj.io/storj/satellite/console."create_user_captcha_unsuccessful" Counter
|
|
|
|
storj.io/storj/satellite/console."create_user_duplicate_unverified" Counter
|
|
|
|
storj.io/storj/satellite/console."create_user_duplicate_verified" Counter
|
|
|
|
storj.io/storj/satellite/console."create_user_success" Counter
|
|
|
|
storj.io/storj/satellite/console."login_attempt" Counter
|
|
|
|
storj.io/storj/satellite/console."login_email_invalid" Counter
|
|
|
|
storj.io/storj/satellite/console."login_email_unverified" Counter
|
|
|
|
storj.io/storj/satellite/console."login_failed" Counter
|
|
|
|
storj.io/storj/satellite/console."login_invalid_password" Counter
|
|
|
|
storj.io/storj/satellite/console."login_locked_out" Counter
|
|
|
|
storj.io/storj/satellite/console."login_lockout_initiated" Counter
|
|
|
|
storj.io/storj/satellite/console."login_lockout_reinitiated" Counter
|
|
|
|
storj.io/storj/satellite/console."login_mfa_conflict" Counter
|
|
|
|
storj.io/storj/satellite/console."login_mfa_missing" Counter
|
|
|
|
storj.io/storj/satellite/console."login_mfa_passcode_failure" Counter
|
|
|
|
storj.io/storj/satellite/console."login_mfa_passcode_success" Counter
|
|
|
|
storj.io/storj/satellite/console."login_mfa_recovery_failure" Counter
|
|
|
|
storj.io/storj/satellite/console."login_mfa_recovery_success" Counter
|
|
|
|
storj.io/storj/satellite/console."login_success" Counter
|
2022-07-21 14:31:38 +01:00
|
|
|
storj.io/storj/satellite/console."login_user_captcha_error" Counter
|
|
|
|
storj.io/storj/satellite/console."login_user_captcha_unsuccessful" Counter
|
2022-05-24 19:18:52 +01:00
|
|
|
storj.io/storj/satellite/console."login_user_failed_count" IntVal
|
2020-12-02 22:17:59 +00:00
|
|
|
storj.io/storj/satellite/contact."failed_dial" Event
|
|
|
|
storj.io/storj/satellite/contact."failed_ping_node" Event
|
2019-11-08 18:57:51 +00:00
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_fail_max_failures_percentage" Meter
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_fail_validation" Meter
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_final_bytes_transferred" IntVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_final_pieces_failed" IntVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_final_pieces_succeess" IntVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_init_node_age_seconds" FloatVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_init_node_audit_success_count" IntVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_init_node_audit_total_count" IntVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_init_node_piece_count" IntVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_success" Meter
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_successful_pieces_transfer_ratio" IntVal
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_transfer_piece_fail" Meter
|
|
|
|
storj.io/storj/satellite/gracefulexit."graceful_exit_transfer_piece_success" Meter
|
2022-10-26 08:02:13 +01:00
|
|
|
storj.io/storj/satellite/metabase/rangedloop."rangedloop_error" Event
|
2021-07-29 09:28:23 +01:00
|
|
|
storj.io/storj/satellite/metabase/segmentloop."segmentloop_error" Event
|
2021-05-28 10:42:55 +01:00
|
|
|
storj.io/storj/satellite/metabase/segmentloop."segmentsProcessed" IntVal
|
|
|
|
storj.io/storj/satellite/metabase/segmentloop.*Service.RunOnce Task
|
2020-01-30 17:43:37 +00:00
|
|
|
storj.io/storj/satellite/metainfo."metainfo_rate_limit_exceeded" Event
|
2020-12-30 17:03:31 +00:00
|
|
|
storj.io/storj/satellite/metainfo/piecedeletion."delete_batch_size" IntVal
|
|
|
|
storj.io/storj/satellite/metainfo/piecedeletion."deletion_pieces_unhandled_count" IntVal
|
2021-07-30 16:31:20 +01:00
|
|
|
storj.io/storj/satellite/metrics."total_inline_bytes" IntVal
|
|
|
|
storj.io/storj/satellite/metrics."total_inline_segments" IntVal
|
|
|
|
storj.io/storj/satellite/metrics."total_remote_bytes" IntVal
|
|
|
|
storj.io/storj/satellite/metrics."total_remote_segments" IntVal
|
2019-12-04 21:24:36 +00:00
|
|
|
storj.io/storj/satellite/orders."download_failed_not_enough_pieces_uplink" Meter
|
2020-10-21 23:02:54 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."checker_injured_segment_health" FloatVal
|
2019-10-15 18:00:14 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."checker_segment_age" IntVal
|
2020-10-21 23:02:54 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."checker_segment_health" FloatVal
|
2019-10-15 18:00:14 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."checker_segment_healthy_count" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."checker_segment_time_until_irreparable" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."checker_segment_total_count" IntVal
|
2020-11-10 14:49:19 +00:00
|
|
|
storj.io/storj/satellite/repair/checker."checker_segments_below_min_req" Counter
|
2020-09-09 21:52:22 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."healthy_segments_removed_from_queue" IntVal
|
2020-05-22 20:54:05 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."new_remote_segments_needing_repair" IntVal
|
2019-10-15 18:00:14 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."remote_files_checked" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_files_lost" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_checked" IntVal
|
2020-03-10 13:59:29 +00:00
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_failed_to_check" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_healthy_percentage" FloatVal
|
2019-10-15 18:00:14 +01:00
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_lost" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_needing_repair" IntVal
|
2020-02-24 23:06:52 +00:00
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_over_threshold_1" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_over_threshold_2" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_over_threshold_3" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_over_threshold_4" IntVal
|
|
|
|
storj.io/storj/satellite/repair/checker."remote_segments_over_threshold_5" IntVal
|
2019-12-04 21:24:36 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."download_failed_not_enough_pieces_repair" Meter
|
2020-01-29 22:44:19 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."healthy_ratio_after_repair" FloatVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."healthy_ratio_before_repair" FloatVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_attempts" Meter
|
2021-07-20 15:41:38 +01:00
|
|
|
storj.io/storj/satellite/repair/repairer."repair_bytes_downloaded" Meter
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_bytes_uploaded" Meter
|
2020-01-29 22:44:19 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."repair_failed" Meter
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_nodes_unavailable" Meter
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_partial" Meter
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_segment_pieces_canceled" IntVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_segment_pieces_failed" IntVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_segment_pieces_successful" IntVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_segment_pieces_total" IntVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_segment_size" IntVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."repair_success" Meter
|
2022-09-19 22:16:48 +01:00
|
|
|
storj.io/storj/satellite/repair/repairer."repair_suspected_network_problem" Meter
|
2020-02-24 20:13:12 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."repair_too_many_nodes_failed" Meter
|
2020-01-29 22:44:19 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."repair_unnecessary" Meter
|
2020-11-10 14:49:19 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."repairer_segments_below_min_req" Counter
|
2020-02-24 20:13:12 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."segment_deleted_before_repair" Meter
|
2020-01-29 22:44:19 +00:00
|
|
|
storj.io/storj/satellite/repair/repairer."segment_repair_count" IntVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."segment_time_until_repair" IntVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."time_for_repair" FloatVal
|
|
|
|
storj.io/storj/satellite/repair/repairer."time_since_checker_queue" FloatVal
|
2020-08-04 21:28:46 +01:00
|
|
|
storj.io/storj/satellite/satellitedb."audit_online_score" FloatVal
|
2019-10-15 18:00:14 +01:00
|
|
|
storj.io/storj/satellite/satellitedb."audit_reputation_alpha" FloatVal
|
|
|
|
storj.io/storj/satellite/satellitedb."audit_reputation_beta" FloatVal
|
2020-10-15 17:00:08 +01:00
|
|
|
storj.io/storj/satellite/satellitedb."bad_audit_dqs" Meter
|
|
|
|
storj.io/storj/satellite/satellitedb."offline_dqs" Meter
|
2020-03-09 15:35:54 +00:00
|
|
|
storj.io/storj/satellite/satellitedb."unknown_audit_reputation_alpha" FloatVal
|
|
|
|
storj.io/storj/satellite/satellitedb."unknown_audit_reputation_beta" FloatVal
|
2020-10-15 17:00:08 +01:00
|
|
|
storj.io/storj/satellite/satellitedb."unknown_suspension_dqs" Meter
|
2019-11-26 18:54:30 +00:00
|
|
|
storj.io/storj/storage/filestore."open_file_in_trash" Meter
|
2019-11-19 21:11:31 +00:00
|
|
|
storj.io/storj/storagenode/contact."satellite_contact_request" Meter
|
|
|
|
storj.io/storj/storagenode/gracefulexit."satellite_gracefulexit_request" Meter
|
2020-06-09 21:58:48 +01:00
|
|
|
storj.io/storj/storagenode/piecestore/usedserials."delete_random_serial" Meter
|