storj/satellite
paul cannon 7f1cad6faf satellite/repair: better handling of piece fetch errors
We have an alert on `repair_too_many_nodes_failed` which fires too
frequently. Every time so far, it has been because of a network blip of
some nature on the satellite side.

Satellite operators are expected to have other means in place for
alerting on network problems and fixing them, so it's not necessary for
the repair framework to act in that way.

Instead, in this change, we change the way that
`repair_too_many_nodes_failed` works. When a repair fails, we collect
piece fetch errors by type and determine from them whether it looks like
we are having network problems (most errors are connection failures,
possibly also some successful connections which subsequently time out)
or whether something else has happened.

We will now only emit `repair_too_many_nodes_failed` when the outcome
does not look like a network failure. In the network failure case, we
will instead emit `repair_suspected_network_problem`.

Refs: https://github.com/storj/storj/issues/4669

Change-Id: I49df98da5df9c606b95ad08a2bdfec8092fba926
2022-09-23 09:35:06 +00:00
..
accounting satellite/accounting: use single call to get project limits 2022-09-13 21:50:52 +00:00
admin satellite/admin: add Burst field to Update Project Limits params 2022-08-17 10:40:28 -04:00
analytics satellite/analytics: Re-added user click events in Access Management flow (#5151) 2022-09-06 09:55:33 -07:00
attribution {cmd/satellite/reports, satellite/attribution}: type and variable name adjustments 2022-04-26 20:12:38 +00:00
audit satellite/repair: move over audit.Pieces 2022-09-22 16:43:03 +00:00
buckets satellite/metainfo: propagate geofencing between buckets and stream id 2021-11-24 08:05:05 +00:00
compensation all: reformat comments as required by gofmt 1.19 2022-08-10 18:24:55 +00:00
console web/satellite: add captcha verification to forgot password form 2022-09-22 15:20:04 +00:00
contact satellite/contact: swap net.IP.IsPrivateIP with isPrivateIP 2022-06-13 01:01:44 +02:00
gc satellite/gc/bloomfilter: use int64 to count pieces 2022-09-22 09:39:53 +00:00
geoip satellite/geoip: update node check-in to associate a country code 2021-11-10 16:44:41 +01:00
gracefulexit satellite: fix segment loop observers metrics 2022-08-10 14:13:16 +00:00
inspector {satellite/metabase, satellite/metainfo, satellite/inspector} : Use metabase.GetObjectLastCommitted instead metabase.GetObjectExactVersion 2022-09-08 07:27:22 +00:00
internalpb satellite/internalpb: new protobuf for retain filters 2022-08-30 12:37:40 +02:00
mailservice satellite/consoleweb: fix flaky TestAuth tests 2022-08-04 19:06:07 +00:00
metabase satellite/metabase: don't count objects for loop 2022-09-22 16:02:08 +00:00
metainfo satellite/metainfo: close project while testing 2022-09-23 07:45:38 +00:00
metrics satellite: fix segment loop observers metrics 2022-08-10 14:13:16 +00:00
nodeapiversion satellite/nodeapiversion: new table for tracking node api usage 2020-07-09 15:02:25 +00:00
nodeselection/uploadselection satellite/repairer: handle excluded countries 2022-03-14 10:59:36 -04:00
nodestats satellite: return interval_end_time in DailyStorageUsage endpoint 2022-07-27 18:24:27 +00:00
oidc satellite/oidc: include nodeURL in oidc configuration 2022-09-13 13:46:30 +00:00
orders satellite/{orders,overlay}: use cache for downloads 2022-07-12 11:04:34 +00:00
overlay satellite/gc/bloomfilter: use int64 to count pieces 2022-09-22 09:39:53 +00:00
payments satellite/payments/stripecoinpayments: update invoicing to use credit note 2022-09-16 13:40:38 +00:00
repair satellite/repair: better handling of piece fetch errors 2022-09-23 09:35:06 +00:00
reputation satellite/reputation: reconfigure lambda and alpha 2022-08-17 18:52:53 +00:00
revocation satellite/satellitedb: move tests to their domains 2021-02-19 17:29:15 +02:00
rewards satellite/rewards: adding SeaweedFS to partners list (#4230) 2021-10-19 21:30:31 +02:00
satellitedb satellite/gc/bloomfilter: use int64 to count pieces 2022-09-22 09:39:53 +00:00
snopayouts all: fix error naming 2021-04-29 15:38:21 +03:00
admin.go satellite/payments/stripecoinpayments: storjscan invoice generation 2022-08-03 13:24:26 -07:00
api.go satellitedb: add token balance to API endpoint 2022-08-31 22:55:16 +00:00
configlock_test.go all: fix linter complaints 2020-10-13 15:59:01 +03:00
core.go satellite/payments/storjscan: add billing update to chore 2022-08-16 11:18:15 -07:00
gc-bf.go satellite/gc/bloomfilter: add service to collect bloom filters 2022-09-02 08:10:46 +00:00
gc.go satellite/gc/sender: new service to send retain filters 2022-09-20 11:49:40 +00:00
peer.go satellite/gc/sender: new service to send retain filters 2022-09-20 11:49:40 +00:00
repairer.go satellite/reputation: offset write times by random, not by satelliteID 2022-08-03 21:14:06 +00:00