storj

Author	SHA1	Message	Date
paul cannon	72189330fd	satellite/gracefulexit: revamp graceful exit Currently, graceful exit is a complicated subsystem that keeps a queue of all pieces expected to be on a node, and asks the node to transfer those pieces to other nodes one by one. The complexity of the system has, unfortunately, led to numerous bugs and unexpected behaviors. We have decided to remove this entire subsystem and restructure graceful exit as follows: * Nodes will signal their intent to exit gracefully * The satellite will not send any new pieces to gracefully exiting nodes * Pieces on gracefully exiting nodes will be considered by the repair subsystem as "retrievable but unhealthy". They will be repaired off of the exiting node as needed. * After one month (with an appropriately high online score), the node will be considered exited, and held amounts for the node will be released. The repair worker will continue to fetch pieces from the node as long as the node stays online. * If, at the end of the month, a node's online score is below a certain threshold, its graceful exit will fail. Refs: https://github.com/storj/storj/issues/6042 Change-Id: I52d4e07a4198e9cb2adf5e6cee2cb64d6f9f426b	2023-09-27 08:40:01 +00:00
Ivan Fraixedes	7fb86617fc	satellite/satellitedb: Use CRDB AS OF SYSTEM & batch for GE Use the 'AS OF SYSTEM TIME' Cockroach DB clause for the Graceful Exit (a.k.a GE) queries that count the delete the GE queue items of nodes which have already exited the network. Split the subquery used for deleting all the transfer queue items of nodes which has exited when CRDB is used and batch the queries because CRDB struggles when executing in a single query unlike Postgres. The new test which has been added to this commit to verify the CRDB batch logic for deleting all the transfer queue items of the exited nodes has raised that the Enqueue method has to run in baches when CRDB is used otherwise CRDB has return the error "driver: bad connection" when a big a amount of items are passed to be enqueued. This error didn't happen with the current test implementation it was with an initial one that it was creating a big amount of exited nodes and transfer queue items for those nodes. Change-Id: I6a099cdbc515a240596bc93141fea3182c2e50a9	2021-05-07 13:09:19 -04:00
Natalie Villasana	856db68fd9	satellite/gracefulexit: extend GE data cleanup to include exit_progress The new 'consistency ge-cleanup-orphaned-data' cli command deleted orphaned transfer queue items, but not entries in the graceful_exit_progress table. This will delete orphaned entries from the exit progress table too. Change-Id: I5f927aac1f258490678deaf179be92ccfe10fcd8	2021-03-01 15:52:43 +00:00
Ivan Fraixedes	076804eac9	cmd/satellite: Add command for GE data cleanup Add a command to the satellite for cleaning up the Graceful Exit (a.k.a GE) transfer queue items of nodes that have exited. The commit adds to the GE satellite DB a couple of new methods, and its corresponding test, for performing the operations of the new command. Change-Id: I29a572a59689d63b24990ac13c52e76d65aaa917	2021-02-01 17:30:58 +00:00
Ethan Adams	f90ea10a4a	Allow for DB application names per process. (#3983 )	2020-12-04 11:24:39 +01:00
Egon Elbre	9b2e00a38b	satellite: pass ctx into satellitedb.Open Opening a database requires ctx, this is first step to passing ctx to the appropriate level. Change-Id: Ic303e69f868ef3449ae36377937a29670cf635e2	2020-10-29 06:38:37 +00:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
Egon Elbre	11a44cdd88	all: don't depend on gogo/proto directly Change-Id: I8822dea0d1b7b99e0b828e0373a0308a42dde2be	2020-04-08 17:32:15 +00:00
Jeff Wendling	77fd41a02e	satellite: add an expiring lru cache around api keys Change-Id: I995429c66affd33da59b091f28f09ca122070b5e	2020-01-09 22:13:41 -07:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Ethan Adams	9420fa9fc5	satellite/gracefulexit: Add graceful exit completed/failed receipt verification to satellite CLI (#3679 )	2019-12-03 17:09:39 -05:00
Maximillian von Briesen	abb567f6ae	cmd/satellite: add graceful exit reports command to satellite CLI (#3300 ) * update lock file and add comment * add created at and bytes transferred * cleanup * rename db func to GetGracefulExitNodesByTimeFrame * fix flag * split into two overlay functions * := to = * fix test * add node not found error class * fix overlay test * suggested test changes * review suggestions * get exit status from overlay.Get() * check rows.Err * fix panic when ExitFinishedAt is nil * fix comments in cmdGracefulExit	2019-10-22 21:06:01 -04:00

12 Commits