storj

Author	SHA1	Message	Date
paul cannon	a06735c1b6	satellite/gracefulexit: add missing test cases These test cases are the parts of the testplan for the Graceful Exit Revamp which are automateable but not yet automated. I'm not entirely sure why we have to reject graceful exit from nodes that are suspended, but implementing that was probably easier than convincing everybody that it's not necessary. Refs: https://github.com/storj/storj/issues/6369 Change-Id: I0261b37f7e010d72d84332cde5dd8689f7c41580	2023-10-03 17:35:20 +00:00
paul cannon	72189330fd	satellite/gracefulexit: revamp graceful exit Currently, graceful exit is a complicated subsystem that keeps a queue of all pieces expected to be on a node, and asks the node to transfer those pieces to other nodes one by one. The complexity of the system has, unfortunately, led to numerous bugs and unexpected behaviors. We have decided to remove this entire subsystem and restructure graceful exit as follows: * Nodes will signal their intent to exit gracefully * The satellite will not send any new pieces to gracefully exiting nodes * Pieces on gracefully exiting nodes will be considered by the repair subsystem as "retrievable but unhealthy". They will be repaired off of the exiting node as needed. * After one month (with an appropriately high online score), the node will be considered exited, and held amounts for the node will be released. The repair worker will continue to fetch pieces from the node as long as the node stays online. * If, at the end of the month, a node's online score is below a certain threshold, its graceful exit will fail. Refs: https://github.com/storj/storj/issues/6042 Change-Id: I52d4e07a4198e9cb2adf5e6cee2cb64d6f9f426b	2023-09-27 08:40:01 +00:00
Michal Niewrzal	6ac5bf0d7c	satellite/gracefulexit: remove segments loop parts We are switching completely to ranged loop. https://github.com/storj/storj/issues/5368 Change-Id: Ia3e2d7879d91f7f5ffa99b8e8f108380e3b39f31	2023-04-24 15:00:26 +00:00
Márton Elek	9fbad53bbe	satellite/gracefulexit: extend GE tests using multple hash algo Change-Id: I0b7accd5c0ae9ebb7adc7e5b8545bab639d70d36	2023-04-19 11:05:23 +02:00
Egon Elbre	f5020de57c	storagenode/blobstore: move blob store logic The blobstore implementation is entirely related to storagenode, so the rightful place is together with the storagenode implementation. Fixes https://github.com/storj/storj/issues/5754 Change-Id: Ie6637b0262cf37af6c3e558556c7604d9dc3613d	2023-04-05 18:06:20 +00:00
Cameron	74ddfab810	satellite/overlay: insert DQ event into node events in overlay.DisqualifyNode Also, return node email from overlaycache db DisqualifyNode to be used in node events insertion Change-Id: I41534cf01351c1690c3966a8055c5fe6fcf0d6a6	2022-11-04 15:18:31 +00:00
Egon Elbre	cf92220c20	{satellite,storagenode}/gracefulexit: simplify limiter usage Change-Id: Ied7091fe5355b96d327e3f893c5bdd4946a9e6af	2022-08-04 08:18:15 +00:00
Yaroslav Vorobiov	3f47d19aa6	satellite/overlay: add disqualification reason Add disqualification reason to NodeDossier. Extend DB.DisqualifyNode with disqualification reason. Extend reputation Service.TestDisqualifyNode with disqualification reason. Change-Id: I8611b6340c7f42ac1bb8bd0fd7f0648ad650ab2d	2022-04-20 13:29:31 +00:00
Fadila Khadar	c00ecae75c	satellite/gracefulexit: stop using gracefulexit_transfer_queue Remove the logic associated to the old transfer queue. A new transfer queue (gracefulexit_segment_transfer_queue) has been created for migration to segmentLoop. Transfers from the old queue were not moved to the new queue. Instead, it was still used for nodes which have initiated graceful exit before migration. There is no such node left, so we can remove all this logic. In a next step, we will drop the table. Change-Id: I3aa9bc29b76065d34b57a73f6e9c9d0297587f54	2021-09-14 11:52:34 +00:00
Michał Niewrzał	c258f4bbac	private/testplanet: move Metabase outside Metainfo for satellite At some point we moved metabase package outside Metainfo but we didn't do that for satellite structure. This change refactors only tests. When uplink will be adjusted we can remove old entries in Metainfo struct. Change-Id: I2b66ed29f539b0ec0f490cad42c72840e0351bcb	2021-09-09 07:15:51 +00:00
Fadila Khadar	c4202b9451	satellite/gracefulexit: use graceful_exit_segment_transfer_queue For being able to use the segment metainfo loop, graceful exit transfers have to include the segment stream_id/position instead of the path. For this, we created a new table graceful_exit_segment_transfer_queue that will replace the graceful_exit_transfer_queue. The table has been created in a previous migration and made accessible through graceful exit db in another one. This changes makes graceful exit enqueue transfer items for new exiting nodes in the new table. Change-Id: I7bd00de13e749be521d63ef3b80c168df66b9433	2021-07-21 14:02:20 +00:00
Fadila Khadar	b0d98b1c1a	satellite/gracefulexit: allow use of graceful_exit_segment_transfer_queue For being able to use the segment metainfo loop, graceful exit transfers have to include the segment stream_id/position instead of the path. For this, we created a new table graceful_exit_segment_transfer_queue that will replace the graceful_exit_transfer_queue. The table has been created in a previous migration. This change gives access to this table. Graceful Exit doesn't use the table yet, this will be done in a next change. Change-Id: I6c09cff4cc45f0529813a8898ddb2d14aadb2cb8	2021-07-21 12:34:44 +00:00
Michał Niewrzał	7944df20d6	storj: use multipart API Change-Id: I10b401434e3e77468d12ecd225b41689568fd197	2021-04-26 13:15:09 +00:00
Egon Elbre	267506bb20	satellite/metabase: move package one level higher metabase has become a central concept and it's more suitable for it to be directly nested under satellite rather than being part of metainfo. metainfo is going to be the "endpoint" logic for handling requests. Change-Id: I53770d6761ac1e9a1283b5aa68f471b21e784198	2021-04-21 15:54:22 +03:00
Kaloyan Raev	035c393da0	satellite: update tests to pass etag.Reader to multipart.PutObjectPart Change-Id: Ibe99357945ae7a91f5b5d4f87b83d425c9fa84a5	2021-03-29 13:18:11 +00:00
Fadila Khadar	5dd76522af	gracefulexit: use GetSegmentByLocation instead of GetObjectLatestVersion This enables the transfer of pieces from an on-going multipart upload. Tests are also modified to take into account pending multipart uploads. See https://storjlabs.atlassian.net/browse/PG-161 Change-Id: I35d433c44dd6e618667e5e8f9f998ef867b9f1ad	2021-02-16 10:49:36 +00:00
Michal Niewrzal	18825d1e0b	satellite/{metainfo,gracefulexit}: fix failing tests Change-Id: I3428ea601255c36a316732c9f75135d6e5fa4d79	2020-12-21 12:22:32 +00:00
Michal Niewrzal	b3aa28cc02	satellite/gracefulexit: migrate to metabase Change-Id: I8be9cc68894124427e4a30d7631126b3afb1f281	2020-12-18 10:57:39 +00:00
Kaloyan Raev	53b7fd7b00	satellite/{audit,gracefulexit}: remove logic for PieceHashesVerified We now have the piece hashes verified for all segments on all production satellites. We can remove the code that handles the case where piece hashes are not verified. This would make easier the migration of services from PointerDB to the new metabase. For consistency, PieceHashesVerified is still set to true in PointerDB for new segments. Change-Id: Idf0ccce4c8d01ae812f11e8384a7221d90d4c183	2020-11-24 11:09:48 +02:00
Moby von Briesen	db6bc6503d	satellite/metainfo: Update metainfo RS config to more easily support multiple RS schemes. Make metainfo.RSConfig a valid pflag config value. This allows us to configure the RSConfig as a string like k/m/o/n-shareSize, which makes having multiple supported RS schemes easier in the future. RS-related config values that are no longer needed have been removed (MinTotalThreshold, MaxTotalThreshold, MaxBufferMem, Verify). Change-Id: I0178ae467dcf4375c504e7202f31443d627c15e1	2020-11-09 22:16:13 +00:00
Egon Elbre	76f4619a9c	{satellite,storagenode}/gracefulexit: ensure client is closed Change-Id: I576a955a5578caf7fcbee832beca28cef2b0c83e	2020-10-27 23:27:07 +02:00
Egon Elbre	9adde49e1a	satellite/gracefulexit: ensure test doesn't timeout on failure Change-Id: Id004f8a075592ffc19b12a9d666058b60cb7724d	2020-10-26 21:16:48 +02:00
paul cannon	76d4977b6a	storagenode/gracefulexit: logic moved from worker to service Change-Id: I8b12606a96b712050bf40d587664fb1b2c578fbc	2020-10-22 23:19:30 +00:00
Michal Niewrzal	aa47e70f03	satellite/metainfo: use metabase.SegmentKey with metainfo.Service Instead of using string or []byte we will be using dedicated type SegmentKey. Change-Id: I6ca8039f0741f6f9837c69a6d070228ed10f2220	2020-09-03 15:11:32 +00:00
paul cannon	7395dd1e6e	storagenode/gracefulexit: revalidate existing pieces ..before they are transferred to another node and submitted to the satellite as successful piece transfers, because if we submit an invalid signature, the node will be marked as a cheater and disqualified immediately. These signatures should have been validated when the piece was originally stored, but bitrot does happen and needn't be cause for an immediate DQ. Change-Id: I8b0ebd5812ea8a2e60766005b7251fbb74ef7857	2020-05-28 09:50:14 -05:00
Egon Elbre	94b2b315f7	storagenode/trust: refactor GetAddress to GetNodeURL Most places now need the NodeURL rather than the ID and Address separately. This simplifies code in multiple places. Change-Id: I52621d8ca52296a8b5bf7afbc1001cf8bfb44239	2020-05-20 11:05:15 +00:00
Egon Elbre	ed627144ed	all: use DialNodeURL throughout the codebase Change-Id: Iaf9ae3aeef7305c937f2660c929744db2d88776c	2020-05-20 10:36:30 +00:00
Egon Elbre	ec589a8289	all: fix comments about grpc Change-Id: Id830fbe2d44f083c88765561b6c07c5689afe5bd	2020-05-11 13:05:34 +03:00
Egon Elbre	bcd93ee375	private/testplanet: add StopNodeAndUpdate This was commonly used and code with it can be simplified. Change-Id: I2f2b91f7de54269aee6ef027f97f9e8a7d222e39	2020-05-08 13:02:19 +00:00
Egon Elbre	c630cf2490	storagenode/pieces: implement buffering for writing Currently uploads can cause a lot of IOPS, reduce this by introducing a in-memory buffer on-top of the file. Change-Id: I5f4e3e01c0a36258271d180b922107de447bcb59	2020-05-04 06:01:32 +00:00
Egon Elbre	11a44cdd88	all: don't depend on gogo/proto directly Change-Id: I8822dea0d1b7b99e0b828e0373a0308a42dde2be	2020-04-08 17:32:15 +00:00
Egon Elbre	e1a443b04a	private/testplanet: allow modifying created database Instead of providing the database from outside to testplanet create it inside and then allow wrapping and modifying it. This is more convenient to use. Change-Id: I9b8f69e6e0a19ff984b4e2bfe927c9100c77bc6c	2020-03-27 19:14:48 +00:00
Egon Elbre	e8f18a2cfe	private/testplanet: expose storagenode and satellite Config Change-Id: I80fe7ed8ef7356948879afcc6ecb984c5d1a6b9d	2020-03-27 17:01:25 +02:00
Yingrong Zhao	b7b19289d1	bump storj.io/common to latest Change-Id: I16e337660ce8e1ef332cc842dbf4cfa067b9b98b	2020-03-25 09:08:40 -04:00
Michal Niewrzal	426c8eb31a	private/testplanet: add DeleteBucket method for uplink New method added to be able to delete easily bucket during tests. Change-Id: Iaae89618cc676ddbbbd4b0df2eeacd143ea6f3c2	2020-02-11 15:58:13 +00:00
Michal Niewrzal	6502454947	satellite/metainfo: move RS configuration to satellite With this change RS configuration will be set on satellite. Uplink with get RS values with BeginObject request and will use it. For backward compatibility and to avoid super large change redundancy scheme stored with bucket is not touched. This can be done in future. Change-Id: Ia5f76fc10c37e2c44e4f7b8754f28eafe1f97eff	2020-01-22 09:33:53 +00:00
Yingrong Zhao	76ee8a1b4c	satellite: remove UptimeReputation configs from codebase With the new storage node downtime tracking feature, we need remove current uptime reputation configs: UptimeReputationAlpha, UptimeReputationBeta, and UptimeReputationDQ. This is the first step of removing the uptime reputation columns from satellitedb Change-Id: Ie8fab13295dbf545e33aeda0c4306cda4ba54e36	2020-01-08 18:54:15 +00:00
Natalie Ventura Villasana	1cb0f80a8d	satellite/gracefulexit: dq node on exit fail Disqualifies a node when the node fails to complete a graceful exit. Adds a new DisqualifyNode method to the overlay cache, since there wasn't an existing method to disqualify a node but do nothing else to its stats. Adds checks to existing tests to make sure that a storage node that fails a graceful exit is marked as disqualified in the overlay cache. https: //storjlabs.atlassian.net/browse/V3-3342 Change-Id: I4d554a519ab59db31ad3b8e28764c8683a6e3888	2020-01-06 19:16:26 -05:00
Egon Elbre	2680bae88c	private/testplanet: remove dependency to uplink Remove direct dependency on uplink.RSConfig, this simplifies moving the config file without introducing weird dependencies. Change-Id: I7fd2a145401e0205d7047631df9d2810241efeec	2020-01-02 09:40:46 +00:00
Natalie Ventura Villasana	aa3e183c2e	satellite/gracefulexit: add ge eligibility check Adds check to see if storage nodes are eligible to initiate graceful exit, by checking their CreatedAt date and seeing if their "age" is greater than the new config value: NodeMinAgeInMonths The default for this value is 6 months for now. https://storjlabs.atlassian.net/browse/V3-3357 Change-Id: Ib807ab8987ddb5a38a27a83886490f73fe8c5816	2019-12-31 09:31:58 -05:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Egon Elbre	d55288cf68	pkg/rpc: replace methods with direct calls to pb Change-Id: I8bd015d8d316a2c12c1daceca1d9fd257f6f57bc	2019-12-22 17:12:43 +02:00
Ethan	b959ccbae6	satellite/gracefulexit: Use proper rpc status codes for disqualified nodes and too many connections Change-Id: I41380026175e7678c7cd3d44211de8eb86ce4d0f	2019-12-20 19:05:28 +00:00
Egon Elbre	afe05edff2	{storagenode,satellite}/gracefulexit: ensure workers finish their work Fixes a data race caused by not waiting for workers to finish before shutting down. Currently this ended up failing logging because it was closed when test tried to write to it. Change-Id: I074045cd83bbf49e658f51353aa7901e9a5d074b	2019-12-17 17:21:52 +02:00
Egon Elbre	ee6c1cac8a	private: rename internal to private (#3573 )	2019-11-14 21:46:15 +02:00
Natalie Villasana	1a9757a7f2	satellite/gracefulexit: add count for order limits sent from satellite to exiting node (#3544 )	2019-11-13 09:54:50 -05:00
Yingrong Zhao	6331f839ae	satellite/gracefulexit: not allow disqualified node to graceful exit (#3493 )	2019-11-07 12:19:34 -05:00
Ethan Adams	f3dccb56b1	satellite/gracefulexit: Check if pointer has been overwritten or deleted before sending transfer message. (#3481 )	2019-11-07 11:13:05 -05:00
Egon Elbre	cc032d3151	satellite/metainfo: fix some uses of metainfo.Delete (#3513 ) * satellite/metainfo: rename Delete to UnsynchronizedDelete * fix deletes * make db private * fix typos * also verify on commit object	2019-11-06 18:02:14 +01:00
Ethan Adams	2eb0cc56fe	satellite/gracefulexit: Check if node already has a piece in the pointer (#3434 )	2019-11-05 14:13:45 -05:00

1 2

66 Commits