storj

Author	SHA1	Message	Date
Michal Niewrzal	6cc2052f47	satellite: fix segment loop observers metrics We made optimization for segment loop observers to avoid heavy monkit initialization on each call. It was applied to very often executed methods. Unfortunately we used wrong monkit method to track function times. Instead mon.Task we used mon.Func(). https://github.com/spacemonkeygo/monkit#how-it-works Change-Id: I9ca454dbd828c6b43ba09ca75c341991d2fd73a8	2022-08-10 14:13:16 +00:00
Egon Elbre	cf92220c20	{satellite,storagenode}/gracefulexit: simplify limiter usage Change-Id: Ied7091fe5355b96d327e3f893c5bdd4946a9e6af	2022-08-04 08:18:15 +00:00
Michał Niewrzał	7a2d2a36ca	satellite: use more optimal monkit call for loop observers methods Recently we applied this optimization to metrics observer and time used by its method dropped from 12m to 3m for us1 (220m segments). It looks that it make sense to apply the same code to all observers. Change-Id: I05898aaacbd9bcdf21babc7be9955da1db57bdf2	2022-05-20 11:03:41 +00:00
Yaroslav Vorobiov	3f47d19aa6	satellite/overlay: add disqualification reason Add disqualification reason to NodeDossier. Extend DB.DisqualifyNode with disqualification reason. Extend reputation Service.TestDisqualifyNode with disqualification reason. Change-Id: I8611b6340c7f42ac1bb8bd0fd7f0648ad650ab2d	2022-04-20 13:29:31 +00:00
paul cannon	3f3f028c88	satellite/gracefulexit: don't mark GE done when it's not done When an api server is processing a graceful exit (node is connected and getting lists of pieces to transfer), and the api server is shut down, it was incorrectly marking all pending graceful exits as complete. The GE then either passed or failed depending on the ratio of successfully transferred pieces to unsuccessful pieces. In at least one case, _no_ pieces were transferred at all before the GE was marked a success. Change-Id: I62cfab54a2296572c2e654eb460b62f772b7a60b	2022-04-19 14:00:29 +00:00
Michał Niewrzał	623cb16b6e	satellite/gracefulexit: test GE with copies Test case to verify if server-side copy doesn't affect GE in any negative way. Fixes https://github.com/storj/storj/issues/4699 Change-Id: I8c385767cca61499d46d9cb8de7318c56e5d7397	2022-04-12 15:59:14 +00:00
Fadila Khadar	c00ecae75c	satellite/gracefulexit: stop using gracefulexit_transfer_queue Remove the logic associated to the old transfer queue. A new transfer queue (gracefulexit_segment_transfer_queue) has been created for migration to segmentLoop. Transfers from the old queue were not moved to the new queue. Instead, it was still used for nodes which have initiated graceful exit before migration. There is no such node left, so we can remove all this logic. In a next step, we will drop the table. Change-Id: I3aa9bc29b76065d34b57a73f6e9c9d0297587f54	2021-09-14 11:52:34 +00:00
Michał Niewrzał	c258f4bbac	private/testplanet: move Metabase outside Metainfo for satellite At some point we moved metabase package outside Metainfo but we didn't do that for satellite structure. This change refactors only tests. When uplink will be adjusted we can remove old entries in Metainfo struct. Change-Id: I2b66ed29f539b0ec0f490cad42c72840e0351bcb	2021-09-09 07:15:51 +00:00
Clement Sam	f06e7c5f60	segment/{metabase,repair}: add dedicated methods on metabase.Pieces This change adds dedicated methods on metabase.Pieces to be able to add, remove pieces and also to check duplicates. Change-Id: I21aaeff40c017c2ebe1cc85a864ae546754769cc	2021-08-03 15:12:03 +00:00
Yingrong Zhao	e91574cee1	satellite/{reputation, gracefulexit}: use reputation store in gracefulexit With the effort to move audit related data into reputation store, this PR updates gracefulexit endpoint to use reputation service to get a node's audit score Change-Id: Iad93ea689ad67ff9c57c7be16687e21e715fab7a	2021-07-28 13:21:41 -04:00
Fadila Khadar	6d60d412f0	satellite/gracefulexit: use segment loop Join the segment loop instead of the metainfo loop, to iterate only over segments. Change-Id: I06259d363b98d4e191f2bf2d82c9b47255ee484a	2021-07-21 15:12:25 +00:00
Fadila Khadar	c4202b9451	satellite/gracefulexit: use graceful_exit_segment_transfer_queue For being able to use the segment metainfo loop, graceful exit transfers have to include the segment stream_id/position instead of the path. For this, we created a new table graceful_exit_segment_transfer_queue that will replace the graceful_exit_transfer_queue. The table has been created in a previous migration and made accessible through graceful exit db in another one. This changes makes graceful exit enqueue transfer items for new exiting nodes in the new table. Change-Id: I7bd00de13e749be521d63ef3b80c168df66b9433	2021-07-21 14:02:20 +00:00
Fadila Khadar	b0d98b1c1a	satellite/gracefulexit: allow use of graceful_exit_segment_transfer_queue For being able to use the segment metainfo loop, graceful exit transfers have to include the segment stream_id/position instead of the path. For this, we created a new table graceful_exit_segment_transfer_queue that will replace the graceful_exit_transfer_queue. The table has been created in a previous migration. This change gives access to this table. Graceful Exit doesn't use the table yet, this will be done in a next change. Change-Id: I6c09cff4cc45f0529813a8898ddb2d14aadb2cb8	2021-07-21 12:34:44 +00:00
Egon Elbre	59e3b586e7	satellite/{gracefulexit,overlay}: enable as of system time queries Change-Id: I2af5eb0e8a51fca7893ce07b78b5633be71dfef8	2021-06-22 11:50:50 +00:00
JT Olio	6949dc0bac	satellite/metaloop: missing monitoring on observers Change-Id: I630fbb0448c8d08b426486b3e49abfbca03332a6	2021-06-15 13:39:13 +00:00
JT Olio	da9ca0c650	testplanet/satellite: reduce the number of places default values need to be configured Satellites set their configuration values to default values using cfgstruct, however, it turns out our tests don't test these values at all! Instead, they have a completely separate definition system that is easy to forget about. As is to be expected, these values have drifted, and it appears in a few cases test planet is testing unreasonable values that we won't see in production, or perhaps worse, features enabled in production were missed and weren't enabled in testplanet. This change makes it so all values are configured the same, systematic way, so it's easy to see when test values are different than dev values or release values, and it's less hard to forget to enable features in testplanet. In terms of reviewing, this change should be actually fairly easy to review, considering private/testplanet/satellite.go keeps the current config system and the new one and confirms that they result in identical configurations, so you can be certain that nothing was missed and the config is all correct. You can also check the config lock to see what actual config values changed. Change-Id: I6715d0794887f577e21742afcf56fd2b9d12170e	2021-06-01 22:14:17 +00:00
Egon Elbre	10372afbe4	ci: fix lint errors Change-Id: Ib5893440807811f77175ccd347aa3f8ca9cccbdf	2021-05-17 13:37:31 +00:00
Egon Elbre	910eec8eee	satellite/metainfo: remove MetabaseDB interface Currently the interface is not useful. When we need to vary the implementation for testing purposes we can introduce a local interface for the service/chore that needs it, rather than using the large api. Unfortunately, this requires adding a cleanup callback for tests, there might be a better solution to this problem. Change-Id: I079fe4dbe297b0ae08c10081a1cea4dfbc277682	2021-05-13 13:22:14 +00:00
Ivan Fraixedes	7fb86617fc	satellite/satellitedb: Use CRDB AS OF SYSTEM & batch for GE Use the 'AS OF SYSTEM TIME' Cockroach DB clause for the Graceful Exit (a.k.a GE) queries that count the delete the GE queue items of nodes which have already exited the network. Split the subquery used for deleting all the transfer queue items of nodes which has exited when CRDB is used and batch the queries because CRDB struggles when executing in a single query unlike Postgres. The new test which has been added to this commit to verify the CRDB batch logic for deleting all the transfer queue items of the exited nodes has raised that the Enqueue method has to run in baches when CRDB is used otherwise CRDB has return the error "driver: bad connection" when a big a amount of items are passed to be enqueued. This error didn't happen with the current test implementation it was with an initial one that it was creating a big amount of exited nodes and transfer queue items for those nodes. Change-Id: I6a099cdbc515a240596bc93141fea3182c2e50a9	2021-05-07 13:09:19 -04:00
Michał Niewrzał	7944df20d6	storj: use multipart API Change-Id: I10b401434e3e77468d12ecd225b41689568fd197	2021-04-26 13:15:09 +00:00
Egon Elbre	4c9ed64f75	satellite/metabase/metaloop: move loop under metabase Currently the loop handling is heavily related to the metabase rather than metainfo. metainfo over time has become related to the "public API" for accessing the metabase data. Currently updates monkit.lock, because monkit monitoring does not handle ScopeNamed correctly. Needs a followup change to monitoring check. Change-Id: Ie50519991d718dfb872ec9a0176a82e732c97584	2021-04-22 12:58:09 +03:00
Ivan Fraixedes	2537bbf543	satellite/gracefulexit: Try avoiding randomly test failure The test function fails randomly in the CI when runs with CRDB. There isn't currently an explanation why the expectation of number of nodes which exited 4 minutes ago reports 4 nodes rather than 5 and the only clue that we have now to see if it gets remedied is to give 2 minutes rather than 1 to the node that exited close to the time passed function which makes the test to randomly fail. Change-Id: I3a731e3eb7f19caebdf29713150727f2cf3e0e0a	2021-04-21 17:40:07 +02:00
Egon Elbre	267506bb20	satellite/metabase: move package one level higher metabase has become a central concept and it's more suitable for it to be directly nested under satellite rather than being part of metainfo. metainfo is going to be the "endpoint" logic for handling requests. Change-Id: I53770d6761ac1e9a1283b5aa68f471b21e784198	2021-04-21 15:54:22 +03:00
Fadila Khadar	bde367ae73	satellite/gc: check on bloom filter creation date Check that the bloom filter creation date is earlier than the metainfo loop system time used for db scanning. Change-Id: Ib0f47c124f5651deae0fd7e7996abcdcaac98fb4	2021-04-14 16:40:37 +00:00
Kaloyan Raev	035c393da0	satellite: update tests to pass etag.Reader to multipart.PutObjectPart Change-Id: Ibe99357945ae7a91f5b5d4f87b83d425c9fa84a5	2021-03-29 13:18:11 +00:00
Egon Elbre	86e698f572	pb: use *UnimplementedServer to avoid breaking API changes Change-Id: I99a34eeb37ac4453411f273511710562a519f57a	2021-03-29 12:26:10 +03:00
Egon Elbre	f19ef4afe5	satellite/metainfo/metaloop: move loop to a separate package Change-Id: I94c931a27c1af6062185ec62688624ec02050f11	2021-03-23 15:37:34 +00:00
Michał Niewrzał	27ae0d1f15	satellite/metainfo/metabase: add NewRedundancy parameter for UpdateSegmentPieces method At some point we might try to change original segment RS values and set Pieces according to the new values. This change adds add NewRedundancy parameter for UpdateSegmentPieces method to give ability to do that. As a part of change NewPieces are validated against NewRedundancy. Change-Id: I8ea531c9060b5cd283d3bf4f6e4c320099dd5576	2021-03-22 08:12:56 +00:00
Michał Niewrzał	67e26aafcd	Merge remote-tracking branch 'origin/main' into multipart-upload Change-Id: I9b183323cb470185be22f7c648bb76917d2e6fca	2021-03-10 08:53:38 +01:00
Michał Niewrzał	c51ea68ad3	satellite/metainfo/metabase: reduce number of fields for LoopSegmentEntry For metainfo loop we need only some of Segment fields. By removing some of them we will reduce memory consumption during loop. Change-Id: I4af8baab58f7de8ddf5e142380180bb70b1b442d	2021-03-02 15:04:54 +01:00
Natalie Villasana	856db68fd9	satellite/gracefulexit: extend GE data cleanup to include exit_progress The new 'consistency ge-cleanup-orphaned-data' cli command deleted orphaned transfer queue items, but not entries in the graceful_exit_progress table. This will delete orphaned entries from the exit progress table too. Change-Id: I5f927aac1f258490678deaf179be92ccfe10fcd8	2021-03-01 15:52:43 +00:00
Michał Niewrzał	d995fb497f	Merge remote-tracking branch 'origin/main' into multipart-upload Change-Id: I367da03351ab80f7343332420490dde9282aa47a	2021-02-23 12:31:31 +01:00
Egon Elbre	630718c392	satellite/gracefulexit: fix test DeleteAllFinishedTransferQueueItems Using random piece num may generate the exact same key or piecenum. Instead use fixed piecenum and key. Change-Id: I54b7bc1a6698149bf99608dd46501ea963cec084	2021-02-19 18:15:22 +02:00
Egon Elbre	1137620baf	satellite/satellitedb: move tests to their domains Testing interfaces is slightly clearer when it's in the package needing the database rather than each individual implementation. Change-Id: I10334c214a205f7e510b939b4359a2214c4e060a	2021-02-19 17:29:15 +02:00
Fadila Khadar	5dd76522af	gracefulexit: use GetSegmentByLocation instead of GetObjectLatestVersion This enables the transfer of pieces from an on-going multipart upload. Tests are also modified to take into account pending multipart uploads. See https://storjlabs.atlassian.net/browse/PG-161 Change-Id: I35d433c44dd6e618667e5e8f9f998ef867b9f1ad	2021-02-16 10:49:36 +00:00
Kaloyan Raev	6f3d0c4ad5	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum satellite/repair/repair_test.go satellite/repair/repairer/segments.go Change-Id: Ie51a56878bee84ad9f2d31135f984881a882e906	2021-02-02 19:19:04 +02:00
Ivan Fraixedes	076804eac9	cmd/satellite: Add command for GE data cleanup Add a command to the satellite for cleaning up the Graceful Exit (a.k.a GE) transfer queue items of nodes that have exited. The commit adds to the GE satellite DB a couple of new methods, and its corresponding test, for performing the operations of the new command. Change-Id: I29a572a59689d63b24990ac13c52e76d65aaa917	2021-02-01 17:30:58 +00:00
Kaloyan Raev	c24ada7114	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum Change-Id: Icf7c029e9d800e5f6a9fdd208c36f28e05468690	2021-01-20 17:35:57 +02:00
Ivan Fraixedes	678b07b314	satellite: Fix typos & code formatting Fix some typos in the doc comments and readdress some code formatting applied automatically. Change-Id: I605b4eff2e7c6c58227ecf16be4c1d26f5322eb6	2021-01-15 16:40:26 +01:00
Kaloyan Raev	bafc6af992	ci: remove workaround for failing tests Change-Id: I3eb673fae6c81bee17d7437cb870d5f5ba6978d5	2020-12-21 18:07:40 +02:00
Michal Niewrzal	18825d1e0b	satellite/{metainfo,gracefulexit}: fix failing tests Change-Id: I3428ea601255c36a316732c9f75135d6e5fa4d79	2020-12-21 12:22:32 +00:00
Michal Niewrzal	b3aa28cc02	satellite/gracefulexit: migrate to metabase Change-Id: I8be9cc68894124427e4a30d7631126b3afb1f281	2020-12-18 10:57:39 +00:00
Stefan Benten	494bd5db81	all: golangci-lint v1.33.0 fixes (#3985 )	2020-12-05 17:01:42 +01:00
Kaloyan Raev	53b7fd7b00	satellite/{audit,gracefulexit}: remove logic for PieceHashesVerified We now have the piece hashes verified for all segments on all production satellites. We can remove the code that handles the case where piece hashes are not verified. This would make easier the migration of services from PointerDB to the new metabase. For consistency, PieceHashesVerified is still set to true in PointerDB for new segments. Change-Id: Idf0ccce4c8d01ae812f11e8384a7221d90d4c183	2020-11-24 11:09:48 +02:00
Moby von Briesen	db6bc6503d	satellite/metainfo: Update metainfo RS config to more easily support multiple RS schemes. Make metainfo.RSConfig a valid pflag config value. This allows us to configure the RSConfig as a string like k/m/o/n-shareSize, which makes having multiple supported RS schemes easier in the future. RS-related config values that are no longer needed have been removed (MinTotalThreshold, MaxTotalThreshold, MaxBufferMem, Verify). Change-Id: I0178ae467dcf4375c504e7202f31443d627c15e1	2020-11-09 22:16:13 +00:00
Egon Elbre	76f4619a9c	{satellite,storagenode}/gracefulexit: ensure client is closed Change-Id: I576a955a5578caf7fcbee832beca28cef2b0c83e	2020-10-27 23:27:07 +02:00
Kaloyan Raev	92a2be2abd	satellite/metainfo: get away from using pb.Pointer in Metainfo Loop As part of the Metainfo Refactoring, we need to make the Metainfo Loop working with both the current PointerDB and the new Metabase. Thus, the Metainfo Loop should pass to the Observer interface more specific Object and Segment types instead of pb.Pointer. After this change, there are still a couple of use cases that require access to the pb.Pointer (hence we have it as a field in the metainfo.Segment type): 1. Expired Deletion Service 2. Repair Service It would require additional refactoring in these two services before we are able to clean this. Change-Id: Ib3eb6b7507ed89d5ba745ffbb6b37524ef10ed9f	2020-10-27 13:06:47 +00:00
Egon Elbre	9adde49e1a	satellite/gracefulexit: ensure test doesn't timeout on failure Change-Id: Id004f8a075592ffc19b12a9d666058b60cb7724d	2020-10-26 21:16:48 +02:00
paul cannon	76d4977b6a	storagenode/gracefulexit: logic moved from worker to service Change-Id: I8b12606a96b712050bf40d587664fb1b2c578fbc	2020-10-22 23:19:30 +00:00
Egon Elbre	0bdb952269	all: use keyed special comment Change-Id: I57f6af053382c638026b64c5ff77b169bd3c6c8b	2020-10-13 15:13:41 +03:00

1 2 3

145 Commits