storj

Author	SHA1	Message	Date
paul cannon	915f3952af	satellite/repair: repair pieces on the same last_net We avoid putting more than one piece of a segment on the same /24 network (or /64 for ipv6). However, it is possible for multiple pieces of the same segment to move to the same network over time. Nodes can change addresses, or segments could be uploaded with dev settings, etc. We will call such pieces "clumped", as they are clumped into the same net, and are much more likely to be lost or preserved together. This change teaches the repair checker to recognize segments which have clumped pieces, and put them in the repair queue. It also teaches the repair worker to repair such segments (treating clumped pieces as "retrievable but unhealthy"; i.e., they will be replaced on new nodes if possible). Refs: https://github.com/storj/storj/issues/5391 Change-Id: Iaa9e339fee8f80f4ad39895438e9f18606338908	2023-04-06 17:34:25 +00:00
Michal Niewrzal	16b7901fde	satellite/metabase: add piece size calculation to segment This code is essentially replacement for eestream.CalcPieceSize. To call eestream.CalcPieceSize we need eestream.RedundancyStrategy which is not trivial to get as it requires infectious.FEC. For example infectious.FEC creation is visible on GE loop observer CPU profile because we were doing this for each segment in DB. New method was added to storj.Redundancy and here we are just wiring it with metabase Segment. BenchmarkSegmentPieceSize BenchmarkSegmentPieceSize/eestream.CalcPieceSize BenchmarkSegmentPieceSize/eestream.CalcPieceSize-8 5822 189189 ns/op 9776 B/op 8 allocs/op BenchmarkSegmentPieceSize/segment.PieceSize BenchmarkSegmentPieceSize/segment.PieceSize-8 94721329 11.49 ns/op 0 B/op 0 allocs/op Change-Id: I5a8b4237aedd1424c54ed0af448061a236b00295	2023-02-22 11:04:02 +00:00
Márton Elek	8f8e97de23	satellite/metainfo: support desired node number for download object/segment This modification introduce support of the new "desired node" field of download segment/object. This can be used to request more nodes than the suggested minimum. It can be used to achieve better performance in exchange of using more bandwidth. (more parallel downloads). Change-Id: Ia167d6979e6d70a597c85070a4ccd1c3a573e406	2023-02-13 13:57:48 +00:00
Márton Elek	ca6e3a9e88	satellite/orders: create mock based unit test Most of our (~integration) tests based on testplanet runner. However running testplanet for each test make the testing process slow. It seems to be better to use real unit tests (without db dependency) when it's possible. This patch makes small modification to make it possible to test orders.Service with real unit test. As the existing unit test of `service.go` is isolated with `_test` package name, it's moved to an `_integration_test.go` file to make place for the unit test. Change-Id: Ia69f26a34e2c48d230d8d36c2040dd02a60455a6	2023-02-13 13:24:30 +00:00
JT Olio	686faeedbd	satellite/overlay: return noise info with selected nodes we have two more fields in the database (noise_proto and noise_public_key) that now need to go into pb.NodeAddress when returning AddressedOrderLimits. the only real complication is making sure type conversions between database types and NodeURLs and so on don't lose this new pb.NodeAddress field (NoiseInfo). otherwise this is a relatively straightforward commit Change-Id: I45b59d7b2d3ae21c2e6eb95497f07cd388d454b3	2023-02-02 15:46:27 +00:00
Andrew Harding	abd0ad92dc	satellite/metainfo: RetryBeginSegmentPieces RPC implementation Part of: https://github.com/storj/uplink/issues/120 Change-Id: I2a2873455f7498ffd31f50ade16c173fe1d18157	2023-01-27 15:04:59 +00:00
Michal Niewrzal	a2a9dafa33	satellite/orders: don't store allocated bandwidth in bucket_bandwidth_rollups table We have performance problems with updating bucket_bandwidth_rollups. To improve situation we can stop storing allocated bandwidth in this table. This should reduce large number of updates which are comming from metainfo endpoints, repair workers and audit. Next step will be to drop `allocated` column completely from bucket_bandwidth_rollups. Allocated GET bandwidth is all we need and we are keeping it in bucket_bandwidth_rollups table. Change-Id: Ifdd26a89ba8262acbca6d794a6c02883ad0c0c9b	2023-01-12 13:21:02 +00:00
Clement Sam	3378215adf	satellite/orders: decrease order expiration time to 24hours Closes https://github.com/storj/storj/issues/5202 Change-Id: I55d1a84c46dd610eeb00dd79df8f4f7e699499a0	2022-11-21 14:52:32 +00:00
paul cannon	c54c45c9c7	satellite/audit: new ReverifyPiece implementation ReverifyPiece() is not currently hooked up to anything, but is planned to take the place of audit.(*Verifier).Reverify(). ReverifyPiece() works by downloading one piece in its entirety, rather than pulling an entire stripe across many nodes. Change-Id: Ie2c680f4d3c3b65273a72466a3f9f55c115b0311	2022-10-27 16:06:21 +00:00
Michal Niewrzal	a97cd97789	satellite/orders: remove unused service dependency Orders service doesn't need buckets service anymore. Change-Id: I27853cda87e82b528f53667e4b4866801f7bfb62	2022-09-28 08:56:36 +00:00
Egon Elbre	51d4e5c275	satellite/{orders,overlay}: use cache for downloads Use DownloadSelectionCache to avoid querying database for every download. This change only addresses downloads from users. The download selection cache is not currently used for audit and repair. Change-Id: I96a49e121dac0b4204f97592a63131edabd73fb5	2022-07-12 11:04:34 +00:00
Fadila Khadar	29fd36a20e	satellite/repairer: handle excluded countries For nodes in excluded areas, we don't necessarily want to remove them from the pointer, but we do want to increase the number of pieces in the segment in case those excluded area nodes go down. To do that, we increase the number of pieces repaired by the number of pieces in excluded areas. Change-Id: I0424f1bcd7e93f33eb3eeeec79dbada3b3ea1f3a	2022-03-14 10:59:36 -04:00
Yingrong Zhao	1f8f7ebf06	satellite/{audit, reputation}: fix potential nodes reputation status inconsistency The original design had a flaw which can potentially cause discrepancy for nodes reputation status between reputations table and nodes table. In the event of a failure(network issue, db failure, satellite failure, etc.) happens between update to reputations table and update to nodes table, data can be out of sync. This PR tries to fix above issue by passing through node's reputation from the beginning of an audit/repair(this data is from nodes table) to the next update in reputation service. If the updated reputation status from the service is different from the existing node status, the service will try to update nodes table. In the case of a failure, the service will be able to try update nodes table again since it can see the discrepancy of the data. This will allow both tables to be in-sync eventually. Change-Id: Ic22130b4503a594b7177237b18f7e68305c2f122	2022-01-06 21:05:59 +00:00
Michał Niewrzał	1ed5db1467	satellite/metainfo: simplifying limits code Its a very simple change to reduct code duplication. Change-Id: Ia135232e3aefd094f76c6988e82e297be028e174	2021-09-28 06:22:13 +00:00
Cameron Ayer	26f839a445	satellite/repair/repairer: if not enough nodes for repair order limits, increment metric and log as irreparable segment Change-Id: I4bd46f28d64278c8d463e885ad221aafb6ce7cf3	2021-08-27 13:42:28 +00:00
Cameron Ayer	24e02b6352	satellite/{audit,orders}: if not enough nodes for audit order limits, increment metric and wrap error with ErrNotEnoughShares Increment a metric so we can get alerts. Wrap the error so we can search the logs for it. Change-Id: I3827aa306c431009828014d9d9afff8dfc057ee6	2021-08-26 20:14:05 +00:00
Clement Sam	d73b9fff9a	satellite/orders: set the expirationDate in CreatePutRepairOrderLimits In the past ExpirationDate was available inside CreatePutRepairOrderLimits but this was removed since the metabase segment was missing the ExpiresAt field. Now ExpiresAt field is available in the metabase segment and can be set correctly while executing NewSignerRepairPut. Change-Id: I068c07492ab27bde2c44477bbd32c5872edd024a	2021-07-27 12:44:40 +00:00
Cameron Ayer	449c873681	satellite/repair/repairer: attempt repair GETs using nodes' last IP and port first Sometimes we see timeouts from DNS lookups when trying to do repair GETs. Solution: try using node's last IP and port first. If we can't connect, retry with DNS lookup. Change-Id: I59e223aebb436118779fb18378f6e09d072f12be	2021-07-21 13:13:06 +00:00
Michał Niewrzał	70e6cdfd06	satellite/audit: move to segmentloop Change-Id: I10e63a1e4b6b62f5cd3098f5922ad3de1ec5af51	2021-06-28 11:32:00 +00:00
JT Olio	da9ca0c650	testplanet/satellite: reduce the number of places default values need to be configured Satellites set their configuration values to default values using cfgstruct, however, it turns out our tests don't test these values at all! Instead, they have a completely separate definition system that is easy to forget about. As is to be expected, these values have drifted, and it appears in a few cases test planet is testing unreasonable values that we won't see in production, or perhaps worse, features enabled in production were missed and weren't enabled in testplanet. This change makes it so all values are configured the same, systematic way, so it's easy to see when test values are different than dev values or release values, and it's less hard to forget to enable features in testplanet. In terms of reviewing, this change should be actually fairly easy to review, considering private/testplanet/satellite.go keeps the current config system and the new one and confirms that they result in identical configurations, so you can be certain that nothing was missed and the config is all correct. You can also check the config lock to see what actual config values changed. Change-Id: I6715d0794887f577e21742afcf56fd2b9d12170e	2021-06-01 22:14:17 +00:00
Egon Elbre	10a0216af5	satellite/metainfo: use range for specifying download limit Previously the object range was not used for calculating order limit. This meant that even if you were downloading only a small range it would account bandwidth based on the full segment. This doesn't fully address the accounting since the lazy segment downloads do not send their requested range nor requested limit. Change-Id: Ic811e570c889be87bac4293547d6537a255078da	2021-06-01 09:36:55 +00:00
Egon Elbre	267506bb20	satellite/metabase: move package one level higher metabase has become a central concept and it's more suitable for it to be directly nested under satellite rather than being part of metainfo. metainfo is going to be the "endpoint" logic for handling requests. Change-Id: I53770d6761ac1e9a1283b5aa68f471b21e784198	2021-04-21 15:54:22 +03:00
Michał Niewrzał	67e26aafcd	Merge remote-tracking branch 'origin/main' into multipart-upload Change-Id: I9b183323cb470185be22f7c648bb76917d2e6fca	2021-03-10 08:53:38 +01:00
Natalie Villasana	c290e5ac9a	satellite/orders: decrease FlushBatchSize default to 1000 The previous default FlushBatchSize of 10000 was causing major slow down in select and insert statements on bucket_bandwidth_rollups. We saw on the saltlake satellite that a FlushBatchSize of 1000 helped reduce contention and query latency. Change-Id: Ib95e73482219bc5aedc11925b1849fa5999774ba	2021-03-02 14:00:48 +00:00
Kaloyan Raev	6f3d0c4ad5	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum satellite/repair/repair_test.go satellite/repair/repairer/segments.go Change-Id: Ie51a56878bee84ad9f2d31135f984881a882e906	2021-02-02 19:19:04 +02:00
Ivan Fraixedes	d93944c57b	satellite/orders: Delete unused methods & DB tables Delete satellite order methods and DB tables which aren't used anymore after we have done a refactoring on the orders to stuck bucket information in the orders' encrypted metadata. There are also configuration parameters and a satellite chore that aren't needed anymore after the orders refactoring. Change-Id: Ida3682b95921df70792284b42c96d2508bf8ca9c	2021-02-01 18:01:29 +00:00
Michał Niewrzał	3c13aae61e	satellite/metainfo: remove unused method CreateGetOrderLimits is not used anymore because we have CreateGetOrderLimits2. We need to remove old method and fix name of second. Change-Id: I59148b8d28fc9dbab7d452c884319125a02745d1	2021-01-21 17:00:13 +01:00
Michał Niewrzał	ec88d21a3c	Merge 'main' branch. Change-Id: I6e8162d1a6caf75e89c9f9c9f9522730aebf83ae	2021-01-11 10:26:58 +01:00
Jeff Wendling	2d2359667d	satellite/orders: remove unused satelliteAddress field Change-Id: I58091769472688433c48becc8dfc9029bddd87aa	2021-01-08 12:25:39 -05:00
Egon Elbre	51731db121	satellite/orders: use smaller encrypted metadata Avoid using project uuid string representation, because it uses more bandwidth. This reduces the encrypted metadata size from 118 -> 97 bytes. Change-Id: Ic53a81b83acc065f24f28cd404f9c0b1fe592594	2021-01-08 16:40:31 +00:00
Michal Niewrzal	9a8959d429	Merge 'master' branch Change-Id: Iba69ea73ca4d3f1cd4ae94243eaaae033c5324e8	2020-12-22 14:55:57 +01:00
Kaloyan Raev	5934969dd6	satellite/orders: remove obsolete CreateDeleteOrderLimits method Not used anywhere. Change-Id: I878635d2d533ad4b06ba0d07a94908105546cb82	2020-12-22 13:15:11 +02:00
Jessica Grebenschikov	d961437889	satellite/orders: remove the config IncludeEncryptedMetadata Since the Satellite now requires the order encryption functionality (since serial_number table is deprecated) to properly function, we can remove the config flag to turn on/off the feature. Change-Id: Ie973f72a9a05a81cef9e53dc9c99d22c940c2488	2020-12-18 10:39:29 -08:00
Jessica Grebenschikov	97a5e6c814	satellite/orders: stop inserting/reading from serial_numbers table This PR contains the minimum changes needed to stop inserting into the serial_numbers table. This is the first step in completely deprecating that table. The next step is to create another PR to remove the expiredSerial chore, fix more tests, and remove any other methods on the serial_number table. Change-Id: I5f12a56ebf3fa4d1a1976141d2911f25a98d2cc3	2020-12-18 08:35:13 -08:00
Kaloyan Raev	9aa61245d0	satellite/audits: migrate to metabase Change-Id: I480c941820c5b0bd3af0539d92b548189211acb2	2020-12-17 14:38:48 +02:00
Michal Niewrzal	8d3ea9c251	satellite/repair/repairer: implement SegmentRepairer with metabase Change-Id: I647c625e00a626c44e812602ad9bc3e85a7b602c	2020-12-17 10:47:21 +00:00
Michal Niewrzal	218bbeaffa	Merge 'master' branch Change-Id: Ica5c25607a951076dd9f77e35e308062f71ce3f0	2020-12-07 15:05:52 +01:00
Jessica Grebenschikov	b261110352	satellite/orders: get bucketID from encrypted metadata in order instead of serial_numbers table We want to stop using the serial_numbers table in satelliteDB. One of the last places using the serial_numbers table is when storagenodes settle orders, we look up the bucket name and project ID from the serial number from the serial_numbers table. Now that we have support to add encrypted metadata into the OrderLimit, this PR makes use of that and now attempts to read the project ID and bucket name from the encrypted orderLimit metadata instead of from the serial_numbers table. For backwards compatibility and to ensure no errors, we will still fallback to the old way of getting that info from the serial_numbers table, but this will be removed in the next release as long as there are no errors. All processes that create orderLimits must have an orders.encryption-keys set. The services that create orderLimits (and thus need to encrypt the order metadata) are the satellite apiProcess, the repair process, audit service (core process), and graceful exit (core process). Only the satellite api process decrypts the order metadata when storagenodes settle orders. This means that the same encryption key needs to be provided in the config for the satellite api process, repair process, and the core process like so: orders.include-encrypted-metadata=true orders.encryption-keys="<"encryptionKeyID>=<encryptionKey>" Change-Id: Ie2c037971713d6fbf69d697bfad7f8b672eedd66	2020-12-01 15:29:32 +00:00
Michal Niewrzal	3fe16f4003	satellite/metainfo: upload/download with metabase This change is adjusting metainfo endpoint to use metabase for uploading and downloading remote objects. Inline segments will be added later. Change-Id: I109d45bf644cd48096c47361043ebd8dfeaea0f3	2020-11-11 12:13:52 +00:00
Michal Niewrzal	7dde184cb5	Merge 'master' branch Change-Id: I6070089128a150a4dd501bbc62a1f8b394aa643e	2020-11-10 11:58:59 +00:00
paul cannon	8616fc146d	satellite/orders: send IPs for graceful exit Storage nodes undergoing Graceful Exit have up to now been receiving hostnames for all other storage nodes they need to contact when transferring pieces. This adds up to a lot of DNS lookups, which apparently overwhelm some home routers. There does not seem to be any need for us to send hostnames for graceful exit as opposed to IP addresses; we already use IP addresses (as given by the last_ip_port column in the nodes table) for all the GET and PUT orders we send out. This change causes IP addresses to be used instead. I started trying to construct a test to ensure that the behavior changed, but it was rabbit-holing, so I've begun to feel that maybe this change doesn't require one; it is a very simple change, and very much of the same nature as what we already do for IPs in CreateGetOrderLimits and CreatePutOrderLimits (and others). Change-Id: Ib2b5ffe7a9310e9cdbe7464450cc7c934fa229a1	2020-11-04 00:17:20 +00:00
Jessica Grebenschikov	f5880f6833	satellite/orders: rollout phase3 of SettlementWithWindow endpoint Change-Id: Id19fae4f444c83157ce58c933a18be1898430ad0	2020-10-26 14:56:28 +00:00
paul cannon	360ab17869	satellite/audit: use LastIPAndPort preferentially This preserves the last_ip_and_port field from node lookups through CreateAuditOrderLimits() and CreateAuditOrderLimit(), so that later calls to (Verifier).GetShare() can try to use that IP and port. If a connection to the given IP and port cannot be made, or the connection cannot be verified and secured with the target node identity, an attempt is made to connect to the original node address instead. A similar change is not necessary to the other CreateOrderLimits functions, because they already replace node addresses with the cached IP and port as appropriate. We might want to consider making a similar change to CreateGetRepairOrderLimits(), though. The audit situation is unique because the ramifications are especially powerful when we get the address wrong. Failing a single audit can have a heavy cost to a storage node. We need to make extra effort in order to avoid imposing that cost unfairly. Situation 1: If an audit fails because the repair worker failed to make a DNS query (which might well be the fault on the satellite side), and we have last_ip_and_port information available for the target node, it would be unfair not to try connecting to that last_ip_and_port address. Situation 2: If a node has changed addresses recently and the operator correctly changed its DNS entry, but we don't bother querying DNS, it would be unfair to penalize the node for our failure to connect to it. So the audit worker must try both last_ip_and_port _and_ the node address as supplied by the SNO. We elect here to try last_ip_and_port first, on the grounds that (a) it is expected to work in the large majority of cases, and (b) there should not be any security concerns with connecting to an out-or-date address, and (c) avoiding DNS queries on the satellite side helps alleviate satellite operational load. Change-Id: I9bf6c6c79866d879adecac6144a6c346f4f61200	2020-10-21 13:34:40 +00:00
Jessica Grebenschikov	205c39d404	satellite/orders: upgrade to phase 2 rollout ordersWithWindow We are moving an error into rejectErr since its preventing storage nodes from being able to settle other orders. Change-Id: I3ac97c340e491b127f5e0024c5e8bd9f4df8d5c3	2020-10-15 21:20:19 +00:00
Egon Elbre	0bdb952269	all: use keyed special comment Change-Id: I57f6af053382c638026b64c5ff77b169bd3c6c8b	2020-10-13 15:13:41 +03:00
Jeff Wendling	0f0faf0a9f	satellite/orders: do a better job limiting concurrent requests Doing it at the ProcessOrders level was insufficient: the endpoints make multiple database calls. It was a misguided attempt to only have one spot enter the semaphore. By putting it in the endpoint we can not only be sure that the concurrency is correctly limited but it can be configurable easily. Change-Id: I937149dd077adf9eb87fce52a1a17dc0afe96f64	2020-10-09 16:27:15 -04:00
Egon Elbre	dc48197bd8	satellite/orders: add bucket id to order limit Change-Id: I9019ec77d692e62ac17b67a1da71dc3535cde50c	2020-09-03 10:50:11 +03:00
Egon Elbre	61b17f1214	satellite/orders: add encryption keys flag to Service Change-Id: Ie96e75bc96241b799d04654ef5e05b82e6a899bb	2020-09-02 05:02:14 +00:00
Egon Elbre	3ca405aa97	satellite/orders: use metabase types as arguments Change-Id: I7ddaad207c20572a5ea762667531770a56fd54ef	2020-08-28 15:52:37 +03:00
Egon Elbre	b4c8e219c7	satellite/orders: calculate order expiration inside signer Change-Id: I07f79eeb1ab41b061a1f3146f684bd21291cffb0	2020-08-18 13:21:16 +03:00

1 2 3

120 Commits