storj

Author	SHA1	Message	Date
Ivan Fraixedes	d93944c57b	satellite/orders: Delete unused methods & DB tables Delete satellite order methods and DB tables which aren't used anymore after we have done a refactoring on the orders to stuck bucket information in the orders' encrypted metadata. There are also configuration parameters and a satellite chore that aren't needed anymore after the orders refactoring. Change-Id: Ida3682b95921df70792284b42c96d2508bf8ca9c	2021-02-01 18:01:29 +00:00
Jeff Wendling	2d2359667d	satellite/orders: remove unused satelliteAddress field Change-Id: I58091769472688433c48becc8dfc9029bddd87aa	2021-01-08 12:25:39 -05:00
Egon Elbre	51731db121	satellite/orders: use smaller encrypted metadata Avoid using project uuid string representation, because it uses more bandwidth. This reduces the encrypted metadata size from 118 -> 97 bytes. Change-Id: Ic53a81b83acc065f24f28cd404f9c0b1fe592594	2021-01-08 16:40:31 +00:00
Jessica Grebenschikov	d961437889	satellite/orders: remove the config IncludeEncryptedMetadata Since the Satellite now requires the order encryption functionality (since serial_number table is deprecated) to properly function, we can remove the config flag to turn on/off the feature. Change-Id: Ie973f72a9a05a81cef9e53dc9c99d22c940c2488	2020-12-18 10:39:29 -08:00
Jessica Grebenschikov	97a5e6c814	satellite/orders: stop inserting/reading from serial_numbers table This PR contains the minimum changes needed to stop inserting into the serial_numbers table. This is the first step in completely deprecating that table. The next step is to create another PR to remove the expiredSerial chore, fix more tests, and remove any other methods on the serial_number table. Change-Id: I5f12a56ebf3fa4d1a1976141d2911f25a98d2cc3	2020-12-18 08:35:13 -08:00
Jessica Grebenschikov	b261110352	satellite/orders: get bucketID from encrypted metadata in order instead of serial_numbers table We want to stop using the serial_numbers table in satelliteDB. One of the last places using the serial_numbers table is when storagenodes settle orders, we look up the bucket name and project ID from the serial number from the serial_numbers table. Now that we have support to add encrypted metadata into the OrderLimit, this PR makes use of that and now attempts to read the project ID and bucket name from the encrypted orderLimit metadata instead of from the serial_numbers table. For backwards compatibility and to ensure no errors, we will still fallback to the old way of getting that info from the serial_numbers table, but this will be removed in the next release as long as there are no errors. All processes that create orderLimits must have an orders.encryption-keys set. The services that create orderLimits (and thus need to encrypt the order metadata) are the satellite apiProcess, the repair process, audit service (core process), and graceful exit (core process). Only the satellite api process decrypts the order metadata when storagenodes settle orders. This means that the same encryption key needs to be provided in the config for the satellite api process, repair process, and the core process like so: orders.include-encrypted-metadata=true orders.encryption-keys="<"encryptionKeyID>=<encryptionKey>" Change-Id: Ie2c037971713d6fbf69d697bfad7f8b672eedd66	2020-12-01 15:29:32 +00:00
paul cannon	8616fc146d	satellite/orders: send IPs for graceful exit Storage nodes undergoing Graceful Exit have up to now been receiving hostnames for all other storage nodes they need to contact when transferring pieces. This adds up to a lot of DNS lookups, which apparently overwhelm some home routers. There does not seem to be any need for us to send hostnames for graceful exit as opposed to IP addresses; we already use IP addresses (as given by the last_ip_port column in the nodes table) for all the GET and PUT orders we send out. This change causes IP addresses to be used instead. I started trying to construct a test to ensure that the behavior changed, but it was rabbit-holing, so I've begun to feel that maybe this change doesn't require one; it is a very simple change, and very much of the same nature as what we already do for IPs in CreateGetOrderLimits and CreatePutOrderLimits (and others). Change-Id: Ib2b5ffe7a9310e9cdbe7464450cc7c934fa229a1	2020-11-04 00:17:20 +00:00
Jessica Grebenschikov	f5880f6833	satellite/orders: rollout phase3 of SettlementWithWindow endpoint Change-Id: Id19fae4f444c83157ce58c933a18be1898430ad0	2020-10-26 14:56:28 +00:00
paul cannon	360ab17869	satellite/audit: use LastIPAndPort preferentially This preserves the last_ip_and_port field from node lookups through CreateAuditOrderLimits() and CreateAuditOrderLimit(), so that later calls to (Verifier).GetShare() can try to use that IP and port. If a connection to the given IP and port cannot be made, or the connection cannot be verified and secured with the target node identity, an attempt is made to connect to the original node address instead. A similar change is not necessary to the other CreateOrderLimits functions, because they already replace node addresses with the cached IP and port as appropriate. We might want to consider making a similar change to CreateGetRepairOrderLimits(), though. The audit situation is unique because the ramifications are especially powerful when we get the address wrong. Failing a single audit can have a heavy cost to a storage node. We need to make extra effort in order to avoid imposing that cost unfairly. Situation 1: If an audit fails because the repair worker failed to make a DNS query (which might well be the fault on the satellite side), and we have last_ip_and_port information available for the target node, it would be unfair not to try connecting to that last_ip_and_port address. Situation 2: If a node has changed addresses recently and the operator correctly changed its DNS entry, but we don't bother querying DNS, it would be unfair to penalize the node for our failure to connect to it. So the audit worker must try both last_ip_and_port _and_ the node address as supplied by the SNO. We elect here to try last_ip_and_port first, on the grounds that (a) it is expected to work in the large majority of cases, and (b) there should not be any security concerns with connecting to an out-or-date address, and (c) avoiding DNS queries on the satellite side helps alleviate satellite operational load. Change-Id: I9bf6c6c79866d879adecac6144a6c346f4f61200	2020-10-21 13:34:40 +00:00
Jessica Grebenschikov	205c39d404	satellite/orders: upgrade to phase 2 rollout ordersWithWindow We are moving an error into rejectErr since its preventing storage nodes from being able to settle other orders. Change-Id: I3ac97c340e491b127f5e0024c5e8bd9f4df8d5c3	2020-10-15 21:20:19 +00:00
Egon Elbre	0bdb952269	all: use keyed special comment Change-Id: I57f6af053382c638026b64c5ff77b169bd3c6c8b	2020-10-13 15:13:41 +03:00
Jeff Wendling	0f0faf0a9f	satellite/orders: do a better job limiting concurrent requests Doing it at the ProcessOrders level was insufficient: the endpoints make multiple database calls. It was a misguided attempt to only have one spot enter the semaphore. By putting it in the endpoint we can not only be sure that the concurrency is correctly limited but it can be configurable easily. Change-Id: I937149dd077adf9eb87fce52a1a17dc0afe96f64	2020-10-09 16:27:15 -04:00
Egon Elbre	dc48197bd8	satellite/orders: add bucket id to order limit Change-Id: I9019ec77d692e62ac17b67a1da71dc3535cde50c	2020-09-03 10:50:11 +03:00
Egon Elbre	61b17f1214	satellite/orders: add encryption keys flag to Service Change-Id: Ie96e75bc96241b799d04654ef5e05b82e6a899bb	2020-09-02 05:02:14 +00:00
Egon Elbre	3ca405aa97	satellite/orders: use metabase types as arguments Change-Id: I7ddaad207c20572a5ea762667531770a56fd54ef	2020-08-28 15:52:37 +03:00
Egon Elbre	b4c8e219c7	satellite/orders: calculate order expiration inside signer Change-Id: I07f79eeb1ab41b061a1f3146f684bd21291cffb0	2020-08-18 13:21:16 +03:00
Egon Elbre	189ab07846	satellite/orders: use Signer in CreateGetOrderLimits Change-Id: Icb7ed4f1af1dabbbb68cb6f6e1f86d93a9b5faa3	2020-08-18 13:20:00 +03:00
Egon Elbre	cd5e99ea6b	satellite/orders: Signer for simplifying signing logic Create a separate struct for signing order limits. Change-Id: I8f8f5245040efa8c03138512be9248d4834f3f36	2020-08-18 13:19:16 +03:00
Qweder93	01bb2bd17d	satellite/audit: verifier checks if node made sucess GE before auditing Change-Id: Ia6cde4e9fcf11020a5301d38065f7159f276eb80	2020-08-17 23:37:57 +03:00
Jeff Wendling	85a74b47e7	satellite/orders: 3-phase rollout This adds a config flag orders.window-endpoint-rollout-phase that can take on the values phase1, phase2 or phase3. In phase1, the current orders endpoint continues to work as usual, and the windowed orders endpoint uses the same backend as the current one (but also does a bit extra). In phase2, the current orders endpoint is disabled and the windowed orders endpoint continues to use the same backend. In phase3, the current orders endpoint is still disabled and the windowed orders endpoint uses the new backend that requires much less database traffic and state. The intention is to deploy in phase1, roll out code to nodes to have them use the windowed endpoint, switch to phase2, wait a couple days for all existing orders to expire, then switch to phase3. Additionally, it fixes a bug where a node could submit a bunch of orders and rack up charges for a bucket. Change-Id: Ifdc10e09ae1645159cbec7ace687dcb2d594c76d	2020-08-03 17:01:42 +00:00
Egon Elbre	36ed939b89	satellite/orders: add buckets db to service We need to add bucket UUID into the order limit, hence we need access to the buckets table. Change-Id: I348ce1f709c9fcdec5c4034acaab59805b33da9f	2020-07-24 17:36:49 +03:00
Egon Elbre	44f9193404	satellite/orders: make optimal threshold multiplier into an argument It feels weird having a repairer configuration part of order services. Let's have a single source of truth for it. Change-Id: I24f7c897aec80f3293f8af24876cbb6733d85a0b	2020-07-24 16:35:59 +03:00
Egon Elbre	ba4c3d9986	satellite/orders: remove unused node status logging flag Change-Id: I24da78a11cc5d3d88cdf6aca85c4238e4086e59c	2020-07-24 16:35:59 +03:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
Egon Elbre	df8cf8f58a	satellite/orders: delete unused code Change-Id: I431c8cc2f23e538c676d6f742fb1faef7cc1d73e	2020-06-25 16:48:26 +03:00
Egon Elbre	f6301612ac	satellite/orders: use serial SerialNumber By ensuring that they have less randomness it means that they can be compressed better. Using a timestamp should be a good improvement here. Change-Id: Ic4dabb53335a744ff1c332dd279f37ae2cd79357	2020-05-15 11:33:53 +00:00
Egon Elbre	1024bf9ce1	all: simplify uuid usage Instead of uuid.Parse, use uuid.FromString. This removes a bunch of pointer management logic. Change-Id: Id25bd174eb43c71d00b450158a198abafd8958f2	2020-04-02 13:45:19 +00:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Egon Elbre	439aba922a	satellite/overlay: reduce overhead of GetNodes Instead of filtering on the client side it's better to filter on the database side. Change-Id: I845fbbe5ed28c2ffdb0b8a3f789b59c094fd1069	2020-03-30 18:36:23 +03:00
Egon Elbre	cb781d66c7	satellite/overlay: optimize FindStorageNodes Reduce the number of fields returned from the query. Benchmark results in `satellite/overlay`: benchstat before.txt after2.txt name old time/op new time/op delta SelectStorageNodes-32 7.85ms ± 1% 6.27ms ± 1% -20.18% (p=0.002 n=10+4) SelectNewStorageNodes-32 8.21ms ± 1% 6.61ms ± 0% -19.53% (p=0.002 n=10+4) SelectStorageNodesExclusion-32 17.2ms ± 1% 15.9ms ± 1% -7.55% (p=0.002 n=10+4) SelectNewStorageNodesExclusion-32 17.8ms ± 2% 16.1ms ± 0% -9.38% (p=0.002 n=10+4) FindStorageNodes-32 48.4ms ± 1% 45.1ms ± 0% -6.69% (p=0.002 n=10+4) FindStorageNodesExclusion-32 79.2ms ± 1% 76.1ms ± 1% -3.89% (p=0.002 n=10+4) Benchmark results from `satellite/overlay` after making them parallel: benchstat before-parallel.txt after2-parallel.txt name old time/op new time/op delta SelectStorageNodes-32 548µs ± 1% 353µs ± 1% -35.60% (p=0.029 n=4+4) SelectNewStorageNodes-32 562µs ± 0% 368µs ± 0% -34.51% (p=0.029 n=4+4) SelectStorageNodesExclusion-32 1.02ms ± 1% 0.84ms ± 0% -18.08% (p=0.029 n=4+4) SelectNewStorageNodesExclusion-32 1.03ms ± 1% 0.86ms ± 2% -16.22% (p=0.029 n=4+4) FindStorageNodes-32 3.11ms ± 0% 2.79ms ± 1% -10.27% (p=0.029 n=4+4) FindStorageNodesExclusion-32 4.75ms ± 0% 4.43ms ± 1% -6.56% (p=0.029 n=4+4) Change-Id: I1d85e2764eb270f4c2b1998303ccfc1179d65b26	2020-03-30 18:36:23 +03:00
Ethan	bdbf764b86	satellite/orders;overlay: Consolidate order limit storage node lookups into 1 query. https: //storjlabs.atlassian.net/browse/SM-449 Change-Id: Idc62cc2978fba67cf48f7c98b27b0f996f9c58ac	2020-03-16 23:15:47 +00:00
Jess G	39cb821196	satellite/overlay: rm combinedcache, fix IP naming to be network (#3798 ) * rn combinedcache, rm dns node lookup Change-Id: I239f07211764b097d851230d8c81900a47756e9e * excludeIPs -> excludedNetworks Change-Id: Ifa6f44ab17457cdd5aff4cd5694296867c18b179 * use lowercase var name Change-Id: I825aad2b718c71f455e747be18f8cabd02aabe55 * update Getnetwork name Change-Id: I002a1b7bc6b4ef40159c0cd2b0ef209f80a9c503 * fix comments Change-Id: Ibddf5b9ffa9d685af6c392d893db063ef18e45fa * update comments with ipv6 Change-Id: I31758b7d4979e7c27d014668f4fb532ad838cda2 Co-authored-by: Stefan Benten <mail@stefan-benten.de>	2020-03-12 11:37:57 -07:00
Jessica Grebenschikov	803e2930f4	satellite: use IP for all uplink operations, use hostname for audit and repairs My understanding is that the nodes table has the following fields: - `address` field which can be a hostname or an IP - `last_net` field that is the /24 subnet of the IP resolved from the address This PR does the following: 1) add back the `last_ip` field to the nodes table 2) for uplink operations remove the calls that the satellite makes to `lookupNodeAddress` (which makes the DNS calls to resolve the IP from the hostname) and instead use the data stored in the nodes table `last_ip` field. This means that the IP that the satellite sends to the uplink for the storage nodes could be approx 1 hr stale. In the short term this is fine, next we will be adding changes so that the storage node pushes any IP changes to the satellite in real time. 3) use the address field for repair and audit since we want them to still make DNS calls to confirm the IP is up to date 4) try to reduce confusion about hostname, ip, subnet, and address in the code base Change-Id: I96ce0d8bb78303f82483d0701bc79544b74057ac	2020-03-11 09:11:40 -07:00
Jessica Grebenschikov	2af71f3460	satellite/orders: add monkit to looking up node addr Change-Id: Ia0eb0ffc343879a6ef9827d46e936e1fbc2e198a	2020-03-04 23:15:18 +00:00
Egon Elbre	5342dd9fe6	go.mod: update uplink Change-Id: I867a6a1eef8aa5d60bb676e5112b98c4192ce811	2020-02-21 16:08:12 +02:00
Simon Guindon	961944f24d	satellite/orders: Resolve storage node addresses to IP addresses. This change resolves all the storage node addresses to their IP addresses before giving them to the uplink so that the uplink doesn't have to resolve a hundred hosts and can immediately connect to improve uplink performance. Change-Id: Idb834351e0fece409d74c8a1c29b0b8c9b09c9ff	2020-02-11 18:44:45 +02:00
Jessica Grebenschikov	a1948ed338	satellite/orders: add old method for CreateGetOrderLimitsOld to maintain compatibility with old versions of the uplink Change-Id: I7ce1f4fbc6217f1d340cf778c4b010d40961b3f0	2020-01-28 18:54:24 -05:00
Jessica Grebenschikov	54dbaaece2	satellite/orders: create as many orderLimits as needed to download a file Change-Id: I2a39483d35037d9940913c035a78a93ea692ce9f	2020-01-28 20:04:11 +00:00
littleskunk	a6c6440ab7	satellite/order: decrease expire time from 7 days to 2 days For the last few month we had no issues with order submission. I would call it stable and now it is time to risk a lower expire time. This will increase the database performance on the satellite and it will reduce the delay for billing. The long term goal is 6h but for that step we need to change graceful exit first. At the moment storage nodes would get disuqlaified for not transfering alle pieces in less than 6 hours. Change-Id: I421a2c2421c5374c4e706e2338f1c2161fedc14c	2020-01-24 23:37:39 +00:00
Jeff Wendling	26e33e7e07	satellite/gracefulexit: make orders with right bucket id and action paths are organized as follows: project_id/segment_index/bucket_name/encrypted_key so by picking parts[0] and parts[1], we were using the segment index instead of the bucket name, causing bandwidth to be accounted for incorrectly. additionally, we were using the PUT action instead of the PUT_GRACEFUL_EXIT action, causing the data to be charged incorrectly. we use PUT_REPAIR for now because nodes won't accept uploads with PUT_GRACEFUL_EXIT and our tables need migrations to handle rollups with it. Change-Id: Ife2aff541222bac930c35df8fcf76e8bac5d60b2	2020-01-24 19:27:38 +00:00
stefanbenten	f4097d518c	satellite: reduce logging of node status Change-Id: I6618cf4bf31b856acd7a28b54011a943c03ab22a	2020-01-18 17:47:59 +00:00
Jeff Wendling	78c6d5bb32	satellite/satellitedb: reported_serials table for processing orders this commit introduces the reported_serials table. its purpose is to allow for blind writes into it as nodes report in so that we have minimal contention. in order to continue to accurately account for used bandwidth, though, we cannot immediately add the settled amount. if we did, we would have to give up on blind writes. the table's primary key is structured precisely so that we can quickly find expired orders and so that we maximally benefit from rocksdb path prefix compression. we do this by rounding the expires at time forward to the next day, effectively giving us storagenode petnames for free. and since there's no secondary index or foreign key constraints, this design should use significantly less space than the current used_serials table while also reducing contention. after inserting the orders into the table, we have a chore that periodically consumes all of the expired orders in it and inserts them into the existing rollups tables. this is as if we changed the nodes to report as the order expired rather than as soon as possible, so the belief in correctness of the refactor is higher. since we are able to process large batches of orders (typically a day's worth), we can use the code to maximally batch inserts into the rollup tables to make inserts as friendly as possible to cockroach. Change-Id: I25d609ca2679b8331979184f16c6d46d4f74c1a6	2020-01-15 19:21:21 -07:00
Isaac Hess	4950d7106a	satellite/orders: Add write cache for bw rollups Change-Id: I8ba454cb2ab4742cafd6ed09120e4240874831fc	2020-01-13 22:40:51 +00:00
littleskunk	bcc23f6869	Satellite/orders: remove allocated bandwith from storagenode_bandwidth_rollups When an uplink requests an upload or download from the satellite we are trackig the allocated bandwidth twice. The value in bucket_bandwidth_rollups is used for project limits but the value in storagenode_bandwidth_rollups is not used at all. We can increase the performance by removing it. Uplinks will get a faster response from the satellite. Change-Id: Icccd41f94107ef34668f30f99bf5f728c384b07e	2020-01-12 16:20:47 +01:00
Egon Elbre	082ec81714	uplink: move to storj.io/uplink (#3746 )	2020-01-08 15:40:19 +02:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Yingrong Zhao	7af42e3c10	satellite/metainfo, satellite/repair, uplink/eestream: add metric for download failed due to not enough pieces available (#3665 )	2019-12-04 16:24:36 -05:00
Ethan Adams	3e0d12354a	storagenode/gracefulexit: Implement storage node graceful exit worker - part 1 (#3322 )	2019-10-22 16:42:21 -04:00
Ethan Adams	a1275746b4	satellite/gracefulexit: Implement the 'process' endpoint on the satellite (#3223 )	2019-10-11 17:18:05 -04:00
Egon Elbre	a801fab66a	all: add archview annotations (#2964 )	2019-09-10 16:24:16 +03:00

1 2

86 Commits