storj

Author	SHA1	Message	Date
Jeff Wendling	4cbd4d52a9	satellite/orders: only hold the orders semaphore during database calls holding it during node i/o means slow nodes can hold up order processing for everyone else. this dramatically increases the amount of tiem spent handling orders. Change-Id: Iec999b7ed0817c921a0fd039097a75bdd3c70ea2	2020-10-10 15:40:50 -04:00
Jeff Wendling	0f0faf0a9f	satellite/orders: do a better job limiting concurrent requests Doing it at the ProcessOrders level was insufficient: the endpoints make multiple database calls. It was a misguided attempt to only have one spot enter the semaphore. By putting it in the endpoint we can not only be sure that the concurrency is correctly limited but it can be configurable easily. Change-Id: I937149dd077adf9eb87fce52a1a17dc0afe96f64	2020-10-09 16:27:15 -04:00
Jeff Wendling	1fecaed7df	satellite/orders: don't version check old endpoint nodes are submitting using both the legacy and windowed endpoints and thus having their legacy submissions rejected. it is legal to use both the legacy and windowed endpoints in phase1 since they use the same backend. the legacy endpoint is disabled in phase2 and phase3. therefore, if we wait an order expiration period (2 days) after we determine enough nodes have started using the windowed endpoint, we can be sure that any orders they did have to submit with the legacy endpoint will have expired. Change-Id: I4418a881bf8bb9377efaef4c651e6103a5dc6ed0	2020-09-09 10:23:48 -04:00
Egon Elbre	dc48197bd8	satellite/orders: add bucket id to order limit Change-Id: I9019ec77d692e62ac17b67a1da71dc3535cde50c	2020-09-03 10:50:11 +03:00
Egon Elbre	61b17f1214	satellite/orders: add encryption keys flag to Service Change-Id: Ie96e75bc96241b799d04654ef5e05b82e6a899bb	2020-09-02 05:02:14 +00:00
Egon Elbre	c86c732fc0	satellite: simplify tests satellite.DB.Console().Projects().GetAll database query can be replaced with planet.Uplinks[0].Projects[0].ID Change-Id: I73b82b91afb2dde7b690917345b798f9d81f6831	2020-08-28 22:28:04 +00:00
Egon Elbre	3ca405aa97	satellite/orders: use metabase types as arguments Change-Id: I7ddaad207c20572a5ea762667531770a56fd54ef	2020-08-28 15:52:37 +03:00
Jeff Wendling	91698207cf	storagenode: live tracking of order window usage This change accomplishes multiple things: 1. Instead of having a max in flight time, which means we effectively have a minimum bandwidth for uploads and downloads, we keep track of what windows have active requests happening in them. 2. We don't double check when we save the order to see if it is too old: by then, it's too late. A malicious uplink could just submit orders outside of the grace window and receive all the data, but the node would just not commit it, so the uplink gets free traffic. Because the endpoints also check for the order being too old, this would be a very tight race that depends on knowledge of the node system clock, but best to not have the race exist. Instead, we piggy back off of the in flight tracking and do the check when we start to handle the order, and commit at the end. 3. Change the functions that send orders and list unsent orders to accept a time at which that operation is happening. This way, in tests, we can pretend we're listing or sending far into the future after the windows are available to send, rather than exposing test functions to modify internal state about the grace period to get the desired effect. This brings tests closer to actual usage in production. 4. Change the calculation for if an order is allowed to be enqueued due to the grace period to just look at the order creation time, rather than some computation involving the window it will be in. In this way, you can easily answer the question of "will this order be accepted?" by asking "is it older than X?" where X is the grace period. 5. Increases the frequency we check to send up orders to once every 5 minutes instead of once every hour because we already have hour-long buffering due to the windows. This decreases the maximum latency that an order will be reported back to the satellite by 55 minutes. Change-Id: Ie08b90d139d45ee89b82347e191a2f8db1b88036	2020-08-19 19:42:33 +00:00
Moby von Briesen	708cb48aa6	storagenode/orders: implement orders filestore on storagenode * Add all new orders to the orders filestore instead of the database. * Submit orders from the filestore to the new satellite SettleWindow endpoint. The orders filestore will eventually replace the orders DB completely. For now, we will still be checking the orders DB and submitting those orders if they exist. In a later release, we will completely remove the orders DB, but we need both the DB and filestore for the transitionary period. Change-Id: Iac8780fd5ab770296181bbd313e1d335f072d4dc	2020-08-19 15:00:35 +00:00
Egon Elbre	b4c8e219c7	satellite/orders: calculate order expiration inside signer Change-Id: I07f79eeb1ab41b061a1f3146f684bd21291cffb0	2020-08-18 13:21:16 +03:00
Egon Elbre	189ab07846	satellite/orders: use Signer in CreateGetOrderLimits Change-Id: Icb7ed4f1af1dabbbb68cb6f6e1f86d93a9b5faa3	2020-08-18 13:20:00 +03:00
Egon Elbre	cd5e99ea6b	satellite/orders: Signer for simplifying signing logic Create a separate struct for signing order limits. Change-Id: I8f8f5245040efa8c03138512be9248d4834f3f36	2020-08-18 13:19:16 +03:00
Qweder93	01bb2bd17d	satellite/audit: verifier checks if node made sucess GE before auditing Change-Id: Ia6cde4e9fcf11020a5301d38065f7159f276eb80	2020-08-17 23:37:57 +03:00
Egon Elbre	94a09ce20b	all: add missing dots Change-Id: I93b86c9fb3398c5d3c9121b8859dad1c615fa23a	2020-08-11 17:50:01 +03:00
Jeff Wendling	85a74b47e7	satellite/orders: 3-phase rollout This adds a config flag orders.window-endpoint-rollout-phase that can take on the values phase1, phase2 or phase3. In phase1, the current orders endpoint continues to work as usual, and the windowed orders endpoint uses the same backend as the current one (but also does a bit extra). In phase2, the current orders endpoint is disabled and the windowed orders endpoint continues to use the same backend. In phase3, the current orders endpoint is still disabled and the windowed orders endpoint uses the new backend that requires much less database traffic and state. The intention is to deploy in phase1, roll out code to nodes to have them use the windowed endpoint, switch to phase2, wait a couple days for all existing orders to expire, then switch to phase3. Additionally, it fixes a bug where a node could submit a bunch of orders and rack up charges for a bucket. Change-Id: Ifdc10e09ae1645159cbec7ace687dcb2d594c76d	2020-08-03 17:01:42 +00:00
Egon Elbre	36ed939b89	satellite/orders: add buckets db to service We need to add bucket UUID into the order limit, hence we need access to the buckets table. Change-Id: I348ce1f709c9fcdec5c4034acaab59805b33da9f	2020-07-24 17:36:49 +03:00
Egon Elbre	44f9193404	satellite/orders: make optimal threshold multiplier into an argument It feels weird having a repairer configuration part of order services. Let's have a single source of truth for it. Change-Id: I24f7c897aec80f3293f8af24876cbb6733d85a0b	2020-07-24 16:35:59 +03:00
Egon Elbre	ba4c3d9986	satellite/orders: remove unused node status logging flag Change-Id: I24da78a11cc5d3d88cdf6aca85c4238e4086e59c	2020-07-24 16:35:59 +03:00
Egon Elbre	b84923558b	satellite: fix scoping, formatting Change-Id: I21ef9edc2d449d75ad74891df7f966fb150d80fd	2020-07-16 19:13:14 +03:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
Jeff Wendling	1944d734ef	satellite/orders: check and enforce node api version Change-Id: Ibdeb1a85dfed8b534bfed32a7cdaae5c3dc8b420	2020-07-16 10:38:12 +00:00
Jeff Wendling	3a8766936b	satellite/orders: remove race condition in new endpoint tests the flush batch size was set to 1 which means that a flush was async scheduled after the first write. the explicit trigger wait was then always flushing nothing, and the test would only pass if the async flush was scheduled before the read. remove that async flush and pause the flush loop so that we are in full control of when the flushes happen so there are no races. the tests are still disabled but that's because the endpoint is still disabled. Change-Id: I2b7b07fd5525388c30be8efbf4af7105087228da	2020-07-14 15:12:33 -04:00
stefanbenten	257855b5de	all: replace == comparison with errors.Is Change-Id: I05d9a369c7c6f144b94a4c524e8aea18eb9cb714	2020-07-14 15:50:25 +00:00
Jessica Grebenschikov	1f1e3f0604	satellite/orders: disable settlementwithwindow, skip flaky tests Change-Id: Ia60d7e0f2d383919650cdc736ba4569bb26ff2d7	2020-07-14 07:17:18 -07:00
Jessica Grebenschikov	8abb907010	satellite/orders: add settle orders with window Why: We need a way to cut down on database traffic due to bandwidth measurement and tracking. What: This changeset is the Satellite side of settling orders in 1 hr windows. See design doc for more details: https://review.dev.storj.io/c/storj/storj/+/1732 Change-Id: I2e1c151e2e65516ebe1b7f47b7c5f83a3a220b31	2020-07-13 15:41:29 -07:00
Egon Elbre	06a3510ae4	satellite/orders: add EncryptionKey encryption Add encryption using nacl/secretbox. Change-Id: I0add31a9fd2359be1bf9d3e43b6c4b9ff3b6fb03	2020-07-13 16:10:19 +00:00
Egon Elbre	5bdcd86fa7	ci: test benchmarks This runs each benchmark for one iteration to ensure that they are valid. Unfortunately, it does not give any useful metrics as output. Change-Id: I68940398c8dd849aed656bd12656f48d5df10128	2020-07-10 13:26:49 +00:00
Jessica Grebenschikov	41497569cd	satellite/orders: add settled amount to rollups write cache https://storjlabs.atlassian.net/browse/SM-1109 Change-Id: Ic5859b141c1384157b33df0d7fb6c8b43cc8a6b1	2020-07-07 14:46:06 +00:00
Egon Elbre	735dc6e163	satellite/orders: add encryption keys This adds EncryptionKey definition that can be used as a flag. These order.EncryptionKey-s will be used to encrypt data in order limits. This helps to avoid storing lots of transient data in the main database. This code doesn't yet contain encryption itself. Change-Id: I2efae102a89b851d33342a0106f8d8b3f35119bb	2020-06-30 15:03:14 +00:00
Egon Elbre	df8cf8f58a	satellite/orders: delete unused code Change-Id: I431c8cc2f23e538c676d6f742fb1faef7cc1d73e	2020-06-25 16:48:26 +03:00
Egon Elbre	f6301612ac	satellite/orders: use serial SerialNumber By ensuring that they have less randomness it means that they can be compressed better. Using a timestamp should be a good improvement here. Change-Id: Ic4dabb53335a744ff1c332dd279f37ae2cd79357	2020-05-15 11:33:53 +00:00
Egon Elbre	7d29f2e0d3	all: remove drpc wrappers Change-Id: I45016f7d2a771dc00776196c1f531f3343e93b40	2020-05-11 08:20:34 +03:00
Egon Elbre	e6d5ce6b77	all: remove grpc It seems everyone has migrated to drpc. Change-Id: Ica6b2d0bdef68c6603083f2963458843eca71e9e	2020-05-10 06:36:09 +00:00
Egon Elbre	9052085f70	private/testplanet: simplify uplink usage Change-Id: I3e488dc296f1094ce95e6d6597ca6d3f8da90a76	2020-04-16 16:45:55 +00:00
Jeff Wendling	a409bd5dec	satellite/orders: check for expired orders first there are a subset of storagenodes hammering the satellite with expired orders. if we check for expiration first, we don't have to do a bunch of pointless signature verification. since a && b is equal to b && a, we can order these checks in any way we want and have it still be correct. Change-Id: I6ffc8025c8b0d54949a1daf5f5ea1fed9e213372	2020-04-02 12:35:11 -06:00
Egon Elbre	6492b13d81	all: remove old uuid Change-Id: I3a137f73456f010c37d3933dbe12cbbb840b809f	2020-04-02 19:30:36 +03:00
Egon Elbre	1024bf9ce1	all: simplify uuid usage Instead of uuid.Parse, use uuid.FromString. This removes a bunch of pointer management logic. Change-Id: Id25bd174eb43c71d00b450158a198abafd8958f2	2020-04-02 13:45:19 +00:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Egon Elbre	439aba922a	satellite/overlay: reduce overhead of GetNodes Instead of filtering on the client side it's better to filter on the database side. Change-Id: I845fbbe5ed28c2ffdb0b8a3f789b59c094fd1069	2020-03-30 18:36:23 +03:00
Egon Elbre	cb781d66c7	satellite/overlay: optimize FindStorageNodes Reduce the number of fields returned from the query. Benchmark results in `satellite/overlay`: benchstat before.txt after2.txt name old time/op new time/op delta SelectStorageNodes-32 7.85ms ± 1% 6.27ms ± 1% -20.18% (p=0.002 n=10+4) SelectNewStorageNodes-32 8.21ms ± 1% 6.61ms ± 0% -19.53% (p=0.002 n=10+4) SelectStorageNodesExclusion-32 17.2ms ± 1% 15.9ms ± 1% -7.55% (p=0.002 n=10+4) SelectNewStorageNodesExclusion-32 17.8ms ± 2% 16.1ms ± 0% -9.38% (p=0.002 n=10+4) FindStorageNodes-32 48.4ms ± 1% 45.1ms ± 0% -6.69% (p=0.002 n=10+4) FindStorageNodesExclusion-32 79.2ms ± 1% 76.1ms ± 1% -3.89% (p=0.002 n=10+4) Benchmark results from `satellite/overlay` after making them parallel: benchstat before-parallel.txt after2-parallel.txt name old time/op new time/op delta SelectStorageNodes-32 548µs ± 1% 353µs ± 1% -35.60% (p=0.029 n=4+4) SelectNewStorageNodes-32 562µs ± 0% 368µs ± 0% -34.51% (p=0.029 n=4+4) SelectStorageNodesExclusion-32 1.02ms ± 1% 0.84ms ± 0% -18.08% (p=0.029 n=4+4) SelectNewStorageNodesExclusion-32 1.03ms ± 1% 0.86ms ± 2% -16.22% (p=0.029 n=4+4) FindStorageNodes-32 3.11ms ± 0% 2.79ms ± 1% -10.27% (p=0.029 n=4+4) FindStorageNodesExclusion-32 4.75ms ± 0% 4.43ms ± 1% -6.56% (p=0.029 n=4+4) Change-Id: I1d85e2764eb270f4c2b1998303ccfc1179d65b26	2020-03-30 18:36:23 +03:00
JT Olio	5511827662	satellite/orders: don't log expired order limits we still need to come up with a better plan to get storage nodes to stop doing this, but in the meantime, we know this is happening, just stop logging it and keep some stats instead. Change-Id: Icb6bcba275e0e955c54b1a90da2b37219fff2349	2020-03-26 22:31:10 -06:00
Jeff Wendling	115f4559e5	satellite/orders: more efficient processing of orders by doing an indexed anti-join we're able to reduce the time to select the pending orders by over 10x on postgres. this should help us process pending orders much more quickly. it probably won't do as good a job on cockroach because it does not do an indexed anti-join and instead does a hash join after scanning the entire consumed serials table. we should either remove orders entirely or try to make that more efficient when necessary. Change-Id: I8ca0535acd21c51e74955b24c9b86d20e4f2ff9c	2020-03-18 09:03:30 +00:00
Jeff Wendling	7baa59753a	satellite/orders: add tests for double sending the same order Change-Id: If2fa7f035257df3b04f506f81aa8b2e0916f5033	2020-03-17 14:18:03 +00:00
Ethan	bdbf764b86	satellite/orders;overlay: Consolidate order limit storage node lookups into 1 query. https: //storjlabs.atlassian.net/browse/SM-449 Change-Id: Idc62cc2978fba67cf48f7c98b27b0f996f9c58ac	2020-03-16 23:15:47 +00:00
Bill Thorp	94c11c5212	satellite: remove some unnecessary UTC() calls Fixes some easy cases of extraneous UTC() calls Change-Id: I3f4c287ae622a455b9a492a8892a699e0710ca9a	2020-03-13 13:49:44 +00:00
Jess G	39cb821196	satellite/overlay: rm combinedcache, fix IP naming to be network (#3798 ) * rn combinedcache, rm dns node lookup Change-Id: I239f07211764b097d851230d8c81900a47756e9e * excludeIPs -> excludedNetworks Change-Id: Ifa6f44ab17457cdd5aff4cd5694296867c18b179 * use lowercase var name Change-Id: I825aad2b718c71f455e747be18f8cabd02aabe55 * update Getnetwork name Change-Id: I002a1b7bc6b4ef40159c0cd2b0ef209f80a9c503 * fix comments Change-Id: Ibddf5b9ffa9d685af6c392d893db063ef18e45fa * update comments with ipv6 Change-Id: I31758b7d4979e7c27d014668f4fb532ad838cda2 Co-authored-by: Stefan Benten <mail@stefan-benten.de>	2020-03-12 11:37:57 -07:00
Jessica Grebenschikov	803e2930f4	satellite: use IP for all uplink operations, use hostname for audit and repairs My understanding is that the nodes table has the following fields: - `address` field which can be a hostname or an IP - `last_net` field that is the /24 subnet of the IP resolved from the address This PR does the following: 1) add back the `last_ip` field to the nodes table 2) for uplink operations remove the calls that the satellite makes to `lookupNodeAddress` (which makes the DNS calls to resolve the IP from the hostname) and instead use the data stored in the nodes table `last_ip` field. This means that the IP that the satellite sends to the uplink for the storage nodes could be approx 1 hr stale. In the short term this is fine, next we will be adding changes so that the storage node pushes any IP changes to the satellite in real time. 3) use the address field for repair and audit since we want them to still make DNS calls to confirm the IP is up to date 4) try to reduce confusion about hostname, ip, subnet, and address in the code base Change-Id: I96ce0d8bb78303f82483d0701bc79544b74057ac	2020-03-11 09:11:40 -07:00
Jessica Grebenschikov	2af71f3460	satellite/orders: add monkit to looking up node addr Change-Id: Ia0eb0ffc343879a6ef9827d46e936e1fbc2e198a	2020-03-04 23:15:18 +00:00
Fadila Khadar	5c9becb9be	satellite/orders: billing partial download Submit an order limit with a high amount but the order has a low amount of traffic. Make sure the order amount is used for billing. Change-Id: I6b6ae26e9b8896f4a3acf530b2f48510b6df89cc	2020-03-04 17:12:50 +00:00
Egon Elbre	64330c55b3	all: use pbgrpc common/pb moved grpc to a separate package common/pb/pbgrpc. This updates this repository to use it. Change-Id: I2de2a190688871cf9cb61f7ea511f8a01e264e4e	2020-02-26 21:27:47 +02:00
Jeff Wendling	f671eb2beb	satellite/satellitedb: use queue for orders to get back fast billing This change adds two new tables to process orders as fast as we used to but in an asynchronous manner and with hopefully less storage usage. This should help scale on cockroach, but limits us to one worker. It lays the groundwork for the order processing pipeline to be queue rather than database driven. For more details, see the added fast billing changes blueprint. It also fixes the orders db so that all the timestamps that are passed to columns that do not contain a time zone are converted to UTC at the last possible opportunity, making it less likely to use the APIs incorrectly. We really should migrate to include timezones on all of our timestamp columns. Change-Id: Ibfda8e7a3d5972b7798fb61b31ff56419c64ea35	2020-02-24 17:07:07 +00:00
Egon Elbre	5342dd9fe6	go.mod: update uplink Change-Id: I867a6a1eef8aa5d60bb676e5112b98c4192ce811	2020-02-21 16:08:12 +02:00
Ivan Fraixedes	1a84a00cc9	satellite/orders: Fix doc comments Enhance the documentation of the UseSerialNumber method (interface and implementation) and add several missing dots in doc comments of the methods of the same interface and implementation. Change-Id: I792cd344f0d2542e060fa2ec288b71231cae69de	2020-02-18 13:03:23 +01:00
Simon Guindon	961944f24d	satellite/orders: Resolve storage node addresses to IP addresses. This change resolves all the storage node addresses to their IP addresses before giving them to the uplink so that the uplink doesn't have to resolve a hundred hosts and can immediately connect to improve uplink performance. Change-Id: Idb834351e0fece409d74c8a1c29b0b8c9b09c9ff	2020-02-11 18:44:45 +02:00
Jeff Wendling	7999d24f81	all: use monkit v3 this commit updates our monkit dependency to the v3 version where it outputs in an influx style. this makes discovery much easier as many tools are built to look at it this way. graphite and rothko will suffer some due to no longer being a tree based on dots. hopefully time will exist to update rothko to index based on the new metric format. it adds an influx output for the statreceiver so that we can write to influxdb v1 or v2 directly. Change-Id: Iae9f9494a6d29cfbd1f932a5e71a891b490415ff	2020-02-05 23:53:17 +00:00
Jessica Grebenschikov	a1948ed338	satellite/orders: add old method for CreateGetOrderLimitsOld to maintain compatibility with old versions of the uplink Change-Id: I7ce1f4fbc6217f1d340cf778c4b010d40961b3f0	2020-01-28 18:54:24 -05:00
Jessica Grebenschikov	54dbaaece2	satellite/orders: create as many orderLimits as needed to download a file Change-Id: I2a39483d35037d9940913c035a78a93ea692ce9f	2020-01-28 20:04:11 +00:00
paul cannon	a0a94a9ac7	satellite/satellitedb: insert into reported_serials w/ arrays Change-Id: Icb682de09ded3e3159e3590594dcf13f2e7f40f0	2020-01-24 18:36:21 -06:00
littleskunk	a6c6440ab7	satellite/order: decrease expire time from 7 days to 2 days For the last few month we had no issues with order submission. I would call it stable and now it is time to risk a lower expire time. This will increase the database performance on the satellite and it will reduce the delay for billing. The long term goal is 6h but for that step we need to change graceful exit first. At the moment storage nodes would get disuqlaified for not transfering alle pieces in less than 6 hours. Change-Id: I421a2c2421c5374c4e706e2338f1c2161fedc14c	2020-01-24 23:37:39 +00:00
Jeff Wendling	26e33e7e07	satellite/gracefulexit: make orders with right bucket id and action paths are organized as follows: project_id/segment_index/bucket_name/encrypted_key so by picking parts[0] and parts[1], we were using the segment index instead of the bucket name, causing bandwidth to be accounted for incorrectly. additionally, we were using the PUT action instead of the PUT_GRACEFUL_EXIT action, causing the data to be charged incorrectly. we use PUT_REPAIR for now because nodes won't accept uploads with PUT_GRACEFUL_EXIT and our tables need migrations to handle rollups with it. Change-Id: Ife2aff541222bac930c35df8fcf76e8bac5d60b2	2020-01-24 19:27:38 +00:00
Cameron Ayer	494fead7af	satellitedb/orders: fix comma bug in SQL stmt Change-Id: Ibc6024eeeb5aa4de3909c0cec2d01ac0a01c809f	2020-01-24 13:58:32 -05:00
Jeff Wendling	665ed3b6b1	satellite/satellitedb: fix issue with shared memory on range for bucket rollups A uuid.UUID is an array of bytes, and slicing it refers to the underlying value, much like taking the address. Because range in Go reuses the same value for every loop iteration, this means that later iterations would overwrite earlier stored project ids. We fix that by making a copy of the value before slicing it for every loop iteration. Change-Id: Iae3f11138d11a176ce360bd5af2244307c74fdad	2020-01-23 21:57:02 -07:00
Isaac Hess	40a890639d	satellite/orders: Flush all pending bandwidth rollup writes on shutdown Currently we risk losing pending bandwidth rollup writes even on a clean shutdown. This change ensures that all pending writes are actually written to the db when shutting down the satellite. Change-Id: Ideab62fa9808937d3dce9585c52405d8c8a0e703	2020-01-23 08:12:41 -07:00
Isaac Hess	960e103082	satellite/orders: Rename orders_write_cache to rollups_write_cache Change-Id: Icffca37e40bb8b2927b38d97728575321c2ad90c	2020-01-23 08:12:41 -07:00
Isaac Hess	0548c3f6bf	satellite/orders: RollupsWriteCache has a single method to reset cache Change-Id: I3ae18115dccd7ac8369313bd96951b9da6464cf3	2020-01-23 08:12:41 -07:00
Michal Niewrzal	6502454947	satellite/metainfo: move RS configuration to satellite With this change RS configuration will be set on satellite. Uplink with get RS values with BeginObject request and will use it. For backward compatibility and to avoid super large change redundancy scheme stored with bucket is not touched. This can be done in future. Change-Id: Ia5f76fc10c37e2c44e4f7b8754f28eafe1f97eff	2020-01-22 09:33:53 +00:00
Egon Elbre	f3b4bf2b7c	satellite/satellitedb/satellitedbtest: pass ctx as an argument ctx is created in most tests, instead pass in as argument to reduce code duplication. Change-Id: I466c51c008392001129c8b007c9d6b3619935ac4	2020-01-20 16:35:42 +02:00
stefanbenten	f4097d518c	satellite: reduce logging of node status Change-Id: I6618cf4bf31b856acd7a28b54011a943c03ab22a	2020-01-18 17:47:59 +00:00
Jessica Grebenschikov	955abd9293	satellite/satellitedb/orders: add multi row upserts to process orders Change-Id: I00d8b55ee74b443fb328bd3a4378308cefa368e4	2020-01-16 23:51:46 +00:00
Jeff Wendling	696d98a232	satellite/satellitedb: fix nitpicks and timestamp issue found in review warning: databases migrated to version 77 before this commit is merged must be manually re-migrated. this should not be a problem for anything but staging databases. Change-Id: Ie1631c48379472352014183ee43f1465e22200f7	2020-01-16 21:22:38 +00:00
Jeff Wendling	f42851b1ab	satellite/satellitedb: remove the big honkin mutex no longer necessary/desired with reported_serials. Change-Id: I69b5c535488eb5f98b250d73a7c8e6deaed0254e	2020-01-15 19:24:35 -07:00
Jeff Wendling	78c6d5bb32	satellite/satellitedb: reported_serials table for processing orders this commit introduces the reported_serials table. its purpose is to allow for blind writes into it as nodes report in so that we have minimal contention. in order to continue to accurately account for used bandwidth, though, we cannot immediately add the settled amount. if we did, we would have to give up on blind writes. the table's primary key is structured precisely so that we can quickly find expired orders and so that we maximally benefit from rocksdb path prefix compression. we do this by rounding the expires at time forward to the next day, effectively giving us storagenode petnames for free. and since there's no secondary index or foreign key constraints, this design should use significantly less space than the current used_serials table while also reducing contention. after inserting the orders into the table, we have a chore that periodically consumes all of the expired orders in it and inserts them into the existing rollups tables. this is as if we changed the nodes to report as the order expired rather than as soon as possible, so the belief in correctness of the refactor is higher. since we are able to process large batches of orders (typically a day's worth), we can use the code to maximally batch inserts into the rollup tables to make inserts as friendly as possible to cockroach. Change-Id: I25d609ca2679b8331979184f16c6d46d4f74c1a6	2020-01-15 19:21:21 -07:00
Jeff Wendling	3b99f03780	satellite/orders: add monitoring to bucket bandwidth cache operations Change-Id: Ib14303fc9f97a133410e2d6e2cf532e468b3dcee	2020-01-13 17:36:40 -07:00
Isaac Hess	4950d7106a	satellite/orders: Add write cache for bw rollups Change-Id: I8ba454cb2ab4742cafd6ed09120e4240874831fc	2020-01-13 22:40:51 +00:00
Jeff Wendling	71ec0ad374	satellite/satellitedb: add big honkin mutex to ProcessOrders the hope is that it is mostly interfering with itself, so this will make it not do that (well, N api servers, but hopefully that's not enough to cause it to have issues). Change-Id: Ifd0c9e6617457785ab25fe5b714d8556cdc8e2d3	2020-01-13 11:33:12 -07:00
littleskunk	bcc23f6869	Satellite/orders: remove allocated bandwith from storagenode_bandwidth_rollups When an uplink requests an upload or download from the satellite we are trackig the allocated bandwidth twice. The value in bucket_bandwidth_rollups is used for project limits but the value in storagenode_bandwidth_rollups is not used at all. We can increase the performance by removing it. Uplinks will get a faster response from the satellite. Change-Id: Icccd41f94107ef34668f30f99bf5f728c384b07e	2020-01-12 16:20:47 +01:00
Egon Elbre	082ec81714	uplink: move to storj.io/uplink (#3746 )	2020-01-08 15:40:19 +02:00
Egon Elbre	2680bae88c	private/testplanet: remove dependency to uplink Remove direct dependency on uplink.RSConfig, this simplifies moving the config file without introducing weird dependencies. Change-Id: I7fd2a145401e0205d7047631df9d2810241efeec	2020-01-02 09:40:46 +00:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Yingrong Zhao	7af42e3c10	satellite/metainfo, satellite/repair, uplink/eestream: add metric for download failed due to not enough pieces available (#3665 )	2019-12-04 16:24:36 -05:00
Egon Elbre	ee6c1cac8a	private: rename internal to private (#3573 )	2019-11-14 21:46:15 +02:00
Ethan Adams	3e0d12354a	storagenode/gracefulexit: Implement storage node graceful exit worker - part 1 (#3322 )	2019-10-22 16:42:21 -04:00
Egon Elbre	3c438f31bd	satellite/satellitedb: remove sqlite support (#3296 )	2019-10-19 00:27:57 +03:00
Ethan Adams	a1275746b4	satellite/gracefulexit: Implement the 'process' endpoint on the satellite (#3223 )	2019-10-11 17:18:05 -04:00
Jeff Wendling	098cbc9c67	all: use pkg/rpc instead of pkg/transport all of the packages and tests work with both grpc and drpc. we'll probably need to do some jenkins pipelines to run the tests with drpc as well. most of the changes are really due to a bit of cleanup of the pkg/transport.Client api into an rpc.Dialer in the spirit of a net.Dialer. now that we don't need observers, we can pass around stateless configuration to everything rather than stateful things that issue observations. it also adds a DialAddressID for the case where we don't have a pb.Node, but we do have an address and want to assert some ID. this happened pretty frequently, and now there's no more weird contortions creating custom tls options, etc. a lot of the other changes are being consistent/using the abstractions in the rpc package to do rpc style things like finding peer information, or checking status codes. Change-Id: Ief62875e21d80a21b3c56a5a37f45887679f9412	2019-09-25 15:37:06 -06:00
Jeff Wendling	0dcbd3dc08	bootstrap/satellite/certificate/storagenode: register drpc services Change-Id: Id29f14b76a8c9cb2be31001b9a7a4356a4bda183	2019-09-12 15:09:46 -06:00
Natalie Villasana	aa3567187e	satellite/audit: worker now verifies and reverifies (#2965 )	2019-09-11 18:37:01 -04:00
Egon Elbre	a801fab66a	all: add archview annotations (#2964 )	2019-09-10 16:24:16 +03:00
ethanadams	4ede12a2ab	satellite/orders: Fix for V3-2529: Release v0.19.0 storage nodes can't submit orders, duplicate key value violates unique constraint (#2900 ) * V3-2529: Add DB savepoint to fix issue with postgres. Add test force a rejected order Co-Authored-By: Ivan Fraixedes <ivan@fraixed.es> * Update satellite/satellitedb/orders.go	2019-08-29 11:14:10 -04:00
Cameron	599324c364	satellite/dbcleanup: delete expired serials from satellite (#2867 ) Creates a new chore, dbcleanup, which can be used for routine deletion of items from the satellite database and adds functionality for deletion of expired serial numbers	2019-08-27 13:12:38 -04:00
Cameron	3d9441999a	storagenode/orders: add archive cleanup to orders service (#2821 ) This PR introduces functionality for routine deletion of archived orders. The user may specify an interval at which to run archive cleanup and a TTL for archived items. During each cleanup, all items that have reached the TTL are deleted This archive cleanup job is combined with the order sender into a new combined orders service	2019-08-22 10:33:14 -04:00
Egon Elbre	2d69d47655	all: fix Error.New formatting (#2840 )	2019-08-21 19:30:29 +03:00
ethanadams	1a69ec8318	satellite/orders: document protocol and fix typos (#2813 ) * Addressing comments from PR 2762 * Rebuild of orders.pb.go after comments added to proto file * run update-satellite-config-lock for spelling fix.	2019-08-19 09:36:11 -04:00
Ivan Fraixedes	e47b8ed131	storagenode: No FATAL error when unsent orders aren't found (#2801 ) * pkg/process: Fatal show complete error information Change the general process execution function to not using the sugared logger for outputting the full error information. Delete some unreachable code because Zap logger Fatal method calls exit 1 internally. * storagenode/storagenodedb: Add info to error Add more information to an error returned due to some data inconsistency. * storagenode/orders: Don't use sugared logger Don't use sugar logger and provide better contextualized error messages in settle method. * storagenode/orders: Add some log fields to error msgs Add some relevant log fields to some logged errors of the sender settle method. * satellite/orders: Remove always nil error from debug Remove an error which as logged in debug level which was always nil and makes the logic that used this variable clear. * storagenode/orders: Don't return error Archiving unsent Don't stop the process which archive unsent orders if some of them aren't found the DB because it cause the Storage Node to stop with a fatal error.	2019-08-16 16:53:22 +02:00
ethanadams	8df683a265	Update satellite settlement endpoint to batch order processing into transactions. (#2762 ) Update satellite settlement endpoint to batch order processing into transactions	2019-08-15 15:05:43 -04:00
Egon Elbre	48211daa9d	uplink/piecestore: handle Download errors better (#2771 )	2019-08-14 12:02:58 +03:00
aligeti	32f95a14fd	satellite/certdb: remove certdb that was used to store uplink certificates (#2760 ) * satellitedb/certDB: refactors of the node certificate storage DB table The existing implementation doesnt allow to store the complete certificate chain of uplinkIDs or storagenodeIDs, so the current table is dropped and new table will be added which addresses the storage and retrieval of certificates pkg/identity: fixes spelling mistakes that I missed on PR#2754 Fixes V3-1992/V3-2388	2019-08-12 10:41:34 -04:00
Egon Elbre	c8edeb0257	satellite/overlay: rename overlay.Cache to overlay.Service (#2717 )	2019-08-06 19:35:59 +03:00
Egon Elbre	5d0816430f	rename all the things (#2531 ) * rename pkg/linksharing to linksharing * rename pkg/httpserver to linksharing/httpserver * rename pkg/eestream to uplink/eestream * rename pkg/stream to uplink/stream * rename pkg/metainfo/kvmetainfo to uplink/metainfo/kvmetainfo * rename pkg/auth/signing to pkg/signing * rename pkg/storage to uplink/storage * rename pkg/accounting to satellite/accounting * rename pkg/audit to satellite/audit * rename pkg/certdb to satellite/certdb * rename pkg/discovery to satellite/discovery * rename pkg/overlay to satellite/overlay * rename pkg/datarepair to satellite/repair	2019-07-28 08:55:36 +03:00
Ivan Fraixedes	f420b29d35	[V3-1927] Repairer uploads to max threshold instead of success… (#2423 ) * pkg/datarepair: Add test to check num upload pieces Add a new test for ensuring the number of pieces that the repair process upload when a segment is injured. * satellite/orders: Don't create "put order limits" over total Repair must not create "put order limits" more than the total count. * pkg/datarepair: Update upload repair pieces test Update the test which checks the number of pieces which are uploaded during a repair for using the same excess over the success threshold value than the implementation. * satellites/orders: Limit repair put order for not being total Limit the number of put orders to be used by repair for only uploading pieces to a % excess over the successful threshold. * pkg/datarepair: Change DataRepair test to pass again Make some changes in the DataRepair test to make pass again after the repair upload repaired pieces only until a % excess over success threshold. Also update the steps description of the DataRepair test after it has been changed, to match on what's now, besides to leave it more generic for avoiding having to update it on minimal future refactorings. * satellite: Make repair excess optimal threshold configurable Add a new configuration parameter to the satellite for being able to configure the percentage excess over the optimal threshold, used for determining how many pieces should be repaired/uploaded, rather than having the value hard coded. * repairer: Add configurable param to segments/repairer Add a new parameters to the segment/repairer to calculate the maximum number of excess nodes, based on the optimal threshold, that repaired pieces can be uploaded. This new parameter has been added for not returning more nodes than the number of upload orders for data repair satellite service calculate for repairing pieces. * pkg/storage/ec: Update log message in clien.Repair * satellite: Update configuration lock file	2019-07-12 00:44:47 +02:00

1 2 3 4

188 Commits