storj

Author	SHA1	Message	Date
Cameron Ayer	02613407ae	satellite/satellitedb: only suspend node if not already suspended Whenever the node's reputation is updated, if its unknown audit reputation is below the suspension threshold, its suspension field is set to the current time. This could overwrite the previous "suspendedAt" value resulting a node that never reaches the end of its suspension. Also log whenever a node is disqualified or its suspension status changes Change-Id: I5e8c8f1c46f66d79cb279b5b16a84fe03f533deb	2020-04-10 09:37:37 +00:00
Egon Elbre	d86cce202c	satellite/satellitedb: use arrays for arguments in node selection This simplifies the code and makes queries faster: name old time/op new time/op delta SelectStorageNodes-32 7.72ms ± 6% 7.22ms ± 3% -6.44% (p=0.016 n=5+5) SelectNewStorageNodes-32 7.75ms ± 2% 7.37ms ± 1% -4.89% (p=0.008 n=5+5) SelectStorageNodesExclusion-32 16.9ms ± 0% 16.6ms ± 0% -2.15% (p=0.008 n=5+5) SelectNewStorageNodesExclusion-32 17.2ms ± 0% 16.6ms ± 2% -3.69% (p=0.008 n=5+5) FindStorageNodes-32 45.5ms ± 0% 45.1ms ± 1% ~ (p=0.056 n=5+5) FindStorageNodesExclusion-32 77.4ms ± 0% 75.9ms ± 0% -1.91% (p=0.008 n=5+5) Change-Id: I38f77f6282b9738e8416113d42c6acb46c03da7b	2020-04-09 21:16:10 +03:00
Egon Elbre	ccf4f9ed2d	satellite/satellitedb: node selection code cleanup Reduce the number of non-methods to reduce funcs in the namespace also combine a func to slightly condense the code more. Change-Id: Ifbe728eb8c8ca4c981df648decd259c2097b6b40	2020-04-09 20:41:29 +03:00
Natalie Villasana	cf80b3caf3	satellite/overlay: combine SelectStorageNodes and SelectNewStorageNodes (#3831 )	2020-04-09 11:19:44 -04:00
Egon Elbre	11a44cdd88	all: don't depend on gogo/proto directly Change-Id: I8822dea0d1b7b99e0b828e0373a0308a42dde2be	2020-04-08 17:32:15 +00:00
Egon Elbre	cf26951a5b	satellite/satellitedb/pbold: remove dead code Change-Id: I7464773c20b8f99a601ca9cc4bee804f1ac14cf9	2020-04-08 15:22:31 +03:00
Jeff Wendling	2ded64ba2c	satellite/compensation: more fixes to get prod running smoothly Change-Id: I13a76d9d49222fb10796415a015f224d4084fde3	2020-04-07 10:10:27 +00:00
Jennifer Johnson	1547e791a3	satellitedb: remove free_bandwidth column from nodes table Change-Id: I9d1d3de9216c6533c1042ef473631721a011d086	2020-04-06 09:30:28 +00:00
Egon Elbre	9200efc61f	satellite/satellitedb: fix selecting a nullable string Change-Id: I59e645966e09da586512c69101691b47055c1e5a	2020-04-03 21:30:20 +03:00
Egon Elbre	6492b13d81	all: remove old uuid Change-Id: I3a137f73456f010c37d3933dbe12cbbb840b809f	2020-04-02 19:30:36 +03:00
Egon Elbre	8f73fb7a32	all: simplify uuid usage uuid.UUID implements driver.Value so it can be directly used as a scannable result. Replace uses of dbutil.BytesToUUID with uuid.FromBytes. Change-Id: I51a670185ceb3cc2199d5aa2b76bc3fc191ca8fe	2020-04-02 05:48:58 +00:00
Egon Elbre	a416b03941	satellite/accounting: fix TestProjectBandwidthTotal Test was inserting for past 4 days, however the test was summing up for the current month. Change-Id: I509afdc6a76b314a6bb90652ab70cd2c2bab1288	2020-04-01 11:50:18 +03:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Qweder93	dc32f1da55	storagenode/cache/heldamount added, errNoRows ignored Change-Id: If6b675e622d6c1324c0893c43cca93dc5323cd78	2020-03-31 11:35:58 +00:00
Jeff Wendling	e2ff2ce672	satellite: compensation package and commands Change-Id: I7fd6399837e45ff48e5f3d47a95192a01d58e125	2020-03-30 14:08:14 -06:00
Jennifer Johnson	d77f3b8786	satellitedb/migrate: set vetted_at backfill to now.day Change-Id: Ib2b12be43dbd3f3705b1891bc703ae15abb75e09	2020-03-30 16:50:23 +00:00
Egon Elbre	439aba922a	satellite/overlay: reduce overhead of GetNodes Instead of filtering on the client side it's better to filter on the database side. Change-Id: I845fbbe5ed28c2ffdb0b8a3f789b59c094fd1069	2020-03-30 18:36:23 +03:00
Egon Elbre	cb781d66c7	satellite/overlay: optimize FindStorageNodes Reduce the number of fields returned from the query. Benchmark results in `satellite/overlay`: benchstat before.txt after2.txt name old time/op new time/op delta SelectStorageNodes-32 7.85ms ± 1% 6.27ms ± 1% -20.18% (p=0.002 n=10+4) SelectNewStorageNodes-32 8.21ms ± 1% 6.61ms ± 0% -19.53% (p=0.002 n=10+4) SelectStorageNodesExclusion-32 17.2ms ± 1% 15.9ms ± 1% -7.55% (p=0.002 n=10+4) SelectNewStorageNodesExclusion-32 17.8ms ± 2% 16.1ms ± 0% -9.38% (p=0.002 n=10+4) FindStorageNodes-32 48.4ms ± 1% 45.1ms ± 0% -6.69% (p=0.002 n=10+4) FindStorageNodesExclusion-32 79.2ms ± 1% 76.1ms ± 1% -3.89% (p=0.002 n=10+4) Benchmark results from `satellite/overlay` after making them parallel: benchstat before-parallel.txt after2-parallel.txt name old time/op new time/op delta SelectStorageNodes-32 548µs ± 1% 353µs ± 1% -35.60% (p=0.029 n=4+4) SelectNewStorageNodes-32 562µs ± 0% 368µs ± 0% -34.51% (p=0.029 n=4+4) SelectStorageNodesExclusion-32 1.02ms ± 1% 0.84ms ± 0% -18.08% (p=0.029 n=4+4) SelectNewStorageNodesExclusion-32 1.03ms ± 1% 0.86ms ± 2% -16.22% (p=0.029 n=4+4) FindStorageNodes-32 3.11ms ± 0% 2.79ms ± 1% -10.27% (p=0.029 n=4+4) FindStorageNodesExclusion-32 4.75ms ± 0% 4.43ms ± 1% -6.56% (p=0.029 n=4+4) Change-Id: I1d85e2764eb270f4c2b1998303ccfc1179d65b26	2020-03-30 18:36:23 +03:00
Egon Elbre	e1a443b04a	private/testplanet: allow modifying created database Instead of providing the database from outside to testplanet create it inside and then allow wrapping and modifying it. This is more convenient to use. Change-Id: I9b8f69e6e0a19ff984b4e2bfe927c9100c77bc6c	2020-03-27 19:14:48 +00:00
Ethan	df462d7265	satellite/accounting: Add index on bucket_bandwidth_rollups to minimize full table scans https://storjlabs.atlassian.net/browse/SM-545 Change-Id: I5599a72a991d70236f17beca027e9bc032777177	2020-03-26 19:53:50 +00:00
Jeff Wendling	97e980cd8a	private/dbutil: add database name to configure as a tag storagenodes have like 10 or more databases. without this tag they all get sent as the same value, stomping on each other. Change-Id: Ib12019684d6ea8f2a5b83df584056dfa79e3c4b3	2020-03-26 16:50:15 +00:00
Jennifer Johnson	b75cbc8e24	satellite,storagenode: remove references to free bandwidth Change-Id: I42a6597544804fa9235e89ec656ebc365eb522e5	2020-03-25 22:28:34 +00:00
Michal Niewrzal	fdf40a7526	storj: remove `storj/private/version` package which was moved to `storj/private` repo Change-Id: I81c3f5b9d5e4fe7bca760999eb045ee9734e5e2e	2020-03-24 14:31:33 +00:00
Jessica Grebenschikov	aeab599d21	satellitedb: removed unused id on storagenode_storage_tallies table, add index on node_id The goal of this change is to improve the storagenode_storage_tallies table by removing the unneeded id column that is not being used but only taking up space, and also to add an index on a different column that needs it. Removing and adding a column seems simple, but ended up being more complicated because of some cockroachdb limitations. The cockroachdb limitation when trying to remove a column from a table and create a new primary key are: 1. only allows primary key creation at table creation time (docs: https://www.cockroachlabs.com/docs/stable/primary-key.html) 2. table drop or rename is performed async and cannot be done in a transaction (issue: https://github.com/cockroachdb/cockroach/issues/12123, https://github.com/cockroachdb/cockroach/issues/22868) To address these differences between cockroachdb and Postgres, this PR performs different migrations for the two database. The Postgres migration is straight forward and what you would expect, but the cockroach migration has two main changes: 1. To change a primary key, use the recommended process from the cockroachdb docs to create a new table with the new primary key you want and then migrate the data. 2. In order to do 1, we needed to do the new table renaming in a separate transaction from the data migration. Ref: SM-65 Change-Id: Idc9aee3ab57aa4d5570e3d2980afea853cd966bf	2020-03-20 14:39:44 -07:00
Jennifer Johnson	9b78473c0c	satellitedb: adds vetted_at nullable timestamp to nodes table Change-Id: I42d5a396b4eecbad26b683c6aee51e043d2ff034	2020-03-20 01:37:28 +00:00
Qweder93	0df586c3a8	satellitedb/heldamount updated, tests added + storagenode console updated Change-Id: I10f568a426d0fc42069d025de2accbef5b26dc0c	2020-03-19 15:37:45 +02:00
Jeff Wendling	115f4559e5	satellite/orders: more efficient processing of orders by doing an indexed anti-join we're able to reduce the time to select the pending orders by over 10x on postgres. this should help us process pending orders much more quickly. it probably won't do as good a job on cockroach because it does not do an indexed anti-join and instead does a hash join after scanning the entire consumed serials table. we should either remove orders entirely or try to make that more efficient when necessary. Change-Id: I8ca0535acd21c51e74955b24c9b86d20e4f2ff9c	2020-03-18 09:03:30 +00:00
Moby von Briesen	2f991b6c56	satellite/{overlay, satellitedb}: account for `suspended` field in overlay cache Make sure that suspended nodes are treated appropriately by the overlay cache. This means we should expect the following behavior: * suspended nodes (vetted or not) should not be selected for uploading new segments * suspended nodes should be treated by the checker and repairer as "unhealthy", and should be removed upon successful repair This commit also removes unused overlay functionality. Fixes a bug with commit `8b72181a1f` where the audit reporter was automatically suspending nodes regardless of audit outcome (see test added). Tests: * updates repair tests to ensure that a suspended node is treated as unhealthy and will be removed from the pointer on successful repair * updates overlay tests for KnownUnreliableOrOffline and KnownReliable to expect suspended nodes to be considered "unreliable" * adds satellitedb test that ensures overlay.SelectStorageNodes and overlay.SelectNewStorageNodes do not include suspended nodes * adds audit reporter test to ensure that different audit outcomes result in the correct suspended/disqualified states Change-Id: I40dba67278c8e8d2ce0bcec5e0a5cb6e4ce2f561	2020-03-17 17:14:56 +00:00
Michal Niewrzal	81afbcc12e	satellite/metainfo: check bucket existence on upload and listing Initial change for checking bucket existence on satellite side for requests like BeginObject and ListObjects. This is simple implementation that is just checking bucket in DB but should be improved in future to avoid DB calls as much as possible. Part of https://storjlabs.atlassian.net/browse/USR-365 Change-Id: I9076acddc44d7dbfa7612a1c24a007de01621583	2020-03-17 15:43:22 +00:00
Jeff Wendling	7baa59753a	satellite/orders: add tests for double sending the same order Change-Id: If2fa7f035257df3b04f506f81aa8b2e0916f5033	2020-03-17 14:18:03 +00:00
Ethan	bdbf764b86	satellite/orders;overlay: Consolidate order limit storage node lookups into 1 query. https: //storjlabs.atlassian.net/browse/SM-449 Change-Id: Idc62cc2978fba67cf48f7c98b27b0f996f9c58ac	2020-03-16 23:15:47 +00:00
Moby von Briesen	8b72181a1f	satellite/{audit,overlay,satellitedb}: implement unknown audit reputation and suspension * change overlay.UpdateStats to allow a third audit outcome. Now it can handle successful, failed, and unknown audits. * when "unknown audit reputation" (unknownAuditAlpha/(unknownAuditAlpha+unknownAuditBeta)) falls below the DQ threshold, put node into suspension. * when unknown audit reputation goes above the DQ threshold, remove node from suspension. * record unknown audits from audit reporter. * add basic tests around unknown audits and suspension. Change-Id: I125f06f3af52e8a29ba48dc19361821a9ff1daa1	2020-03-16 20:29:26 +00:00
Stefan Benten	52590197c2	satellite/payments: More Cleanup and Satellite command to ensure we have stripe customers (#3805 )	2020-03-16 20:34:15 +01:00
Qweder93	9f84261c36	storagenode/cache heldamount added Change-Id: I7fc807789de63e8a9b8ca2018fd73bdb9e01ad0d	2020-03-16 00:28:35 +02:00
Qweder93	94c4d1e737	satellite/satellitedb/heldamount added, endpoint added Change-Id: Ife8402b89f631f65ebb5cdf5ca02e99aa9b0b3ff	2020-03-13 18:15:52 +00:00
Jeff Wendling	41887883f3	satellite/satellitedb: check indexes on migration Change-Id: I5ba7ae2b512d77c70405ce332158f12128e27eed	2020-03-13 10:45:22 +00:00
Jess G	39cb821196	satellite/overlay: rm combinedcache, fix IP naming to be network (#3798 ) * rn combinedcache, rm dns node lookup Change-Id: I239f07211764b097d851230d8c81900a47756e9e * excludeIPs -> excludedNetworks Change-Id: Ifa6f44ab17457cdd5aff4cd5694296867c18b179 * use lowercase var name Change-Id: I825aad2b718c71f455e747be18f8cabd02aabe55 * update Getnetwork name Change-Id: I002a1b7bc6b4ef40159c0cd2b0ef209f80a9c503 * fix comments Change-Id: Ibddf5b9ffa9d685af6c392d893db063ef18e45fa * update comments with ipv6 Change-Id: I31758b7d4979e7c27d014668f4fb532ad838cda2 Co-authored-by: Stefan Benten <mail@stefan-benten.de>	2020-03-12 11:37:57 -07:00
Jessica Grebenschikov	803e2930f4	satellite: use IP for all uplink operations, use hostname for audit and repairs My understanding is that the nodes table has the following fields: - `address` field which can be a hostname or an IP - `last_net` field that is the /24 subnet of the IP resolved from the address This PR does the following: 1) add back the `last_ip` field to the nodes table 2) for uplink operations remove the calls that the satellite makes to `lookupNodeAddress` (which makes the DNS calls to resolve the IP from the hostname) and instead use the data stored in the nodes table `last_ip` field. This means that the IP that the satellite sends to the uplink for the storage nodes could be approx 1 hr stale. In the short term this is fine, next we will be adding changes so that the storage node pushes any IP changes to the satellite in real time. 3) use the address field for repair and audit since we want them to still make DNS calls to confirm the IP is up to date 4) try to reduce confusion about hostname, ip, subnet, and address in the code base Change-Id: I96ce0d8bb78303f82483d0701bc79544b74057ac	2020-03-11 09:11:40 -07:00
Moby von Briesen	1baf1bd249	satellite/satellitedb: Add index on num_healthy_pieces column in injuredsegments table We missed this in the migration that added the num_healthy_pieces column. It exists in dbx, but not on the actual satellite table. Change-Id: If16b5ec2325d56406250298531b3285215188bf3	2020-03-10 16:59:35 +00:00
paul cannon	79553059cb	satellite/repair: put irreparable segments in irreparableDB Previously, we were simply discarding rows from the repair queue when they couldn't be repaired (either because the overlay said too many nodes were down, or because we failed to download enough pieces). Now, such segments will be put into the irreparableDB for further and (hopefully) more focused attention. This change also better differentiates some error cases from Repair() for monitoring purposes. Change-Id: I82a52a6da50c948ddd651048e2a39cb4b1e6df5c	2020-03-09 21:45:16 +00:00
Egon Elbre	0675413f7a	satellite/satellitedb: increase migrate test timeout Change-Id: I789ea22ad463a6c31737e959ec54941b66830188	2020-03-09 14:30:50 +02:00
Bill Thorp	e99e675fb1	satellite/satellitedb: use time zones with all timestamps The migration was broken into one migration per table to reduce table locking and reduce the chances of failure due to SQL timeouts. Of the 14 fields that lacked time zones, only the 3 named 'interval_start` seemed to have non-UTC data in them. These fields are fixed in the migration by removing the +00 and adding AT TIME ZONE current_setting('TIMEZONE') Field with good data are migrated by adding AT TIME ZONE 'UTC' Note that postgres's timezone() is different than cockroach's timezone() so AT TIME ZONE is used. https://storjlabs.atlassian.net/browse/SM-104 Change-Id: I410f2f1d7c11b143f17844347f37e6f4b1e70fce	2020-03-05 21:11:25 +00:00
Jennifer Johnson	1c1750e6be	removes bandwidth limiting On satellite, remove all references to free_bandwidth column in nodes table. On storage node, remove references to AllocatedBandwidth and MinimumBandwidth and mark as deprecated. Protobuf message, NodeCapacity, is left intact for backwards compatibility. Once this is released to all satellites, we can drop the column from the DB. Change-Id: I2ff6c6537fc9008a0c5588e951afea58ede85838	2020-03-04 14:04:00 +00:00
Egon Elbre	5f2ca0338b	satellite/satellitedb: fix err and close order Change-Id: Ied927275853c4cf4a8ccb500048d50545f6c6efe	2020-03-04 09:05:22 +00:00
Moby von Briesen	f495544c56	satellite/satellitedb/dbx: add fields to node table for placing nodes into suspended mode for too many unknown-error audits Change-Id: Iac9a619e5c08377de87ffdf4acdd0155027f5eb3	2020-03-03 03:30:59 +00:00
Jeff Wendling	1db087cfba	satellite/satellitedb: migration to create tables for compensation these tables are used in future commits with respect to the new storagenode payments code. if we create them now, it will make backfilling them with historical data easier. Change-Id: I3c08c9770ec5b2baa38b4f2fd18c2f07746a61c2	2020-02-27 17:34:50 +00:00
Moby von Briesen	4e5a7f13c7	satellite/repair/queue: Prioritize selection of items off repair queue by segment health Add a column to the repair queue table in the satellite db for healthy piece count. When an item is selected from the repair queue, the least durable segment that has not been attempted in the past hour should be selected first. This prevents our repairer from getting stuck doing work on segments that are close to the repair threshold while allowing segments that are more unhealthy to degrade further. The migration also clears the repair queue so that the migration runs quickly and we can properly account for segment health in future repair work. We do not select items off the repair queue that have been attempted in the past six hours. This was changed from on hour to allow us time to try a wider variety of segments when the repair queue is very large. Change-Id: Iaf183f1e5fd45cd792a52e3563a3e43a2b9f410b	2020-02-26 09:54:16 -05:00
Jeff Wendling	f671eb2beb	satellite/satellitedb: use queue for orders to get back fast billing This change adds two new tables to process orders as fast as we used to but in an asynchronous manner and with hopefully less storage usage. This should help scale on cockroach, but limits us to one worker. It lays the groundwork for the order processing pipeline to be queue rather than database driven. For more details, see the added fast billing changes blueprint. It also fixes the orders db so that all the timestamps that are passed to columns that do not contain a time zone are converted to UTC at the last possible opportunity, making it less likely to use the APIs incorrectly. We really should migrate to include timezones on all of our timestamp columns. Change-Id: Ibfda8e7a3d5972b7798fb61b31ff56419c64ea35	2020-02-24 17:07:07 +00:00
Qweder93	dca6fcbe28	satellite/payments/stripecoinpayments: credits added to invoice calculations Change-Id: I6d3f5244a46f8945d2703af39ced333940db34e9	2020-02-24 16:48:27 +00:00
Yaroslav Vorobiov	f185adcf7c	satellite/payments: fix projects list pagination Change-Id: I342e69a17be34a503c1e0cef18ee009f1921fcd4	2020-02-21 19:37:11 +02:00

1 2 3 4 5 ...

476 Commits