storj

Author	SHA1	Message	Date
Jeff Wendling	376547c33c	satellite/compensation: smaller txns for RecordPeriod cockroach is having problems with huge transactions and having them complete before timeouts or whatever, so do smaller transactions. because we can have partial recording of payments which are not unique, we have to do a thing where we read and check if it already exists before writing. this is not concurrency safe. Change-Id: Ia7d59499a43ce6d70cb2a23754edbdd1b643ef1a	2021-03-02 20:14:25 +00:00
Cameron Ayer	aeac6264cd	sallite/satellitedb: add metric stray_nodes_dq_count Add metric so we can see how many nodes are DQd due to this. Change-Id: Ie4bdd1375fb9bd948af14fed9a2962b783b6a526	2021-03-01 21:06:36 +00:00
Natalie Villasana	856db68fd9	satellite/gracefulexit: extend GE data cleanup to include exit_progress The new 'consistency ge-cleanup-orphaned-data' cli command deleted orphaned transfer queue items, but not entries in the graceful_exit_progress table. This will delete orphaned entries from the exit progress table too. Change-Id: I5f927aac1f258490678deaf179be92ccfe10fcd8	2021-03-01 15:52:43 +00:00
Cameron Ayer	411a7ad0bc	satellite/satellitedb: drop uptime_reputation_alpha and uptime_reputation_beta from nodes table Change-Id: Ib46e783bf1a5c036394b4cac281382d0380bb1be	2021-03-01 15:30:51 +00:00
Michał Niewrzał	1af9400a23	satellite/satellitedb/dbx: remove unused methods Turns out that many methods generated with dbx where not used at all. Lets remove them. As a next step we can think about dropping tables like: * user_credit * offer Change-Id: Id6cda81a701348db2a6b8b26daa22ae9c4f87cb4	2021-02-25 13:50:50 +00:00
Michał Niewrzał	d995fb497f	Merge remote-tracking branch 'origin/main' into multipart-upload Change-Id: I367da03351ab80f7343332420490dde9282aa47a	2021-02-23 12:31:31 +01:00
Egon Elbre	0178ec7771	satellite/satellitedb: clearer testdata version parsing Change-Id: I93a23625b30472e826b962dab31ff3e1081b1b8f	2021-02-23 10:22:19 +02:00
Egon Elbre	32afbd0d15	satellite/satellitedb: faster test database setup Pregenerate the database schema we should use for most tests. Currently, Cockroach is slow with regards to migration and it's better if it happens in as few transactions as possible. This reduces test time from ~21min to ~15min. Change-Id: Ife8117053e6b9ecf3c93fe63677edf15d4d7c254	2021-02-22 21:13:00 +02:00
Cameron Ayer	549033f2e6	satellite/satellitedb: don't include DQd and exited nodes in DQStrayNodes Don't update DQ time of already DQd nodes. Don't DQ nodes who exited. Change-Id: I4528a9ba9f8e278987165ad337a9b34dadb9788b	2021-02-19 15:12:30 -05:00
Egon Elbre	1137620baf	satellite/satellitedb: move tests to their domains Testing interfaces is slightly clearer when it's in the package needing the database rather than each individual implementation. Change-Id: I10334c214a205f7e510b939b4359a2214c4e060a	2021-02-19 17:29:15 +02:00
Egon Elbre	1cb6376daa	satellite/metainfo: remove BucketsDB.ListAllBuckets The ListAllBuckets implementation was buggy, remove it altogether. Change-Id: Id457ba5f4d793156af3fc2071f74ce1be17ba804	2021-02-19 10:59:41 +02:00
JT Olio	b2ed7edd30	cmd/satellite: restore-trash parallel workers Change-Id: Ic7466b21c20bda334e7ba4268a494e96b6528ac1	2021-02-18 19:11:19 +02:00
JT Olio	3ae3389ddc	cmd/satellite: restore-trash command Change-Id: I80fc932c12147692d49cde277784871ac611fcad	2021-02-18 09:19:22 -07:00
Michał Niewrzał	12402eb729	Merge remote-tracking branch 'origin/main' into multipart-upload Change-Id: I38adf8218c1415c7ea1910f8bd6bed13544b0f03	2021-02-17 08:50:38 +01:00
Malcolm Bouzi	4b2e46a0c9	satellite/satellitedb: add employee size column to users Change-Id: I21f5904331f0ceb92f494729c22a52c256a69163	2021-02-12 09:15:15 -05:00
Egon Elbre	63c7f8b7fc	satellitedb/satellitedbtest: creating a database shouldn't auto-migrate Some tests need to control the migration progress manually. Change-Id: I776c69b6d56dc35c7cb88688c4b827d6bba4b7ac	2021-02-11 14:21:49 +02:00
Michał Niewrzał	908a96ae30	Merge remote-tracking branch 'origin/main' into multipart-upload Change-Id: I075aaff42ca3f5dc538356cedfccd5939c75e791	2021-02-11 11:48:23 +01:00
Yaroslav Vorobiov	966535e9de	{storagenode,satellite}/nodeoperator: add wallet features Change-Id: Iac7eb40a52b8fddcc573aebaad2e3a30a10cded9	2021-02-08 22:09:45 +02:00
Yingrong Zhao	3b49d3cddf	satellite: remove referral program related code This PR removes all back-end related referral program code including the marketing portal. We will have a separate PR for front-end code and database migration to drop `offers` and `usercredits` table Change-Id: If59f952cddfe0558a7dc03a0eac7cc1081517f88	2021-02-08 13:52:50 +00:00
Jeff Wendling	e114cfe86d	satellite/satellitedb: fix broken query Change-Id: I6d412a673d75264bf9751c6f15b1fb0ab94e1394	2021-02-04 16:00:51 -05:00
Malcolm Bouzi	db3a3088f9	satellite/satellitedb: add professional user fields to db interface (#4034 )	2021-02-04 10:00:15 -05:00
Michał Niewrzał	9a60011774	Merge remote-tracking branch 'origin/main' into multipart-upload Change-Id: Ia90f29be432e207c4125f7f955c912978eabe59a	2021-02-04 09:38:08 +01:00
JT Olio	c8e42df2f8	satellitedb: reorder migrations 140-144 These changes are independently tracked on https://github.com/storj/storj/tree/jt/migration-reorder The point of this is to make the distributed column migration, needed for SNO invoice generation, the very next one, so we can release it as a point release. Change-Id: I26e1c03629c4f079b9ad12485e2b71a715d82b3b	2021-02-03 18:28:42 +00:00
Ethan	9506b67ca2	satellite/projectaccounting: Improve performance of ProjectAccounting.getBuckets Limit bucket name lookup to date range of the calling methods since we only need distinct bucket names for that time period. Adds new index and removes an index specific to project ID since it is no longer needed. Change-Id: Ic07bbfb1c32280e0c0e39f8da020b284e1e5d974	2021-02-03 14:05:12 +00:00
Cameron Ayer	a17934cb51	satellite/satellitedb: remove reference to uptime counts Change-Id: I26ac540b720a8ba5d6ca44526900228352dcaf4e	2021-02-02 14:51:27 -05:00
Jeff Wendling	759bdd6794	satellite/compensation: add total-paid and total-distributed to invoices Change-Id: Id4414867917cbf8aad77795f764d6381e88d9a34	2021-02-02 18:14:31 +00:00
Kaloyan Raev	6f3d0c4ad5	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum satellite/repair/repair_test.go satellite/repair/repairer/segments.go Change-Id: Ie51a56878bee84ad9f2d31135f984881a882e906	2021-02-02 19:19:04 +02:00
Ivan Fraixedes	cc0d88f9c3	satellite/satellitedb: Fix GE flaky test Fix an issue due to copy-paste problem that made that the Graceful Exit test to be flaky. The test uses a time created at the beginning of the test for avoiding to get undeterministic time differences due to the fact of the response time variation by the DB queries, however some part of the test were using a current time rather than this base time, so they have been addressed. Change-Id: I4786f06209e041269875c07798a44c2850478438	2021-02-02 13:24:42 +01:00
Ivan Fraixedes	d93944c57b	satellite/orders: Delete unused methods & DB tables Delete satellite order methods and DB tables which aren't used anymore after we have done a refactoring on the orders to stuck bucket information in the orders' encrypted metadata. There are also configuration parameters and a satellite chore that aren't needed anymore after the orders refactoring. Change-Id: Ida3682b95921df70792284b42c96d2508bf8ca9c	2021-02-01 18:01:29 +00:00
Ivan Fraixedes	076804eac9	cmd/satellite: Add command for GE data cleanup Add a command to the satellite for cleaning up the Graceful Exit (a.k.a GE) transfer queue items of nodes that have exited. The commit adds to the GE satellite DB a couple of new methods, and its corresponding test, for performing the operations of the new command. Change-Id: I29a572a59689d63b24990ac13c52e76d65aaa917	2021-02-01 17:30:58 +00:00
Jeff Wendling	1cf3d89a56	satellite/satellitedb: add distributed column and migration using redash i manually checked that the only times the sum of the payments does not match the paid column is for 2020-12 and if it does not match then there are no payments. Change-Id: I71ce0571de7e38e21548d7d6757b25abc3bfa781	2021-02-01 16:33:14 +00:00
Natalie Villasana	91bd4191dd	satellite/accounting: add rollup archiver chore The rollup archiver chore moves bucket bandwidth rollups and storagenode rollups that are older than a given duration to two new archive tables. Change-Id: I1626a3742ad4271bc744fbcefa6355a29d49c6a5	2021-02-01 09:29:54 -05:00
Kaloyan Raev	d0612199f0	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum satellite/metainfo/config.go satellite/metainfo/metainfo_test.go Change-Id: I95cf3c1d020a7918795b5eec63f36112fdb86749	2021-02-01 14:32:12 +02:00
Egon Elbre	54e01d37f9	satellite/overlay: add DownloadSelectionCache Change-Id: Ic0779280172325f8d03f55a2e9673722f72bdd44	2021-01-29 16:47:06 +02:00
Kaloyan Raev	4d32bdaefb	satellite/satellitedb: drop bucket_metainfos_name_project_id_key index This index is obsolete and duplicates a similiar (project_id, name) index on the same table. Moreover, it might confuse CockroachDB which of the two index to use, which may might affect DB performance. Change-Id: If8d1df8347714942cea9dca82864ba5f4973bed3	2021-01-28 09:06:22 +02:00
Jeff Wendling	ca86820b8b	satellite/snopayouts: use dbx + some refactorings Change-Id: I8f3973d2377f071bcea2f61e0fc21d913ffa7ea8	2021-01-27 17:59:16 +00:00
Jeff Wendling	66e15fb7f1	satellite/compensation: remove ytd paid amounts they aren't right and we aren't using them. Change-Id: I5ca024e38d055696696886278863e941b5bc51bf	2021-01-27 17:31:01 +00:00
Malcolm Bouzi	24d60384c5	satellite/satellitedb: add columns for professional users (#4028 ) Co-authored-by: Egon Elbre <egonelbre@gmail.com>	2021-01-26 11:38:53 -05:00
Isaac Hess	c92bda7e75	tally monkit: change location to monitor piecesize When we observed the value for total piecesizes stored in the network, we were doing it after converting them to byte-hours, rather than using the actual piece sizes. This fixes that issue. Change-Id: I1564d21b519f70eb59f298d97dbd777baf127723	2021-01-26 15:37:02 +00:00
Moby von Briesen	0a48071854	satellite/console: Add pagination fields for ListProjectsByOwnerID Add ProjectsCursor type for pagination Add PageCount, CurrentPage, and TotalCount ProjectsPage This allows us to mimic the logic of GetBucketTotals and the implementation of BucketUsages in graphql for the new ProjectsByOwnerID functionality. Change-Id: I4e1613859085db65971b44fcacd9813d9ddad8eb	2021-01-20 16:15:29 +00:00
Kaloyan Raev	c24ada7114	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum Change-Id: Icf7c029e9d800e5f6a9fdd208c36f28e05468690	2021-01-20 17:35:57 +02:00
Cameron Ayer	d14607a5f7	satellite/{contact,nodestats,overlay,satellitedb}: remove references to total_uptime_count and uptime_success_count columns Change-Id: I1f92022909bc564e9b1e31bf937fdfe7c16554de	2021-01-19 15:43:02 -05:00
Cameron Ayer	75d828200c	private,satellite: add chore to dq stray nodes Full scope: private/testplanet,satellite/{overlay,satellitedb} Description: In most cases, downtime tracking with audits will eventually lead to DQ for nodes who are unresponsive. However, if a stray node has no pieces, it will not be audited and will thus never be disqualified. This chore will check for nodes who have not successfully been contacted in some set time and DQ them. There are some new flags for toggling DQ of stray nodes and the timeframes for running the chore and how long nodes can go without contact. Change-Id: Ic9d41fdbf214736798925e728245180fb3c55615	2021-01-19 14:21:56 -05:00
Qweder93	6ba8f6c8a9	storanode, satellite: payout renamed to payouts, expected estimation payouts added, console api for audits reworked Change-Id: I4aa5e99bffaa87d0a800a429a4c83aa498ad4b7b	2021-01-18 10:56:03 +00:00
Ivan Fraixedes	678b07b314	satellite: Fix typos & code formatting Fix some typos in the doc comments and readdress some code formatting applied automatically. Change-Id: I605b4eff2e7c6c58227ecf16be4c1d26f5322eb6	2021-01-15 16:40:26 +01:00
Moby von Briesen	c24f84914c	satellite/console: Add ability to list projects by owner ID Listing projects by owner ID also includes the number of members in each project. Change-Id: I53a09674b60c199ef378943851bb0f164e92e4e2	2021-01-15 14:22:22 +00:00
Kaloyan Raev	6dff40f5c5	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum satellite/metainfo/metainfo.go Change-Id: Ib5c49f3c911c58319855a171f9ce73657da976d9	2021-01-14 14:33:59 +02:00
Cameron Ayer	0184d33e96	satellite/satellitedb: set default 0 on uptime columns This is the first step in the removal of uptime columns on the nodes table. These columns are no longer used: uptime_success_count total_uptime_count uptime_reputation_alpha uptime_reputation_beta In order to avoid breaking backwards compatibility, we need to remove all references to these columns before removing the columns themselves from the database. However, since uptime_success_count and total_uptime_count are NOT NULLABLE, we can't remove them from the insert statements in the overlay. So we can't remove the columns because of the references, and we can't remove the references because the columns can't be null. What a pickle. To remedy this, we will set a default on the columns. Then we should be able to remove them from the insert statements Change-Id: I75f6c56fb7897835bbf29869f86f39de1d9dd345	2021-01-12 17:44:37 +00:00
Cameron Ayer	0403e99a5b	satellite/{overlay,satellitedb}: remove unused methods for old downtime tracking GetSuccessfulNodeNotCheckedInSince and GetOfflineNodesLimited are overlay methods which were only used by the previous downtime tracking system which has been removed. These methods should also be removed. Change-Id: Idb829d742e1f987e095604423fff656fe581183e	2021-01-11 15:21:28 +00:00
Michał Niewrzał	ec88d21a3c	Merge 'main' branch. Change-Id: I6e8162d1a6caf75e89c9f9c9f9522730aebf83ae	2021-01-11 10:26:58 +01:00
Moby von Briesen	6e2ef3b9ee	Revert "satellite/satellitedb: Do not consider nodes with offline_suspended as reputable." This reverts commit `e24262c2c9`. Change-Id: I287deb2e52d03bbd698ed055f0f216b0b5bf2798	2021-01-04 14:28:37 +00:00
Michał Niewrzał	ad3e3a38c5	Merge 'main' branch Change-Id: Ia0db1b1f9ef3e0671d3f2208881b0abc3064e200	2021-01-04 12:13:45 +01:00
Moby von Briesen	825dc71227	satellite/{overlay, satellitedb}: Refactor audit history * Separate audit history interface into its own file in the overlay package * Add overlay.AuditHistory struct so that internalpb.AuditHistory is only used from within the database layer * Add overlay.GetAuditHistory function for features that will require access to detailed audit history information * Do not return full audit history from UpdateAuditHistory - callers to that function only need to know the online score and whether a full tracking period has been completed * Move audit history tests out of satellite/satellitedb, since they are independent of database implementation Change-Id: I35b0c4ac23bbaabd80624f8a9631c3cb1a1f33bd	2020-12-29 18:50:22 +00:00
Moby von Briesen	85ae13f11d	satellite/satellitedb: Drop nodes_offline_times table. Now that the deprecated downtime tracking service is removed (`3fc76f4ffe`), we can safely remove the nodes_offline_times table. Change-Id: Ia7c6efe32ba104dff5a830af5f2beee3337eefe5	2020-12-29 18:17:50 +00:00
Moby von Briesen	e24262c2c9	satellite/satellitedb: Do not consider nodes with offline_suspended as reputable. Nodes which are offline_suspended will no longer be considered for new uploads. The current threshold that enters a node into offline suspension is 0.6. Disqualification for offline suspension is still disabled. Change-Id: I0da9abf47167dd5bf6bb21e0bc2186e003e38d1a	2020-12-29 17:59:09 +00:00
Ethan Adams	6070018021	satellite/overlay: use AS OF SYSTEM TIME with Cockroach Query nodes table using AS OF SYSTEM TIME '-10s' (by default) when on CRDB to alleviate contention on the nodes table and minimize CRDB retries. Queries for standard uploads are already cached, and node lookups for graceful exit uploads has retry logic so it isn't necessary for the nodes returned to be current.	2020-12-22 21:07:07 +02:00
Michal Niewrzal	9a8959d429	Merge 'master' branch Change-Id: Iba69ea73ca4d3f1cd4ae94243eaaae033c5324e8	2020-12-22 14:55:57 +01:00
Ethan Adams	563197c628	satellite/overlay: Add index on nodes table (#4012 ) satellite/accounting: Add index for project_id on bucket_storage_tallies	2020-12-21 12:48:48 -05:00
Ethan Adams	9b52283570	satellite/accounting: Add index for project_id on bucket_storage_tallies (#4010 ) Change-Id: I47ab2d1e24f94307c3383c497cffe2a150fa8ab7	2020-12-21 11:42:00 -05:00
Ethan Adams	6e501898c3	satellite/accounting: Performance improvements to getNodeIds used by GetBandwidthSince (#4009 )	2020-12-21 16:37:01 +01:00
Jessica Grebenschikov	da0327c9b7	satellite/dbcleanup: remove expired serial chore Change-Id: Ib71d41eb6679d6435e5bc10b6244dac66380a74e	2020-12-18 09:36:28 -08:00
Michal Niewrzal	2111740236	Merge 'master' branch Change-Id: Ib73af0ff3ce0e9a1547b0b9fc55bf88704f6f394	2020-12-18 09:13:24 +01:00
Cameron Ayer	28eaae66af	satellite/satellitedb: drop num_healthy_pieces column from injuredsegments This column is no longer used as it has been replaced by the segment_health column. Change-Id: I6b4df89cd4f994d8418976f88e8c5f57615f8115	2020-12-17 20:17:08 +00:00
Michal Niewrzal	70ba4deea9	satellite/repair/checker: adjust irreparable part of repair checker Change-Id: I0732104a97ba18a5359de3966cd692677a0ff790	2020-12-17 14:11:22 +00:00
Kaloyan Raev	9aa61245d0	satellite/audits: migrate to metabase Change-Id: I480c941820c5b0bd3af0539d92b548189211acb2	2020-12-17 14:38:48 +02:00
Michal Niewrzal	57f374af24	Merge 'master' branch Change-Id: Idf6b10ea7ca94e4d232e6a3b6a38ef2e646ba197	2020-12-15 08:26:53 +01:00
Stefan Benten	9fe477899b	satellite/satellitedb: add lint ignore rule to support staticcheck 2020.2 staticcheck 2020.2 is not liking our dbx files, so we need to ignore them. Change-Id: I6becc3619bb088473f9776d0878ce240d4935936	2020-12-14 21:16:31 +00:00
Jessica Grebenschikov	0649d2b930	satellite/repair: improve contention for injuredsegments table on CRDB We migrated satelliteDB off of Postgres and over to CockroachDB (crdb), but there was way too high contention for the injuredsegments table so we had to rollback to Postgres for the repair queue. A couple things contributed to this problem: 1) crdb doesn't support `FOR UPDATE SKIP LOCKED` 2) the original crdb Select query was doing 2 full table scans and not using any indexes 3) the SLC Satellite (where we were doing the migration) was running 48 repair worker processes, each of which run up to 5 goroutines which all are trying to select out of the repair queue and this was causing a ton of contention. The changes in this PR should help to reduce that contention and improve performance on CRDB. The changes include: 1) Use an update/set query instead of select/update to capitalize on the new `UPDATE` implicit row locking ability in CRDB. - Details: As of CRDB v20.2.2, there is implicit row locking with update/set queries (contention reduction and performance gains are described in this blog post: https://www.cockroachlabs.com/blog/when-and-why-to-use-select-for-update-in-cockroachdb/). 2) Remove the `ORDER BY` clause since this was causing a full table scan and also prevented the use of the row locking capability. - While long term it is very important to `ORDER BY segment_health`, the change here is only suppose to be a temporary bandaid to get us migrated over to CRDB quickly. Since segment_health has been set to infinity for some time now (re: https://review.dev.storj.io/c/storj/storj/+/3224), it seems like it might be ok to continue not making use of this for the short term. However, long term this needs to be fixed with a redesign of the repair workers, possible in the trusted delegated repair design (https://review.dev.storj.io/c/storj/storj/+/2602) or something similar to what is recommended here on how to implement a queue on CRDB https://dev.to/ajwerner/quick-and-easy-exactly-once-distributed-work-queues-using-serializable-transactions-jdp, or migrate to rabbit MQ priority queue or something similar.. This PRs improved query uses the index to avoid full scans and also locks the row its going to update and CRDB retries for us if there are any lock errors. Change-Id: Id29faad2186627872fbeb0f31536c4f55f860f23	2020-12-10 09:51:26 -08:00
Michal Niewrzal	b3acc1101a	Merge 'master' branch Change-Id: Iee99400c7095770e61cde94b3b2c8eb0ddec463d	2020-12-10 15:42:52 +01:00
Michal Niewrzal	c2a97aeb14	satellite/satellitedb: add ListAllBuckets method We need to be able to list all buckets in DB without knowing project ID. This method will be used to list buckets for metainfo loop implementation based on metabase. Change-Id: Iac75af0eee4f31e80a15577575a8249cbca787b2	2020-12-10 14:19:27 +00:00
Michal Niewrzal	218bbeaffa	Merge 'master' branch Change-Id: Ica5c25607a951076dd9f77e35e308062f71ce3f0	2020-12-07 15:05:52 +01:00
Stefan Benten	494bd5db81	all: golangci-lint v1.33.0 fixes (#3985 )	2020-12-05 17:01:42 +01:00
Ethan Adams	f90ea10a4a	Allow for DB application names per process. (#3983 )	2020-12-04 11:24:39 +01:00
Moby von Briesen	3fc76f4ffe	satellite/downtime: Remove deprecated downtime tracking service. We are no longer planning on implementing downtime penalization using the method described in docs/blueprints/archive/storage-node-downtime-tracking-deprecated.md. Now, we are implementing the design described in docs/blueprints/storage-node-downtime-tracking-with-audits.md. This change removes the downtime estimation chores from the satellite core as well as the package satellite/downtime. A future change will remove the database table. Change-Id: I1a1d3cf9dceeba36255d25243294865b89925518	2020-12-02 15:16:13 -05:00
JT Olio	1728c3a992	satellite/dbx: standardize on assignment Change-Id: I8f87bc8391e765e4480b0590d92d3601248e1f93	2020-12-01 16:10:18 +00:00
JT Olio	70b91aac54	satellitedb: remove cruft caused by https://review.dev.storj.io/c/storj/storj/+/3223 Change-Id: I198bb2f869cc7177b9ecafdd8932bbf2b58be5b8	2020-12-01 00:16:26 +00:00
Michal Niewrzal	5a7bc9657d	Merge 'master' branch Change-Id: If583132a821274dc4b78cf5f72b853ba8460c619	2020-11-30 12:57:22 +01:00
Egon Elbre	f456d7ce03	satellite: remove implementation detail from DB interface Which database access and how it internally does migrations is an implementation detail and does not belong in the requirements interface. Change-Id: Ia4a6994f39470063a96a8e5f3a1bd27aa79fe5cd	2020-11-30 13:29:20 +02:00
Egon Elbre	28ea63be92	satellite/repair: avoid TestDBAccess Change-Id: I34adb58cd67fba5917032f2f328d75b1c4afdbbf	2020-11-30 13:29:08 +02:00
JT Olio	71e11b27f3	satellite/dbx: only retry with cockroach Change-Id: Id3630c26dbfda36dcbece2849e2353d5ab2882af	2020-11-29 18:10:07 -07:00
JT Olio	bd23d12bb9	satellite/dbx: add cockroach retries for other QueryContext operations Change-Id: Ia30fbba55c926892702fa96fb9dd01b75347d351	2020-11-29 18:09:56 -07:00
JT Olio	ea2f39ca7f	satellite/dbx: add retries for QueryRowContext-based operations Change-Id: Ie2527b673dd4ce5250cf5c0cbf8f14921262f665	2020-11-29 18:09:46 -07:00
JT Olio	d3b0691bbd	satellite/dbx: import dbx templates these are unchanged from storj.io/dbx. we're importing them because in a later commit we will change them, and it'd be nice to see that diff as a separate commit. Change-Id: I8315130ed6bab397bd65b9a1a90c29d130b8c02d	2020-11-29 18:09:33 -07:00
JT Olio	5d8a67a4f7	satellitedb: retry GetBandwidthSince on cockroach Change-Id: I2bf20f3a19e7f3af97630d8a679410feba70661e	2020-11-29 16:36:15 -07:00
Ethan	5dc013d3bd	satellite/overlay: Add retry to all selects in overlaycache Change-Id: I0356d71a35701f8e0ca04a34b2bb2aea666c1394	2020-11-29 16:46:57 -05:00
JT Olio	6bce907cb0	satellite: try to stream rollups to aggregation function to use less memory this change tries really hard to never have all of the storage node rollups in memory at the same time, up until the rollups are actually getting summed together. Change-Id: If67f49e7d71106798d996a6850b3e48671bd9e18	2020-11-29 10:26:32 -07:00
JT Olio	6aae21541f	satellitedb: do saverollup in batches Change-Id: I78278a192cba60541eee2986f54a88d5a479bd3e	2020-11-28 19:26:46 -07:00
JT Olio	0ba516d405	satellite: support pointing db components at different databases the immediate need is to be able to move the repair queue back out of cockroach if we can't save it. Change-Id: If26001a4e6804f6bb8713b4aee7e4fd6254dc326	2020-11-28 18:39:16 +00:00
Michal Niewrzal	efaba85c73	Merge 'master' branch Change-Id: I3520b3e327732929f5167b07a15ddb92d26cae1b	2020-11-24 10:03:20 +01:00
Egon Elbre	55d5e1fd7d	satellite/orders: ensure that expired deletion doesn't stall Add checks to ensure that when somebody uses empty options, the deletion doesn't loop infinitely. Change-Id: I1738fb1e7e1f8efbbb954c491cb6489f7bcdc2db	2020-11-23 14:52:40 +02:00
Ethan	2b92bba563	satellite/satellitedb/orders: Handle serial_numbers deletes in smaller increments on CRDB CRDB doesn't like large deletes. While testing in the POC environment we found that deletes on the serial_numbers table could take hours. This change limits deletes to 1000 at a time (configurable) to avoid blocking other queries. Change-Id: I08455e25db1574579dd4d7b7125a08e9c913dff1	2020-11-20 13:44:52 +00:00
Moby von Briesen	a8b66dce17	satellite/accounting: account for old orders that can be submitted in satellite rollup With the new phase 3 order submission, orders can be added to the storage and bandwidth rollup tables at timestamps before the most recent rollup was run. This change shifts the start time of each new rollup window to account for any unexpired orders that might have been added since the previous rollup. A satellitedb migration is necessary to allow upserts in the accounting_rollups table when entries with identical node_ids and start_times are inserted. Change-Id: Ib3022081f4d6be60cfec8430b45867ad3c01da63	2020-11-18 14:46:00 -05:00
Moby von Briesen	0ec685b173	satellite/{satellitedb, repair/{queue, checker}}: Use new column "segmentHealth" instead of "numHealthy" in injured segments queue We plan to add support for a new Reed-Solomon scheme soon, but our repair queue orders segments by least number of healthy pieces first. With a second RS scheme, fewer healthy pieces will not necessarily correlate to lower health. This change just adds the new column in a migration. A separate change will add the new health function. Right now, since we only support one RS scheme, behavior will not change. Number of healthy pieces is being inserted as "segment health" until the new health function is merged. Segment health is calculated with a new priority function created in commit `3e5640359`. In order to use the function, a new config value is added, called NodeFailureRate, representing the approximate probability of any individual node going down in the duration of one checker run. Change-Id: I51c4202203faf52528d923befbe886dbf86d02f2	2020-11-16 21:18:09 +00:00
Michal Niewrzal	7c384c8293	Merge 'master' branch Change-Id: I1eefd5a56449e577820977d61fa4a22bdd4fc230	2020-11-16 10:02:54 +01:00
Jessica Grebenschikov	f558cc825e	satellite/orders: add storagenode_bw_phase2 table and dont delete tallies for longer It turns out we need to make 2 more changes in order for the new order submission phase 3 to get deployed. This PR makes 2 changes: 1) when the rollup service deletes tallies, we now keep tallies around until orders expire (vs 1 day like before). 2) the reported rollup chore will now write the storagenode_bandwidth_rollups to a new table _phase2 as an intermediary step so it doesn't conflict with phase 3 order settlement. These changes need to be deployed for 2 days before we can turn on phase 3 of the new orders settlement workflow. Change-Id: Iafbff577ba7d55f8f17b7db857311b2ce799de60	2020-11-13 17:15:24 +00:00
Michal Niewrzal	7dde184cb5	Merge 'master' branch Change-Id: I6070089128a150a4dd501bbc62a1f8b394aa643e	2020-11-10 11:58:59 +00:00
Cameron Ayer	dc67ce74c9	satellite: remove IsUp field from overlay.UpdateRequest With the new overlay.AuditOutcome type for offline audits, the IsUp field is redundant. If AuditOutcome != AuditOffline, then the node is online. In addition to removing the field itself, other changes needed to be made regarding the relationship between 'uptime' and 'audits'. Previously, uptime and audit outcome were completely separated. For example, it was possible to update a node's stats to give it a successful/failed/unknown audit while simultaneously indicating that the node was offline by setting IsUp to false. This is no longer possible under this changeset. Some test which did this have been changed slightly in order to pass. Also add new benchmarks for UpdateStats and BatchUpdateStats with different audit outcomes. Change-Id: I998892d615850b1f138dc62f9b050f720ea0926b	2020-11-02 15:34:17 -05:00
Egon Elbre	7183dca6cb	all: fix defers in loop defer should not be called in a loop. Change-Id: Ifa5a25a56402814b974bcdfb0c2fce56df8e7e59	2020-11-02 15:06:38 +02:00
Egon Elbre	716068a1e0	Merge branch 'master'. Change-Id: Ic14325edc291573582dce0cea3e04991a820b48b	2020-11-02 13:02:01 +02:00
Egon Elbre	11338e9beb	satellite/internalpb: move audithistory.pb Change-Id: I8eee84d49ed90459168ddaf04ae57f790c2a22c4	2020-10-30 15:30:11 +02:00
Egon Elbre	7ce372c686	satellite/internalpb: add inspectors Change-Id: Ib688e43d05135c0c31ae95df533f1e4535ea396a	2020-10-30 13:28:17 +02:00
Egon Elbre	004e610d0f	satellite/internalpb: move datarepair.pb to internal Change-Id: If901d9ff4e5ee6715b963eeeb46513a602a44b3d	2020-10-30 13:28:14 +02:00
Kaloyan Raev	b8c6fb764c	satellite/metainfo: add metabase to metainfo service Change-Id: Ie3ff238b138d8a57d99e32b13f7a71aa624d53e3	2020-10-30 12:49:47 +02:00
Egon Elbre	caefde6b32	private/{dbutil,tagsql}: pass ctx to database opening Database opening usually dial and hence we should pass ctx to them. Change-Id: Iaa2875981570d83e65be3710f841cf30349f807b	2020-10-29 10:51:29 +00:00
Egon Elbre	e3985799a1	storage/{cockroachkv,postgreskv}: add ctx to opening Database opening usually dial and hence we should pass ctx to them. Change-Id: Iecf41241aaa94d54506cbc80b0e53449848d8819	2020-10-29 10:49:08 +00:00
Egon Elbre	9b2e00a38b	satellite: pass ctx into satellitedb.Open Opening a database requires ctx, this is first step to passing ctx to the appropriate level. Change-Id: Ic303e69f868ef3449ae36377937a29670cf635e2	2020-10-29 06:38:37 +00:00
Cameron Ayer	bb7be23115	satellite/{audit,overlay,satellitedb}: enable reporting offline audits - Remove flag for switching off offline audit reporting. - Change the overlay method used from UpdateUptime to BatchUpdateStats, as this is where the new online scoring is done. - Add a new overlay.AuditOutcome type: AuditOffline. Since we now use the same method to record offline audits as success, failure, and unknown, we need to distinguish offline audits from the rest. Change-Id: Iadcfe10cf13466fa1a1c2dc542db8994a6423355	2020-10-27 10:44:46 +00:00
Ethan	9a29ec5b3e	Add index to graceful_exit_transfer_queue table This fixes a slow query that was taking up to 4 seconds in production SELECT node_id, path, piece_num, root_piece_id, durability_ratio, queued_at, requested_at, last_failed_at, last_failed_code, failed_count, finished_at, order_limit_send_count FROM graceful_exit_transfer_queue WHERE node_id = '[redacted]' AND finished_at is NULL AND last_failed_at is NULL ORDER BY durability_ratio asc, queued_at asc LIMIT 300 OFFSET 0; Change-Id: Ib89743ca35f1d8d0a1456b20fa08c683ebdc1549	2020-10-26 14:47:48 +00:00
Moby von Briesen	7c3afe164b	satellite/overlay: uncomment dq for offline and disable with feature flag Change-Id: Ib39e2be32e880b822a94eddfb81af99a38843a27	2020-10-16 12:55:16 +00:00
Yaroslav Vorobiov	139a7ee959	private/migrate: add ablity to create dbs during migration Use tagsql.DB pointer as step database, to propagate changes back and forth between actual database and migration. Adds CreateDB operation to the migration step to be able to create new dbs before executing migration action. Adjusts storagenode database migration to use inner tagsql.DB pointer of each database as step.DB. Adjusts satellite dabase migration, adds proxy migrationDB field to satellite db that wraps itself as tagsql.DB, pointer of which is used as step.DB. Change-Id: Ifed4de5b01a356cf7b37db64d2eaeb7b61982c5c	2020-10-15 15:28:04 +03:00
Stefan Benten	0b43b93259	satellite/satellitedb: make limits per default NULL This change completes the column migration of `5f6fccc6e8` and `2f648fd981`. It resets every users project limits who are below or equal to our current production defaults. Change-Id: Ie041d08bb67b62844f6023190fc00bc2dad5b1cb	2020-10-14 20:28:16 +00:00
Egon Elbre	2268cc1df3	all: fix linter complaints Change-Id: Ia01404dbb6bdd19a146fa10ff7302e08f87a8c95	2020-10-13 15:59:01 +03:00
Egon Elbre	0bdb952269	all: use keyed special comment Change-Id: I57f6af053382c638026b64c5ff77b169bd3c6c8b	2020-10-13 15:13:41 +03:00
Jeff Wendling	0f0faf0a9f	satellite/orders: do a better job limiting concurrent requests Doing it at the ProcessOrders level was insufficient: the endpoints make multiple database calls. It was a misguided attempt to only have one spot enter the semaphore. By putting it in the endpoint we can not only be sure that the concurrency is correctly limited but it can be configurable easily. Change-Id: I937149dd077adf9eb87fce52a1a17dc0afe96f64	2020-10-09 16:27:15 -04:00
Jeff Wendling	7c303208ff	satellite/satellitedb: emergency temporary order processing semaphore we have thundering herds of order submissions that take all of the database connections causing temporary periodic outages. limit the amount of concurrent order processing to 2. Change-Id: If3f86cdbd21085a4414c2ff17d9ef6d8839a6c2b	2020-10-08 19:16:47 +00:00
Cameron Ayer	b39a99bae6	satellite/{overlay,satellitedb}: always show node's real online score Previously if a node did not have audit history data for each of the windows over the tracking period, we would give them the benefit of the doubt and set their score to 1. This was to prevent nodes from being suspended right out the gate. We need a minimum amount of data to evaluate them. However, a node who is actually failing at being online will have no idea until they have received enough audits and we suspend them. Instead, we will always use their real score, but use a flag to determine whether they are eligible for suspension/dq. Change-Id: I382218f12e8770f95d4bcddcf101ef348940cadf	2020-10-02 12:28:11 -04:00
Cameron Ayer	c2525ba2b5	satellite/{repair,satellitedb}: clean up healthy segments from repair queue at end of checker iteration Repair workers prioritize the most unhealthy segments. This has the consequence that when we finally begin to reach the end of the queue, a good portion of the remaining segments are healthy again as their nodes have come back online. This makes it appear that there are more injured segments than there actually are. solution: Any time the checker observes an injured segment it inserts it into the repair queue or updates it if it already exists. Therefore, we can determine which segments are no longer injured if they were not inserted or updated by the last checker iteration. To do this we add a new column to the injured segments table, updated_at, which is set to the current time when a segment is inserted or updated. At the end of the checker iteration, we can delete any items where updated_at < checker start. Change-Id: I76a98487a4a845fab2fbc677638a732a95057a94	2020-09-29 20:38:22 +00:00
Egon Elbre	c23a8e3b81	go.mod: update pgx to v4.9.0 Fix query to use TextArray instead of VarcharArray. Fix queries to use the correct type. Change-Id: Ibb7e55adba277d05778118d81ca697470e72c374	2020-09-29 19:03:08 +00:00
Egon Elbre	2d27bc8787	satellite/satellitedb: separate cockroach for migration tests Currently Cockroach migration test is the most heavy with regards to schema changes. This causes other tests to time out. This adds an alternate cockroach instance that is used for migration tests. Change-Id: I01fe9313527ff002f0bb0914dd52c3645b8eaf6d	2020-09-29 09:31:33 +00:00
Jessica Grebenschikov	4a2c66fa06	satellite/accounting: add cache for getting project storage and bw limits This PR adds the following items: 1) an in-memory read-only cache thats stores project limit info for projectIDs This cache is stored in-memory since this is expected to be a small amount of data. In this implementation we are only storing in the cache projects that have been accessed. Currently for the largest Satellite (eu-west) there is about 4500 total projects. So storing the storage limit (int64) and the bandwidth limit (int64), this would end up being about 200kb (including the 32 byte project ID) if all 4500 projectIDs were in the cache. So this all fits in memory for the time being. At some point it may not as usage grows, but that seems years out. The cache is a read only cache. When requests come in to upload/download a file, we will read from the cache what the current limits are for that project. If the cache does not contain the projectID, it will get the info from the database (satellitedb project table), then add it to the cache. The only time the values in the cache are modified is when either a) the project ID is not in the cache, or b) the item in the cache has expired (default 10mins), then the data gets refreshed out of the database. This occurs by default every 10 mins. This means that if we update the usage limits in the database, that change might not show up in the cache for 10 mins which mean it will not be reflected to limit end users uploading/downloading files for that time period.. Change-Id: I3fd7056cf963676009834fcbcf9c4a0922ca4a8f	2020-09-25 16:28:49 +00:00
Stefan Benten	38108828ac	satellite/satellitedb: enable multiple projects existing users Change-Id: I2ef77182d5464d72574698c8abfbbfdbda3f5a9e	2020-09-23 18:17:38 +02:00
Stefan Benten	5f6fccc6e8	satellite/satellitedb: makes limits nullable change backwards compatible Our current endpoints bail on us, if the column data is null. Thus we need to take the intermediate step and set the default to a fixed value and reset those with the following release. It sets the default column value to our current config values of 50GB for storage and bandwidth and 100 buckets, while still enabling the field to be nullable. All 0 values are migrated to be the default as well to ensure they can keep using their projects, as with the original change, 0 actually means 0. Change-Id: I797be80ce2d2105091599dc1b3fc76f74336b66b	2020-09-23 17:54:42 +02:00
Stefan Benten	2f648fd981	satellite: make limits be nullable Currently we have no way to actually set one of the following limits to 0 (meaning not usable): - maxBuckets - usageLimit - bandwidthLimit With having the field nullable, NULL corresponds to the global default, 0 now actually 0 and a set value determines a custom limit. Change-Id: I92bb77529dcbd0881ae8368921be9d246eb0919e	2020-09-21 19:34:19 +00:00
Qweder93	8182fdad0b	storagenode: heldamount renamed to payouts, renamed some methods and structs to more meaningful names. grouped estimated payout with pathouts satellite: heldamount renamed to SNOpayouts. Change-Id: I244b4d2454e0621f4b8e22d3c0d3e602c0bbcb02	2020-09-16 14:57:35 +00:00
Cameron Ayer	e7c34a053d	satellite/satellitedb: add column and index "updated_at" to injuredsegments Change-Id: I59e9bb2077885f09e17795375fe98ed31bd83d54	2020-09-14 12:53:04 -04:00
Michal Niewrzal	27a9d14e2a	satellite/repair: use metabase.SegmentKey type in repair package Another change which is a part of refactoring to replace path parameter (string/[]byte) with key paramter (metabase.SegmentKey) Change-Id: I617878442442e5d59bbe5c995f913c3c93c16928	2020-09-08 19:35:20 +00:00
Jennifer Johnson	4e2413a99d	satellite/satellitedb: uses vetted_at field to select for reputable nodes Additionally, this PR changes NewNodeFraction devDefault and testplanet config from 0.05 to 1. This is because many tests relied on selecting nodes that were reputable based on audit and uptime counts of 0, in effect, selecting new nodes as reputable ones. However, since reputation is now indicated by a vetted_at db field that is explicitly set rather than implied by audit and uptime counts, it would be more complicated to try to update all of the nodes' reputations before selecting nodes for tests. Now we just allow all test nodes to be new if needed. Change-Id: Ib9531be77408662315b948fd029cee925ed2ca1d	2020-09-04 16:45:32 +00:00
Michal Niewrzal	8649a00557	satellite/gracefulexit: replace `Path []byte` to `Key metabaseSegmentKey` TransferQueueItem We are unifying which name (and type) we are using for value we are using to point to segment. We want to use `key` instead of `path`. Dedicated type `metabase.SegmentKey` was created for this purposes also. This change is doing refactoring around gracefulexit. Change-Id: I90d51ff087b206179e61d5f1bc95f4709d76f917	2020-09-04 11:09:48 +00:00
Egon Elbre	dc48197bd8	satellite/orders: add bucket id to order limit Change-Id: I9019ec77d692e62ac17b67a1da71dc3535cde50c	2020-09-03 10:50:11 +03:00
Michal Niewrzal	0604a672c1	satellite/metainfo: use metabase in loop Change-Id: I1bb0c6fe0a762895fde950690b06f7dd9d77e178	2020-09-01 10:06:16 +00:00
Moby von Briesen	2d01dd9732	satellite/satellitedb: Add online_score column to nodes table Add online score used for the new audit history offline tracking system to the nodes table. This allows us easy access to the node's online score for the storagenode dashboard as well as for data analysis. Change-Id: Ie99be1192e5236862a5b3dbed2e5ef03b9169410	2020-08-31 15:07:07 +00:00
Moby von Briesen	60a95d0dc9	satellite/{satellitedb,overlay}: Enable offline suspension and review period When a node's audit history "online score" passes below a configured threshold, the node goes into "offline suspension" mode and begins a review period, where the operator is given an opportunity to bring their node back online. After the review period passes, offline suspension is turned off for the node. In the future, if a node still has a bad online score at the end of the review period, it will be disqualified. This is disabled right now. In the future, if a node is in offline suspension, it will be treated as "unhealthy". Right now, there are no consequences for being in offline suspension. Minor changes: * Moves AuditHistoryConfig out of UpdateStats/BatchUpdateStats args and into UpdateRequest. * Adds "now" argument to UpdateStats/BatchUpdateStats args for easy testing. * Changes formatting strings inside buildUpdateStatement to use specific types. Change-Id: I032b60298840fc16e6ef831da750f2d57619a397	2020-08-28 16:35:48 +00:00
Bill Thorp	729079965f	satellite/satellitedb : remove migation steps 69-102 Jenkins has been failing a lot lately due to test timeouts with CockroachDB. TestMigrateCockroach previously took around 5 minutes, now it takes 2. Why 103? I couldn't get 100 to work due to an error w/ NOT NULL and PKs. Change-Id: Iec95d4e25f9d6cd36920e7f43272c486a17fa879	2020-08-27 07:36:05 +00:00
Moby von Briesen	959cd5cd83	satellite/satellitedb: Update audit history from overlay.UpdateStats and overlay.BatchUpdateStats Change-Id: Ib530b61895ca4a8b12ba022c408a416b237b56d7	2020-08-20 22:46:28 +00:00
Moby von Briesen	5f0477ebe9	satellite/{overlay,satellitedb}: Create database functionality for updating audit history Add a function to the overlay cache called UpdateAuditHistory, which allows us to add online or offline audits to a particular node's audit history, and get that node's "online score" for the configured tracking period. The next step will be to use UpdateAuditHistory from inside BatchUpdateStats/UpdateStats, so that audit history is actually updated when nodes get audited, and we can suspend nodes based on their online score. Change-Id: I2289105e6961e68e829a987ff756b0e576fab120	2020-08-20 17:34:27 +00:00
Egon Elbre	94a09ce20b	all: add missing dots Change-Id: I93b86c9fb3398c5d3c9121b8859dad1c615fa23a	2020-08-11 17:50:01 +03:00
Ethan	ab1d0f097d	satellite/storageusage: Group accounting rollups at_rest_total by day When investigating a gap in storage usage data in the SN dashboard, I noticed that there were 2 entries in the accounting_rollups table on the date of the gap. This change accounts for multiple entries in the accounting_rollups table for a given day. Change-Id: Ibf2b5d0455117cb0417163e8fcfb7e509d594171	2020-08-10 15:03:15 +00:00
Kaloyan Raev	7552ff26ec	satellite/db: drop project_invoice_stamps table It's an obsolete table from earlier state of Stripe invoices implementation. No code is currently using it. It is confirmed that this table is currently empty across all satellites. Change-Id: I12d2756578faf8418ea8f3b09088e885694b8925	2020-08-10 13:22:10 +00:00
Kaloyan Raev	edfd3d7661	satellite/payments: delete `credits` and `credits_spendings` db tables Jira: https://storjlabs.atlassian.net/browse/USR-822 This the last step of dropping these 2 db tables. It also deletes all code associate with them. Change-Id: I8be840dc2a7be255cf6308c9434b729fe4d9391e	2020-07-30 12:19:57 +03:00
Egon Elbre	36ed939b89	satellite/orders: add buckets db to service We need to add bucket UUID into the order limit, hence we need access to the buckets table. Change-Id: I348ce1f709c9fcdec5c4034acaab59805b33da9f	2020-07-24 17:36:49 +03:00
Ethan	cfca021839	satellite/accounting: Add chore to cleanup old project bandwidth rollups data Removes old project_bandwidth_rollups records that are no longer used. Uses a retain months configuration to determine how many months to save. Current month cannot be removed. Tests retainMonths=-1, 0, 2 Change-Id: Ia4be2546cdb28802427acf41ecd85ad66df3e62c	2020-07-22 18:56:49 +00:00
Bill Thorp	65408db6e0	satellite/satellitedb: Coinpayments repeat insert bug fix I introduced a bug with https://review.dev.storj.io/c/storj/storj/+/2216 Because the log change allowed insert to be called multiple times. This changes the insert logic to do nothing if the PK already exists. Change-Id: I90d192a0f6619bfbb360ea104066f00a3348f6dd	2020-07-20 20:21:35 +00:00
Isaac Hess	67a292d135	satellite/satellitedb: Monitor node tallies We are adding a monkit evaluation for the total sum of data stored on the nodes before it is inserted into the database. This will give us a time-series history of total data stored so we can see it change over time. Change-Id: I41145a2d7a09c8e63b42ae578bd081035b60e529	2020-07-17 10:21:42 -06:00
Egon Elbre	d8dcae3075	all: fix error checking Change-Id: Ia0da1bbd6ce695139922f94096c2419281905e32	2020-07-16 19:13:14 +03:00
Egon Elbre	e70da5cd4e	all: fix comments Change-Id: I2d2307e3fab87de47a72b3595d051e2c95ff4f8a	2020-07-16 19:13:14 +03:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
stefanbenten	9ace375ee0	satellite/{console,satellitedb}: change project limiting based on new users field This change switches the backend logic to use the new DB column on the users table to restrict project creation. Furthermore it back fills the existing limits from registration tokens to the new column to ensure no users are reset to the new default. UI is updated to reflect ability to create several projects Change-Id: Ie29157430ae6b065411ca4c4557c9f1be69cdc4f	2020-07-16 10:57:47 +00:00
stefanbenten	0209a2095f	satellite/{console,satellitedb}: add project_limit column to users table Change-Id: I603f085f17ca5b413dd1c6837c2081f9e7e791a1	2020-07-15 17:27:31 +00:00
stefanbenten	2c2d284f3d	satellite/admin: add bucket limit handling endpoint Change-Id: I4b199277cff30f11f4a9fff3b0ac4017b694f2e8	2020-07-15 17:27:23 +00:00
Jennifer Johnson	784a156eea	satellite: prevents uplink from creating a bucket once it exceeds the max bucket allocation. Change-Id: I4b3822ed723c03dbbc0df136b2201027e19ba0cd	2020-07-15 17:27:05 +00:00
stefanbenten	257855b5de	all: replace == comparison with errors.Is Change-Id: I05d9a369c7c6f144b94a4c524e8aea18eb9cb714	2020-07-14 15:50:25 +00:00
stefanbenten	0a32ba0e6b	satellite/admin: add project rename functionality Change-Id: I4c0f42d4c2c26859279f247f94cef97a8ff630a9	2020-07-14 11:36:49 +00:00
stefanbenten	f768302c91	satellite/admin: harden project deletion requirements Change-Id: Ia7ea469f87469b16e464dc22af24b98a6ef1873d	2020-07-14 11:36:29 +00:00
Jessica Grebenschikov	8abb907010	satellite/orders: add settle orders with window Why: We need a way to cut down on database traffic due to bandwidth measurement and tracking. What: This changeset is the Satellite side of settling orders in 1 hr windows. See design doc for more details: https://review.dev.storj.io/c/storj/storj/+/1732 Change-Id: I2e1c151e2e65516ebe1b7f47b7c5f83a3a220b31	2020-07-13 15:41:29 -07:00
paul cannon	bbdb351e5e	all: use jackc/pgx in place of lib/pq What: Use the github.com/jackc/pgx postgresql driver in place of github.com/lib/pq. Why: github.com/lib/pq has some problems with error handling and context cancellations (i.e. it might even issue queries or DML statements more than once! see https://github.com/lib/pq/issues/939). The github.com/jackx/pgx library appears not to have these problems, and also appears to be better engineered and implemented (in particular, it doesn't use "exceptions by panic"). It should also give us some performance improvements in some cases, and even more so if we can use it directly instead of going through the database/sql layer. Change-Id: Ia696d220f340a097dee9550a312d37de14ed2044	2020-07-13 15:54:41 +00:00
Egon Elbre	9dc9cd8a17	tests: allow STORJ_TEST_POSTGRES STORJ_POSTGRES_TEST naming was not consistent with STORJ_SIM_POSTGRES. This allows to use STORJ_TEST_POSTGRES for clarity, it still has a fallback to STORJ_POSTGRES_TEST. Change-Id: I6f294c66c80fcfd6750fea2a89795f3b7f5dd691	2020-07-10 16:43:49 +03:00
Jeff Wendling	885ef70c58	satellite/nodeapiversion: new table for tracking node api usage This system tracks an abstract "api version" from nodes based on their usage, allowing us to have latching behavior where if a node ever uses a new api, it can be blocked from using the old api. This is better than using self-reported semver version information because the node cannot lie, there's no confusion about what semver version implies which features, no questions about dev and ci environments, and no dependencies between reporting the version and using the new api. Change-Id: Ifeced5c9ae8e0a16102d79635e176a7d3bdd8ed4	2020-07-09 15:02:25 +00:00
Isaac Hess	fd740295ec	satellite/satellitedb: Add comment to revocation Change-Id: I1b65b7e46439c4788835ea5bfd4df3d32a713b44	2020-07-06 21:51:35 +00:00
Bill Thorp	00ae5ebbab	satellite/payemnts: Credit coin payments earlier Apply the coin payments when CoinPayments.net recieves the funds Instead of the when STORJ gets them from CoinPayments.net Based on 7/1/20 User Growth standup guidance by JG Relates to: https://storjlabs.atlassian.net/browse/USR-801 Change-Id: I174ca23a585010f39464c45525e1dfe0179b7c1a	2020-07-06 13:24:26 +00:00
Cameron Ayer	e3088d9ad5	satellite/satellitedb: add new DB table audit_histories Change-Id: I5f854514994cab9a68cf978f2dabfb588df695f5	2020-07-01 21:14:35 +00:00
Qweder93	b639ec08d4	satellite/heldamount: payments added, endpoind for payments added Change-Id: Ia2b9580bc353ef614680230c6f82c5bf6ded49c4	2020-07-01 18:15:01 +03:00
Cameron Ayer	cadb435d25	{satellite/audit, private/testplanet}: remove ErrAlreadyExists, run 2 audit workers in testplanet Since we increased the number of concurrent audit workers to two, there are going to be instances of a single node being audited simultaneously for different segments. If the node times out for both, we will try to write them both to the pending audits table, and the second will return an error since the path is not the same as what already exists. Since with concurrent workers this is expected, we will log the occurrence rather than return an error. Since the release default audit concurrency is 2, update testplanet default to run with concurrent workers as well. Change-Id: I4e657693fa3e825713a219af3835ae287bb062cb	2020-06-30 18:00:07 +00:00
Egon Elbre	d91cf5f4de	satellite/satellitedb: add missing SeparateTx Change-Id: I3ba5a4e0632a1e0e5e77c30e515953eadf05bc45	2020-06-26 12:27:05 +03:00
Egon Elbre	13a5854535	satellite/satellitedb: clarify test migration merging Use a field to distinguish migration steps that need to use a different transaction from previous steps. This is clearer than using a func. Change-Id: I2147369d05413f3e8ddb50c71a46ab1ba3ab5114	2020-06-25 14:32:45 +00:00
Cameron Ayer	3b4b5f45c7	satellite: replace references to Suspended with UnknownAuditSuspended Change-Id: I3d2d00c95954c0546ad077702617895f262926ef	2020-06-23 14:19:22 +00:00
Isaac Hess	2d727bb14e	satellite: Check macaroon revocation When a request comes in on the satellite api and we validate the macaroon, we now also check if any of the macaroon's tails have been revoked. Change-Id: I80ce4312602baf431cfa1b1285f79bed88bb4497	2020-06-22 13:50:07 -06:00
Egon Elbre	f68e7b3fde	satellite/overlay: replace pb.InfoResponse pb.InfoResponse wasn't used for protocol buffer communication, but instead as a satellite type. Change-Id: I755619f2deec5b76c4fe488591b7d8c1b9fcdafb	2020-06-16 15:16:55 +03:00
Cameron Ayer	0885ba5646	satellite/satellitedb: add new columns for offline suspension add new columns `offline_suspended` and `under_review` to nodes table. `unknown_audit_suspended` is a new column which will replace `suspended` Change-Id: I22ddeb338ea0ff63f14332a7ebd0f3e9e4c06cdc	2020-06-15 04:00:20 +00:00
paul cannon	7b8e91ff28	satellite/satellitedb: no orders for exited nodes We should not be sending any type of orders to nodes that have completed graceful exit with the current satellite. In particular, we should not be trying to audit them, because that would be silly. Change-Id: Ie2153e5739914ab696feefcdef28545ed70f84e4	2020-06-13 13:49:33 +00:00
Egon Elbre	1ed5a1bac5	satellite/satellitedb/satellitedbtest: skip omitted database The first implementation missed some changes. Change-Id: I7ae696175e0a9ea46954970ba8547638a05ed5a9	2020-06-11 13:28:16 +00:00
Cameron Ayer	bad299b541	satellite/satellitedb: serialize UpdateStats and BatchUpdateStats transactions Since we increased the number of audit workers from 1 to 2, we need to make sure concurrent updates do not trample each other. We can do this by serializing the transactions. Change-Id: If1b2f71cabe3c779c12ffa33c0c3271778ac3ae0	2020-06-10 17:11:28 +00:00
Egon Elbre	36c461bd59	private/tagsql: track proper closing of rows and statements This ensures that rows are closed to avoid leaks. Also verifies that Err() is called, to ensure that no error is left behind. Change-Id: Idd1bec9bf479f40021da67b2c80ce83033149469	2020-06-05 18:25:43 +00:00
Egon Elbre	34db4a80fd	ci: fix staticcheck failures Change-Id: I176fb24214755a1940a0a1a4e9cc8e39f184870b	2020-06-05 13:15:34 +00:00
Michal Niewrzal	2b2efcc662	satellite/payments/stripecoinpayments: move Coupons expiration date sorting directly to listing method Change-Id: I58d8a6ea1feba9ff2d19f21a1dbc87bfb8b49801	2020-06-04 09:47:42 +00:00
Jeff Wendling	254b42ff65	satellite/satellitedb: fix leaked rows from repairQueue.Insert Change-Id: If5e62c49770f591ebe3f4d2dd4dd2658c229a022	2020-06-03 14:31:21 -06:00
Michal Niewrzal	b20ced9519	satellite/satellitedb: drop project_id column from coupons table This is last part of https://storjlabs.atlassian.net/browse/USR-818 Change-Id: I053d11b37df962c12e46645bae2fc2dad49c9755	2020-06-03 14:56:41 +00:00
Cameron Ayer	6a60e1e96b	satellite/satellitedb: inclusive interval_start in GetAllocatedBandwidthTotal The DB query in GetAllocatedBandwidthTotal uses an exclusive range: 'WHERE interval_start > ?' The value that is used for this condition is the first day of current the month, 00:00:00 UTC. By using the exclusive '>', we exclude the entire first hour of the month from the result set. Change-Id: I3ed300f5230c7514dc9495a85e8166213cd0842e	2020-06-02 13:06:45 -04:00
Jeff Wendling	2b3545c0c3	satellite/satellitedb: use delete returning to query pending_serial_queue this way we don't have to do 2 steps, and by using the ctid, postgres is going to do two very efficient prefix scans. Change-Id: Ia9d0546cdf0a1af67ceec9cd508d336a5fdcbdb9	2020-06-01 15:43:33 -06:00
Jeff Wendling	44433f38be	satellite/satellitedb: remove ORDER BY when reading from queue also remove the continuation support from the queue, otherwise we may end up sequential scanning the entire table to get a few rows at the end. then, in the core, instead of looping both to get a big enough batch inside of the queue, as well as outside of it to ensure we consume the whole queue, just get a single batch at a time. also, make the queue size configurable because we'll need to do some tuning in production. Change-Id: If1a997c6012898056ace89366a847c4cb141a025	2020-06-01 18:31:14 +00:00
Yingrong Zhao	163c027a6d	satellite/satellitedb: remove monkit trace from convertDBNode In jaeger, it shows that this function gets called repetitively in a single request. Most of the time, it's less than 1ms. Therefore, it doesn't add much value in our trace but create noises. Change-Id: I20234f36bbcf0fc22f91e5e1a5634c0cad577ed0	2020-06-01 17:58:43 +00:00
Michal Niewrzal	a9f6489663	satellite/payments/stripecoinpayments: remove ProjectID from Coupon struct This change is removing ProjectID from code. Next change will be about dropping this colum from DB table. Change-Id: Idb949e2829e2c304a2b6b011259c7cc7667082e1	2020-06-01 11:37:20 +00:00
Egon Elbre	07050eea26	all: use common/storj Change-Id: Id1e36d52f9807b5ffbb72ce73f4b60cb21b68a78	2020-05-29 11:57:32 +03:00
Jeff Wendling	1e065fb450	satellite: migration to fix bad imported payment history the initial calculations for the historical values of comp_at_rest were wrong. because our historical data only included total amounts as well as compensation for bandwidth, the at rest value was calculated as at_rest = total - bandwidth unfortunately, that calculation did not take surge pricing into account correctly. the at rest and bandwidth values do not include surge pricing, but the total that was used did. so what we actually calculated was no_surge_at_rest = surge_total - no_surge_bandwidth which will create a value that is too large. this migration fixes the calculation for imports that are old enough and of a non-negligable difference. Change-Id: I61eb0b670510f6d7fb8fc3de39ba79150fac10eb	2020-05-28 12:59:08 -06:00
Michal Niewrzal	75b3db5426	satellite/payments/stripecoinpayments: test invoice user with more than 1 project https://storjlabs.atlassian.net/browse/USR-291 Change-Id: I98286e40254e8868de9eb675a9c9a8cd0bf5f3b1	2020-05-27 09:12:23 +00:00
Moby von Briesen	290c006a10	satellite/repair/{checker,queue}: add metric for new segments added to repair queue * add monkit stat new_remote_segments_needing_repair, which reports the number of new unhealthy segments in the repair queue since the previous checker iteration Change-Id: I2f10266006fdd6406ece50f4759b91382059dcc3	2020-05-27 06:23:47 +00:00
Jeff Wendling	074649835b	satellite/satellitedb: add some docs and improve some snapshots This attempts to add a README.md to help create consistent migrations that maximize our test coverage and do not include unnecessary statements. It also adds a feature to have an `-- OLD DATA --` section as well as a `-- NEW DATA --` section so that we can fix mistakes made in previous snapshots (like a row that was forgotten to be added when a table was created) without editing them going forward. Change-Id: I28a786f8ef163cae1de1bb08f61af1e1104b0a88	2020-05-22 21:27:36 +00:00
Jennifer Johnson	03e5f922c3	satellite/overlay: updates node with a vetted_at timestamp if they meet the vetting criteria What: As soon as a node passes the vetting criteria (total_audit_count and total_uptime_count are greater than the configured thresholds), we set vetted_at to the current timestamp. Why: We may want to use this timestamp in future development to select new vs vetted nodes. It also allows flexibility in node vetting experiments and allows for better metrics around vetting times. Please describe the tests: satellitedb_test: TestUpdateStats and TestBatchUpdateStats make sure vetted_at is set appropriately Please describe the performance impact: This change does add extra logic to BatchUpdateStats and UpdateStats and commits another variable to the db (vetted_at), but this should be negligible. Change-Id: I3de804549b5f1bc359da4935bc859758ceac261d	2020-05-20 16:30:26 -04:00
Egon Elbre	5d016425f1	satellite/{contact,downtime,overlay}: use NodeURL Change-Id: I555a479a89e0ddbf0499898bdbc8574282cd6846	2020-05-20 11:09:05 +00:00
Stefan Benten	0a26c4af9a	satellite/admin: add coupon deletion (#3893 )	2020-05-19 15:49:44 +03:00
Stefan Benten	671aca56b0	satellite/admin: add coupon creation and listing (#3891 )	2020-05-19 12:36:13 +02:00
Kaloyan Raev	49571f1a23	satellite/payments: all invoice commands require period To avoid including multiple months in a single invoice, we need all inspector's invoice commands to run in for specific period. See https://storjlabs.atlassian.net/browse/USR-725 Change-Id: I3637dc189234f02350daca8d897c21765762ea55	2020-05-14 11:50:19 +00:00
Jeff Wendling	6352d46100	satellite/satellitedb: do better ::date conversions There is a subtle problem when one does a cast with `::date`. Observe: teststorj=# set timezone = 'US/Eastern'; SET teststorj=# select (timestamp with time zone '2020-02-01 00:00:00+00')::date; date ------------ 2020-01-31 (1 row) teststorj=# set timezone = 'UTC'; SET teststorj=# select (timestamp with time zone '2020-02-01 00:00:00+00')::date; date ------------ 2020-02-01 (1 row) In order to correctly determine the date a timestamp is in, one has to explicitly pick the time zone that the date truncation should use otherwise postgres will use whatever setting the client has. These tests were failing for me locally, because I run my postgres in the US/Eastern time zone to try to tickle these bugs out. So it should be `(x at time zone 'UTC')::date` instead of just `x::date`. Change-Id: I4e9e32d4b53abc6165a4d0474f4702f8b9f801c7	2020-05-13 15:58:07 +00:00
Egon Elbre	0e3be60b79	satellite/satellitedb: simplify migrate step Change-Id: Ie4574144fb6ddd057d5fca740702c59fbdb2c5e4	2020-05-12 18:27:07 +03:00
Stefan Benten	e23bd806b4	satellite/accounting: separate usage and bandwidth limit (#3878 )	2020-05-12 15:01:15 +02:00
Michal Niewrzal	22fbe804e3	satellite/accounting: test if project bandwidth limits reset with billing cycle https://storjlabs.atlassian.net/browse/USR-287 Change-Id: I4dc5f6342417b6af3384da32d3d2ed8592904406	2020-05-11 15:11:53 +00:00
Moby von Briesen	8f60cfc4fb	satellite/overlay: Add flag for enabling/disabling disqualification from suspension mode Add a flag that allows us to easily switch disqualification from suspension mode on or off. A node will only be disqualified from suspension mode if it has been suspended for longer than the grace period AND the SuspensionDQEnabled flag is true. Change-Id: I9e67caa727183cd52ab2042b0a370a1bcaebe792	2020-05-04 17:25:09 +00:00
Ethan	acf53bea4d	satellite/orders;accounting: Add monthly project download bandwidth rollup See https://storjlabs.atlassian.net/browse/SM-776 Change-Id: Ifd5cccea43c556fd59822d17344f399cfe9a7164	2020-05-04 15:49:57 +00:00
Egon Elbre	8928399d02	all: rename CreateTables to MigrateToLatest CreateTables hasn't been quite true for a while now, rename to MigrateToLatest to be clearer in it's behavior. Change-Id: Ida48e95122a5d9b7a814e922d3698e00024a2ba7	2020-04-30 07:21:17 +00:00
Jessica Grebenschikov	6a6427526b	satellite/overlay: remove old updateaddress method The UpdateAddress method use to be used when storage node's checked in with the Satellite, but once the contact service was created this method was no longer used. This PR finally removes it. Change-Id: Ib3f83c8003269671d97d54f21ee69665fa663f24	2020-04-30 06:41:48 +00:00
Moby von Briesen	de366537a8	satellite/satellitedb/overlaycache: fix behavior around gracefully exited nodes Sometimes nodes who have gracefully exited will still be holding pieces according to the satellite. This has some unintended side effects currently, such as nodes getting disqualified after having successfully exited. * When the audit reporter attempts to update node stats, do not update stats (alpha, beta, suspension, disqualification) if the node has finished graceful exit (audit/reporter_test.go TestGracefullyExitedNotUpdated) * Treat gracefully exited nodes as "not reputable" so that the repairer and checker do not count them as healthy (overlay/statdb_test.go TestKnownUnreliableOrOffline, repair/repair_test.go TestRepairGracefullyExited) Change-Id: I1920d60dd35de5b2385a9b06989397628a2f1272	2020-04-28 23:58:43 +00:00
Egon Elbre	85c45cd56f	private/dbutil/pgtest: support multiple databases for testing Currently Cockroach isn't performant for concurrent database setup and tear-down. Instead of a single instance allow setting multiple potential connection strings and let the tests pick one connection string randomly. This improves test duration by ~10 minutes. While we are at significantly changing how pgtest works, introduce helper PickPostgres and PickCockroach for selecting the database to reduce code duplications in multiple places. Change-Id: I8ad171d5c4c8a4fc081ec2ae9bdd0cc948a80619	2020-04-28 21:55:49 +03:00
Natalie Villasana	6f84be133a	satellite/metainfo: add MigrateToLatest to PointerDB In cases like the segment reaper script connecting to the metainfodb, we don't want a db migration to happen automatically when we call metainfo.NewStore. This adds MigrateToLatest method for postgreskv and cockroackv, and calls MigrateToLatest in places where NewStore used to create tables. Change-Id: I682d0f26d609af0601dfdb32a24866cdf5d32a7e	2020-04-28 17:26:35 +00:00
Egon Elbre	ef913be234	satellite/satellitedb/satellitedbtest: don't use subtest naming A/B indicates that B is a subtest of A, however in this case they represent a configuration of the test, not a subtest. Change-Id: I64eed5d5bcb12759e54fe4b5373f8e88488e50f7	2020-04-27 19:32:09 +03:00
Ivan Fraixedes	03871d17c3	satellite/satellitedb: Update ticket ref Update a reference to a ticket in a code comment. Change-Id: Ib82220e94527482c5ca1a58d8614b919d1113ab5	2020-04-27 08:50:41 +00:00
Stefan Benten	d73630fd4a	satellite/satellitedb: Ensure we just return bucket usage for buckets that exist (#3863 )	2020-04-24 22:25:16 +02:00
Moby von Briesen	720e26d235	satellite/satellitedb/overlaycache: update unknown alpha/beta values properly Update unknown_audit_reputation_alpha and unknown_audit_reputation_beta. Add test to verify that BatchUpdateStats properly modifies unknown audit alpha/beta Change-Id: I0d5f9cac96a99f64905cf575b772402db0756a9d	2020-04-23 10:40:53 -04:00
Moby von Briesen	72b93f3120	satellite/satellitedb: disqualify suspended nodes when the grace period passes If a node is suspended and receives an unknown or failing audit, disqualify them if the grace period (default 1w in production) has passed. Migrate the nodes table so any node that is currently suspended gets unsuspended when the satellite starts up. Change-Id: I7b81c68026f823417faa0bf5e5cb5e67c7156b82	2020-04-22 15:45:00 -04:00
Ethan Adams	60e07f0a8b	Revert "satellite/accounting: Remove unnecessary index bucket_bandwidth_rollups_project_id_action_interval_index" This reverts commit `105dc7acc6`. Reason for revert: Recent changes to the Postgres query plan seems to want to use this index now. Reverting until we have time to analyze what's happening. Change-Id: I74b4b5a8f15c3850d8a958a29f51dbc80e7c282c	2020-04-22 14:49:04 +00:00
Qweder93	805e328c47	storagenode/heldamount payments removed Change-Id: I87cc04f43d182a4190a571ef417be85d02db9d34	2020-04-21 17:15:31 +00:00
Ethan	105dc7acc6	satellite/accounting: Remove unnecessary index bucket_bandwidth_rollups_project_id_action_interval_index See https://storjlabs.atlassian.net/browse/SM-738 Change-Id: I9ba3cc3fbff9f13fc0b95d25feee5a19e5a5c486	2020-04-21 16:43:09 +00:00
Qweder93	6e3585e394	satellite/heldamount/endpoint : GetAllPaystubs added Change-Id: Ic8cdd9db8b2a68796f9579c7fed2d49d9054bd64	2020-04-19 19:21:54 +03:00
Ethan	4cd86ff780	satellite/accounting: Add index on bucket_bandwidth_rollups for action, interval_start, and project_id See https://storjlabs.atlassian.net/browse/SM-551 for details Change-Id: I104c4e87d5aef500cc4a3893817763808f76c484	2020-04-17 19:14:45 +00:00
Jess G	5ea1602ca5	satellite/overlay: add selected node cache (#3846 ) * init implementation cache Change-Id: Ia54a1943e0707a77189bc5f4a9aaa8339c98d99a * one query to init cache Change-Id: I7c04b3ae104b553ae23fca372351a4328f632c66 * add monit tracking of cache Change-Id: I7d209e12c8f32d43708b23bf2126c5d5098e0a07 * add first test Change-Id: I0646a9349d457a9eb3920f7cd2d62fb72ffc3ab5 * add staleness to cache Change-Id: If002329bfdd53a4b200ad14dbd2ffc8b280aedb8 * add init test Change-Id: I3a3d0aa74cfac1d125fa93cb749316ed2a74d5b1 * fix comment Change-Id: I73353d00ccf0952b38c0f8ef7d1755c15cbfe9d9 * mv to nodeselection pkg Change-Id: I62487f768296c7a7b597fa398a4c42daf6e9c5b7 * add state to cache Change-Id: I081e77ec0e16706faee1a267de9a7fa643d6ac11 * add refresh concurrent test Change-Id: Idcba72508291099f280edc65355273c0acc3d3ce * add a few more tests Change-Id: I9422e9eaa22bf01c11f14bdb892ebcf7b3e5e5fb * fix tests, add min version to select allnodes Change-Id: I926f41d568951ad4ff70c6d4ceb87abb1e3e5009 * update comments Change-Id: I6ffe33e245ca65fb523c880cd72e63ce35776eb9 * fixes and rm Init Change-Id: Ifbe09b668978b5d9af09ca38cb080d02a2154cf4 * fix format Change-Id: I03cc217e28dc1839190c5c6dbdbb602c132a5a38	2020-04-14 13:50:02 -07:00
Moby von Briesen	d7794a4851	satellite/overlay: hardcode default values for audit alpha/beta Alpha=1 and beta=0 are the expected first values for any alpha/beta reputation system we are using in the codebase. So we are removing the configurability of these values. Change-Id: Ic61861b8ea5047fa1438ea6609b1d0048bf0abc3	2020-04-14 19:12:40 +00:00
Cameron Ayer	02613407ae	satellite/satellitedb: only suspend node if not already suspended Whenever the node's reputation is updated, if its unknown audit reputation is below the suspension threshold, its suspension field is set to the current time. This could overwrite the previous "suspendedAt" value resulting a node that never reaches the end of its suspension. Also log whenever a node is disqualified or its suspension status changes Change-Id: I5e8c8f1c46f66d79cb279b5b16a84fe03f533deb	2020-04-10 09:37:37 +00:00
Egon Elbre	d86cce202c	satellite/satellitedb: use arrays for arguments in node selection This simplifies the code and makes queries faster: name old time/op new time/op delta SelectStorageNodes-32 7.72ms ± 6% 7.22ms ± 3% -6.44% (p=0.016 n=5+5) SelectNewStorageNodes-32 7.75ms ± 2% 7.37ms ± 1% -4.89% (p=0.008 n=5+5) SelectStorageNodesExclusion-32 16.9ms ± 0% 16.6ms ± 0% -2.15% (p=0.008 n=5+5) SelectNewStorageNodesExclusion-32 17.2ms ± 0% 16.6ms ± 2% -3.69% (p=0.008 n=5+5) FindStorageNodes-32 45.5ms ± 0% 45.1ms ± 1% ~ (p=0.056 n=5+5) FindStorageNodesExclusion-32 77.4ms ± 0% 75.9ms ± 0% -1.91% (p=0.008 n=5+5) Change-Id: I38f77f6282b9738e8416113d42c6acb46c03da7b	2020-04-09 21:16:10 +03:00
Egon Elbre	ccf4f9ed2d	satellite/satellitedb: node selection code cleanup Reduce the number of non-methods to reduce funcs in the namespace also combine a func to slightly condense the code more. Change-Id: Ifbe728eb8c8ca4c981df648decd259c2097b6b40	2020-04-09 20:41:29 +03:00
Natalie Villasana	cf80b3caf3	satellite/overlay: combine SelectStorageNodes and SelectNewStorageNodes (#3831 )	2020-04-09 11:19:44 -04:00
Egon Elbre	11a44cdd88	all: don't depend on gogo/proto directly Change-Id: I8822dea0d1b7b99e0b828e0373a0308a42dde2be	2020-04-08 17:32:15 +00:00
Egon Elbre	cf26951a5b	satellite/satellitedb/pbold: remove dead code Change-Id: I7464773c20b8f99a601ca9cc4bee804f1ac14cf9	2020-04-08 15:22:31 +03:00
Jeff Wendling	2ded64ba2c	satellite/compensation: more fixes to get prod running smoothly Change-Id: I13a76d9d49222fb10796415a015f224d4084fde3	2020-04-07 10:10:27 +00:00
Jennifer Johnson	1547e791a3	satellitedb: remove free_bandwidth column from nodes table Change-Id: I9d1d3de9216c6533c1042ef473631721a011d086	2020-04-06 09:30:28 +00:00
Egon Elbre	9200efc61f	satellite/satellitedb: fix selecting a nullable string Change-Id: I59e645966e09da586512c69101691b47055c1e5a	2020-04-03 21:30:20 +03:00
Egon Elbre	6492b13d81	all: remove old uuid Change-Id: I3a137f73456f010c37d3933dbe12cbbb840b809f	2020-04-02 19:30:36 +03:00
Egon Elbre	8f73fb7a32	all: simplify uuid usage uuid.UUID implements driver.Value so it can be directly used as a scannable result. Replace uses of dbutil.BytesToUUID with uuid.FromBytes. Change-Id: I51a670185ceb3cc2199d5aa2b76bc3fc191ca8fe	2020-04-02 05:48:58 +00:00
Egon Elbre	a416b03941	satellite/accounting: fix TestProjectBandwidthTotal Test was inserting for past 4 days, however the test was summing up for the current month. Change-Id: I509afdc6a76b314a6bb90652ab70cd2c2bab1288	2020-04-01 11:50:18 +03:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Qweder93	dc32f1da55	storagenode/cache/heldamount added, errNoRows ignored Change-Id: If6b675e622d6c1324c0893c43cca93dc5323cd78	2020-03-31 11:35:58 +00:00
Jeff Wendling	e2ff2ce672	satellite: compensation package and commands Change-Id: I7fd6399837e45ff48e5f3d47a95192a01d58e125	2020-03-30 14:08:14 -06:00
Jennifer Johnson	d77f3b8786	satellitedb/migrate: set vetted_at backfill to now.day Change-Id: Ib2b12be43dbd3f3705b1891bc703ae15abb75e09	2020-03-30 16:50:23 +00:00
Egon Elbre	439aba922a	satellite/overlay: reduce overhead of GetNodes Instead of filtering on the client side it's better to filter on the database side. Change-Id: I845fbbe5ed28c2ffdb0b8a3f789b59c094fd1069	2020-03-30 18:36:23 +03:00
Egon Elbre	cb781d66c7	satellite/overlay: optimize FindStorageNodes Reduce the number of fields returned from the query. Benchmark results in `satellite/overlay`: benchstat before.txt after2.txt name old time/op new time/op delta SelectStorageNodes-32 7.85ms ± 1% 6.27ms ± 1% -20.18% (p=0.002 n=10+4) SelectNewStorageNodes-32 8.21ms ± 1% 6.61ms ± 0% -19.53% (p=0.002 n=10+4) SelectStorageNodesExclusion-32 17.2ms ± 1% 15.9ms ± 1% -7.55% (p=0.002 n=10+4) SelectNewStorageNodesExclusion-32 17.8ms ± 2% 16.1ms ± 0% -9.38% (p=0.002 n=10+4) FindStorageNodes-32 48.4ms ± 1% 45.1ms ± 0% -6.69% (p=0.002 n=10+4) FindStorageNodesExclusion-32 79.2ms ± 1% 76.1ms ± 1% -3.89% (p=0.002 n=10+4) Benchmark results from `satellite/overlay` after making them parallel: benchstat before-parallel.txt after2-parallel.txt name old time/op new time/op delta SelectStorageNodes-32 548µs ± 1% 353µs ± 1% -35.60% (p=0.029 n=4+4) SelectNewStorageNodes-32 562µs ± 0% 368µs ± 0% -34.51% (p=0.029 n=4+4) SelectStorageNodesExclusion-32 1.02ms ± 1% 0.84ms ± 0% -18.08% (p=0.029 n=4+4) SelectNewStorageNodesExclusion-32 1.03ms ± 1% 0.86ms ± 2% -16.22% (p=0.029 n=4+4) FindStorageNodes-32 3.11ms ± 0% 2.79ms ± 1% -10.27% (p=0.029 n=4+4) FindStorageNodesExclusion-32 4.75ms ± 0% 4.43ms ± 1% -6.56% (p=0.029 n=4+4) Change-Id: I1d85e2764eb270f4c2b1998303ccfc1179d65b26	2020-03-30 18:36:23 +03:00
Egon Elbre	e1a443b04a	private/testplanet: allow modifying created database Instead of providing the database from outside to testplanet create it inside and then allow wrapping and modifying it. This is more convenient to use. Change-Id: I9b8f69e6e0a19ff984b4e2bfe927c9100c77bc6c	2020-03-27 19:14:48 +00:00
Ethan	df462d7265	satellite/accounting: Add index on bucket_bandwidth_rollups to minimize full table scans https://storjlabs.atlassian.net/browse/SM-545 Change-Id: I5599a72a991d70236f17beca027e9bc032777177	2020-03-26 19:53:50 +00:00
Jeff Wendling	97e980cd8a	private/dbutil: add database name to configure as a tag storagenodes have like 10 or more databases. without this tag they all get sent as the same value, stomping on each other. Change-Id: Ib12019684d6ea8f2a5b83df584056dfa79e3c4b3	2020-03-26 16:50:15 +00:00
Jennifer Johnson	b75cbc8e24	satellite,storagenode: remove references to free bandwidth Change-Id: I42a6597544804fa9235e89ec656ebc365eb522e5	2020-03-25 22:28:34 +00:00
Michal Niewrzal	fdf40a7526	storj: remove `storj/private/version` package which was moved to `storj/private` repo Change-Id: I81c3f5b9d5e4fe7bca760999eb045ee9734e5e2e	2020-03-24 14:31:33 +00:00
Jessica Grebenschikov	aeab599d21	satellitedb: removed unused id on storagenode_storage_tallies table, add index on node_id The goal of this change is to improve the storagenode_storage_tallies table by removing the unneeded id column that is not being used but only taking up space, and also to add an index on a different column that needs it. Removing and adding a column seems simple, but ended up being more complicated because of some cockroachdb limitations. The cockroachdb limitation when trying to remove a column from a table and create a new primary key are: 1. only allows primary key creation at table creation time (docs: https://www.cockroachlabs.com/docs/stable/primary-key.html) 2. table drop or rename is performed async and cannot be done in a transaction (issue: https://github.com/cockroachdb/cockroach/issues/12123, https://github.com/cockroachdb/cockroach/issues/22868) To address these differences between cockroachdb and Postgres, this PR performs different migrations for the two database. The Postgres migration is straight forward and what you would expect, but the cockroach migration has two main changes: 1. To change a primary key, use the recommended process from the cockroachdb docs to create a new table with the new primary key you want and then migrate the data. 2. In order to do 1, we needed to do the new table renaming in a separate transaction from the data migration. Ref: SM-65 Change-Id: Idc9aee3ab57aa4d5570e3d2980afea853cd966bf	2020-03-20 14:39:44 -07:00
Jennifer Johnson	9b78473c0c	satellitedb: adds vetted_at nullable timestamp to nodes table Change-Id: I42d5a396b4eecbad26b683c6aee51e043d2ff034	2020-03-20 01:37:28 +00:00
Qweder93	0df586c3a8	satellitedb/heldamount updated, tests added + storagenode console updated Change-Id: I10f568a426d0fc42069d025de2accbef5b26dc0c	2020-03-19 15:37:45 +02:00
Jeff Wendling	115f4559e5	satellite/orders: more efficient processing of orders by doing an indexed anti-join we're able to reduce the time to select the pending orders by over 10x on postgres. this should help us process pending orders much more quickly. it probably won't do as good a job on cockroach because it does not do an indexed anti-join and instead does a hash join after scanning the entire consumed serials table. we should either remove orders entirely or try to make that more efficient when necessary. Change-Id: I8ca0535acd21c51e74955b24c9b86d20e4f2ff9c	2020-03-18 09:03:30 +00:00
Moby von Briesen	2f991b6c56	satellite/{overlay, satellitedb}: account for `suspended` field in overlay cache Make sure that suspended nodes are treated appropriately by the overlay cache. This means we should expect the following behavior: * suspended nodes (vetted or not) should not be selected for uploading new segments * suspended nodes should be treated by the checker and repairer as "unhealthy", and should be removed upon successful repair This commit also removes unused overlay functionality. Fixes a bug with commit `8b72181a1f` where the audit reporter was automatically suspending nodes regardless of audit outcome (see test added). Tests: * updates repair tests to ensure that a suspended node is treated as unhealthy and will be removed from the pointer on successful repair * updates overlay tests for KnownUnreliableOrOffline and KnownReliable to expect suspended nodes to be considered "unreliable" * adds satellitedb test that ensures overlay.SelectStorageNodes and overlay.SelectNewStorageNodes do not include suspended nodes * adds audit reporter test to ensure that different audit outcomes result in the correct suspended/disqualified states Change-Id: I40dba67278c8e8d2ce0bcec5e0a5cb6e4ce2f561	2020-03-17 17:14:56 +00:00
Michal Niewrzal	81afbcc12e	satellite/metainfo: check bucket existence on upload and listing Initial change for checking bucket existence on satellite side for requests like BeginObject and ListObjects. This is simple implementation that is just checking bucket in DB but should be improved in future to avoid DB calls as much as possible. Part of https://storjlabs.atlassian.net/browse/USR-365 Change-Id: I9076acddc44d7dbfa7612a1c24a007de01621583	2020-03-17 15:43:22 +00:00
Jeff Wendling	7baa59753a	satellite/orders: add tests for double sending the same order Change-Id: If2fa7f035257df3b04f506f81aa8b2e0916f5033	2020-03-17 14:18:03 +00:00
Ethan	bdbf764b86	satellite/orders;overlay: Consolidate order limit storage node lookups into 1 query. https: //storjlabs.atlassian.net/browse/SM-449 Change-Id: Idc62cc2978fba67cf48f7c98b27b0f996f9c58ac	2020-03-16 23:15:47 +00:00
Moby von Briesen	8b72181a1f	satellite/{audit,overlay,satellitedb}: implement unknown audit reputation and suspension * change overlay.UpdateStats to allow a third audit outcome. Now it can handle successful, failed, and unknown audits. * when "unknown audit reputation" (unknownAuditAlpha/(unknownAuditAlpha+unknownAuditBeta)) falls below the DQ threshold, put node into suspension. * when unknown audit reputation goes above the DQ threshold, remove node from suspension. * record unknown audits from audit reporter. * add basic tests around unknown audits and suspension. Change-Id: I125f06f3af52e8a29ba48dc19361821a9ff1daa1	2020-03-16 20:29:26 +00:00
Stefan Benten	52590197c2	satellite/payments: More Cleanup and Satellite command to ensure we have stripe customers (#3805 )	2020-03-16 20:34:15 +01:00
Qweder93	9f84261c36	storagenode/cache heldamount added Change-Id: I7fc807789de63e8a9b8ca2018fd73bdb9e01ad0d	2020-03-16 00:28:35 +02:00
Qweder93	94c4d1e737	satellite/satellitedb/heldamount added, endpoint added Change-Id: Ife8402b89f631f65ebb5cdf5ca02e99aa9b0b3ff	2020-03-13 18:15:52 +00:00
Jeff Wendling	41887883f3	satellite/satellitedb: check indexes on migration Change-Id: I5ba7ae2b512d77c70405ce332158f12128e27eed	2020-03-13 10:45:22 +00:00

... 3 4 5 6 7 ...

890 Commits