storj

Author	SHA1	Message	Date
Kaloyan Raev	6dff40f5c5	Merge remote-tracking branch 'origin/main' into multipart-upload Conflicts: go.mod go.sum satellite/metainfo/metainfo.go Change-Id: Ib5c49f3c911c58319855a171f9ce73657da976d9	2021-01-14 14:33:59 +02:00
Cameron Ayer	0184d33e96	satellite/satellitedb: set default 0 on uptime columns This is the first step in the removal of uptime columns on the nodes table. These columns are no longer used: uptime_success_count total_uptime_count uptime_reputation_alpha uptime_reputation_beta In order to avoid breaking backwards compatibility, we need to remove all references to these columns before removing the columns themselves from the database. However, since uptime_success_count and total_uptime_count are NOT NULLABLE, we can't remove them from the insert statements in the overlay. So we can't remove the columns because of the references, and we can't remove the references because the columns can't be null. What a pickle. To remedy this, we will set a default on the columns. Then we should be able to remove them from the insert statements Change-Id: I75f6c56fb7897835bbf29869f86f39de1d9dd345	2021-01-12 17:44:37 +00:00
Cameron Ayer	0403e99a5b	satellite/{overlay,satellitedb}: remove unused methods for old downtime tracking GetSuccessfulNodeNotCheckedInSince and GetOfflineNodesLimited are overlay methods which were only used by the previous downtime tracking system which has been removed. These methods should also be removed. Change-Id: Idb829d742e1f987e095604423fff656fe581183e	2021-01-11 15:21:28 +00:00
Michał Niewrzał	ec88d21a3c	Merge 'main' branch. Change-Id: I6e8162d1a6caf75e89c9f9c9f9522730aebf83ae	2021-01-11 10:26:58 +01:00
Moby von Briesen	6e2ef3b9ee	Revert "satellite/satellitedb: Do not consider nodes with offline_suspended as reputable." This reverts commit `e24262c2c9`. Change-Id: I287deb2e52d03bbd698ed055f0f216b0b5bf2798	2021-01-04 14:28:37 +00:00
Michał Niewrzał	ad3e3a38c5	Merge 'main' branch Change-Id: Ia0db1b1f9ef3e0671d3f2208881b0abc3064e200	2021-01-04 12:13:45 +01:00
Moby von Briesen	825dc71227	satellite/{overlay, satellitedb}: Refactor audit history * Separate audit history interface into its own file in the overlay package * Add overlay.AuditHistory struct so that internalpb.AuditHistory is only used from within the database layer * Add overlay.GetAuditHistory function for features that will require access to detailed audit history information * Do not return full audit history from UpdateAuditHistory - callers to that function only need to know the online score and whether a full tracking period has been completed * Move audit history tests out of satellite/satellitedb, since they are independent of database implementation Change-Id: I35b0c4ac23bbaabd80624f8a9631c3cb1a1f33bd	2020-12-29 18:50:22 +00:00
Moby von Briesen	85ae13f11d	satellite/satellitedb: Drop nodes_offline_times table. Now that the deprecated downtime tracking service is removed (`3fc76f4ffe`), we can safely remove the nodes_offline_times table. Change-Id: Ia7c6efe32ba104dff5a830af5f2beee3337eefe5	2020-12-29 18:17:50 +00:00
Moby von Briesen	e24262c2c9	satellite/satellitedb: Do not consider nodes with offline_suspended as reputable. Nodes which are offline_suspended will no longer be considered for new uploads. The current threshold that enters a node into offline suspension is 0.6. Disqualification for offline suspension is still disabled. Change-Id: I0da9abf47167dd5bf6bb21e0bc2186e003e38d1a	2020-12-29 17:59:09 +00:00
Ethan Adams	6070018021	satellite/overlay: use AS OF SYSTEM TIME with Cockroach Query nodes table using AS OF SYSTEM TIME '-10s' (by default) when on CRDB to alleviate contention on the nodes table and minimize CRDB retries. Queries for standard uploads are already cached, and node lookups for graceful exit uploads has retry logic so it isn't necessary for the nodes returned to be current.	2020-12-22 21:07:07 +02:00
Michal Niewrzal	9a8959d429	Merge 'master' branch Change-Id: Iba69ea73ca4d3f1cd4ae94243eaaae033c5324e8	2020-12-22 14:55:57 +01:00
Ethan Adams	563197c628	satellite/overlay: Add index on nodes table (#4012 ) satellite/accounting: Add index for project_id on bucket_storage_tallies	2020-12-21 12:48:48 -05:00
Ethan Adams	9b52283570	satellite/accounting: Add index for project_id on bucket_storage_tallies (#4010 ) Change-Id: I47ab2d1e24f94307c3383c497cffe2a150fa8ab7	2020-12-21 11:42:00 -05:00
Ethan Adams	6e501898c3	satellite/accounting: Performance improvements to getNodeIds used by GetBandwidthSince (#4009 )	2020-12-21 16:37:01 +01:00
Jessica Grebenschikov	da0327c9b7	satellite/dbcleanup: remove expired serial chore Change-Id: Ib71d41eb6679d6435e5bc10b6244dac66380a74e	2020-12-18 09:36:28 -08:00
Michal Niewrzal	2111740236	Merge 'master' branch Change-Id: Ib73af0ff3ce0e9a1547b0b9fc55bf88704f6f394	2020-12-18 09:13:24 +01:00
Cameron Ayer	28eaae66af	satellite/satellitedb: drop num_healthy_pieces column from injuredsegments This column is no longer used as it has been replaced by the segment_health column. Change-Id: I6b4df89cd4f994d8418976f88e8c5f57615f8115	2020-12-17 20:17:08 +00:00
Michal Niewrzal	70ba4deea9	satellite/repair/checker: adjust irreparable part of repair checker Change-Id: I0732104a97ba18a5359de3966cd692677a0ff790	2020-12-17 14:11:22 +00:00
Kaloyan Raev	9aa61245d0	satellite/audits: migrate to metabase Change-Id: I480c941820c5b0bd3af0539d92b548189211acb2	2020-12-17 14:38:48 +02:00
Michal Niewrzal	57f374af24	Merge 'master' branch Change-Id: Idf6b10ea7ca94e4d232e6a3b6a38ef2e646ba197	2020-12-15 08:26:53 +01:00
Stefan Benten	9fe477899b	satellite/satellitedb: add lint ignore rule to support staticcheck 2020.2 staticcheck 2020.2 is not liking our dbx files, so we need to ignore them. Change-Id: I6becc3619bb088473f9776d0878ce240d4935936	2020-12-14 21:16:31 +00:00
Jessica Grebenschikov	0649d2b930	satellite/repair: improve contention for injuredsegments table on CRDB We migrated satelliteDB off of Postgres and over to CockroachDB (crdb), but there was way too high contention for the injuredsegments table so we had to rollback to Postgres for the repair queue. A couple things contributed to this problem: 1) crdb doesn't support `FOR UPDATE SKIP LOCKED` 2) the original crdb Select query was doing 2 full table scans and not using any indexes 3) the SLC Satellite (where we were doing the migration) was running 48 repair worker processes, each of which run up to 5 goroutines which all are trying to select out of the repair queue and this was causing a ton of contention. The changes in this PR should help to reduce that contention and improve performance on CRDB. The changes include: 1) Use an update/set query instead of select/update to capitalize on the new `UPDATE` implicit row locking ability in CRDB. - Details: As of CRDB v20.2.2, there is implicit row locking with update/set queries (contention reduction and performance gains are described in this blog post: https://www.cockroachlabs.com/blog/when-and-why-to-use-select-for-update-in-cockroachdb/). 2) Remove the `ORDER BY` clause since this was causing a full table scan and also prevented the use of the row locking capability. - While long term it is very important to `ORDER BY segment_health`, the change here is only suppose to be a temporary bandaid to get us migrated over to CRDB quickly. Since segment_health has been set to infinity for some time now (re: https://review.dev.storj.io/c/storj/storj/+/3224), it seems like it might be ok to continue not making use of this for the short term. However, long term this needs to be fixed with a redesign of the repair workers, possible in the trusted delegated repair design (https://review.dev.storj.io/c/storj/storj/+/2602) or something similar to what is recommended here on how to implement a queue on CRDB https://dev.to/ajwerner/quick-and-easy-exactly-once-distributed-work-queues-using-serializable-transactions-jdp, or migrate to rabbit MQ priority queue or something similar.. This PRs improved query uses the index to avoid full scans and also locks the row its going to update and CRDB retries for us if there are any lock errors. Change-Id: Id29faad2186627872fbeb0f31536c4f55f860f23	2020-12-10 09:51:26 -08:00
Michal Niewrzal	b3acc1101a	Merge 'master' branch Change-Id: Iee99400c7095770e61cde94b3b2c8eb0ddec463d	2020-12-10 15:42:52 +01:00
Michal Niewrzal	c2a97aeb14	satellite/satellitedb: add ListAllBuckets method We need to be able to list all buckets in DB without knowing project ID. This method will be used to list buckets for metainfo loop implementation based on metabase. Change-Id: Iac75af0eee4f31e80a15577575a8249cbca787b2	2020-12-10 14:19:27 +00:00
Michal Niewrzal	218bbeaffa	Merge 'master' branch Change-Id: Ica5c25607a951076dd9f77e35e308062f71ce3f0	2020-12-07 15:05:52 +01:00
Stefan Benten	494bd5db81	all: golangci-lint v1.33.0 fixes (#3985 )	2020-12-05 17:01:42 +01:00
Ethan Adams	f90ea10a4a	Allow for DB application names per process. (#3983 )	2020-12-04 11:24:39 +01:00
Moby von Briesen	3fc76f4ffe	satellite/downtime: Remove deprecated downtime tracking service. We are no longer planning on implementing downtime penalization using the method described in docs/blueprints/archive/storage-node-downtime-tracking-deprecated.md. Now, we are implementing the design described in docs/blueprints/storage-node-downtime-tracking-with-audits.md. This change removes the downtime estimation chores from the satellite core as well as the package satellite/downtime. A future change will remove the database table. Change-Id: I1a1d3cf9dceeba36255d25243294865b89925518	2020-12-02 15:16:13 -05:00
JT Olio	1728c3a992	satellite/dbx: standardize on assignment Change-Id: I8f87bc8391e765e4480b0590d92d3601248e1f93	2020-12-01 16:10:18 +00:00
JT Olio	70b91aac54	satellitedb: remove cruft caused by https://review.dev.storj.io/c/storj/storj/+/3223 Change-Id: I198bb2f869cc7177b9ecafdd8932bbf2b58be5b8	2020-12-01 00:16:26 +00:00
Michal Niewrzal	5a7bc9657d	Merge 'master' branch Change-Id: If583132a821274dc4b78cf5f72b853ba8460c619	2020-11-30 12:57:22 +01:00
Egon Elbre	f456d7ce03	satellite: remove implementation detail from DB interface Which database access and how it internally does migrations is an implementation detail and does not belong in the requirements interface. Change-Id: Ia4a6994f39470063a96a8e5f3a1bd27aa79fe5cd	2020-11-30 13:29:20 +02:00
Egon Elbre	28ea63be92	satellite/repair: avoid TestDBAccess Change-Id: I34adb58cd67fba5917032f2f328d75b1c4afdbbf	2020-11-30 13:29:08 +02:00
JT Olio	71e11b27f3	satellite/dbx: only retry with cockroach Change-Id: Id3630c26dbfda36dcbece2849e2353d5ab2882af	2020-11-29 18:10:07 -07:00
JT Olio	bd23d12bb9	satellite/dbx: add cockroach retries for other QueryContext operations Change-Id: Ia30fbba55c926892702fa96fb9dd01b75347d351	2020-11-29 18:09:56 -07:00
JT Olio	ea2f39ca7f	satellite/dbx: add retries for QueryRowContext-based operations Change-Id: Ie2527b673dd4ce5250cf5c0cbf8f14921262f665	2020-11-29 18:09:46 -07:00
JT Olio	d3b0691bbd	satellite/dbx: import dbx templates these are unchanged from storj.io/dbx. we're importing them because in a later commit we will change them, and it'd be nice to see that diff as a separate commit. Change-Id: I8315130ed6bab397bd65b9a1a90c29d130b8c02d	2020-11-29 18:09:33 -07:00
JT Olio	5d8a67a4f7	satellitedb: retry GetBandwidthSince on cockroach Change-Id: I2bf20f3a19e7f3af97630d8a679410feba70661e	2020-11-29 16:36:15 -07:00
Ethan	5dc013d3bd	satellite/overlay: Add retry to all selects in overlaycache Change-Id: I0356d71a35701f8e0ca04a34b2bb2aea666c1394	2020-11-29 16:46:57 -05:00
JT Olio	6bce907cb0	satellite: try to stream rollups to aggregation function to use less memory this change tries really hard to never have all of the storage node rollups in memory at the same time, up until the rollups are actually getting summed together. Change-Id: If67f49e7d71106798d996a6850b3e48671bd9e18	2020-11-29 10:26:32 -07:00
JT Olio	6aae21541f	satellitedb: do saverollup in batches Change-Id: I78278a192cba60541eee2986f54a88d5a479bd3e	2020-11-28 19:26:46 -07:00
JT Olio	0ba516d405	satellite: support pointing db components at different databases the immediate need is to be able to move the repair queue back out of cockroach if we can't save it. Change-Id: If26001a4e6804f6bb8713b4aee7e4fd6254dc326	2020-11-28 18:39:16 +00:00
Michal Niewrzal	efaba85c73	Merge 'master' branch Change-Id: I3520b3e327732929f5167b07a15ddb92d26cae1b	2020-11-24 10:03:20 +01:00
Egon Elbre	55d5e1fd7d	satellite/orders: ensure that expired deletion doesn't stall Add checks to ensure that when somebody uses empty options, the deletion doesn't loop infinitely. Change-Id: I1738fb1e7e1f8efbbb954c491cb6489f7bcdc2db	2020-11-23 14:52:40 +02:00
Ethan	2b92bba563	satellite/satellitedb/orders: Handle serial_numbers deletes in smaller increments on CRDB CRDB doesn't like large deletes. While testing in the POC environment we found that deletes on the serial_numbers table could take hours. This change limits deletes to 1000 at a time (configurable) to avoid blocking other queries. Change-Id: I08455e25db1574579dd4d7b7125a08e9c913dff1	2020-11-20 13:44:52 +00:00
Moby von Briesen	a8b66dce17	satellite/accounting: account for old orders that can be submitted in satellite rollup With the new phase 3 order submission, orders can be added to the storage and bandwidth rollup tables at timestamps before the most recent rollup was run. This change shifts the start time of each new rollup window to account for any unexpired orders that might have been added since the previous rollup. A satellitedb migration is necessary to allow upserts in the accounting_rollups table when entries with identical node_ids and start_times are inserted. Change-Id: Ib3022081f4d6be60cfec8430b45867ad3c01da63	2020-11-18 14:46:00 -05:00
Moby von Briesen	0ec685b173	satellite/{satellitedb, repair/{queue, checker}}: Use new column "segmentHealth" instead of "numHealthy" in injured segments queue We plan to add support for a new Reed-Solomon scheme soon, but our repair queue orders segments by least number of healthy pieces first. With a second RS scheme, fewer healthy pieces will not necessarily correlate to lower health. This change just adds the new column in a migration. A separate change will add the new health function. Right now, since we only support one RS scheme, behavior will not change. Number of healthy pieces is being inserted as "segment health" until the new health function is merged. Segment health is calculated with a new priority function created in commit `3e5640359`. In order to use the function, a new config value is added, called NodeFailureRate, representing the approximate probability of any individual node going down in the duration of one checker run. Change-Id: I51c4202203faf52528d923befbe886dbf86d02f2	2020-11-16 21:18:09 +00:00
Michal Niewrzal	7c384c8293	Merge 'master' branch Change-Id: I1eefd5a56449e577820977d61fa4a22bdd4fc230	2020-11-16 10:02:54 +01:00
Jessica Grebenschikov	f558cc825e	satellite/orders: add storagenode_bw_phase2 table and dont delete tallies for longer It turns out we need to make 2 more changes in order for the new order submission phase 3 to get deployed. This PR makes 2 changes: 1) when the rollup service deletes tallies, we now keep tallies around until orders expire (vs 1 day like before). 2) the reported rollup chore will now write the storagenode_bandwidth_rollups to a new table _phase2 as an intermediary step so it doesn't conflict with phase 3 order settlement. These changes need to be deployed for 2 days before we can turn on phase 3 of the new orders settlement workflow. Change-Id: Iafbff577ba7d55f8f17b7db857311b2ce799de60	2020-11-13 17:15:24 +00:00
Michal Niewrzal	7dde184cb5	Merge 'master' branch Change-Id: I6070089128a150a4dd501bbc62a1f8b394aa643e	2020-11-10 11:58:59 +00:00
Cameron Ayer	dc67ce74c9	satellite: remove IsUp field from overlay.UpdateRequest With the new overlay.AuditOutcome type for offline audits, the IsUp field is redundant. If AuditOutcome != AuditOffline, then the node is online. In addition to removing the field itself, other changes needed to be made regarding the relationship between 'uptime' and 'audits'. Previously, uptime and audit outcome were completely separated. For example, it was possible to update a node's stats to give it a successful/failed/unknown audit while simultaneously indicating that the node was offline by setting IsUp to false. This is no longer possible under this changeset. Some test which did this have been changed slightly in order to pass. Also add new benchmarks for UpdateStats and BatchUpdateStats with different audit outcomes. Change-Id: I998892d615850b1f138dc62f9b050f720ea0926b	2020-11-02 15:34:17 -05:00
Egon Elbre	7183dca6cb	all: fix defers in loop defer should not be called in a loop. Change-Id: Ifa5a25a56402814b974bcdfb0c2fce56df8e7e59	2020-11-02 15:06:38 +02:00
Egon Elbre	716068a1e0	Merge branch 'master'. Change-Id: Ic14325edc291573582dce0cea3e04991a820b48b	2020-11-02 13:02:01 +02:00
Egon Elbre	11338e9beb	satellite/internalpb: move audithistory.pb Change-Id: I8eee84d49ed90459168ddaf04ae57f790c2a22c4	2020-10-30 15:30:11 +02:00
Egon Elbre	7ce372c686	satellite/internalpb: add inspectors Change-Id: Ib688e43d05135c0c31ae95df533f1e4535ea396a	2020-10-30 13:28:17 +02:00
Egon Elbre	004e610d0f	satellite/internalpb: move datarepair.pb to internal Change-Id: If901d9ff4e5ee6715b963eeeb46513a602a44b3d	2020-10-30 13:28:14 +02:00
Kaloyan Raev	b8c6fb764c	satellite/metainfo: add metabase to metainfo service Change-Id: Ie3ff238b138d8a57d99e32b13f7a71aa624d53e3	2020-10-30 12:49:47 +02:00
Egon Elbre	caefde6b32	private/{dbutil,tagsql}: pass ctx to database opening Database opening usually dial and hence we should pass ctx to them. Change-Id: Iaa2875981570d83e65be3710f841cf30349f807b	2020-10-29 10:51:29 +00:00
Egon Elbre	e3985799a1	storage/{cockroachkv,postgreskv}: add ctx to opening Database opening usually dial and hence we should pass ctx to them. Change-Id: Iecf41241aaa94d54506cbc80b0e53449848d8819	2020-10-29 10:49:08 +00:00
Egon Elbre	9b2e00a38b	satellite: pass ctx into satellitedb.Open Opening a database requires ctx, this is first step to passing ctx to the appropriate level. Change-Id: Ic303e69f868ef3449ae36377937a29670cf635e2	2020-10-29 06:38:37 +00:00
Cameron Ayer	bb7be23115	satellite/{audit,overlay,satellitedb}: enable reporting offline audits - Remove flag for switching off offline audit reporting. - Change the overlay method used from UpdateUptime to BatchUpdateStats, as this is where the new online scoring is done. - Add a new overlay.AuditOutcome type: AuditOffline. Since we now use the same method to record offline audits as success, failure, and unknown, we need to distinguish offline audits from the rest. Change-Id: Iadcfe10cf13466fa1a1c2dc542db8994a6423355	2020-10-27 10:44:46 +00:00
Ethan	9a29ec5b3e	Add index to graceful_exit_transfer_queue table This fixes a slow query that was taking up to 4 seconds in production SELECT node_id, path, piece_num, root_piece_id, durability_ratio, queued_at, requested_at, last_failed_at, last_failed_code, failed_count, finished_at, order_limit_send_count FROM graceful_exit_transfer_queue WHERE node_id = '[redacted]' AND finished_at is NULL AND last_failed_at is NULL ORDER BY durability_ratio asc, queued_at asc LIMIT 300 OFFSET 0; Change-Id: Ib89743ca35f1d8d0a1456b20fa08c683ebdc1549	2020-10-26 14:47:48 +00:00
Moby von Briesen	7c3afe164b	satellite/overlay: uncomment dq for offline and disable with feature flag Change-Id: Ib39e2be32e880b822a94eddfb81af99a38843a27	2020-10-16 12:55:16 +00:00
Yaroslav Vorobiov	139a7ee959	private/migrate: add ablity to create dbs during migration Use tagsql.DB pointer as step database, to propagate changes back and forth between actual database and migration. Adds CreateDB operation to the migration step to be able to create new dbs before executing migration action. Adjusts storagenode database migration to use inner tagsql.DB pointer of each database as step.DB. Adjusts satellite dabase migration, adds proxy migrationDB field to satellite db that wraps itself as tagsql.DB, pointer of which is used as step.DB. Change-Id: Ifed4de5b01a356cf7b37db64d2eaeb7b61982c5c	2020-10-15 15:28:04 +03:00
Stefan Benten	0b43b93259	satellite/satellitedb: make limits per default NULL This change completes the column migration of `5f6fccc6e8` and `2f648fd981`. It resets every users project limits who are below or equal to our current production defaults. Change-Id: Ie041d08bb67b62844f6023190fc00bc2dad5b1cb	2020-10-14 20:28:16 +00:00
Egon Elbre	2268cc1df3	all: fix linter complaints Change-Id: Ia01404dbb6bdd19a146fa10ff7302e08f87a8c95	2020-10-13 15:59:01 +03:00
Egon Elbre	0bdb952269	all: use keyed special comment Change-Id: I57f6af053382c638026b64c5ff77b169bd3c6c8b	2020-10-13 15:13:41 +03:00
Jeff Wendling	0f0faf0a9f	satellite/orders: do a better job limiting concurrent requests Doing it at the ProcessOrders level was insufficient: the endpoints make multiple database calls. It was a misguided attempt to only have one spot enter the semaphore. By putting it in the endpoint we can not only be sure that the concurrency is correctly limited but it can be configurable easily. Change-Id: I937149dd077adf9eb87fce52a1a17dc0afe96f64	2020-10-09 16:27:15 -04:00
Jeff Wendling	7c303208ff	satellite/satellitedb: emergency temporary order processing semaphore we have thundering herds of order submissions that take all of the database connections causing temporary periodic outages. limit the amount of concurrent order processing to 2. Change-Id: If3f86cdbd21085a4414c2ff17d9ef6d8839a6c2b	2020-10-08 19:16:47 +00:00
Cameron Ayer	b39a99bae6	satellite/{overlay,satellitedb}: always show node's real online score Previously if a node did not have audit history data for each of the windows over the tracking period, we would give them the benefit of the doubt and set their score to 1. This was to prevent nodes from being suspended right out the gate. We need a minimum amount of data to evaluate them. However, a node who is actually failing at being online will have no idea until they have received enough audits and we suspend them. Instead, we will always use their real score, but use a flag to determine whether they are eligible for suspension/dq. Change-Id: I382218f12e8770f95d4bcddcf101ef348940cadf	2020-10-02 12:28:11 -04:00
Cameron Ayer	c2525ba2b5	satellite/{repair,satellitedb}: clean up healthy segments from repair queue at end of checker iteration Repair workers prioritize the most unhealthy segments. This has the consequence that when we finally begin to reach the end of the queue, a good portion of the remaining segments are healthy again as their nodes have come back online. This makes it appear that there are more injured segments than there actually are. solution: Any time the checker observes an injured segment it inserts it into the repair queue or updates it if it already exists. Therefore, we can determine which segments are no longer injured if they were not inserted or updated by the last checker iteration. To do this we add a new column to the injured segments table, updated_at, which is set to the current time when a segment is inserted or updated. At the end of the checker iteration, we can delete any items where updated_at < checker start. Change-Id: I76a98487a4a845fab2fbc677638a732a95057a94	2020-09-29 20:38:22 +00:00
Egon Elbre	c23a8e3b81	go.mod: update pgx to v4.9.0 Fix query to use TextArray instead of VarcharArray. Fix queries to use the correct type. Change-Id: Ibb7e55adba277d05778118d81ca697470e72c374	2020-09-29 19:03:08 +00:00
Egon Elbre	2d27bc8787	satellite/satellitedb: separate cockroach for migration tests Currently Cockroach migration test is the most heavy with regards to schema changes. This causes other tests to time out. This adds an alternate cockroach instance that is used for migration tests. Change-Id: I01fe9313527ff002f0bb0914dd52c3645b8eaf6d	2020-09-29 09:31:33 +00:00
Jessica Grebenschikov	4a2c66fa06	satellite/accounting: add cache for getting project storage and bw limits This PR adds the following items: 1) an in-memory read-only cache thats stores project limit info for projectIDs This cache is stored in-memory since this is expected to be a small amount of data. In this implementation we are only storing in the cache projects that have been accessed. Currently for the largest Satellite (eu-west) there is about 4500 total projects. So storing the storage limit (int64) and the bandwidth limit (int64), this would end up being about 200kb (including the 32 byte project ID) if all 4500 projectIDs were in the cache. So this all fits in memory for the time being. At some point it may not as usage grows, but that seems years out. The cache is a read only cache. When requests come in to upload/download a file, we will read from the cache what the current limits are for that project. If the cache does not contain the projectID, it will get the info from the database (satellitedb project table), then add it to the cache. The only time the values in the cache are modified is when either a) the project ID is not in the cache, or b) the item in the cache has expired (default 10mins), then the data gets refreshed out of the database. This occurs by default every 10 mins. This means that if we update the usage limits in the database, that change might not show up in the cache for 10 mins which mean it will not be reflected to limit end users uploading/downloading files for that time period.. Change-Id: I3fd7056cf963676009834fcbcf9c4a0922ca4a8f	2020-09-25 16:28:49 +00:00
Stefan Benten	38108828ac	satellite/satellitedb: enable multiple projects existing users Change-Id: I2ef77182d5464d72574698c8abfbbfdbda3f5a9e	2020-09-23 18:17:38 +02:00
Stefan Benten	5f6fccc6e8	satellite/satellitedb: makes limits nullable change backwards compatible Our current endpoints bail on us, if the column data is null. Thus we need to take the intermediate step and set the default to a fixed value and reset those with the following release. It sets the default column value to our current config values of 50GB for storage and bandwidth and 100 buckets, while still enabling the field to be nullable. All 0 values are migrated to be the default as well to ensure they can keep using their projects, as with the original change, 0 actually means 0. Change-Id: I797be80ce2d2105091599dc1b3fc76f74336b66b	2020-09-23 17:54:42 +02:00
Stefan Benten	2f648fd981	satellite: make limits be nullable Currently we have no way to actually set one of the following limits to 0 (meaning not usable): - maxBuckets - usageLimit - bandwidthLimit With having the field nullable, NULL corresponds to the global default, 0 now actually 0 and a set value determines a custom limit. Change-Id: I92bb77529dcbd0881ae8368921be9d246eb0919e	2020-09-21 19:34:19 +00:00
Qweder93	8182fdad0b	storagenode: heldamount renamed to payouts, renamed some methods and structs to more meaningful names. grouped estimated payout with pathouts satellite: heldamount renamed to SNOpayouts. Change-Id: I244b4d2454e0621f4b8e22d3c0d3e602c0bbcb02	2020-09-16 14:57:35 +00:00
Cameron Ayer	e7c34a053d	satellite/satellitedb: add column and index "updated_at" to injuredsegments Change-Id: I59e9bb2077885f09e17795375fe98ed31bd83d54	2020-09-14 12:53:04 -04:00
Michal Niewrzal	27a9d14e2a	satellite/repair: use metabase.SegmentKey type in repair package Another change which is a part of refactoring to replace path parameter (string/[]byte) with key paramter (metabase.SegmentKey) Change-Id: I617878442442e5d59bbe5c995f913c3c93c16928	2020-09-08 19:35:20 +00:00
Jennifer Johnson	4e2413a99d	satellite/satellitedb: uses vetted_at field to select for reputable nodes Additionally, this PR changes NewNodeFraction devDefault and testplanet config from 0.05 to 1. This is because many tests relied on selecting nodes that were reputable based on audit and uptime counts of 0, in effect, selecting new nodes as reputable ones. However, since reputation is now indicated by a vetted_at db field that is explicitly set rather than implied by audit and uptime counts, it would be more complicated to try to update all of the nodes' reputations before selecting nodes for tests. Now we just allow all test nodes to be new if needed. Change-Id: Ib9531be77408662315b948fd029cee925ed2ca1d	2020-09-04 16:45:32 +00:00
Michal Niewrzal	8649a00557	satellite/gracefulexit: replace `Path []byte` to `Key metabaseSegmentKey` TransferQueueItem We are unifying which name (and type) we are using for value we are using to point to segment. We want to use `key` instead of `path`. Dedicated type `metabase.SegmentKey` was created for this purposes also. This change is doing refactoring around gracefulexit. Change-Id: I90d51ff087b206179e61d5f1bc95f4709d76f917	2020-09-04 11:09:48 +00:00
Egon Elbre	dc48197bd8	satellite/orders: add bucket id to order limit Change-Id: I9019ec77d692e62ac17b67a1da71dc3535cde50c	2020-09-03 10:50:11 +03:00
Michal Niewrzal	0604a672c1	satellite/metainfo: use metabase in loop Change-Id: I1bb0c6fe0a762895fde950690b06f7dd9d77e178	2020-09-01 10:06:16 +00:00
Moby von Briesen	2d01dd9732	satellite/satellitedb: Add online_score column to nodes table Add online score used for the new audit history offline tracking system to the nodes table. This allows us easy access to the node's online score for the storagenode dashboard as well as for data analysis. Change-Id: Ie99be1192e5236862a5b3dbed2e5ef03b9169410	2020-08-31 15:07:07 +00:00
Moby von Briesen	60a95d0dc9	satellite/{satellitedb,overlay}: Enable offline suspension and review period When a node's audit history "online score" passes below a configured threshold, the node goes into "offline suspension" mode and begins a review period, where the operator is given an opportunity to bring their node back online. After the review period passes, offline suspension is turned off for the node. In the future, if a node still has a bad online score at the end of the review period, it will be disqualified. This is disabled right now. In the future, if a node is in offline suspension, it will be treated as "unhealthy". Right now, there are no consequences for being in offline suspension. Minor changes: * Moves AuditHistoryConfig out of UpdateStats/BatchUpdateStats args and into UpdateRequest. * Adds "now" argument to UpdateStats/BatchUpdateStats args for easy testing. * Changes formatting strings inside buildUpdateStatement to use specific types. Change-Id: I032b60298840fc16e6ef831da750f2d57619a397	2020-08-28 16:35:48 +00:00
Bill Thorp	729079965f	satellite/satellitedb : remove migation steps 69-102 Jenkins has been failing a lot lately due to test timeouts with CockroachDB. TestMigrateCockroach previously took around 5 minutes, now it takes 2. Why 103? I couldn't get 100 to work due to an error w/ NOT NULL and PKs. Change-Id: Iec95d4e25f9d6cd36920e7f43272c486a17fa879	2020-08-27 07:36:05 +00:00
Moby von Briesen	959cd5cd83	satellite/satellitedb: Update audit history from overlay.UpdateStats and overlay.BatchUpdateStats Change-Id: Ib530b61895ca4a8b12ba022c408a416b237b56d7	2020-08-20 22:46:28 +00:00
Moby von Briesen	5f0477ebe9	satellite/{overlay,satellitedb}: Create database functionality for updating audit history Add a function to the overlay cache called UpdateAuditHistory, which allows us to add online or offline audits to a particular node's audit history, and get that node's "online score" for the configured tracking period. The next step will be to use UpdateAuditHistory from inside BatchUpdateStats/UpdateStats, so that audit history is actually updated when nodes get audited, and we can suspend nodes based on their online score. Change-Id: I2289105e6961e68e829a987ff756b0e576fab120	2020-08-20 17:34:27 +00:00
Egon Elbre	94a09ce20b	all: add missing dots Change-Id: I93b86c9fb3398c5d3c9121b8859dad1c615fa23a	2020-08-11 17:50:01 +03:00
Ethan	ab1d0f097d	satellite/storageusage: Group accounting rollups at_rest_total by day When investigating a gap in storage usage data in the SN dashboard, I noticed that there were 2 entries in the accounting_rollups table on the date of the gap. This change accounts for multiple entries in the accounting_rollups table for a given day. Change-Id: Ibf2b5d0455117cb0417163e8fcfb7e509d594171	2020-08-10 15:03:15 +00:00
Kaloyan Raev	7552ff26ec	satellite/db: drop project_invoice_stamps table It's an obsolete table from earlier state of Stripe invoices implementation. No code is currently using it. It is confirmed that this table is currently empty across all satellites. Change-Id: I12d2756578faf8418ea8f3b09088e885694b8925	2020-08-10 13:22:10 +00:00
Kaloyan Raev	edfd3d7661	satellite/payments: delete `credits` and `credits_spendings` db tables Jira: https://storjlabs.atlassian.net/browse/USR-822 This the last step of dropping these 2 db tables. It also deletes all code associate with them. Change-Id: I8be840dc2a7be255cf6308c9434b729fe4d9391e	2020-07-30 12:19:57 +03:00
Egon Elbre	36ed939b89	satellite/orders: add buckets db to service We need to add bucket UUID into the order limit, hence we need access to the buckets table. Change-Id: I348ce1f709c9fcdec5c4034acaab59805b33da9f	2020-07-24 17:36:49 +03:00
Ethan	cfca021839	satellite/accounting: Add chore to cleanup old project bandwidth rollups data Removes old project_bandwidth_rollups records that are no longer used. Uses a retain months configuration to determine how many months to save. Current month cannot be removed. Tests retainMonths=-1, 0, 2 Change-Id: Ia4be2546cdb28802427acf41ecd85ad66df3e62c	2020-07-22 18:56:49 +00:00
Bill Thorp	65408db6e0	satellite/satellitedb: Coinpayments repeat insert bug fix I introduced a bug with https://review.dev.storj.io/c/storj/storj/+/2216 Because the log change allowed insert to be called multiple times. This changes the insert logic to do nothing if the PK already exists. Change-Id: I90d192a0f6619bfbb360ea104066f00a3348f6dd	2020-07-20 20:21:35 +00:00
Isaac Hess	67a292d135	satellite/satellitedb: Monitor node tallies We are adding a monkit evaluation for the total sum of data stored on the nodes before it is inserted into the database. This will give us a time-series history of total data stored so we can see it change over time. Change-Id: I41145a2d7a09c8e63b42ae578bd081035b60e529	2020-07-17 10:21:42 -06:00
Egon Elbre	d8dcae3075	all: fix error checking Change-Id: Ia0da1bbd6ce695139922f94096c2419281905e32	2020-07-16 19:13:14 +03:00
Egon Elbre	e70da5cd4e	all: fix comments Change-Id: I2d2307e3fab87de47a72b3595d051e2c95ff4f8a	2020-07-16 19:13:14 +03:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
stefanbenten	9ace375ee0	satellite/{console,satellitedb}: change project limiting based on new users field This change switches the backend logic to use the new DB column on the users table to restrict project creation. Furthermore it back fills the existing limits from registration tokens to the new column to ensure no users are reset to the new default. UI is updated to reflect ability to create several projects Change-Id: Ie29157430ae6b065411ca4c4557c9f1be69cdc4f	2020-07-16 10:57:47 +00:00
stefanbenten	0209a2095f	satellite/{console,satellitedb}: add project_limit column to users table Change-Id: I603f085f17ca5b413dd1c6837c2081f9e7e791a1	2020-07-15 17:27:31 +00:00
stefanbenten	2c2d284f3d	satellite/admin: add bucket limit handling endpoint Change-Id: I4b199277cff30f11f4a9fff3b0ac4017b694f2e8	2020-07-15 17:27:23 +00:00
Jennifer Johnson	784a156eea	satellite: prevents uplink from creating a bucket once it exceeds the max bucket allocation. Change-Id: I4b3822ed723c03dbbc0df136b2201027e19ba0cd	2020-07-15 17:27:05 +00:00
stefanbenten	257855b5de	all: replace == comparison with errors.Is Change-Id: I05d9a369c7c6f144b94a4c524e8aea18eb9cb714	2020-07-14 15:50:25 +00:00
stefanbenten	0a32ba0e6b	satellite/admin: add project rename functionality Change-Id: I4c0f42d4c2c26859279f247f94cef97a8ff630a9	2020-07-14 11:36:49 +00:00
stefanbenten	f768302c91	satellite/admin: harden project deletion requirements Change-Id: Ia7ea469f87469b16e464dc22af24b98a6ef1873d	2020-07-14 11:36:29 +00:00
Jessica Grebenschikov	8abb907010	satellite/orders: add settle orders with window Why: We need a way to cut down on database traffic due to bandwidth measurement and tracking. What: This changeset is the Satellite side of settling orders in 1 hr windows. See design doc for more details: https://review.dev.storj.io/c/storj/storj/+/1732 Change-Id: I2e1c151e2e65516ebe1b7f47b7c5f83a3a220b31	2020-07-13 15:41:29 -07:00
paul cannon	bbdb351e5e	all: use jackc/pgx in place of lib/pq What: Use the github.com/jackc/pgx postgresql driver in place of github.com/lib/pq. Why: github.com/lib/pq has some problems with error handling and context cancellations (i.e. it might even issue queries or DML statements more than once! see https://github.com/lib/pq/issues/939). The github.com/jackx/pgx library appears not to have these problems, and also appears to be better engineered and implemented (in particular, it doesn't use "exceptions by panic"). It should also give us some performance improvements in some cases, and even more so if we can use it directly instead of going through the database/sql layer. Change-Id: Ia696d220f340a097dee9550a312d37de14ed2044	2020-07-13 15:54:41 +00:00
Egon Elbre	9dc9cd8a17	tests: allow STORJ_TEST_POSTGRES STORJ_POSTGRES_TEST naming was not consistent with STORJ_SIM_POSTGRES. This allows to use STORJ_TEST_POSTGRES for clarity, it still has a fallback to STORJ_POSTGRES_TEST. Change-Id: I6f294c66c80fcfd6750fea2a89795f3b7f5dd691	2020-07-10 16:43:49 +03:00
Jeff Wendling	885ef70c58	satellite/nodeapiversion: new table for tracking node api usage This system tracks an abstract "api version" from nodes based on their usage, allowing us to have latching behavior where if a node ever uses a new api, it can be blocked from using the old api. This is better than using self-reported semver version information because the node cannot lie, there's no confusion about what semver version implies which features, no questions about dev and ci environments, and no dependencies between reporting the version and using the new api. Change-Id: Ifeced5c9ae8e0a16102d79635e176a7d3bdd8ed4	2020-07-09 15:02:25 +00:00
Isaac Hess	fd740295ec	satellite/satellitedb: Add comment to revocation Change-Id: I1b65b7e46439c4788835ea5bfd4df3d32a713b44	2020-07-06 21:51:35 +00:00
Bill Thorp	00ae5ebbab	satellite/payemnts: Credit coin payments earlier Apply the coin payments when CoinPayments.net recieves the funds Instead of the when STORJ gets them from CoinPayments.net Based on 7/1/20 User Growth standup guidance by JG Relates to: https://storjlabs.atlassian.net/browse/USR-801 Change-Id: I174ca23a585010f39464c45525e1dfe0179b7c1a	2020-07-06 13:24:26 +00:00
Cameron Ayer	e3088d9ad5	satellite/satellitedb: add new DB table audit_histories Change-Id: I5f854514994cab9a68cf978f2dabfb588df695f5	2020-07-01 21:14:35 +00:00
Qweder93	b639ec08d4	satellite/heldamount: payments added, endpoind for payments added Change-Id: Ia2b9580bc353ef614680230c6f82c5bf6ded49c4	2020-07-01 18:15:01 +03:00
Cameron Ayer	cadb435d25	{satellite/audit, private/testplanet}: remove ErrAlreadyExists, run 2 audit workers in testplanet Since we increased the number of concurrent audit workers to two, there are going to be instances of a single node being audited simultaneously for different segments. If the node times out for both, we will try to write them both to the pending audits table, and the second will return an error since the path is not the same as what already exists. Since with concurrent workers this is expected, we will log the occurrence rather than return an error. Since the release default audit concurrency is 2, update testplanet default to run with concurrent workers as well. Change-Id: I4e657693fa3e825713a219af3835ae287bb062cb	2020-06-30 18:00:07 +00:00
Egon Elbre	d91cf5f4de	satellite/satellitedb: add missing SeparateTx Change-Id: I3ba5a4e0632a1e0e5e77c30e515953eadf05bc45	2020-06-26 12:27:05 +03:00
Egon Elbre	13a5854535	satellite/satellitedb: clarify test migration merging Use a field to distinguish migration steps that need to use a different transaction from previous steps. This is clearer than using a func. Change-Id: I2147369d05413f3e8ddb50c71a46ab1ba3ab5114	2020-06-25 14:32:45 +00:00
Cameron Ayer	3b4b5f45c7	satellite: replace references to Suspended with UnknownAuditSuspended Change-Id: I3d2d00c95954c0546ad077702617895f262926ef	2020-06-23 14:19:22 +00:00
Isaac Hess	2d727bb14e	satellite: Check macaroon revocation When a request comes in on the satellite api and we validate the macaroon, we now also check if any of the macaroon's tails have been revoked. Change-Id: I80ce4312602baf431cfa1b1285f79bed88bb4497	2020-06-22 13:50:07 -06:00
Egon Elbre	f68e7b3fde	satellite/overlay: replace pb.InfoResponse pb.InfoResponse wasn't used for protocol buffer communication, but instead as a satellite type. Change-Id: I755619f2deec5b76c4fe488591b7d8c1b9fcdafb	2020-06-16 15:16:55 +03:00
Cameron Ayer	0885ba5646	satellite/satellitedb: add new columns for offline suspension add new columns `offline_suspended` and `under_review` to nodes table. `unknown_audit_suspended` is a new column which will replace `suspended` Change-Id: I22ddeb338ea0ff63f14332a7ebd0f3e9e4c06cdc	2020-06-15 04:00:20 +00:00
paul cannon	7b8e91ff28	satellite/satellitedb: no orders for exited nodes We should not be sending any type of orders to nodes that have completed graceful exit with the current satellite. In particular, we should not be trying to audit them, because that would be silly. Change-Id: Ie2153e5739914ab696feefcdef28545ed70f84e4	2020-06-13 13:49:33 +00:00
Egon Elbre	1ed5a1bac5	satellite/satellitedb/satellitedbtest: skip omitted database The first implementation missed some changes. Change-Id: I7ae696175e0a9ea46954970ba8547638a05ed5a9	2020-06-11 13:28:16 +00:00
Cameron Ayer	bad299b541	satellite/satellitedb: serialize UpdateStats and BatchUpdateStats transactions Since we increased the number of audit workers from 1 to 2, we need to make sure concurrent updates do not trample each other. We can do this by serializing the transactions. Change-Id: If1b2f71cabe3c779c12ffa33c0c3271778ac3ae0	2020-06-10 17:11:28 +00:00
Egon Elbre	36c461bd59	private/tagsql: track proper closing of rows and statements This ensures that rows are closed to avoid leaks. Also verifies that Err() is called, to ensure that no error is left behind. Change-Id: Idd1bec9bf479f40021da67b2c80ce83033149469	2020-06-05 18:25:43 +00:00
Egon Elbre	34db4a80fd	ci: fix staticcheck failures Change-Id: I176fb24214755a1940a0a1a4e9cc8e39f184870b	2020-06-05 13:15:34 +00:00
Michal Niewrzal	2b2efcc662	satellite/payments/stripecoinpayments: move Coupons expiration date sorting directly to listing method Change-Id: I58d8a6ea1feba9ff2d19f21a1dbc87bfb8b49801	2020-06-04 09:47:42 +00:00
Jeff Wendling	254b42ff65	satellite/satellitedb: fix leaked rows from repairQueue.Insert Change-Id: If5e62c49770f591ebe3f4d2dd4dd2658c229a022	2020-06-03 14:31:21 -06:00
Michal Niewrzal	b20ced9519	satellite/satellitedb: drop project_id column from coupons table This is last part of https://storjlabs.atlassian.net/browse/USR-818 Change-Id: I053d11b37df962c12e46645bae2fc2dad49c9755	2020-06-03 14:56:41 +00:00
Cameron Ayer	6a60e1e96b	satellite/satellitedb: inclusive interval_start in GetAllocatedBandwidthTotal The DB query in GetAllocatedBandwidthTotal uses an exclusive range: 'WHERE interval_start > ?' The value that is used for this condition is the first day of current the month, 00:00:00 UTC. By using the exclusive '>', we exclude the entire first hour of the month from the result set. Change-Id: I3ed300f5230c7514dc9495a85e8166213cd0842e	2020-06-02 13:06:45 -04:00
Jeff Wendling	2b3545c0c3	satellite/satellitedb: use delete returning to query pending_serial_queue this way we don't have to do 2 steps, and by using the ctid, postgres is going to do two very efficient prefix scans. Change-Id: Ia9d0546cdf0a1af67ceec9cd508d336a5fdcbdb9	2020-06-01 15:43:33 -06:00
Jeff Wendling	44433f38be	satellite/satellitedb: remove ORDER BY when reading from queue also remove the continuation support from the queue, otherwise we may end up sequential scanning the entire table to get a few rows at the end. then, in the core, instead of looping both to get a big enough batch inside of the queue, as well as outside of it to ensure we consume the whole queue, just get a single batch at a time. also, make the queue size configurable because we'll need to do some tuning in production. Change-Id: If1a997c6012898056ace89366a847c4cb141a025	2020-06-01 18:31:14 +00:00
Yingrong Zhao	163c027a6d	satellite/satellitedb: remove monkit trace from convertDBNode In jaeger, it shows that this function gets called repetitively in a single request. Most of the time, it's less than 1ms. Therefore, it doesn't add much value in our trace but create noises. Change-Id: I20234f36bbcf0fc22f91e5e1a5634c0cad577ed0	2020-06-01 17:58:43 +00:00
Michal Niewrzal	a9f6489663	satellite/payments/stripecoinpayments: remove ProjectID from Coupon struct This change is removing ProjectID from code. Next change will be about dropping this colum from DB table. Change-Id: Idb949e2829e2c304a2b6b011259c7cc7667082e1	2020-06-01 11:37:20 +00:00
Egon Elbre	07050eea26	all: use common/storj Change-Id: Id1e36d52f9807b5ffbb72ce73f4b60cb21b68a78	2020-05-29 11:57:32 +03:00
Jeff Wendling	1e065fb450	satellite: migration to fix bad imported payment history the initial calculations for the historical values of comp_at_rest were wrong. because our historical data only included total amounts as well as compensation for bandwidth, the at rest value was calculated as at_rest = total - bandwidth unfortunately, that calculation did not take surge pricing into account correctly. the at rest and bandwidth values do not include surge pricing, but the total that was used did. so what we actually calculated was no_surge_at_rest = surge_total - no_surge_bandwidth which will create a value that is too large. this migration fixes the calculation for imports that are old enough and of a non-negligable difference. Change-Id: I61eb0b670510f6d7fb8fc3de39ba79150fac10eb	2020-05-28 12:59:08 -06:00
Michal Niewrzal	75b3db5426	satellite/payments/stripecoinpayments: test invoice user with more than 1 project https://storjlabs.atlassian.net/browse/USR-291 Change-Id: I98286e40254e8868de9eb675a9c9a8cd0bf5f3b1	2020-05-27 09:12:23 +00:00
Moby von Briesen	290c006a10	satellite/repair/{checker,queue}: add metric for new segments added to repair queue * add monkit stat new_remote_segments_needing_repair, which reports the number of new unhealthy segments in the repair queue since the previous checker iteration Change-Id: I2f10266006fdd6406ece50f4759b91382059dcc3	2020-05-27 06:23:47 +00:00
Jeff Wendling	074649835b	satellite/satellitedb: add some docs and improve some snapshots This attempts to add a README.md to help create consistent migrations that maximize our test coverage and do not include unnecessary statements. It also adds a feature to have an `-- OLD DATA --` section as well as a `-- NEW DATA --` section so that we can fix mistakes made in previous snapshots (like a row that was forgotten to be added when a table was created) without editing them going forward. Change-Id: I28a786f8ef163cae1de1bb08f61af1e1104b0a88	2020-05-22 21:27:36 +00:00
Jennifer Johnson	03e5f922c3	satellite/overlay: updates node with a vetted_at timestamp if they meet the vetting criteria What: As soon as a node passes the vetting criteria (total_audit_count and total_uptime_count are greater than the configured thresholds), we set vetted_at to the current timestamp. Why: We may want to use this timestamp in future development to select new vs vetted nodes. It also allows flexibility in node vetting experiments and allows for better metrics around vetting times. Please describe the tests: satellitedb_test: TestUpdateStats and TestBatchUpdateStats make sure vetted_at is set appropriately Please describe the performance impact: This change does add extra logic to BatchUpdateStats and UpdateStats and commits another variable to the db (vetted_at), but this should be negligible. Change-Id: I3de804549b5f1bc359da4935bc859758ceac261d	2020-05-20 16:30:26 -04:00
Egon Elbre	5d016425f1	satellite/{contact,downtime,overlay}: use NodeURL Change-Id: I555a479a89e0ddbf0499898bdbc8574282cd6846	2020-05-20 11:09:05 +00:00
Stefan Benten	0a26c4af9a	satellite/admin: add coupon deletion (#3893 )	2020-05-19 15:49:44 +03:00
Stefan Benten	671aca56b0	satellite/admin: add coupon creation and listing (#3891 )	2020-05-19 12:36:13 +02:00
Kaloyan Raev	49571f1a23	satellite/payments: all invoice commands require period To avoid including multiple months in a single invoice, we need all inspector's invoice commands to run in for specific period. See https://storjlabs.atlassian.net/browse/USR-725 Change-Id: I3637dc189234f02350daca8d897c21765762ea55	2020-05-14 11:50:19 +00:00
Jeff Wendling	6352d46100	satellite/satellitedb: do better ::date conversions There is a subtle problem when one does a cast with `::date`. Observe: teststorj=# set timezone = 'US/Eastern'; SET teststorj=# select (timestamp with time zone '2020-02-01 00:00:00+00')::date; date ------------ 2020-01-31 (1 row) teststorj=# set timezone = 'UTC'; SET teststorj=# select (timestamp with time zone '2020-02-01 00:00:00+00')::date; date ------------ 2020-02-01 (1 row) In order to correctly determine the date a timestamp is in, one has to explicitly pick the time zone that the date truncation should use otherwise postgres will use whatever setting the client has. These tests were failing for me locally, because I run my postgres in the US/Eastern time zone to try to tickle these bugs out. So it should be `(x at time zone 'UTC')::date` instead of just `x::date`. Change-Id: I4e9e32d4b53abc6165a4d0474f4702f8b9f801c7	2020-05-13 15:58:07 +00:00
Egon Elbre	0e3be60b79	satellite/satellitedb: simplify migrate step Change-Id: Ie4574144fb6ddd057d5fca740702c59fbdb2c5e4	2020-05-12 18:27:07 +03:00
Stefan Benten	e23bd806b4	satellite/accounting: separate usage and bandwidth limit (#3878 )	2020-05-12 15:01:15 +02:00
Michal Niewrzal	22fbe804e3	satellite/accounting: test if project bandwidth limits reset with billing cycle https://storjlabs.atlassian.net/browse/USR-287 Change-Id: I4dc5f6342417b6af3384da32d3d2ed8592904406	2020-05-11 15:11:53 +00:00
Moby von Briesen	8f60cfc4fb	satellite/overlay: Add flag for enabling/disabling disqualification from suspension mode Add a flag that allows us to easily switch disqualification from suspension mode on or off. A node will only be disqualified from suspension mode if it has been suspended for longer than the grace period AND the SuspensionDQEnabled flag is true. Change-Id: I9e67caa727183cd52ab2042b0a370a1bcaebe792	2020-05-04 17:25:09 +00:00
Ethan	acf53bea4d	satellite/orders;accounting: Add monthly project download bandwidth rollup See https://storjlabs.atlassian.net/browse/SM-776 Change-Id: Ifd5cccea43c556fd59822d17344f399cfe9a7164	2020-05-04 15:49:57 +00:00
Egon Elbre	8928399d02	all: rename CreateTables to MigrateToLatest CreateTables hasn't been quite true for a while now, rename to MigrateToLatest to be clearer in it's behavior. Change-Id: Ida48e95122a5d9b7a814e922d3698e00024a2ba7	2020-04-30 07:21:17 +00:00
Jessica Grebenschikov	6a6427526b	satellite/overlay: remove old updateaddress method The UpdateAddress method use to be used when storage node's checked in with the Satellite, but once the contact service was created this method was no longer used. This PR finally removes it. Change-Id: Ib3f83c8003269671d97d54f21ee69665fa663f24	2020-04-30 06:41:48 +00:00
Moby von Briesen	de366537a8	satellite/satellitedb/overlaycache: fix behavior around gracefully exited nodes Sometimes nodes who have gracefully exited will still be holding pieces according to the satellite. This has some unintended side effects currently, such as nodes getting disqualified after having successfully exited. * When the audit reporter attempts to update node stats, do not update stats (alpha, beta, suspension, disqualification) if the node has finished graceful exit (audit/reporter_test.go TestGracefullyExitedNotUpdated) * Treat gracefully exited nodes as "not reputable" so that the repairer and checker do not count them as healthy (overlay/statdb_test.go TestKnownUnreliableOrOffline, repair/repair_test.go TestRepairGracefullyExited) Change-Id: I1920d60dd35de5b2385a9b06989397628a2f1272	2020-04-28 23:58:43 +00:00
Egon Elbre	85c45cd56f	private/dbutil/pgtest: support multiple databases for testing Currently Cockroach isn't performant for concurrent database setup and tear-down. Instead of a single instance allow setting multiple potential connection strings and let the tests pick one connection string randomly. This improves test duration by ~10 minutes. While we are at significantly changing how pgtest works, introduce helper PickPostgres and PickCockroach for selecting the database to reduce code duplications in multiple places. Change-Id: I8ad171d5c4c8a4fc081ec2ae9bdd0cc948a80619	2020-04-28 21:55:49 +03:00
Natalie Villasana	6f84be133a	satellite/metainfo: add MigrateToLatest to PointerDB In cases like the segment reaper script connecting to the metainfodb, we don't want a db migration to happen automatically when we call metainfo.NewStore. This adds MigrateToLatest method for postgreskv and cockroackv, and calls MigrateToLatest in places where NewStore used to create tables. Change-Id: I682d0f26d609af0601dfdb32a24866cdf5d32a7e	2020-04-28 17:26:35 +00:00
Egon Elbre	ef913be234	satellite/satellitedb/satellitedbtest: don't use subtest naming A/B indicates that B is a subtest of A, however in this case they represent a configuration of the test, not a subtest. Change-Id: I64eed5d5bcb12759e54fe4b5373f8e88488e50f7	2020-04-27 19:32:09 +03:00
Ivan Fraixedes	03871d17c3	satellite/satellitedb: Update ticket ref Update a reference to a ticket in a code comment. Change-Id: Ib82220e94527482c5ca1a58d8614b919d1113ab5	2020-04-27 08:50:41 +00:00
Stefan Benten	d73630fd4a	satellite/satellitedb: Ensure we just return bucket usage for buckets that exist (#3863 )	2020-04-24 22:25:16 +02:00
Moby von Briesen	720e26d235	satellite/satellitedb/overlaycache: update unknown alpha/beta values properly Update unknown_audit_reputation_alpha and unknown_audit_reputation_beta. Add test to verify that BatchUpdateStats properly modifies unknown audit alpha/beta Change-Id: I0d5f9cac96a99f64905cf575b772402db0756a9d	2020-04-23 10:40:53 -04:00
Moby von Briesen	72b93f3120	satellite/satellitedb: disqualify suspended nodes when the grace period passes If a node is suspended and receives an unknown or failing audit, disqualify them if the grace period (default 1w in production) has passed. Migrate the nodes table so any node that is currently suspended gets unsuspended when the satellite starts up. Change-Id: I7b81c68026f823417faa0bf5e5cb5e67c7156b82	2020-04-22 15:45:00 -04:00
Ethan Adams	60e07f0a8b	Revert "satellite/accounting: Remove unnecessary index bucket_bandwidth_rollups_project_id_action_interval_index" This reverts commit `105dc7acc6`. Reason for revert: Recent changes to the Postgres query plan seems to want to use this index now. Reverting until we have time to analyze what's happening. Change-Id: I74b4b5a8f15c3850d8a958a29f51dbc80e7c282c	2020-04-22 14:49:04 +00:00
Qweder93	805e328c47	storagenode/heldamount payments removed Change-Id: I87cc04f43d182a4190a571ef417be85d02db9d34	2020-04-21 17:15:31 +00:00
Ethan	105dc7acc6	satellite/accounting: Remove unnecessary index bucket_bandwidth_rollups_project_id_action_interval_index See https://storjlabs.atlassian.net/browse/SM-738 Change-Id: I9ba3cc3fbff9f13fc0b95d25feee5a19e5a5c486	2020-04-21 16:43:09 +00:00
Qweder93	6e3585e394	satellite/heldamount/endpoint : GetAllPaystubs added Change-Id: Ic8cdd9db8b2a68796f9579c7fed2d49d9054bd64	2020-04-19 19:21:54 +03:00
Ethan	4cd86ff780	satellite/accounting: Add index on bucket_bandwidth_rollups for action, interval_start, and project_id See https://storjlabs.atlassian.net/browse/SM-551 for details Change-Id: I104c4e87d5aef500cc4a3893817763808f76c484	2020-04-17 19:14:45 +00:00
Jess G	5ea1602ca5	satellite/overlay: add selected node cache (#3846 ) * init implementation cache Change-Id: Ia54a1943e0707a77189bc5f4a9aaa8339c98d99a * one query to init cache Change-Id: I7c04b3ae104b553ae23fca372351a4328f632c66 * add monit tracking of cache Change-Id: I7d209e12c8f32d43708b23bf2126c5d5098e0a07 * add first test Change-Id: I0646a9349d457a9eb3920f7cd2d62fb72ffc3ab5 * add staleness to cache Change-Id: If002329bfdd53a4b200ad14dbd2ffc8b280aedb8 * add init test Change-Id: I3a3d0aa74cfac1d125fa93cb749316ed2a74d5b1 * fix comment Change-Id: I73353d00ccf0952b38c0f8ef7d1755c15cbfe9d9 * mv to nodeselection pkg Change-Id: I62487f768296c7a7b597fa398a4c42daf6e9c5b7 * add state to cache Change-Id: I081e77ec0e16706faee1a267de9a7fa643d6ac11 * add refresh concurrent test Change-Id: Idcba72508291099f280edc65355273c0acc3d3ce * add a few more tests Change-Id: I9422e9eaa22bf01c11f14bdb892ebcf7b3e5e5fb * fix tests, add min version to select allnodes Change-Id: I926f41d568951ad4ff70c6d4ceb87abb1e3e5009 * update comments Change-Id: I6ffe33e245ca65fb523c880cd72e63ce35776eb9 * fixes and rm Init Change-Id: Ifbe09b668978b5d9af09ca38cb080d02a2154cf4 * fix format Change-Id: I03cc217e28dc1839190c5c6dbdbb602c132a5a38	2020-04-14 13:50:02 -07:00
Moby von Briesen	d7794a4851	satellite/overlay: hardcode default values for audit alpha/beta Alpha=1 and beta=0 are the expected first values for any alpha/beta reputation system we are using in the codebase. So we are removing the configurability of these values. Change-Id: Ic61861b8ea5047fa1438ea6609b1d0048bf0abc3	2020-04-14 19:12:40 +00:00
Cameron Ayer	02613407ae	satellite/satellitedb: only suspend node if not already suspended Whenever the node's reputation is updated, if its unknown audit reputation is below the suspension threshold, its suspension field is set to the current time. This could overwrite the previous "suspendedAt" value resulting a node that never reaches the end of its suspension. Also log whenever a node is disqualified or its suspension status changes Change-Id: I5e8c8f1c46f66d79cb279b5b16a84fe03f533deb	2020-04-10 09:37:37 +00:00
Egon Elbre	d86cce202c	satellite/satellitedb: use arrays for arguments in node selection This simplifies the code and makes queries faster: name old time/op new time/op delta SelectStorageNodes-32 7.72ms ± 6% 7.22ms ± 3% -6.44% (p=0.016 n=5+5) SelectNewStorageNodes-32 7.75ms ± 2% 7.37ms ± 1% -4.89% (p=0.008 n=5+5) SelectStorageNodesExclusion-32 16.9ms ± 0% 16.6ms ± 0% -2.15% (p=0.008 n=5+5) SelectNewStorageNodesExclusion-32 17.2ms ± 0% 16.6ms ± 2% -3.69% (p=0.008 n=5+5) FindStorageNodes-32 45.5ms ± 0% 45.1ms ± 1% ~ (p=0.056 n=5+5) FindStorageNodesExclusion-32 77.4ms ± 0% 75.9ms ± 0% -1.91% (p=0.008 n=5+5) Change-Id: I38f77f6282b9738e8416113d42c6acb46c03da7b	2020-04-09 21:16:10 +03:00
Egon Elbre	ccf4f9ed2d	satellite/satellitedb: node selection code cleanup Reduce the number of non-methods to reduce funcs in the namespace also combine a func to slightly condense the code more. Change-Id: Ifbe728eb8c8ca4c981df648decd259c2097b6b40	2020-04-09 20:41:29 +03:00
Natalie Villasana	cf80b3caf3	satellite/overlay: combine SelectStorageNodes and SelectNewStorageNodes (#3831 )	2020-04-09 11:19:44 -04:00
Egon Elbre	11a44cdd88	all: don't depend on gogo/proto directly Change-Id: I8822dea0d1b7b99e0b828e0373a0308a42dde2be	2020-04-08 17:32:15 +00:00
Egon Elbre	cf26951a5b	satellite/satellitedb/pbold: remove dead code Change-Id: I7464773c20b8f99a601ca9cc4bee804f1ac14cf9	2020-04-08 15:22:31 +03:00
Jeff Wendling	2ded64ba2c	satellite/compensation: more fixes to get prod running smoothly Change-Id: I13a76d9d49222fb10796415a015f224d4084fde3	2020-04-07 10:10:27 +00:00
Jennifer Johnson	1547e791a3	satellitedb: remove free_bandwidth column from nodes table Change-Id: I9d1d3de9216c6533c1042ef473631721a011d086	2020-04-06 09:30:28 +00:00
Egon Elbre	9200efc61f	satellite/satellitedb: fix selecting a nullable string Change-Id: I59e645966e09da586512c69101691b47055c1e5a	2020-04-03 21:30:20 +03:00
Egon Elbre	6492b13d81	all: remove old uuid Change-Id: I3a137f73456f010c37d3933dbe12cbbb840b809f	2020-04-02 19:30:36 +03:00
Egon Elbre	8f73fb7a32	all: simplify uuid usage uuid.UUID implements driver.Value so it can be directly used as a scannable result. Replace uses of dbutil.BytesToUUID with uuid.FromBytes. Change-Id: I51a670185ceb3cc2199d5aa2b76bc3fc191ca8fe	2020-04-02 05:48:58 +00:00
Egon Elbre	a416b03941	satellite/accounting: fix TestProjectBandwidthTotal Test was inserting for past 4 days, however the test was summing up for the current month. Change-Id: I509afdc6a76b314a6bb90652ab70cd2c2bab1288	2020-04-01 11:50:18 +03:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Qweder93	dc32f1da55	storagenode/cache/heldamount added, errNoRows ignored Change-Id: If6b675e622d6c1324c0893c43cca93dc5323cd78	2020-03-31 11:35:58 +00:00
Jeff Wendling	e2ff2ce672	satellite: compensation package and commands Change-Id: I7fd6399837e45ff48e5f3d47a95192a01d58e125	2020-03-30 14:08:14 -06:00
Jennifer Johnson	d77f3b8786	satellitedb/migrate: set vetted_at backfill to now.day Change-Id: Ib2b12be43dbd3f3705b1891bc703ae15abb75e09	2020-03-30 16:50:23 +00:00
Egon Elbre	439aba922a	satellite/overlay: reduce overhead of GetNodes Instead of filtering on the client side it's better to filter on the database side. Change-Id: I845fbbe5ed28c2ffdb0b8a3f789b59c094fd1069	2020-03-30 18:36:23 +03:00
Egon Elbre	cb781d66c7	satellite/overlay: optimize FindStorageNodes Reduce the number of fields returned from the query. Benchmark results in `satellite/overlay`: benchstat before.txt after2.txt name old time/op new time/op delta SelectStorageNodes-32 7.85ms ± 1% 6.27ms ± 1% -20.18% (p=0.002 n=10+4) SelectNewStorageNodes-32 8.21ms ± 1% 6.61ms ± 0% -19.53% (p=0.002 n=10+4) SelectStorageNodesExclusion-32 17.2ms ± 1% 15.9ms ± 1% -7.55% (p=0.002 n=10+4) SelectNewStorageNodesExclusion-32 17.8ms ± 2% 16.1ms ± 0% -9.38% (p=0.002 n=10+4) FindStorageNodes-32 48.4ms ± 1% 45.1ms ± 0% -6.69% (p=0.002 n=10+4) FindStorageNodesExclusion-32 79.2ms ± 1% 76.1ms ± 1% -3.89% (p=0.002 n=10+4) Benchmark results from `satellite/overlay` after making them parallel: benchstat before-parallel.txt after2-parallel.txt name old time/op new time/op delta SelectStorageNodes-32 548µs ± 1% 353µs ± 1% -35.60% (p=0.029 n=4+4) SelectNewStorageNodes-32 562µs ± 0% 368µs ± 0% -34.51% (p=0.029 n=4+4) SelectStorageNodesExclusion-32 1.02ms ± 1% 0.84ms ± 0% -18.08% (p=0.029 n=4+4) SelectNewStorageNodesExclusion-32 1.03ms ± 1% 0.86ms ± 2% -16.22% (p=0.029 n=4+4) FindStorageNodes-32 3.11ms ± 0% 2.79ms ± 1% -10.27% (p=0.029 n=4+4) FindStorageNodesExclusion-32 4.75ms ± 0% 4.43ms ± 1% -6.56% (p=0.029 n=4+4) Change-Id: I1d85e2764eb270f4c2b1998303ccfc1179d65b26	2020-03-30 18:36:23 +03:00
Egon Elbre	e1a443b04a	private/testplanet: allow modifying created database Instead of providing the database from outside to testplanet create it inside and then allow wrapping and modifying it. This is more convenient to use. Change-Id: I9b8f69e6e0a19ff984b4e2bfe927c9100c77bc6c	2020-03-27 19:14:48 +00:00
Ethan	df462d7265	satellite/accounting: Add index on bucket_bandwidth_rollups to minimize full table scans https://storjlabs.atlassian.net/browse/SM-545 Change-Id: I5599a72a991d70236f17beca027e9bc032777177	2020-03-26 19:53:50 +00:00
Jeff Wendling	97e980cd8a	private/dbutil: add database name to configure as a tag storagenodes have like 10 or more databases. without this tag they all get sent as the same value, stomping on each other. Change-Id: Ib12019684d6ea8f2a5b83df584056dfa79e3c4b3	2020-03-26 16:50:15 +00:00
Jennifer Johnson	b75cbc8e24	satellite,storagenode: remove references to free bandwidth Change-Id: I42a6597544804fa9235e89ec656ebc365eb522e5	2020-03-25 22:28:34 +00:00
Michal Niewrzal	fdf40a7526	storj: remove `storj/private/version` package which was moved to `storj/private` repo Change-Id: I81c3f5b9d5e4fe7bca760999eb045ee9734e5e2e	2020-03-24 14:31:33 +00:00
Jessica Grebenschikov	aeab599d21	satellitedb: removed unused id on storagenode_storage_tallies table, add index on node_id The goal of this change is to improve the storagenode_storage_tallies table by removing the unneeded id column that is not being used but only taking up space, and also to add an index on a different column that needs it. Removing and adding a column seems simple, but ended up being more complicated because of some cockroachdb limitations. The cockroachdb limitation when trying to remove a column from a table and create a new primary key are: 1. only allows primary key creation at table creation time (docs: https://www.cockroachlabs.com/docs/stable/primary-key.html) 2. table drop or rename is performed async and cannot be done in a transaction (issue: https://github.com/cockroachdb/cockroach/issues/12123, https://github.com/cockroachdb/cockroach/issues/22868) To address these differences between cockroachdb and Postgres, this PR performs different migrations for the two database. The Postgres migration is straight forward and what you would expect, but the cockroach migration has two main changes: 1. To change a primary key, use the recommended process from the cockroachdb docs to create a new table with the new primary key you want and then migrate the data. 2. In order to do 1, we needed to do the new table renaming in a separate transaction from the data migration. Ref: SM-65 Change-Id: Idc9aee3ab57aa4d5570e3d2980afea853cd966bf	2020-03-20 14:39:44 -07:00
Jennifer Johnson	9b78473c0c	satellitedb: adds vetted_at nullable timestamp to nodes table Change-Id: I42d5a396b4eecbad26b683c6aee51e043d2ff034	2020-03-20 01:37:28 +00:00
Qweder93	0df586c3a8	satellitedb/heldamount updated, tests added + storagenode console updated Change-Id: I10f568a426d0fc42069d025de2accbef5b26dc0c	2020-03-19 15:37:45 +02:00
Jeff Wendling	115f4559e5	satellite/orders: more efficient processing of orders by doing an indexed anti-join we're able to reduce the time to select the pending orders by over 10x on postgres. this should help us process pending orders much more quickly. it probably won't do as good a job on cockroach because it does not do an indexed anti-join and instead does a hash join after scanning the entire consumed serials table. we should either remove orders entirely or try to make that more efficient when necessary. Change-Id: I8ca0535acd21c51e74955b24c9b86d20e4f2ff9c	2020-03-18 09:03:30 +00:00
Moby von Briesen	2f991b6c56	satellite/{overlay, satellitedb}: account for `suspended` field in overlay cache Make sure that suspended nodes are treated appropriately by the overlay cache. This means we should expect the following behavior: * suspended nodes (vetted or not) should not be selected for uploading new segments * suspended nodes should be treated by the checker and repairer as "unhealthy", and should be removed upon successful repair This commit also removes unused overlay functionality. Fixes a bug with commit `8b72181a1f` where the audit reporter was automatically suspending nodes regardless of audit outcome (see test added). Tests: * updates repair tests to ensure that a suspended node is treated as unhealthy and will be removed from the pointer on successful repair * updates overlay tests for KnownUnreliableOrOffline and KnownReliable to expect suspended nodes to be considered "unreliable" * adds satellitedb test that ensures overlay.SelectStorageNodes and overlay.SelectNewStorageNodes do not include suspended nodes * adds audit reporter test to ensure that different audit outcomes result in the correct suspended/disqualified states Change-Id: I40dba67278c8e8d2ce0bcec5e0a5cb6e4ce2f561	2020-03-17 17:14:56 +00:00
Michal Niewrzal	81afbcc12e	satellite/metainfo: check bucket existence on upload and listing Initial change for checking bucket existence on satellite side for requests like BeginObject and ListObjects. This is simple implementation that is just checking bucket in DB but should be improved in future to avoid DB calls as much as possible. Part of https://storjlabs.atlassian.net/browse/USR-365 Change-Id: I9076acddc44d7dbfa7612a1c24a007de01621583	2020-03-17 15:43:22 +00:00
Jeff Wendling	7baa59753a	satellite/orders: add tests for double sending the same order Change-Id: If2fa7f035257df3b04f506f81aa8b2e0916f5033	2020-03-17 14:18:03 +00:00
Ethan	bdbf764b86	satellite/orders;overlay: Consolidate order limit storage node lookups into 1 query. https: //storjlabs.atlassian.net/browse/SM-449 Change-Id: Idc62cc2978fba67cf48f7c98b27b0f996f9c58ac	2020-03-16 23:15:47 +00:00
Moby von Briesen	8b72181a1f	satellite/{audit,overlay,satellitedb}: implement unknown audit reputation and suspension * change overlay.UpdateStats to allow a third audit outcome. Now it can handle successful, failed, and unknown audits. * when "unknown audit reputation" (unknownAuditAlpha/(unknownAuditAlpha+unknownAuditBeta)) falls below the DQ threshold, put node into suspension. * when unknown audit reputation goes above the DQ threshold, remove node from suspension. * record unknown audits from audit reporter. * add basic tests around unknown audits and suspension. Change-Id: I125f06f3af52e8a29ba48dc19361821a9ff1daa1	2020-03-16 20:29:26 +00:00
Stefan Benten	52590197c2	satellite/payments: More Cleanup and Satellite command to ensure we have stripe customers (#3805 )	2020-03-16 20:34:15 +01:00
Qweder93	9f84261c36	storagenode/cache heldamount added Change-Id: I7fc807789de63e8a9b8ca2018fd73bdb9e01ad0d	2020-03-16 00:28:35 +02:00
Qweder93	94c4d1e737	satellite/satellitedb/heldamount added, endpoint added Change-Id: Ife8402b89f631f65ebb5cdf5ca02e99aa9b0b3ff	2020-03-13 18:15:52 +00:00
Jeff Wendling	41887883f3	satellite/satellitedb: check indexes on migration Change-Id: I5ba7ae2b512d77c70405ce332158f12128e27eed	2020-03-13 10:45:22 +00:00
Jess G	39cb821196	satellite/overlay: rm combinedcache, fix IP naming to be network (#3798 ) * rn combinedcache, rm dns node lookup Change-Id: I239f07211764b097d851230d8c81900a47756e9e * excludeIPs -> excludedNetworks Change-Id: Ifa6f44ab17457cdd5aff4cd5694296867c18b179 * use lowercase var name Change-Id: I825aad2b718c71f455e747be18f8cabd02aabe55 * update Getnetwork name Change-Id: I002a1b7bc6b4ef40159c0cd2b0ef209f80a9c503 * fix comments Change-Id: Ibddf5b9ffa9d685af6c392d893db063ef18e45fa * update comments with ipv6 Change-Id: I31758b7d4979e7c27d014668f4fb532ad838cda2 Co-authored-by: Stefan Benten <mail@stefan-benten.de>	2020-03-12 11:37:57 -07:00
Jessica Grebenschikov	803e2930f4	satellite: use IP for all uplink operations, use hostname for audit and repairs My understanding is that the nodes table has the following fields: - `address` field which can be a hostname or an IP - `last_net` field that is the /24 subnet of the IP resolved from the address This PR does the following: 1) add back the `last_ip` field to the nodes table 2) for uplink operations remove the calls that the satellite makes to `lookupNodeAddress` (which makes the DNS calls to resolve the IP from the hostname) and instead use the data stored in the nodes table `last_ip` field. This means that the IP that the satellite sends to the uplink for the storage nodes could be approx 1 hr stale. In the short term this is fine, next we will be adding changes so that the storage node pushes any IP changes to the satellite in real time. 3) use the address field for repair and audit since we want them to still make DNS calls to confirm the IP is up to date 4) try to reduce confusion about hostname, ip, subnet, and address in the code base Change-Id: I96ce0d8bb78303f82483d0701bc79544b74057ac	2020-03-11 09:11:40 -07:00
Moby von Briesen	1baf1bd249	satellite/satellitedb: Add index on num_healthy_pieces column in injuredsegments table We missed this in the migration that added the num_healthy_pieces column. It exists in dbx, but not on the actual satellite table. Change-Id: If16b5ec2325d56406250298531b3285215188bf3	2020-03-10 16:59:35 +00:00
paul cannon	79553059cb	satellite/repair: put irreparable segments in irreparableDB Previously, we were simply discarding rows from the repair queue when they couldn't be repaired (either because the overlay said too many nodes were down, or because we failed to download enough pieces). Now, such segments will be put into the irreparableDB for further and (hopefully) more focused attention. This change also better differentiates some error cases from Repair() for monitoring purposes. Change-Id: I82a52a6da50c948ddd651048e2a39cb4b1e6df5c	2020-03-09 21:45:16 +00:00
Egon Elbre	0675413f7a	satellite/satellitedb: increase migrate test timeout Change-Id: I789ea22ad463a6c31737e959ec54941b66830188	2020-03-09 14:30:50 +02:00
Bill Thorp	e99e675fb1	satellite/satellitedb: use time zones with all timestamps The migration was broken into one migration per table to reduce table locking and reduce the chances of failure due to SQL timeouts. Of the 14 fields that lacked time zones, only the 3 named 'interval_start` seemed to have non-UTC data in them. These fields are fixed in the migration by removing the +00 and adding AT TIME ZONE current_setting('TIMEZONE') Field with good data are migrated by adding AT TIME ZONE 'UTC' Note that postgres's timezone() is different than cockroach's timezone() so AT TIME ZONE is used. https://storjlabs.atlassian.net/browse/SM-104 Change-Id: I410f2f1d7c11b143f17844347f37e6f4b1e70fce	2020-03-05 21:11:25 +00:00
Jennifer Johnson	1c1750e6be	removes bandwidth limiting On satellite, remove all references to free_bandwidth column in nodes table. On storage node, remove references to AllocatedBandwidth and MinimumBandwidth and mark as deprecated. Protobuf message, NodeCapacity, is left intact for backwards compatibility. Once this is released to all satellites, we can drop the column from the DB. Change-Id: I2ff6c6537fc9008a0c5588e951afea58ede85838	2020-03-04 14:04:00 +00:00
Egon Elbre	5f2ca0338b	satellite/satellitedb: fix err and close order Change-Id: Ied927275853c4cf4a8ccb500048d50545f6c6efe	2020-03-04 09:05:22 +00:00
Moby von Briesen	f495544c56	satellite/satellitedb/dbx: add fields to node table for placing nodes into suspended mode for too many unknown-error audits Change-Id: Iac9a619e5c08377de87ffdf4acdd0155027f5eb3	2020-03-03 03:30:59 +00:00
Jeff Wendling	1db087cfba	satellite/satellitedb: migration to create tables for compensation these tables are used in future commits with respect to the new storagenode payments code. if we create them now, it will make backfilling them with historical data easier. Change-Id: I3c08c9770ec5b2baa38b4f2fd18c2f07746a61c2	2020-02-27 17:34:50 +00:00
Moby von Briesen	4e5a7f13c7	satellite/repair/queue: Prioritize selection of items off repair queue by segment health Add a column to the repair queue table in the satellite db for healthy piece count. When an item is selected from the repair queue, the least durable segment that has not been attempted in the past hour should be selected first. This prevents our repairer from getting stuck doing work on segments that are close to the repair threshold while allowing segments that are more unhealthy to degrade further. The migration also clears the repair queue so that the migration runs quickly and we can properly account for segment health in future repair work. We do not select items off the repair queue that have been attempted in the past six hours. This was changed from on hour to allow us time to try a wider variety of segments when the repair queue is very large. Change-Id: Iaf183f1e5fd45cd792a52e3563a3e43a2b9f410b	2020-02-26 09:54:16 -05:00
Jeff Wendling	f671eb2beb	satellite/satellitedb: use queue for orders to get back fast billing This change adds two new tables to process orders as fast as we used to but in an asynchronous manner and with hopefully less storage usage. This should help scale on cockroach, but limits us to one worker. It lays the groundwork for the order processing pipeline to be queue rather than database driven. For more details, see the added fast billing changes blueprint. It also fixes the orders db so that all the timestamps that are passed to columns that do not contain a time zone are converted to UTC at the last possible opportunity, making it less likely to use the APIs incorrectly. We really should migrate to include timezones on all of our timestamp columns. Change-Id: Ibfda8e7a3d5972b7798fb61b31ff56419c64ea35	2020-02-24 17:07:07 +00:00
Qweder93	dca6fcbe28	satellite/payments/stripecoinpayments: credits added to invoice calculations Change-Id: I6d3f5244a46f8945d2703af39ced333940db34e9	2020-02-24 16:48:27 +00:00
Yaroslav Vorobiov	f185adcf7c	satellite/payments: fix projects list pagination Change-Id: I342e69a17be34a503c1e0cef18ee009f1921fcd4	2020-02-21 19:37:11 +02:00
Ivan Fraixedes	1a84a00cc9	satellite/orders: Fix doc comments Enhance the documentation of the UseSerialNumber method (interface and implementation) and add several missing dots in doc comments of the methods of the same interface and implementation. Change-Id: I792cd344f0d2542e060fa2ec288b71231cae69de	2020-02-18 13:03:23 +01:00
Michal Niewrzal	dbe8428f9f	satelite/metainfo: return NotFound on delete non existing bucket Change-Id: I7f466b5f824eab7b5146c2792f40cb2bcd7976a5	2020-02-18 09:05:30 +00:00
Egon Elbre	892b190db6	satellite/admin: add project limit modification and authorization token Change-Id: If9a7214a940b8544f8023c2cd82da21f19d3f521	2020-02-17 07:56:16 +00:00
Yaroslav Vorobiov	827da1ae2b	satellite/payments: fail when trying to consume consumed transactions Change-Id: Ibb2528079ec917b7611b87a02972fb771937a025	2020-02-13 19:52:55 +00:00
Jeff Wendling	2d2f5e1a7f	satellite/satellitedb/dbx: remove typo in dbx file and format it Change-Id: I756315d6228ac9edd35cad8b496d36ecf2b5d26f	2020-02-11 14:15:13 -07:00
Yaroslav Vorobiov	bd9cebda5b	satellite/payments: fix transaction list pagination Change-Id: I533f637e5cb12b47d7f7248f8bf7de93bd8be000	2020-02-11 16:22:53 +02:00
Egon Elbre	ccd8b7f107	satellite/satellitedb: add benchmark for satellitedb setup and close Change-Id: Ifb561f2eb81e439ea7cfa2ca2dad6b15aa50417e	2020-02-11 13:30:23 +00:00
Yaroslav Vorobiov	984ed26737	satellite/payments: fix invoice project records pagination Change-Id: I68de69de78256280a6bbf0b744963b9c8c813007	2020-02-11 14:31:55 +02:00
Qweder93	dc075eaa96	satellite/payments : deposit bonuses (credits) added Change-Id: Ib151bbb9b02d655fa619c53bfbc04ed6f3bb39e0	2020-02-11 11:11:42 +00:00
Cameron Ayer	75355547c2	satellite/satellitedb: don't include GET_AUDIT and GET_REPAIR with chargeable BW In the methods we use to retrieve a user's chargeable BW, we were summing GET, GET_AUDIT, and GET_REPAIR. We only want to charge for GET Change-Id: Icead7695494b22c7c835482cf8b1512a980d59f1	2020-02-07 12:02:44 +00:00
Jeff Wendling	7999d24f81	all: use monkit v3 this commit updates our monkit dependency to the v3 version where it outputs in an influx style. this makes discovery much easier as many tools are built to look at it this way. graphite and rothko will suffer some due to no longer being a tree based on dots. hopefully time will exist to update rothko to index based on the new metric format. it adds an influx output for the statreceiver so that we can write to influxdb v1 or v2 directly. Change-Id: Iae9f9494a6d29cfbd1f932a5e71a891b490415ff	2020-02-05 23:53:17 +00:00
Jeff Wendling	d20db90cff	private/dbutil/txutil: create new transactions for retries it was noticed that if you had a long lived transaction A that was blocking some other transaction B and A was being aborted due to retriable errors, then transaction B was never given priority. this was due to using savepoints to do lightweight retries. this behavior was problematic becaue we had some queries blocked for over 16 hours, so this commit addresses the issue with two prongs: 1. bound the amount of time we will retry a transaction 2. create new transactions when a retry is needed the first ensures that we never wait for 16 hours, and the value chosen is 10 minutes. that should be long enough for an ample amount of retries for small queries, and huge queries probably shouldn't be retried, even if possible: it's more preferrable to find a way to make them smaller. the second ensures that even in the case of retries, queries that are blocked on the aborted transaction gain priority to run. between those two changes, the maximum stall time due to retries should be bounded to around 10 minutes. Change-Id: Icf898501ef505a89738820a3fae2580988f9f5f4	2020-02-01 18:34:28 +00:00
Egon Elbre	97d360afd1	satellite/satellitedb: use correct type Array was using a smaller type integer. Change-Id: I025d61b6cea9869efa0b4ac1d24265356491f6dc	2020-01-31 13:00:14 -05:00
crawter	e549e32976	satellite/payments: fix promotional coupons Change-Id: Ib8b7e38f2cb07085655448264f281fd7fc7867dd	2020-01-29 16:40:43 +02:00
Jennifer Johnson	2209924d41	satellite/satellitedb: use arrays and batch inserts for SaveTallies query Cockroachdb is more performant with multi-row inserts Change-Id: Ie1ce2a9da0be1df4e66e72fc9cae49cbd95023f3	2020-01-27 16:54:20 -05:00
Egon Elbre	227e03dea1	satellite/satellitedb: insert using arrays Using dynamic query strings is error prone, prefer arrays. Change-Id: I303fbf21c6a54795bd9f399371943b5c51e6f863	2020-01-27 21:27:28 +00:00
Jeff Wendling	d09bd4a749	satellite/satellitedb/dbx: regenerate with paged composite key fixes before dbx would generate a compilcated blob of conditions that encoded a row comparison, which only optimized to an index seek on cockroachdb. this means that sqlite and postgres both had quadratic behavior on paged queries of this form. instead, use the implicit row construction feature supported in all of the databases to do paged support so that they all optimize well. Change-Id: Iac8703929ba2a59ee3ffa619b916d12663422887	2020-01-27 12:43:16 -07:00
paul cannon	a0a94a9ac7	satellite/satellitedb: insert into reported_serials w/ arrays Change-Id: Icb682de09ded3e3159e3590594dcf13f2e7f40f0	2020-01-24 18:36:21 -06:00
littleskunk	90cf78e6f2	satellite/coinpayments: fix migration The old migration was not working. It was updateding pending (status 0) and failed (status -1) to completed (status 100). Change-Id: I808ff3cc692fe6c698ce26a8b411b134e67b752b	2020-01-25 00:12:35 +00:00
Cameron Ayer	494fead7af	satellitedb/orders: fix comma bug in SQL stmt Change-Id: Ibc6024eeeb5aa4de3909c0cec2d01ac0a01c809f	2020-01-24 13:58:32 -05:00
Jeff Wendling	665ed3b6b1	satellite/satellitedb: fix issue with shared memory on range for bucket rollups A uuid.UUID is an array of bytes, and slicing it refers to the underlying value, much like taking the address. Because range in Go reuses the same value for every loop iteration, this means that later iterations would overwrite earlier stored project ids. We fix that by making a copy of the value before slicing it for every loop iteration. Change-Id: Iae3f11138d11a176ce360bd5af2244307c74fdad	2020-01-23 21:57:02 -07:00
ccase	a9e4c6f66d	satellite/satellitedb/dbx: Remove bashism from gen.sh Change-Id: Ia698edae99d7ff0c73fa457b4a3c0a7b5f0bbec5	2020-01-23 17:09:07 -05:00
Egon Elbre	76fdb5d863	storage: add configurable lookup limits Currently storage tests were tied to the default lookup limit. By increasing the limits, the tests will take longer and sometimes cause a large number of goroutines to be started. This change adds configurable lookup limit to all storage backends. Also remove boltdb.NewShared, since it's not used any more. Change-Id: I1a052f149da471246fac5745da133c3cfc27582e	2020-01-22 21:35:56 +02:00
Egon Elbre	fc2766eefc	private/testplanet: flatten migration for running tests Currently Cockroach DB setup takes a significant amount of time. This flattens the database setup into a single query, which improves the test time significantly. The migration tests still test each migration separately. Change-Id: Iaca16f34a6af3926fa2b5ebf618f939fd59460b3	2020-01-22 15:09:11 +00:00
Jeff Wendling	75314a4364	satellite/satellitedb: fix roundToNextDay to handle timezones appropriately Since incoming times may be in any time zone, and we want the output to be in UTC and for them to have 00:00:00 hours, minutes and seconds we first convert the incoming timestamp to UTC before doing the truncate to the day and adding a day. Because the old code always returned a timestamp that was in the future, this is just for efficiency. Change-Id: Ie692d47bca8691e73852c822d5c56cf8773d99b4	2020-01-21 21:02:16 +00:00
Ethan	21a5d70a83	satellite/metainfo: Rate limiting - API requests Limits how many times metainfo APIs can be called per second by project ID. If limit is exceeded, the API will return Unauthorized/Too Many requests. Limit per second and the size of the limiter cache per project are configurable, as well as whether the limiter is enabled. Tests added/updated for the new rate_limit field in projects table. Tests added for exceeding limits and disableing limiter. Change-Id: Ic8ad102de3b690a475809d4f684156d5715f20fa	2020-01-21 14:25:04 +00:00
Egon Elbre	c1c878efcf	all: fix import groupings check-imports was broken and didn't complain about things. Change-Id: I38adafd16b4aba86f0eb4f53427b4393f9a6c710	2020-01-20 17:47:44 +00:00
Egon Elbre	f3b4bf2b7c	satellite/satellitedb/satellitedbtest: pass ctx as an argument ctx is created in most tests, instead pass in as argument to reduce code duplication. Change-Id: I466c51c008392001129c8b007c9d6b3619935ac4	2020-01-20 16:35:42 +02:00
Egon Elbre	1279eeae39	private/tagsql,storage: fixes to context cancellation Replace all the remaining uses of sql.DB with tagsql.DB to fix issues with context cancellation. Introduce tagsql.Open which helps to get rid of all tagsql.Wrap-s. Use tagsql in cockroachkv and postgreskv. Change-Id: I8946d203341cb85a25976896fc7881e1f704e779	2020-01-20 15:44:39 +02:00
Egon Elbre	ba2fce814c	satellite/satellitedb: better coupons query Change-Id: Iaf180b99c57443550418b46dfd8300f921e93bec	2020-01-20 15:05:10 +02:00
crawter	c4cbc6ff2f	satellite/payments: promotional coupons generation functional added Change-Id: Ie0df256503114ca377d81bf7c8b26cc90a1f5b26	2020-01-20 11:01:55 +00:00
Egon Elbre	cf7b22c466	satellite/satellitedb: add missing err check Change-Id: I502838f78f1871315597b488602c0f1112612981	2020-01-19 19:24:12 +00:00
Egon Elbre	c207cd08fc	satellite/satellitedb: gracefulexit, add missing Errs check Change-Id: Iba4ba84fd57b3a0a0d15f13006566076045d6c11	2020-01-19 15:24:12 +00:00
Egon Elbre	1abfe42142	satellite: use tagsql Change-Id: I2170dee409fb0c2fe85913ddd36e7811a3b853ed	2020-01-19 14:39:16 +02:00
Egon Elbre	59d06644b9	private/migrate: switch to tagsql Also added temporary types withRebind and withTagTx, which will be later removed. Currently they help to avoid changing the whole codebase at the same time. Change-Id: I7f07ba8f4709a23a463bfa67464628665a05808f	2020-01-19 14:39:16 +02:00
Yaroslav	d8368d0b30	satellite/payments: coinpayments add completed status, treat received status as pending, add balance for completed transactions only Change-Id: I20494bdddfda6d4f37ba2c5b6f7955cd29a6d798	2020-01-17 17:26:34 +00:00
Jessica Grebenschikov	955abd9293	satellite/satellitedb/orders: add multi row upserts to process orders Change-Id: I00d8b55ee74b443fb328bd3a4378308cefa368e4	2020-01-16 23:51:46 +00:00
Isaac Hess	cd48dc369a	satellite/satellitedb: Remove unused indexes Change-Id: I875b94574eacf9d2df537bcf1f42f30e0bf60ab9	2020-01-16 16:06:21 -07:00
Jeff Wendling	47bb7a7a86	satellite/satellitedb/dbx: regenerate with default support Change-Id: I0dab34f27af913795ef95ef92173844c2f53b29b	2020-01-16 22:13:57 +00:00
Jeff Wendling	696d98a232	satellite/satellitedb: fix nitpicks and timestamp issue found in review warning: databases migrated to version 77 before this commit is merged must be manually re-migrated. this should not be a problem for anything but staging databases. Change-Id: Ie1631c48379472352014183ee43f1465e22200f7	2020-01-16 21:22:38 +00:00
Egon Elbre	7d79aab14e	satellite/satellitedb: fixes to row handling Change-Id: I48fae692bcca152143a12f333296c42471538850	2020-01-16 17:07:26 +02:00
Jeff Wendling	78c6d5bb32	satellite/satellitedb: reported_serials table for processing orders this commit introduces the reported_serials table. its purpose is to allow for blind writes into it as nodes report in so that we have minimal contention. in order to continue to accurately account for used bandwidth, though, we cannot immediately add the settled amount. if we did, we would have to give up on blind writes. the table's primary key is structured precisely so that we can quickly find expired orders and so that we maximally benefit from rocksdb path prefix compression. we do this by rounding the expires at time forward to the next day, effectively giving us storagenode petnames for free. and since there's no secondary index or foreign key constraints, this design should use significantly less space than the current used_serials table while also reducing contention. after inserting the orders into the table, we have a chore that periodically consumes all of the expired orders in it and inserts them into the existing rollups tables. this is as if we changed the nodes to report as the order expired rather than as soon as possible, so the belief in correctness of the refactor is higher. since we are able to process large batches of orders (typically a day's worth), we can use the code to maximally batch inserts into the rollup tables to make inserts as friendly as possible to cockroach. Change-Id: I25d609ca2679b8331979184f16c6d46d4f74c1a6	2020-01-15 19:21:21 -07:00
Jeff Wendling	9da16b1d9e	satellite/satellitedb/dbx: name the package dbx everyone was importing it as dbx anyway. why should it be named satellitedb? so yeah just pass the "-p dbx" flag. Change-Id: I5efa669f4f00f196b38a9acd0d402009475a936f	2020-01-15 15:16:39 -07:00
Egon Elbre	64fb2d3d2f	Revert "dbutil: statically require all databases accesses to use contexts" This reverts commit `8e242cd012`. Revert because lib/pq has known issues with context cancellation. These issues need to be resolved before these changes can be merged. Change-Id: I160af51dbc2d67c5449aafa406a403e5367bb555	2020-01-15 07:28:00 +00:00
JT Olio	8e242cd012	dbutil: statically require all databases accesses to use contexts this will allow for some nice runtime analysis down the road. also, this allows for wrapping database handles in a way that can interact with these contexts requires https://review.dev.storj.io/c/storj/dbx/+/514 Change-Id: Ib087b7cd73296dd2c1e0331314da34d861f61d2b	2020-01-14 18:20:47 -05:00
crawter	a57ce18f58	satellite/payments: coupons, coupons usage, invoice generation with pricing model applied Change-Id: Ic5d5a2fc116388647efe46896cfccc2038c77537	2020-01-14 12:45:00 +00:00
Jeff Wendling	3b99f03780	satellite/orders: add monitoring to bucket bandwidth cache operations Change-Id: Ib14303fc9f97a133410e2d6e2cf532e468b3dcee	2020-01-13 17:36:40 -07:00
Isaac Hess	4950d7106a	satellite/orders: Add write cache for bw rollups Change-Id: I8ba454cb2ab4742cafd6ed09120e4240874831fc	2020-01-13 22:40:51 +00:00
Egon Elbre	ff267168c5	private/migrate: add ctx argument Change-Id: I3d65912d89261386413c494c7ed1576fed4dcaf4	2020-01-13 15:52:26 +02:00
Egon Elbre	24958bd7d3	satellite: add ctx to DB.CreateTables Change-Id: I9ecad624cf5a7fc9c86bb91c68f96a3a4efd2e92	2020-01-13 15:31:09 +02:00
Egon Elbre	0835b9024c	private/dbutil/pgutil: add ctx argument Change-Id: Icfd56ca8c1f831ad56c0195a0b883e8f0618daaf	2020-01-13 15:27:06 +02:00
littleskunk	bcc23f6869	Satellite/orders: remove allocated bandwith from storagenode_bandwidth_rollups When an uplink requests an upload or download from the satellite we are trackig the allocated bandwidth twice. The value in bucket_bandwidth_rollups is used for project limits but the value in storagenode_bandwidth_rollups is not used at all. We can increase the performance by removing it. Uplinks will get a faster response from the satellite. Change-Id: Icccd41f94107ef34668f30f99bf5f728c384b07e	2020-01-12 16:20:47 +01:00
Jeff Wendling	4aef0e3823	satellite/satellitedb: only reject orders if row not found any database error doesn't mean the order wasn't found. for example in cockroach it may say that the transaction is aborted. then what? maybe we get big old row level deadlocks like we've observed? so instead explicitly check for ErrNoRows to reject the order and bail out otherwise. the surrounding logic will give it a retry. Change-Id: I6e1f8f6e6a6def3e45b44f5088cbdc158e1098e4	2020-01-10 19:05:54 -07:00
Jeff Wendling	77fd41a02e	satellite: add an expiring lru cache around api keys Change-Id: I995429c66affd33da59b091f28f09ca122070b5e	2020-01-09 22:13:41 -07:00
Egon Elbre	d3d75a597f	satellite,storage: clean global ctx usage in tests Change-Id: I89ea5c95fc6895518b464f8eb6a4c74c6ae37651	2020-01-09 10:37:21 +00:00
Yingrong Zhao	76ee8a1b4c	satellite: remove UptimeReputation configs from codebase With the new storage node downtime tracking feature, we need remove current uptime reputation configs: UptimeReputationAlpha, UptimeReputationBeta, and UptimeReputationDQ. This is the first step of removing the uptime reputation columns from satellitedb Change-Id: Ie8fab13295dbf545e33aeda0c4306cda4ba54e36	2020-01-08 18:54:15 +00:00
Egon Elbre	082ec81714	uplink: move to storj.io/uplink (#3746 )	2020-01-08 15:40:19 +02:00
Jeff Wendling	c740b82e66	satellitedb/dbx: remove sed usage for bash script turns out portable sed is hard: it has to work with both linux and bsd sed, etc. instead, use a really really basic bash script and a temporary file. this should be much less likely to cause issues on a wide range of machines. Change-Id: Ia759789fb52aa1ee3361426bb6c02ed4eac3d23a	2020-01-08 01:40:24 +00:00
paul cannon	6b21334c47	satellite/satellitedb: use txutil.ExecuteInTx in dbx WithTx() Change-Id: I42ec21fdf117c661b3e1687a04014650c3a6ab97	2020-01-07 17:00:08 -06:00
paul cannon	0c5e381434	satellite/console: use transaction helpers in consoledb Transactions in our code that might need to work against CockroachDB need to be retried in the event of a retryable error. The transaction helper functions in dbutil do that automatically. I am changing this code to use those helpers instead. I also fleshed out consoledb_test.go to do actual inserts and gets to make sure things were working correctly. Change-Id: I089bf4c776d15dc8578080e26760bd6dff4beec9	2020-01-07 17:59:10 +00:00
paul cannon	22b6e9220a	satellite/satellitedb: use transaction helpers in irreparabledb Transactions in our code that might need to work against CockroachDB need to be retried in the event of a retryable error. The transaction helper functions in dbutil do that automatically. I am changing this code to use those helpers instead. Change-Id: I22b850ce5859fa07d13bf475be5140e6bde95b8a	2020-01-07 17:40:09 +00:00
paul cannon	723ed23298	satellite/satellitedb: use transaction helpers in orders Transactions in our code that might need to work against CockroachDB need to be retried in the event of a retryable error. The WithTx helper functions in dbutil and dbx do that automatically. I am changing this code to use those helpers instead. Change-Id: Iaf492af35471931125f2b7365aa4338f44154881	2020-01-07 16:31:47 +00:00
Natalie Ventura Villasana	1cb0f80a8d	satellite/gracefulexit: dq node on exit fail Disqualifies a node when the node fails to complete a graceful exit. Adds a new DisqualifyNode method to the overlay cache, since there wasn't an existing method to disqualify a node but do nothing else to its stats. Adds checks to existing tests to make sure that a storage node that fails a graceful exit is marked as disqualified in the overlay cache. https: //storjlabs.atlassian.net/browse/V3-3342 Change-Id: I4d554a519ab59db31ad3b8e28764c8683a6e3888	2020-01-06 19:16:26 -05:00
paul cannon	4a26fb5bd5	satellite/satellitedb: don't use crdb.ExecuteTx with postgres crdb.ExecuteTx is great, but I don't think it will work right with PostgreSQL. It works by way of cockroach savepoints, which allows it to react to retryable errors, whereas tx.Commit() doesn't. But I don't think PostgreSQL savepoints work exactly the same way. I'm not 100% sure, but it doesn't seem worth the risk. So, I'm switching one case here to use the new dbutil.WithTx instead, which will use crdb.ExecuteTx if appropriate. The other case doesn't need a transaction at all. Change-Id: I39283f3b5d8d47596db7aff5048bb74597e5918f	2020-01-06 23:51:35 +00:00
paul cannon	f3aee1b758	satellite/satellitedb: use transaction helpers in containment Transactions in our code that might need to work against CockroachDB need to be retried in the event of a retryable error. The transaction helper functions in dbutil do that automatically. I am changing this code to use those helpers instead. Change-Id: I660540885a0784fae844cf99376d1537e208fa69	2020-01-06 23:07:38 +00:00
Moby von Briesen	6c2e4cc0cd	satellite/overlay: Return NodeLastContact instead of a node dossier from overlay.GetOfflineNodesLimited We only care about node ID, address, and last contact success/failure from the downtime service, so the overlay should only return these values for the downtime-specific queries. Change-Id: I08a6ecfdd2a12b82cae62e87d6adeab53975bfce	2020-01-06 17:12:30 -05:00
paul cannon	4203e25c54	satellite/satellitedb: use transaction helpers in overlaycache Transactions in our code that might need to work against CockroachDB need to be retried in the event of a retryable error. The transaction helper functions in dbutil do that automatically. I am changing this code to use those helpers instead. Change-Id: Icd3da71448a84c582c6afdc6b52d1f345fe9469f	2020-01-06 21:42:57 +00:00
paul cannon	b072e16ff7	satellite/satellitedb: use transaction helpers in peeridentities Transactions in our code that might need to work against CockroachDB need to be retried in the event of a retryable error. The transaction helper functions in dbutil do that automatically. I am changing this code to use those helpers instead. Change-Id: Ibaadd2c8540ba5c8cccd6ecbf529017ab98b78ca	2020-01-06 20:42:15 +00:00
paul cannon	eb81879d47	satellite/satellitedb: use transaction helpers in usercredits Transactions in our code that might need to work against CockroachDB need to be retried in the event of a retryable error. The transaction helper functions in dbutil do that automatically. I am changing this code to use those helpers instead. Change-Id: Id24906f5f3ae83245dabb218e1f70e0bcb3b417a	2020-01-06 20:40:45 +00:00
paul cannon	a33734bee7	satellite/satellitedb/dbx: add cockroach driver type Change-Id: I7a0da6e066c67a521fc1b23b085ab8554eee0d4c	2020-01-06 18:01:03 +00:00
Ethan	05b406e992	satellite:{downtime,overlay}: Implement offline node detection chore https://storjlabs.atlassian.net/browse/V3-3398 Change-Id: I598c3bad819026377d1d113c099dc9bba8b02742	2020-01-03 17:10:03 +00:00
Ethan	8859c36234	satellite/{downtime,contact}: Add CheckNodeAvailability for use within the downtime tracking chores. https://storjlabs.atlassian.net/browse/V3-2545 Change-Id: I1dd54a0c77cb4905bb1f350beeb82c6f7700ee70	2020-01-02 18:24:11 +00:00
Moby von Briesen	ff74b44c5f	satellite/overlay: Add ability for overlay to get offline nodes ordered by last checked time This is required for the downtime tracking service: https://storjlabs.atlassian.net/browse/V3-2545 Change-Id: I286cdc07d802393948eb10c25c45ba78cc3ceafc	2020-01-02 16:39:38 +00:00
Egon Elbre	3528c56c6f	satellite/satellitedb/satellitedbtest: skip unconfigured db Change-Id: Ib6ea58208ef19410146845e82eb08724888e85ae	2020-01-02 10:51:59 +00:00
Moby von Briesen	bb3baf5a4e	satellite/satellitedb: Add nodes_offline_times table for downtime tracking Change-Id: If6b80fe0a20d88cedacaf4b76b75aa21d0af2465	2019-12-30 15:45:02 -05:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Egon Elbre	acb4435a67	satellite/satellitedb: improve Cockroach migrate test Load schemas in parallel instead of one-by-one. Optimizes from 2m30s to 1m15s. Change-Id: I0bf6381a0ae99b44271fe55d4ee658683064c097	2019-12-21 10:58:43 +00:00
Yingrong Zhao	6e71591b9b	satellitedb;storagenodedb: remove unnecessary use of DB transactions in graceful exit Change-Id: Ief0a28c6750c130896b48bfebfbea7fb3caa810f	2019-12-20 21:24:38 +00:00
Jeff Wendling	ffbc43d170	satellite/satellitedb/dbx: monitor the database calls Change-Id: I9fbdc4a35fedfc7a1b4e1b630d2a3664b3218e67	2019-12-18 21:57:36 +00:00
Kaloyan Raev	5ee1a00857	satellite/overlay: filter reliable nodes from a list Adds the KnownReliable method to Overlay Service that filters all nodes from the given list to be only reliable nodes (online and qualified). The method return []*pb.Node of reliable nodes. The pb.Node values are ready for dialing. The first use case is when deleting an object to efficiently dial all reliable nodes holding a piece of that object and send them a delete request. Change-Id: I13e0a8666f3807c5c31ef1a1087476018a5d3acb	2019-12-17 21:20:08 +00:00
Cameron Ayer	a4f9865b47	satellite: adds and enables cockroachdb compatibility for tests Change-Id: I85a3ad8c3b9d7e15ea8675b6c55af0002933db57	2019-12-16 22:29:25 +00:00
paul cannon	2f7465c294	private/dbutil: register "cockroach" as sql.DB driver this will allow us to inspect the type of `db.Driver()` on *sql.DB connections to correctly differentiate between pg and crdb conns. as a bonus, this moves all concerns about when to replace "cockroach://" with "postgres://" out of view, letting the thin shim "driver" take care of that. Change-Id: Ib24103ab7c508231e681f89a7321b623e4e125e9	2019-12-16 19:10:00 +00:00

... 4 5 6 7 8 ...

894 Commits