storj

Author	SHA1	Message	Date
Cameron Ayer	0155c21b44	private/testplanet, storagenode/{monitor,pieces}: write storage dir verification file on run and verify on loop On run, write the storage directory verification file. Every time the node runs it will write the file even if it already exists. The reason we do this is because if the verification file is missing, the SN doesn't know whether it is an incorrect directory, or it simply hasn't written the file yet, and we want to keep nodes running without needing operator intervention. Once this change has been a part of the minimum version for several releases, we will move the file creation from the run command to the setup command. Run will only verify its existence. Change-Id: Ib7d20e78e711c63817db0ab3036a50af0e8f49cb	2020-08-19 19:12:21 +00:00
Cameron Ayer	586e6f2f13	private/testblobs, storage, storage/filestore: add storage dir verification to filestore Sometimes SNOs fail to properly configure or lose connection to their storage directory which can result in DQ. This causes unnecessary repair and is unfortunate for all parties. This change introduces the creation of a special file in the storage directory at runtime containing the node ID. While the storage node runs, it periodically verifies that it can find said file with the correct contents in the correct location. If not, the node will shut down with an error message. This change will solve the issue of nodes losing access to the storage directory, but it will not solve the issue of nodes pointing to the wrong directory, as the identifying file is created each time the node starts up. After this change has been the minimum version for a few releases, we will remove the creation of the directory-identifying file from the storage node run command and add it to the setup command. Change-Id: Ib7b10e96ac07373219835e39239e93957e7667a4	2020-08-19 17:18:14 +00:00
Yingrong Zhao	14ad7a4f1c	satellite/metainfo: add limiter for objectdeletion and piecedeletion services This PR adds a limiter on the amount of concurrent objects deletion can be handled so we don't run out of memory. Change-Id: Id2ce368af6f86845fcdfd34cb2f5e460efe9b272	2020-08-19 16:08:29 +00:00
Moby von Briesen	708cb48aa6	storagenode/orders: implement orders filestore on storagenode * Add all new orders to the orders filestore instead of the database. * Submit orders from the filestore to the new satellite SettleWindow endpoint. The orders filestore will eventually replace the orders DB completely. For now, we will still be checking the orders DB and submitting those orders if they exist. In a later release, we will completely remove the orders DB, but we need both the DB and filestore for the transitionary period. Change-Id: Iac8780fd5ab770296181bbd313e1d335f072d4dc	2020-08-19 15:00:35 +00:00
Ivan Fraixedes	7f8df74070	private/testplanet: Use config with name set when empty In testplanet Run function we create a new configuration variable on each t.Run for setting the value to the config name field when it's empty, however the new copy of the configuration was not used. Change-Id: I9da34e743f9648850c96556eab0349e742db3aac	2020-08-19 13:12:10 +02:00
Egon Elbre	94a09ce20b	all: add missing dots Change-Id: I93b86c9fb3398c5d3c9121b8859dad1c615fa23a	2020-08-11 17:50:01 +03:00
Michal Niewrzal	88dcc93f3c	satellite/metainfo: use user PartnerID for bucket attribution Change-Id: I20f1bd432333f9b37ca8fb457c349eff94ffb392	2020-08-06 13:14:07 +00:00
Moby von Briesen	e02adfe5e9	satellite/overlay/config.go: Add AuditHistoryConfig to overlay Adds AuditHistory{WindowSize, TrackingPeriod, GracePeriod, OfflineThreshold}. These values will be used to track offline audits over time, and to suspend/disqualify nodes for being offline for too long. Change-Id: I05f7dbc3c034bdc53c4fbd7719c71a44f37ec6a5	2020-08-04 18:18:56 +00:00
Jeff Wendling	85a74b47e7	satellite/orders: 3-phase rollout This adds a config flag orders.window-endpoint-rollout-phase that can take on the values phase1, phase2 or phase3. In phase1, the current orders endpoint continues to work as usual, and the windowed orders endpoint uses the same backend as the current one (but also does a bit extra). In phase2, the current orders endpoint is disabled and the windowed orders endpoint continues to use the same backend. In phase3, the current orders endpoint is still disabled and the windowed orders endpoint uses the new backend that requires much less database traffic and state. The intention is to deploy in phase1, roll out code to nodes to have them use the windowed endpoint, switch to phase2, wait a couple days for all existing orders to expire, then switch to phase3. Additionally, it fixes a bug where a node could submit a bunch of orders and rack up charges for a bucket. Change-Id: Ifdc10e09ae1645159cbec7ace687dcb2d594c76d	2020-08-03 17:01:42 +00:00
Rafael Gomes	935f44ddb7	satellite/metainfo: Add Delete Service config Change-Id: I0a6e3ce1adfe1488eb23da9dda92877af1834599	2020-08-03 14:28:02 +00:00
Michal Niewrzal	20184d3604	satellite/metainfo: move TestAttributionReport to attribution tests Additionally test was simplified by adding ability to set user agent for testplanet uplink. Change-Id: I82942c2280562b5118a42aa8e1e0f53092f8dbe1	2020-07-30 19:18:15 +00:00
Bill Thorp	b265b7f555	satellite/console: make paywall optional Add a config so that some percent of users require credit cards / account balances in order to create a project or have a promotional coupon applied UI was updated to match needed paywall status At this point we decided not to use a field to store if a user is in an A/B test, and instead just use math to see if they're in a test. We decided to use MD5 (because its in Postgres too) and User UUID for that math. Change-Id: I0fcd80707dc29afc668632d078e1b5a7a24f3bb3	2020-07-28 10:57:49 +00:00
Qweder93	92efffb48a	storagenode/version: notification flow now based on cursor, chore_test added, versioncontrol added to reconfigure. Change-Id: I70713def8d585228270ec5a8c586ecc5b4d510c4	2020-07-23 14:13:24 +00:00
Ethan	cfca021839	satellite/accounting: Add chore to cleanup old project bandwidth rollups data Removes old project_bandwidth_rollups records that are no longer used. Uses a retain months configuration to determine how many months to save. Current month cannot be removed. Tests retainMonths=-1, 0, 2 Change-Id: Ia4be2546cdb28802427acf41ecd85ad66df3e62c	2020-07-22 18:56:49 +00:00
paul cannon	fd7bfc94fe	private/dbutil: don't sort column names in an index The order in which column names appear in an index should be deterministic (for both our sqlite and postgresql code). Also, the order is very relevant as to whether a given schema is correct. Change-Id: I227ea057fcd9c3e967dd241a7e1c787d1bc4baa1	2020-07-17 10:07:01 +00:00
Egon Elbre	b84923558b	satellite: fix scoping, formatting Change-Id: I21ef9edc2d449d75ad74891df7f966fb150d80fd	2020-07-16 19:13:14 +03:00
Egon Elbre	e70da5cd4e	all: fix comments Change-Id: I2d2307e3fab87de47a72b3595d051e2c95ff4f8a	2020-07-16 19:13:14 +03:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
stefanbenten	9ace375ee0	satellite/{console,satellitedb}: change project limiting based on new users field This change switches the backend logic to use the new DB column on the users table to restrict project creation. Furthermore it back fills the existing limits from registration tokens to the new column to ensure no users are reset to the new default. UI is updated to reflect ability to create several projects Change-Id: Ie29157430ae6b065411ca4c4557c9f1be69cdc4f	2020-07-16 10:57:47 +00:00
Jennifer Johnson	784a156eea	satellite: prevents uplink from creating a bucket once it exceeds the max bucket allocation. Change-Id: I4b3822ed723c03dbbc0df136b2201027e19ba0cd	2020-07-15 17:27:05 +00:00
stefanbenten	257855b5de	all: replace == comparison with errors.Is Change-Id: I05d9a369c7c6f144b94a4c524e8aea18eb9cb714	2020-07-14 15:50:25 +00:00
stefanbenten	1149417615	satellite/admin: cleanup parameter handling We passed in revocationDB and metainfoDB for no reason. Lets remove it from the dependency list to further reduce the footprint. Change-Id: Ic0317bb92670fbd305d4a8b0ed1cb82858e2f6d3	2020-07-14 13:53:09 +02:00
Jessica Grebenschikov	8abb907010	satellite/orders: add settle orders with window Why: We need a way to cut down on database traffic due to bandwidth measurement and tracking. What: This changeset is the Satellite side of settling orders in 1 hr windows. See design doc for more details: https://review.dev.storj.io/c/storj/storj/+/1732 Change-Id: I2e1c151e2e65516ebe1b7f47b7c5f83a3a220b31	2020-07-13 15:41:29 -07:00
paul cannon	bbdb351e5e	all: use jackc/pgx in place of lib/pq What: Use the github.com/jackc/pgx postgresql driver in place of github.com/lib/pq. Why: github.com/lib/pq has some problems with error handling and context cancellations (i.e. it might even issue queries or DML statements more than once! see https://github.com/lib/pq/issues/939). The github.com/jackx/pgx library appears not to have these problems, and also appears to be better engineered and implemented (in particular, it doesn't use "exceptions by panic"). It should also give us some performance improvements in some cases, and even more so if we can use it directly instead of going through the database/sql layer. Change-Id: Ia696d220f340a097dee9550a312d37de14ed2044	2020-07-13 15:54:41 +00:00
Egon Elbre	9dc9cd8a17	tests: allow STORJ_TEST_POSTGRES STORJ_POSTGRES_TEST naming was not consistent with STORJ_SIM_POSTGRES. This allows to use STORJ_TEST_POSTGRES for clarity, it still has a fallback to STORJ_POSTGRES_TEST. Change-Id: I6f294c66c80fcfd6750fea2a89795f3b7f5dd691	2020-07-10 16:43:49 +03:00
Egon Elbre	4869cfc9a4	satellite/vouchers: remove deprecated endpoint Change-Id: I0a754217d9424253e448126face6594bc143f412	2020-07-10 12:38:46 +00:00
Stefan Benten	9dbd511396	private/dbutil: reduce db connection defaults (#3920 )	2020-07-08 19:59:42 +02:00
Qweder93	0521435e08	storagenode/gracefulexit: added deletion of all files left in storage/blobs/satellite after successful GE https://storjlabs.atlassian.net/browse/SG-368 Change-Id: I29a978fe0d0153aedf2be91dc7f45b4ef386d447	2020-07-08 14:38:31 +03:00
Bill Thorp	4a98c9514c	private/date: fix MonthsCountSince build issue Change-Id: I58a70ea85f966dece4b3c75f54cfaa5238f9ecd9	2020-06-30 17:47:18 -04:00
Cameron Ayer	cadb435d25	{satellite/audit, private/testplanet}: remove ErrAlreadyExists, run 2 audit workers in testplanet Since we increased the number of concurrent audit workers to two, there are going to be instances of a single node being audited simultaneously for different segments. If the node times out for both, we will try to write them both to the pending audits table, and the second will return an error since the path is not the same as what already exists. Since with concurrent workers this is expected, we will log the occurrence rather than return an error. Since the release default audit concurrency is 2, update testplanet default to run with concurrent workers as well. Change-Id: I4e657693fa3e825713a219af3835ae287bb062cb	2020-06-30 18:00:07 +00:00
Egon Elbre	13a5854535	satellite/satellitedb: clarify test migration merging Use a field to distinguish migration steps that need to use a different transaction from previous steps. This is clearer than using a func. Change-Id: I2147369d05413f3e8ddb50c71a46ab1ba3ab5114	2020-06-25 14:32:45 +00:00
Rafael Gomes	958ea1b9df	satellite/accounting: add download limit cache Change-Id: I722930cab8bd5d240f4878dc6997e9bc7637311f	2020-06-12 16:33:46 -03:00
Egon Elbre	1ed5a1bac5	satellite/satellitedb/satellitedbtest: skip omitted database The first implementation missed some changes. Change-Id: I7ae696175e0a9ea46954970ba8547638a05ed5a9	2020-06-11 13:28:16 +00:00
Ivan Fraixedes	dc5502cb81	private: Prepare pkg for enabling gosec Prepare package for enabling gosec linter. Change-Id: I0cce91d83969385f95e5bf82269d6c23629e04a0	2020-06-11 12:00:52 +00:00
Egon Elbre	1c30efd3a1	private/testplanet: allow setting "omit" as database to reduce output Change-Id: I7af90fdefe2ff2df1340aa2b17f40806d889ca18	2020-06-09 12:41:58 +03:00
Egon Elbre	36c461bd59	private/tagsql: track proper closing of rows and statements This ensures that rows are closed to avoid leaks. Also verifies that Err() is called, to ensure that no error is left behind. Change-Id: Idd1bec9bf479f40021da67b2c80ce83033149469	2020-06-05 18:25:43 +00:00
Egon Elbre	10f8b5492c	Revert "private/tagsql: add finalizer based leak checks during dev" This reverts commit `c6310b34d2`. The change was causing data-races that are hard to deal with. Change-Id: I0d29d85af70dce7ee2e967b9d7854719b32cf216	2020-06-05 17:52:46 +03:00
Yaroslav Vorobiov	09ca382abf	storagenode/db: preflight improve index discovery Change-Id: I876b321f6cd4e91dfced87aa4d39f2cf9a8e63d0	2020-06-05 14:03:25 +03:00
Jeff Wendling	c6310b34d2	private/tagsql: add finalizer based leak checks during dev what would win? thousands of man-hours spent trying to make the best, most bug-free code possible, or one leaky boi? this way we hopefully reduce the number of times we deadlock everything by forgetting a single rows.Close. Change-Id: I191727bbb75f74f5f4d0664e9e7b6ccf46c931f5	2020-06-03 15:06:58 -06:00
Moby von Briesen	b82d04e618	satellite/metainfo: limit size of uplink-provided metadata to 2KiB Change-Id: Id44a46046ddb4a12102525531f4502fcff2b6252	2020-06-01 16:51:29 -04:00
Qweder93	89c9672ce0	storagenode/piecestore: available storage check added in Upload Change-Id: I71e9e5f335d4320d5de8b374fe747fec43179f78	2020-06-01 16:55:22 +00:00
Michal Niewrzal	21518bcc92	private/testuplink: move tests to uplink Tests will be deleted from storj repo and added to uplink. Change-Id: I298d852325c8eb0df07df38fd7e1345623addd8d	2020-06-01 12:29:21 +02:00
Ethan	b1bb665c78	satellite/metainfo: Handle "server is not accepting clients" error during CRDB node rejoins https: //storjlabs.atlassian.net/browse/SM-1035 Change-Id: I27243b0d8fc3250916c86ceb915f973cbf80f656	2020-05-29 16:21:56 +00:00
Moby von Briesen	dc57640d9c	storagenode/piecestore: switch usedserials db for in-memory usedserials store Part 2 of moving usedserials in memory * Drop usedserials table in storagenodedb * Use in-memory usedserials store in place of db for order limit verification * Update order limit grace period to be only one hour - this means uplinks must send their order limits to storagenodes within an hour of receiving them Change-Id: I37a0e1d2ca6cb80854a3ef495af2d1d1f92e9f03	2020-05-28 12:52:52 -04:00
Michal Niewrzal	84892631c8	private/testplanet: remove old libuplink from testplanet Change-Id: Ib1553f84d0b3ae12a5b00382f0f53357b6a273e2	2020-05-28 13:50:23 +00:00
Qweder93	8db848791f	storagenode/console: added estimated payout for current month and estimated pay stub for previous month (until there's real data in satellite's table) + heldback percentage rate for previous month. Change-Id: I9346f6d22ed6fbb7e5346b102fc898467678f384	2020-05-27 14:51:23 +03:00
Natalie Villasana	8bd4d7b43e	storage/cockroachkv: add check if retry is needed during iteration This changeset replaces https://review.dev.storj.io/c/storj/storj/+/1839 which did the same thing but Nat couldn't figure out how to fix conflicting files the correct gerrity way. Change-Id: If05a8902aca986ea9f6c9168a90b31beebab839a	2020-05-26 14:32:06 -04:00
Jeff Wendling	074649835b	satellite/satellitedb: add some docs and improve some snapshots This attempts to add a README.md to help create consistent migrations that maximize our test coverage and do not include unnecessary statements. It also adds a feature to have an `-- OLD DATA --` section as well as a `-- NEW DATA --` section so that we can fix mistakes made in previous snapshots (like a row that was forgotten to be added when a table was created) without editing them going forward. Change-Id: I28a786f8ef163cae1de1bb08f61af1e1104b0a88	2020-05-22 21:27:36 +00:00
Michal Niewrzal	5c10964040	satellite/payments/stripecoinpayments: add test for listing issues while invoice generation https://review.dev.storj.io/c/storj/storj/+/1853 https://review.dev.storj.io/c/storj/storj/+/1882 Change-Id: Ie71363b819866dd60dbe7117b42cfa8348479310	2020-05-22 17:24:16 +00:00
Michal Niewrzal	3d332de228	private/testplanet: remove StorageNodeCount from testplanet uplink definion Small cleanup. Change-Id: Icabdf1433c36cd0e9f8e10a67975e98391024e14	2020-05-21 14:51:58 +00:00
Egon Elbre	bef84a5f9d	storagenode: remove dependency to overlay.NodeDossier This is the last dependency from storage node to satellite. Change-Id: I12f7abb91e84f823ba5af126c6e2979519838612	2020-05-21 08:37:13 +03:00
Egon Elbre	b42778c42e	private/testplanet: remove some additional Local-s Change-Id: I49701c41efb92efca27cc18d0a3f6d6b44d3cf8b	2020-05-21 08:37:13 +03:00
Natalie Villasana	2514d6328d	dbutil/cockroachutil: add monkit to QueryContext This will help us keep track of crdb errors in influx. Change-Id: I997596aa4eb9a2b9b81305d123c3452ecdf5dde5	2020-05-20 14:56:25 -04:00
Bill Thorp	f43cb1688d	private/tagsql: verify SQL connection with ping Use ping to make sure the database connection is valid Change-Id: I5217e28e186f487266c8f4a1d39cce0070dc1465	2020-05-20 13:12:16 +00:00
Michal Niewrzal	fe6a6f063f	private/testplanet: cleanup predefined data generation Use Console service to create user and project instead direct DB modifications. Change-Id: Ib0074b38313b3dc43b7d8d63ab2775d29028fb7b	2020-05-20 12:38:43 +00:00
Egon Elbre	941d10cbc3	private/testplanet: remove Peer.Local() Currently storagenode depends on overlay.NodeDossier, this is the first step in removing it. Change-Id: I034a3f1601835f8349bd41752455022e19bcc707	2020-05-20 11:05:34 +00:00
Egon Elbre	ed627144ed	all: use DialNodeURL throughout the codebase Change-Id: Iaf9ae3aeef7305c937f2660c929744db2d88776c	2020-05-20 10:36:30 +00:00
Michal Niewrzal	705e82ea99	private/testplanet: add AddUser and AddProject to satellite functionality We want to start adding more complex test cases for billing/invoices and we need more handy tooling to be able do this easily. Change-Id: Ib22ac6b4ba9ee77cc91c88b0cfd2d2efc15657df	2020-05-19 13:02:04 +00:00
Michal Niewrzal	ac375d37bc	satellite/payments: remove mockpayments and add Stripe client mock instead Change-Id: If3496f6abc16da90d2b43fa0c5be356847a39507	2020-05-19 09:35:37 +02:00
Natalie Villasana	8d87a6efc9	cockroachutil/driver: handle retryable errors returned from Next This will only work if retryable errors are returned on the first call to Next. Otherwise if they're returned later, we will need deeper changes at the application code level throughout the codebase 😬👎 Change-Id: I46d795a13670f66b7f085605ba1b779f69c339c3	2020-05-15 14:49:43 -04:00
littleskunk	ef2671927d	storagenode/piecestore: move queue size defaults (#3881 )	2020-05-15 19:10:26 +02:00
Ethan	159df8b2e4	Add logging listener for retrieving and setting log levels See https://storjlabs.atlassian.net/browse/SM-752 These changes allow us to change the log level at runtime through a handler off of the debug endpoint. Examples of changing the log level on storj-sim To get the current level for the satellite api process: curl -XGET 'http://127.0.0.1:10009/logging' --header 'Content-Type: text/plain' To change the log level: curl -XPUT 'http://127.0.0.1:10009/logging' --header 'Content-Type: text/plain' --data-raw '{"level":"error"}' Change-Id: I05d164b290929fa06b6d78c01075ee41f8238044	2020-05-12 16:38:06 -04:00
igor gaidaienko	1eab5e2980	satellite/console: Increase default webUI rate limit to 5 Previous limit is annoying for normal users Change-Id: I7cb783e0b2515f415b2a055d5e811efab3810654	2020-05-12 16:12:17 +00:00
Stefan Benten	e23bd806b4	satellite/accounting: separate usage and bandwidth limit (#3878 )	2020-05-12 15:01:15 +02:00
Egon Elbre	e6d5ce6b77	all: remove grpc It seems everyone has migrated to drpc. Change-Id: Ica6b2d0bdef68c6603083f2963458843eca71e9e	2020-05-10 06:36:09 +00:00
Egon Elbre	bcd93ee375	private/testplanet: add StopNodeAndUpdate This was commonly used and code with it can be simplified. Change-Id: I2f2b91f7de54269aee6ef027f97f9e8a7d222e39	2020-05-08 13:02:19 +00:00
Egon Elbre	90d859fbb8	private/testplanet: use drpc piecestore mock for testing Change-Id: Ia3f93f3c8b6584fb92f5d29025b7f0691120430e	2020-05-07 10:54:49 +03:00
Egon Elbre	c5452a87ec	private/testplanet: use drpc referral manager server Change-Id: I9e9e9a724c78c98859dd3e29416d766d8ffdca63	2020-05-07 07:03:11 +00:00
Egon Elbre	d98b8f6e23	satellite/metainfo,storage: use different limit for metainfo loop Change-Id: I5ef7233930679b977b33f7b3e1dda45c907dcfad	2020-05-05 10:37:20 +00:00
Moby von Briesen	8f60cfc4fb	satellite/overlay: Add flag for enabling/disabling disqualification from suspension mode Add a flag that allows us to easily switch disqualification from suspension mode on or off. A node will only be disqualified from suspension mode if it has been suspended for longer than the grace period AND the SuspensionDQEnabled flag is true. Change-Id: I9e67caa727183cd52ab2042b0a370a1bcaebe792	2020-05-04 17:25:09 +00:00
Egon Elbre	c630cf2490	storagenode/pieces: implement buffering for writing Currently uploads can cause a lot of IOPS, reduce this by introducing a in-memory buffer on-top of the file. Change-Id: I5f4e3e01c0a36258271d180b922107de447bcb59	2020-05-04 06:01:32 +00:00
Qweder93	0dfbdae614	private/date: MonthsCountSince removed, being unused anymore Change-Id: I666d29b91bc1283c1abb7c3a70b15417c1289f59	2020-04-30 15:14:17 +00:00
Qweder93	f54a4960a8	private/date: TestMonthCountSince temporary fix Change-Id: Ifc6f590f9fcdbd15ed84766cfe7a5809aa8a33f8	2020-04-30 12:52:44 +00:00
Egon Elbre	8928399d02	all: rename CreateTables to MigrateToLatest CreateTables hasn't been quite true for a while now, rename to MigrateToLatest to be clearer in it's behavior. Change-Id: Ida48e95122a5d9b7a814e922d3698e00024a2ba7	2020-04-30 07:21:17 +00:00
Egon Elbre	51f69d53dc	private/testplanet: fix closablePeer Peers require that Run finishes before calling Close. Cancel only signals peer to start closing however it does not wait for it to complete. Change-Id: If4b3778f4fc86402363ed3b555db11e1189e6200	2020-04-29 17:57:39 +00:00
Isaac Hess	237d9da477	storagenode/pieces: Deleter can handle multiple tests Before the deleter would close its done channel once, so if additional tests shared a storagenode, even if not in parallel, the later waits would not work properly. This fixes that problem. Change-Id: I7dcacf6699cef7c2c2948ba0f4369ef520601bf5	2020-04-29 11:26:56 -06:00
Isaac Hess	baccfd36b1	private/testplanet: Mark sn peer deleter test mode When running testplanet tests, mark storagenode peer PieceDeleter as in testing mode so that you don't have to do it on each test. Change-Id: I2592e02c63f8bcc9152ecf436bac4e798b08bccf	2020-04-28 15:57:29 -06:00
Egon Elbre	85c45cd56f	private/dbutil/pgtest: support multiple databases for testing Currently Cockroach isn't performant for concurrent database setup and tear-down. Instead of a single instance allow setting multiple potential connection strings and let the tests pick one connection string randomly. This improves test duration by ~10 minutes. While we are at significantly changing how pgtest works, introduce helper PickPostgres and PickCockroach for selecting the database to reduce code duplications in multiple places. Change-Id: I8ad171d5c4c8a4fc081ec2ae9bdd0cc948a80619	2020-04-28 21:55:49 +03:00
Bill Thorp	849326efee	satellite/console: cleanup rate limiter Changed == to >= JIC, removed TODOs after being convinced by Isaac Change-Id: Ibe8e5aafb3accfd3abb153bc315ebad223d55d15	2020-04-28 13:26:23 +00:00
Egon Elbre	ef913be234	satellite/satellitedb/satellitedbtest: don't use subtest naming A/B indicates that B is a subtest of A, however in this case they represent a configuration of the test, not a subtest. Change-Id: I64eed5d5bcb12759e54fe4b5373f8e88488e50f7	2020-04-27 19:32:09 +03:00
Bill Thorp	341aecfe0f	satellite/console: add rate limiter to login, register, password recovery Added a per IP rate limiter to the console web. Cleaned up password check to leak less bcyrpt info. Change-Id: I3c882978bd8de3ee9428cb6434a41ab2fc405fb2	2020-04-24 17:15:49 +00:00
Jess G	825226c98e	satellite/overlay: use node selection cache for uploads (#3859 ) * satellite/overlay: use node selection cache for uploads Change-Id: Ibd16cccee979d0544f2f4a01749af9f36f02a6ad * fix config lock Change-Id: Idd307e4dee8ab92749f1ec3f996419ea0af829fd * start fixing tests Change-Id: I207d373a3b2a2d9312c9e72fe9bd0b01e06ad6cf * fix test, add some more Change-Id: I82b99c2004fca2510965f9b389f87dd4474bc722 * change config name Change-Id: I0c0f7fc726b2565dc3828cb723f5459a940f2a0b * add benchmarks Change-Id: I05fa25bff8d5b65f94d918556855b95163d002e9 * revert bench to put in different PR Change-Id: I0f6942296895594768f19614bd7b2e3b9b106ade * add staleness to benchmark Change-Id: Ia80a310623d5a342afa6d835402170b531b0f870 * add cache config to testplanet Change-Id: I39abdab8cc442694da543115a9e470b2a8a25dff * have repair select old way Change-Id: I25a938457d7d1bcf89fd15130cb6b0ac19585252 * lower testplante config time Change-Id: Ib56a2ed086c06bc6061388d15a10a2526a663af7 * fix test Change-Id: I3868e9cacde2dfbf9c407afab04dc5fc2f286f69	2020-04-24 09:11:04 -07:00
Ivan Fraixedes	a0692d0db8	private/migrate: enhance docs in some funcs Enhance the doc comment in some migration methods. Change-Id: I3d91f7e01f24670fe3d972bd3b022b8a47251bdc	2020-04-23 13:06:06 +02:00
Moby von Briesen	72b93f3120	satellite/satellitedb: disqualify suspended nodes when the grace period passes If a node is suspended and receives an unknown or failing audit, disqualify them if the grace period (default 1w in production) has passed. Migrate the nodes table so any node that is currently suspended gets unsuspended when the satellite starts up. Change-Id: I7b81c68026f823417faa0bf5e5cb5e67c7156b82	2020-04-22 15:45:00 -04:00
Moby von Briesen	178aa8b5e0	satellite/{metainfo,repair}: Delete expired segments from metainfo * Delete expired segments in expired segments service using metainfo loop * Add test to verify expired segments service deletes expired segments * Ignore expired segments in checker observer * Modify checker tests to verify that expired segments are ignored * Ignore expired segments in segment repairer and drop from repair queue * Add repair test to verify that a segment that expires after being added to the repair queue is ignored and dropped from the repair queue Change-Id: Ib2b0934db525fef58325583d2a7ca859b88ea60d	2020-04-22 13:02:31 +00:00
Egon Elbre	e655e160dc	private/testuplink: delete delete ecclient.Delete is a deprecated func that shouldn't be used anymore. Change-Id: Ica4d17e334220311c99cea28f1d0e2d854d72896	2020-04-21 13:56:40 +00:00
Michal Niewrzal	c021b35879	private/testplanet: migrate testplanet to new libuplink Replace most of old libuplink usages in testplanet. 100% migration will be possible when we will be able to implement UploadWithClientConfig with new libuplink. Change-Id: I432d7d4917c7b67d46a058abd0a2a6a13f565ac4	2020-04-20 12:43:34 +00:00
Egon Elbre	9052085f70	private/testplanet: simplify uplink usage Change-Id: I3e488dc296f1094ce95e6d6597ca6d3f8da90a76	2020-04-16 16:45:55 +00:00
Jess G	75b9a5971e	satellite: update log levels (#3851 ) * satellite: update log levels Change-Id: I86bc32e042d742af6dbc469a294291a2e667e81f * log version on start up for every service Change-Id: Ic128bb9c5ac52d4dc6d6c4cb3059fbad73f5d3de * Use monkit for tracking failed ip resolutions Change-Id: Ia5aa71d315515e0c5f62c98d9d115ef984cd50c2 * fix compile errors Change-Id: Ia33c8b6e34e780bd1115120dc347a439d99e83bf * add request limit value to storage node rpc err Change-Id: I1ad6706a60237928e29da300d96a1bafa94156e5 * we cant track storage node ids in monkit metrics so lets use logging to track that for expired orders Change-Id: I1cc1d240b29019ae2f8c774792765df3cbeac887 * fix build errs Change-Id: I6d0ffe058e9a38b7ed031c85a29440f3d68e8d47	2020-04-15 12:32:22 -07:00
Kaloyan Raev	a2ce836761	remove sugar logging Change-Id: I6b6ca9704837cb3f5f5449ba7f55661487814d9f	2020-04-15 12:37:47 +00:00
Moby von Briesen	d7794a4851	satellite/overlay: hardcode default values for audit alpha/beta Alpha=1 and beta=0 are the expected first values for any alpha/beta reputation system we are using in the codebase. So we are removing the configurability of these values. Change-Id: Ic61861b8ea5047fa1438ea6609b1d0048bf0abc3	2020-04-14 19:12:40 +00:00
Qweder93	743b3fb226	storagenode/nodestats: add pricing model, storagenode/cache: add paystub history storing Change-Id: I9bc104a1407c8f286a964c796656d89b122bf752	2020-04-14 19:04:00 +03:00
Cameron Ayer	3ee6c14f54	satellite/downtime: add concurrency to downtime estimation We want to increase our throughput for downtime estimation. This commit adds the ability to reach out to multiple nodes concurrently for downtime estimation. The number of concurrent routines is determined by a new config flag, EstimationConcurrencyLimit. It also increases the default EstimationBatchSize to 1000. Change-Id: I800ce7ec1035885afa194c3c3f64eedd4f6f61eb	2020-04-14 14:39:13 +00:00
Jeff Wendling	e33da90879	private/dbutil/cockroachutil: stop checking for jackc/pgx we do not use that driver, and removing the case from the type assertion reduces the satellite binary size by 5%. Change-Id: I1c1b5e1e57dc4a98415103cfddd4f8c091588573	2020-04-10 07:19:02 +00:00
Jeff Wendling	d658a6a6ec	private/dbutil/txutil: fix logic in transaction retries before this change, any transaction that took longer than 5 minutes even if it succeeded, would get a retry error included in the result. try to make the logic more clear and add comments for the reader. Change-Id: Ib84a89a33907a24426ecf52c90404be0e0dfa307	2020-04-09 13:58:53 +00:00
Egon Elbre	a4c554f2ed	satellite/admin: support user query by email This adds new endpoint /api/user/{user-email} which allows to get the projects where the user is a member. It also moves existing endpoint: /project/{projectid}/limit -> /api/project/{projectid}/limit To avoid future conflicts for displaying pages. Change-Id: I5efe3e1c8f79894c136f92ed815f635a34ba6f98	2020-04-06 18:32:25 +00:00
Cameron Ayer	42be4bdc0f	satellite/contact: add timeout to PingBack method Change-Id: I2ec2f82e2e10d8be16f82e9de13ce42358e47c98	2020-04-04 18:26:30 +00:00
Egon Elbre	6492b13d81	all: remove old uuid Change-Id: I3a137f73456f010c37d3933dbe12cbbb840b809f	2020-04-02 19:30:36 +03:00
Michal Niewrzal	c178a08cb8	satellite/metainfo: add max segment size and max inline size to BeginObject response We want to control inline segment size and segment size on satellite side. We need to return such information to uplink like with redundancy scheme. Change-Id: If04b0a45a2757a01c0cc046432c115f475e9323c	2020-04-02 12:41:28 +00:00
Egon Elbre	8f73fb7a32	all: simplify uuid usage uuid.UUID implements driver.Value so it can be directly used as a scannable result. Replace uses of dbutil.BytesToUUID with uuid.FromBytes. Change-Id: I51a670185ceb3cc2199d5aa2b76bc3fc191ca8fe	2020-04-02 05:48:58 +00:00
Egon Elbre	644df8dcdc	private/version: minimal fix for tag-release.sh Previous split to a storj.io/private repository broke tag-release.sh script. This is the minimal temporary fix to make things work. This links the build information to specified variables and sets them inline. This approach, of course, is very fragile. Change-Id: I73db2305e6c304146e5a14b13f1d917881a7455c	2020-04-01 13:46:45 +00:00
Jeff Wendling	9bd0bd0c24	private/currency: add strictcsv support to microunit Change-Id: Iad2f6a07f189f2faa1d13bdb82dfa320921f6938	2020-03-31 14:57:04 -06:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Jeff Wendling	e2ff2ce672	satellite: compensation package and commands Change-Id: I7fd6399837e45ff48e5f3d47a95192a01d58e125	2020-03-30 14:08:14 -06:00
Egon Elbre	e1a443b04a	private/testplanet: allow modifying created database Instead of providing the database from outside to testplanet create it inside and then allow wrapping and modifying it. This is more convenient to use. Change-Id: I9b8f69e6e0a19ff984b4e2bfe927c9100c77bc6c	2020-03-27 19:14:48 +00:00
Moby von Briesen	a933bcc99a	satellite/repair/repairer/ec.go: add option for downloading pieces onto disk instead of in memory during repair Add flag to satellite repairer, "InMemoryRepair" that allows the satellite to decide whether to download the entire segment being repaired into memory (this is what the satellite already does), or to download it into temporary files on disk that will be read from in the upload phase of repair. This should help with handling high repair traffic on satellites that cannot afford to spend 64mb of memory per repair worker. Updates tests to test repair for both in memory and to disk. Change-Id: Iddf591e165621497c98533d45bfea3c28b08a194	2020-03-27 16:41:00 +00:00
Egon Elbre	e8f18a2cfe	private/testplanet: expose storagenode and satellite Config Change-Id: I80fe7ed8ef7356948879afcc6ecb984c5d1a6b9d	2020-03-27 17:01:25 +02:00
Natalie Villasana	8e0ca0e6f5	satellite/gc: update release default for gc to run separately (#3830 )	2020-03-26 14:44:18 -04:00
Jeff Wendling	97e980cd8a	private/dbutil: add database name to configure as a tag storagenodes have like 10 or more databases. without this tag they all get sent as the same value, stomping on each other. Change-Id: Ib12019684d6ea8f2a5b83df584056dfa79e3c4b3	2020-03-26 16:50:15 +00:00
Yingrong Zhao	b7b19289d1	bump storj.io/common to latest Change-Id: I16e337660ce8e1ef332cc842dbf4cfa067b9b98b	2020-03-25 09:08:40 -04:00
Yingrong Zhao	a731472496	bump storj.io/common to latest and storj.io/drpc to v0.0.11 Change-Id: I7a6e823b441eeff4621dfdf2d6577be76c9761c8	2020-03-24 15:17:10 -04:00
Michal Niewrzal	2a482e8bc4	private/version/checker: remove code which was moved to `storj/private/debug` package Change-Id: I44dfecd6ab875fb33851a22cf10b3064da9bfd65	2020-03-24 17:07:33 +00:00
Michal Niewrzal	fdf40a7526	storj: remove `storj/private/version` package which was moved to `storj/private` repo Change-Id: I81c3f5b9d5e4fe7bca760999eb045ee9734e5e2e	2020-03-24 14:31:33 +00:00
Michal Niewrzal	f0aeda3091	storj: remove from `storj/pkg` packages moved to `storj/private` repo * debug * traces * cfgstruct * process Package `storj/private/version` will be removed as a separate change. Change-Id: Iadc40faa782e6225513b28218952f02d9c240a9f	2020-03-24 09:56:29 +01:00
Egon Elbre	6a7571f73e	cmd/s3-benchmark: move to storj.io/benchmark Change-Id: Idca2b836bdf876ca28eb5cabc9bfae1d576e4a3e	2020-03-23 19:09:42 +02:00
Egon Elbre	1b6ab173a8	private/context2: moved to storj.io/common/context2 Change-Id: Ic1dd1ed645ff3e1057c9b2b143e2c3ddf29d678e	2020-03-20 14:39:46 +00:00
Jennifer Johnson	699b635e5d	satellite/overlay: rename newNodePercentage to newNodeFraction Change-Id: Ie66de91f88183b44de0773589e83e4ade9aa997a	2020-03-19 20:09:32 +00:00
Qweder93	0df586c3a8	satellitedb/heldamount updated, tests added + storagenode console updated Change-Id: I10f568a426d0fc42069d025de2accbef5b26dc0c	2020-03-19 15:37:45 +02:00
Kaloyan Raev	10b032e484	libuplink: return deleted bucket/object (step 4) Switch back to the original DeleteBucket and DeleteObject methods. Next step: remove the DeleteBucketReturnDeleted and DeleteObjectReturnDeleted from storj.io/uplink. Change-Id: I273a305326d411e51ce354ce72fcc6ecadf4dd5f	2020-03-19 13:32:07 +02:00
Jessica Grebenschikov	5142874144	satellite/gc: move garbage collection to its own process Change-Id: I7235aa83f7c641e31c62ba9d42192b2232dca4a5	2020-03-18 16:44:01 +00:00
Egon Elbre	09e0f3de63	satellite/metainfo/piecedeletion: add Service Change-Id: Id7e32ed569701fa0be66f9527c43a67052994570	2020-03-18 14:50:08 +00:00
Bill Thorp	94c11c5212	satellite: remove some unnecessary UTC() calls Fixes some easy cases of extraneous UTC() calls Change-Id: I3f4c287ae622a455b9a492a8892a699e0710ca9a	2020-03-13 13:49:44 +00:00
Jeff Wendling	41887883f3	satellite/satellitedb: check indexes on migration Change-Id: I5ba7ae2b512d77c70405ce332158f12128e27eed	2020-03-13 10:45:22 +00:00
JT Olio	051569c69f	satellite: enable open registration (and add flag that disables it) SM-441 Change-Id: I47bfedb312089f6d2bfbab013bd74ad4b8aa5f5e	2020-03-11 03:53:34 +01:00
paul cannon	79553059cb	satellite/repair: put irreparable segments in irreparableDB Previously, we were simply discarding rows from the repair queue when they couldn't be repaired (either because the overlay said too many nodes were down, or because we failed to download enough pieces). Now, such segments will be put into the irreparableDB for further and (hopefully) more focused attention. This change also better differentiates some error cases from Repair() for monitoring purposes. Change-Id: I82a52a6da50c948ddd651048e2a39cb4b1e6df5c	2020-03-09 21:45:16 +00:00
Michal Niewrzal	d7b5df70d3	cmd/uplink: remove unused flag New API has limited number of options to configure at the moment. We should remove unused flags from Uplink CLI and add if needed in the future. Change-Id: Icf3f3dadd43cb61a3b408b02d0762aef34425dbf	2020-03-09 13:44:46 +00:00
Michal Niewrzal	c20cf25f35	cmd: migrate uplink CLI to new API Change-Id: I8f8fcc8dd9a68aac18fd79c4071696fb54853a60	2020-03-09 13:26:29 +00:00
Egon Elbre	f4d5d89b68	private/testplanet: add WaitForStorageNodeEndpoints After calling uplink.Upload it is not guaranteed that the storage node has yet saved all the orders since it happens asynchronously. Hence we need a separate func to wait for them to complete. Change-Id: I0c34b3ea6c98dbcf37f80493c0e10a8bdbbb2aaf	2020-03-05 10:33:56 +00:00
Jennifer Johnson	1c1750e6be	removes bandwidth limiting On satellite, remove all references to free_bandwidth column in nodes table. On storage node, remove references to AllocatedBandwidth and MinimumBandwidth and mark as deprecated. Protobuf message, NodeCapacity, is left intact for backwards compatibility. Once this is released to all satellites, we can drop the column from the DB. Change-Id: I2ff6c6537fc9008a0c5588e951afea58ede85838	2020-03-04 14:04:00 +00:00
Cameron Ayer	7244a6a84e	storagenode/{contact, piecestore}: implement low disk notification with cooldown When a storagenode begins to run low on capacity, we want to notify the satellite before completely running out of space. To achieve this, at the end of an upload request, the SN checks if its available space has fallen below a certain threshold. If so, trigger a notification to the satellites. The new NotifyLowDisk method on the monitor chore is implemented using the common/syn2.Cooldown type, which allows us to execute contact only once within a given timeframe; avoiding hammering the satellites with requests. This PR contains changes to the storagenode/contact package, namely moving methods involving the actual satellite communication out of Chore and into Service. This allows us to ping satellites from the monitor chore Change-Id: I668455748cdc6741291b61130d8ef9feece86458	2020-03-03 10:45:37 -05:00
Michal Niewrzal	d384e48ad7	private/testplanet: set rollout seed to avoid warnings in logs Each test log is starting with warnings like this: "rollout config error: empty seed {"binary": "Identity"}". Make no sense to print them and pollute output. Change-Id: Ib50e28d09d8b259106d3b79d8f1262954a7aed63	2020-03-03 12:58:54 +00:00
Egon Elbre	decb2ec69a	private/processgroup: moved to storj.io/common/processgroup Change-Id: I1ec0bb440dda757d8f9a6f564a0084dde2f9cc84	2020-03-03 10:50:33 +00:00
Jeff Wendling	443aa08a06	private/dbutil/txutil: remove the individual retry events Change-Id: I63d06e57d7e6723b4d00d51f77c46345a11c4671	2020-03-03 08:38:19 +00:00
Qweder93	484ec7463a	storagenode: notifications on outdated software version Change-Id: If19b075c78a7b2c441e11b783c3c09fed55060c7	2020-03-02 16:48:02 +00:00
Egon Elbre	1f7c3be8f9	private/testplanet: add option to run testplanet databases non-parallel NonParallel running is needed for gateway tests, because minio unfortunately relies on global state. Change-Id: If730db2ab86d10f4d02e1ac3128f758e9c18cdff	2020-02-27 15:49:22 +02:00
Egon Elbre	f85606b5a7	private/grpctlsopts: grpc related tlsopts This moves grpc related tlsopts methods to private/grpctlsopts. This allows to remove grpc dependency from tlsopts. Change-Id: I25090b82b1e7a0633417ad600f8587b0c30ace73	2020-02-26 22:46:06 +02:00
Egon Elbre	64330c55b3	all: use pbgrpc common/pb moved grpc to a separate package common/pb/pbgrpc. This updates this repository to use it. Change-Id: I2de2a190688871cf9cb61f7ea511f8a01e264e4e	2020-02-26 21:27:47 +02:00
Egon Elbre	9752d01884	private/prompt: remove dependency to go-prompt Change-Id: Ida8ef731ce806cec076343dc77d72a3b0d7736b4	2020-02-25 13:09:41 +02:00
paul cannon	92d86fa044	satellite/repair: fix repair concurrency This new repair timeout (configured as TotalTimeout) will include both the time to download pieces and the time to upload pieces, as well as the time to pop the segment from the repair queue. This is a move from Github PR #3645. Change-Id: I47d618f57285845d8473fcd285f7d9be9b4318c8	2020-02-24 19:57:09 +00:00
Cameron Ayer	f22bddf122	{storagenode/contact, private/testplanet}: remove ErrFailureToStart and panic in testplanet.Start Change-Id: I252e8c9407400af7bda95a7657c8154660c3c801	2020-02-24 18:24:23 +00:00
Egon Elbre	e30f7b35b6	cmd/gateway: use a separate repository Change-Id: Idbb0b2b6cf0e60c6d5d91218c24524d72285cf26	2020-02-24 10:03:03 +02:00
Yingrong Zhao	5011e78311	storagenode/piecestore: remove unused DeletePiece endpoint With commit: `3331b443e7`, satellite will start calling `DeletePieces`. Therefore, we can remove the old endpoint once the above commit is deployed with all satellites Change-Id: I0124bc00a7cb808d119eb59f8fcd7fadf68158bb	2020-02-21 21:03:49 +00:00
Egon Elbre	5342dd9fe6	go.mod: update uplink Change-Id: I867a6a1eef8aa5d60bb676e5112b98c4192ce811	2020-02-21 16:08:12 +02:00
Egon Elbre	fd5611fb5e	private/testplanet: ensure server is closed in test Change-Id: I12eafadfb1794cd84a288e39740f703919a9ddc6	2020-02-21 10:10:51 +02:00
Yingrong Zhao	77f67a8086	satellite/metainfo: add timeout for delete request Change-Id: I9cad6d7ea185fc2c0ed4e58b42e4e3a78178a79f	2020-02-20 09:10:16 +00:00
Cameron Ayer	3e70a893dd	storagenode/{piecestore, contact}: report capacity to satellites if below specific threshold Curently, storage nodes only report their capacity to satellites once per hour. If a node fills up, it will fail all uploads until the next contact cycle begins. With these changes, at the end of an upload we check whether the MinimumDiskSpace threshold has been passed. If so, trigger the monitor chore to update the node's capacity, then trigger the contact chore to report the new capacity to the satellites Change-Id: Ie6aadaade1e2c12c87e03f8ff9059a50121380a0	2020-02-18 15:42:48 -05:00
Jeff Wendling	948589d38b	private/dbutil/txutil: include details about retry attempts in error Change-Id: I978ae44c4890df31185ec6077c9fb3a2b2fce8f1	2020-02-17 14:18:13 +00:00
Egon Elbre	892b190db6	satellite/admin: add project limit modification and authorization token Change-Id: If9a7214a940b8544f8023c2cd82da21f19d3f521	2020-02-17 07:56:16 +00:00
Michal Niewrzal	cea4c25f53	mod: bump common and uplink version Change-Id: Ia063d33c087dd91a46c008e154b078f11fa21527	2020-02-12 14:33:54 +00:00
Egon Elbre	dbf46c4aa7	satellite/admin: administrative endpoint Admin server allows creating basic REST and html API-s for different administrative tasks. Change-Id: I3dc1786abe1c87350eed60ec90e48130f44e63cf	2020-02-12 12:12:50 +02:00
Cameron Ayer	33d696b096	storage/redis/redisserver: simplify redisserver creation Change-Id: I881576a7881db671b5abeeca7120a022987cc47f	2020-02-11 19:11:57 +00:00
Cameron Ayer	b22bf16b35	satellite/overlay: add config flag for node selection free disk requirement Currently SNs report their free disk space once per hour. If a node becomes full, it has to wait until the next contact cycle begins to report; all the while receiving and failing upload requests. By increasing the minimum required disk space, we can give the storage nodes more time to report their space before the completely fill up. This change goes hand-in-hand with another change we want to implement: trigger capacity report on SN immediately upon falling below threshold. Change-Id: I12f778286c6c3f582438b0e2949765ac43325e27	2020-02-11 18:08:25 +00:00
Egon Elbre	429f08b4f0	satellite: add Admin peer This peer will contain our administrative panels. It's completely separated from our other satellite processes because it allows better control for restricting access to it. Change-Id: Ifca473bee82ff6c680b346918ba32b835a7a6847	2020-02-11 16:15:33 +00:00
Michal Niewrzal	426c8eb31a	private/testplanet: add DeleteBucket method for uplink New method added to be able to delete easily bucket during tests. Change-Id: Iaae89618cc676ddbbbd4b0df2eeacd143ea6f3c2	2020-02-11 15:58:13 +00:00
Jeff Wendling	99c3ba5bbf	testplanet: log stack trace for error during creation Change-Id: Ifcd2cba4195413a7213ba4d113c43f9fb3cbc3e5	2020-02-10 21:59:20 +00:00
Jeff Wendling	7999d24f81	all: use monkit v3 this commit updates our monkit dependency to the v3 version where it outputs in an influx style. this makes discovery much easier as many tools are built to look at it this way. graphite and rothko will suffer some due to no longer being a tree based on dots. hopefully time will exist to update rothko to index based on the new metric format. it adds an influx output for the statreceiver so that we can write to influxdb v1 or v2 directly. Change-Id: Iae9f9494a6d29cfbd1f932a5e71a891b490415ff	2020-02-05 23:53:17 +00:00
Jeff Wendling	d20db90cff	private/dbutil/txutil: create new transactions for retries it was noticed that if you had a long lived transaction A that was blocking some other transaction B and A was being aborted due to retriable errors, then transaction B was never given priority. this was due to using savepoints to do lightweight retries. this behavior was problematic becaue we had some queries blocked for over 16 hours, so this commit addresses the issue with two prongs: 1. bound the amount of time we will retry a transaction 2. create new transactions when a retry is needed the first ensures that we never wait for 16 hours, and the value chosen is 10 minutes. that should be long enough for an ample amount of retries for small queries, and huge queries probably shouldn't be retried, even if possible: it's more preferrable to find a way to make them smaller. the second ensures that even in the case of retries, queries that are blocked on the aborted transaction gain priority to run. between those two changes, the maximum stall time due to retries should be bounded to around 10 minutes. Change-Id: Icf898501ef505a89738820a3fae2580988f9f5f4	2020-02-01 18:34:28 +00:00
Michal Niewrzal	a181e0b627	libuplink: adjust tests to changes in encryption store We move PathCipher to encryption.Store and we need to adjust storj/uplink for those changes. Uplink repo is also using libuplink to run tests so we need first adjust storj/storj libuplink and later storj/uplink. Change-Id: I84f23e6bad18ac139f72c19939dc526f9f46d88b	2020-01-30 22:00:24 +00:00
Egon Elbre	f237d70098	storagenode,satellite: use pkg/debug Use debug.Server in storage node and satellite for customizing debug server. Change-Id: I7979412376d028cadf29656d838ab94f18e2aa99	2020-01-29 16:30:31 -05:00
Ethan	149273c63f	satellite/metainfo: add cache expiration for project level rate limiting Allow rate limit project cache to expire so we can make project level rate limit changes without restarting the satellite process. Change-Id: I159ea22edff5de7cbfcd13bfe70898dcef770e42	2020-01-29 16:14:10 +00:00
Egon Elbre	e319660f7a	private/lifecycle: implement Group lifecycle.Group implements controlling multiple items such that their startup and close works. Change-Id: Idb4f4a6c3a1f07cdcf44d3147a6c959686df0007	2020-01-29 00:37:33 +00:00
paul cannon	5a1838bc28	private/dbutil: retry single statements on cockroachdb This ought to make it so that all single statements (Exec- or Query-) on a CockroachDB backend will get retried as necessary. As there is no need for savepoints to be allocated or released in this case, there is no round-trip overhead except when statements actually do need to be retried. Change-Id: Ibd7f1725ff727477c456cb309120d080f3cd7099	2020-01-24 09:01:47 +00:00
Isaac Hess	2f77ce48f0	private/testplanet: Add databases to testplanet.databases near creation We now close databases in testplanet in reverse order, knowing that some caches and other objects need to close prior to the underlying db. Some dbs were not being added near the list of closeable databases near their creation, causing an issue with shutdown order. Change-Id: I23391f4d77649030493e47bd7169002a72b3bf7a	2020-01-23 15:30:52 -07:00
Jeff Wendling	16bb374deb	storagenode/piecestore: add large timeouts to read/write operations this is to help protect against intentional or unintentional slowloris style problems where a client keeps a tcp connection alive but never sends any data. because grpc is great, we have to spawn a separate goroutine for every read/write to the stream so that we can return from the server handler to cancel it if necessary. yep. really. additionally, we update the rpcstatus package to do some stack trace capture and add a Wrap method for the times where we want to just use the existing error. also fixes a number of TODOs where we attach status codes to the returned errors in the endpoints. Change-Id: Id8bb8ff84aa34e0f711b0cf9bce3908b36a1d3c1	2020-01-23 19:20:49 +00:00
Egon Elbre	89a148047d	private/testplanet: shutdown databases in reverse order Since we have caches on top of databases and they are included in the databases list, we need to shut them down in-reverse order to avoid issues with flushing to a closed database. Change-Id: I3f23a527a2a5425638b1a7e2cab84741f019d493	2020-01-23 18:55:57 +00:00
paul cannon	fd84fa6316	private/dbutil: rollback pending transactions on panic We don't do a lot of panicking in our main code, so hopefully this won't matter much, but we /do/ call panic a lot in our tests (t.Fatal, require.NoError, etc). And when that happens, we need pending transactions to be aborted or we can get into a deadlock situation when something else tries to /Close/ that connection. Change-Id: Idaf0d543ac95afea34f9b2393d1187f5322e9f0f	2020-01-23 16:30:19 +00:00
Isaac Hess	40a890639d	satellite/orders: Flush all pending bandwidth rollup writes on shutdown Currently we risk losing pending bandwidth rollup writes even on a clean shutdown. This change ensures that all pending writes are actually written to the db when shutting down the satellite. Change-Id: Ideab62fa9808937d3dce9585c52405d8c8a0e703	2020-01-23 08:12:41 -07:00
Egon Elbre	c6f94ce9e4	satellite/metainfo: remove support for boltdb based pointerDB By previous changes we can now remove testplanet.New and also remove metainfo boltdb support. Change-Id: I5bdfbbbb45967492728e705b34b2fedb4f28c381	2020-01-23 13:54:00 +02:00
Egon Elbre	5a4745eddb	all: remove usages of testplanet.New Ensure that tests use testplanet.Run, so we always require running against all database backends. Change-Id: I6b0209e6a4912cf3328bd35b2c31bb8598930acb	2020-01-22 22:42:57 +02:00
Jeff Wendling	3b86917cc9	private/dbutil/pgutil: faster cockroach constraint finding Change-Id: Ia100b9ef7d2d59dfad0389feb8f2e7c47c2c4c9b	2020-01-22 15:47:04 +00:00
Egon Elbre	fc2766eefc	private/testplanet: flatten migration for running tests Currently Cockroach DB setup takes a significant amount of time. This flattens the database setup into a single query, which improves the test time significantly. The migration tests still test each migration separately. Change-Id: Iaca16f34a6af3926fa2b5ebf618f939fd59460b3	2020-01-22 15:09:11 +00:00
Egon Elbre	8b3db70329	private/testplanet: increase metainfo rate limit Rate limit was causing tests to fail due to making too many request. Change-Id: Iafbc97b4880b6d98c86045b28ca7583d27f51720	2020-01-22 13:57:38 +00:00
Michal Niewrzal	6502454947	satellite/metainfo: move RS configuration to satellite With this change RS configuration will be set on satellite. Uplink with get RS values with BeginObject request and will use it. For backward compatibility and to avoid super large change redundancy scheme stored with bucket is not touched. This can be done in future. Change-Id: Ia5f76fc10c37e2c44e4f7b8754f28eafe1f97eff	2020-01-22 09:33:53 +00:00
Ethan	21a5d70a83	satellite/metainfo: Rate limiting - API requests Limits how many times metainfo APIs can be called per second by project ID. If limit is exceeded, the API will return Unauthorized/Too Many requests. Limit per second and the size of the limiter cache per project are configurable, as well as whether the limiter is enabled. Tests added/updated for the new rate_limit field in projects table. Tests added for exceeding limits and disableing limiter. Change-Id: Ic8ad102de3b690a475809d4f684156d5715f20fa	2020-01-21 14:25:04 +00:00
Michal Niewrzal	86f194769f	uplink: adjust to changes in storj/uplink This change is adjusting code base to changes in storj/uplink. https://review.dev.storj.io/c/storj/uplink/+/643 Change-Id: Ieca87f9f5983e391bf4b4fec8b9d5491fd32bfa1	2020-01-20 22:06:19 +00:00
Egon Elbre	c1c878efcf	all: fix import groupings check-imports was broken and didn't complain about things. Change-Id: I38adafd16b4aba86f0eb4f53427b4393f9a6c710	2020-01-20 17:47:44 +00:00
Egon Elbre	1279eeae39	private/tagsql,storage: fixes to context cancellation Replace all the remaining uses of sql.DB with tagsql.DB to fix issues with context cancellation. Introduce tagsql.Open which helps to get rid of all tagsql.Wrap-s. Use tagsql in cockroachkv and postgreskv. Change-Id: I8946d203341cb85a25976896fc7881e1f704e779	2020-01-20 15:44:39 +02:00
Egon Elbre	10d932fd65	lib/uplinkc: fix test flakiness by setting MaxTimeSkew Not having a skew caused an issue where: 1. Uplink calls "begin segment", where segment isn't committed to the database. 2. Uplink stores piece X to the storage node A with timestamp 1. 3. Satellite runs garbage collection with timestamp 2. 4. Satellite sends retain request to storage node A with timestamp 2. 5. Storage node A deletes piece X, because 1 < 2. 6. Uplink calls "commit segment" with storage node A in it. 7. Download of segment fails, because A doesn't have piece X. In production this is not an issue since the MaxTimeSkew is 72h by default. Change-Id: Id87ca3ddc44103dcd85d031b1367168c014b8e7b	2020-01-20 12:44:42 +00:00
Egon Elbre	ee0293c212	private/dbutil/sqliteutil: add missing err check Change-Id: Ie18c76d0e6d02a5c55e2d6503437b8a07b47a64e	2020-01-19 19:24:58 +00:00
Egon Elbre	1abfe42142	satellite: use tagsql Change-Id: I2170dee409fb0c2fe85913ddd36e7811a3b853ed	2020-01-19 14:39:16 +02:00
Egon Elbre	25b76fe63f	storagenode/storagenodedb: use tagsql Change-Id: Iba3b34a97b982deb4f72ce55517a294f249b6b55	2020-01-19 14:39:16 +02:00
Egon Elbre	59d06644b9	private/migrate: switch to tagsql Also added temporary types withRebind and withTagTx, which will be later removed. Currently they help to avoid changing the whole codebase at the same time. Change-Id: I7f07ba8f4709a23a463bfa67464628665a05808f	2020-01-19 14:39:16 +02:00
Egon Elbre	5fd833b108	private/dbutil: remove basic Query dbschema.Query is used only for testing and sqlite, so this won't cause us problems in production. Change-Id: Ib296a7daf161a9d3de23a7dfdc4f505d47ac4a37	2020-01-19 14:39:16 +02:00
stefanbenten	f4097d518c	satellite: reduce logging of node status Change-Id: I6618cf4bf31b856acd7a28b54011a943c03ab22a	2020-01-18 17:47:59 +00:00
Moby von Briesen	273eb66fae	cmd/storagenode,storagenode/preflight: add config flag to disable storagenode database preflight check. Disable preflight database check by default, and have the option to enable it. This will allow us to enable it once it is definitely working. Also change the name of the config flag for preflight time sync. Change-Id: Ie2e20f9e25dcb38794eafa7e1505e7c6ff287c99	2020-01-17 17:53:17 +00:00
Egon Elbre	5d80e22af9	private/tagsql: implement wrapper for sql.DB Wrapper adds tracing and fixes context usage issues. Change-Id: Ie6f7650eac87e2a2b64b760198498ba5857ad535	2020-01-17 13:52:12 +00:00
Cameron Ayer	4424697d7f	satellite/accounting: refactor live accounting to hold current estimated totals live accounting used to be a cache to store writes before they are picked up during the tally iteration, after which the cache is cleared. This created a window in which users could potentially exceed the storage limit. This PR refactors live accounting to hold current estimations of space used per project. This should also reduce DB load since we no longer need to query the satellite DB when checking space used for limiting. The mechanism by which the new live accounting system works is as follows: During the upload of any segment, the size of that segment is added to its respective project total in live accounting. At the beginning of the tally iteration we record the current values in live accounting as `initialLiveTotals`. At the end of the tally iteration we again record the current totals in live accounting as `latestLiveTotals`. The metainfo loop observer in tally allows us to get the project totals from what it observed in metainfo DB which are stored in `tallyProjectTotals`. However, for any particular segment uploaded during the metainfo loop, the observer may or may not have seen it. Thus, we take half of the difference between `latestLiveTotals` and `initialLiveTotals`, and add that to the total that was found during tally and set that as the new live accounting total. Initially, live accounting was storing the total stored amount across all nodes rather than the segment size, which is inconsistent with how we record amounts stored in the project accounting DB, so we have refactored live accounting to record segment size Change-Id: Ie48bfdef453428fcdc180b2d781a69d58fd927fb	2020-01-16 10:26:49 -05:00
Jeff Wendling	78c6d5bb32	satellite/satellitedb: reported_serials table for processing orders this commit introduces the reported_serials table. its purpose is to allow for blind writes into it as nodes report in so that we have minimal contention. in order to continue to accurately account for used bandwidth, though, we cannot immediately add the settled amount. if we did, we would have to give up on blind writes. the table's primary key is structured precisely so that we can quickly find expired orders and so that we maximally benefit from rocksdb path prefix compression. we do this by rounding the expires at time forward to the next day, effectively giving us storagenode petnames for free. and since there's no secondary index or foreign key constraints, this design should use significantly less space than the current used_serials table while also reducing contention. after inserting the orders into the table, we have a chore that periodically consumes all of the expired orders in it and inserts them into the existing rollups tables. this is as if we changed the nodes to report as the order expired rather than as soon as possible, so the belief in correctness of the refactor is higher. since we are able to process large batches of orders (typically a day's worth), we can use the code to maximally batch inserts into the rollup tables to make inserts as friendly as possible to cockroach. Change-Id: I25d609ca2679b8331979184f16c6d46d4f74c1a6	2020-01-15 19:21:21 -07:00
Yingrong Zhao	db8aee0806	satellite/contact; storagenode/preflight: add clock check on startup for storagenode add config preflight.enabled-local-time Change-Id: I7b942c9bee063aae409ee6721ae9d079dff0144f	2020-01-15 15:35:26 +00:00
Egon Elbre	08f63614be	private/context2: add WithoutCancellation Change-Id: I38557c16f41b8983886f256353cc6afb7634d9e6	2020-01-15 14:23:46 +02:00
Egon Elbre	64fb2d3d2f	Revert "dbutil: statically require all databases accesses to use contexts" This reverts commit `8e242cd012`. Revert because lib/pq has known issues with context cancellation. These issues need to be resolved before these changes can be merged. Change-Id: I160af51dbc2d67c5449aafa406a403e5367bb555	2020-01-15 07:28:00 +00:00
JT Olio	c01cbe0130	satellitedb: save out all db-touching traces Change-Id: Ib1e192221f9da813fd9cbb55f620a047b82c9523	2020-01-14 18:47:45 -05:00
JT Olio	8e242cd012	dbutil: statically require all databases accesses to use contexts this will allow for some nice runtime analysis down the road. also, this allows for wrapping database handles in a way that can interact with these contexts requires https://review.dev.storj.io/c/storj/dbx/+/514 Change-Id: Ib087b7cd73296dd2c1e0331314da34d861f61d2b	2020-01-14 18:20:47 -05:00
Egon Elbre	64f056bee4	private/dbutil/sqlutil: use context in queries Change-Id: Icb92daa483d13e6d57013f3917571d476126bfd2	2020-01-14 20:27:09 +00:00
Egon Elbre	df9e53ea0b	private: ensure we don't eat the underlying error When error is formatted using %v it's not possible to check whether the error was caused by a context cancellation. Change-Id: I164d1c83cdf5e7e6eacf082145b5c6a47078d041	2020-01-14 20:26:51 +00:00
Egon Elbre	cd4ff0722e	private/testplanet: use defaultInterval Change-Id: Ife2810be46faaaf8cd51b193a859a88fff894a0e	2020-01-14 16:07:36 +00:00
Isaac Hess	4950d7106a	satellite/orders: Add write cache for bw rollups Change-Id: I8ba454cb2ab4742cafd6ed09120e4240874831fc	2020-01-13 22:40:51 +00:00
Egon Elbre	b9740f0c0a	storage/cockroachkv: add ctx argument Change-Id: Ib6c29f44722b0354afcd499a0e567f04aef7eb28	2020-01-13 15:57:47 +02:00
Egon Elbre	ff267168c5	private/migrate: add ctx argument Change-Id: I3d65912d89261386413c494c7ed1576fed4dcaf4	2020-01-13 15:52:26 +02:00
Egon Elbre	24958bd7d3	satellite: add ctx to DB.CreateTables Change-Id: I9ecad624cf5a7fc9c86bb91c68f96a3a4efd2e92	2020-01-13 15:31:09 +02:00

... 2 3 4 5 6 ...

410 Commits