storj

Author	SHA1	Message	Date
Egon Elbre	0bdb952269	all: use keyed special comment Change-Id: I57f6af053382c638026b64c5ff77b169bd3c6c8b	2020-10-13 15:13:41 +03:00
Jeff Wendling	0f0faf0a9f	satellite/orders: do a better job limiting concurrent requests Doing it at the ProcessOrders level was insufficient: the endpoints make multiple database calls. It was a misguided attempt to only have one spot enter the semaphore. By putting it in the endpoint we can not only be sure that the concurrency is correctly limited but it can be configurable easily. Change-Id: I937149dd077adf9eb87fce52a1a17dc0afe96f64	2020-10-09 16:27:15 -04:00
Jeff Wendling	7c303208ff	satellite/satellitedb: emergency temporary order processing semaphore we have thundering herds of order submissions that take all of the database connections causing temporary periodic outages. limit the amount of concurrent order processing to 2. Change-Id: If3f86cdbd21085a4414c2ff17d9ef6d8839a6c2b	2020-10-08 19:16:47 +00:00
Cameron Ayer	b39a99bae6	satellite/{overlay,satellitedb}: always show node's real online score Previously if a node did not have audit history data for each of the windows over the tracking period, we would give them the benefit of the doubt and set their score to 1. This was to prevent nodes from being suspended right out the gate. We need a minimum amount of data to evaluate them. However, a node who is actually failing at being online will have no idea until they have received enough audits and we suspend them. Instead, we will always use their real score, but use a flag to determine whether they are eligible for suspension/dq. Change-Id: I382218f12e8770f95d4bcddcf101ef348940cadf	2020-10-02 12:28:11 -04:00
Cameron Ayer	c2525ba2b5	satellite/{repair,satellitedb}: clean up healthy segments from repair queue at end of checker iteration Repair workers prioritize the most unhealthy segments. This has the consequence that when we finally begin to reach the end of the queue, a good portion of the remaining segments are healthy again as their nodes have come back online. This makes it appear that there are more injured segments than there actually are. solution: Any time the checker observes an injured segment it inserts it into the repair queue or updates it if it already exists. Therefore, we can determine which segments are no longer injured if they were not inserted or updated by the last checker iteration. To do this we add a new column to the injured segments table, updated_at, which is set to the current time when a segment is inserted or updated. At the end of the checker iteration, we can delete any items where updated_at < checker start. Change-Id: I76a98487a4a845fab2fbc677638a732a95057a94	2020-09-29 20:38:22 +00:00
Egon Elbre	c23a8e3b81	go.mod: update pgx to v4.9.0 Fix query to use TextArray instead of VarcharArray. Fix queries to use the correct type. Change-Id: Ibb7e55adba277d05778118d81ca697470e72c374	2020-09-29 19:03:08 +00:00
Egon Elbre	2d27bc8787	satellite/satellitedb: separate cockroach for migration tests Currently Cockroach migration test is the most heavy with regards to schema changes. This causes other tests to time out. This adds an alternate cockroach instance that is used for migration tests. Change-Id: I01fe9313527ff002f0bb0914dd52c3645b8eaf6d	2020-09-29 09:31:33 +00:00
Jessica Grebenschikov	4a2c66fa06	satellite/accounting: add cache for getting project storage and bw limits This PR adds the following items: 1) an in-memory read-only cache thats stores project limit info for projectIDs This cache is stored in-memory since this is expected to be a small amount of data. In this implementation we are only storing in the cache projects that have been accessed. Currently for the largest Satellite (eu-west) there is about 4500 total projects. So storing the storage limit (int64) and the bandwidth limit (int64), this would end up being about 200kb (including the 32 byte project ID) if all 4500 projectIDs were in the cache. So this all fits in memory for the time being. At some point it may not as usage grows, but that seems years out. The cache is a read only cache. When requests come in to upload/download a file, we will read from the cache what the current limits are for that project. If the cache does not contain the projectID, it will get the info from the database (satellitedb project table), then add it to the cache. The only time the values in the cache are modified is when either a) the project ID is not in the cache, or b) the item in the cache has expired (default 10mins), then the data gets refreshed out of the database. This occurs by default every 10 mins. This means that if we update the usage limits in the database, that change might not show up in the cache for 10 mins which mean it will not be reflected to limit end users uploading/downloading files for that time period.. Change-Id: I3fd7056cf963676009834fcbcf9c4a0922ca4a8f	2020-09-25 16:28:49 +00:00
Stefan Benten	38108828ac	satellite/satellitedb: enable multiple projects existing users Change-Id: I2ef77182d5464d72574698c8abfbbfdbda3f5a9e	2020-09-23 18:17:38 +02:00
Stefan Benten	5f6fccc6e8	satellite/satellitedb: makes limits nullable change backwards compatible Our current endpoints bail on us, if the column data is null. Thus we need to take the intermediate step and set the default to a fixed value and reset those with the following release. It sets the default column value to our current config values of 50GB for storage and bandwidth and 100 buckets, while still enabling the field to be nullable. All 0 values are migrated to be the default as well to ensure they can keep using their projects, as with the original change, 0 actually means 0. Change-Id: I797be80ce2d2105091599dc1b3fc76f74336b66b	2020-09-23 17:54:42 +02:00
Stefan Benten	2f648fd981	satellite: make limits be nullable Currently we have no way to actually set one of the following limits to 0 (meaning not usable): - maxBuckets - usageLimit - bandwidthLimit With having the field nullable, NULL corresponds to the global default, 0 now actually 0 and a set value determines a custom limit. Change-Id: I92bb77529dcbd0881ae8368921be9d246eb0919e	2020-09-21 19:34:19 +00:00
Qweder93	8182fdad0b	storagenode: heldamount renamed to payouts, renamed some methods and structs to more meaningful names. grouped estimated payout with pathouts satellite: heldamount renamed to SNOpayouts. Change-Id: I244b4d2454e0621f4b8e22d3c0d3e602c0bbcb02	2020-09-16 14:57:35 +00:00
Cameron Ayer	e7c34a053d	satellite/satellitedb: add column and index "updated_at" to injuredsegments Change-Id: I59e9bb2077885f09e17795375fe98ed31bd83d54	2020-09-14 12:53:04 -04:00
Michal Niewrzal	27a9d14e2a	satellite/repair: use metabase.SegmentKey type in repair package Another change which is a part of refactoring to replace path parameter (string/[]byte) with key paramter (metabase.SegmentKey) Change-Id: I617878442442e5d59bbe5c995f913c3c93c16928	2020-09-08 19:35:20 +00:00
Jennifer Johnson	4e2413a99d	satellite/satellitedb: uses vetted_at field to select for reputable nodes Additionally, this PR changes NewNodeFraction devDefault and testplanet config from 0.05 to 1. This is because many tests relied on selecting nodes that were reputable based on audit and uptime counts of 0, in effect, selecting new nodes as reputable ones. However, since reputation is now indicated by a vetted_at db field that is explicitly set rather than implied by audit and uptime counts, it would be more complicated to try to update all of the nodes' reputations before selecting nodes for tests. Now we just allow all test nodes to be new if needed. Change-Id: Ib9531be77408662315b948fd029cee925ed2ca1d	2020-09-04 16:45:32 +00:00
Michal Niewrzal	8649a00557	satellite/gracefulexit: replace `Path []byte` to `Key metabaseSegmentKey` TransferQueueItem We are unifying which name (and type) we are using for value we are using to point to segment. We want to use `key` instead of `path`. Dedicated type `metabase.SegmentKey` was created for this purposes also. This change is doing refactoring around gracefulexit. Change-Id: I90d51ff087b206179e61d5f1bc95f4709d76f917	2020-09-04 11:09:48 +00:00
Egon Elbre	dc48197bd8	satellite/orders: add bucket id to order limit Change-Id: I9019ec77d692e62ac17b67a1da71dc3535cde50c	2020-09-03 10:50:11 +03:00
Michal Niewrzal	0604a672c1	satellite/metainfo: use metabase in loop Change-Id: I1bb0c6fe0a762895fde950690b06f7dd9d77e178	2020-09-01 10:06:16 +00:00
Moby von Briesen	2d01dd9732	satellite/satellitedb: Add online_score column to nodes table Add online score used for the new audit history offline tracking system to the nodes table. This allows us easy access to the node's online score for the storagenode dashboard as well as for data analysis. Change-Id: Ie99be1192e5236862a5b3dbed2e5ef03b9169410	2020-08-31 15:07:07 +00:00
Moby von Briesen	60a95d0dc9	satellite/{satellitedb,overlay}: Enable offline suspension and review period When a node's audit history "online score" passes below a configured threshold, the node goes into "offline suspension" mode and begins a review period, where the operator is given an opportunity to bring their node back online. After the review period passes, offline suspension is turned off for the node. In the future, if a node still has a bad online score at the end of the review period, it will be disqualified. This is disabled right now. In the future, if a node is in offline suspension, it will be treated as "unhealthy". Right now, there are no consequences for being in offline suspension. Minor changes: * Moves AuditHistoryConfig out of UpdateStats/BatchUpdateStats args and into UpdateRequest. * Adds "now" argument to UpdateStats/BatchUpdateStats args for easy testing. * Changes formatting strings inside buildUpdateStatement to use specific types. Change-Id: I032b60298840fc16e6ef831da750f2d57619a397	2020-08-28 16:35:48 +00:00
Bill Thorp	729079965f	satellite/satellitedb : remove migation steps 69-102 Jenkins has been failing a lot lately due to test timeouts with CockroachDB. TestMigrateCockroach previously took around 5 minutes, now it takes 2. Why 103? I couldn't get 100 to work due to an error w/ NOT NULL and PKs. Change-Id: Iec95d4e25f9d6cd36920e7f43272c486a17fa879	2020-08-27 07:36:05 +00:00
Moby von Briesen	959cd5cd83	satellite/satellitedb: Update audit history from overlay.UpdateStats and overlay.BatchUpdateStats Change-Id: Ib530b61895ca4a8b12ba022c408a416b237b56d7	2020-08-20 22:46:28 +00:00
Moby von Briesen	5f0477ebe9	satellite/{overlay,satellitedb}: Create database functionality for updating audit history Add a function to the overlay cache called UpdateAuditHistory, which allows us to add online or offline audits to a particular node's audit history, and get that node's "online score" for the configured tracking period. The next step will be to use UpdateAuditHistory from inside BatchUpdateStats/UpdateStats, so that audit history is actually updated when nodes get audited, and we can suspend nodes based on their online score. Change-Id: I2289105e6961e68e829a987ff756b0e576fab120	2020-08-20 17:34:27 +00:00
Egon Elbre	94a09ce20b	all: add missing dots Change-Id: I93b86c9fb3398c5d3c9121b8859dad1c615fa23a	2020-08-11 17:50:01 +03:00
Ethan	ab1d0f097d	satellite/storageusage: Group accounting rollups at_rest_total by day When investigating a gap in storage usage data in the SN dashboard, I noticed that there were 2 entries in the accounting_rollups table on the date of the gap. This change accounts for multiple entries in the accounting_rollups table for a given day. Change-Id: Ibf2b5d0455117cb0417163e8fcfb7e509d594171	2020-08-10 15:03:15 +00:00
Kaloyan Raev	7552ff26ec	satellite/db: drop project_invoice_stamps table It's an obsolete table from earlier state of Stripe invoices implementation. No code is currently using it. It is confirmed that this table is currently empty across all satellites. Change-Id: I12d2756578faf8418ea8f3b09088e885694b8925	2020-08-10 13:22:10 +00:00
Kaloyan Raev	edfd3d7661	satellite/payments: delete `credits` and `credits_spendings` db tables Jira: https://storjlabs.atlassian.net/browse/USR-822 This the last step of dropping these 2 db tables. It also deletes all code associate with them. Change-Id: I8be840dc2a7be255cf6308c9434b729fe4d9391e	2020-07-30 12:19:57 +03:00
Egon Elbre	36ed939b89	satellite/orders: add buckets db to service We need to add bucket UUID into the order limit, hence we need access to the buckets table. Change-Id: I348ce1f709c9fcdec5c4034acaab59805b33da9f	2020-07-24 17:36:49 +03:00
Ethan	cfca021839	satellite/accounting: Add chore to cleanup old project bandwidth rollups data Removes old project_bandwidth_rollups records that are no longer used. Uses a retain months configuration to determine how many months to save. Current month cannot be removed. Tests retainMonths=-1, 0, 2 Change-Id: Ia4be2546cdb28802427acf41ecd85ad66df3e62c	2020-07-22 18:56:49 +00:00
Bill Thorp	65408db6e0	satellite/satellitedb: Coinpayments repeat insert bug fix I introduced a bug with https://review.dev.storj.io/c/storj/storj/+/2216 Because the log change allowed insert to be called multiple times. This changes the insert logic to do nothing if the PK already exists. Change-Id: I90d192a0f6619bfbb360ea104066f00a3348f6dd	2020-07-20 20:21:35 +00:00
Isaac Hess	67a292d135	satellite/satellitedb: Monitor node tallies We are adding a monkit evaluation for the total sum of data stored on the nodes before it is inserted into the database. This will give us a time-series history of total data stored so we can see it change over time. Change-Id: I41145a2d7a09c8e63b42ae578bd081035b60e529	2020-07-17 10:21:42 -06:00
Egon Elbre	d8dcae3075	all: fix error checking Change-Id: Ia0da1bbd6ce695139922f94096c2419281905e32	2020-07-16 19:13:14 +03:00
Egon Elbre	e70da5cd4e	all: fix comments Change-Id: I2d2307e3fab87de47a72b3595d051e2c95ff4f8a	2020-07-16 19:13:14 +03:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
stefanbenten	9ace375ee0	satellite/{console,satellitedb}: change project limiting based on new users field This change switches the backend logic to use the new DB column on the users table to restrict project creation. Furthermore it back fills the existing limits from registration tokens to the new column to ensure no users are reset to the new default. UI is updated to reflect ability to create several projects Change-Id: Ie29157430ae6b065411ca4c4557c9f1be69cdc4f	2020-07-16 10:57:47 +00:00
stefanbenten	0209a2095f	satellite/{console,satellitedb}: add project_limit column to users table Change-Id: I603f085f17ca5b413dd1c6837c2081f9e7e791a1	2020-07-15 17:27:31 +00:00
stefanbenten	2c2d284f3d	satellite/admin: add bucket limit handling endpoint Change-Id: I4b199277cff30f11f4a9fff3b0ac4017b694f2e8	2020-07-15 17:27:23 +00:00
Jennifer Johnson	784a156eea	satellite: prevents uplink from creating a bucket once it exceeds the max bucket allocation. Change-Id: I4b3822ed723c03dbbc0df136b2201027e19ba0cd	2020-07-15 17:27:05 +00:00
stefanbenten	257855b5de	all: replace == comparison with errors.Is Change-Id: I05d9a369c7c6f144b94a4c524e8aea18eb9cb714	2020-07-14 15:50:25 +00:00
stefanbenten	0a32ba0e6b	satellite/admin: add project rename functionality Change-Id: I4c0f42d4c2c26859279f247f94cef97a8ff630a9	2020-07-14 11:36:49 +00:00
stefanbenten	f768302c91	satellite/admin: harden project deletion requirements Change-Id: Ia7ea469f87469b16e464dc22af24b98a6ef1873d	2020-07-14 11:36:29 +00:00
Jessica Grebenschikov	8abb907010	satellite/orders: add settle orders with window Why: We need a way to cut down on database traffic due to bandwidth measurement and tracking. What: This changeset is the Satellite side of settling orders in 1 hr windows. See design doc for more details: https://review.dev.storj.io/c/storj/storj/+/1732 Change-Id: I2e1c151e2e65516ebe1b7f47b7c5f83a3a220b31	2020-07-13 15:41:29 -07:00
paul cannon	bbdb351e5e	all: use jackc/pgx in place of lib/pq What: Use the github.com/jackc/pgx postgresql driver in place of github.com/lib/pq. Why: github.com/lib/pq has some problems with error handling and context cancellations (i.e. it might even issue queries or DML statements more than once! see https://github.com/lib/pq/issues/939). The github.com/jackx/pgx library appears not to have these problems, and also appears to be better engineered and implemented (in particular, it doesn't use "exceptions by panic"). It should also give us some performance improvements in some cases, and even more so if we can use it directly instead of going through the database/sql layer. Change-Id: Ia696d220f340a097dee9550a312d37de14ed2044	2020-07-13 15:54:41 +00:00
Egon Elbre	9dc9cd8a17	tests: allow STORJ_TEST_POSTGRES STORJ_POSTGRES_TEST naming was not consistent with STORJ_SIM_POSTGRES. This allows to use STORJ_TEST_POSTGRES for clarity, it still has a fallback to STORJ_POSTGRES_TEST. Change-Id: I6f294c66c80fcfd6750fea2a89795f3b7f5dd691	2020-07-10 16:43:49 +03:00
Jeff Wendling	885ef70c58	satellite/nodeapiversion: new table for tracking node api usage This system tracks an abstract "api version" from nodes based on their usage, allowing us to have latching behavior where if a node ever uses a new api, it can be blocked from using the old api. This is better than using self-reported semver version information because the node cannot lie, there's no confusion about what semver version implies which features, no questions about dev and ci environments, and no dependencies between reporting the version and using the new api. Change-Id: Ifeced5c9ae8e0a16102d79635e176a7d3bdd8ed4	2020-07-09 15:02:25 +00:00
Isaac Hess	fd740295ec	satellite/satellitedb: Add comment to revocation Change-Id: I1b65b7e46439c4788835ea5bfd4df3d32a713b44	2020-07-06 21:51:35 +00:00
Bill Thorp	00ae5ebbab	satellite/payemnts: Credit coin payments earlier Apply the coin payments when CoinPayments.net recieves the funds Instead of the when STORJ gets them from CoinPayments.net Based on 7/1/20 User Growth standup guidance by JG Relates to: https://storjlabs.atlassian.net/browse/USR-801 Change-Id: I174ca23a585010f39464c45525e1dfe0179b7c1a	2020-07-06 13:24:26 +00:00
Cameron Ayer	e3088d9ad5	satellite/satellitedb: add new DB table audit_histories Change-Id: I5f854514994cab9a68cf978f2dabfb588df695f5	2020-07-01 21:14:35 +00:00
Qweder93	b639ec08d4	satellite/heldamount: payments added, endpoind for payments added Change-Id: Ia2b9580bc353ef614680230c6f82c5bf6ded49c4	2020-07-01 18:15:01 +03:00
Cameron Ayer	cadb435d25	{satellite/audit, private/testplanet}: remove ErrAlreadyExists, run 2 audit workers in testplanet Since we increased the number of concurrent audit workers to two, there are going to be instances of a single node being audited simultaneously for different segments. If the node times out for both, we will try to write them both to the pending audits table, and the second will return an error since the path is not the same as what already exists. Since with concurrent workers this is expected, we will log the occurrence rather than return an error. Since the release default audit concurrency is 2, update testplanet default to run with concurrent workers as well. Change-Id: I4e657693fa3e825713a219af3835ae287bb062cb	2020-06-30 18:00:07 +00:00
Egon Elbre	d91cf5f4de	satellite/satellitedb: add missing SeparateTx Change-Id: I3ba5a4e0632a1e0e5e77c30e515953eadf05bc45	2020-06-26 12:27:05 +03:00
Egon Elbre	13a5854535	satellite/satellitedb: clarify test migration merging Use a field to distinguish migration steps that need to use a different transaction from previous steps. This is clearer than using a func. Change-Id: I2147369d05413f3e8ddb50c71a46ab1ba3ab5114	2020-06-25 14:32:45 +00:00
Cameron Ayer	3b4b5f45c7	satellite: replace references to Suspended with UnknownAuditSuspended Change-Id: I3d2d00c95954c0546ad077702617895f262926ef	2020-06-23 14:19:22 +00:00
Isaac Hess	2d727bb14e	satellite: Check macaroon revocation When a request comes in on the satellite api and we validate the macaroon, we now also check if any of the macaroon's tails have been revoked. Change-Id: I80ce4312602baf431cfa1b1285f79bed88bb4497	2020-06-22 13:50:07 -06:00
Egon Elbre	f68e7b3fde	satellite/overlay: replace pb.InfoResponse pb.InfoResponse wasn't used for protocol buffer communication, but instead as a satellite type. Change-Id: I755619f2deec5b76c4fe488591b7d8c1b9fcdafb	2020-06-16 15:16:55 +03:00
Cameron Ayer	0885ba5646	satellite/satellitedb: add new columns for offline suspension add new columns `offline_suspended` and `under_review` to nodes table. `unknown_audit_suspended` is a new column which will replace `suspended` Change-Id: I22ddeb338ea0ff63f14332a7ebd0f3e9e4c06cdc	2020-06-15 04:00:20 +00:00
paul cannon	7b8e91ff28	satellite/satellitedb: no orders for exited nodes We should not be sending any type of orders to nodes that have completed graceful exit with the current satellite. In particular, we should not be trying to audit them, because that would be silly. Change-Id: Ie2153e5739914ab696feefcdef28545ed70f84e4	2020-06-13 13:49:33 +00:00
Egon Elbre	1ed5a1bac5	satellite/satellitedb/satellitedbtest: skip omitted database The first implementation missed some changes. Change-Id: I7ae696175e0a9ea46954970ba8547638a05ed5a9	2020-06-11 13:28:16 +00:00
Cameron Ayer	bad299b541	satellite/satellitedb: serialize UpdateStats and BatchUpdateStats transactions Since we increased the number of audit workers from 1 to 2, we need to make sure concurrent updates do not trample each other. We can do this by serializing the transactions. Change-Id: If1b2f71cabe3c779c12ffa33c0c3271778ac3ae0	2020-06-10 17:11:28 +00:00
Egon Elbre	36c461bd59	private/tagsql: track proper closing of rows and statements This ensures that rows are closed to avoid leaks. Also verifies that Err() is called, to ensure that no error is left behind. Change-Id: Idd1bec9bf479f40021da67b2c80ce83033149469	2020-06-05 18:25:43 +00:00
Egon Elbre	34db4a80fd	ci: fix staticcheck failures Change-Id: I176fb24214755a1940a0a1a4e9cc8e39f184870b	2020-06-05 13:15:34 +00:00
Michal Niewrzal	2b2efcc662	satellite/payments/stripecoinpayments: move Coupons expiration date sorting directly to listing method Change-Id: I58d8a6ea1feba9ff2d19f21a1dbc87bfb8b49801	2020-06-04 09:47:42 +00:00
Jeff Wendling	254b42ff65	satellite/satellitedb: fix leaked rows from repairQueue.Insert Change-Id: If5e62c49770f591ebe3f4d2dd4dd2658c229a022	2020-06-03 14:31:21 -06:00
Michal Niewrzal	b20ced9519	satellite/satellitedb: drop project_id column from coupons table This is last part of https://storjlabs.atlassian.net/browse/USR-818 Change-Id: I053d11b37df962c12e46645bae2fc2dad49c9755	2020-06-03 14:56:41 +00:00
Cameron Ayer	6a60e1e96b	satellite/satellitedb: inclusive interval_start in GetAllocatedBandwidthTotal The DB query in GetAllocatedBandwidthTotal uses an exclusive range: 'WHERE interval_start > ?' The value that is used for this condition is the first day of current the month, 00:00:00 UTC. By using the exclusive '>', we exclude the entire first hour of the month from the result set. Change-Id: I3ed300f5230c7514dc9495a85e8166213cd0842e	2020-06-02 13:06:45 -04:00
Jeff Wendling	2b3545c0c3	satellite/satellitedb: use delete returning to query pending_serial_queue this way we don't have to do 2 steps, and by using the ctid, postgres is going to do two very efficient prefix scans. Change-Id: Ia9d0546cdf0a1af67ceec9cd508d336a5fdcbdb9	2020-06-01 15:43:33 -06:00
Jeff Wendling	44433f38be	satellite/satellitedb: remove ORDER BY when reading from queue also remove the continuation support from the queue, otherwise we may end up sequential scanning the entire table to get a few rows at the end. then, in the core, instead of looping both to get a big enough batch inside of the queue, as well as outside of it to ensure we consume the whole queue, just get a single batch at a time. also, make the queue size configurable because we'll need to do some tuning in production. Change-Id: If1a997c6012898056ace89366a847c4cb141a025	2020-06-01 18:31:14 +00:00
Yingrong Zhao	163c027a6d	satellite/satellitedb: remove monkit trace from convertDBNode In jaeger, it shows that this function gets called repetitively in a single request. Most of the time, it's less than 1ms. Therefore, it doesn't add much value in our trace but create noises. Change-Id: I20234f36bbcf0fc22f91e5e1a5634c0cad577ed0	2020-06-01 17:58:43 +00:00
Michal Niewrzal	a9f6489663	satellite/payments/stripecoinpayments: remove ProjectID from Coupon struct This change is removing ProjectID from code. Next change will be about dropping this colum from DB table. Change-Id: Idb949e2829e2c304a2b6b011259c7cc7667082e1	2020-06-01 11:37:20 +00:00
Egon Elbre	07050eea26	all: use common/storj Change-Id: Id1e36d52f9807b5ffbb72ce73f4b60cb21b68a78	2020-05-29 11:57:32 +03:00
Jeff Wendling	1e065fb450	satellite: migration to fix bad imported payment history the initial calculations for the historical values of comp_at_rest were wrong. because our historical data only included total amounts as well as compensation for bandwidth, the at rest value was calculated as at_rest = total - bandwidth unfortunately, that calculation did not take surge pricing into account correctly. the at rest and bandwidth values do not include surge pricing, but the total that was used did. so what we actually calculated was no_surge_at_rest = surge_total - no_surge_bandwidth which will create a value that is too large. this migration fixes the calculation for imports that are old enough and of a non-negligable difference. Change-Id: I61eb0b670510f6d7fb8fc3de39ba79150fac10eb	2020-05-28 12:59:08 -06:00
Michal Niewrzal	75b3db5426	satellite/payments/stripecoinpayments: test invoice user with more than 1 project https://storjlabs.atlassian.net/browse/USR-291 Change-Id: I98286e40254e8868de9eb675a9c9a8cd0bf5f3b1	2020-05-27 09:12:23 +00:00
Moby von Briesen	290c006a10	satellite/repair/{checker,queue}: add metric for new segments added to repair queue * add monkit stat new_remote_segments_needing_repair, which reports the number of new unhealthy segments in the repair queue since the previous checker iteration Change-Id: I2f10266006fdd6406ece50f4759b91382059dcc3	2020-05-27 06:23:47 +00:00
Jeff Wendling	074649835b	satellite/satellitedb: add some docs and improve some snapshots This attempts to add a README.md to help create consistent migrations that maximize our test coverage and do not include unnecessary statements. It also adds a feature to have an `-- OLD DATA --` section as well as a `-- NEW DATA --` section so that we can fix mistakes made in previous snapshots (like a row that was forgotten to be added when a table was created) without editing them going forward. Change-Id: I28a786f8ef163cae1de1bb08f61af1e1104b0a88	2020-05-22 21:27:36 +00:00
Jennifer Johnson	03e5f922c3	satellite/overlay: updates node with a vetted_at timestamp if they meet the vetting criteria What: As soon as a node passes the vetting criteria (total_audit_count and total_uptime_count are greater than the configured thresholds), we set vetted_at to the current timestamp. Why: We may want to use this timestamp in future development to select new vs vetted nodes. It also allows flexibility in node vetting experiments and allows for better metrics around vetting times. Please describe the tests: satellitedb_test: TestUpdateStats and TestBatchUpdateStats make sure vetted_at is set appropriately Please describe the performance impact: This change does add extra logic to BatchUpdateStats and UpdateStats and commits another variable to the db (vetted_at), but this should be negligible. Change-Id: I3de804549b5f1bc359da4935bc859758ceac261d	2020-05-20 16:30:26 -04:00
Egon Elbre	5d016425f1	satellite/{contact,downtime,overlay}: use NodeURL Change-Id: I555a479a89e0ddbf0499898bdbc8574282cd6846	2020-05-20 11:09:05 +00:00
Stefan Benten	0a26c4af9a	satellite/admin: add coupon deletion (#3893 )	2020-05-19 15:49:44 +03:00
Stefan Benten	671aca56b0	satellite/admin: add coupon creation and listing (#3891 )	2020-05-19 12:36:13 +02:00
Kaloyan Raev	49571f1a23	satellite/payments: all invoice commands require period To avoid including multiple months in a single invoice, we need all inspector's invoice commands to run in for specific period. See https://storjlabs.atlassian.net/browse/USR-725 Change-Id: I3637dc189234f02350daca8d897c21765762ea55	2020-05-14 11:50:19 +00:00
Jeff Wendling	6352d46100	satellite/satellitedb: do better ::date conversions There is a subtle problem when one does a cast with `::date`. Observe: teststorj=# set timezone = 'US/Eastern'; SET teststorj=# select (timestamp with time zone '2020-02-01 00:00:00+00')::date; date ------------ 2020-01-31 (1 row) teststorj=# set timezone = 'UTC'; SET teststorj=# select (timestamp with time zone '2020-02-01 00:00:00+00')::date; date ------------ 2020-02-01 (1 row) In order to correctly determine the date a timestamp is in, one has to explicitly pick the time zone that the date truncation should use otherwise postgres will use whatever setting the client has. These tests were failing for me locally, because I run my postgres in the US/Eastern time zone to try to tickle these bugs out. So it should be `(x at time zone 'UTC')::date` instead of just `x::date`. Change-Id: I4e9e32d4b53abc6165a4d0474f4702f8b9f801c7	2020-05-13 15:58:07 +00:00
Egon Elbre	0e3be60b79	satellite/satellitedb: simplify migrate step Change-Id: Ie4574144fb6ddd057d5fca740702c59fbdb2c5e4	2020-05-12 18:27:07 +03:00
Stefan Benten	e23bd806b4	satellite/accounting: separate usage and bandwidth limit (#3878 )	2020-05-12 15:01:15 +02:00
Michal Niewrzal	22fbe804e3	satellite/accounting: test if project bandwidth limits reset with billing cycle https://storjlabs.atlassian.net/browse/USR-287 Change-Id: I4dc5f6342417b6af3384da32d3d2ed8592904406	2020-05-11 15:11:53 +00:00
Moby von Briesen	8f60cfc4fb	satellite/overlay: Add flag for enabling/disabling disqualification from suspension mode Add a flag that allows us to easily switch disqualification from suspension mode on or off. A node will only be disqualified from suspension mode if it has been suspended for longer than the grace period AND the SuspensionDQEnabled flag is true. Change-Id: I9e67caa727183cd52ab2042b0a370a1bcaebe792	2020-05-04 17:25:09 +00:00
Ethan	acf53bea4d	satellite/orders;accounting: Add monthly project download bandwidth rollup See https://storjlabs.atlassian.net/browse/SM-776 Change-Id: Ifd5cccea43c556fd59822d17344f399cfe9a7164	2020-05-04 15:49:57 +00:00
Egon Elbre	8928399d02	all: rename CreateTables to MigrateToLatest CreateTables hasn't been quite true for a while now, rename to MigrateToLatest to be clearer in it's behavior. Change-Id: Ida48e95122a5d9b7a814e922d3698e00024a2ba7	2020-04-30 07:21:17 +00:00
Jessica Grebenschikov	6a6427526b	satellite/overlay: remove old updateaddress method The UpdateAddress method use to be used when storage node's checked in with the Satellite, but once the contact service was created this method was no longer used. This PR finally removes it. Change-Id: Ib3f83c8003269671d97d54f21ee69665fa663f24	2020-04-30 06:41:48 +00:00
Moby von Briesen	de366537a8	satellite/satellitedb/overlaycache: fix behavior around gracefully exited nodes Sometimes nodes who have gracefully exited will still be holding pieces according to the satellite. This has some unintended side effects currently, such as nodes getting disqualified after having successfully exited. * When the audit reporter attempts to update node stats, do not update stats (alpha, beta, suspension, disqualification) if the node has finished graceful exit (audit/reporter_test.go TestGracefullyExitedNotUpdated) * Treat gracefully exited nodes as "not reputable" so that the repairer and checker do not count them as healthy (overlay/statdb_test.go TestKnownUnreliableOrOffline, repair/repair_test.go TestRepairGracefullyExited) Change-Id: I1920d60dd35de5b2385a9b06989397628a2f1272	2020-04-28 23:58:43 +00:00
Egon Elbre	85c45cd56f	private/dbutil/pgtest: support multiple databases for testing Currently Cockroach isn't performant for concurrent database setup and tear-down. Instead of a single instance allow setting multiple potential connection strings and let the tests pick one connection string randomly. This improves test duration by ~10 minutes. While we are at significantly changing how pgtest works, introduce helper PickPostgres and PickCockroach for selecting the database to reduce code duplications in multiple places. Change-Id: I8ad171d5c4c8a4fc081ec2ae9bdd0cc948a80619	2020-04-28 21:55:49 +03:00
Natalie Villasana	6f84be133a	satellite/metainfo: add MigrateToLatest to PointerDB In cases like the segment reaper script connecting to the metainfodb, we don't want a db migration to happen automatically when we call metainfo.NewStore. This adds MigrateToLatest method for postgreskv and cockroackv, and calls MigrateToLatest in places where NewStore used to create tables. Change-Id: I682d0f26d609af0601dfdb32a24866cdf5d32a7e	2020-04-28 17:26:35 +00:00
Egon Elbre	ef913be234	satellite/satellitedb/satellitedbtest: don't use subtest naming A/B indicates that B is a subtest of A, however in this case they represent a configuration of the test, not a subtest. Change-Id: I64eed5d5bcb12759e54fe4b5373f8e88488e50f7	2020-04-27 19:32:09 +03:00
Ivan Fraixedes	03871d17c3	satellite/satellitedb: Update ticket ref Update a reference to a ticket in a code comment. Change-Id: Ib82220e94527482c5ca1a58d8614b919d1113ab5	2020-04-27 08:50:41 +00:00
Stefan Benten	d73630fd4a	satellite/satellitedb: Ensure we just return bucket usage for buckets that exist (#3863 )	2020-04-24 22:25:16 +02:00
Moby von Briesen	720e26d235	satellite/satellitedb/overlaycache: update unknown alpha/beta values properly Update unknown_audit_reputation_alpha and unknown_audit_reputation_beta. Add test to verify that BatchUpdateStats properly modifies unknown audit alpha/beta Change-Id: I0d5f9cac96a99f64905cf575b772402db0756a9d	2020-04-23 10:40:53 -04:00
Moby von Briesen	72b93f3120	satellite/satellitedb: disqualify suspended nodes when the grace period passes If a node is suspended and receives an unknown or failing audit, disqualify them if the grace period (default 1w in production) has passed. Migrate the nodes table so any node that is currently suspended gets unsuspended when the satellite starts up. Change-Id: I7b81c68026f823417faa0bf5e5cb5e67c7156b82	2020-04-22 15:45:00 -04:00
Ethan Adams	60e07f0a8b	Revert "satellite/accounting: Remove unnecessary index bucket_bandwidth_rollups_project_id_action_interval_index" This reverts commit `105dc7acc6`. Reason for revert: Recent changes to the Postgres query plan seems to want to use this index now. Reverting until we have time to analyze what's happening. Change-Id: I74b4b5a8f15c3850d8a958a29f51dbc80e7c282c	2020-04-22 14:49:04 +00:00
Qweder93	805e328c47	storagenode/heldamount payments removed Change-Id: I87cc04f43d182a4190a571ef417be85d02db9d34	2020-04-21 17:15:31 +00:00
Ethan	105dc7acc6	satellite/accounting: Remove unnecessary index bucket_bandwidth_rollups_project_id_action_interval_index See https://storjlabs.atlassian.net/browse/SM-738 Change-Id: I9ba3cc3fbff9f13fc0b95d25feee5a19e5a5c486	2020-04-21 16:43:09 +00:00
Qweder93	6e3585e394	satellite/heldamount/endpoint : GetAllPaystubs added Change-Id: Ic8cdd9db8b2a68796f9579c7fed2d49d9054bd64	2020-04-19 19:21:54 +03:00
Ethan	4cd86ff780	satellite/accounting: Add index on bucket_bandwidth_rollups for action, interval_start, and project_id See https://storjlabs.atlassian.net/browse/SM-551 for details Change-Id: I104c4e87d5aef500cc4a3893817763808f76c484	2020-04-17 19:14:45 +00:00
Jess G	5ea1602ca5	satellite/overlay: add selected node cache (#3846 ) * init implementation cache Change-Id: Ia54a1943e0707a77189bc5f4a9aaa8339c98d99a * one query to init cache Change-Id: I7c04b3ae104b553ae23fca372351a4328f632c66 * add monit tracking of cache Change-Id: I7d209e12c8f32d43708b23bf2126c5d5098e0a07 * add first test Change-Id: I0646a9349d457a9eb3920f7cd2d62fb72ffc3ab5 * add staleness to cache Change-Id: If002329bfdd53a4b200ad14dbd2ffc8b280aedb8 * add init test Change-Id: I3a3d0aa74cfac1d125fa93cb749316ed2a74d5b1 * fix comment Change-Id: I73353d00ccf0952b38c0f8ef7d1755c15cbfe9d9 * mv to nodeselection pkg Change-Id: I62487f768296c7a7b597fa398a4c42daf6e9c5b7 * add state to cache Change-Id: I081e77ec0e16706faee1a267de9a7fa643d6ac11 * add refresh concurrent test Change-Id: Idcba72508291099f280edc65355273c0acc3d3ce * add a few more tests Change-Id: I9422e9eaa22bf01c11f14bdb892ebcf7b3e5e5fb * fix tests, add min version to select allnodes Change-Id: I926f41d568951ad4ff70c6d4ceb87abb1e3e5009 * update comments Change-Id: I6ffe33e245ca65fb523c880cd72e63ce35776eb9 * fixes and rm Init Change-Id: Ifbe09b668978b5d9af09ca38cb080d02a2154cf4 * fix format Change-Id: I03cc217e28dc1839190c5c6dbdbb602c132a5a38	2020-04-14 13:50:02 -07:00
Moby von Briesen	d7794a4851	satellite/overlay: hardcode default values for audit alpha/beta Alpha=1 and beta=0 are the expected first values for any alpha/beta reputation system we are using in the codebase. So we are removing the configurability of these values. Change-Id: Ic61861b8ea5047fa1438ea6609b1d0048bf0abc3	2020-04-14 19:12:40 +00:00
Cameron Ayer	02613407ae	satellite/satellitedb: only suspend node if not already suspended Whenever the node's reputation is updated, if its unknown audit reputation is below the suspension threshold, its suspension field is set to the current time. This could overwrite the previous "suspendedAt" value resulting a node that never reaches the end of its suspension. Also log whenever a node is disqualified or its suspension status changes Change-Id: I5e8c8f1c46f66d79cb279b5b16a84fe03f533deb	2020-04-10 09:37:37 +00:00
Egon Elbre	d86cce202c	satellite/satellitedb: use arrays for arguments in node selection This simplifies the code and makes queries faster: name old time/op new time/op delta SelectStorageNodes-32 7.72ms ± 6% 7.22ms ± 3% -6.44% (p=0.016 n=5+5) SelectNewStorageNodes-32 7.75ms ± 2% 7.37ms ± 1% -4.89% (p=0.008 n=5+5) SelectStorageNodesExclusion-32 16.9ms ± 0% 16.6ms ± 0% -2.15% (p=0.008 n=5+5) SelectNewStorageNodesExclusion-32 17.2ms ± 0% 16.6ms ± 2% -3.69% (p=0.008 n=5+5) FindStorageNodes-32 45.5ms ± 0% 45.1ms ± 1% ~ (p=0.056 n=5+5) FindStorageNodesExclusion-32 77.4ms ± 0% 75.9ms ± 0% -1.91% (p=0.008 n=5+5) Change-Id: I38f77f6282b9738e8416113d42c6acb46c03da7b	2020-04-09 21:16:10 +03:00
Egon Elbre	ccf4f9ed2d	satellite/satellitedb: node selection code cleanup Reduce the number of non-methods to reduce funcs in the namespace also combine a func to slightly condense the code more. Change-Id: Ifbe728eb8c8ca4c981df648decd259c2097b6b40	2020-04-09 20:41:29 +03:00
Natalie Villasana	cf80b3caf3	satellite/overlay: combine SelectStorageNodes and SelectNewStorageNodes (#3831 )	2020-04-09 11:19:44 -04:00
Egon Elbre	11a44cdd88	all: don't depend on gogo/proto directly Change-Id: I8822dea0d1b7b99e0b828e0373a0308a42dde2be	2020-04-08 17:32:15 +00:00
Egon Elbre	cf26951a5b	satellite/satellitedb/pbold: remove dead code Change-Id: I7464773c20b8f99a601ca9cc4bee804f1ac14cf9	2020-04-08 15:22:31 +03:00
Jeff Wendling	2ded64ba2c	satellite/compensation: more fixes to get prod running smoothly Change-Id: I13a76d9d49222fb10796415a015f224d4084fde3	2020-04-07 10:10:27 +00:00
Jennifer Johnson	1547e791a3	satellitedb: remove free_bandwidth column from nodes table Change-Id: I9d1d3de9216c6533c1042ef473631721a011d086	2020-04-06 09:30:28 +00:00
Egon Elbre	9200efc61f	satellite/satellitedb: fix selecting a nullable string Change-Id: I59e645966e09da586512c69101691b47055c1e5a	2020-04-03 21:30:20 +03:00
Egon Elbre	6492b13d81	all: remove old uuid Change-Id: I3a137f73456f010c37d3933dbe12cbbb840b809f	2020-04-02 19:30:36 +03:00
Egon Elbre	8f73fb7a32	all: simplify uuid usage uuid.UUID implements driver.Value so it can be directly used as a scannable result. Replace uses of dbutil.BytesToUUID with uuid.FromBytes. Change-Id: I51a670185ceb3cc2199d5aa2b76bc3fc191ca8fe	2020-04-02 05:48:58 +00:00
Egon Elbre	a416b03941	satellite/accounting: fix TestProjectBandwidthTotal Test was inserting for past 4 days, however the test was summing up for the current month. Change-Id: I509afdc6a76b314a6bb90652ab70cd2c2bab1288	2020-04-01 11:50:18 +03:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Qweder93	dc32f1da55	storagenode/cache/heldamount added, errNoRows ignored Change-Id: If6b675e622d6c1324c0893c43cca93dc5323cd78	2020-03-31 11:35:58 +00:00
Jeff Wendling	e2ff2ce672	satellite: compensation package and commands Change-Id: I7fd6399837e45ff48e5f3d47a95192a01d58e125	2020-03-30 14:08:14 -06:00
Jennifer Johnson	d77f3b8786	satellitedb/migrate: set vetted_at backfill to now.day Change-Id: Ib2b12be43dbd3f3705b1891bc703ae15abb75e09	2020-03-30 16:50:23 +00:00
Egon Elbre	439aba922a	satellite/overlay: reduce overhead of GetNodes Instead of filtering on the client side it's better to filter on the database side. Change-Id: I845fbbe5ed28c2ffdb0b8a3f789b59c094fd1069	2020-03-30 18:36:23 +03:00
Egon Elbre	cb781d66c7	satellite/overlay: optimize FindStorageNodes Reduce the number of fields returned from the query. Benchmark results in `satellite/overlay`: benchstat before.txt after2.txt name old time/op new time/op delta SelectStorageNodes-32 7.85ms ± 1% 6.27ms ± 1% -20.18% (p=0.002 n=10+4) SelectNewStorageNodes-32 8.21ms ± 1% 6.61ms ± 0% -19.53% (p=0.002 n=10+4) SelectStorageNodesExclusion-32 17.2ms ± 1% 15.9ms ± 1% -7.55% (p=0.002 n=10+4) SelectNewStorageNodesExclusion-32 17.8ms ± 2% 16.1ms ± 0% -9.38% (p=0.002 n=10+4) FindStorageNodes-32 48.4ms ± 1% 45.1ms ± 0% -6.69% (p=0.002 n=10+4) FindStorageNodesExclusion-32 79.2ms ± 1% 76.1ms ± 1% -3.89% (p=0.002 n=10+4) Benchmark results from `satellite/overlay` after making them parallel: benchstat before-parallel.txt after2-parallel.txt name old time/op new time/op delta SelectStorageNodes-32 548µs ± 1% 353µs ± 1% -35.60% (p=0.029 n=4+4) SelectNewStorageNodes-32 562µs ± 0% 368µs ± 0% -34.51% (p=0.029 n=4+4) SelectStorageNodesExclusion-32 1.02ms ± 1% 0.84ms ± 0% -18.08% (p=0.029 n=4+4) SelectNewStorageNodesExclusion-32 1.03ms ± 1% 0.86ms ± 2% -16.22% (p=0.029 n=4+4) FindStorageNodes-32 3.11ms ± 0% 2.79ms ± 1% -10.27% (p=0.029 n=4+4) FindStorageNodesExclusion-32 4.75ms ± 0% 4.43ms ± 1% -6.56% (p=0.029 n=4+4) Change-Id: I1d85e2764eb270f4c2b1998303ccfc1179d65b26	2020-03-30 18:36:23 +03:00
Egon Elbre	e1a443b04a	private/testplanet: allow modifying created database Instead of providing the database from outside to testplanet create it inside and then allow wrapping and modifying it. This is more convenient to use. Change-Id: I9b8f69e6e0a19ff984b4e2bfe927c9100c77bc6c	2020-03-27 19:14:48 +00:00
Ethan	df462d7265	satellite/accounting: Add index on bucket_bandwidth_rollups to minimize full table scans https://storjlabs.atlassian.net/browse/SM-545 Change-Id: I5599a72a991d70236f17beca027e9bc032777177	2020-03-26 19:53:50 +00:00
Jeff Wendling	97e980cd8a	private/dbutil: add database name to configure as a tag storagenodes have like 10 or more databases. without this tag they all get sent as the same value, stomping on each other. Change-Id: Ib12019684d6ea8f2a5b83df584056dfa79e3c4b3	2020-03-26 16:50:15 +00:00
Jennifer Johnson	b75cbc8e24	satellite,storagenode: remove references to free bandwidth Change-Id: I42a6597544804fa9235e89ec656ebc365eb522e5	2020-03-25 22:28:34 +00:00
Michal Niewrzal	fdf40a7526	storj: remove `storj/private/version` package which was moved to `storj/private` repo Change-Id: I81c3f5b9d5e4fe7bca760999eb045ee9734e5e2e	2020-03-24 14:31:33 +00:00
Jessica Grebenschikov	aeab599d21	satellitedb: removed unused id on storagenode_storage_tallies table, add index on node_id The goal of this change is to improve the storagenode_storage_tallies table by removing the unneeded id column that is not being used but only taking up space, and also to add an index on a different column that needs it. Removing and adding a column seems simple, but ended up being more complicated because of some cockroachdb limitations. The cockroachdb limitation when trying to remove a column from a table and create a new primary key are: 1. only allows primary key creation at table creation time (docs: https://www.cockroachlabs.com/docs/stable/primary-key.html) 2. table drop or rename is performed async and cannot be done in a transaction (issue: https://github.com/cockroachdb/cockroach/issues/12123, https://github.com/cockroachdb/cockroach/issues/22868) To address these differences between cockroachdb and Postgres, this PR performs different migrations for the two database. The Postgres migration is straight forward and what you would expect, but the cockroach migration has two main changes: 1. To change a primary key, use the recommended process from the cockroachdb docs to create a new table with the new primary key you want and then migrate the data. 2. In order to do 1, we needed to do the new table renaming in a separate transaction from the data migration. Ref: SM-65 Change-Id: Idc9aee3ab57aa4d5570e3d2980afea853cd966bf	2020-03-20 14:39:44 -07:00
Jennifer Johnson	9b78473c0c	satellitedb: adds vetted_at nullable timestamp to nodes table Change-Id: I42d5a396b4eecbad26b683c6aee51e043d2ff034	2020-03-20 01:37:28 +00:00
Qweder93	0df586c3a8	satellitedb/heldamount updated, tests added + storagenode console updated Change-Id: I10f568a426d0fc42069d025de2accbef5b26dc0c	2020-03-19 15:37:45 +02:00
Jeff Wendling	115f4559e5	satellite/orders: more efficient processing of orders by doing an indexed anti-join we're able to reduce the time to select the pending orders by over 10x on postgres. this should help us process pending orders much more quickly. it probably won't do as good a job on cockroach because it does not do an indexed anti-join and instead does a hash join after scanning the entire consumed serials table. we should either remove orders entirely or try to make that more efficient when necessary. Change-Id: I8ca0535acd21c51e74955b24c9b86d20e4f2ff9c	2020-03-18 09:03:30 +00:00
Moby von Briesen	2f991b6c56	satellite/{overlay, satellitedb}: account for `suspended` field in overlay cache Make sure that suspended nodes are treated appropriately by the overlay cache. This means we should expect the following behavior: * suspended nodes (vetted or not) should not be selected for uploading new segments * suspended nodes should be treated by the checker and repairer as "unhealthy", and should be removed upon successful repair This commit also removes unused overlay functionality. Fixes a bug with commit `8b72181a1f` where the audit reporter was automatically suspending nodes regardless of audit outcome (see test added). Tests: * updates repair tests to ensure that a suspended node is treated as unhealthy and will be removed from the pointer on successful repair * updates overlay tests for KnownUnreliableOrOffline and KnownReliable to expect suspended nodes to be considered "unreliable" * adds satellitedb test that ensures overlay.SelectStorageNodes and overlay.SelectNewStorageNodes do not include suspended nodes * adds audit reporter test to ensure that different audit outcomes result in the correct suspended/disqualified states Change-Id: I40dba67278c8e8d2ce0bcec5e0a5cb6e4ce2f561	2020-03-17 17:14:56 +00:00
Michal Niewrzal	81afbcc12e	satellite/metainfo: check bucket existence on upload and listing Initial change for checking bucket existence on satellite side for requests like BeginObject and ListObjects. This is simple implementation that is just checking bucket in DB but should be improved in future to avoid DB calls as much as possible. Part of https://storjlabs.atlassian.net/browse/USR-365 Change-Id: I9076acddc44d7dbfa7612a1c24a007de01621583	2020-03-17 15:43:22 +00:00
Jeff Wendling	7baa59753a	satellite/orders: add tests for double sending the same order Change-Id: If2fa7f035257df3b04f506f81aa8b2e0916f5033	2020-03-17 14:18:03 +00:00
Ethan	bdbf764b86	satellite/orders;overlay: Consolidate order limit storage node lookups into 1 query. https: //storjlabs.atlassian.net/browse/SM-449 Change-Id: Idc62cc2978fba67cf48f7c98b27b0f996f9c58ac	2020-03-16 23:15:47 +00:00
Moby von Briesen	8b72181a1f	satellite/{audit,overlay,satellitedb}: implement unknown audit reputation and suspension * change overlay.UpdateStats to allow a third audit outcome. Now it can handle successful, failed, and unknown audits. * when "unknown audit reputation" (unknownAuditAlpha/(unknownAuditAlpha+unknownAuditBeta)) falls below the DQ threshold, put node into suspension. * when unknown audit reputation goes above the DQ threshold, remove node from suspension. * record unknown audits from audit reporter. * add basic tests around unknown audits and suspension. Change-Id: I125f06f3af52e8a29ba48dc19361821a9ff1daa1	2020-03-16 20:29:26 +00:00
Stefan Benten	52590197c2	satellite/payments: More Cleanup and Satellite command to ensure we have stripe customers (#3805 )	2020-03-16 20:34:15 +01:00
Qweder93	9f84261c36	storagenode/cache heldamount added Change-Id: I7fc807789de63e8a9b8ca2018fd73bdb9e01ad0d	2020-03-16 00:28:35 +02:00
Qweder93	94c4d1e737	satellite/satellitedb/heldamount added, endpoint added Change-Id: Ife8402b89f631f65ebb5cdf5ca02e99aa9b0b3ff	2020-03-13 18:15:52 +00:00
Jeff Wendling	41887883f3	satellite/satellitedb: check indexes on migration Change-Id: I5ba7ae2b512d77c70405ce332158f12128e27eed	2020-03-13 10:45:22 +00:00
Jess G	39cb821196	satellite/overlay: rm combinedcache, fix IP naming to be network (#3798 ) * rn combinedcache, rm dns node lookup Change-Id: I239f07211764b097d851230d8c81900a47756e9e * excludeIPs -> excludedNetworks Change-Id: Ifa6f44ab17457cdd5aff4cd5694296867c18b179 * use lowercase var name Change-Id: I825aad2b718c71f455e747be18f8cabd02aabe55 * update Getnetwork name Change-Id: I002a1b7bc6b4ef40159c0cd2b0ef209f80a9c503 * fix comments Change-Id: Ibddf5b9ffa9d685af6c392d893db063ef18e45fa * update comments with ipv6 Change-Id: I31758b7d4979e7c27d014668f4fb532ad838cda2 Co-authored-by: Stefan Benten <mail@stefan-benten.de>	2020-03-12 11:37:57 -07:00
Jessica Grebenschikov	803e2930f4	satellite: use IP for all uplink operations, use hostname for audit and repairs My understanding is that the nodes table has the following fields: - `address` field which can be a hostname or an IP - `last_net` field that is the /24 subnet of the IP resolved from the address This PR does the following: 1) add back the `last_ip` field to the nodes table 2) for uplink operations remove the calls that the satellite makes to `lookupNodeAddress` (which makes the DNS calls to resolve the IP from the hostname) and instead use the data stored in the nodes table `last_ip` field. This means that the IP that the satellite sends to the uplink for the storage nodes could be approx 1 hr stale. In the short term this is fine, next we will be adding changes so that the storage node pushes any IP changes to the satellite in real time. 3) use the address field for repair and audit since we want them to still make DNS calls to confirm the IP is up to date 4) try to reduce confusion about hostname, ip, subnet, and address in the code base Change-Id: I96ce0d8bb78303f82483d0701bc79544b74057ac	2020-03-11 09:11:40 -07:00
Moby von Briesen	1baf1bd249	satellite/satellitedb: Add index on num_healthy_pieces column in injuredsegments table We missed this in the migration that added the num_healthy_pieces column. It exists in dbx, but not on the actual satellite table. Change-Id: If16b5ec2325d56406250298531b3285215188bf3	2020-03-10 16:59:35 +00:00
paul cannon	79553059cb	satellite/repair: put irreparable segments in irreparableDB Previously, we were simply discarding rows from the repair queue when they couldn't be repaired (either because the overlay said too many nodes were down, or because we failed to download enough pieces). Now, such segments will be put into the irreparableDB for further and (hopefully) more focused attention. This change also better differentiates some error cases from Repair() for monitoring purposes. Change-Id: I82a52a6da50c948ddd651048e2a39cb4b1e6df5c	2020-03-09 21:45:16 +00:00
Egon Elbre	0675413f7a	satellite/satellitedb: increase migrate test timeout Change-Id: I789ea22ad463a6c31737e959ec54941b66830188	2020-03-09 14:30:50 +02:00
Bill Thorp	e99e675fb1	satellite/satellitedb: use time zones with all timestamps The migration was broken into one migration per table to reduce table locking and reduce the chances of failure due to SQL timeouts. Of the 14 fields that lacked time zones, only the 3 named 'interval_start` seemed to have non-UTC data in them. These fields are fixed in the migration by removing the +00 and adding AT TIME ZONE current_setting('TIMEZONE') Field with good data are migrated by adding AT TIME ZONE 'UTC' Note that postgres's timezone() is different than cockroach's timezone() so AT TIME ZONE is used. https://storjlabs.atlassian.net/browse/SM-104 Change-Id: I410f2f1d7c11b143f17844347f37e6f4b1e70fce	2020-03-05 21:11:25 +00:00
Jennifer Johnson	1c1750e6be	removes bandwidth limiting On satellite, remove all references to free_bandwidth column in nodes table. On storage node, remove references to AllocatedBandwidth and MinimumBandwidth and mark as deprecated. Protobuf message, NodeCapacity, is left intact for backwards compatibility. Once this is released to all satellites, we can drop the column from the DB. Change-Id: I2ff6c6537fc9008a0c5588e951afea58ede85838	2020-03-04 14:04:00 +00:00
Egon Elbre	5f2ca0338b	satellite/satellitedb: fix err and close order Change-Id: Ied927275853c4cf4a8ccb500048d50545f6c6efe	2020-03-04 09:05:22 +00:00
Moby von Briesen	f495544c56	satellite/satellitedb/dbx: add fields to node table for placing nodes into suspended mode for too many unknown-error audits Change-Id: Iac9a619e5c08377de87ffdf4acdd0155027f5eb3	2020-03-03 03:30:59 +00:00
Jeff Wendling	1db087cfba	satellite/satellitedb: migration to create tables for compensation these tables are used in future commits with respect to the new storagenode payments code. if we create them now, it will make backfilling them with historical data easier. Change-Id: I3c08c9770ec5b2baa38b4f2fd18c2f07746a61c2	2020-02-27 17:34:50 +00:00
Moby von Briesen	4e5a7f13c7	satellite/repair/queue: Prioritize selection of items off repair queue by segment health Add a column to the repair queue table in the satellite db for healthy piece count. When an item is selected from the repair queue, the least durable segment that has not been attempted in the past hour should be selected first. This prevents our repairer from getting stuck doing work on segments that are close to the repair threshold while allowing segments that are more unhealthy to degrade further. The migration also clears the repair queue so that the migration runs quickly and we can properly account for segment health in future repair work. We do not select items off the repair queue that have been attempted in the past six hours. This was changed from on hour to allow us time to try a wider variety of segments when the repair queue is very large. Change-Id: Iaf183f1e5fd45cd792a52e3563a3e43a2b9f410b	2020-02-26 09:54:16 -05:00
Jeff Wendling	f671eb2beb	satellite/satellitedb: use queue for orders to get back fast billing This change adds two new tables to process orders as fast as we used to but in an asynchronous manner and with hopefully less storage usage. This should help scale on cockroach, but limits us to one worker. It lays the groundwork for the order processing pipeline to be queue rather than database driven. For more details, see the added fast billing changes blueprint. It also fixes the orders db so that all the timestamps that are passed to columns that do not contain a time zone are converted to UTC at the last possible opportunity, making it less likely to use the APIs incorrectly. We really should migrate to include timezones on all of our timestamp columns. Change-Id: Ibfda8e7a3d5972b7798fb61b31ff56419c64ea35	2020-02-24 17:07:07 +00:00

1 2 3 4 5 ...

678 Commits