storj

Author	SHA1	Message	Date
JT Olio	0ba516d405	satellite: support pointing db components at different databases the immediate need is to be able to move the repair queue back out of cockroach if we can't save it. Change-Id: If26001a4e6804f6bb8713b4aee7e4fd6254dc326	2020-11-28 18:39:16 +00:00
Moby von Briesen	75f0f713a3	satellite/repair/checker/checker.go: Use number of healthy pieces instead of SegmentHealth for injured segments queue. We did not test the SegmentHealth function with actual production values, and it turns out that values such as 52 healthy, 35 minimum result in +Inf segment health - so pretty much all segments put into the repair queue have the same health, which means we effectively aren't sorting by health. This change inserts numHealthy as segment health into the database so the segments are ordered as they were before. We need to refine the SegmentHealth function before we can support multi RS. Change-Id: Ief19bbfee3594c5dfe94ca606bc930f05f85ff74	2020-11-28 12:16:32 -05:00
Ivan Fraixedes	7eb3b2d6d0	satellite/gc: Init map with an aprox size Because the PieceTracker receives a piece count per nodes which is an approximation of the number of nodes that they are going to be reported by the metainfo loop so we can use as a good guess of the map's size and initialized with it. Change-Id: I644db40926c03e4c457457fb41d2ec1da059cea6	2020-11-27 10:44:19 +01:00
Ivan Fraixedes	319d2cad11	satellite: Fix typo in a comment Change-Id: I151b824e868db1cc1e8b8e8af9f35b027db1e6ff	2020-11-26 15:44:49 +01:00
Michal Niewrzal	8ceef9f357	satellite/metainfo: temporary disable one assertion in test This is need to merge https://review.dev.storj.io/c/storj/uplink/+/3208 , after that this code will be back. Change-Id: If9f2f1db95c7a1bba64a41c45a39bd3096a519e7	2020-11-25 13:21:41 +00:00
Egon Elbre	3792e2921c	satellite/accounting/tally: make test less fragile MetadataSize can slightly vary and checking for exact value makes difficult to change what's being encoded in metadata. Change-Id: I5f1ade41bc26d115e6743367ee35cf1ba74795c9	2020-11-25 13:33:24 +02:00
Kaloyan Raev	53b7fd7b00	satellite/{audit,gracefulexit}: remove logic for PieceHashesVerified We now have the piece hashes verified for all segments on all production satellites. We can remove the code that handles the case where piece hashes are not verified. This would make easier the migration of services from PointerDB to the new metabase. For consistency, PieceHashesVerified is still set to true in PointerDB for new segments. Change-Id: Idf0ccce4c8d01ae812f11e8384a7221d90d4c183	2020-11-24 11:09:48 +02:00
Egon Elbre	9de1617db0	satellite/orders: ensure encryption keys handles set twice Currently flag parsing seems to call Set twice, which causes problems with encryption keys. We can clear for every set for now. Change-Id: Id5c695b4020194ac1c50a2da9c7d2a896cb9216f	2020-11-23 19:47:22 +00:00
Moby von Briesen	575f50df84	satellite/repair: Update repair override config to support multiple RS schemes. Rather than having a single repair override value, we will now support repair override values based on a particular segment's RS scheme. The new format for RS override values is "k/o/n-override,k/o/n-override..." Change-Id: Ieb422638446ef3a9357d59b2d279ee941367604d	2020-11-23 18:01:15 +00:00
Egon Elbre	55d5e1fd7d	satellite/orders: ensure that expired deletion doesn't stall Add checks to ensure that when somebody uses empty options, the deletion doesn't loop infinitely. Change-Id: I1738fb1e7e1f8efbbb954c491cb6489f7bcdc2db	2020-11-23 14:52:40 +02:00
Jessica Grebenschikov	5beb2f5737	satellite/orders: add factory function to encryption key Change-Id: I9a1020c63e4ebc6d73683cf1749366e9b9f20f07	2020-11-20 11:40:15 -08:00
Ethan	2b92bba563	satellite/satellitedb/orders: Handle serial_numbers deletes in smaller increments on CRDB CRDB doesn't like large deletes. While testing in the POC environment we found that deletes on the serial_numbers table could take hours. This change limits deletes to 1000 at a time (configurable) to avoid blocking other queries. Change-Id: I08455e25db1574579dd4d7b7125a08e9c913dff1	2020-11-20 13:44:52 +00:00
Moby von Briesen	a8b66dce17	satellite/accounting: account for old orders that can be submitted in satellite rollup With the new phase 3 order submission, orders can be added to the storage and bandwidth rollup tables at timestamps before the most recent rollup was run. This change shifts the start time of each new rollup window to account for any unexpired orders that might have been added since the previous rollup. A satellitedb migration is necessary to allow upserts in the accounting_rollups table when entries with identical node_ids and start_times are inserted. Change-Id: Ib3022081f4d6be60cfec8430b45867ad3c01da63	2020-11-18 14:46:00 -05:00
Egon Elbre	aeb801604e	{satellite,storagenode}/orders: fix flaky tests Before manipulating order information on storagenodes we need to wait for the orders to propagate to the database. Some of that happens async with uplink. Change-Id: Iaacfd7db0909ab5d2831d06388e5fb27b6d4778f	2020-11-18 13:44:02 +00:00
paul cannon	2b59640f18	cmd/satellite: ignore Canceled in exit from repair worker Firstly, this changes the repair functionality to return Canceled errors when a repair is canceled during the Get phase. Previously, because we do not track individual errors per piece, this would just show up as a failure to download enough pieces to repair the segment, which would cause the segment to be added to the IrreparableDB, which is entirely unhelpful. Then, ignore Canceled errors in the return value of the repair worker. Apparently, when the worker returns an error, that makes Cobra exit the program with a nonzero exit code, which causes some piece of our deployment automation to freak out and page people. And when we ask the repair worker to shut down, "canceled" errors are what we _expect_, not an error case. Change-Id: Ia3eb1c60a8d6ec5d09e7cef55dea523be28e8435	2020-11-17 21:37:59 +00:00
Moby von Briesen	0ec685b173	satellite/{satellitedb, repair/{queue, checker}}: Use new column "segmentHealth" instead of "numHealthy" in injured segments queue We plan to add support for a new Reed-Solomon scheme soon, but our repair queue orders segments by least number of healthy pieces first. With a second RS scheme, fewer healthy pieces will not necessarily correlate to lower health. This change just adds the new column in a migration. A separate change will add the new health function. Right now, since we only support one RS scheme, behavior will not change. Number of healthy pieces is being inserted as "segment health" until the new health function is merged. Segment health is calculated with a new priority function created in commit `3e5640359`. In order to use the function, a new config value is added, called NodeFailureRate, representing the approximate probability of any individual node going down in the duration of one checker run. Change-Id: I51c4202203faf52528d923befbe886dbf86d02f2	2020-11-16 21:18:09 +00:00
VitaliiShpital	51a712f9e8	satellite/console: get all bucket names endpoint and service method WHAT: new endpoint for fetching all bucket names WHY: used by new access grant flow Change-Id: I356a3381359665fd2726120139b34b1e611fe3c4	2020-11-16 17:51:40 +02:00
Jessica Grebenschikov	f558cc825e	satellite/orders: add storagenode_bw_phase2 table and dont delete tallies for longer It turns out we need to make 2 more changes in order for the new order submission phase 3 to get deployed. This PR makes 2 changes: 1) when the rollup service deletes tallies, we now keep tallies around until orders expire (vs 1 day like before). 2) the reported rollup chore will now write the storagenode_bandwidth_rollups to a new table _phase2 as an intermediary step so it doesn't conflict with phase 3 order settlement. These changes need to be deployed for 2 days before we can turn on phase 3 of the new orders settlement workflow. Change-Id: Iafbff577ba7d55f8f17b7db857311b2ce799de60	2020-11-13 17:15:24 +00:00
Yaroslav Vorobiov	1b4bfbb9d2	multinode/console: nodes addition and removal Change-Id: I60c685953a8d0e24f78b1414c34a28d4b87863b0	2020-11-12 20:26:08 +02:00
Jessica Grebenschikov	226e13e616	satellite/cosole: add tests for wasm access code Change-Id: I78f71b2f0bef03b6e87cd7d79ccaef5f45393b55	2020-11-12 08:03:36 -08:00
paul cannon	3e56403599	satellite/repair: add a repair health function This will be used to rank segments in need of repair for attention by the repair workers. Change-Id: I5b70650cec933696b4c6d73bb7efb97e3efdf24a	2020-11-11 18:48:51 +00:00
Jeff Wendling	31533ed1a1	satellite/console/wasm: remove storj.io/uplink deependency Change-Id: Iee95389e4ba24618e31aff7be44d05377b2e2419	2020-11-11 16:51:14 +00:00
Cameron Ayer	da9f1f0611	satellite/repair: add monkit counter for segments below minimum required The current monkit reporting for "remote_segments_lost" is not usable for triggering alerts, as it has reported no data. To allow alerting, two new metrics "checker_segments_below_min_req" and "repairer_segments_below_min_req" will increment by zero on each segment unless it is below the minimum required piece count. The two metrics report what is found by the checker and the repairer respectively. Change-Id: I98a68bb189eaf68a833d25cf5db9e68df535b9d7	2020-11-11 12:48:23 +00:00
Yingrong Zhao	2ce3170bb4	satellite/console/wasm: expose method to add caveats in the browser This PR does the following three things: 1. Defines a high-level interface for this wasm package - All return value from this package will be wrapped with an result object that contains a value field and an error field 2. Exposes two new functions to allow users to add permissions for a given API key - newPermission() - setAPIKeyPermission() 3. Adds API documentation for the newly added API functions Change-Id: Id995189702b369bba18fa344bef4ddfb0f3f1f44	2020-11-10 20:10:53 +00:00
Brandon Iglesias	3ba52b25a9	satellite/rewards: update partners to include MAXN	2020-11-10 14:08:32 +02:00
Moby von Briesen	db6bc6503d	satellite/metainfo: Update metainfo RS config to more easily support multiple RS schemes. Make metainfo.RSConfig a valid pflag config value. This allows us to configure the RSConfig as a string like k/m/o/n-shareSize, which makes having multiple supported RS schemes easier in the future. RS-related config values that are no longer needed have been removed (MinTotalThreshold, MaxTotalThreshold, MaxBufferMem, Verify). Change-Id: I0178ae467dcf4375c504e7202f31443d627c15e1	2020-11-09 22:16:13 +00:00
Cameron Ayer	d63b7658e8	satellite/repair: fix lastSeenSegmentKey bug in IrreparableProcess A change was made to use a metabase.SegmentKey (a byte slice alias) as the last seen item to iterate through the irreparable DB in a for loop. However, this SegmentKey was not initialized, thus it was nil. This caused the DB query to return nothing, and healthy segments could not be cleaned out of the irreparable DB. Change-Id: Idb30d6fef6113a30a27158d548f62c7443e65a81	2020-11-09 14:48:15 +00:00
VitaliiShpital	f8c3848c78	satellite/console: change user's email endpoint/feature WHAT: change user's email endpoint and appropriate service method was implemented WHY: make it possible to change user's email for temporary filezilla account Change-Id: Ieea41bf49819a42b5f433e8dfaeec24c6d5ddc9f	2020-11-06 11:54:07 +00:00
jessicagreben	c4c29e370a	wasm: add webassembly code for creating access grant in console web UI Change-Id: I3c6d9afc660f3d959d6138db84341e9460b877a1	2020-11-04 12:08:30 -08:00
Ivan Fraixedes	2dffaebc6f	satellite/accounting: Fix and enhance code doc comments Fix and enhance the source code documentation comments for the satellite/accounting packaged. Change-Id: I965742cf378e8b6b80d18bc84a4ff76e9af1e8b7	2020-11-04 09:50:48 +00:00
paul cannon	8616fc146d	satellite/orders: send IPs for graceful exit Storage nodes undergoing Graceful Exit have up to now been receiving hostnames for all other storage nodes they need to contact when transferring pieces. This adds up to a lot of DNS lookups, which apparently overwhelm some home routers. There does not seem to be any need for us to send hostnames for graceful exit as opposed to IP addresses; we already use IP addresses (as given by the last_ip_port column in the nodes table) for all the GET and PUT orders we send out. This change causes IP addresses to be used instead. I started trying to construct a test to ensure that the behavior changed, but it was rabbit-holing, so I've begun to feel that maybe this change doesn't require one; it is a very simple change, and very much of the same nature as what we already do for IPs in CreateGetOrderLimits and CreatePutOrderLimits (and others). Change-Id: Ib2b5ffe7a9310e9cdbe7464450cc7c934fa229a1	2020-11-04 00:17:20 +00:00
Cameron Ayer	dc67ce74c9	satellite: remove IsUp field from overlay.UpdateRequest With the new overlay.AuditOutcome type for offline audits, the IsUp field is redundant. If AuditOutcome != AuditOffline, then the node is online. In addition to removing the field itself, other changes needed to be made regarding the relationship between 'uptime' and 'audits'. Previously, uptime and audit outcome were completely separated. For example, it was possible to update a node's stats to give it a successful/failed/unknown audit while simultaneously indicating that the node was offline by setting IsUp to false. This is no longer possible under this changeset. Some test which did this have been changed slightly in order to pass. Also add new benchmarks for UpdateStats and BatchUpdateStats with different audit outcomes. Change-Id: I998892d615850b1f138dc62f9b050f720ea0926b	2020-11-02 15:34:17 -05:00
Egon Elbre	7183dca6cb	all: fix defers in loop defer should not be called in a loop. Change-Id: Ifa5a25a56402814b974bcdfb0c2fce56df8e7e59	2020-11-02 15:06:38 +02:00
Egon Elbre	fd8e697ab2	{satellite,storagenode}/internalpb: use specific package name Ensure we don't register types with the same name into protobuf. Change-Id: I53d025863fff8c91a067ca5819befa87eb5e35bb	2020-10-30 17:31:08 +02:00
Michal Niewrzal	0205f0d807	satellite/metainfo: fix usage of types from internalpb After moving SatStreamID and SatSegmentID from common I missed changing some methods in metainfo endpoint. This change is a fix for that. Change-Id: I34e121fce47371ee4cfd92cce03809520b68859f	2020-10-30 16:03:45 +02:00
Egon Elbre	77c4f99fa0	satellite/internalpb: move delegated_repair.proto Change-Id: If4f37c52b151e09cf35d2145b463ef1e9ab529ae	2020-10-30 15:31:32 +02:00
Egon Elbre	11338e9beb	satellite/internalpb: move audithistory.pb Change-Id: I8eee84d49ed90459168ddaf04ae57f790c2a22c4	2020-10-30 15:30:11 +02:00
Egon Elbre	7ce372c686	satellite/internalpb: add inspectors Change-Id: Ib688e43d05135c0c31ae95df533f1e4535ea396a	2020-10-30 13:28:17 +02:00
Egon Elbre	004e610d0f	satellite/internalpb: move datarepair.pb to internal Change-Id: If901d9ff4e5ee6715b963eeeb46513a602a44b3d	2020-10-30 13:28:14 +02:00
Michal Niewrzal	8f26f66da0	internalpb: move satellite specific protobuf types storj/storj We have some types that are only valid for satellite usage. Such types are SatStreamID and SatSegmentID. This change moves those types to storj/storj and adds basic infrastructure for generating code. Change-Id: I1e643844f947ce06b13e51ff16b7e671267cea64	2020-10-30 08:49:16 +00:00
Egon Elbre	e0dca4042d	all: add pprof labels for debugger By using pprof.Labels debugger is able to show service/peer names in goroutine names. Change-Id: I5f55253470f7cc7e556f8e8b87f746394e41675f	2020-10-29 15:10:07 +00:00
Egon Elbre	caefde6b32	private/{dbutil,tagsql}: pass ctx to database opening Database opening usually dial and hence we should pass ctx to them. Change-Id: Iaa2875981570d83e65be3710f841cf30349f807b	2020-10-29 10:51:29 +00:00
Egon Elbre	e3985799a1	storage/{cockroachkv,postgreskv}: add ctx to opening Database opening usually dial and hence we should pass ctx to them. Change-Id: Iecf41241aaa94d54506cbc80b0e53449848d8819	2020-10-29 10:49:08 +00:00
Egon Elbre	9b2e00a38b	satellite: pass ctx into satellitedb.Open Opening a database requires ctx, this is first step to passing ctx to the appropriate level. Change-Id: Ic303e69f868ef3449ae36377937a29670cf635e2	2020-10-29 06:38:37 +00:00
littleskunk	ed1f6d7973	satellite/config: move repair override from config to default (#3958 ) Co-authored-by: Igor <38665104+ihaid@users.noreply.github.com>	2020-10-28 17:24:39 +02:00
Michal Niewrzal	cb1fea87f8	satellite/metainfo: mark unused methods as 'not implemented' Some of metainfo endpoint methods are not used but we still have implementation there. This change removes unused code and returns unimplemented error for those methods. Change-Id: I74e75e0caff76a4f5d119ee989b687b4e9d6e6f9	2020-10-28 12:42:47 +00:00
Michal Niewrzal	1adb497a71	satellite/metainfo: remove unused code This change removed unused 'createRequests' struct. As far I remember it was used to help validating old metainfo beginObject/commitObject flow. Change-Id: I0f139b9934196d73f26eafa347ba5605722f3a55	2020-10-28 12:40:14 +01:00
Egon Elbre	76f4619a9c	{satellite,storagenode}/gracefulexit: ensure client is closed Change-Id: I576a955a5578caf7fcbee832beca28cef2b0c83e	2020-10-27 23:27:07 +02:00
Kaloyan Raev	92a2be2abd	satellite/metainfo: get away from using pb.Pointer in Metainfo Loop As part of the Metainfo Refactoring, we need to make the Metainfo Loop working with both the current PointerDB and the new Metabase. Thus, the Metainfo Loop should pass to the Observer interface more specific Object and Segment types instead of pb.Pointer. After this change, there are still a couple of use cases that require access to the pb.Pointer (hence we have it as a field in the metainfo.Segment type): 1. Expired Deletion Service 2. Repair Service It would require additional refactoring in these two services before we are able to clean this. Change-Id: Ib3eb6b7507ed89d5ba745ffbb6b37524ef10ed9f	2020-10-27 13:06:47 +00:00
Cameron Ayer	bb7be23115	satellite/{audit,overlay,satellitedb}: enable reporting offline audits - Remove flag for switching off offline audit reporting. - Change the overlay method used from UpdateUptime to BatchUpdateStats, as this is where the new online scoring is done. - Add a new overlay.AuditOutcome type: AuditOffline. Since we now use the same method to record offline audits as success, failure, and unknown, we need to distinguish offline audits from the rest. Change-Id: Iadcfe10cf13466fa1a1c2dc542db8994a6423355	2020-10-27 10:44:46 +00:00

1 2 3 4 5 ...

1514 Commits