storj

Author	SHA1	Message	Date
Michal Niewrzal	426c8eb31a	private/testplanet: add DeleteBucket method for uplink New method added to be able to delete easily bucket during tests. Change-Id: Iaae89618cc676ddbbbd4b0df2eeacd143ea6f3c2	2020-02-11 15:58:13 +00:00
Jeff Wendling	7999d24f81	all: use monkit v3 this commit updates our monkit dependency to the v3 version where it outputs in an influx style. this makes discovery much easier as many tools are built to look at it this way. graphite and rothko will suffer some due to no longer being a tree based on dots. hopefully time will exist to update rothko to index based on the new metric format. it adds an influx output for the statreceiver so that we can write to influxdb v1 or v2 directly. Change-Id: Iae9f9494a6d29cfbd1f932a5e71a891b490415ff	2020-02-05 23:53:17 +00:00
Egon Elbre	8dea4f52db	satellite: add control panel Change-Id: Id48246e9bcd4c6ec643277fe740937b2e42ad85b	2020-01-30 08:06:43 -05:00
paul cannon	8ce9ce7f0f	satellite/gracefulexit: wait for errgroup to return credit to Yingrong Change-Id: I538371040d4dcdf6e943c61e8454320fd57b7526	2020-01-28 19:26:43 +00:00
Jeff Wendling	26e33e7e07	satellite/gracefulexit: make orders with right bucket id and action paths are organized as follows: project_id/segment_index/bucket_name/encrypted_key so by picking parts[0] and parts[1], we were using the segment index instead of the bucket name, causing bandwidth to be accounted for incorrectly. additionally, we were using the PUT action instead of the PUT_GRACEFUL_EXIT action, causing the data to be charged incorrectly. we use PUT_REPAIR for now because nodes won't accept uploads with PUT_GRACEFUL_EXIT and our tables need migrations to handle rollups with it. Change-Id: Ife2aff541222bac930c35df8fcf76e8bac5d60b2	2020-01-24 19:27:38 +00:00
Michal Niewrzal	6502454947	satellite/metainfo: move RS configuration to satellite With this change RS configuration will be set on satellite. Uplink with get RS values with BeginObject request and will use it. For backward compatibility and to avoid super large change redundancy scheme stored with bucket is not touched. This can be done in future. Change-Id: Ia5f76fc10c37e2c44e4f7b8754f28eafe1f97eff	2020-01-22 09:33:53 +00:00
Egon Elbre	0c0b47823d	satellite: use require.WithinDuration Noticed that assert/require has WithinDuration for comparing time.Time-s. Change-Id: Ia340896443f610d38799b7ef245b5775eecfc92b	2020-01-21 19:43:53 +02:00
Egon Elbre	f3b4bf2b7c	satellite/satellitedb/satellitedbtest: pass ctx as an argument ctx is created in most tests, instead pass in as argument to reduce code duplication. Change-Id: I466c51c008392001129c8b007c9d6b3619935ac4	2020-01-20 16:35:42 +02:00
Egon Elbre	a4026f97b8	satellite: fix test time comparisons Correct way to compare time that may have an error is to use InDelta. Change-Id: I0140892119c44c63fa042bbc7292ab91bb33a350	2020-01-20 10:17:20 +00:00
Egon Elbre	d5438036b5	{satellite,storagnode}/gracefulexit: reduce logging Change-Id: I9f274ede77a582fc43ef14a47bf9341d4e3083df	2020-01-19 22:36:13 +02:00
Yingrong Zhao	76ee8a1b4c	satellite: remove UptimeReputation configs from codebase With the new storage node downtime tracking feature, we need remove current uptime reputation configs: UptimeReputationAlpha, UptimeReputationBeta, and UptimeReputationDQ. This is the first step of removing the uptime reputation columns from satellitedb Change-Id: Ie8fab13295dbf545e33aeda0c4306cda4ba54e36	2020-01-08 18:54:15 +00:00
Egon Elbre	082ec81714	uplink: move to storj.io/uplink (#3746 )	2020-01-08 15:40:19 +02:00
Natalie Ventura Villasana	1cb0f80a8d	satellite/gracefulexit: dq node on exit fail Disqualifies a node when the node fails to complete a graceful exit. Adds a new DisqualifyNode method to the overlay cache, since there wasn't an existing method to disqualify a node but do nothing else to its stats. Adds checks to existing tests to make sure that a storage node that fails a graceful exit is marked as disqualified in the overlay cache. https: //storjlabs.atlassian.net/browse/V3-3342 Change-Id: I4d554a519ab59db31ad3b8e28764c8683a6e3888	2020-01-06 19:16:26 -05:00
Egon Elbre	2680bae88c	private/testplanet: remove dependency to uplink Remove direct dependency on uplink.RSConfig, this simplifies moving the config file without introducing weird dependencies. Change-Id: I7fd2a145401e0205d7047631df9d2810241efeec	2020-01-02 09:40:46 +00:00
Natalie Ventura Villasana	aa3e183c2e	satellite/gracefulexit: add ge eligibility check Adds check to see if storage nodes are eligible to initiate graceful exit, by checking their CreatedAt date and seeing if their "age" is greater than the new config value: NodeMinAgeInMonths The default for this value is 6 months for now. https://storjlabs.atlassian.net/browse/V3-3357 Change-Id: Ib807ab8987ddb5a38a27a83886490f73fe8c5816	2019-12-31 09:31:58 -05:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Egon Elbre	d55288cf68	pkg/rpc: replace methods with direct calls to pb Change-Id: I8bd015d8d316a2c12c1daceca1d9fd257f6f57bc	2019-12-22 17:12:43 +02:00
Ethan	b959ccbae6	satellite/gracefulexit: Use proper rpc status codes for disqualified nodes and too many connections Change-Id: I41380026175e7678c7cd3d44211de8eb86ce4d0f	2019-12-20 19:05:28 +00:00
Egon Elbre	afe05edff2	{storagenode,satellite}/gracefulexit: ensure workers finish their work Fixes a data race caused by not waiting for workers to finish before shutting down. Currently this ended up failing logging because it was closed when test tried to write to it. Change-Id: I074045cd83bbf49e658f51353aa7901e9a5d074b	2019-12-17 17:21:52 +02:00
Egon Elbre	7a36507a0a	private/testcontext: ensure we call cleanup everywhere Change-Id: Icb921144b651611d78f3736629430d05c3b8a7d3	2019-12-17 14:16:09 +00:00
littleskunk	6ab72a6e79	satellite/gracefulexit: enable graceful exit in production Change-Id: I526ce4a4de9c318f1333b793e3167f5f86d65adc	2019-12-09 17:32:34 +00:00
Ethan Adams	bba92911cc	fix calcuation of durability ration (#3656 )	2019-11-26 12:04:48 -05:00
Maximillian von Briesen	1339252cbe	satellite/gracefulexit: refactor concurrency (#3624 ) Update PendingMap structure to also handle concurrency control between the sending and receiving sides of the graceful exit endpoint.	2019-11-21 17:03:16 -05:00
Egon Elbre	ee6c1cac8a	private: rename internal to private (#3573 )	2019-11-14 21:46:15 +02:00
Natalie Villasana	1a9757a7f2	satellite/gracefulexit: add count for order limits sent from satellite to exiting node (#3544 )	2019-11-13 09:54:50 -05:00
Yingrong Zhao	69b0ae02bf	satellite/gracefulexit: separate functional code in endpoint (#3476 )	2019-11-08 13:57:51 -05:00
Yingrong Zhao	6331f839ae	satellite/gracefulexit: not allow disqualified node to graceful exit (#3493 )	2019-11-07 12:19:34 -05:00
Ethan Adams	f3dccb56b1	satellite/gracefulexit: Check if pointer has been overwritten or deleted before sending transfer message. (#3481 )	2019-11-07 11:13:05 -05:00
Natalie Villasana	68a7790069	satellite/gracefulexit: select new node filtered by Distinct IP (#3435 )	2019-11-06 16:38:51 -05:00
Egon Elbre	cc032d3151	satellite/metainfo: fix some uses of metainfo.Delete (#3513 ) * satellite/metainfo: rename Delete to UnsynchronizedDelete * fix deletes * make db private * fix typos * also verify on commit object	2019-11-06 18:02:14 +01:00
littleskunk	7eb6724c92	logging: unify logging around satellite ID, node ID and piece ID (#3491 ) * logging: unify logging around satellite ID, node ID and piece ID * unify segment index	2019-11-05 22:04:07 +01:00
Ethan Adams	2eb0cc56fe	satellite/gracefulexit: Check if node already has a piece in the pointer (#3434 )	2019-11-05 14:13:45 -05:00
Maximillian von Briesen	78fedf5db3	satellite/gracefulexit: handle piece not found messages from storagenode (#3456 ) * If a node claims to fail a transfer due to piece not found, remove that node from the pointer, delete the transfer queue item. * If the pointer is piece hash verified, penalize the node. Otherwise, do not penalize the node.	2019-11-05 10:04:39 -05:00
Maximillian von Briesen	f9df0ea591	satellite/gracefulexit: check for unknown error in graceful exit disable test Allow error in graceful exit disable test to be rpcstatus.Unimplemented (grpc) or rpcstatus.Unknown (drpc)	2019-11-01 17:21:30 -04:00
Maximillian von Briesen	590312970d	satellite/gracefulexit: add flag for enabling/disabling graceful exit on the satellite (#3437 )	2019-11-01 16:21:24 +02:00
Ethan Adams	43103ae13f	lower storage node counts in tests (#3427 )	2019-10-31 10:57:54 -04:00
Natalie Villasana	4878135068	satellite/gracefulexit, storagenode/gracefulexit: add timeouts (#3407 )	2019-10-30 13:40:57 -04:00
Egon Elbre	65a8e0bcbc	{satellite,storagenode}/gracefulexit: clearer log messages (#3413 )	2019-10-30 10:21:27 +02:00
Maximillian von Briesen	54594e79c3	satellite/gracefulexit: add metrics on satellite for graceful exit (#3355 )	2019-10-29 16:22:20 -04:00
Maximillian von Briesen	cd0940724c	satellite/gracefulexit: use sync2 cycle inside satellite graceful exit endpoint (#3394 )	2019-10-29 14:40:42 -04:00
Maximillian von Briesen	cd3d3850f9	satellite/gracefulexit: only allow one connection per node to graceful exit endpoint (#3357 )	2019-10-29 13:23:17 -04:00
Ethan Adams	7c2daa4dd9	Use original pointer when calling UpdatePieces (#3397 )	2019-10-28 14:43:46 -04:00
Ethan Adams	9905f2c61e	add piece num to transfer queue PK (#3390 )	2019-10-28 11:08:33 -04:00
Yingrong Zhao	292e64ee2f	satellite/gracefulexit: check duplicate node id before update pointer (#3380 ) * check duplicate node id before update pointer * add test for transfer failure when pointer already contain the receiving node id * check exiting and receiving nod are still in the pointer * check node id only exists once in a pointer * return error if the existing node doesn't match with the piece info in the pointer * try to recreate the issue on jenkins * should not remove exiting node piece in test * Update satellite/gracefulexit/endpoint.go Co-Authored-By: Maximillian von Briesen <mobyvb@gmail.com> * Update satellite/gracefulexit/endpoint.go Co-Authored-By: Maximillian von Briesen <mobyvb@gmail.com>	2019-10-27 14:20:22 -04:00
Maximillian von Briesen	a4e618fd1f	handle context cancelled in satellite graceful exit endpoint (#3388 )	2019-10-27 10:14:25 -04:00
Ethan Adams	e54d290d2e	satellite/gracefulexit: Add signatures for success/failed exit finished messages. (#3368 ) * add signatures, fix process loop bug, move delete to on success * added tests for signatures * PR comment updates * fixed setting reason by default. * updates for PR comments * added signed failure when verificationi fails * moved to sign_test * fix panic * removed testplanet from test	2019-10-25 16:36:26 -04:00
Maximillian von Briesen	6df4d7bc73	storagenode/gracefulexit + satellite/gracefulexit: add storagenode-side transfer validation (#3371 ) * Make the exiting node check piece hashes, piece IDs, and piece hash signatures before relaying successful transfer data to the satellite. * Enable immediate graceful exit failure for "successful" transfers that fail satellite-side validation. * Move transfer piece logic in storagenode worker to separate function (to make the worker easier to understand)	2019-10-25 13:16:20 -04:00
Ethan Adams	f0caa6ce5e	satellite:gracefulexit: Update pointer after success (#3369 )	2019-10-25 11:14:22 -04:00
Natalie Villasana	696c567e89	satellite/gracefulexit: add piece hash validation for successful transfer (#3313 )	2019-10-24 15:38:40 -04:00
Yingrong Zhao	fa1ac24e19	satellite/gracefulexit: add failure threshold check (#3329 ) * add overall failure percentage check and inactive time frame check before sending a response to sno * update comment * delete node from transfer queue if it has been inactive for too long * fix linting error * add test config value * fix nil pointer * add config value into testplanet * add unit test for overall failure threshold * move timeframe threshold to chore * update protolock * add chore test * add per peiece failure count logic * change config name from EndpointMaxFailures to MaxFailuresPerPiece * address comments * fix linting error * add error handling for no row returned from progress table * fix test for graceful exit chore on storagenode * fix typo InActive -> Inactive * improve readability for failure threshold calculation * update config lock * change error handling for GetProgress in graceful exit endpoint on the satellite side * return proper rpc error in endpoint * add check in chore test for checking finish timestamp and queue	2019-10-24 12:24:42 -04:00
Maximillian von Briesen	abb567f6ae	cmd/satellite: add graceful exit reports command to satellite CLI (#3300 ) * update lock file and add comment * add created at and bytes transferred * cleanup * rename db func to GetGracefulExitNodesByTimeFrame * fix flag * split into two overlay functions * := to = * fix test * add node not found error class * fix overlay test * suggested test changes * review suggestions * get exit status from overlay.Get() * check rows.Err * fix panic when ExitFinishedAt is nil * fix comments in cmdGracefulExit	2019-10-22 21:06:01 -04:00
Ethan Adams	3e0d12354a	storagenode/gracefulexit: Implement storage node graceful exit worker - part 1 (#3322 )	2019-10-22 16:42:21 -04:00
Natalie Villasana	22d0f89941	satellite/gracefulexit: use zap.Stringer instead of zap.String (#3299 )	2019-10-17 10:29:35 -04:00
Ethan Adams	37ab84355f	satellite/gracefulexit: protobuf field name updates (#3284 ) rename piece_id to original_piece_id	2019-10-15 15:59:12 -04:00
Ethan Adams	78ccf14837	fix interface and EOF check (#3251 )	2019-10-12 07:06:20 -06:00
Ethan Adams	a1275746b4	satellite/gracefulexit: Implement the 'process' endpoint on the satellite (#3223 )	2019-10-11 17:18:05 -04:00
Ethan Adams	4c4519f0be	satellite/gracefulexit: add transfer queue for pieces (#3174 ) initial impl of transfer queue updated docs represent the new design how we handle durability during exit	2019-10-07 16:38:05 -04:00
Natalie Villasana	4f2f8ae11b	satellite/overlay: add UpdateExitStatus and GetExitingNodes for graceful exit (#3087 )	2019-10-01 18:18:21 -04:00
Ethan Adams	9edfb6efe0	satellite/satellitedb: Initial GE Satellite DB Implementation (#3049 ) Initial GE Satellite DB impl Add basic CRUD operations for graceful_exit_progress and graceful_exit_transfer_queue tables.	2019-09-25 11:12:44 -06:00

1 2 3 4

159 Commits