storj

Author	SHA1	Message	Date
Ivan Fraixedes	3c8f1370d2	[v3 2137] - Add more info to find out repair failures (#2623 ) * pkg/datarepair/repairer: Track always time for repair Make a minor change in the worker function of the repairer, that when successful, always track the metric time for repair independently if the time since checker queue metric can be tracked. * storage/postgreskv: Wrap error in Get func Wrap the returned error of the Get function as it is done when the query doesn't return any row. * satellite/metainfo: Move debug msg to the right place NewStore function was writing a debug log message when the DB was connected, however it was always writing it out despite if an error happened when getting the connection. * pkg/datarepair/repairer: Wrap error before logging it Wrap the error returned by process which is executed by the Run method of the repairer service to add context to the error log message. * pkg/datarepair/repairer: Make errors more specific in worker Make the error messages of the "worker" method of the Service more specific and the logged message for such errors. * pkg/storage/repair: Improve error reporting Repair In order of improving the error reporting by the pkg/storage/repair.Repair method, several errors of this method and functions/methods which this one relies one have been updated to be wrapper into their corresponding classes. * pkg/storage/segments: Track path param of Repair method Track in monkit the path parameter passed to the Repair method. * satellite/satellitedb: Wrap Error returned by Delete Wrap the error returned by repairQueue.Delete method to enhance the error with a class and stack and the pkg/storage/segments.Repairer.Repair method get a more contextualized error from it.	2019-07-23 16:28:06 +02:00
Ivan Fraixedes	79e9b62f6d	pkg/storage/segments: Clarify logic in Repair method (#2621 ) Create a new variable rather than reusing the existing one because the name of the existing one is confusing when reading the logic and it requires more time that the logic doesn't have a bug.	2019-07-23 14:16:32 +02:00
Ivan Fraixedes	f420b29d35	[V3-1927] Repairer uploads to max threshold instead of success… (#2423 ) * pkg/datarepair: Add test to check num upload pieces Add a new test for ensuring the number of pieces that the repair process upload when a segment is injured. * satellite/orders: Don't create "put order limits" over total Repair must not create "put order limits" more than the total count. * pkg/datarepair: Update upload repair pieces test Update the test which checks the number of pieces which are uploaded during a repair for using the same excess over the success threshold value than the implementation. * satellites/orders: Limit repair put order for not being total Limit the number of put orders to be used by repair for only uploading pieces to a % excess over the successful threshold. * pkg/datarepair: Change DataRepair test to pass again Make some changes in the DataRepair test to make pass again after the repair upload repaired pieces only until a % excess over success threshold. Also update the steps description of the DataRepair test after it has been changed, to match on what's now, besides to leave it more generic for avoiding having to update it on minimal future refactorings. * satellite: Make repair excess optimal threshold configurable Add a new configuration parameter to the satellite for being able to configure the percentage excess over the optimal threshold, used for determining how many pieces should be repaired/uploaded, rather than having the value hard coded. * repairer: Add configurable param to segments/repairer Add a new parameters to the segment/repairer to calculate the maximum number of excess nodes, based on the optimal threshold, that repaired pieces can be uploaded. This new parameter has been added for not returning more nodes than the number of upload orders for data repair satellite service calculate for repairing pieces. * pkg/storage/ec: Update log message in clien.Repair * satellite: Update configuration lock file	2019-07-12 00:44:47 +02:00
Egon Elbre	d52f764e54	protocol: implement new piece signing and verification (#2525 )	2019-07-11 16:51:40 -04:00
Jeff Wendling	10547cc1ea	segments: send in the object path to the initial CreateSegment… (#2518 ) otherwise, api key restictions will fail because we look like we're asking to put to the bucket metadata path.	2019-07-10 11:33:55 -04:00
Alexander Leitner	1c5db71faf	Change protobuf expirations to use time.Time (#2509 ) * Change protobuf expirations to use time.Time instead of timestamp.Timestamp	2019-07-09 17:54:00 -04:00
Alexander Leitner	3587e1a579	Change pointerdb pointer to use time.Time for Creation date (#2483 )	2019-07-09 00:16:50 +02:00
Maximillian von Briesen	52e5a4eee3	pass logger into repairer and ecclient (#2365 )	2019-07-02 13:08:02 +03:00
littleskunk	fb66867856	fix repair queue (#2405 )	2019-07-01 19:00:53 +02:00
Egon Elbre	385c046723	pkg/pb: rename Order2 to Order, OrderLimit2 to OrderLimit (#2406 )	2019-07-01 18:54:11 +03:00
Natalie Villasana	3ffb42483f	account for disqualified nodes in data repair tests (#2315 )	2019-07-01 11:34:42 -04:00
Natalie Villasana	3f643551e7	remove flakiness in TestDataRepair and TestSegmentStoreRepair (#2335 ) * stop audit loop in repair tests to prevent possible timeout	2019-07-01 11:15:45 -04:00
Ivan Fraixedes	fd2e708587	pkg/storage/segments: Abort repaired test unmet requirements (#2403 ) Abort the repairer test when asserting results which are required later on by the test for being able to continue.	2019-07-01 08:49:40 -04:00
Maximillian von Briesen	b750bc2d0f	Keep some unhealthy pieces in the event of a partial repair (#2380 ) * if repair is partial, keep saving "unhealthy" pieces that are not duplicates	2019-06-28 15:48:51 -04:00
Egon Elbre	b6ad3e9c9f	internal/testrand: new package for random data (#2282 )	2019-06-26 13:38:51 +03:00
Egon Elbre	414648d660	Fix some metainfo.Client leaks (#2327 )	2019-06-25 18:36:23 +03:00
Michal Niewrzal	fdeb834801	Bucket name validation (#2244 )	2019-06-24 11:52:25 +02:00
Kaloyan Raev	ebd9b375fc	Repair should not corrupt files (#2194 )	2019-06-14 12:16:31 +03:00
ethanadams	8f2dca8437	Re-enabling and fixing repairer tests (#2099 ) * Disabled discovery service by changiing from Stop() to Pause() Paused to solve race condition. If discovery is running, it may mark a node "up" after they've been manually marked "down" in this test. * Extend to the repair timeout Fixes intermittent test failures when repairs were taking more than 2 seconds. * Re-enabled test. Disabled discovery service by changiing from Stop() to Pause() * Changed back to Stop. * Revert "Changed back to Stop." This reverts commit 46d410e72dfae63e0c44915be42784cc9a7b5abf. * re-enabling TestIdentifyInjuredSegments * Changed Pause to Stop. Commented on timeout change * testing... * temporarily skipping audit tests * changing back to discover Stop for testing via jenkins * Revert "changing back to discover Stop for testing via jenkins" This reverts commit 6aa8558b11a0053c30e0c8b2dbf0d6c0cb34ee6c. * Changing back to Stop(). Depends on PR 2137 * Revert "temporarily skipping audit tests" This reverts commit 1940ed9b315d663a0eb6c95521780cbcb48cb121. * Removed reference to Graveyard since its been removed	2019-06-10 09:06:21 +02:00
JT Olio	f1641af802	storage: add monkit task to missing places (#2122 ) * storage: add monkit task to missing places Change-Id: I9e17a6b14f7c25bbf698eeecf32785e9add3f26e * fix tests Change-Id: Id078276fa3de61a28eb3d01d4e751732ecbb173f * import order Change-Id: I814e33755b9f10b5219af37cd828cd75eb3da1a4 * remove part of other commit Change-Id: Idaa4c95cd65e97567fb466de49718db8203cfbe1	2019-06-05 16:23:10 +02:00
ethanadams	16e3b77cf5	Enable Scopelint Linter (#2049 ) * added scopelint and correcte issues found * corrected scopelint issue * made updates based on Ivan's suggestions Most were around naming conventions Some were false positives, but I kept them since the test.Run could eventually be changed to run in parallel, which could cause a bug Others were false positives. Added // nolint: scopelint	2019-05-29 09:30:16 -04:00
ethanadams	268dc6b7e4	Enable gocritic linter (#2051 ) * first round cleanup based on go-critic * more issues resolved for ifelsechain and unlambda checks * updated from master and gocritic found a new ifElseChain issue * disable appendAssign. i reports false positives * re-enabled go-critic appendAssign and disabled lint check at code line level * fixed go-critic lint error * fixed // nolint add gocritic specifically	2019-05-29 09:14:25 -04:00
Maximillian von Briesen	c07162beef	address potential divide by 0` (#2065 )	2019-05-28 08:54:30 -06:00
Maximillian von Briesen	5a4ff2c855	add repair monkit stats (#2045 ) * add repair monkit stats * rename values, use meter instead of counter, use success threshold instead of repair threshold * Counter -> Meter * add repair segment size * update names and use ratios for healthy before/after repair * restart jenkins	2019-05-28 16:10:26 +02:00
Jeff Wendling	1bd52b9f90	server side macaroons (#1945 ) What: Adds macaroon support to the server side Why: So that api keys are now macaroons	2019-05-24 10:51:27 -06:00
Bill Thorp	09065b8dec	call GetRemotePieces once (#2003 )	2019-05-20 15:22:03 +02:00
littleskunk	c974e0ce8a	Store repaired Segments and improve Repair Condition (#2000 ) * repair no cutoff longtail * commit repair pieces even if not hitting success threshold * commit repair pieces even if not hitting success threshold * remove useless condition * better error message	2019-05-20 12:50:13 +02:00
littleskunk	d2c95c1d62	improve repair logs (#1999 )	2019-05-20 10:37:46 +02:00
Egon Elbre	1103fa63c0	disable flaky TestSegmentStoreRepair (#1994 )	2019-05-17 23:13:37 +03:00
aligeti	60cf1dafb0	repair segment reassess it missing pieces just before repair (#1939 ) * repair segment reaccess it missing pieces just before repair to see if it actually needs repair	2019-05-16 09:49:10 -04:00
Michal Niewrzal	fe3dfc1587	Move pointerdb.Service to satellite (#1826 )	2019-04-25 10:46:32 +02:00
Kaloyan Raev	8fc5fe1d6f	Refactor pb.Node protobuf (#1785 )	2019-04-22 12:07:50 +03:00
paul cannon	0ae0de75bc	use SerializableMeta to store bucket attributes (#1658 )	2019-04-10 18:27:04 -04:00
Cameron	62deae6a0a	fix bucketID bug in repair (#1719 )	2019-04-09 13:20:00 -04:00
Maximillian von Briesen	bb3b4e4816	Data repair integration test (#1582 )	2019-04-08 13:33:47 -04:00
Kaloyan Raev	bfdee78f05	Introduce NodeDossier type and cleanup overlay.DB interface (#1626 ) Co-authored-by: Natalie Villasana <navillasa@gmail.com> Co-authored-by: Bill Thorp <bill3000@hotmail.com>	2019-04-04 19:34:36 +03:00
Maximillian von Briesen	4d925f783c	Fix repairer unit test (#1557 )	2019-04-03 19:00:25 -04:00
paul cannon	e4a70e3fac	plumb EncryptionScheme, RedundancyScheme through to buckets (#1638 ) We want to use those fields in the bucket-level Pointer objects as bucket defaults, but we need to be able to get at them first. I don't see any strong reason not to make these available, except that it was kind of a pain.	2019-04-02 15:15:31 -06:00
Michal Niewrzal	f80750693c	Store bandwidth from orders on satellite (#1586 )	2019-04-01 16:14:58 -04:00
Egon Elbre	be06fdfd6c	Create orders.Service (#1593 )	2019-03-28 22:09:23 +02:00
Egon Elbre	94e79eda6d	remove overlay endpoint (#1521 )	2019-03-23 10:06:11 +02:00
Egon Elbre	694b6dc1da	make tests run faster (#1553 )	2019-03-22 15:14:17 +02:00
Maximillian von Briesen	db64d6590b	Add repairer tests (#1494 )	2019-03-21 16:26:56 +02:00
Michal Niewrzal	d7feafe56b	Move psserver tests (#1522 )	2019-03-20 23:12:00 +02:00
Kaloyan Raev	d057efb05e	Add Repair method to ECClient (#1509 )	2019-03-19 15:14:59 +02:00
Egon Elbre	05d148aeb5	Storage node and upload/download protocol refactor (#1422 ) refactor storage node server refactor upload and download protocol	2019-03-18 12:55:06 +02:00
Michal Niewrzal	4122c98cb7	Validate piece hash on satellite (#1359 ) The satellite receives pieces signed hashes in Pointer. If signed hash cannot be validated then piece is removed from Pointer and not saved in DB.	2019-02-28 15:14:54 +01:00
Michal Niewrzal	81408a3c9e	Use SignedHash on client/uplink side (#1354 ) * psclient receives storage node hash and compare it to own hash for verification * uplink sends delete request when hashes don't match * valid hashes are propagated up to segments.Store for future sending to satellite	2019-02-25 16:57:54 +01:00
Bill Thorp	373b301736	BWA aliases (#1333 ) aliased RBAs and PBAs	2019-02-22 16:17:35 -05:00
Natalie Villasana	c3d3f41d30	removes some SignedMessage use (#1258 ) Removes most instances of pb.SignedMessage (there's more to take out but they shouldn't hurt anyone as is). There used to be places in psserver where a PieceID was hmac'd with the SatelliteID, which was gotten from a SignedMessage. This PR makes it so some functions access the SatelliteID from the Payer Bandwidth Allocation instead. This requires passing a SatelliteID into psserver functions where they weren't before, so the following proto messages have been changed: * PieceId - satellite_id field added This is so the psserver.Piece function has access to the SatelliteID when it needs to get the namespaced pieceID. This proto message should probably be renamed to PieceRequest, or a new PieceRequest message should be created so this isn't misnamed. * PieceDelete - satellite_id field added This is so the psserver.Delete function has access to the SatelliteID when receiving a request to Delete.	2019-02-19 23:36:08 -06:00

1 2 3

104 Commits