storj

Author	SHA1	Message	Date
Maximillian von Briesen	590312970d	satellite/gracefulexit: add flag for enabling/disabling graceful exit on the satellite (#3437 )	2019-11-01 16:21:24 +02:00
Maximillian von Briesen	d9bb25b4b9	satellite/metainfo: support a wider range of values for RS.Total in satellite metainfo validation (#3431 ) change uplink RS default configuration from 130 to 95	2019-10-31 15:04:33 -04:00
Yingrong Zhao	bfa6699e2c	satellite/repair: add timeout for repair download from a single node(#3418 )	2019-10-30 16:31:08 -04:00
Natalie Villasana	4878135068	satellite/gracefulexit, storagenode/gracefulexit: add timeouts (#3407 )	2019-10-30 13:40:57 -04:00
Cameron	b2ff13f1fa	{cmd/satellite, storj/satellite}: create command to run repair process in isolation (#3341 ) * set up satellite repair run command * add separated repair process to storj-sim * add repairer peer to satellite in testplanet * move api run cmd into api.go * add satellite run repair to entrypoint	2019-10-29 10:55:57 -04:00
Yingrong Zhao	fa1ac24e19	satellite/gracefulexit: add failure threshold check (#3329 ) * add overall failure percentage check and inactive time frame check before sending a response to sno * update comment * delete node from transfer queue if it has been inactive for too long * fix linting error * add test config value * fix nil pointer * add config value into testplanet * add unit test for overall failure threshold * move timeframe threshold to chore * update protolock * add chore test * add per peiece failure count logic * change config name from EndpointMaxFailures to MaxFailuresPerPiece * address comments * fix linting error * add error handling for no row returned from progress table * fix test for graceful exit chore on storagenode * fix typo InActive -> Inactive * improve readability for failure threshold calculation * update config lock * change error handling for GetProgress in graceful exit endpoint on the satellite side * return proper rpc error in endpoint * add check in chore test for checking finish timestamp and queue	2019-10-24 12:24:42 -04:00
Bryan White	f468816f13	{internal/version,versioncontrol,cmd/storagenode-updater}: add rollout to storagenode updater (#3276 )	2019-10-21 12:50:59 +02:00
Bryan White	243ba1cb17	{versioncontrol,internal/version,cmd/*}: refactor version control (#3253 )	2019-10-20 09:56:23 +02:00
Egon Elbre	89ed997706	satellite/satellitedb: switch to postgres only (#3320 )	2019-10-18 22:03:10 +03:00
Natalie Villasana	855fca003d	satellite/metrics: create a metrics chore (#3263 ) * add metrics counter and chore * updates metrics observer interval release default and dev default to 15min * add more specific check for remote pointers * add Counter field to metrics chore, add counter tests * rm redundant ObjectCount suffix * make pointer check easier to read * change metrics.Config.Interval to ChoreInterval * rm unneeded var * fix comment * update satellite config lock	2019-10-16 14:08:33 -04:00
Cameron	76ad83f12c	satellite/accounting: add redis support to live accounting (#3213 ) * set up redis support in live accounting * move live.Service interface into accounting package and rename to Cache, pass into satellite * refactor Cache to store one int64 total, add IncrBy method to redis client implementation * add monkit tracing to live accounting	2019-10-16 12:50:29 -04:00
Jess G	87a426f228	internal/testplanet: add satellite.API to testplanet (#3237 )	2019-10-14 16:01:53 -04:00
Jennifer Li Johnson	b185dbbee2	satellite/discovery: remove discovery related code (#3175 )	2019-10-14 10:57:01 -04:00
Ethan Adams	a1275746b4	satellite/gracefulexit: Implement the 'process' endpoint on the satellite (#3223 )	2019-10-11 17:18:05 -04:00
Egon Elbre	e9c36d560f	satellite: make PointerDB an argument to satellite.New (#3233 )	2019-10-10 21:06:26 +03:00
Ethan Adams	4c4519f0be	satellite/gracefulexit: add transfer queue for pieces (#3174 ) initial impl of transfer queue updated docs represent the new design how we handle durability during exit	2019-10-07 16:38:05 -04:00
Maximillian von Briesen	08ed50bcaa	satellite/metainfo: add commit interval to prevent long delays between order limit creation and segment commit (#3149 )	2019-10-01 12:55:02 -04:00
Egon Elbre	94bbb9563d	internal/testplanet: set intervals to 15s by default (#3103 )	2019-09-25 18:41:24 +03:00
Jennifer Li Johnson	724bb44723	Remove Kademlia dependencies from Satellite and Storagenode (#2966 ) What: cmd/inspector/main.go: removes kad commands internal/testplanet/planet.go: Waits for contact chore to finish satellite/contact/nodesservice.go: creates an empty nodes service implementation satellite/contact/service.go: implements Local and FetchInfo methods & adds external address config value satellite/discovery/service.go: replaces kad.FetchInfo with contact.FetchInfo in Refresh() & removes Discover() satellite/peer.go: sets up contact service and endpoints storagenode/console/service.go: replaces nodeID with contact.Local() storagenode/contact/chore.go: replaces routing table with contact service storagenode/contact/nodesservice.go: creates empty implementation for ping and request info nodes service & implements RequestInfo method storagenode/contact/service.go: creates a service to return the local node and update its own capacity storagenode/monitor/monitor.go: uses contact service in place of routing table storagenode/operator.go: moves operatorconfig from kad into its own setup storagenode/peer.go: sets up contact service, chore, pingstats and endpoints satellite/overlay/config.go: changes NodeSelectionConfig.OnlineWindow default to 4hr to allow for accurate repair selection Removes kademlia setups in: cmd/storagenode/main.go cmd/storj-sim/network.go internal/testplane/planet.go internal/testplanet/satellite.go internal/testplanet/storagenode.go satellite/peer.go scripts/test-sim-backwards.sh scripts/testdata/satellite-config.yaml.lock storagenode/inspector/inspector.go storagenode/peer.go storagenode/storagenodedb/database.go Why: Replacing Kademlia Please describe the tests: • internal/testplanet/planet_test.go: TestBasic: assert that the storagenode can check in with the satellite without any errors TestContact: test that all nodes get inserted into both satellites' overlay cache during testplanet setup • satellite/contact/contact_test.go: TestFetchInfo: Tests that the FetchInfo method returns the correct info • storagenode/contact/contact_test.go: TestNodeInfoUpdated: tests that the contact chore updates the node information TestRequestInfoEndpoint: tests that the Request info endpoint returns the correct info Please describe the performance impact: Node discovery should be at least slightly more performant since each node connects directly to each satellite and no longer needs to wait for bootstrapping. It probably won't be faster in real time on start up since each node waits a random amount of time (less than 1 hr) to initialize its first connection (jitter).	2019-09-19 15:56:34 -04:00
Jess G	7c203b4884	add satelliteSystem to testplanet and update tests (#3066 )	2019-09-17 13:14:49 -07:00
Natalie Villasana	aa3567187e	satellite/audit: worker now verifies and reverifies (#2965 )	2019-09-11 18:37:01 -04:00
Natalie Villasana	dbe90926ca	internal/testplanet: reduce coalesce duration (#3009 )	2019-09-11 18:15:14 -04:00
Natalie Villasana	6d363fb756	satellite/audit: create the audit queue, chore, and worker (#2888 )	2019-09-05 11:40:52 -04:00
Cameron	af5fb8e9c5	satellite/vouchers: deprecate voucher endpoint, return 'please upgrade' error (#2940 ) * voucher endpoint returns 'please upgrade' error, test	2019-09-04 13:21:02 -04:00
Cameron	599324c364	satellite/dbcleanup: delete expired serials from satellite (#2867 ) Creates a new chore, dbcleanup, which can be used for routine deletion of items from the satellite database and adds functionality for deletion of expired serial numbers	2019-08-27 13:12:38 -04:00
Egon Elbre	00b2e1a7d7	all: enable staticcheck (#2849 ) * by having megacheck in disable it also disabled staticcheck * fix closing body * keep interfacer disabled * hide bodies * don't use deprecated func * fix dead code * fix potential overrun * keep stylecheck disabled * don't pass nil as context * fix infinite recursion * remove extraneous return * fix data race * use correct func * ignore unused var * remove unused consts	2019-08-22 13:40:15 +02:00
Egon Elbre	9ec0ceddf3	pkg/revocation: ensure we close revocation databases (#2825 )	2019-08-20 18:04:17 +03:00
Isaac Hess	25154720bd	lib/uplink: remove redis and bolt dependencies (#2812 ) * identity: remove redis and bolt dependencies * identity: move revDB creation to main files	2019-08-19 16:10:38 -06:00
ethanadams	c9b46f2fe2	V3-1987: Optimize audits stats persistence (#2632 ) * Added batch update stats for recordAuditSuccessStatus * Added batch update stats to recordAuditFailStatus * added configurable batch size * build individual update/delete statements so the statements can be batched into 1 call to the DB * notified #config-changes channel and ran make update-satellite-config-lock * updated tests to use batch update stats	2019-07-31 13:21:06 -04:00
Egon Elbre	5d0816430f	rename all the things (#2531 ) * rename pkg/linksharing to linksharing * rename pkg/httpserver to linksharing/httpserver * rename pkg/eestream to uplink/eestream * rename pkg/stream to uplink/stream * rename pkg/metainfo/kvmetainfo to uplink/metainfo/kvmetainfo * rename pkg/auth/signing to pkg/signing * rename pkg/storage to uplink/storage * rename pkg/accounting to satellite/accounting * rename pkg/audit to satellite/audit * rename pkg/certdb to satellite/certdb * rename pkg/discovery to satellite/discovery * rename pkg/overlay to satellite/overlay * rename pkg/datarepair to satellite/repair	2019-07-28 08:55:36 +03:00
Natalie Villasana	f11413bc8e	Implement garbage collection on satellite (#2577 ) * Added a gc package at satellite/gc, which contains the gc.Service, which runs garbage collection integrated with the metainfoloop, and the gc PieceTracker, which implements the metainfo loop Observer interface and stores all of the filters (about which pieces are good) for each node. * Added a gc config located at satellite/gc/service.go (loop disabled by default in release) * Creates bloom filters with pieces to be retained inside the metainfo loop * Sends RetainRequests (or filters with good piece ids) to all storage nodes.	2019-07-24 13:26:43 -04:00
Maximillian von Briesen	6c1c3fb4a7	Add metainfo loop service (#2563 ) Add a metainfo loop service on the satellite that can be subscribed to by various services that need to make use of metainfo information	2019-07-22 09:34:12 -04:00
Alexander Leitner	64b2769de3	discovery: parallelize refresh (#2535 ) * parallelize discovery refresh * add paginateQualifiedtest, address pr comments * Remove duplicate uptime update * Lower concurrency in Testplanet for discovery	2019-07-12 10:35:48 -04:00
Ivan Fraixedes	f420b29d35	[V3-1927] Repairer uploads to max threshold instead of success… (#2423 ) * pkg/datarepair: Add test to check num upload pieces Add a new test for ensuring the number of pieces that the repair process upload when a segment is injured. * satellite/orders: Don't create "put order limits" over total Repair must not create "put order limits" more than the total count. * pkg/datarepair: Update upload repair pieces test Update the test which checks the number of pieces which are uploaded during a repair for using the same excess over the success threshold value than the implementation. * satellites/orders: Limit repair put order for not being total Limit the number of put orders to be used by repair for only uploading pieces to a % excess over the successful threshold. * pkg/datarepair: Change DataRepair test to pass again Make some changes in the DataRepair test to make pass again after the repair upload repaired pieces only until a % excess over success threshold. Also update the steps description of the DataRepair test after it has been changed, to match on what's now, besides to leave it more generic for avoiding having to update it on minimal future refactorings. * satellite: Make repair excess optimal threshold configurable Add a new configuration parameter to the satellite for being able to configure the percentage excess over the optimal threshold, used for determining how many pieces should be repaired/uploaded, rather than having the value hard coded. * repairer: Add configurable param to segments/repairer Add a new parameters to the segment/repairer to calculate the maximum number of excess nodes, based on the optimal threshold, that repaired pieces can be uploaded. This new parameter has been added for not returning more nodes than the number of upload orders for data repair satellite service calculate for repairing pieces. * pkg/storage/ec: Update log message in clien.Repair * satellite: Update configuration lock file	2019-07-12 00:44:47 +02:00
Bill Thorp	0e463dccfd	7 day validity window for order limits (#2520 ) * 7 day limit	2019-07-10 17:17:00 -04:00
JT Olio	a79c7d77f3	overlay cache: slight modification of node-is-online rules (#2490 )	2019-07-09 22:36:09 -04:00
Stefan Benten	16156e3b3d	Ensure we force a segment size and account storage before committing them (#2473 )	2019-07-08 18:24:38 -04:00
Egon Elbre	674742d1a7	satellite/datarepair: use reliability cache (#1976 )	2019-07-09 01:04:35 +03:00
Kaloyan Raev	ca0058c9f1	Set MinDownloadTimeout to 5s in testplanet (#2447 )	2019-07-03 17:49:08 +03:00
Natalie Villasana	3f643551e7	remove flakiness in TestDataRepair and TestSegmentStoreRepair (#2335 ) * stop audit loop in repair tests to prevent possible timeout	2019-07-01 11:15:45 -04:00
Yingrong Zhao	bbedff12a6	satellite: rearrange marketing package (#2268 ) * move offer out of marketing package and remove marketing package * fix imports * fix rename errors * remove offer service * change package name from offers to rewards * fix linting * remove unused code and use appropriate comment	2019-06-24 16:51:54 -04:00
Cameron	1283036e37	add storage node voucher request service (#2158 ) * add voucher service on storage node * config field tag syntax, go routines for requests * hook up voucher service in storagenode/peer.go * add voucher config to testplanet * add voucher config to testplanet * add voucher response status INVALID, ACCEPTED, REJECTED * add a test for vouchers service * handle no row from GetValid, test it * add trust pool to voucher service * use trusted list to get satellites * verify vouchers upon receipt * test VerifyVoucher	2019-06-21 18:48:52 -04:00
aligeti	043d603cbe	satellite rs config check with validation check set to false default (#2229 ) * satellite rs config check with validation check	2019-06-21 14:15:58 -04:00
Bill Thorp	8f47fca5d3	Remove audit / uptime ratio fields (#2247 ) * removed ratios	2019-06-21 13:14:53 -04:00
Ivan Fraixedes	3d6b25a043	[v3-1952 test 1 & 3] pkg/audit: Add DQ test for too many failed audits (#2265 ) * pkg/audit: Add DQ test for too many failed audits Add an integration test which checks that a node which fails several audits gets disqualified but not before it reaches the audit reputation disqualification cut-off. * internal/testplanet: Set DQ cut-off config values Set the values of the Overlay cache DQ cut-off configuration parameters used by testplanet.	2019-06-21 18:27:19 +02:00
Egon Elbre	5b030062c0	internal/testplanet: split planet.go file to avoid package naming conflicts (#2279 )	2019-06-21 16:39:43 +03:00

46 Commits