storj

Author	SHA1	Message	Date
Jeff Wendling	828d0b9984	pkg/server: set TCP_USER_TIMEOUT and monitor leaked conns Go will, by default, set tcp keep alives on sockets. But the kernel does not send keep alives to sockets that have a non-empty send queue. That can cause connections that hang forever. So we set TCP_USER_TIMEOUT on all of the sockets as well. That option will close any connection that has not received an ack for any sent data (keep alive or otherwise) in the configured time period. This places an upper bound on the amount of time a socket can be stuck due to a client not acknowleding data. See https://blog.cloudflare.com/when-tcp-sockets-refuse-to-die/ for more information on what these options do and how they interact. Additionally, make sure that we close every connection coming from the listeners by wrapping them in a type with a finalizer that closes the connection, much like the os package does for file handles. It monitors if a connection was closed due to a finalizer so that we can go and look for the bug if we ever see a non-zero value. Change-Id: Idc6c0564224b8dc2e4c9d769e80374ed1fe8cce0	2020-01-03 21:31:09 +00:00
Egon Elbre	e03d3fb577	uplink: move configs to cmd/uplink/cmd Change-Id: Ifc1d3440dcef429c2a6142c16f3e991abf49f1d2	2020-01-02 09:40:57 +00:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Fadila	115b8b0fc8	storagenode/piecestore: delete several pieces in a single request This is part of the deletion performance improvement. See https://storjlabs.atlassian.net/browse/V3-3349 Change-Id: Idcd83a302f2bd5cc3299e1a4195a7e177f452599	2019-12-27 10:58:04 +00:00
Egon Elbre	3849b9a21a	pkr/rpc: remove RawConn Change-Id: I61bc10b82f178a16f27279b85b627553d122c174	2019-12-22 18:01:01 +02:00
Egon Elbre	d55288cf68	pkg/rpc: replace methods with direct calls to pb Change-Id: I8bd015d8d316a2c12c1daceca1d9fd257f6f57bc	2019-12-22 17:12:43 +02:00
Egon Elbre	006baa9ca6	pkg/rpc: remove drpc aliases We need to split up pb package, which means we cannot have a core package that depends on them. Change-Id: I7f4f6fd82f89a51a9b2ad08bf2b1207253b8a215	2019-12-22 16:58:08 +02:00
Egon Elbre	31fbdcc8f7	pkg/encryption: better EncodingDecodingStress The test case wasn't testing all the combinations. But, testing all 3 byte combinations would take too long, instead test special values and +- 1 of them, with some additional noise characters. Change-Id: If53ff25863a1f27c534922bd399fbbbdfefda441	2019-12-21 12:45:24 +02:00
Egon Elbre	6fc009f6e4	uplink/eestream: move Pad to encryption package to break dependency to eestream Change-Id: I0c9bc3c65f161d79812196ac8285405e6be04c9e	2019-12-20 19:32:12 +00:00
Egon Elbre	ea455b6df0	all: remove code to default to grpc We have moved to drpc so we don't need to have code for building with grpc only. Change-Id: I55732314dca0d5b4ce1132b68de4186a15d91b21	2019-12-20 20:12:04 +02:00
JT Olio	389d1821ea	uplink/paths/encryption: support commandline argument to override path cipher to be urlsafe base64 for lists and deletes (#2855 )	2019-12-19 12:29:00 +01:00
Egon Elbre	9ed9d3516f	pkg/peertls: move tests requiring redis or bolt Change-Id: Ib9de8d5ac1123d109b0209de4174fb4da7c67078	2019-12-18 17:11:36 +02:00
Egon Elbre	1eaf9e9ed7	pkg/storj: move non-common types Change-Id: I2dd15c95cd334660f29a528dfb3dec9c30b29cab	2019-12-18 14:14:53 +00:00
Egon Elbre	71d6eef0d7	pkg/storj: remove unnecessary interfaces Change-Id: Ifb308d2654d61a5b9bb3e28ff20282d6a690f0ca	2019-12-18 14:14:43 +00:00
Egon Elbre	7a36507a0a	private/testcontext: ensure we call cleanup everywhere Change-Id: Icb921144b651611d78f3736629430d05c3b8a7d3	2019-12-17 14:16:09 +00:00
Egon Elbre	7455ab771b	pkg/peertls/tlsopts: move test that requires testplanet For splitting core repository we need it not to pull in testplanet even in tests. Change-Id: I04d46b418e6e908185a4da694cf47dc3c5cc65f0	2019-12-17 13:45:51 +00:00
Egon Elbre	b04f9996c5	pkg/rpc: move test that needs testplanet Move rpc test that uses testplanet into private/testplanet. This ensures that rpc doesn't have the whole system as a dependency making it easier to separate. This unfortunately leaves pkg/rpc without specific tests, but we would need to write new tests that only use the core packages. Change-Id: I402ab3c2d50282af159c2ef3371d23b0997fef0a	2019-12-17 13:31:12 +00:00
Cameron Ayer	a4f9865b47	satellite: adds and enables cockroachdb compatibility for tests Change-Id: I85a3ad8c3b9d7e15ea8675b6c55af0002933db57	2019-12-16 22:29:25 +00:00
Isaac Hess	0008aebf80	pkg/rpc: Change drpcheader to save a packet This changes when we write the drpcheader. Rather than making it its own write to the connection, it now prepends the drpc header to the first write on the connection (typically the tls handshake). This results in one less packet being sent at the beginning of each drpc connection. For an operation like uploading a file from uplink, this results in many packets being dropped: one when communicating with the satellite, and one for each communication with the storage nodes. Change-Id: I7644b46e90ffa7acea73ac56831396307352ed7a	2019-12-16 13:33:39 -07:00
Kaloyan Raev	958772467a	cmd/storagenode-updater, pkg/process: Fix logging timestamp After changing how we execute the storagenode-updater process we lost timestamps in the log. The fix is to start using zap logging. The Windows Installer is changed to register the storagenode-updater service in a way that the Windows Service Manager passes the --log.output flag instead of the old --log. The old --log flag is deprecated, but not removed. We will support it for backward compatibility. This is required as the storagenode-updater can auto-updated itself, but the Windows Service Manager of this old installtion will continue passing the old --log flag when starting it. Change-Id: I690dff27e01335e617aa314032ecbadc4ea8cbd5 Signed-off-by: Kaloyan Raev <kaloyan@storj.io>	2019-12-11 10:05:48 +00:00
Jeff Wendling	23df647a15	pkg/rpc/rpcpool: add idle expiration to connections long lived uplinks could just hold on to connections forever if their client to the storagenode or satellite isn't closed. this will prevent that from happening on the client. more changes will be necessary to add appropriate prevention on the servers. Change-Id: Ib36d85e70cbafb315664ad7657bb70b936b3828c	2019-12-10 20:32:11 +00:00
Cameron Ayer	6fae361c31	replace planet.Start in tests with planet.Run planet.Start starts a testplanet system, whereas planet.Run starts a testplanet and runs a test against it with each DB backend (cockroach compat). Change-Id: I39c9da26d9619ee69a2b718d24ab00271f9e9bc2	2019-12-10 16:55:54 +00:00
Michal Niewrzal	ffd570eb04	uplink/metainfo: remove additional GetObject from download This change removes one additional metainfo.GetObject call during download. Change-Id: I86b4e5165f299069e3e18944fe79c697c6a514d3	2019-12-09 11:48:48 +00:00
Jennifer Johnson	ecb960f506	private/dbutil: distinguishes between db drivers and implementations to allow for different implementations of SQL queries. Change-Id: I2dc8d1d371139aa8bc805e92a2b80b71f580fd64	2019-12-04 18:31:26 +00:00
Andrew Harding	2461ccd469	pkg/private/fpath: subsume AtomicWriteFile AtomicWriteFile is useful primitive to use throughout the codebase Change-Id: I338fc4505ba20d5aece09ddc257286f46298e083	2019-12-03 18:14:08 +00:00
Ivan Fraixedes	bf97ef06fc	storagenode: Add new endpoint to receive satellite requests for… (#3590 ) * pkg/pg: Add new service function storage node Add a new service function to the storage node piece store for deleting pieces when satellites request them. * storagenode/piecestore: Add endpoint to delete piece Add a new endpoint to receive from trusted satellites to delete a piece. * private/testplanet: Fix storagenode mock Add to the storagenode mock the new endpoint method. * proto.lock: Update it with the last protbuff changes * storagenode/piecestore: Reuse test piece upload Extract the repeated logic from several tests functions for uploading a test piece to a test helper function. * uplink/piecestore: Implement client side method Implement the client side method of the new piecestore RPC function. * storagenode/piecestore: Add test DeletePiece endpoint Implement a test for the DeletePiece new endpoint method.	2019-11-26 18:47:19 +01:00
Yingrong Zhao	66f1a1680f	add completion receipt to exit-status cli command on storage node (#3650 )	2019-11-26 12:32:26 -05:00
Egon Elbre	36fead0093	satellite/metainfo: add UserAgent support to endpoints (#3548 )	2019-11-26 03:12:37 -08:00
Yingrong Zhao	79a4fff6c7	satellite/referrals: set up referrals service and http endpoints (#3566 )	2019-11-25 16:36:36 -05:00
JT Olio	031ba86de5	argon2: choose a steady parallelism value (#3630 ) * argon2: choose a steady parallelism value Change-Id: I6006da7d7980cda88f5f08ee759612df23a8132d * whoops, not cruft Change-Id: Ied9039f9a9be1d0f6ff3c7d5c4839a83fc7b4b1f * fix broken test file Change-Id: I07288cd6cef32ba387f2f003febff5c297e50997 * fix linting error Change-Id: Icdbda8b709cc100a86f3859303c40edb8dff1e0f	2019-11-22 14:00:04 -07:00
Natalie Villasana	b7a8ffcdff	pkg/pb/referralmanager: update to add satellite ID to Get Tokens request (#3625 )	2019-11-21 16:13:48 -05:00
Isaac Hess	6aeddf2f53	storagenode/pieces: Add Trash and RestoreTrash to piecestore (#3575 ) * storagenode/pieces: Add Trash and RestoreTrash to piecestore * Add index for expiration trash	2019-11-20 09:28:49 -07:00
Michal Niewrzal	5964502ce0	uplink/metainfo: remove GetObject from download Batch (#3596 )	2019-11-19 04:58:26 -08:00
Jeff Wendling	53176dcb0e	pkg/rpc/rpcstatus: do not depend on grpc/drpc build mode if your server is built to make drpc connections, clients can still connect with grpc. thus, your responses to grpc clients must still look the same, so we have to have all of our status wrapping include codes for both the drpc and grpc servers to return the right thing. Change-Id: If99fa0e674dec2e20ddd372a827f1c01b4d305b2	2019-11-18 15:51:58 -07:00
Jeff Wendling	f3b20215b0	pkg/{rpc,server,tlsopts}: pick larger defaults for buffer sizes these may not be optimal but they're probably better based on our previous testing. we can tune better in the future now that the groundwork is there. Change-Id: Iafaee86d3181287c33eadf6b7eceb307dda566a6	2019-11-18 21:22:49 +00:00
Natalie Villasana	3a0b71ae5b	pkg/pb/referralmanager: update RedeemTokenRequest, rm ReserveToken (#3593 )	2019-11-18 16:17:21 -05:00
Maximillian von Briesen	2524cc5d42	pkg/pb: remove unneeded proto imports (#3585 )	2019-11-15 11:55:32 -05:00
Michal Niewrzal	bc16cb5d24	libuplink: remove additional GetBucket for upload/download (#3568 )	2019-11-15 02:06:17 -08:00
Egon Elbre	ee6c1cac8a	private: rename internal to private (#3573 )	2019-11-14 21:46:15 +02:00
Yingrong Zhao	d2a8ab5d7f	pkg/pb: add referral manager protobuf definition (#3561 )	2019-11-14 12:33:00 -05:00
JT Olio	a72bf6c254	pkg/rpc: generate drpc/grpc tags correctly (#3556 ) Change-Id: Iac79d6134246e92876dd57e269a9c96c2de95884	2019-11-12 16:22:21 -07:00
paul cannon	0c025fa937	storage/: remove reverse-key-listing feature We don't use reverse listing in any of our code, outside of tests, and it is only exposed through libuplink in the lib/uplink.(*Project).ListBuckets() API. We also don't know of any users who might have a need for reverse listing through ListBuckets(). Since one of our prospective pointerdb backends can not support backwards iteration, and because of the above considerations, we are going to remove the reverse listing feature. Change-Id: I8d2a1f33d01ee70b79918d584b8c671f57eef2a0	2019-11-12 18:47:51 +00:00
Andrew Harding	168f72d371	Initialize math/rand default source (#3552 )	2019-11-12 10:03:41 -07:00
Jeff Wendling	013e0d94bc	pkg/rpc: ensure connections are quickly closed drpc will call Close on any transport we pass to it, but some transports (like tls.Conn) will attempt to notify the remote side of things. we don't want to do that, so pass a new interface that just closes the underlying socket. Change-Id: I53344d2747de21b3146abe4f82b8394bb8948cb5	2019-11-12 15:53:36 +00:00
Bryan White	7355065dc9	pkg/{cfgstruct,identity}: replace seperator in default values when path tag set (#3498 )	2019-11-12 12:31:19 +01:00
Kaloyan Raev	20623fdc96	Increase min required difficulty to 36 in signing service (#3535 )	2019-11-11 12:27:09 +02:00
Ivan Fraixedes	e4a220347a	uplink: Suppress one metainfo call on delete (#3511 ) Change signature of metainfo DeleteObject to get rid of an extra call to kvmetainfo GetBucket method and eliminate one round trip to the satellite when deleting objects.	2019-11-07 10:39:40 +01:00
Jeff Wendling	f62107d3e9	pkg/rpc: fix grpc dial timeouts (#3517 ) grpc doesn't exit dials right away if the context dialer returns an error. since that's the only spot where we were enforcing dial timeouts, dials could just leak for an unknown amount of time. add a timeout above the grpc dial because that's the documented way that grpc expected to be canceled. Change-Id: Ic47ac61ce8a5f721510cc2c4584f63d43fe4f2d5	2019-11-06 16:42:20 -07:00
Michal Niewrzal	ab5c623ac7	cli: should return non-zero code for error (#3469 )	2019-11-05 06:01:26 -08:00
Yaroslav Vorobiov	35edc2bcc3	satellite/payments: invoice creation (#3468 )	2019-11-05 15:16:02 +02:00
Jeff Wendling	17e9044c0f	pkg/rpc/rpcpeer: check both drpc and grpc for peers on a context we don't know if an incoming connection is from drpc or grpc during the migration time, so check both. Change-Id: I2418dde8b651dcc4a23726057178465224a48103	2019-11-01 17:04:53 -06:00
JT Olio	41c0093e5b	drpc: enable by default (#3452 )	2019-11-01 22:43:24 +01:00
Jennifer Li Johnson	76b64b79ba	cmd/identity: allow using redis for RevocationDB (#3259 )	2019-11-01 13:27:47 -04:00
Michal Niewrzal	8786a37f89	uplink/storage: use Batch to optimize upload requests (#3408 )	2019-10-29 08:49:16 -07:00
Ethan Adams	e54d290d2e	satellite/gracefulexit: Add signatures for success/failed exit finished messages. (#3368 ) * add signatures, fix process loop bug, move delete to on success * added tests for signatures * PR comment updates * fixed setting reason by default. * updates for PR comments * added signed failure when verificationi fails * moved to sign_test * fix panic * removed testplanet from test	2019-10-25 16:36:26 -04:00
Natalie Villasana	696c567e89	satellite/gracefulexit: add piece hash validation for successful transfer (#3313 )	2019-10-24 15:38:40 -04:00
Yingrong Zhao	fa1ac24e19	satellite/gracefulexit: add failure threshold check (#3329 ) * add overall failure percentage check and inactive time frame check before sending a response to sno * update comment * delete node from transfer queue if it has been inactive for too long * fix linting error * add test config value * fix nil pointer * add config value into testplanet * add unit test for overall failure threshold * move timeframe threshold to chore * update protolock * add chore test * add per peiece failure count logic * change config name from EndpointMaxFailures to MaxFailuresPerPiece * address comments * fix linting error * add error handling for no row returned from progress table * fix test for graceful exit chore on storagenode * fix typo InActive -> Inactive * improve readability for failure threshold calculation * update config lock * change error handling for GetProgress in graceful exit endpoint on the satellite side * return proper rpc error in endpoint * add check in chore test for checking finish timestamp and queue	2019-10-24 12:24:42 -04:00
Jeff Wendling	51d5d8656a	pkg/rpc: drpc connection pooling keep a pool of connections open when dialing for drpc. this makes it so that long lived clients (like lib/uplink's Project) don't continue to use a bad connection forever. it also allows for concurrent rpcs. Change-Id: If649b286050e4f09c413fadc3e1ce88f5fc6e600	2019-10-22 18:15:24 -06:00
JT Olio	2c6fa3c5f8	pkg/rpc: remove read/write deadlines as a mechanism for request timeouts (#3335 ) libuplink was incorrectly setting timeouts to 10 seconds still, but should have been at least 10 minutes. the order sender was setting them to 1 hour. we don't want timeouts in uplink-side logic as it establishes a minimum rate on tcp streams. instead of all of this, just use tcp keep alive. tcp keep alive packets are sent every 15 seconds and if the peer stops responding the connection dies. this is enabled by default with go. this will kill tcp connections when they stop working. Change-Id: I3d7ad49f71950b3eb43044eedf4b17993116045b	2019-10-22 17:57:24 -06:00
Ethan Adams	3e0d12354a	storagenode/gracefulexit: Implement storage node graceful exit worker - part 1 (#3322 )	2019-10-22 16:42:21 -04:00
Michal Niewrzal	04c2454c71	satellite/metainfo: pass streamID/segmentID between Batch request/response (#3311 )	2019-10-22 03:23:22 -07:00
Bryan White	f468816f13	{internal/version,versioncontrol,cmd/storagenode-updater}: add rollout to storagenode updater (#3276 )	2019-10-21 12:50:59 +02:00
Bryan White	243ba1cb17	{versioncontrol,internal/version,cmd/*}: refactor version control (#3253 )	2019-10-20 09:56:23 +02:00
Egon Elbre	f929310add	pkg/rpc/rpcstatus: fix drpc grpc compatibilty (#3306 ) When code is compiled without -tags=drpc the statuses for drpc server weren't handled, which meant an uplink using -tags=drpc didn't get the correct status code.	2019-10-17 15:21:20 -04:00
Yingrong Zhao	87e3764390	storagenode/cmd: add exit-status command for graceful exit (#3264 ) * add exit-status command * remove todo and fix format * fix status display * change startExit to exit progress * fix linting error * add successful column in exit progress * fix test * remove extra new line * fix TYPOS * format the percentage better	2019-10-15 18:07:32 -04:00
Ethan Adams	37ab84355f	satellite/gracefulexit: protobuf field name updates (#3284 ) rename piece_id to original_piece_id	2019-10-15 15:59:12 -04:00
Ethan Adams	1ad2ba7e3e	storagenode/gracefulexit: Add graceful exit chore and worker. (#3262 ) Adds graceful exit chore and worker for V3-2614	2019-10-15 11:29:47 -04:00
Marc Schubert	93d5eeda31	Update dial.go (#3261 ) What: Bring back partial nodeID to debug.trace-out Why: The information is useful for interpreting the trace file and was there up drpc. I just bring it back. https://github.com/storj/storj/blob/v0.21.3/pkg/transport/transport.go#L76 Please describe the tests: Test 1: Test 2: Please describe the performance impact: No impact.	2019-10-14 15:44:15 -06:00
JT Olio	694177e217	pkg/pb: regen gracefulexit.pb.go (#3270 )	2019-10-14 17:06:04 -04:00
Jennifer Li Johnson	b185dbbee2	satellite/discovery: remove discovery related code (#3175 )	2019-10-14 10:57:01 -04:00
JT Olio	6ede140df1	pkg/rpc: defeat MITM attacks in most cases (#3215 ) This change adds a trusted registry (via the source code) of node address to node id mappings (currently only for well known Satellites) to defeat MITM attacks to Satellites. It also extends the uplink UI such that when entering a satellite address by hand, a node id prefix can also be added to defeat MITM attacks with unknown satellites. When running uplink setup, satellite addresses can now be of the form 12EayRS2V1k@us-central-1.tardigrade.io (not even using a full node id) to ensure that the peer contacted is the peer that was expected. When using a known satellite address, the known node ids are used if no override is provided.	2019-10-12 14:34:41 -06:00
Ethan Adams	a1275746b4	satellite/gracefulexit: Implement the 'process' endpoint on the satellite (#3223 )	2019-10-11 17:18:05 -04:00
Isaac Hess	9256399872	CI: test drpc and grpc (#3163 ) * wip: test drpc * Add parallel intregration test * Add jenkinsfile.drpc * Remove unnecessary jenkinsfile items * testing: GOFLAGS=-drpc (#3236) * Use GOFLAGS * add debug * revert tags * revert changes * move goflags to the correct place * add sanity check	2019-10-11 08:30:06 -06:00
Yingrong Zhao	743a0fc38b	storagenode/cmd: create start graceful exit CLI (#3202 )	2019-10-11 09:58:12 -04:00
Ethan Adams	447c219d92	satellite/gracefulexit: Add protobuf definitions for communication between storage node and satellite (#3201 )	2019-10-08 13:42:56 -04:00
Jennifer Li Johnson	7ceaabb18e	Delete Bootstrap and Kademlia (#2974 )	2019-10-04 16:48:41 -04:00
Jeff Wendling	4fab22d691	pkg/rpc: don't leak goroutines during a drpc dial we spawned a goroutine to wait on the context's done channel sending the error afterward, but we forgot to ensure the context was eventually done, so the goroutine would be leaked until then. instead, we can just do a select on two channels to get the error rather than spawn a goroutine which makes it impossible to leak a goroutine. Change-Id: I2fdba206ae6ff7a3441b00708b86b36dfeece2b5	2019-10-04 20:09:36 +00:00
Jeff Wendling	64e43e555e	pkg/rpc: return context error if ready after DialContext fails the net package does not make it easy to know if DialContext failed because the context was done. it's important for some of our tests that canceled contexts are detected as such, so we accept the small race that's arguably correct (the context must be canceled asynchronously) to ensure we always return the context error if available. Change-Id: I058064d5c666e5353b74fb5bd300bf7abe537ff5	2019-10-04 20:09:00 +00:00
Jeff Wendling	c9e0aa7c70	pkg/kademlia: make tests run/work with drpc Change-Id: I69372fd8f0d52913e1ad2cf7d01115460ba8eeda	2019-10-03 15:33:25 -06:00
littleskunk	b2e328f118	storagenode/dashboard: update online status (#3168 )	2019-10-03 20:31:39 +02:00
Isaac Hess	94c7df0d6e	pkg/rpc/rpcstatus: Fix return type (#3162 )	2019-10-02 14:46:18 -06:00
Jennifer Li Johnson	29b96a666b	internal/testplanet: fix conn leak (#3132 )	2019-09-27 09:47:57 -06:00
Jeff Wendling	93349f247e	pkg/rpc: add WithInsecure when doing non-tls dials Change-Id: I993f223f4ac78824b75a7725342ebf2ae0f74254	2019-09-27 09:07:14 -06:00
Bryan White	c8aa821ccb	pkg/certificates: move certificate package to root (#3107 )	2019-09-26 09:11:05 -07:00
Jeff Wendling	098cbc9c67	all: use pkg/rpc instead of pkg/transport all of the packages and tests work with both grpc and drpc. we'll probably need to do some jenkins pipelines to run the tests with drpc as well. most of the changes are really due to a bit of cleanup of the pkg/transport.Client api into an rpc.Dialer in the spirit of a net.Dialer. now that we don't need observers, we can pass around stateless configuration to everything rather than stateful things that issue observations. it also adds a DialAddressID for the case where we don't have a pb.Node, but we do have an address and want to assert some ID. this happened pretty frequently, and now there's no more weird contortions creating custom tls options, etc. a lot of the other changes are being consistent/using the abstractions in the rpc package to do rpc style things like finding peer information, or checking status codes. Change-Id: Ief62875e21d80a21b3c56a5a37f45887679f9412	2019-09-25 15:37:06 -06:00
Bryan White	a7040647a4	run certificate authorization endpoint (#3108 )	2019-09-23 15:19:13 -07:00
Jeff Wendling	d32d85a717	pkg/listenmux: resolve deadlock in test it was possible, because we spawned Run before we did any calls to Route, that the listenmux would send multiple connections to the default listener. Fix that by ensuring we call Route before we call Run. Change-Id: Ie8fd754997975969a99fd2a3f8d3010c24cdc73d	2019-09-20 21:16:59 +00:00
Jeff Wendling	a20a7db793	pkg/rpc: build tag based selection of rpc details It provides an abstraction around the rpc details so that one can use dprc or gprc with the same code. It subsumes using the protobuf package directly for client interfaces as well as the pkg/transport package to perform dials. Change-Id: I8f5688bd71be8b0c766f13029128a77e5d46320b	2019-09-20 21:07:33 +00:00
Jennifer Li Johnson	724bb44723	Remove Kademlia dependencies from Satellite and Storagenode (#2966 ) What: cmd/inspector/main.go: removes kad commands internal/testplanet/planet.go: Waits for contact chore to finish satellite/contact/nodesservice.go: creates an empty nodes service implementation satellite/contact/service.go: implements Local and FetchInfo methods & adds external address config value satellite/discovery/service.go: replaces kad.FetchInfo with contact.FetchInfo in Refresh() & removes Discover() satellite/peer.go: sets up contact service and endpoints storagenode/console/service.go: replaces nodeID with contact.Local() storagenode/contact/chore.go: replaces routing table with contact service storagenode/contact/nodesservice.go: creates empty implementation for ping and request info nodes service & implements RequestInfo method storagenode/contact/service.go: creates a service to return the local node and update its own capacity storagenode/monitor/monitor.go: uses contact service in place of routing table storagenode/operator.go: moves operatorconfig from kad into its own setup storagenode/peer.go: sets up contact service, chore, pingstats and endpoints satellite/overlay/config.go: changes NodeSelectionConfig.OnlineWindow default to 4hr to allow for accurate repair selection Removes kademlia setups in: cmd/storagenode/main.go cmd/storj-sim/network.go internal/testplane/planet.go internal/testplanet/satellite.go internal/testplanet/storagenode.go satellite/peer.go scripts/test-sim-backwards.sh scripts/testdata/satellite-config.yaml.lock storagenode/inspector/inspector.go storagenode/peer.go storagenode/storagenodedb/database.go Why: Replacing Kademlia Please describe the tests: • internal/testplanet/planet_test.go: TestBasic: assert that the storagenode can check in with the satellite without any errors TestContact: test that all nodes get inserted into both satellites' overlay cache during testplanet setup • satellite/contact/contact_test.go: TestFetchInfo: Tests that the FetchInfo method returns the correct info • storagenode/contact/contact_test.go: TestNodeInfoUpdated: tests that the contact chore updates the node information TestRequestInfoEndpoint: tests that the Request info endpoint returns the correct info Please describe the performance impact: Node discovery should be at least slightly more performant since each node connects directly to each satellite and no longer needs to wait for bootstrapping. It probably won't be faster in real time on start up since each node waits a random amount of time (less than 1 hr) to initialize its first connection (jitter).	2019-09-19 15:56:34 -04:00
Jess G	93788e5218	remove kademlia: create upsert query to update uptime (#2999 ) * create upsert query for check-in method * add tests * fix lint err * add benchmark test for db query * fix lint and tests * add a unit test, fix lint * add address to tests * replace print w/ b.Fatal * refactor query per CR comments * fix disqualified, only set if null * fix query * add version to updatecheckin query * fix version * fix tests * change version for tests * add version to tests * add IP, add transport, mv unit test * use node.address as arg * add last ip * fix lint	2019-09-19 11:37:31 -07:00
Kaloyan Raev	45df0c5340	storagenode/process: respond to Windows Service events (#3025 )	2019-09-19 19:37:40 +03:00
JT Olio	946ec201e2	metainfo: move api keys to part of the request (#3069 ) What: we move api keys out of the grpc connection-level metadata on the client side and into the request protobufs directly. the server side still supports both mechanisms for backwards compatibility. Why: dRPC won't support connection-level metadata. the only thing we currently use connection-level metadata for is api keys. we need to move all information needed by a request into the request protobuf itself for drpc support. check out the .proto changes for the main details. One fun side-fact: Did you know that protobuf fields 1-15 are special and only use one byte for both the field number and type? Additionally did you know we don't use field 15 anywhere yet? So the new request header will use field 15, and should use field 15 on all protobufs going forward. Please describe the tests: all existing tests should pass Please describe the performance impact: none	2019-09-19 10:19:29 -06:00
Jess G	695de9dcd7	rm noisy debug logs that we dont need (#3083 )	2019-09-18 12:43:57 -07:00
Egon Elbre	186e67e056	pkg/transport: set default timeout to 10 minutes (#3075 )	2019-09-18 11:56:23 -04:00
Maximillian von Briesen	574c96c350	satellite/metainfo: Verify storagenode signature on satellite upload (#2985 )	2019-09-18 09:50:33 -04:00
Jess G	7c203b4884	add satelliteSystem to testplanet and update tests (#3066 )	2019-09-17 13:14:49 -07:00
Natalie Villasana	cc70cd2329	satellite/repair: add metric trackers for segment age before repair (#3056 )	2019-09-17 15:18:48 -04:00
Ivan Fraixedes	febe32bc7a	pkg/miniogw: Add a missed stack trace error (#3035 ) Add the stack trace to an error returned by one of the functions for being able to track down the origin in case the error happens and gets logged.	2019-09-16 23:00:50 +03:00
Jess G	d3ef574b20	pkg/pb: minor changes to contact.proto (#3048 ) * minor fixes to contact proto * simply and rm nodeAddr object from client	2019-09-13 19:37:32 -05:00
Andrew Harding	f550ab5d1c	Uplink "import" command (#2981 ) * uplink import cmd * pkg/process: fix import order * fix golangci-lint failures * remove "help" from the satellite config lock file	2019-09-13 12:33:30 -06:00

1 2 3 4 5 ...

1371 Commits