storj

Author	SHA1	Message	Date
Michal Niewrzal	218bbeaffa	Merge 'master' branch Change-Id: Ica5c25607a951076dd9f77e35e308062f71ce3f0	2020-12-07 15:05:52 +01:00
Stefan Benten	494bd5db81	all: golangci-lint v1.33.0 fixes (#3985 )	2020-12-05 17:01:42 +01:00
Ethan Adams	f90ea10a4a	Allow for DB application names per process. (#3983 )	2020-12-04 11:24:39 +01:00
Moby von Briesen	3fc76f4ffe	satellite/downtime: Remove deprecated downtime tracking service. We are no longer planning on implementing downtime penalization using the method described in docs/blueprints/archive/storage-node-downtime-tracking-deprecated.md. Now, we are implementing the design described in docs/blueprints/storage-node-downtime-tracking-with-audits.md. This change removes the downtime estimation chores from the satellite core as well as the package satellite/downtime. A future change will remove the database table. Change-Id: I1a1d3cf9dceeba36255d25243294865b89925518	2020-12-02 15:16:13 -05:00
Jessica Grebenschikov	b261110352	satellite/orders: get bucketID from encrypted metadata in order instead of serial_numbers table We want to stop using the serial_numbers table in satelliteDB. One of the last places using the serial_numbers table is when storagenodes settle orders, we look up the bucket name and project ID from the serial number from the serial_numbers table. Now that we have support to add encrypted metadata into the OrderLimit, this PR makes use of that and now attempts to read the project ID and bucket name from the encrypted orderLimit metadata instead of from the serial_numbers table. For backwards compatibility and to ensure no errors, we will still fallback to the old way of getting that info from the serial_numbers table, but this will be removed in the next release as long as there are no errors. All processes that create orderLimits must have an orders.encryption-keys set. The services that create orderLimits (and thus need to encrypt the order metadata) are the satellite apiProcess, the repair process, audit service (core process), and graceful exit (core process). Only the satellite api process decrypts the order metadata when storagenodes settle orders. This means that the same encryption key needs to be provided in the config for the satellite api process, repair process, and the core process like so: orders.include-encrypted-metadata=true orders.encryption-keys="<"encryptionKeyID>=<encryptionKey>" Change-Id: Ie2c037971713d6fbf69d697bfad7f8b672eedd66	2020-12-01 15:29:32 +00:00
Kaloyan Raev	76199db3c7	private/testplanet: expose Metabase to Test Planet. Change-Id: Ibffa681ffe3d4964e75c68375f3852e53b4497d6	2020-11-30 19:43:06 +00:00
Michal Niewrzal	5a7bc9657d	Merge 'master' branch Change-Id: If583132a821274dc4b78cf5f72b853ba8460c619	2020-11-30 12:57:22 +01:00
JT Olio	0ba516d405	satellite: support pointing db components at different databases the immediate need is to be able to move the repair queue back out of cockroach if we can't save it. Change-Id: If26001a4e6804f6bb8713b4aee7e4fd6254dc326	2020-11-28 18:39:16 +00:00
Michal Niewrzal	efaba85c73	Merge 'master' branch Change-Id: I3520b3e327732929f5167b07a15ddb92d26cae1b	2020-11-24 10:03:20 +01:00
Ethan	2b92bba563	satellite/satellitedb/orders: Handle serial_numbers deletes in smaller increments on CRDB CRDB doesn't like large deletes. While testing in the POC environment we found that deletes on the serial_numbers table could take hours. This change limits deletes to 1000 at a time (configurable) to avoid blocking other queries. Change-Id: I08455e25db1574579dd4d7b7125a08e9c913dff1	2020-11-20 13:44:52 +00:00
Michal Niewrzal	7c384c8293	Merge 'master' branch Change-Id: I1eefd5a56449e577820977d61fa4a22bdd4fc230	2020-11-16 10:02:54 +01:00
Cameron Ayer	5a337c48ec	{cmd,private,storagenode}: create storage dir verification during setup Previously, we created a new file to use for directory verification every time the storage node starts. This is not helpful if the storage node points to the wrong directory when restarting. Now we will only create the file on setup. Now the file should be created only once and will be verified at runtime. Change-Id: Id529f681469138d368e5ea3c63159befe62b1a5b	2020-11-11 11:01:36 -05:00
Michal Niewrzal	7dde184cb5	Merge 'master' branch Change-Id: I6070089128a150a4dd501bbc62a1f8b394aa643e	2020-11-10 11:58:59 +00:00
Kaloyan Raev	3ed4183e52	satellite/metainfo: delete object to use metabase Change-Id: I2ab63a719fdbc1f8a7fbb4ad73d51a2d2dcfadc6	2020-11-10 09:55:23 +00:00
Moby von Briesen	db6bc6503d	satellite/metainfo: Update metainfo RS config to more easily support multiple RS schemes. Make metainfo.RSConfig a valid pflag config value. This allows us to configure the RSConfig as a string like k/m/o/n-shareSize, which makes having multiple supported RS schemes easier in the future. RS-related config values that are no longer needed have been removed (MinTotalThreshold, MaxTotalThreshold, MaxBufferMem, Verify). Change-Id: I0178ae467dcf4375c504e7202f31443d627c15e1	2020-11-09 22:16:13 +00:00
Egon Elbre	e1f37ece08	private/lifecycle: warn on slow service shutdown Adds a warning when service takes over 15s to shutdown. Change-Id: I44307b4b7560ac2978f62a623894a4af4f5a7402	2020-11-06 15:01:54 +00:00
Egon Elbre	cbc1922590	private/dbutil/pgtest: use round robin to pick databases Currently we were picking databases randomly for testing, however a round-robin picking might have more predictable behavior and cause less cockroach timeouts. Change-Id: I74ac0d5b38c89452d3c46d3811330e46e7449514	2020-11-06 12:55:55 +00:00
Egon Elbre	60bb34a096	private/testblobs: fix data race in BadDB The database is accessed concurrently and modifications need to be synchronized. Change-Id: I72a91ae2eac55d48a15aa7b0af8966aa3b038021	2020-11-06 11:56:46 +02:00
Egon Elbre	c55c23f81f	private/testplanet: add STORJ_TESTPLANET_ABSTIME Allow setting STORJ_TESTPLANET_ABSTIME=1 to use absolute time in testplanet logs. Change-Id: I4df5dfc1fc055d9726aed65242ab71338550e671	2020-11-03 15:44:18 +02:00
Egon Elbre	0c23b12038	private/testplanet: use relative time logging Instead of printing RFC3339 timestamp, we'll print relative time since the creation of the testplanet. Before: logger.go:130: 2020-11-02T14:54:53.864+0200 DEBUG versioncontrol addr= 127.0.0.1:30904 After: log.go:54: 00:00.002 DEBUG versioncontrol addr= 127.0.0.1:30945 Change-Id: Ifa423f9d54d4e7c583d9290fe36a791d28166f8f	2020-11-02 17:53:18 +00:00
Egon Elbre	7183dca6cb	all: fix defers in loop defer should not be called in a loop. Change-Id: Ifa5a25a56402814b974bcdfb0c2fce56df8e7e59	2020-11-02 15:06:38 +02:00
Kaloyan Raev	b8c6fb764c	satellite/metainfo: add metabase to metainfo service Change-Id: Ie3ff238b138d8a57d99e32b13f7a71aa624d53e3	2020-10-30 12:49:47 +02:00
Egon Elbre	e0dca4042d	all: add pprof labels for debugger By using pprof.Labels debugger is able to show service/peer names in goroutine names. Change-Id: I5f55253470f7cc7e556f8e8b87f746394e41675f	2020-10-29 15:10:07 +00:00
Egon Elbre	caefde6b32	private/{dbutil,tagsql}: pass ctx to database opening Database opening usually dial and hence we should pass ctx to them. Change-Id: Iaa2875981570d83e65be3710f841cf30349f807b	2020-10-29 10:51:29 +00:00
Egon Elbre	89ce1fe626	storagenode/storagenodedb: add ctx to OpenNew and OpenExisting Database opening usually dial and hence we should pass ctx to them. Change-Id: I9160ae95829f22f347bd525904898a47279a7427	2020-10-29 09:52:37 +02:00
Egon Elbre	d0beaa4a87	pkg/revocation: pass ctx into opening the database Opening a databases requires ctx, this is first step to passing ctx to the appropriate level. Change-Id: I12700f39a320206d8a2a4e054452319f8585b44b	2020-10-29 07:15:36 +00:00
Jessica Grebenschikov	f5880f6833	satellite/orders: rollout phase3 of SettlementWithWindow endpoint Change-Id: Id19fae4f444c83157ce58c933a18be1898430ad0	2020-10-26 14:56:28 +00:00
Yaroslav Vorobiov	139a7ee959	private/migrate: add ablity to create dbs during migration Use tagsql.DB pointer as step database, to propagate changes back and forth between actual database and migration. Adds CreateDB operation to the migration step to be able to create new dbs before executing migration action. Adjusts storagenode database migration to use inner tagsql.DB pointer of each database as step.DB. Adjusts satellite dabase migration, adds proxy migrationDB field to satellite db that wraps itself as tagsql.DB, pointer of which is used as step.DB. Change-Id: Ifed4de5b01a356cf7b37db64d2eaeb7b61982c5c	2020-10-15 15:28:04 +03:00
Egon Elbre	2268cc1df3	all: fix linter complaints Change-Id: Ia01404dbb6bdd19a146fa10ff7302e08f87a8c95	2020-10-13 15:59:01 +03:00
Stefan Benten	14a2050b8d	pkg/auth: move package to consoleauth To avoid further name collisions, the very broad named package gets moved into the consoleauth package where its also mainly being used. Change-Id: Ie563c9700adbf0553baca2b7b8ba4a1d9c29d144	2020-10-06 14:15:07 +02:00
Egon Elbre	4e8d53c8fb	private/dbutil/pgutil: ensure storagenode doesn't depend on pgx pgx is a large dependency and there's no need to include it in storagenode binary. Change-Id: I49c304c6420733d5f095d7edb35d32811210e41a	2020-09-30 14:28:47 +00:00
Yaroslav Vorobiov	a840cb71e7	storagenode: check db version before run Change-Id: I912f63fd62f2bff10341346c28dfb92fcd683806	2020-09-30 10:58:09 +00:00
Egon Elbre	c23a8e3b81	go.mod: update pgx to v4.9.0 Fix query to use TextArray instead of VarcharArray. Fix queries to use the correct type. Change-Id: Ibb7e55adba277d05778118d81ca697470e72c374	2020-09-29 19:03:08 +00:00
Egon Elbre	2d27bc8787	satellite/satellitedb: separate cockroach for migration tests Currently Cockroach migration test is the most heavy with regards to schema changes. This causes other tests to time out. This adds an alternate cockroach instance that is used for migration tests. Change-Id: I01fe9313527ff002f0bb0914dd52c3645b8eaf6d	2020-09-29 09:31:33 +00:00
Jessica Grebenschikov	4a2c66fa06	satellite/accounting: add cache for getting project storage and bw limits This PR adds the following items: 1) an in-memory read-only cache thats stores project limit info for projectIDs This cache is stored in-memory since this is expected to be a small amount of data. In this implementation we are only storing in the cache projects that have been accessed. Currently for the largest Satellite (eu-west) there is about 4500 total projects. So storing the storage limit (int64) and the bandwidth limit (int64), this would end up being about 200kb (including the 32 byte project ID) if all 4500 projectIDs were in the cache. So this all fits in memory for the time being. At some point it may not as usage grows, but that seems years out. The cache is a read only cache. When requests come in to upload/download a file, we will read from the cache what the current limits are for that project. If the cache does not contain the projectID, it will get the info from the database (satellitedb project table), then add it to the cache. The only time the values in the cache are modified is when either a) the project ID is not in the cache, or b) the item in the cache has expired (default 10mins), then the data gets refreshed out of the database. This occurs by default every 10 mins. This means that if we update the usage limits in the database, that change might not show up in the cache for 10 mins which mean it will not be reflected to limit end users uploading/downloading files for that time period.. Change-Id: I3fd7056cf963676009834fcbcf9c4a0922ca4a8f	2020-09-25 16:28:49 +00:00
Stefan Benten	8b4b44d42b	private/web: fix ratelimter IP handling Change-Id: Idab43f15fb5b90d9d831193d0e7119e64513f271	2020-09-05 18:39:49 +02:00
Jennifer Johnson	4e2413a99d	satellite/satellitedb: uses vetted_at field to select for reputable nodes Additionally, this PR changes NewNodeFraction devDefault and testplanet config from 0.05 to 1. This is because many tests relied on selecting nodes that were reputable based on audit and uptime counts of 0, in effect, selecting new nodes as reputable ones. However, since reputation is now indicated by a vetted_at db field that is explicitly set rather than implied by audit and uptime counts, it would be more complicated to try to update all of the nodes' reputations before selecting nodes for tests. Now we just allow all test nodes to be new if needed. Change-Id: Ib9531be77408662315b948fd029cee925ed2ca1d	2020-09-04 16:45:32 +00:00
Michal Niewrzal	aa47e70f03	satellite/metainfo: use metabase.SegmentKey with metainfo.Service Instead of using string or []byte we will be using dedicated type SegmentKey. Change-Id: I6ca8039f0741f6f9837c69a6d070228ed10f2220	2020-09-03 15:11:32 +00:00
Egon Elbre	77b53bd21c	private/lifecycle: log fatal ending to a runner Change-Id: If07b62dad7f4ac235dd51a3a217c2c56d30978ad	2020-09-03 16:54:40 +03:00
Cameron Ayer	ca0c1a5f0c	storagenode/{monitor,pieces}, storage/filestore: add loop to check storage directory writability periodically create and delete a temp file in the storage directory to verify writability. If this check fails, shut the node down. Change-Id: I433e3a8d1d775fc779ae78e7cf3144a05ffd0574	2020-08-31 21:20:49 +00:00
Moby von Briesen	5d21e85529	satellite/audit/queue: Separate audit queue into two separate structs. * The audit worker wants to get items from the queue and process them. * The audit chore wants to create new queues and swap them in when the old queue has been processed. This change adds a "Queues" struct which handles the concurrency issues around the worker fetching a queue and the chore swapping a new queue in. It simplifies the logic of the "Queue" struct to its bare bones, so that it behaves like a normal queue with no need to understand the details of swapping and worker/chore interactions. Change-Id: Ic3689ede97a528e7590e98338cedddfa51794e1b	2020-08-31 20:51:25 +00:00
stefanbenten	4645805b18	private/dbutil: set connMaxLifetime to 30 minutes To prevent longlived unused connections, set the maximum time to 30 minutes to prevent proxies and loadbalancers forcefully cutting the connection. This helps in scenarios with low load/requests to a DB. Change-Id: I7dba15ef97f6f6541e872a6fb1d3a9bbbfe5bb50	2020-08-28 18:00:41 +00:00
Bill Thorp	dbb53151f0	private/testplanet: Decrease metainfo MaxBuckets test value to speed testing. TestMaxOutBuckets is one of our slower tests (50-90s). This change seems to make it 2-12s. It reduces the number of buckets that need to be created. It also removes unnecessary storage nodes. Change-Id: I1012fc6e9258b2f7674b16da4e8b418741c93eea	2020-08-26 17:31:31 +00:00
Jeff Wendling	91698207cf	storagenode: live tracking of order window usage This change accomplishes multiple things: 1. Instead of having a max in flight time, which means we effectively have a minimum bandwidth for uploads and downloads, we keep track of what windows have active requests happening in them. 2. We don't double check when we save the order to see if it is too old: by then, it's too late. A malicious uplink could just submit orders outside of the grace window and receive all the data, but the node would just not commit it, so the uplink gets free traffic. Because the endpoints also check for the order being too old, this would be a very tight race that depends on knowledge of the node system clock, but best to not have the race exist. Instead, we piggy back off of the in flight tracking and do the check when we start to handle the order, and commit at the end. 3. Change the functions that send orders and list unsent orders to accept a time at which that operation is happening. This way, in tests, we can pretend we're listing or sending far into the future after the windows are available to send, rather than exposing test functions to modify internal state about the grace period to get the desired effect. This brings tests closer to actual usage in production. 4. Change the calculation for if an order is allowed to be enqueued due to the grace period to just look at the order creation time, rather than some computation involving the window it will be in. In this way, you can easily answer the question of "will this order be accepted?" by asking "is it older than X?" where X is the grace period. 5. Increases the frequency we check to send up orders to once every 5 minutes instead of once every hour because we already have hour-long buffering due to the windows. This decreases the maximum latency that an order will be reported back to the satellite by 55 minutes. Change-Id: Ie08b90d139d45ee89b82347e191a2f8db1b88036	2020-08-19 19:42:33 +00:00
Cameron Ayer	0155c21b44	private/testplanet, storagenode/{monitor,pieces}: write storage dir verification file on run and verify on loop On run, write the storage directory verification file. Every time the node runs it will write the file even if it already exists. The reason we do this is because if the verification file is missing, the SN doesn't know whether it is an incorrect directory, or it simply hasn't written the file yet, and we want to keep nodes running without needing operator intervention. Once this change has been a part of the minimum version for several releases, we will move the file creation from the run command to the setup command. Run will only verify its existence. Change-Id: Ib7d20e78e711c63817db0ab3036a50af0e8f49cb	2020-08-19 19:12:21 +00:00
Cameron Ayer	586e6f2f13	private/testblobs, storage, storage/filestore: add storage dir verification to filestore Sometimes SNOs fail to properly configure or lose connection to their storage directory which can result in DQ. This causes unnecessary repair and is unfortunate for all parties. This change introduces the creation of a special file in the storage directory at runtime containing the node ID. While the storage node runs, it periodically verifies that it can find said file with the correct contents in the correct location. If not, the node will shut down with an error message. This change will solve the issue of nodes losing access to the storage directory, but it will not solve the issue of nodes pointing to the wrong directory, as the identifying file is created each time the node starts up. After this change has been the minimum version for a few releases, we will remove the creation of the directory-identifying file from the storage node run command and add it to the setup command. Change-Id: Ib7b10e96ac07373219835e39239e93957e7667a4	2020-08-19 17:18:14 +00:00
Yingrong Zhao	14ad7a4f1c	satellite/metainfo: add limiter for objectdeletion and piecedeletion services This PR adds a limiter on the amount of concurrent objects deletion can be handled so we don't run out of memory. Change-Id: Id2ce368af6f86845fcdfd34cb2f5e460efe9b272	2020-08-19 16:08:29 +00:00
Moby von Briesen	708cb48aa6	storagenode/orders: implement orders filestore on storagenode * Add all new orders to the orders filestore instead of the database. * Submit orders from the filestore to the new satellite SettleWindow endpoint. The orders filestore will eventually replace the orders DB completely. For now, we will still be checking the orders DB and submitting those orders if they exist. In a later release, we will completely remove the orders DB, but we need both the DB and filestore for the transitionary period. Change-Id: Iac8780fd5ab770296181bbd313e1d335f072d4dc	2020-08-19 15:00:35 +00:00
Ivan Fraixedes	7f8df74070	private/testplanet: Use config with name set when empty In testplanet Run function we create a new configuration variable on each t.Run for setting the value to the config name field when it's empty, however the new copy of the configuration was not used. Change-Id: I9da34e743f9648850c96556eab0349e742db3aac	2020-08-19 13:12:10 +02:00
Egon Elbre	94a09ce20b	all: add missing dots Change-Id: I93b86c9fb3398c5d3c9121b8859dad1c615fa23a	2020-08-11 17:50:01 +03:00

1 2 3 4 5 ...

304 Commits