storj

Author	SHA1	Message	Date
ccase	034f9845b1	storage: Plumb limit through storage backends. * Plumbs the limit through all backends ensuring they don't do unnecessary work. * Don't arbitrarily limit at the backend with hardcoded defaults. The limit will be set by the caller. Prior to this change the code on recursive in some backends would do 10k results from the database and then only return the first 1k (throwing out 9k of them). Prior to this change some backends had no limit at all (e.g. redis). Change-Id: I1f327eefe095776d123dd11362cd00994c22efdf	2020-01-19 21:23:20 +00:00
Egon Elbre	1abfe42142	satellite: use tagsql Change-Id: I2170dee409fb0c2fe85913ddd36e7811a3b853ed	2020-01-19 14:39:16 +02:00
ccase	14b43b7e9b	storage/postgreskv/schema/data.go: Regenerate migrations that failed to update. Change-Id: I9fd5a9a5414214faea5f8c476778fccbe022cb6c	2020-01-19 11:22:00 +00:00
Stefan Benten	409d4123bb	Add proper Pathdata Index (#3750 )	2020-01-17 00:48:59 +01:00
Cameron Ayer	4424697d7f	satellite/accounting: refactor live accounting to hold current estimated totals live accounting used to be a cache to store writes before they are picked up during the tally iteration, after which the cache is cleared. This created a window in which users could potentially exceed the storage limit. This PR refactors live accounting to hold current estimations of space used per project. This should also reduce DB load since we no longer need to query the satellite DB when checking space used for limiting. The mechanism by which the new live accounting system works is as follows: During the upload of any segment, the size of that segment is added to its respective project total in live accounting. At the beginning of the tally iteration we record the current values in live accounting as `initialLiveTotals`. At the end of the tally iteration we again record the current totals in live accounting as `latestLiveTotals`. The metainfo loop observer in tally allows us to get the project totals from what it observed in metainfo DB which are stored in `tallyProjectTotals`. However, for any particular segment uploaded during the metainfo loop, the observer may or may not have seen it. Thus, we take half of the difference between `latestLiveTotals` and `initialLiveTotals`, and add that to the total that was found during tally and set that as the new live accounting total. Initially, live accounting was storing the total stored amount across all nodes rather than the segment size, which is inconsistent with how we record amounts stored in the project accounting DB, so we have refactored live accounting to record segment size Change-Id: Ie48bfdef453428fcdc180b2d781a69d58fd927fb	2020-01-16 10:26:49 -05:00
Egon Elbre	64fb2d3d2f	Revert "dbutil: statically require all databases accesses to use contexts" This reverts commit `8e242cd012`. Revert because lib/pq has known issues with context cancellation. These issues need to be resolved before these changes can be merged. Change-Id: I160af51dbc2d67c5449aafa406a403e5367bb555	2020-01-15 07:28:00 +00:00
JT Olio	8e242cd012	dbutil: statically require all databases accesses to use contexts this will allow for some nice runtime analysis down the road. also, this allows for wrapping database handles in a way that can interact with these contexts requires https://review.dev.storj.io/c/storj/dbx/+/514 Change-Id: Ib087b7cd73296dd2c1e0331314da34d861f61d2b	2020-01-14 18:20:47 -05:00
JT Olio	86093d0940	postgreskv: drop not null on buckets Change-Id: I2a2bd7709de211a9d1808248af573f1bb630cfd5	2020-01-14 12:07:53 -07:00
JT Olio	e1ba3931ec	postgres2: use cockroachkv impl against postgres this allows for setting $STORJ_METAINFO_POSTGRESQL_USE_ALT=yes if you want to use the cockroachkv implementation for metainfo against postgres Change-Id: I0c9458c83fd67ee63ef4a78351e64a80a0647408	2020-01-13 14:51:56 -06:00
Egon Elbre	b9740f0c0a	storage/cockroachkv: add ctx argument Change-Id: Ib6c29f44722b0354afcd499a0e567f04aef7eb28	2020-01-13 15:57:47 +02:00
Egon Elbre	0835b9024c	private/dbutil/pgutil: add ctx argument Change-Id: Icfd56ca8c1f831ad56c0195a0b883e8f0618daaf	2020-01-13 15:27:06 +02:00
Simon Guindon	5a1b2f49f4	storage/cockroachkv: add application name to the db connection string. CockroachDB collects query metrics and separates them by application name and we were not setting the correct application name for the cockroachkv client. This PR calls our existing function that appends it to the connection string. Change-Id: I4a97ed248c31f8b187c680d84b45472f0d50fd7e	2020-01-10 15:11:08 -05:00
Egon Elbre	d3d75a597f	satellite,storage: clean global ctx usage in tests Change-Id: I89ea5c95fc6895518b464f8eb6a4c74c6ae37651	2020-01-09 10:37:21 +00:00
paul cannon	0135852a0e	storage/postgreskv: use transactional helper We may never need this code to work with CockroachDB, but I'm on a mission to avoid problematic uses of Begin() and BeginTx(), and anywhere they appear is a possible place for someone to copy-and-paste and do something wrong. dbutil.WithTx makes this code a little bit simpler too, so it seems worthwhile. Change-Id: I9b4ab484db4590cad5ab07de515bbf5d9708daed	2020-01-06 23:24:44 +00:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Isaac Hess	7d1e28ea30	storagenode: Include trash space when calculating space used This commit adds functionality to include the space used in the trash directory when calculating available space on the node. It also includes this trash value in the space used cache, with methods to keep the cache up-to-date as files are trashed, restored, and emptied. As part of the commit, the RestoreTrash and EmptyTrash methods have slightly changed signatures. RestoreTrash now also returns the keys that were restored, while EmptyTrash also returns the total disk space recovered. Each of these changes makes it possible to keep the cache up-to-date and know how much space is being used/recovered. Also changed is the signature of PieceStoreAccess.ContentSize method. Previously this method returns only the content size of the blob, removing the size of any header data. This method has been renamed `Size` and returns both the full disk size and content size of the blob. This allows us to only stat the file once, and in some instances (i.e. cache) knowing the full file size is useful. Note: This commit simply adds the trash size data to the piece size data we were already collecting. The piece size data is not accurate for all use-cases (e.g. because it does not contain piece header data); however, this commit does not fix that problem. Now that the ContentSize (Size) method returns the full size of the file, it should be easier to fix this problem in a future commit. Change-Id: I4a6cae09e262c8452a618116d1dc66b687f59f85	2019-12-23 19:07:03 -07:00
Jeff Wendling	efa08d4081	storage/cockroachkv: use different batch size for recursive iteration the relatively small batch size of 128 was chosen so that if we have a set of keys like a/1 a/2 ... a/100000 b c list operations would not have to walk 100k keys inside of a/ before skipping to b. unfortunately, iteration is also used by the metainfo loop. in that case, it's doing a recursive listing, and so there's no need to skip large prefixes. thus, we can use a bigger batch size when recursive listing is requested. Change-Id: I87cf1ba385b6eb2928c5b7cc5e0f7a8c7bd126d9	2019-12-17 18:24:26 +00:00
Egon Elbre	7a36507a0a	private/testcontext: ensure we call cleanup everywhere Change-Id: Icb921144b651611d78f3736629430d05c3b8a7d3	2019-12-17 14:16:09 +00:00
paul cannon	2f7465c294	private/dbutil: register "cockroach" as sql.DB driver this will allow us to inspect the type of `db.Driver()` on *sql.DB connections to correctly differentiate between pg and crdb conns. as a bonus, this moves all concerns about when to replace "cockroach://" with "postgres://" out of view, letting the thin shim "driver" take care of that. Change-Id: Ib24103ab7c508231e681f89a7321b623e4e125e9	2019-12-16 19:10:00 +00:00
paul cannon	94651921c3	storage/testsuite: pass ctx in to bulk setup methods to make them cancelable. Also, * rename BulkDelete->BulkDeleteAll this leaves room for a new method `BulkDelete(items storage.Items)` that does a bulk deletion of a specified list of items, as opposed to deleting _everything_. such a method would be used in the `cleanupItems()` function found in utils.go, because when individual deletes are fairly slow, that step takes way too long during tests. * use BulkDelete method if available nothing currently provides `BulkDelete(items storage.Items) error`, but we made use of it with the Bigtable testing and code, and may make use of it again when adding new kv backends. * and eliminate the global context in test_iterate.go Change-Id: I171c7a3818beffbad969b131e98b9bbe3f324bf2	2019-12-10 20:22:08 +00:00
Jeff Wendling	48da8baab5	storj-sim: work with cockroach:// urls for satellite databases for storj-sim to work, we need to avoid schemas in cockroach urls so we have storj-sim create namespaced databases instead of schemas and we have the migrate command create the database in the same way that it would create a schema for postgres. then it works! a follow up commit will move the creation of the database/schemas into storj-sim's setup step so that we can avoid doing these icky creations during normal migration calls. it will also make the pointerdb have an explicit call to migrate instead of just doing it every time it's opened. Change-Id: If69ef5cb96b6866b0438c761bd445afb3597ae5f	2019-12-09 23:44:00 +00:00
Jeff Wendling	1df7b360d7	satellite/metainfo: Use cockroachdb client for metainfo db Change-Id: I3cf7a00de4f654eacaffbb494f4841c64a2d9ce6	2019-12-05 10:33:54 -07:00
Jeff Wendling	f15192ea40	storage/cockroachkv: initial client implementation Change-Id: I72ae558739b2ca532f31cb64f480b43437b6b309	2019-12-05 16:11:23 +00:00
paul cannon	378b863b2b	private,satellite: unite all the "temp db schema" things first, so that they all work the same way, because it's getting complicated, and second, so that we can do the appropriate thing instead of CREATE SCHEMA for cockroachdb. Change-Id: I27fbaeeb6223a3e06d97bcf692a2d014b31465f7	2019-12-05 15:36:59 +00:00
Ivan Fraixedes	42c61138e8	storage: Improve doc comments delete methods (#3591 ) Improve the documentation of several methods involved in the delete operation to make clear their behavior without having to inspect their logic.	2019-12-02 12:18:20 +01:00
Isaac Hess	a6235d3962	storage/filestore: Monitor when we open files in trash Change-Id: I817bf8349c2e1ba55e1490f06162af1099bebdb0	2019-11-26 14:38:49 -07:00
Isaac Hess	56f8fd2dd7	storagenode/pieces: Add EmptyTrash functionality (#3640 ) * storagenode/pieces: Add EmptyTrash functionality * storagenode/pieces: Fix err * storagenode/pieces: Fix lint	2019-11-26 09:25:21 -07:00
Matt Robinson	9af97d366a	Make sed a little more cross platformable (#3629 )	2019-11-22 11:17:02 -07:00
Isaac Hess	2166c2a21b	storage/filestore: Add Trash and RestoreTrash to Blobs (#3529 ) * storage/filestore: Add Trash and RestoreTrash to Blobs * Change restore to be satellite-specific * Fix comment * Fix merge rename conflict	2019-11-14 15:19:15 -07:00
Egon Elbre	ee6c1cac8a	private: rename internal to private (#3573 )	2019-11-14 21:46:15 +02:00
Egon Elbre	1e64006e32	lint: add staticcheck as a separate step (#3569 )	2019-11-14 10:31:30 +02:00
paul cannon	d5963d81d0	storage/postgreskv: fix ultra-slow nonrecursive list (#3564 ) This is based on Jeff's most excellent work to identify why non-recursive listing under postgreskv was phenomenally slow. It turns out PostgreSQL's query planner was actually using two sequential scans of the pathdata table to do its job. It's unclear for how long that has been happening, but obviously it won't scale any further. The main change is propagating bucket association with pathnames through the CTE so that the query planner lets itself use the pathdata index on (bucket, fullpath) for the skipping-forward part. Jeff also had some changes to the range ends to keep NULL from being used- I believe with the intent of making sure the query planner was able to use the pathdata index. My tests on postgres 9.6 and 11 indicate that those changes don't make any appreciable difference in performance or query plan, so I'm going to leave them off for now to avoid a careful audit of the semantic differences. There is a test included here, which only serves to check that the new version of the function is indeed active. To actually ensure that no sequential scans are being used in the query plan anymore, our tests would need to be run against a test db with lots of data already loaded in it, and that isn't feasible for now. Change-Id: Iffe9a1f411c54a2f742a4abb8f2df0c64fd662cb	2019-11-13 17:52:14 -06:00
paul cannon	bd89f51c66	Keep v0pieceinfo database isolated (#3364 ) * put TestCreateV0 back in StoreForTest * avoid direct handles to V0 pieceinfo db * type mismatch fix * use storage.Blobs interface in store_test.go ..instead of filestore.Store. this will allow filestore.Store to become unexported. * unexport filestore.Store rename it to blobStore. things should use the storage.Blobs interface instead. changes in this commit are purely mechanical (made through the "refactor" tool in Gocode followed by search/replace on the word "Store" within the storage/filestore/ directory). * kill filestore.StoreForTest now that filestore.blobStore is unexported, there isn't a need for a specialized wrapper type. this (not coincidentally) also makes it possible for the WriterForFormatVersion() method on storagenode/pieces.StoreForTest to work, without requiring everything to wrap the store.blobs attribute in a filestore.StoreForTest, which was impractical.	2019-11-13 13:15:31 -06:00
paul cannon	0d0b5a449b	storage/filestore: monkit event for delete queuing (#3507 )	2019-11-12 15:56:57 -05:00
paul cannon	0c025fa937	storage/: remove reverse-key-listing feature We don't use reverse listing in any of our code, outside of tests, and it is only exposed through libuplink in the lib/uplink.(*Project).ListBuckets() API. We also don't know of any users who might have a need for reverse listing through ListBuckets(). Since one of our prospective pointerdb backends can not support backwards iteration, and because of the above considerations, we are going to remove the reverse listing feature. Change-Id: I8d2a1f33d01ee70b79918d584b8c671f57eef2a0	2019-11-12 18:47:51 +00:00
Isaac Hess	4d26d0a6a6	storagenode/pieces: Add migration from v0 piece to v1 piece (#3401 )	2019-11-04 17:59:45 +01:00
Jess G	8d92c288e2	satellitedb: separate migration into subcommand (#3436 ) * separate sadb migration, add version check * update checkversion to do same validation as migration * changes per CR * add sa migration to storj-sim * add different debug port in storj-sim for migration * add wait for exit for storj-sim migration * update sa docker entrypoint to support migration * storj-sim satellite parts all wait for migration * upgrade golang-migrate/migrate to v4 because bug * fix go mod tidy	2019-11-02 13:09:07 -07:00
Jennifer Li Johnson	76b64b79ba	cmd/identity: allow using redis for RevocationDB (#3259 )	2019-11-01 13:27:47 -04:00
Cameron	76ad83f12c	satellite/accounting: add redis support to live accounting (#3213 ) * set up redis support in live accounting * move live.Service interface into accounting package and rename to Cache, pass into satellite * refactor Cache to store one int64 total, add IncrBy method to redis client implementation * add monkit tracing to live accounting	2019-10-16 12:50:29 -04:00
Egon Elbre	e9c36d560f	satellite: make PointerDB an argument to satellite.New (#3233 )	2019-10-10 21:06:26 +03:00
Egon Elbre	a801fab66a	all: add archview annotations (#2964 )	2019-09-10 16:24:16 +03:00
Egon Elbre	00b2e1a7d7	all: enable staticcheck (#2849 ) * by having megacheck in disable it also disabled staticcheck * fix closing body * keep interfacer disabled * hide bodies * don't use deprecated func * fix dead code * fix potential overrun * keep stylecheck disabled * don't pass nil as context * fix infinite recursion * remove extraneous return * fix data race * use correct func * ignore unused var * remove unused consts	2019-08-22 13:40:15 +02:00
Egon Elbre	2d69d47655	all: fix Error.New formatting (#2840 )	2019-08-21 19:30:29 +03:00
Jess G	022f5d2e14	storagenode: add space used cache for pieces (#2753 ) * add cache, update cache w/piece create/delete * add service w/loop to cache to recalculate space used cache * add piecestore cache to other sn svcs to use * add table to persist the total space used * rm cache where not needed * rm stuff from sn svcs * start fixing tests, changes per comments * update commits * add unit tests * fix commiting before we write header bytes * fix cache create test * copy cache map, add started back to recalc * fix test * add test, update comments	2019-08-12 14:43:05 -07:00
paul cannon	17bdb5e9e5	move piece info into files (#2629 ) Deprecate the pieceinfo database, and start storing piece info as a header to piece files. Institute a "storage format version" concept allowing us to handle pieces stored under multiple different types of storage. Add a piece_expirations table which will still be used to track expiration times, so we can query it, but which should be much smaller than the pieceinfo database would be for the same number of pieces. (Only pieces with expiration times need to be stored in piece_expirations, and we don't need to store large byte blobs like the serialized order limit, etc.) Use specialized names for accessing any functionality related only to dealing with V0 pieces (e.g., `store.V0PieceInfo()`). Move SpaceUsed- type functionality under the purview of the piece store. Add some generic interfaces for traversing all blobs or all pieces. Add lots of tests.	2019-08-07 20:47:30 -05:00
Egon Elbre	c8edeb0257	satellite/overlay: rename overlay.Cache to overlay.Service (#2717 )	2019-08-06 19:35:59 +03:00
paul cannon	3ebeda3334	storage/redis: make sure to return after put() (#2625 )	2019-07-24 10:28:13 +03:00
Kaloyan Raev	0e1cb7bfb8	CompareAndSwap in KeyValueStore (#2602 )	2019-07-23 22:46:33 +03:00
Ivan Fraixedes	3c8f1370d2	[v3 2137] - Add more info to find out repair failures (#2623 ) * pkg/datarepair/repairer: Track always time for repair Make a minor change in the worker function of the repairer, that when successful, always track the metric time for repair independently if the time since checker queue metric can be tracked. * storage/postgreskv: Wrap error in Get func Wrap the returned error of the Get function as it is done when the query doesn't return any row. * satellite/metainfo: Move debug msg to the right place NewStore function was writing a debug log message when the DB was connected, however it was always writing it out despite if an error happened when getting the connection. * pkg/datarepair/repairer: Wrap error before logging it Wrap the error returned by process which is executed by the Run method of the repairer service to add context to the error log message. * pkg/datarepair/repairer: Make errors more specific in worker Make the error messages of the "worker" method of the Service more specific and the logged message for such errors. * pkg/storage/repair: Improve error reporting Repair In order of improving the error reporting by the pkg/storage/repair.Repair method, several errors of this method and functions/methods which this one relies one have been updated to be wrapper into their corresponding classes. * pkg/storage/segments: Track path param of Repair method Track in monkit the path parameter passed to the Repair method. * satellite/satellitedb: Wrap Error returned by Delete Wrap the error returned by repairQueue.Delete method to enhance the error with a class and stack and the pkg/storage/segments.Repairer.Repair method get a more contextualized error from it.	2019-07-23 16:28:06 +02:00
Jennifer Li Johnson	53d96be44a	Stylistic Go Cleanup (#2524 )	2019-07-22 15:10:04 -04:00

1 2 3

144 Commits