storj

Author	SHA1	Message	Date
Jeff Wendling	7999d24f81	all: use monkit v3 this commit updates our monkit dependency to the v3 version where it outputs in an influx style. this makes discovery much easier as many tools are built to look at it this way. graphite and rothko will suffer some due to no longer being a tree based on dots. hopefully time will exist to update rothko to index based on the new metric format. it adds an influx output for the statreceiver so that we can write to influxdb v1 or v2 directly. Change-Id: Iae9f9494a6d29cfbd1f932a5e71a891b490415ff	2020-02-05 23:53:17 +00:00
Jeff Wendling	d20db90cff	private/dbutil/txutil: create new transactions for retries it was noticed that if you had a long lived transaction A that was blocking some other transaction B and A was being aborted due to retriable errors, then transaction B was never given priority. this was due to using savepoints to do lightweight retries. this behavior was problematic becaue we had some queries blocked for over 16 hours, so this commit addresses the issue with two prongs: 1. bound the amount of time we will retry a transaction 2. create new transactions when a retry is needed the first ensures that we never wait for 16 hours, and the value chosen is 10 minutes. that should be long enough for an ample amount of retries for small queries, and huge queries probably shouldn't be retried, even if possible: it's more preferrable to find a way to make them smaller. the second ensures that even in the case of retries, queries that are blocked on the aborted transaction gain priority to run. between those two changes, the maximum stall time due to retries should be bounded to around 10 minutes. Change-Id: Icf898501ef505a89738820a3fae2580988f9f5f4	2020-02-01 18:34:28 +00:00
Jeff Wendling	71ff044edb	storagenode/bandwidth: fix tests to not fail for 10 hours near the end of the month Change-Id: I390569a8702164c42edddd3be020e93782227c2e	2020-01-31 16:25:52 -07:00
Egon Elbre	d0b4272467	storagenode: fix global logger in tests https://github.com/storj/storj/wiki/Testing#logging Change-Id: Ic6a31360bcfedae3f37f6b2536a345f00e33cd78	2020-01-31 14:09:28 +00:00
Isaac Hess	4dafd03f11	storagenode: Prevent negative values in piece_space_used, migrate negatives to 0 Change-Id: Ibd663db087058c928190aa52c520f22e9338dd04	2020-01-30 13:03:18 -05:00
Jeff Wendling	21b65ca3b0	storagenode/storagenodedb: migrate to set total to content_size Change-Id: I4906c2fe9cdb3a32c045c98039d4bde6b8b809e3	2020-01-30 08:53:12 -07:00
Isaac Hess	14fd6a9ef0	storagenode/pieces: Track total piece size This change updates the storagenode piecestore apis to expose access to the full piece size stored on disk. Previously we only had access to (and only kept a cache of) the content size used for all pieces. This was inaccurate when reporting the amount of disk space used by nodes. We now have access to the total content size, as well as the total disk usage, of all pieces. The pieces cache also keeps a cache of the total piece size along with the content size. Change-Id: I4fffe7e1257e04c46021a2e37c5adc6fe69bee55	2020-01-23 11:00:24 -07:00
Egon Elbre	21f53e38da	storagenode/storagenodedb/storagenodedbtest: pass ctx as an argument Change-Id: I10b0a8ef3a7d5001e7d361f1873ad5987af1f9c2	2020-01-20 16:56:12 +02:00
Egon Elbre	f3b4bf2b7c	satellite/satellitedb/satellitedbtest: pass ctx as an argument ctx is created in most tests, instead pass in as argument to reduce code duplication. Change-Id: I466c51c008392001129c8b007c9d6b3619935ac4	2020-01-20 16:35:42 +02:00
Egon Elbre	1279eeae39	private/tagsql,storage: fixes to context cancellation Replace all the remaining uses of sql.DB with tagsql.DB to fix issues with context cancellation. Introduce tagsql.Open which helps to get rid of all tagsql.Wrap-s. Use tagsql in cockroachkv and postgreskv. Change-Id: I8946d203341cb85a25976896fc7881e1f704e779	2020-01-20 15:44:39 +02:00
Egon Elbre	3cd584c007	storagenode/gracefulexit: move database test Database tests belong to the interface, not the implementation. Change-Id: I5d76fdc7df0b0f32391ebad1b595ef26b062a9cb	2020-01-19 18:12:01 +00:00
Egon Elbre	7bc76624cf	storagenode/storagenodedb: fix closing in-use database Migration step was closing a database that was used by the migration itself. There is an active tranasction over the database. Instead of closing in the same transaction we can wait until restart for the database cleanup. Change-Id: Ic971d8cea81a3ab783f4a1bdc6357009c8b31386	2020-01-19 16:18:46 +02:00
Egon Elbre	25b76fe63f	storagenode/storagenodedb: use tagsql Change-Id: Iba3b34a97b982deb4f72ce55517a294f249b6b55	2020-01-19 14:39:16 +02:00
Egon Elbre	59d06644b9	private/migrate: switch to tagsql Also added temporary types withRebind and withTagTx, which will be later removed. Currently they help to avoid changing the whole codebase at the same time. Change-Id: I7f07ba8f4709a23a463bfa67464628665a05808f	2020-01-19 14:39:16 +02:00
Moby von Briesen	e115bc1903	cmd/storagenode;storagenode/storagenodedb: add preflight database check for storagenode Ensure that database schema matches latest test migration schema before allowing the node to start up. Ensure minimal read/write functionality for each storagenode database before allowing the node to start up. This will eliminate many unhandled audit errors we are seeing. Change-Id: Ic0e628b04a9c35b7a8243f6a81d4683918170ba9	2020-01-16 18:44:46 +00:00
Egon Elbre	81d53b8097	storagenode/storagenodedb: fixes to row handling Change-Id: I3813310b48337428f13678a9fcba5c8a0e0b2b2a	2020-01-16 15:08:37 +00:00
Yingrong Zhao	07c2824d94	storagenode/gracefulexit: fix exit-status command output When exit succeeded, cli should display `Y` in Successful column and `100%` in PercentComplete. Change-Id: I6093eca207ecd618bb332af12e5e455bc8224dde	2020-01-15 14:58:15 +00:00
Egon Elbre	64fb2d3d2f	Revert "dbutil: statically require all databases accesses to use contexts" This reverts commit `8e242cd012`. Revert because lib/pq has known issues with context cancellation. These issues need to be resolved before these changes can be merged. Change-Id: I160af51dbc2d67c5449aafa406a403e5367bb555	2020-01-15 07:28:00 +00:00
JT Olio	8e242cd012	dbutil: statically require all databases accesses to use contexts this will allow for some nice runtime analysis down the road. also, this allows for wrapping database handles in a way that can interact with these contexts requires https://review.dev.storj.io/c/storj/dbx/+/514 Change-Id: Ib087b7cd73296dd2c1e0331314da34d861f61d2b	2020-01-14 18:20:47 -05:00
Egon Elbre	5af1f9e6d1	storagenode/{piecestore,storagenodedb}: use context in queries In endpoint.saveOrder, ensure we always try to save orders such that they can be settled. Change-Id: Ic9ac8f4bf684d8493282912ca97f386c1762e364	2020-01-14 20:27:26 +00:00
Egon Elbre	ff267168c5	private/migrate: add ctx argument Change-Id: I3d65912d89261386413c494c7ed1576fed4dcaf4	2020-01-13 15:52:26 +02:00
Egon Elbre	c7b846589e	private/dbutil/sqliteutil: add ctx argument Change-Id: If1caa9cde746817e62cae32a152eeec81959129c	2020-01-13 15:03:30 +02:00
Qweder93	cf19e141e0	storagenode/notifications: return unread count and fix json id, list-notifications method fix Change-Id: Ic56beac1f388d91a29c9e8266161715d09364520	2020-01-09 17:56:00 +00:00
Yingrong Zhao	ebeee58001	storagenode/gracefulexit: remove satellite entry when node fail precondition Change-Id: I3c215170f10f0053e4f8718ee31d64d93f52ec80	2020-01-08 18:11:58 +00:00
paul cannon	0c88a7b475	private/migrate: use transactional helpers and not Begin() This code needs to work against cockroachDB, so transactions must be retried when a retryable error is returned. This change puts migrate transactions into the dbutil.WithTx transactional helpers to achieve this in the easiest way. Change-Id: Ib930e82d55cb0257357a222ce9131e6e53372c03	2020-01-07 18:25:38 +00:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Isaac Hess	7d1e28ea30	storagenode: Include trash space when calculating space used This commit adds functionality to include the space used in the trash directory when calculating available space on the node. It also includes this trash value in the space used cache, with methods to keep the cache up-to-date as files are trashed, restored, and emptied. As part of the commit, the RestoreTrash and EmptyTrash methods have slightly changed signatures. RestoreTrash now also returns the keys that were restored, while EmptyTrash also returns the total disk space recovered. Each of these changes makes it possible to keep the cache up-to-date and know how much space is being used/recovered. Also changed is the signature of PieceStoreAccess.ContentSize method. Previously this method returns only the content size of the blob, removing the size of any header data. This method has been renamed `Size` and returns both the full disk size and content size of the blob. This allows us to only stat the file once, and in some instances (i.e. cache) knowing the full file size is useful. Note: This commit simply adds the trash size data to the piece size data we were already collecting. The piece size data is not accurate for all use-cases (e.g. because it does not contain piece header data); however, this commit does not fix that problem. Now that the ContentSize (Size) method returns the full size of the file, it should be easier to fix this problem in a future commit. Change-Id: I4a6cae09e262c8452a618116d1dc66b687f59f85	2019-12-23 19:07:03 -07:00
Yingrong Zhao	6e71591b9b	satellitedb;storagenodedb: remove unnecessary use of DB transactions in graceful exit Change-Id: Ief0a28c6750c130896b48bfebfbea7fb3caa810f	2019-12-20 21:24:38 +00:00
Qweder93	e47ec84dee	storagenode notification service and api added Change-Id: I36898d7c43e1768e0cae0da8d83bb20b16f0cdde	2019-12-20 18:42:23 +00:00
Egon Elbre	7a36507a0a	private/testcontext: ensure we call cleanup everywhere Change-Id: Icb921144b651611d78f3736629430d05c3b8a7d3	2019-12-17 14:16:09 +00:00
Vitalii Shpital	53d9bc4530	storagenode/notifications: db created (#3707 )	2019-12-16 19:59:01 +02:00
littleskunk	c2ea75208f	storagenode/orderdb: fix db lock Change-Id: Id1add0ba7ae1b20bd98099bd4d3aff0fcfdd90c9	2019-12-15 23:41:22 +01:00
Jeff Wendling	fb8e78132d	storagenodedb: reenable utccheck in tests Change-Id: If7d64dd4ae58e4b656ff9122ae3195b2a5173cb3	2019-12-10 23:17:14 +00:00
Isaac Hess	6aeddf2f53	storagenode/pieces: Add Trash and RestoreTrash to piecestore (#3575 ) * storagenode/pieces: Add Trash and RestoreTrash to piecestore * Add index for expiration trash	2019-11-20 09:28:49 -07:00
Vitalii Shpital	61c8bcc9a6	web/storagenode: egress chart implemented (#3574 )	2019-11-20 16:37:57 +02:00
Egon Elbre	ee6c1cac8a	private: rename internal to private (#3573 )	2019-11-14 21:46:15 +02:00
Egon Elbre	1a54007f1c	storagenode/storagenodedb: dont log opening of each database (#3571 )	2019-11-14 17:08:16 +02:00
paul cannon	bd89f51c66	Keep v0pieceinfo database isolated (#3364 ) * put TestCreateV0 back in StoreForTest * avoid direct handles to V0 pieceinfo db * type mismatch fix * use storage.Blobs interface in store_test.go ..instead of filestore.Store. this will allow filestore.Store to become unexported. * unexport filestore.Store rename it to blobStore. things should use the storage.Blobs interface instead. changes in this commit are purely mechanical (made through the "refactor" tool in Gocode followed by search/replace on the word "Store" within the storage/filestore/ directory). * kill filestore.StoreForTest now that filestore.blobStore is unexported, there isn't a need for a specialized wrapper type. this (not coincidentally) also makes it possible for the WriterForFormatVersion() method on storagenode/pieces.StoreForTest to work, without requiring everything to wrap the store.blobs attribute in a filestore.StoreForTest, which was impractical.	2019-11-13 13:15:31 -06:00
Ethan Adams	5b0398a718	storagenode/gracefulexit: Exclude finished exits from chore/worker processing. Fix update status bug (#3399 )	2019-10-28 13:59:45 -04:00
Bill Thorp	89c59d06f9	storagenode/storagenodedb: add SQL receiver logic for graceful exit (#3067 ) * added graceful exit db methods	2019-10-01 10:34:03 -04:00
Isaac Hess	2c5e169888	storagenode/storagenodedb: Vacuum info.db to prepare for splitting storagenodedbs (#3134 )	2019-09-27 07:55:51 -06:00
Isaac Hess	580e511b4c	storagenode/storagenodedb: Migrate to separate dbs (#3081 ) * storagenode/storagenodedb: Migrate to separate dbs * storagenode/storagenodedb: Add migration to drop versions tables * Put drop table statements into a transaction. * Fix CI errors. * Fix CI errors. * Changes requested from PR feedback. * storagenode/storagenodedb: fix tx commit	2019-09-23 12:36:46 -07:00
Jennifer Li Johnson	724bb44723	Remove Kademlia dependencies from Satellite and Storagenode (#2966 ) What: cmd/inspector/main.go: removes kad commands internal/testplanet/planet.go: Waits for contact chore to finish satellite/contact/nodesservice.go: creates an empty nodes service implementation satellite/contact/service.go: implements Local and FetchInfo methods & adds external address config value satellite/discovery/service.go: replaces kad.FetchInfo with contact.FetchInfo in Refresh() & removes Discover() satellite/peer.go: sets up contact service and endpoints storagenode/console/service.go: replaces nodeID with contact.Local() storagenode/contact/chore.go: replaces routing table with contact service storagenode/contact/nodesservice.go: creates empty implementation for ping and request info nodes service & implements RequestInfo method storagenode/contact/service.go: creates a service to return the local node and update its own capacity storagenode/monitor/monitor.go: uses contact service in place of routing table storagenode/operator.go: moves operatorconfig from kad into its own setup storagenode/peer.go: sets up contact service, chore, pingstats and endpoints satellite/overlay/config.go: changes NodeSelectionConfig.OnlineWindow default to 4hr to allow for accurate repair selection Removes kademlia setups in: cmd/storagenode/main.go cmd/storj-sim/network.go internal/testplane/planet.go internal/testplanet/satellite.go internal/testplanet/storagenode.go satellite/peer.go scripts/test-sim-backwards.sh scripts/testdata/satellite-config.yaml.lock storagenode/inspector/inspector.go storagenode/peer.go storagenode/storagenodedb/database.go Why: Replacing Kademlia Please describe the tests: • internal/testplanet/planet_test.go: TestBasic: assert that the storagenode can check in with the satellite without any errors TestContact: test that all nodes get inserted into both satellites' overlay cache during testplanet setup • satellite/contact/contact_test.go: TestFetchInfo: Tests that the FetchInfo method returns the correct info • storagenode/contact/contact_test.go: TestNodeInfoUpdated: tests that the contact chore updates the node information TestRequestInfoEndpoint: tests that the Request info endpoint returns the correct info Please describe the performance impact: Node discovery should be at least slightly more performant since each node connects directly to each satellite and no longer needs to wait for bootstrapping. It probably won't be faster in real time on start up since each node waits a random amount of time (less than 1 hr) to initialize its first connection (jitter).	2019-09-19 15:56:34 -04:00
Simon Guindon	a2b1e9fa95	storagenode/storagenodedb: refactor both data access objects and migrations to support multiple DB connections (#3057 ) * Split the info.db database into multiple DBs using Backup API. * Remove location. Prev refactor assumed we would need this but don't. * Added VACUUM to reclaim space after splitting storage node databases. * Added unique names to SQLite3 connection hooks to fix testplanet. * Moving DB closing to the migration step. * Removing the closing of the versions DB. It's already getting closed. * Swapping the database connection references on reconnect. * Moved sqlite closing logic away from the boltdb closing logic. * Moved sqlite closing logic away from the boltdb closing logic. * Remove certificate and vouchers from DB split migration. * Removed vouchers and bumped up the migration version. * Use same constructor in tests for storage node databases. * Use same constructor in tests for storage node databases. * Adding method to access underlining SQL database connections and cleanup * Adding logging for migration diagnostics. * Moved migration closing database logic to minimize disk usage. * Cleaning up error handling. * Fix missing copyright. * Fix linting error. * Add test for migration 21 (#3012) * Refactoring migration code into a nicer to use object. * Refactoring migration code into a nicer to use object. * Fixing broken migration test. * Removed unnecessary code that is no longer needed now that we close DBs. * Removed unnecessary code that is no longer needed now that we close DBs. * Fixed bug where an invalid database path was being opened. * Fixed linting errors. * Renamed VersionsDB to LegacyInfoDB and refactored DB lookup keys. * Renamed VersionsDB to LegacyInfoDB and refactored DB lookup keys. * Fix migration test. NOTE: This change does not address new tables satellites and satellite_exit_progress * Removing v22 migration to move into it's own PR. * Removing v22 migration to move into it's own PR. * Refactored schema, rebind and configure functions to be re-useable. * Renamed LegacyInfoDB to DeprecatedInfoDB. * Cleaned up closeDatabase function. * Renamed storageNodeSQLDB to migratableDB. * Switched from using errs.Combine() to errs.Group in closeDatabases func. * Removed constructors from storage node data access objects. * Reformatted usage of const. * Fixed broken test snapshots. * Fixed linting error.	2019-09-18 12:17:28 -04:00
Simon Guindon	91d54af705	Add satellites database business objects. (#3055 ) * Add satellites database business objects. * Fixed linting error.	2019-09-16 13:54:53 -04:00
Yingrong Zhao	9f2f1527c5	storagenode/storagenodedb: add new tables for graceful exit (#3008 ) * add database schema * add migration * change table name and update blueprint	2019-09-11 18:57:53 -04:00
Isaac Hess	7718802f0c	storagenode/storagenodedb: prepare for multiple databases (#3005 ) * Migrate test: prepare for multiple databases * Add copyright * Fix unused variables * Move data to testdata, split MultiDBSnapshot from MultiDBState	2019-09-11 14:31:46 -06:00
Isaac Hess	0b32572ae6	migrate: Allow work on separate dbs (#2996 )	2019-09-10 13:42:23 -06:00
Yaroslav Vorobiov	c35ad5cbfc	storagenode/console: update api (#2969 )	2019-09-06 15:01:03 +03:00
Yaroslav Vorobiov	f7403f97b0	storagenode/storageusage: add summary, rename timestamp to interval_start (#2911 )	2019-09-04 17:13:43 +03:00

1 2 3

128 Commits