storj

Author	SHA1	Message	Date
Jessica Grebenschikov	89bdb20a62	storagenodedb/orders: select unsent satellite with expiration In production we are seeing ~115 storage nodes (out of ~6,500) are not using the new SettlementWithWindow endpoint (but they are upgraded to > v1.12). We analyzed data being reported by monkit for the nodes who were above version 1.11 but were not successfully submitting orders to the new endpoint. The nodes fell into a few categories: 1. Always fail to list orders from the db; never get to try sending orders from the filestore 2. Successfully list/send orders from the db; never get to calling satellite endpoint for submitting filestore orders 3. Successfully list/send orders from the db; successfully list filestore orders, but satellite endpoint fails (with "unauthenticated" drpc error) The code change here add the following to address these issues: - modify the query for ordersDB.listUnsentBySatellite so that we no longer select expired orders from the unsent_orders table - always process any orders that are in the ordersDB and also any orders stored in the filestore - add monkit monitoring to filestore.ListUnsentBySatellite so that we can see the failures/successes Change-Id: I0b473e5d75252e7ab5fa6b5c204ed260ab5094ec	2020-10-21 15:02:23 +00:00
littleskunk	77d54ff0ac	storagenode/bandwidthdb: Use existing indexes (#3949 ) * storagenode/bandwidthdb: Use existing indexes	2020-10-20 22:48:40 +02:00
Qweder93	9df74338a8	storagenode: secret db and service added Change-Id: I91257e5adc4fc6711653f30c118e476ed1c95b6b	2020-10-16 13:24:33 +00:00
Yaroslav Vorobiov	139a7ee959	private/migrate: add ablity to create dbs during migration Use tagsql.DB pointer as step database, to propagate changes back and forth between actual database and migration. Adds CreateDB operation to the migration step to be able to create new dbs before executing migration action. Adjusts storagenode database migration to use inner tagsql.DB pointer of each database as step.DB. Adjusts satellite dabase migration, adds proxy migrationDB field to satellite db that wraps itself as tagsql.DB, pointer of which is used as step.DB. Change-Id: Ifed4de5b01a356cf7b37db64d2eaeb7b61982c5c	2020-10-15 15:28:04 +03:00
Egon Elbre	2268cc1df3	all: fix linter complaints Change-Id: Ia01404dbb6bdd19a146fa10ff7302e08f87a8c95	2020-10-13 15:59:01 +03:00
Moby von Briesen	fbf2c0b242	storagenode/orders: Refactor orders store Abstract details of writing and reading data to/from orders files so that adding V1 and future maintenance are easier. Change-Id: I85f4a91761293de1a782e197bc9e09db228933c9	2020-10-06 15:28:07 -04:00
Yaroslav Vorobiov	a840cb71e7	storagenode: check db version before run Change-Id: I912f63fd62f2bff10341346c28dfb92fcd683806	2020-09-30 10:58:09 +00:00
Yaroslav Vorobiov	8786e55a78	storagenode/storagenodedb: allow existing dbs on setup Allow existing storagenode dbs on setup to be able to reinstall the node with existing data. Change-Id: Ib42ab585432e61dfecc10640b6cd755ce83f0c46	2020-09-28 16:31:48 +03:00
Qweder93	8182fdad0b	storagenode: heldamount renamed to payouts, renamed some methods and structs to more meaningful names. grouped estimated payout with pathouts satellite: heldamount renamed to SNOpayouts. Change-Id: I244b4d2454e0621f4b8e22d3c0d3e602c0bbcb02	2020-09-16 14:57:35 +00:00
Qweder93	ac29d80495	storagenode: heldamount GetPaystub refactored, estimationPayouts logic separated form console to separate service, storagenodeapi tests fixed. Change-Id: I902823ef40a62861ce32799e9fb7a67a1e14710d	2020-09-09 15:31:16 +00:00
Jennifer Johnson	4e2413a99d	satellite/satellitedb: uses vetted_at field to select for reputable nodes Additionally, this PR changes NewNodeFraction devDefault and testplanet config from 0.05 to 1. This is because many tests relied on selecting nodes that were reputable based on audit and uptime counts of 0, in effect, selecting new nodes as reputable ones. However, since reputation is now indicated by a vetted_at db field that is explicitly set rather than implied by audit and uptime counts, it would be more complicated to try to update all of the nodes' reputations before selecting nodes for tests. Now we just allow all test nodes to be new if needed. Change-Id: Ib9531be77408662315b948fd029cee925ed2ca1d	2020-09-04 16:45:32 +00:00
Qweder93	36d752e92d	storagenode/reputation: offline_under_review_at added Change-Id: Ia7ec79b2d6f20fe29de0c36223f9485380d2845c	2020-09-02 18:48:28 +03:00
Qweder93	7d9897b7af	storagenode/nodestats: online_score added Change-Id: I84b50a6cace306e5f10d53a2073fe8810d4d2960	2020-09-02 17:45:01 +03:00
Jeff Wendling	91698207cf	storagenode: live tracking of order window usage This change accomplishes multiple things: 1. Instead of having a max in flight time, which means we effectively have a minimum bandwidth for uploads and downloads, we keep track of what windows have active requests happening in them. 2. We don't double check when we save the order to see if it is too old: by then, it's too late. A malicious uplink could just submit orders outside of the grace window and receive all the data, but the node would just not commit it, so the uplink gets free traffic. Because the endpoints also check for the order being too old, this would be a very tight race that depends on knowledge of the node system clock, but best to not have the race exist. Instead, we piggy back off of the in flight tracking and do the check when we start to handle the order, and commit at the end. 3. Change the functions that send orders and list unsent orders to accept a time at which that operation is happening. This way, in tests, we can pretend we're listing or sending far into the future after the windows are available to send, rather than exposing test functions to modify internal state about the grace period to get the desired effect. This brings tests closer to actual usage in production. 4. Change the calculation for if an order is allowed to be enqueued due to the grace period to just look at the order creation time, rather than some computation involving the window it will be in. In this way, you can easily answer the question of "will this order be accepted?" by asking "is it older than X?" where X is the grace period. 5. Increases the frequency we check to send up orders to once every 5 minutes instead of once every hour because we already have hour-long buffering due to the windows. This decreases the maximum latency that an order will be reported back to the satellite by 55 minutes. Change-Id: Ie08b90d139d45ee89b82347e191a2f8db1b88036	2020-08-19 19:42:33 +00:00
Egon Elbre	be3fd0147e	storagenode/storagenodedb: database name in all preflight errors Shorten the error strings and include database name in all potential preflight errors. Change-Id: Ic92ca1ec6e14ffbddb0a0cf89e357eec9532d27e	2020-08-18 16:31:19 +03:00
Yaroslav Vorobiov	4d2a505788	storagenode/db: explicitly open and create dbs To prevent storagenode from implicitly recreating missing dbs and storage, as such behaviour leads to audit failures. Do not allow storagenode to start if any of dbs or storage is missing, corrupted, or dedicated storage disk is unmounted, to get downtime instead. Change-Id: Ic64e1f0ff4d8ef5b2fddbe7a7e53df4f4bd8652e	2020-07-24 14:08:47 +03:00
Egon Elbre	d8dcae3075	all: fix error checking Change-Id: Ia0da1bbd6ce695139922f94096c2419281905e32	2020-07-16 19:13:14 +03:00
Egon Elbre	080ba47a06	all: fix dots Change-Id: I6a419c62700c568254ff67ae5b73efed2fc98aa2	2020-07-16 14:58:28 +00:00
stefanbenten	257855b5de	all: replace == comparison with errors.Is Change-Id: I05d9a369c7c6f144b94a4c524e8aea18eb9cb714	2020-07-14 15:50:25 +00:00
Qweder93	ac716e1514	storagenode/heldamount: payment receipt added to monthly paystub, heldamount.service separated for service and endpoint Change-Id: Id759586c6362edbef34c230d4f0d2585c11c9b47	2020-07-06 09:51:52 +00:00
Cameron Ayer	35b709ba18	storagenode/storagenodedb: check if db is nil before closing In the event of an error in storj.io/storj/storagenode/storagenodedb.(*DB).openDatabases the caller will attempt to close all databases. However, the error prevents the DB from being opened and set in the proper place. Attempting to close results in a nil pointer dereference https://forum.storj.io/t/node-wont-start-after-update-to-v1-6-4-runtime-error-invalid-memory-address-or-nil-pointer-dereference/7889 Change-Id: Ibfe6f3e13c36d9d15a0cb46e384f0120afdab60b	2020-07-02 15:02:38 +00:00
Qweder93	9b90712aa0	storagenode/heldamount: payents added to db Change-Id: Ib6c486251ca08d34003c35379d10314127edf103	2020-06-30 17:24:35 +03:00
Qweder93	9a02149654	storagenode/heladamount: held history extended with joined_at date, total_held and total_disposed amounts Change-Id: I41fe9ab8c5667aa988257a94848ea70225305d79	2020-06-30 13:33:25 +00:00
Yaroslav Vorobiov	09ca382abf	storagenode/db: preflight improve index discovery Change-Id: I876b321f6cd4e91dfced87aa4d39f2cf9a8e63d0	2020-06-05 14:03:25 +03:00
Egon Elbre	07050eea26	all: use common/storj Change-Id: Id1e36d52f9807b5ffbb72ce73f4b60cb21b68a78	2020-05-29 11:57:32 +03:00
Moby von Briesen	dc57640d9c	storagenode/piecestore: switch usedserials db for in-memory usedserials store Part 2 of moving usedserials in memory * Drop usedserials table in storagenodedb * Use in-memory usedserials store in place of db for order limit verification * Update order limit grace period to be only one hour - this means uplinks must send their order limits to storagenodes within an hour of receiving them Change-Id: I37a0e1d2ca6cb80854a3ef495af2d1d1f92e9f03	2020-05-28 12:52:52 -04:00
Qweder93	73214c6d1c	storagenode/heldamount: heldhistory reworked to all satellites Change-Id: I8d7707fddfbdc52d29951a8a002978c7fbb07049	2020-05-28 11:44:26 +00:00
Qweder93	f2a0c64425	storage/filestore: log potential disk corruption In walkNamespaceWithPrefix log in case of "lstat" error, because this may indicate an underlying disk corruption. SG-50 Change-Id: I867c3ffc47cfac325ae90658ec4780d213ff3e63	2020-05-27 12:12:55 +00:00
crawter	f5ac678b0a	storagenode/satellitesdb: added FK constraint to satelliteID Change-Id: If5adf2b92627fcf80850670ba672b346320ddd87	2020-05-21 13:01:20 +00:00
crawter	2c9afe7f17	storagenode/console/api/helamount: periods with heldamount data endpoint added Change-Id: Ie893f56f02c7a76bcfc21c32c10bd1f1d05660e7	2020-05-20 11:45:06 +00:00
Qweder93	49ad90dcd8	storagenode/reputation: unknown_score (unknown_alpha / unknown_alpha+unknown_beta) added to reputation stats, https://storjlabs.atlassian.net/browse/SG-326 Change-Id: I0b29ad736f7a11c7e57a846b6891f4b40755aa48	2020-05-20 11:25:14 +00:00
Jeff Wendling	efee3563a8	storagenode/storagenodedb: fix joined_at migration Change-Id: I9091741bbec6e7a2fc6addc23249a40176ec040a	2020-05-05 15:01:07 -06:00
Qweder93	b754806b23	storagenode/reputation: unknown_audit_reputation_alpha and beta added to db, and reputation endpoint Change-Id: I5d92268851bb9a202cc5d6b4d403467e6f692726	2020-05-05 15:48:04 +00:00
Qweder93	16cd9b06ec	storagenode/heldamount: added api for heldamount history separated by periods Change-Id: I170010364269822848bc6cd051e0e0fb3df95d91	2020-05-05 12:29:44 +00:00
Egon Elbre	c630cf2490	storagenode/pieces: implement buffering for writing Currently uploads can cause a lot of IOPS, reduce this by introducing a in-memory buffer on-top of the file. Change-Id: I5f4e3e01c0a36258271d180b922107de447bcb59	2020-05-04 06:01:32 +00:00
Jeff Wendling	4ad01e8170	storagenode/storagenodedb: backfill reputation.joined_at it was being used in ways that implied it should be NOT NULL even though it was possibly null. we used to get this data from the satellite db's added_at column as seen in `30369b02`, so backfill it using that data where joined_at is NULL, and then alter the table to constrain the column to be NOT NULL. Fixes #3866. Change-Id: If2d856189209740d985f71dada7b93525e625ef3	2020-05-01 17:59:53 +00:00
Jeff Wendling	202e65e408	storagenode/storagenodedb: do correct generalized alter table procedure According to the docs at https://www.sqlite.org/lang_altertable.html doing the steps 1. Rename old table 2. Create new table 3. Copy data 4. Drop old table is incorrect and should be 1. Create new table 2. Copy data 3. Drop old table 4. Rename new into old Additionally, each step was being run in a different transaction, which could cause permanent failures if a problem happened during the migration. Avoid both of those problems by changing up some previous migrations that ran in this way. Since they are semantically identical, it's fine to change up these old migrations. It will help make newer nodes coming up for the first time more robust. Change-Id: I43fb004fa1b6cb2fe2554f9920925420da28fb4a	2020-05-01 13:53:26 +00:00
Egon Elbre	8928399d02	all: rename CreateTables to MigrateToLatest CreateTables hasn't been quite true for a while now, rename to MigrateToLatest to be clearer in it's behavior. Change-Id: Ida48e95122a5d9b7a814e922d3698e00024a2ba7	2020-04-30 07:21:17 +00:00
Egon Elbre	85c45cd56f	private/dbutil/pgtest: support multiple databases for testing Currently Cockroach isn't performant for concurrent database setup and tear-down. Instead of a single instance allow setting multiple potential connection strings and let the tests pick one connection string randomly. This improves test duration by ~10 minutes. While we are at significantly changing how pgtest works, introduce helper PickPostgres and PickCockroach for selecting the database to reduce code duplications in multiple places. Change-Id: I8ad171d5c4c8a4fc081ec2ae9bdd0cc948a80619	2020-04-28 21:55:49 +03:00
Qweder93	805e328c47	storagenode/heldamount payments removed Change-Id: I87cc04f43d182a4190a571ef417be85d02db9d34	2020-04-21 17:15:31 +00:00
Qweder93	30369b027c	storagenode/storagenodedb/reputation: add joined_at Change-Id: Ic471fac97bf54b537f2c34f24b4069b0641c746d	2020-04-17 12:12:09 +00:00
Kaloyan Raev	a2ce836761	remove sugar logging Change-Id: I6b6ca9704837cb3f5f5449ba7f55661487814d9f	2020-04-15 12:37:47 +00:00
Qweder93	743b3fb226	storagenode/nodestats: add pricing model, storagenode/cache: add paystub history storing Change-Id: I9bc104a1407c8f286a964c796656d89b122bf752	2020-04-14 19:04:00 +03:00
Moby von Briesen	14b3704f56	storagenode: add suspended status to storagenode dashboard/api * Add migration to storagenode reputation table to add suspended timestamp * Send suspended info to storagenode from satellite nodestats endpoint * Add suspended status to storagenode api * Add an indicator on the storagenode dashboard informing operator of the satellites the node is suspended on Change-Id: Ie3669f6069cc0258ba76ec99d17006e1b5fd9c8a	2020-04-09 13:36:23 +00:00
Egon Elbre	11a44cdd88	all: don't depend on gogo/proto directly Change-Id: I8822dea0d1b7b99e0b828e0373a0308a42dde2be	2020-04-08 17:32:15 +00:00
Stefan Benten	0a4d253990	storagenode/storagenodedb: Improve preflight schema error message (#3844 )	2020-04-03 11:20:24 +02:00
Egon Elbre	8f73fb7a32	all: simplify uuid usage uuid.UUID implements driver.Value so it can be directly used as a scannable result. Replace uses of dbutil.BytesToUUID with uuid.FromBytes. Change-Id: I51a670185ceb3cc2199d5aa2b76bc3fc191ca8fe	2020-04-02 05:48:58 +00:00
Egon Elbre	0a69da4ff1	all: switch to storj.io/common/uuid Change-Id: I178a0a8dac691e57bce317b91411292fb3c40c9f	2020-03-31 19:16:41 +03:00
Jeff Wendling	97e980cd8a	private/dbutil: add database name to configure as a tag storagenodes have like 10 or more databases. without this tag they all get sent as the same value, stomping on each other. Change-Id: Ib12019684d6ea8f2a5b83df584056dfa79e3c4b3	2020-03-26 16:50:15 +00:00
Michal Niewrzal	f0aeda3091	storj: remove from `storj/pkg` packages moved to `storj/private` repo * debug * traces * cfgstruct * process Package `storj/private/version` will be removed as a separate change. Change-Id: Iadc40faa782e6225513b28218952f02d9c240a9f	2020-03-24 09:56:29 +01:00
Qweder93	8597e6b512	storagenode/console/api period payment api extended Change-Id: I18ec331c6a684e3a9351e3c917bacdb8b8f18c28	2020-03-19 16:51:31 +02:00
crawter	fde5c3542b	storagenode/console/api: period payStub api extended Change-Id: I624bbf7a9640f9df97789bea109201cbfb556753	2020-03-19 14:42:02 +02:00
crawter	45507d285b	storagenode/storagenodedb: heldamount tests added Change-Id: I862b0b38349a11254426855ffafb1f2f0845cb4c	2020-03-15 14:55:11 +00:00
Qweder93	5ccce04338	storagenode/storagenodedb: heldamount added Change-Id: I213e3abffd7356bbfccb3f33bcbafa558674b8d9	2020-03-13 16:23:59 +00:00
Moby von Briesen	178dbb4683	storagenode/storagenodedb: allow storagenodes to start test_table exists In many cases when a storagenode fails the preflight check, it is due to test_table existing, which is used to determine read/write capabilities after the initial schema verification. If preflight ends early due to a failure or stopped storagenode, it may not get the chance to drop this table. This change excludes test_table from the schema comparison to ensure that it never prevents a storagenode from starting up. It also adds Preflight DB test for storagenode. Change-Id: Ib8e71df2e42fda3b2a364fbf7a801891c5831d39	2020-03-09 14:29:46 -04:00
NikolaiYurchenko	2601f25c98	web/storagenode: notification logic implementation Change-Id: Iec741997312203117213674ef85125fa8a976249	2020-02-21 15:49:27 +00:00
Jeff Wendling	7999d24f81	all: use monkit v3 this commit updates our monkit dependency to the v3 version where it outputs in an influx style. this makes discovery much easier as many tools are built to look at it this way. graphite and rothko will suffer some due to no longer being a tree based on dots. hopefully time will exist to update rothko to index based on the new metric format. it adds an influx output for the statreceiver so that we can write to influxdb v1 or v2 directly. Change-Id: Iae9f9494a6d29cfbd1f932a5e71a891b490415ff	2020-02-05 23:53:17 +00:00
Jeff Wendling	d20db90cff	private/dbutil/txutil: create new transactions for retries it was noticed that if you had a long lived transaction A that was blocking some other transaction B and A was being aborted due to retriable errors, then transaction B was never given priority. this was due to using savepoints to do lightweight retries. this behavior was problematic becaue we had some queries blocked for over 16 hours, so this commit addresses the issue with two prongs: 1. bound the amount of time we will retry a transaction 2. create new transactions when a retry is needed the first ensures that we never wait for 16 hours, and the value chosen is 10 minutes. that should be long enough for an ample amount of retries for small queries, and huge queries probably shouldn't be retried, even if possible: it's more preferrable to find a way to make them smaller. the second ensures that even in the case of retries, queries that are blocked on the aborted transaction gain priority to run. between those two changes, the maximum stall time due to retries should be bounded to around 10 minutes. Change-Id: Icf898501ef505a89738820a3fae2580988f9f5f4	2020-02-01 18:34:28 +00:00
Jeff Wendling	71ff044edb	storagenode/bandwidth: fix tests to not fail for 10 hours near the end of the month Change-Id: I390569a8702164c42edddd3be020e93782227c2e	2020-01-31 16:25:52 -07:00
Egon Elbre	d0b4272467	storagenode: fix global logger in tests https://github.com/storj/storj/wiki/Testing#logging Change-Id: Ic6a31360bcfedae3f37f6b2536a345f00e33cd78	2020-01-31 14:09:28 +00:00
Isaac Hess	4dafd03f11	storagenode: Prevent negative values in piece_space_used, migrate negatives to 0 Change-Id: Ibd663db087058c928190aa52c520f22e9338dd04	2020-01-30 13:03:18 -05:00
Jeff Wendling	21b65ca3b0	storagenode/storagenodedb: migrate to set total to content_size Change-Id: I4906c2fe9cdb3a32c045c98039d4bde6b8b809e3	2020-01-30 08:53:12 -07:00
Isaac Hess	14fd6a9ef0	storagenode/pieces: Track total piece size This change updates the storagenode piecestore apis to expose access to the full piece size stored on disk. Previously we only had access to (and only kept a cache of) the content size used for all pieces. This was inaccurate when reporting the amount of disk space used by nodes. We now have access to the total content size, as well as the total disk usage, of all pieces. The pieces cache also keeps a cache of the total piece size along with the content size. Change-Id: I4fffe7e1257e04c46021a2e37c5adc6fe69bee55	2020-01-23 11:00:24 -07:00
Egon Elbre	21f53e38da	storagenode/storagenodedb/storagenodedbtest: pass ctx as an argument Change-Id: I10b0a8ef3a7d5001e7d361f1873ad5987af1f9c2	2020-01-20 16:56:12 +02:00
Egon Elbre	f3b4bf2b7c	satellite/satellitedb/satellitedbtest: pass ctx as an argument ctx is created in most tests, instead pass in as argument to reduce code duplication. Change-Id: I466c51c008392001129c8b007c9d6b3619935ac4	2020-01-20 16:35:42 +02:00
Egon Elbre	1279eeae39	private/tagsql,storage: fixes to context cancellation Replace all the remaining uses of sql.DB with tagsql.DB to fix issues with context cancellation. Introduce tagsql.Open which helps to get rid of all tagsql.Wrap-s. Use tagsql in cockroachkv and postgreskv. Change-Id: I8946d203341cb85a25976896fc7881e1f704e779	2020-01-20 15:44:39 +02:00
Egon Elbre	3cd584c007	storagenode/gracefulexit: move database test Database tests belong to the interface, not the implementation. Change-Id: I5d76fdc7df0b0f32391ebad1b595ef26b062a9cb	2020-01-19 18:12:01 +00:00
Egon Elbre	7bc76624cf	storagenode/storagenodedb: fix closing in-use database Migration step was closing a database that was used by the migration itself. There is an active tranasction over the database. Instead of closing in the same transaction we can wait until restart for the database cleanup. Change-Id: Ic971d8cea81a3ab783f4a1bdc6357009c8b31386	2020-01-19 16:18:46 +02:00
Egon Elbre	25b76fe63f	storagenode/storagenodedb: use tagsql Change-Id: Iba3b34a97b982deb4f72ce55517a294f249b6b55	2020-01-19 14:39:16 +02:00
Egon Elbre	59d06644b9	private/migrate: switch to tagsql Also added temporary types withRebind and withTagTx, which will be later removed. Currently they help to avoid changing the whole codebase at the same time. Change-Id: I7f07ba8f4709a23a463bfa67464628665a05808f	2020-01-19 14:39:16 +02:00
Moby von Briesen	e115bc1903	cmd/storagenode;storagenode/storagenodedb: add preflight database check for storagenode Ensure that database schema matches latest test migration schema before allowing the node to start up. Ensure minimal read/write functionality for each storagenode database before allowing the node to start up. This will eliminate many unhandled audit errors we are seeing. Change-Id: Ic0e628b04a9c35b7a8243f6a81d4683918170ba9	2020-01-16 18:44:46 +00:00
Egon Elbre	81d53b8097	storagenode/storagenodedb: fixes to row handling Change-Id: I3813310b48337428f13678a9fcba5c8a0e0b2b2a	2020-01-16 15:08:37 +00:00
Yingrong Zhao	07c2824d94	storagenode/gracefulexit: fix exit-status command output When exit succeeded, cli should display `Y` in Successful column and `100%` in PercentComplete. Change-Id: I6093eca207ecd618bb332af12e5e455bc8224dde	2020-01-15 14:58:15 +00:00
Egon Elbre	64fb2d3d2f	Revert "dbutil: statically require all databases accesses to use contexts" This reverts commit `8e242cd012`. Revert because lib/pq has known issues with context cancellation. These issues need to be resolved before these changes can be merged. Change-Id: I160af51dbc2d67c5449aafa406a403e5367bb555	2020-01-15 07:28:00 +00:00
JT Olio	8e242cd012	dbutil: statically require all databases accesses to use contexts this will allow for some nice runtime analysis down the road. also, this allows for wrapping database handles in a way that can interact with these contexts requires https://review.dev.storj.io/c/storj/dbx/+/514 Change-Id: Ib087b7cd73296dd2c1e0331314da34d861f61d2b	2020-01-14 18:20:47 -05:00
Egon Elbre	5af1f9e6d1	storagenode/{piecestore,storagenodedb}: use context in queries In endpoint.saveOrder, ensure we always try to save orders such that they can be settled. Change-Id: Ic9ac8f4bf684d8493282912ca97f386c1762e364	2020-01-14 20:27:26 +00:00
Egon Elbre	ff267168c5	private/migrate: add ctx argument Change-Id: I3d65912d89261386413c494c7ed1576fed4dcaf4	2020-01-13 15:52:26 +02:00
Egon Elbre	c7b846589e	private/dbutil/sqliteutil: add ctx argument Change-Id: If1caa9cde746817e62cae32a152eeec81959129c	2020-01-13 15:03:30 +02:00
Qweder93	cf19e141e0	storagenode/notifications: return unread count and fix json id, list-notifications method fix Change-Id: Ic56beac1f388d91a29c9e8266161715d09364520	2020-01-09 17:56:00 +00:00
Yingrong Zhao	ebeee58001	storagenode/gracefulexit: remove satellite entry when node fail precondition Change-Id: I3c215170f10f0053e4f8718ee31d64d93f52ec80	2020-01-08 18:11:58 +00:00
paul cannon	0c88a7b475	private/migrate: use transactional helpers and not Begin() This code needs to work against cockroachDB, so transactions must be retried when a retryable error is returned. This change puts migrate transactions into the dbutil.WithTx transactional helpers to achieve this in the easiest way. Change-Id: Ib930e82d55cb0257357a222ce9131e6e53372c03	2020-01-07 18:25:38 +00:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Isaac Hess	7d1e28ea30	storagenode: Include trash space when calculating space used This commit adds functionality to include the space used in the trash directory when calculating available space on the node. It also includes this trash value in the space used cache, with methods to keep the cache up-to-date as files are trashed, restored, and emptied. As part of the commit, the RestoreTrash and EmptyTrash methods have slightly changed signatures. RestoreTrash now also returns the keys that were restored, while EmptyTrash also returns the total disk space recovered. Each of these changes makes it possible to keep the cache up-to-date and know how much space is being used/recovered. Also changed is the signature of PieceStoreAccess.ContentSize method. Previously this method returns only the content size of the blob, removing the size of any header data. This method has been renamed `Size` and returns both the full disk size and content size of the blob. This allows us to only stat the file once, and in some instances (i.e. cache) knowing the full file size is useful. Note: This commit simply adds the trash size data to the piece size data we were already collecting. The piece size data is not accurate for all use-cases (e.g. because it does not contain piece header data); however, this commit does not fix that problem. Now that the ContentSize (Size) method returns the full size of the file, it should be easier to fix this problem in a future commit. Change-Id: I4a6cae09e262c8452a618116d1dc66b687f59f85	2019-12-23 19:07:03 -07:00
Yingrong Zhao	6e71591b9b	satellitedb;storagenodedb: remove unnecessary use of DB transactions in graceful exit Change-Id: Ief0a28c6750c130896b48bfebfbea7fb3caa810f	2019-12-20 21:24:38 +00:00
Qweder93	e47ec84dee	storagenode notification service and api added Change-Id: I36898d7c43e1768e0cae0da8d83bb20b16f0cdde	2019-12-20 18:42:23 +00:00
Egon Elbre	7a36507a0a	private/testcontext: ensure we call cleanup everywhere Change-Id: Icb921144b651611d78f3736629430d05c3b8a7d3	2019-12-17 14:16:09 +00:00
Vitalii Shpital	53d9bc4530	storagenode/notifications: db created (#3707 )	2019-12-16 19:59:01 +02:00
littleskunk	c2ea75208f	storagenode/orderdb: fix db lock Change-Id: Id1add0ba7ae1b20bd98099bd4d3aff0fcfdd90c9	2019-12-15 23:41:22 +01:00
Jeff Wendling	fb8e78132d	storagenodedb: reenable utccheck in tests Change-Id: If7d64dd4ae58e4b656ff9122ae3195b2a5173cb3	2019-12-10 23:17:14 +00:00
Isaac Hess	6aeddf2f53	storagenode/pieces: Add Trash and RestoreTrash to piecestore (#3575 ) * storagenode/pieces: Add Trash and RestoreTrash to piecestore * Add index for expiration trash	2019-11-20 09:28:49 -07:00
Vitalii Shpital	61c8bcc9a6	web/storagenode: egress chart implemented (#3574 )	2019-11-20 16:37:57 +02:00
Egon Elbre	ee6c1cac8a	private: rename internal to private (#3573 )	2019-11-14 21:46:15 +02:00
Egon Elbre	1a54007f1c	storagenode/storagenodedb: dont log opening of each database (#3571 )	2019-11-14 17:08:16 +02:00
paul cannon	bd89f51c66	Keep v0pieceinfo database isolated (#3364 ) * put TestCreateV0 back in StoreForTest * avoid direct handles to V0 pieceinfo db * type mismatch fix * use storage.Blobs interface in store_test.go ..instead of filestore.Store. this will allow filestore.Store to become unexported. * unexport filestore.Store rename it to blobStore. things should use the storage.Blobs interface instead. changes in this commit are purely mechanical (made through the "refactor" tool in Gocode followed by search/replace on the word "Store" within the storage/filestore/ directory). * kill filestore.StoreForTest now that filestore.blobStore is unexported, there isn't a need for a specialized wrapper type. this (not coincidentally) also makes it possible for the WriterForFormatVersion() method on storagenode/pieces.StoreForTest to work, without requiring everything to wrap the store.blobs attribute in a filestore.StoreForTest, which was impractical.	2019-11-13 13:15:31 -06:00
Ethan Adams	5b0398a718	storagenode/gracefulexit: Exclude finished exits from chore/worker processing. Fix update status bug (#3399 )	2019-10-28 13:59:45 -04:00
Bill Thorp	89c59d06f9	storagenode/storagenodedb: add SQL receiver logic for graceful exit (#3067 ) * added graceful exit db methods	2019-10-01 10:34:03 -04:00
Isaac Hess	2c5e169888	storagenode/storagenodedb: Vacuum info.db to prepare for splitting storagenodedbs (#3134 )	2019-09-27 07:55:51 -06:00
Isaac Hess	580e511b4c	storagenode/storagenodedb: Migrate to separate dbs (#3081 ) * storagenode/storagenodedb: Migrate to separate dbs * storagenode/storagenodedb: Add migration to drop versions tables * Put drop table statements into a transaction. * Fix CI errors. * Fix CI errors. * Changes requested from PR feedback. * storagenode/storagenodedb: fix tx commit	2019-09-23 12:36:46 -07:00
Jennifer Li Johnson	724bb44723	Remove Kademlia dependencies from Satellite and Storagenode (#2966 ) What: cmd/inspector/main.go: removes kad commands internal/testplanet/planet.go: Waits for contact chore to finish satellite/contact/nodesservice.go: creates an empty nodes service implementation satellite/contact/service.go: implements Local and FetchInfo methods & adds external address config value satellite/discovery/service.go: replaces kad.FetchInfo with contact.FetchInfo in Refresh() & removes Discover() satellite/peer.go: sets up contact service and endpoints storagenode/console/service.go: replaces nodeID with contact.Local() storagenode/contact/chore.go: replaces routing table with contact service storagenode/contact/nodesservice.go: creates empty implementation for ping and request info nodes service & implements RequestInfo method storagenode/contact/service.go: creates a service to return the local node and update its own capacity storagenode/monitor/monitor.go: uses contact service in place of routing table storagenode/operator.go: moves operatorconfig from kad into its own setup storagenode/peer.go: sets up contact service, chore, pingstats and endpoints satellite/overlay/config.go: changes NodeSelectionConfig.OnlineWindow default to 4hr to allow for accurate repair selection Removes kademlia setups in: cmd/storagenode/main.go cmd/storj-sim/network.go internal/testplane/planet.go internal/testplanet/satellite.go internal/testplanet/storagenode.go satellite/peer.go scripts/test-sim-backwards.sh scripts/testdata/satellite-config.yaml.lock storagenode/inspector/inspector.go storagenode/peer.go storagenode/storagenodedb/database.go Why: Replacing Kademlia Please describe the tests: • internal/testplanet/planet_test.go: TestBasic: assert that the storagenode can check in with the satellite without any errors TestContact: test that all nodes get inserted into both satellites' overlay cache during testplanet setup • satellite/contact/contact_test.go: TestFetchInfo: Tests that the FetchInfo method returns the correct info • storagenode/contact/contact_test.go: TestNodeInfoUpdated: tests that the contact chore updates the node information TestRequestInfoEndpoint: tests that the Request info endpoint returns the correct info Please describe the performance impact: Node discovery should be at least slightly more performant since each node connects directly to each satellite and no longer needs to wait for bootstrapping. It probably won't be faster in real time on start up since each node waits a random amount of time (less than 1 hr) to initialize its first connection (jitter).	2019-09-19 15:56:34 -04:00
Simon Guindon	a2b1e9fa95	storagenode/storagenodedb: refactor both data access objects and migrations to support multiple DB connections (#3057 ) * Split the info.db database into multiple DBs using Backup API. * Remove location. Prev refactor assumed we would need this but don't. * Added VACUUM to reclaim space after splitting storage node databases. * Added unique names to SQLite3 connection hooks to fix testplanet. * Moving DB closing to the migration step. * Removing the closing of the versions DB. It's already getting closed. * Swapping the database connection references on reconnect. * Moved sqlite closing logic away from the boltdb closing logic. * Moved sqlite closing logic away from the boltdb closing logic. * Remove certificate and vouchers from DB split migration. * Removed vouchers and bumped up the migration version. * Use same constructor in tests for storage node databases. * Use same constructor in tests for storage node databases. * Adding method to access underlining SQL database connections and cleanup * Adding logging for migration diagnostics. * Moved migration closing database logic to minimize disk usage. * Cleaning up error handling. * Fix missing copyright. * Fix linting error. * Add test for migration 21 (#3012) * Refactoring migration code into a nicer to use object. * Refactoring migration code into a nicer to use object. * Fixing broken migration test. * Removed unnecessary code that is no longer needed now that we close DBs. * Removed unnecessary code that is no longer needed now that we close DBs. * Fixed bug where an invalid database path was being opened. * Fixed linting errors. * Renamed VersionsDB to LegacyInfoDB and refactored DB lookup keys. * Renamed VersionsDB to LegacyInfoDB and refactored DB lookup keys. * Fix migration test. NOTE: This change does not address new tables satellites and satellite_exit_progress * Removing v22 migration to move into it's own PR. * Removing v22 migration to move into it's own PR. * Refactored schema, rebind and configure functions to be re-useable. * Renamed LegacyInfoDB to DeprecatedInfoDB. * Cleaned up closeDatabase function. * Renamed storageNodeSQLDB to migratableDB. * Switched from using errs.Combine() to errs.Group in closeDatabases func. * Removed constructors from storage node data access objects. * Reformatted usage of const. * Fixed broken test snapshots. * Fixed linting error.	2019-09-18 12:17:28 -04:00

1 2 3 4 5

234 Commits