storj

Author	SHA1	Message	Date
Egon Elbre	4e94da3fda	satellite/overlay: add feature flag for node selection cache Also distinguish the purpose for selecting nodes to avoid potential confusion, what should allow caching and what shouldn't. Change-Id: Iee2451c1f10d0f1c81feb1641507400d89918d61	2020-05-06 16:13:47 +03:00
Jennifer Johnson	18078bf7ee	satellite/audit: increases audit worker concurrency to 2 Change-Id: Ibe3e3801b79accffbcfe9e2e02c96fc963894a7f	2020-05-05 11:31:55 +00:00
Egon Elbre	d98b8f6e23	satellite/metainfo,storage: use different limit for metainfo loop Change-Id: I5ef7233930679b977b33f7b3e1dda45c907dcfad	2020-05-05 10:37:20 +00:00
Moby von Briesen	8f60cfc4fb	satellite/overlay: Add flag for enabling/disabling disqualification from suspension mode Add a flag that allows us to easily switch disqualification from suspension mode on or off. A node will only be disqualified from suspension mode if it has been suspended for longer than the grace period AND the SuspensionDQEnabled flag is true. Change-Id: I9e67caa727183cd52ab2042b0a370a1bcaebe792	2020-05-04 17:25:09 +00:00
Yingrong Zhao	9b4a3f8fcc	cmd/uplink: use tracing.enabled flag Previously we are using tracing.sampled to be the switch for turning on/off tracing. However we would like to separate sampling rate from being the switch, so we can set sampling rate to be 0 but still intialize tracing for satellite and storagenodes Change-Id: I27e6ba25ea6f6b612b4e1a57cf1301889ded41ec	2020-04-27 17:54:57 +00:00
Bill Thorp	341aecfe0f	satellite/console: add rate limiter to login, register, password recovery Added a per IP rate limiter to the console web. Cleaned up password check to leak less bcyrpt info. Change-Id: I3c882978bd8de3ee9428cb6434a41ab2fc405fb2	2020-04-24 17:15:49 +00:00
Jess G	825226c98e	satellite/overlay: use node selection cache for uploads (#3859 ) * satellite/overlay: use node selection cache for uploads Change-Id: Ibd16cccee979d0544f2f4a01749af9f36f02a6ad * fix config lock Change-Id: Idd307e4dee8ab92749f1ec3f996419ea0af829fd * start fixing tests Change-Id: I207d373a3b2a2d9312c9e72fe9bd0b01e06ad6cf * fix test, add some more Change-Id: I82b99c2004fca2510965f9b389f87dd4474bc722 * change config name Change-Id: I0c0f7fc726b2565dc3828cb723f5459a940f2a0b * add benchmarks Change-Id: I05fa25bff8d5b65f94d918556855b95163d002e9 * revert bench to put in different PR Change-Id: I0f6942296895594768f19614bd7b2e3b9b106ade * add staleness to benchmark Change-Id: Ia80a310623d5a342afa6d835402170b531b0f870 * add cache config to testplanet Change-Id: I39abdab8cc442694da543115a9e470b2a8a25dff * have repair select old way Change-Id: I25a938457d7d1bcf89fd15130cb6b0ac19585252 * lower testplante config time Change-Id: Ib56a2ed086c06bc6061388d15a10a2526a663af7 * fix test Change-Id: I3868e9cacde2dfbf9c407afab04dc5fc2f286f69	2020-04-24 09:11:04 -07:00
Moby von Briesen	72b93f3120	satellite/satellitedb: disqualify suspended nodes when the grace period passes If a node is suspended and receives an unknown or failing audit, disqualify them if the grace period (default 1w in production) has passed. Migrate the nodes table so any node that is currently suspended gets unsuspended when the satellite starts up. Change-Id: I7b81c68026f823417faa0bf5e5cb5e67c7156b82	2020-04-22 15:45:00 -04:00
Yingrong Zhao	0bdcf123cf	bump monkit, monkit-jaeger, and private to latest Also bump storj.io/common and sync repo Change-Id: If8e60db6bdf0af8077b7befcb1da304c3c4dcae4	2020-04-22 12:30:37 -04:00
Moby von Briesen	178aa8b5e0	satellite/{metainfo,repair}: Delete expired segments from metainfo * Delete expired segments in expired segments service using metainfo loop * Add test to verify expired segments service deletes expired segments * Ignore expired segments in checker observer * Modify checker tests to verify that expired segments are ignored * Ignore expired segments in segment repairer and drop from repair queue * Add repair test to verify that a segment that expires after being added to the repair queue is ignored and dropped from the repair queue Change-Id: Ib2b0934db525fef58325583d2a7ca859b88ea60d	2020-04-22 13:02:31 +00:00
Yingrong Zhao	8375a09c89	cmd: remove InitTracing from satellite and storagenode main.go file Change-Id: I4addbe7d0645f66abfb3e98d74d17035e9624e69	2020-04-20 14:06:26 -04:00
VitaliiShpital	2dce4c232c	web/satellite: redirect to verification page on sign up if inside iframe Change-Id: I606b63fd27bef46597697b491970523e8a3a0cae	2020-04-16 13:35:49 +00:00
VitaliiShpital	158013a866	satellite/console: redirect on account activation Change-Id: I2506ce0fd3832bf46fbcdcc5a42bb83dc926e99a	2020-04-15 11:49:50 +00:00
Moby von Briesen	d7794a4851	satellite/overlay: hardcode default values for audit alpha/beta Alpha=1 and beta=0 are the expected first values for any alpha/beta reputation system we are using in the codebase. So we are removing the configurability of these values. Change-Id: Ic61861b8ea5047fa1438ea6609b1d0048bf0abc3	2020-04-14 19:12:40 +00:00
Cameron Ayer	3ee6c14f54	satellite/downtime: add concurrency to downtime estimation We want to increase our throughput for downtime estimation. This commit adds the ability to reach out to multiple nodes concurrently for downtime estimation. The number of concurrent routines is determined by a new config flag, EstimationConcurrencyLimit. It also increases the default EstimationBatchSize to 1000. Change-Id: I800ce7ec1035885afa194c3c3f64eedd4f6f61eb	2020-04-14 14:39:13 +00:00
Qweder93	3a9422cc9a	satellite/nodestats: add pricing model to endpoint Change-Id: Iddace8e437216a343458f440b543cee61164f233	2020-04-08 14:29:51 +00:00
Yingrong Zhao	96e58d21b4	cmd;pkg/server: init tracing collector in all processes Add tracing handler in drpc server. Initializing tracing collector in admin, satellite api, garbage collection, satellite core, repaier, storagenode. Change-Id: Ie98420e35dfc6913836ebd82b517d9d12877aefc Change-Id: I91057b6265a4ac8bde033dfde692b8a28acca99f	2020-04-07 17:20:59 -04:00
Cameron Ayer	42be4bdc0f	satellite/contact: add timeout to PingBack method Change-Id: I2ec2f82e2e10d8be16f82e9de13ce42358e47c98	2020-04-04 18:26:30 +00:00
Michal Niewrzal	c178a08cb8	satellite/metainfo: add max segment size and max inline size to BeginObject response We want to control inline segment size and segment size on satellite side. We need to return such information to uplink like with redundancy scheme. Change-Id: If04b0a45a2757a01c0cc046432c115f475e9323c	2020-04-02 12:41:28 +00:00
Jeff Wendling	e2ff2ce672	satellite: compensation package and commands Change-Id: I7fd6399837e45ff48e5f3d47a95192a01d58e125	2020-03-30 14:08:14 -06:00
JT Olio	f28100b73f	bump storj.io/private Change-Id: I4ddd5c34521602967b89bd18e2a71a6f1e29f436	2020-03-27 21:57:35 +00:00
Moby von Briesen	a933bcc99a	satellite/repair/repairer/ec.go: add option for downloading pieces onto disk instead of in memory during repair Add flag to satellite repairer, "InMemoryRepair" that allows the satellite to decide whether to download the entire segment being repaired into memory (this is what the satellite already does), or to download it into temporary files on disk that will be read from in the upload phase of repair. This should help with handling high repair traffic on satellites that cannot afford to spend 64mb of memory per repair worker. Updates tests to test repair for both in memory and to disk. Change-Id: Iddf591e165621497c98533d45bfea3c28b08a194	2020-03-27 16:41:00 +00:00
Natalie Villasana	8e0ca0e6f5	satellite/gc: update release default for gc to run separately (#3830 )	2020-03-26 14:44:18 -04:00
Jennifer Johnson	699b635e5d	satellite/overlay: rename newNodePercentage to newNodeFraction Change-Id: Ie66de91f88183b44de0773589e83e4ade9aa997a	2020-03-19 20:09:32 +00:00
Jessica Grebenschikov	5142874144	satellite/gc: move garbage collection to its own process Change-Id: I7235aa83f7c641e31c62ba9d42192b2232dca4a5	2020-03-18 16:44:01 +00:00
Egon Elbre	09e0f3de63	satellite/metainfo/piecedeletion: add Service Change-Id: Id7e32ed569701fa0be66f9527c43a67052994570	2020-03-18 14:50:08 +00:00
Stefan Benten	49a30ce4a7	satellite/payments: Set proper defaults for the release (#3806 ) * Slight adjustments to the migration Change-Id: I68ae81c010c3414fde2845df16ab124f8d17834b * Change Coupon Value Change-Id: I0f241d09e5f716f1d1b3f0688643ba7f614d83c4 * Change AlphaUsage to 5GB Change-Id: I5d25c6b5750684510cda8b14a27f38d5b2b07408 * change config lock Change-Id: Ib7c7a54555ba2387c9aa8dd60a0501b0ee6491dd * Use Scan properly Change-Id: Ie39cf4644e3ddd703a254e2f5e616763dd805235 * Fix Config Lock Change-Id: I558ecc1c1becfaaefc7aea5ad2fe83fd6bf6b561	2020-03-16 22:53:12 +01:00
Stefan Benten	52590197c2	satellite/payments: More Cleanup and Satellite command to ensure we have stripe customers (#3805 )	2020-03-16 20:34:15 +01:00
Stefan Benten	bd603c0751	satellite/payments: Improve Invoice Generation (#3800 )	2020-03-13 17:07:39 +01:00
JT Olio	051569c69f	satellite: enable open registration (and add flag that disables it) SM-441 Change-Id: I47bfedb312089f6d2bfbab013bd74ad4b8aa5f5e	2020-03-11 03:53:34 +01:00
Jessica Grebenschikov	e19e3c1101	pkg/process: Now that we are trying to identify the root cause of the satellite load limitations (i.e. currently the satellite has a max ability of 400 rps for uploads and we need this to be higher), we are using the golang diagnostic tools to collect insight into what the bottlenecks are. We currently have a debug endpoint to gather some cpu and mem data, but it could be useful to have continuous profiling. GCP stackdriver has support for continuous profiling so lets set that up and see if it is helpful to gather more data. This PR adds support for [GCP continuous profiler](https://cloud.google.com/profiler) which allows enabling continuous cpu/mem profiling and the stats are sent to stackdriver in google cloud console. To enable the continuous profiling for a storj component, do the following: - prereq: the workload must be running in GKE and have Stackdriver Profiling IAM role permissions - provide the config flag `debug.profilename` in the config.yaml file for the workload (i.e. satellite api process, etc). The profilename should be the workload name, for example "satellite-api". - once the above config flag is provided, the profiler will be initialized and profiling stats will automatically be sent to GCP project where the workload is running and viewable in the Stackdriver Profile page in the console The current implementation assumes the workload is running in GKE, however if we find if useful we can add support to enable this from anywhere. But for simplicity, its configured this way assuming the main goal is to enable in production systems. Change-Id: Ibf8ebe2df7bf06fdd4951ee6a1e48854dd36ad47	2020-02-25 09:04:23 -08:00
paul cannon	92d86fa044	satellite/repair: fix repair concurrency This new repair timeout (configured as TotalTimeout) will include both the time to download pieces and the time to upload pieces, as well as the time to pop the segment from the repair queue. This is a move from Github PR #3645. Change-Id: I47d618f57285845d8473fcd285f7d9be9b4318c8	2020-02-24 19:57:09 +00:00
Jeff Wendling	f671eb2beb	satellite/satellitedb: use queue for orders to get back fast billing This change adds two new tables to process orders as fast as we used to but in an asynchronous manner and with hopefully less storage usage. This should help scale on cockroach, but limits us to one worker. It lays the groundwork for the order processing pipeline to be queue rather than database driven. For more details, see the added fast billing changes blueprint. It also fixes the orders db so that all the timestamps that are passed to columns that do not contain a time zone are converted to UTC at the last possible opportunity, making it less likely to use the APIs incorrectly. We really should migrate to include timezones on all of our timestamp columns. Change-Id: Ibfda8e7a3d5972b7798fb61b31ff56419c64ea35	2020-02-24 17:07:07 +00:00
Yingrong Zhao	77f67a8086	satellite/metainfo: add timeout for delete request Change-Id: I9cad6d7ea185fc2c0ed4e58b42e4e3a78178a79f	2020-02-20 09:10:16 +00:00
JT Olio	2ae9978304	satellite/gc: skip first gc run rationale: if GC kills the satellite, it would be nice to make it through a repair checker sweep first Change-Id: Id56171dc8e13940cfb6481e36a910bad077a01ed	2020-02-13 13:41:15 +02:00
littleskunk	76849558cb	satellite/gracefulexit: increase performance and tolerate higher error rate Graceful exit is very slow at the moment. Over the last couple days we increase the batch size on Stefans satellite to 1000 but as a side effect the error rate was increased. With a batch size of 500 the error rate looks stable. This PR will increase the default to batch size to 300. Graceful exit will still be painful slow but at least it will be a bit faster. At the same time this PR also increases the number of errors we tolerate. We don't want to DQ slow storage nodes just because they didn't finish all 300 transfers in time. We want to give them more retries. Change-Id: I92e3f99e116d4988457d8b902a88e85ed1bcc1a7	2020-02-12 11:40:15 +00:00
Egon Elbre	dbf46c4aa7	satellite/admin: administrative endpoint Admin server allows creating basic REST and html API-s for different administrative tasks. Change-Id: I3dc1786abe1c87350eed60ec90e48130f44e63cf	2020-02-12 12:12:50 +02:00
Cameron Ayer	b22bf16b35	satellite/overlay: add config flag for node selection free disk requirement Currently SNs report their free disk space once per hour. If a node becomes full, it has to wait until the next contact cycle begins to report; all the while receiving and failing upload requests. By increasing the minimum required disk space, we can give the storage nodes more time to report their space before the completely fill up. This change goes hand-in-hand with another change we want to implement: trigger capacity report on SN immediately upon falling below threshold. Change-Id: I12f778286c6c3f582438b0e2949765ac43325e27	2020-02-11 18:08:25 +00:00
Qweder93	dc075eaa96	satellite/payments : deposit bonuses (credits) added Change-Id: Ib151bbb9b02d655fa619c53bfbc04ed6f3bb39e0	2020-02-11 11:11:42 +00:00
Egon Elbre	a2b2bc676b	pkg/debug: implement control panel Control Panel allows to control different chores and services. Currently this adds controlling of cycles. Change-Id: I734f1676b2a0d883b8f5ba937e93c45ac1a9ce21	2020-01-29 16:30:31 -05:00
littleskunk	e0cb8037c1	satellite/projectusage: reduce usage limit from 5GB to 0GB Change-Id: Ie3d2509613e7a4336e2a8d2b136b32f5f308aafc	2020-01-29 20:38:39 +00:00
Ethan	149273c63f	satellite/metainfo: add cache expiration for project level rate limiting Allow rate limit project cache to expire so we can make project level rate limit changes without restarting the satellite process. Change-Id: I159ea22edff5de7cbfcd13bfe70898dcef770e42	2020-01-29 16:14:10 +00:00
Yaroslav Vorobiov	083b396c16	satellite/payments: allow floating point numbers for pricing Change-Id: I78b60134cf043746efef5371b761939a10f75aaf	2020-01-28 22:52:13 -05:00
littleskunk	a0c9f7f3b0	satellite/projectusage: reduce usage limit from 25GB to 5GB Change-Id: I2819012b520fd687ab8058000aa38d76b8208158	2020-01-29 04:01:09 +01:00
littleskunk	a6c6440ab7	satellite/order: decrease expire time from 7 days to 2 days For the last few month we had no issues with order submission. I would call it stable and now it is time to risk a lower expire time. This will increase the database performance on the satellite and it will reduce the delay for billing. The long term goal is 6h but for that step we need to change graceful exit first. At the moment storage nodes would get disuqlaified for not transfering alle pieces in less than 6 hours. Change-Id: I421a2c2421c5374c4e706e2338f1c2161fedc14c	2020-01-24 23:37:39 +00:00
Michal Niewrzal	6502454947	satellite/metainfo: move RS configuration to satellite With this change RS configuration will be set on satellite. Uplink with get RS values with BeginObject request and will use it. For backward compatibility and to avoid super large change redundancy scheme stored with bucket is not touched. This can be done in future. Change-Id: Ia5f76fc10c37e2c44e4f7b8754f28eafe1f97eff	2020-01-22 09:33:53 +00:00
Ethan	21a5d70a83	satellite/metainfo: Rate limiting - API requests Limits how many times metainfo APIs can be called per second by project ID. If limit is exceeded, the API will return Unauthorized/Too Many requests. Limit per second and the size of the limiter cache per project are configurable, as well as whether the limiter is enabled. Tests added/updated for the new rate_limit field in projects table. Tests added for exceeding limits and disableing limiter. Change-Id: Ic8ad102de3b690a475809d4f684156d5715f20fa	2020-01-21 14:25:04 +00:00
stefanbenten	f4097d518c	satellite: reduce logging of node status Change-Id: I6618cf4bf31b856acd7a28b54011a943c03ab22a	2020-01-18 17:47:59 +00:00
Cameron Ayer	4424697d7f	satellite/accounting: refactor live accounting to hold current estimated totals live accounting used to be a cache to store writes before they are picked up during the tally iteration, after which the cache is cleared. This created a window in which users could potentially exceed the storage limit. This PR refactors live accounting to hold current estimations of space used per project. This should also reduce DB load since we no longer need to query the satellite DB when checking space used for limiting. The mechanism by which the new live accounting system works is as follows: During the upload of any segment, the size of that segment is added to its respective project total in live accounting. At the beginning of the tally iteration we record the current values in live accounting as `initialLiveTotals`. At the end of the tally iteration we again record the current totals in live accounting as `latestLiveTotals`. The metainfo loop observer in tally allows us to get the project totals from what it observed in metainfo DB which are stored in `tallyProjectTotals`. However, for any particular segment uploaded during the metainfo loop, the observer may or may not have seen it. Thus, we take half of the difference between `latestLiveTotals` and `initialLiveTotals`, and add that to the total that was found during tally and set that as the new live accounting total. Initially, live accounting was storing the total stored amount across all nodes rather than the segment size, which is inconsistent with how we record amounts stored in the project accounting DB, so we have refactored live accounting to record segment size Change-Id: Ie48bfdef453428fcdc180b2d781a69d58fd927fb	2020-01-16 10:26:49 -05:00
Jeff Wendling	78c6d5bb32	satellite/satellitedb: reported_serials table for processing orders this commit introduces the reported_serials table. its purpose is to allow for blind writes into it as nodes report in so that we have minimal contention. in order to continue to accurately account for used bandwidth, though, we cannot immediately add the settled amount. if we did, we would have to give up on blind writes. the table's primary key is structured precisely so that we can quickly find expired orders and so that we maximally benefit from rocksdb path prefix compression. we do this by rounding the expires at time forward to the next day, effectively giving us storagenode petnames for free. and since there's no secondary index or foreign key constraints, this design should use significantly less space than the current used_serials table while also reducing contention. after inserting the orders into the table, we have a chore that periodically consumes all of the expired orders in it and inserts them into the existing rollups tables. this is as if we changed the nodes to report as the order expired rather than as soon as possible, so the belief in correctness of the refactor is higher. since we are able to process large batches of orders (typically a day's worth), we can use the code to maximally batch inserts into the rollup tables to make inserts as friendly as possible to cockroach. Change-Id: I25d609ca2679b8331979184f16c6d46d4f74c1a6	2020-01-15 19:21:21 -07:00
Isaac Hess	4950d7106a	satellite/orders: Add write cache for bw rollups Change-Id: I8ba454cb2ab4742cafd6ed09120e4240874831fc	2020-01-13 22:40:51 +00:00
Jeff Wendling	77fd41a02e	satellite: add an expiring lru cache around api keys Change-Id: I995429c66affd33da59b091f28f09ca122070b5e	2020-01-09 22:13:41 -07:00
Natalie Ventura Villasana	6b1829f3c3	satellite/downtime: new chore estimates downtime Adds EstimationChore to the downtime package, which is an independent chore that finds offline nodes given a configurable limit, then uptime checks those nodes, and sets a last contact success or failure given a response. For failed nodes, the chore updates the amount of downtime the node has been offline in the DowntimeTracking table. Design doc section: https://github.com/storj/storj/blob/master/docs/blueprints/storage-node-downtime-tracking.md#estimating-offline-time Jira: https://storjlabs.atlassian.net/browse/V3-2545 Change-Id: I60af95803930bf9b33232b248bb20cca6f0e0b5f	2020-01-09 15:05:13 -05:00
Yingrong Zhao	76ee8a1b4c	satellite: remove UptimeReputation configs from codebase With the new storage node downtime tracking feature, we need remove current uptime reputation configs: UptimeReputationAlpha, UptimeReputationBeta, and UptimeReputationDQ. This is the first step of removing the uptime reputation columns from satellitedb Change-Id: Ie8fab13295dbf545e33aeda0c4306cda4ba54e36	2020-01-08 18:54:15 +00:00
Jeff Wendling	29fe206b9a	satellite/gc: add timeout to retain requests We don't want slowloris nodes to be able to indefinitely block up the satellite, so add a timeout. Some monitoring inspection showed the largest success times being on the order of 30s, so a 1min timeout should be sufficient to kill the misbehaving nodes. Change-Id: I5e2c3480a15f6304e37262d0a4d30d07eae99bb3	2020-01-03 21:46:46 +00:00
Simon Guindon	e1e7cebe49	satellite/metainfo: added rate limiting support to the metainfo loop. As per discussed we decided to rate limit how fast we iterate through the metainfo database in the metainfo loop. This puts in place a mechanism for rate limiting and burst limiting if need be in the future. The default for this rate limiting is still no limits so it stays the same as our previous functionality. Change-Id: I950f7192962b0e49f082d2c4284e2d52b0a925c7	2020-01-03 15:00:29 -05:00
Ethan	05b406e992	satellite:{downtime,overlay}: Implement offline node detection chore https://storjlabs.atlassian.net/browse/V3-3398 Change-Id: I598c3bad819026377d1d113c099dc9bba8b02742	2020-01-03 17:10:03 +00:00
Natalie Ventura Villasana	aa3e183c2e	satellite/gracefulexit: add ge eligibility check Adds check to see if storage nodes are eligible to initiate graceful exit, by checking their CreatedAt date and seeing if their "age" is greater than the new config value: NodeMinAgeInMonths The default for this value is 6 months for now. https://storjlabs.atlassian.net/browse/V3-3357 Change-Id: Ib807ab8987ddb5a38a27a83886490f73fe8c5816	2019-12-31 09:31:58 -05:00
littleskunk	d5c5b57fac	satellite/db: enable DeleteTallies Change-Id: I1e2a6873b3e6398260e053592d676993272b960d	2019-12-18 13:16:06 +00:00
littleskunk	71b58edb2c	satellite/repair: decrease repair interval Change-Id: Id9efdbfaa82521c35dc41e7a52b700522c197e77	2019-12-10 00:36:00 +00:00
littleskunk	6ab72a6e79	satellite/gracefulexit: enable graceful exit in production Change-Id: I526ce4a4de9c318f1333b793e3167f5f86d65adc	2019-12-09 17:32:34 +00:00
Malcolm Bouzi	18a5e614d9	satellite/web: add segmentio plugin (#3405 )	2019-11-27 11:57:59 -05:00
Yingrong Zhao	63e51df9a6	private/testplanet: add a mock referral manager server into testplanet (#3631 )	2019-11-21 17:34:49 -05:00
Matt Robinson	976881f72b	satellite/console: Add security headers (#3615 ) * satellite/console: Add X-Frame-Options and Referrer-Policy security headers * Update to use CSP instead of XFO and include tardigrade.io * Make FrameAncestors a config option * Update satellite-config lock * Make help text for FrameAncestors better	2019-11-21 11:15:22 -05:00
littleskunk	c52c7275ad	satellite/repair: reduce upload timeout (#3597 )	2019-11-18 18:52:56 +01:00
Nikolai Siedov	3fe518d547	satellite: added ability to inject stripe public key post build (#3560 )	2019-11-18 13:38:43 +02:00
Yaroslav Vorobiov	53c6741ba6	satellite/payments: add API for retrieving conversion ratio, convert tokens to USD before applying to balance (#3530 )	2019-11-15 16:59:39 +02:00
Yehor Butko	a8e4e9cb03	satellite/payments: project usage charges (#3512 )	2019-11-15 16:27:44 +02:00
Natalie Villasana	1a9757a7f2	satellite/gracefulexit: add count for order limits sent from satellite to exiting node (#3544 )	2019-11-13 09:54:50 -05:00
Yaroslav Vorobiov	0b32690d0a	satellite/peer: add payments config (#3488 ) * satellite/peer: add payments config * remove stripe-key from console config * update config lock * fix imports * fix config-lock	2019-11-05 21:26:19 +01:00
littleskunk	def3dcbaa9	satellite/audit: increase timeout to 5 minutes (#3480 ) * satellite/audit: increase timeout to 5 minutes * fix lint error	2019-11-05 11:21:25 +01:00
Maximillian von Briesen	590312970d	satellite/gracefulexit: add flag for enabling/disabling graceful exit on the satellite (#3437 )	2019-11-01 16:21:24 +02:00
Maximillian von Briesen	d9bb25b4b9	satellite/metainfo: support a wider range of values for RS.Total in satellite metainfo validation (#3431 ) change uplink RS default configuration from 130 to 95	2019-10-31 15:04:33 -04:00
Yingrong Zhao	bfa6699e2c	satellite/repair: add timeout for repair download from a single node(#3418 )	2019-10-30 16:31:08 -04:00
Natalie Villasana	4878135068	satellite/gracefulexit, storagenode/gracefulexit: add timeouts (#3407 )	2019-10-30 13:40:57 -04:00
Yingrong Zhao	fa1ac24e19	satellite/gracefulexit: add failure threshold check (#3329 ) * add overall failure percentage check and inactive time frame check before sending a response to sno * update comment * delete node from transfer queue if it has been inactive for too long * fix linting error * add test config value * fix nil pointer * add config value into testplanet * add unit test for overall failure threshold * move timeframe threshold to chore * update protolock * add chore test * add per peiece failure count logic * change config name from EndpointMaxFailures to MaxFailuresPerPiece * address comments * fix linting error * add error handling for no row returned from progress table * fix test for graceful exit chore on storagenode * fix typo InActive -> Inactive * improve readability for failure threshold calculation * update config lock * change error handling for GetProgress in graceful exit endpoint on the satellite side * return proper rpc error in endpoint * add check in chore test for checking finish timestamp and queue	2019-10-24 12:24:42 -04:00
littleskunk	2a5526fcc4	satellite/repair: reduce timeout (#3302 )	2019-10-18 13:43:24 +02:00
Natalie Villasana	855fca003d	satellite/metrics: create a metrics chore (#3263 ) * add metrics counter and chore * updates metrics observer interval release default and dev default to 15min * add more specific check for remote pointers * add Counter field to metrics chore, add counter tests * rm redundant ObjectCount suffix * make pointer check easier to read * change metrics.Config.Interval to ChoreInterval * rm unneeded var * fix comment * update satellite config lock	2019-10-16 14:08:33 -04:00
Cameron	76ad83f12c	satellite/accounting: add redis support to live accounting (#3213 ) * set up redis support in live accounting * move live.Service interface into accounting package and rename to Cache, pass into satellite * refactor Cache to store one int64 total, add IncrBy method to redis client implementation * add monkit tracing to live accounting	2019-10-16 12:50:29 -04:00
Jennifer Li Johnson	b185dbbee2	satellite/discovery: remove discovery related code (#3175 )	2019-10-14 10:57:01 -04:00
littleskunk	96aeedcdee	OrderLimit/GracePeriod: Increase time window from 1h to 24h (#3255 ) * OrderLimit/GracePeriod: Increase time window from 1h to 24h * update satellite config lock	2019-10-13 17:40:24 +02:00
Ethan Adams	a1275746b4	satellite/gracefulexit: Implement the 'process' endpoint on the satellite (#3223 )	2019-10-11 17:18:05 -04:00
Ethan Adams	4c4519f0be	satellite/gracefulexit: add transfer queue for pieces (#3174 ) initial impl of transfer queue updated docs represent the new design how we handle durability during exit	2019-10-07 16:38:05 -04:00
Stefan Benten	1db4251234	Satellite/repair: Add Repair Threshold Override to allow earlier repair (#3151 )	2019-10-02 14:58:37 +02:00
Maximillian von Briesen	08ed50bcaa	satellite/metainfo: add commit interval to prevent long delays between order limit creation and segment commit (#3149 )	2019-10-01 12:55:02 -04:00
Bogdan Artemenko	423d35fb3f	satellite/console: Added support URLs and other fields to config file (#3090 )	2019-09-27 10:48:53 -06:00
Stefan Benten	c71f3a3f4a	internal/version: Change default endpoint to query (#3126 ) * change default domain name change default domain name to point to the new version control * Update satellite-config.yaml.lock	2019-09-25 22:55:38 +02:00
Jennifer Li Johnson	724bb44723	Remove Kademlia dependencies from Satellite and Storagenode (#2966 ) What: cmd/inspector/main.go: removes kad commands internal/testplanet/planet.go: Waits for contact chore to finish satellite/contact/nodesservice.go: creates an empty nodes service implementation satellite/contact/service.go: implements Local and FetchInfo methods & adds external address config value satellite/discovery/service.go: replaces kad.FetchInfo with contact.FetchInfo in Refresh() & removes Discover() satellite/peer.go: sets up contact service and endpoints storagenode/console/service.go: replaces nodeID with contact.Local() storagenode/contact/chore.go: replaces routing table with contact service storagenode/contact/nodesservice.go: creates empty implementation for ping and request info nodes service & implements RequestInfo method storagenode/contact/service.go: creates a service to return the local node and update its own capacity storagenode/monitor/monitor.go: uses contact service in place of routing table storagenode/operator.go: moves operatorconfig from kad into its own setup storagenode/peer.go: sets up contact service, chore, pingstats and endpoints satellite/overlay/config.go: changes NodeSelectionConfig.OnlineWindow default to 4hr to allow for accurate repair selection Removes kademlia setups in: cmd/storagenode/main.go cmd/storj-sim/network.go internal/testplane/planet.go internal/testplanet/satellite.go internal/testplanet/storagenode.go satellite/peer.go scripts/test-sim-backwards.sh scripts/testdata/satellite-config.yaml.lock storagenode/inspector/inspector.go storagenode/peer.go storagenode/storagenodedb/database.go Why: Replacing Kademlia Please describe the tests: • internal/testplanet/planet_test.go: TestBasic: assert that the storagenode can check in with the satellite without any errors TestContact: test that all nodes get inserted into both satellites' overlay cache during testplanet setup • satellite/contact/contact_test.go: TestFetchInfo: Tests that the FetchInfo method returns the correct info • storagenode/contact/contact_test.go: TestNodeInfoUpdated: tests that the contact chore updates the node information TestRequestInfoEndpoint: tests that the Request info endpoint returns the correct info Please describe the performance impact: Node discovery should be at least slightly more performant since each node connects directly to each satellite and no longer needs to wait for bootstrapping. It probably won't be faster in real time on start up since each node waits a random amount of time (less than 1 hr) to initialize its first connection (jitter).	2019-09-19 15:56:34 -04:00
Jennifer Li Johnson	ce3203e910	update NodeSelectionConfig.OnlineWindow to 4hr default (#3082 )	2019-09-18 14:57:57 -04:00
Andrew Harding	f550ab5d1c	Uplink "import" command (#2981 ) * uplink import cmd * pkg/process: fix import order * fix golangci-lint failures * remove "help" from the satellite config lock file	2019-09-13 12:33:30 -06:00
Natalie Villasana	aa3567187e	satellite/audit: worker now verifies and reverifies (#2965 )	2019-09-11 18:37:01 -04:00
Natalie Villasana	6d363fb756	satellite/audit: create the audit queue, chore, and worker (#2888 )	2019-09-05 11:40:52 -04:00
Cameron	af5fb8e9c5	satellite/vouchers: deprecate voucher endpoint, return 'please upgrade' error (#2940 ) * voucher endpoint returns 'please upgrade' error, test	2019-09-04 13:21:02 -04:00
Yingrong Zhao	10a896bf73	web/marketing: static asset path (#2872 ) * use relative path instead of absolute path * add template func baseURL * add a method * update storj-sim * add comment	2019-08-30 18:43:53 -04:00
Cameron	599324c364	satellite/dbcleanup: delete expired serials from satellite (#2867 ) Creates a new chore, dbcleanup, which can be used for routine deletion of items from the satellite database and adds functionality for deletion of expired serial numbers	2019-08-27 13:12:38 -04:00
Natalie Villasana	243cedb628	satellite/audit: implement reservoir struct and RemoteSegment observer method (#2744 )	2019-08-21 11:49:27 -04:00
ethanadams	1a69ec8318	satellite/orders: document protocol and fix typos (#2813 ) * Addressing comments from PR 2762 * Rebuild of orders.pb.go after comments added to proto file * run update-satellite-config-lock for spelling fix.	2019-08-19 09:36:11 -04:00
ethanadams	8df683a265	Update satellite settlement endpoint to batch order processing into transactions. (#2762 ) Update satellite settlement endpoint to batch order processing into transactions	2019-08-15 15:05:43 -04:00
littleskunk	3e41767f22	satellie/gc: enable garbage collection on the satellite (#2765 )	2019-08-12 20:30:09 +02:00
Egon Elbre	c8edeb0257	satellite/overlay: rename overlay.Cache to overlay.Service (#2717 )	2019-08-06 19:35:59 +03:00
Jeff Wendling	21a3bf89ee	cmd/uplink: use scopes to open (#2501 ) What: Change cmd/uplink to use scopes It moves the fields that will be subsumed by scopes into an explicit legacy section and hides their configuration flags. Why: So that it can read scopes in from files and stuff	2019-08-05 11:01:20 -06:00
ethanadams	c9b46f2fe2	V3-1987: Optimize audits stats persistence (#2632 ) * Added batch update stats for recordAuditSuccessStatus * Added batch update stats to recordAuditFailStatus * added configurable batch size * build individual update/delete statements so the statements can be batched into 1 call to the DB * notified #config-changes channel and ran make update-satellite-config-lock * updated tests to use batch update stats	2019-07-31 13:21:06 -04:00
Natalie Villasana	f11413bc8e	Implement garbage collection on satellite (#2577 ) * Added a gc package at satellite/gc, which contains the gc.Service, which runs garbage collection integrated with the metainfoloop, and the gc PieceTracker, which implements the metainfo loop Observer interface and stores all of the filters (about which pieces are good) for each node. * Added a gc config located at satellite/gc/service.go (loop disabled by default in release) * Creates bloom filters with pieces to be retained inside the metainfo loop * Sends RetainRequests (or filters with good piece ids) to all storage nodes.	2019-07-24 13:26:43 -04:00
Maximillian von Briesen	6c1c3fb4a7	Add metainfo loop service (#2563 ) Add a metainfo loop service on the satellite that can be subscribed to by various services that need to make use of metainfo information	2019-07-22 09:34:12 -04:00
Alexander Leitner	64b2769de3	discovery: parallelize refresh (#2535 ) * parallelize discovery refresh * add paginateQualifiedtest, address pr comments * Remove duplicate uptime update * Lower concurrency in Testplanet for discovery	2019-07-12 10:35:48 -04:00
Ivan Fraixedes	f420b29d35	[V3-1927] Repairer uploads to max threshold instead of success… (#2423 ) * pkg/datarepair: Add test to check num upload pieces Add a new test for ensuring the number of pieces that the repair process upload when a segment is injured. * satellite/orders: Don't create "put order limits" over total Repair must not create "put order limits" more than the total count. * pkg/datarepair: Update upload repair pieces test Update the test which checks the number of pieces which are uploaded during a repair for using the same excess over the success threshold value than the implementation. * satellites/orders: Limit repair put order for not being total Limit the number of put orders to be used by repair for only uploading pieces to a % excess over the successful threshold. * pkg/datarepair: Change DataRepair test to pass again Make some changes in the DataRepair test to make pass again after the repair upload repaired pieces only until a % excess over success threshold. Also update the steps description of the DataRepair test after it has been changed, to match on what's now, besides to leave it more generic for avoiding having to update it on minimal future refactorings. * satellite: Make repair excess optimal threshold configurable Add a new configuration parameter to the satellite for being able to configure the percentage excess over the optimal threshold, used for determining how many pieces should be repaired/uploaded, rather than having the value hard coded. * repairer: Add configurable param to segments/repairer Add a new parameters to the segment/repairer to calculate the maximum number of excess nodes, based on the optimal threshold, that repaired pieces can be uploaded. This new parameter has been added for not returning more nodes than the number of upload orders for data repair satellite service calculate for repairing pieces. * pkg/storage/ec: Update log message in clien.Repair * satellite: Update configuration lock file	2019-07-12 00:44:47 +02:00
Bill Thorp	0e463dccfd	7 day validity window for order limits (#2520 ) * 7 day limit	2019-07-10 17:17:00 -04:00
Stefan Benten	16156e3b3d	Ensure we force a segment size and account storage before committing them (#2473 )	2019-07-08 18:24:38 -04:00
Egon Elbre	674742d1a7	satellite/datarepair: use reliability cache (#1976 )	2019-07-09 01:04:35 +03:00
littleskunk	a2362f92dc	Rollback uptime disqualification (#2417 )	2019-07-02 10:39:36 +02:00
littleskunk	9e62423f47	reduce vetting requirement (#2416 )	2019-07-02 01:02:23 +02:00
JT Olio	8c57434ded	pkg/process/metrics: add an instance prefix (#2190 ) * pkg/process/metrics: add an instance prefix the distinction between which satellite is sending which data should go in the instance field, not the suffix or application fields. (un)fortunately, the instance id is deliberately not configurable because we don't want it to be easy to accidentally have multiple applications collide with the same instance id. so we're currently stuffing the human readable instance in the suffix. :( perhaps a reasonable tradeoff would be an optional instance prefix that allows operators to put their domain name in the instance Change-Id: I6fcc8498be908c5740439cc00f77474ad151febd * linting Change-Id: I9f9a44fa9a2634ef5e4f89548d42d57ce9e4450e	2019-06-24 16:45:37 -06:00
Maximillian von Briesen	fd6a4d96f2	change uptime dq threshold to 0.4 (#2313 ) * change uptime dq threshold to 0.4 * update config lock	2019-06-24 12:18:32 -04:00
Cameron	1283036e37	add storage node voucher request service (#2158 ) * add voucher service on storage node * config field tag syntax, go routines for requests * hook up voucher service in storagenode/peer.go * add voucher config to testplanet * add voucher config to testplanet * add voucher response status INVALID, ACCEPTED, REJECTED * add a test for vouchers service * handle no row from GetValid, test it * add trust pool to voucher service * use trusted list to get satellites * verify vouchers upon receipt * test VerifyVoucher	2019-06-21 18:48:52 -04:00
aligeti	043d603cbe	satellite rs config check with validation check set to false default (#2229 ) * satellite rs config check with validation check	2019-06-21 14:15:58 -04:00
Bill Thorp	8f47fca5d3	Remove audit / uptime ratio fields (#2247 ) * removed ratios	2019-06-21 13:14:53 -04:00
JT Olio	568b000e9b	satellite: make order expiration configurable (#2251 )	2019-06-21 13:38:40 +03:00
Natalie Villasana	edb3d1cbf8	pkg/overlay: update node selection config values for reputation (#2264 )	2019-06-20 15:01:50 -04:00
Natalie Villasana	9386187fe6	add disqualification and new reputation system into overlay cache (#2227 )	2019-06-20 09:56:04 -04:00
Natalie Villasana	b30c35d306	change ReputationAuditOmega (et al.) to AuditReputationWeight (#2232 )	2019-06-18 14:17:25 -04:00
JT Olio	e58a06bd0c	config: update release values to match prod (#2192 )	2019-06-15 18:19:19 +02:00
littleskunk	319cc77a34	increase audit timeout (#2208 )	2019-06-14 13:53:49 +02:00
JT Olio	df2fad15d8	pkg/process/logging: different defaults for release/dev (#2191 ) * pkg/process/logging: different defaults for release/dev Change-Id: I55be80430a31668fededf479b052e106ab18d9ce * linting Change-Id: I4e50d4c9569b7324c4704c14df7dd3228dbb7dd5 * Trigger Jenkins * fix lock file * use dev=debug and prod=info	2019-06-13 10:43:39 -06:00
Egon Elbre	fdddaa2a47	Fix marketingweb code (#2177 ) * set to only listen on 127.0.0.1, move static files to same location, better template handling * handle error * fix path in storj-sim * revert template handling changes * code shouldn't panic on invalid tempalte * do not rewrite once writing has started * write correct error code * use filepath for path handling * revert change * fix * fix mod tidy * use correct error code for not found, avoid infinite loop on failure	2019-06-12 09:42:39 -04:00
Yingrong Zhao	af66d9c6e4	Open a new port on satellite for admin GUI (#1901 ) * Set up new port 8090 for in offers Clean up commented code Rename offers to offersweb Remove unused code Add todos for adding front-end templates Add middleware for only allow local access Add comment Fix linting error Remove commented code Update storj-sim Check request IP against Host IP Use net pakcage to retrieve IP address Rename service to marketing * Add wrapper for all errors * fix conflicts * update the config file * fix linting error * remove unused packages * remove global runtime var and add flag to storj-sim for mar static dir * remove debugging lines * add new config for test data and check if static dir flag is set before passing to mux * change 'console' to 'marketing' for test data config * fix linting errors * update config flag * Trigger Jenkins * Trigger CLA	2019-06-11 11:00:59 -04:00
Ivan Fraixedes	f624b213a3	reputation: Add configuration parameters (#2150 ) * reputation: Add configuration parameters Add the configuration parameters which will be used by the algorithm which will calculate the storage node reputation. Because the reputation calculation is based on audit and uptime check results some configuration parameters are in pkg/audit, others in pkg/discovery and other in the satellite which will combine the both reputation results to obtain the storage node reputation for repair and uplink. * satellite-config: Refresh lock file with new params Refresh the Satellite configuration yaml lock file with the new parameters added in this branch.	2019-06-11 12:14:01 +02:00
Egon Elbre	749846b42b	Update golangci-lint (#2159 )	2019-06-10 11:52:09 +03:00
JT Olio	469d485f62	dbutil: reduce defaults so sum is < 100 (#2157 )	2019-06-09 21:04:22 +02:00
JT Olio	43d4f3daf5	discovery: remove graveyard (#2145 )	2019-06-07 08:40:51 +03:00
Stefan Benten	23213a7a29	Fix Config Comments in DButil Package (#2119 ) * Fix Config Comments * Updating lock file * Update lock file * Set Actual time.Duration * Fix Issue	2019-06-04 17:53:38 -06:00
JT Olio	d02427e41a	db: set max open conns, conn max lifetime, add db stat monitoring (#2117 )	2019-06-04 23:30:21 +02:00
paul cannon	d15eaed588	add capability of logging all GRPC calls/payloads (#2067 )	2019-06-04 14:55:24 +02:00
JT Olio	3fe8343b6c	repairer: fix config comments (#2105 )	2019-06-04 14:13:31 +02:00
Yaroslav Vorobiov	6809129e6f	Console add stripe service (#2080 )	2019-06-03 16:46:57 +03:00
Kaloyan Raev	2ab95b533e	Check errors for possible outcomes from audit's DownloadShares (#2072 )	2019-06-03 12:17:09 +03:00
JT Olio	e60ff9dcbb	process/metrics: have metrics suffix default to dev/release status (#2073 ) What: this will make it so release binaries default to whatever-release instead of whatever-dev in metrics collection Why: So we can monitor release binaries with default configuration without getting drowned out by dev binaries	2019-05-31 16:47:48 -06:00
Fadila	5b730e3073	Make maxReverifyCount configurable (#2071 ) * make max reverify count configurable	2019-05-31 17:23:00 +02:00
Cameron	590b1a5a1d	Satellite voucher service (#2043 ) * set up voucher service skeleton, basic test * add VetNode db method * basic test for VetNode * encode and sign voucher functions * fill out and sign vouchers * test pass/fail voucher request * match EncodeVoucher to other Encode functions	2019-05-30 15:52:33 -04:00
aligeti	934ebf9cbf	Added the irreparable repair functionality (#1955 ) * Added the irreparable repair functionality	2019-05-30 11:18:20 -04:00
Stefan Benten	8912d7149c	Fixes Auth Issue (#2064 )	2019-05-28 16:32:51 +02:00
Cameron	4058c29ca4	filter duplicate node IPs (#1890 ) * add last_ip field to dbx model node, generate dbx * add last_ip to node proto, generate pb * migrate * resolve address in transport.DialNode, update lastIp in cache.UpdateAddress * use net.SplitHostPort to isolate host address from port * define DistinctIPs flag * add test for GetIP * select last_ip when querying for nodes * if distinctIPs flag == true, query for nodes with distinct IPs * some basic tests * change last_ip to field 14 in proto * remove comments * check err * change distinctIPs to distinctIP * exclude IPs from newNodes in query for reputable nodes * add index on last_ip * only add to excludedIPs if flag is true * test half new nodes returns distinct IPs * fix alignment * add test * rework ip filter query, add retry logic, add switch for database driver * add retry to SelectNewNodes * change discovery intervals so IPs don't get overwritten * remove TestGetIP * edit updating node stats in test * split exclude into nodeIDs and IPs * separate non-distinct IP query into other function * trigger checks * remove else block	2019-05-22 16:06:27 -04:00
JT Olio	32b3f8fef0	cmd/storagenode: pull more things into releaseDefaults (#1980 )	2019-05-21 13:48:47 +02:00
paul cannon	02be91b029	real-time tracking of space used per project (#1910 ) Ran into difficulties trying to find the ideal solution for sharing these counts between multiple satellite servers, so for now this is a dumb solution storing recent space-usage changes in a big dumb in-memory map with a big dumb lock around it. The interface used, though, should allow us to swap out the implementation without much difficulty elsewhere once we know what we want it to be.	2019-05-09 20:39:21 -05:00
Bill Thorp	89c5e70003	defaults now commented out (#1878 ) * defaults now commented out, unless custom / user / override	2019-05-08 08:14:00 -04:00
Ivan Fraixedes	f8df461249	v3-1660: Create a test that breaks when satellite config file has changed (#1898 ) * scripts: check for satellite configuration changes Create a simple script that checks if the satellite configuration has differed with the last changes. * Makefile: add target to check satellite cfg changes Add a Makefile target which executes the script that checks if the last changes has made a change in the satellite configuration. * ci: add satellite cfg check on integration stage * FIXUP: use releae defaults rather than development ones * FIXUP: show the message when config differs & some cleanups * scripts: add script to update the satellite cfg lock Add a script for allowing to update the satellite configuration lock file and add Makefile target to run it. * scripts/testsdata: update satellite cfg lock Update the satellite configuration lock file with the last changes that satellite has suffered upstream.	2019-05-07 17:20:04 -04:00

... 4 5 6 7 8

395 Commits