storj

Author	SHA1	Message	Date
Isaac Hess	85b6316ce8	statreceiver: Update to filter on packet headers The admission/v3 protocol now supports arbitrary key/value headers to be included in each packet of metrics. This commit creates support for this, so the lua config file can declare a filter taking into account the key/value headers. Change-Id: I41de8c018d33304ccf46ec221ae689d55c5fb1ee	2020-03-12 13:57:07 -06:00
Isaac Hess	8e274a5ce4	cmd/uplink: Enable telemetry on ctx Change-Id: I26ffbbb72b9f2ca71b629ba12637c74c2aa072c3	2020-03-12 14:52:40 +00:00
Michal Niewrzal	c20cf25f35	cmd: migrate uplink CLI to new API Change-Id: I8f8fcc8dd9a68aac18fd79c4071696fb54853a60	2020-03-09 13:26:29 +00:00
Jessica Grebenschikov	bcb0453db2	upgrade dependencies for trace db debug endpoint Change-Id: I4de658b361bb39ce28dc31b982895bb4f45b580a	2020-03-04 07:35:34 +00:00
Cameron Ayer	7244a6a84e	storagenode/{contact, piecestore}: implement low disk notification with cooldown When a storagenode begins to run low on capacity, we want to notify the satellite before completely running out of space. To achieve this, at the end of an upload request, the SN checks if its available space has fallen below a certain threshold. If so, trigger a notification to the satellites. The new NotifyLowDisk method on the monitor chore is implemented using the common/syn2.Cooldown type, which allows us to execute contact only once within a given timeframe; avoiding hammering the satellites with requests. This PR contains changes to the storagenode/contact package, namely moving methods involving the actual satellite communication out of Chore and into Service. This allows us to ping satellites from the monitor chore Change-Id: I668455748cdc6741291b61130d8ef9feece86458	2020-03-03 10:45:37 -05:00
Egon Elbre	decb2ec69a	private/processgroup: moved to storj.io/common/processgroup Change-Id: I1ec0bb440dda757d8f9a6f564a0084dde2f9cc84	2020-03-03 10:50:33 +00:00
Egon Elbre	64330c55b3	all: use pbgrpc common/pb moved grpc to a separate package common/pb/pbgrpc. This updates this repository to use it. Change-Id: I2de2a190688871cf9cb61f7ea511f8a01e264e4e	2020-02-26 21:27:47 +02:00
Egon Elbre	8822e98c1f	cmd/gateway: simplify module handling Change-Id: If6ed158a6c9568fa33f69ca2d52e231ee4fcb0cb	2020-02-26 17:59:45 +00:00
Jessica Grebenschikov	e19e3c1101	pkg/process: Now that we are trying to identify the root cause of the satellite load limitations (i.e. currently the satellite has a max ability of 400 rps for uploads and we need this to be higher), we are using the golang diagnostic tools to collect insight into what the bottlenecks are. We currently have a debug endpoint to gather some cpu and mem data, but it could be useful to have continuous profiling. GCP stackdriver has support for continuous profiling so lets set that up and see if it is helpful to gather more data. This PR adds support for [GCP continuous profiler](https://cloud.google.com/profiler) which allows enabling continuous cpu/mem profiling and the stats are sent to stackdriver in google cloud console. To enable the continuous profiling for a storj component, do the following: - prereq: the workload must be running in GKE and have Stackdriver Profiling IAM role permissions - provide the config flag `debug.profilename` in the config.yaml file for the workload (i.e. satellite api process, etc). The profilename should be the workload name, for example "satellite-api". - once the above config flag is provided, the profiler will be initialized and profiling stats will automatically be sent to GCP project where the workload is running and viewable in the Stackdriver Profile page in the console The current implementation assumes the workload is running in GKE, however if we find if useful we can add support to enable this from anywhere. But for simplicity, its configured this way assuming the main goal is to enable in production systems. Change-Id: Ibf8ebe2df7bf06fdd4951ee6a1e48854dd36ad47	2020-02-25 09:04:23 -08:00
Egon Elbre	29452d82a5	go.mod: unlock graphql dependency and bump to latest Change-Id: I40026f6c8de155e024f5fbb51105546065393034	2020-02-25 13:17:49 +02:00
Egon Elbre	9752d01884	private/prompt: remove dependency to go-prompt Change-Id: Ida8ef731ce806cec076343dc77d72a3b0d7736b4	2020-02-25 13:09:41 +02:00
JT Olio	50a21de9dc	traces: fix memory leak for long running traces that aren't being collected for real this time. i'm so ashamed Change-Id: Ib05bb50d8e947dec2d872fd53e71eec561c2d0e8	2020-02-24 15:02:26 -07:00
Egon Elbre	e30f7b35b6	cmd/gateway: use a separate repository Change-Id: Idbb0b2b6cf0e60c6d5d91218c24524d72285cf26	2020-02-24 10:03:03 +02:00
Yingrong Zhao	5011e78311	storagenode/piecestore: remove unused DeletePiece endpoint With commit: `3331b443e7`, satellite will start calling `DeletePieces`. Therefore, we can remove the old endpoint once the above commit is deployed with all satellites Change-Id: I0124bc00a7cb808d119eb59f8fcd7fadf68158bb	2020-02-21 21:03:49 +00:00
Michal Niewrzal	54e38b8986	pkg/miniogw: gateway implementation with new libuplink Change-Id: I170c3a68cfeea33b528eeb27e6aecb126ecb0365	2020-02-21 16:20:38 +01:00
Egon Elbre	5342dd9fe6	go.mod: update uplink Change-Id: I867a6a1eef8aa5d60bb676e5112b98c4192ce811	2020-02-21 16:08:12 +02:00
Ivan Fraixedes	0a8f268a7e	go.mod: Update golang.org/x/crypto to fix vulnerability Update the golang.org/x/crypto package to fix the vulnerability CVE-2020-9283. See https://groups.google.com/forum/#!topic/golang-nuts/XDqhhjZViNk Change-Id: I7c841c0bae0f55dad0c7de19ac70c730d11733f0	2020-02-21 11:30:05 +01:00
Yingrong Zhao	e6da8d0249	satellite/metainfo: use global limiter for DeletePieces Service we want to return back to the user as quick as possible but also keep deleting remaining pieces on the storagenodes Change-Id: I04e9e7a80b17a8c474c841cceae02bb21d2e796f	2020-02-19 12:17:36 +00:00
Egon Elbre	892b190db6	satellite/admin: add project limit modification and authorization token Change-Id: If9a7214a940b8544f8023c2cd82da21f19d3f521	2020-02-17 07:56:16 +00:00
JT Olio	900cc47772	traces: fix memory leak for long running traces that aren't being collected Change-Id: I7576e5f420e83c52c5bdb65a30f68aa8ee3d3cc8	2020-02-13 03:16:31 -07:00
Jeff Wendling	05a240050e	storagenode: monitor available space and bandwidth Change-Id: I5763597327c5b32982faab8910c136c6c8dc18c5	2020-02-13 07:07:29 +00:00
Qweder93	eeaaa8aa98	satellite/payments/stripecoinpayments: added ApplyInvoiceCredits Change-Id: I7ed9d8397c0aa59d4ce0d40d1e50d13929e0fe5f	2020-02-12 20:06:08 +02:00
Michal Niewrzal	cea4c25f53	mod: bump common and uplink version Change-Id: Ia063d33c087dd91a46c008e154b078f11fa21527	2020-02-12 14:33:54 +00:00
Egon Elbre	34f38bf6ce	mod: upgrade miniredis to latest miniredis 2.5.0 had a bug with matching keys with newlines. Change-Id: I9bcf998459be6d7d4e03bca3589e989e5ed2304d	2020-02-06 13:31:17 +00:00
Jeff Wendling	7999d24f81	all: use monkit v3 this commit updates our monkit dependency to the v3 version where it outputs in an influx style. this makes discovery much easier as many tools are built to look at it this way. graphite and rothko will suffer some due to no longer being a tree based on dots. hopefully time will exist to update rothko to index based on the new metric format. it adds an influx output for the statreceiver so that we can write to influxdb v1 or v2 directly. Change-Id: Iae9f9494a6d29cfbd1f932a5e71a891b490415ff	2020-02-05 23:53:17 +00:00
Jessica Grebenschikov	dd9d18f152	upgrade drpc so that we have the monkit metric capability Change-Id: Icdd08478aeff4fbd7148975eca8a21fac41289d7	2020-02-03 17:05:54 +00:00
Jeff Wendling	d20db90cff	private/dbutil/txutil: create new transactions for retries it was noticed that if you had a long lived transaction A that was blocking some other transaction B and A was being aborted due to retriable errors, then transaction B was never given priority. this was due to using savepoints to do lightweight retries. this behavior was problematic becaue we had some queries blocked for over 16 hours, so this commit addresses the issue with two prongs: 1. bound the amount of time we will retry a transaction 2. create new transactions when a retry is needed the first ensures that we never wait for 16 hours, and the value chosen is 10 minutes. that should be long enough for an ample amount of retries for small queries, and huge queries probably shouldn't be retried, even if possible: it's more preferrable to find a way to make them smaller. the second ensures that even in the case of retries, queries that are blocked on the aborted transaction gain priority to run. between those two changes, the maximum stall time due to retries should be bounded to around 10 minutes. Change-Id: Icf898501ef505a89738820a3fae2580988f9f5f4	2020-02-01 18:34:28 +00:00
Michal Niewrzal	a181e0b627	libuplink: adjust tests to changes in encryption store We move PathCipher to encryption.Store and we need to adjust storj/uplink for those changes. Uplink repo is also using libuplink to run tests so we need first adjust storj/storj libuplink and later storj/uplink. Change-Id: I84f23e6bad18ac139f72c19939dc526f9f46d88b	2020-01-30 22:00:24 +00:00
Yaroslav Vorobiov	083b396c16	satellite/payments: allow floating point numbers for pricing Change-Id: I78b60134cf043746efef5371b761939a10f75aaf	2020-01-28 22:52:13 -05:00
Jessica Grebenschikov	54dbaaece2	satellite/orders: create as many orderLimits as needed to download a file Change-Id: I2a39483d35037d9940913c035a78a93ea692ce9f	2020-01-28 20:04:11 +00:00
Egon Elbre	f4317d257a	mod: bump uplink and common Change-Id: I83874539c705e4b22940bc15ea990fe879dde721	2020-01-27 08:20:10 -05:00
Jeff Wendling	16bb374deb	storagenode/piecestore: add large timeouts to read/write operations this is to help protect against intentional or unintentional slowloris style problems where a client keeps a tcp connection alive but never sends any data. because grpc is great, we have to spawn a separate goroutine for every read/write to the stream so that we can return from the server handler to cancel it if necessary. yep. really. additionally, we update the rpcstatus package to do some stack trace capture and add a Wrap method for the times where we want to just use the existing error. also fixes a number of TODOs where we attach status codes to the returned errors in the endpoints. Change-Id: Id8bb8ff84aa34e0f711b0cf9bce3908b36a1d3c1	2020-01-23 19:20:49 +00:00
Yingrong Zhao	5de4f66553	scripts/tests: change multisegment file to be 128kb To cover a special case: an object that has 2 remote segments and 1 inline segment. Change-Id: Ia8d82bb67fc6cf76af9c7f44cd738cab6df591e9	2020-01-22 17:12:11 +00:00
Michal Niewrzal	86f194769f	uplink: adjust to changes in storj/uplink This change is adjusting code base to changes in storj/uplink. https://review.dev.storj.io/c/storj/uplink/+/643 Change-Id: Ieca87f9f5983e391bf4b4fec8b9d5491fd32bfa1	2020-01-20 22:06:19 +00:00
Egon Elbre	64fb2d3d2f	Revert "dbutil: statically require all databases accesses to use contexts" This reverts commit `8e242cd012`. Revert because lib/pq has known issues with context cancellation. These issues need to be resolved before these changes can be merged. Change-Id: I160af51dbc2d67c5449aafa406a403e5367bb555	2020-01-15 07:28:00 +00:00
JT Olio	c01cbe0130	satellitedb: save out all db-touching traces Change-Id: Ib1e192221f9da813fd9cbb55f620a047b82c9523	2020-01-14 18:47:45 -05:00
JT Olio	8e242cd012	dbutil: statically require all databases accesses to use contexts this will allow for some nice runtime analysis down the road. also, this allows for wrapping database handles in a way that can interact with these contexts requires https://review.dev.storj.io/c/storj/dbx/+/514 Change-Id: Ib087b7cd73296dd2c1e0331314da34d861f61d2b	2020-01-14 18:20:47 -05:00
crawter	41d5e86306	satellite/payments: coupon addition removed Change-Id: I92781d9133603fdefd58b19a6f0ac6b1c6df3ac6	2020-01-14 16:24:48 +00:00
crawter	a57ce18f58	satellite/payments: coupons, coupons usage, invoice generation with pricing model applied Change-Id: Ic5d5a2fc116388647efe46896cfccc2038c77537	2020-01-14 12:45:00 +00:00
Yingrong Zhao	ee87846f0b	satellite/contact: add placeholder for GetTime endpoint Change-Id: I42f8479708f0558350c2280a398d84d145e8118f	2020-01-14 06:38:47 +00:00
Egon Elbre	b252fcdc01	mod: bump uplink dependency Change-Id: I67bee25c7cae28445e5091ed519359cfb5ea1af5	2020-01-09 15:38:02 +00:00
VitaliiShpital	a4e5c18877	satellite/payments: mock methods added to endpoint to match pb PaymentsServer Change-Id: Ic8ff44cbe0b2368021a5d83cf86ce0dd2b670fd7	2020-01-09 14:30:21 +02:00
Egon Elbre	8d8d57c3b5	mod: update sqlite module to v2.0.2 This updates SQLite amalgamation from 3.29.0 to 3.30.1. The module contains fixes for races. Change-Id: Ic6a06a43ba404de0091d8a2f7444a8f4b1d5d54c	2020-01-08 21:21:15 +02:00
Egon Elbre	082ec81714	uplink: move to storj.io/uplink (#3746 )	2020-01-08 15:40:19 +02:00
Egon Elbre	00c0c51b1c	cmd/uplink: fix TestSetGetMeta flakiness testrand.Path was also returning folders which has different behavior for cp. Change-Id: Ia53a2709bf3e768b3b7063a6137ec474c2622cb2	2020-01-08 12:25:25 +00:00
Egon Elbre	05e2b0ca86	mod: bump common and some other packages Change-Id: Icf73220f5182f5a2fd1431471825948c300c0bea	2020-01-07 18:09:59 +02:00
Jeff Wendling	2549c601e9	all: bump storj.io/common dependency Change-Id: I2e9ba0a76380d99d8650ae6921ed0a3ffe436536	2020-01-06 22:41:06 +00:00
Natalie Ventura Villasana	aa3e183c2e	satellite/gracefulexit: add ge eligibility check Adds check to see if storage nodes are eligible to initiate graceful exit, by checking their CreatedAt date and seeing if their "age" is greater than the new config value: NodeMinAgeInMonths The default for this value is 6 months for now. https://storjlabs.atlassian.net/browse/V3-3357 Change-Id: Ib807ab8987ddb5a38a27a83886490f73fe8c5816	2019-12-31 09:31:58 -05:00
Egon Elbre	6615ecc9b6	common: separate repository Change-Id: Ibb89c42060450e3839481a7e495bbe3ad940610a	2019-12-27 14:11:15 +02:00
Egon Elbre	ef8bc88328	ci: use external repository Change-Id: If26a005df45f6067240511d603fb4dd613f92b79	2019-12-19 12:05:49 +00:00

1 2 3 4

153 Commits