storj

Author	SHA1	Message	Date
Márton Elek	a63a69dfd9	satellite/nodeselection: support OR in placement definition Change-Id: Icc7fd465b28c0c6f09f50c4ab8bffbcc77631dbd	2023-10-19 15:21:15 +00:00
Márton Elek	6304046e80	satellite/nodeselection: read email + wallet from db to SelectedNode NodeSelection struct is used to make decisions (and assertions) related to node selection. Usually we don't use email and wallet for placement decision, as they are not reliable. But there are cases, when we know that the email address is confirmed. Also, it can be used for upper-bound estimations (if same wallet is used for too many pieces in a segment, it's a sign of a risk, even if not all the risks can be detected with this approach, as one owner can use different wallets). Long story short: let's put wallet and email to the SelectedNode. Change-Id: I922185e3769d43eb7762b8d60d88ecd3d50991bb	2023-10-03 18:15:56 +00:00
Márton Elek	98921f9faa	satellite/overlay: fix placement selection config parsing When we do `satellite run api --placement '...'`, the placement rules are not parsed well. The problem is based on `viper.AllSettings()`, and the main logic is sg. like this (from a new unit test): ``` r := ConfigurablePlacementRule{} err := r.Set(p) require.NoError(t, err) serialized := r.String() r2 := ConfigurablePlacementRule{} err = r2.Set(serialized) require.NoError(t, err) require.Equal(t, p, r2.String()) ``` All settings evaluates the placement rules in `ConfigurablePlacementRules` and stores the string representation. The problem is that we don't have proper `String()` implementation (it prints out the structs instead of the original definition. There are two main solutions for this problem: 1. We can fix the `String()`. When we parse a placement rule, the `String()` method should print out the original definition 2. We can switch to use pure string as configuration parameter, and parse the rules only when required. I feel that 1 is error prone, we can do it (and in this patch I added a lot of `String()` implementations, but it's hard to be sure that our `String()` logic is inline with the parsing logic. Therefore I decided to make the configuration value of the placements a string (or a wrapper around string). That's the main reason why this patch seems to be big, as I updated all the usages. But the main part is in beginning of the `placement.go` (configuration parsing is not a pflag.Value implementation any more, but a separated step). And `filter.go`, (a few more String implementation for filters. https://github.com/storj/storj/issues/6248 Change-Id: I47c762d3514342b76a2e85683b1c891502a0756a	2023-09-21 14:31:41 +00:00
Márton Elek	f4fe983b1e	satellite/{placement,nodeselection}: introduce empty() and notEmpty() for tag value selection It helps to implement rules like `tag("nodeid","select",notEmpty()) Change-Id: If7a4532eacc0e4e670ffe81d504aab9d5b34302f	2023-09-14 19:30:29 +00:00
Márton Elek	c202929413	satellite/nodeselection: rename (NodeFilter).MatchInclude to Match As I learned, the `Include` supposed to communicate that some internal change also "included" to the filters during the check -> filters might be stateful. But it's not the case any more after `552242387`, where we removed the only one stateful filter. Change-Id: I7c36ddadb2defbfa3b6b67bcc115e4427ba9e083	2023-08-31 16:17:52 +02:00
Artur M. Wolff	37d6df23fa	satellite: implement metainfo.GetBucketLocation endpoint Updates storj/storj-private#408 Updates storj/storj-private#409 Change-Id: Idaaca74b4a5c9c7907d095e0a3a5f29e52843ce6	2023-08-28 13:48:07 +02:00
Márton Elek	5c12a3406d	satellite/nodeselection: improve annotation composability We would like to make it easier to accept multiple annotations. Examples: ``` country("GB") && annotation(...) annotated(annotated(X,...),...) ``` Change-Id: I92e622e8b985b314dadddf83b17976c245eb2069	2023-08-28 09:27:04 +00:00
Márton Elek	84ea80c1fd	satellite/repair/checker: respect autoExcludeSubnet anntation in checker rangedloop This patch is a oneliner: rangedloop checker should check the subnets only if it's not turned off with placement annotation. (see in satellite/repair/checker/observer.go). But I didn't find any unit test to cover that part, so I had to write one, and I prefered to write it as a unit test not an integration test, which requires a mock repair queue (observer_unit_test.go mock.go). Because it's small change, I also included a small change: creating a elper method to check if AutoExcludeSubnet annotation is defined Change-Id: I2666b937074ab57f603b356408ef108cd55bd6fd	2023-08-23 13:45:09 +00:00
Márton Elek	5522423871	satellite/nodeselection: remove AutoExcludeSubnet filter It's statefull, therefore it can hit naive users. (NodeFilters couldn't be reused for more than one iterations). But looks like we don't need it, as `SelectBySubnet` doest the same job. Change-Id: Ie85b7f9c2bd9a47293f4e3b359f8b619215c7649	2023-08-18 08:31:00 +00:00
Márton Elek	b218002752	satellite/overlay/placement: improve placement configurability with &&, placement, region and ! support in country This patch makes it easier to configure existing placement rules only with string. 1. placement(n) rule can be used to reuse earlier definitions 2 .&& can be used in addition to all(n1,n2) 3. country(c) accepts exclusions (like '!RU'), regions ('EU','EEA'), all and none See the 'full example' unit test, which uses all of these, in a realistic example. https://github.com/storj/storj/issues/6126 Change-Id: Ica76f016ebd002eb7ea8103d4258bacd6a6d77bf	2023-08-17 16:12:53 +00:00
Márton Elek	da08117fcd	satellite/~placement: do not ignore placement check for placement=0 There are cases when we would like to override the default placement=0 rule. For example when we would like to exclude tagged nodes from the selection (by default). Therefore we couldn't use a shortcut any more, we should always check the placement rules, even if we use placement=0. TODO: we need to update common, and rename `EveryCountry` to `DefaultPlacement`, just to avoid confusion. https://github.com/storj/storj/issues/6126 Change-Id: Iba6c655bd623e04351ea7ff91fd741785dc193e4	2023-08-16 07:06:56 +00:00
Márton Elek	c08792f066	satellite/overlay: implement an exclude filter for placement configuration https://github.com/storj/storj/issues/6126 Change-Id: I05215b5d46bec958001cc020edf1fa97b00d3299	2023-08-15 17:29:29 +00:00
Márton Elek	0e17b1018c	satellite/{nodeselection,overlay}: support annotations on node filters Change-Id: I844d8a25042750aae189175842113e2f052d5b17	2023-08-15 16:49:57 +00:00
paul cannon	6e46a926bb	satellite/nodeselection: expand SelectedNode In the repair subsystem, it is necessary to acquire several extra properties of nodes that are holding pieces of things or may be selected to hold pieces. We need to know if a node is 'online' (the definition of "online" may change somewhat depending on the situation), if a node is in the process of graceful exit, and whether a node is suspended. We can't just filter out nodes with all of these properties, because sometimes we need to know properties about nodes even when the nodes are suspended or gracefully exiting. I thought the best way to do this was to add fields to SelectedNode, and (to avoid any confusion) arrange for the added fields to be populated wherever SelectedNode is returned, whether or not the new fields are necessarily going to be used. If people would rather I use a separate type from SelectedNode, I can do that instead. Change-Id: I7804a0e0a15cfe34c8ff47a227175ea5862a4ebc	2023-08-07 12:44:49 +00:00
Márton Elek	0b02a48a10	satellite/nodeselection: SelectBySubnet should use placement filters for all nodes Current node selection logic (in case of using SelectBySubnet): 1. selects one subnet randomly 2. selects one node randomly from the subnet 3. applies the placement NodeFilters to the node and ignore it, if doesn't match This logic is wrong: 1. Imagine that we have a subnet with two DE and one GB nodes. 2. We would like to select DE nodes 2. In case of GB node is selected (randomly) in step2, step3 will ignore the subnet, even if there are good (DE) nodes in there. Change-Id: I7673f52c89b46e0cc7b20a9b74137dc689d6c17e	2023-08-04 10:48:15 +02:00
Márton Elek	63c8cfe4c3	satellite/nodeselection: remove CountryCodeExclude We don't need it any more, as CountryCode uses location.Set, which supports exclusion (`Without`). Change-Id: Ie311ae19fefa0bc9a0161496af1233ef4a6607df	2023-08-02 09:48:36 +00:00
Márton Elek	f7b39aaed4	satellite/nodeselection: remove stats/size from nodeselection state stats/size/count is not used by any production code, and it's not required, as we can assert the state with other checks. real motivation: next commits will make the Selector of the State configurable, therefore we won't have one single Stat, it depends on the request parameters. (we plan to support both network and id based randomization) Change-Id: I631828fc0046d2fef5b7a674fc0268a0446e9655	2023-08-01 18:29:41 +00:00
Egon Elbre	465941b345	satellite/{nodeselection,overlay}: use location.Set location.Set is faster for comparisons. Updates #6028 Change-Id: I764eb5cafc507f908e4168b16a7994cc7721ce4d	2023-07-11 17:16:30 +00:00
Egon Elbre	9370bc4580	satellite/{nodeselection,overlay}: bump common and fix some potential issues * Handle failed country code conversion. * Avoid potential issues with a data-race due to shared slice. Updates #6028 Change-Id: If7beef2619abd084e1f4109de2d323f834a6090a	2023-07-11 11:13:41 +00:00
Márton Elek	70cdca5d3c	satellite: move satellite/nodeselection/uploadselection => satellite/nodeselection All the files in uploadselection are (in fact) related to generic node selection, and used not only for upload, but for download, repair, etc... Change-Id: Ie4098318a6f8f0bbf672d432761e87047d3762ab	2023-07-07 10:32:03 +02:00
Márton Elek	ddf1f1c340	satellite/{nodeselection,overlay}: NodeFilters for dynamic placement implementations Change-Id: Ica3a7b535fa6736cd8fb12066e615b70e1fa65d6	2023-07-06 12:08:01 +00:00
Márton Elek	1525324384	satellite/uploadselection: avoid String conversation of location during node selection Converting location to String is not free, better to avoid it. `81cb588c23/storj/location/countrycode.go (L32)` Thanks to Egon, who reported this issue. See also: https://review.dev.storj.io/c/storj/common/+/10732 Change-Id: Ife348cffa59c020b46914a68be231c6eb75f06c9	2023-07-05 19:22:12 +00:00
Márton Elek	500b6244f8	satellite/satellitedb: create table for node tags Change-Id: I884bb740974e6b8241aa6b85faf266b85fe892d4	2023-07-05 09:38:53 +02:00
Márton Elek	d38b8fa2c4	satellite/nodeselection: use the same Node object from overlay and nodeselection We use two different Node types in `overlay` and `uploadnodeselection` and converting back and forth. Using the same object would allow us to use a unified node selection interface everywhere. Change-Id: Ie71e29d60184ee0e5b4547eb54325f09c418f73c	2023-07-03 16:59:33 +00:00
paul cannon	c856d45cc0	satellite/overlay: fix GetNodesNetworkInOrder We were using the UploadSelectionCache previously, which does _not_ have all nodes, or even all online nodes, in it. So all nodes with less than MinimumVersion, or with less than MinimumDiskSpace, or nodes suspended for unknown audit errors, or nodes that have started graceful exit, were all missing, and ended up having empty last_nets. Even with all that, I'm kind of surprised how many nodes this involved, but using the upload selection cache was definitely wrong. This change uses the download selection cache instead, which excludes nodes only when they are disqualified, gracefully exited (completely), or offline. Change-Id: Iaa07c988aa29c1eb05796ac48a6f19d69f5826c1	2023-05-19 08:08:08 +00:00
paul cannon	75d10fe4fa	satellite/overlay: use UploadSelectionCache for GetNodesNetworkInOrder The query for GetNodesNetworkInOrder is causing far too much load on the database. Since it is not critical that the repair checker have perfectly up-to-date node network information, we can use a cache instead. Change-Id: I07ad45bfdeb46529da093941a06c2da8a00ce878	2023-05-16 17:32:09 +00:00
paul cannon	2522ff09b6	satellite/overlay: configurable meaning of last_net Up to now, we have been implementing the DistinctIP preference with code in two places: 1. On check-in, the last_net is determined by taking the /24 or /64 (in ResolveIPAndNetwork()) and we store it with the node record. 2. On node selection, a preference parameter defines whether to return results that are distinct on last_net. It can be observed that we have never yet had the need to switch from DistinctIP to !DistinctIP, or from !DistinctIP to DistinctIP, on the same satellite, and we will probably never need to do so in an automated way. It can also be observed that this arrangement makes tests more complicated, because we often have to arrange for test nodes to have IP addresses in different /24 networks (a particular pain on macOS). Those two considerations, plus some pending work on the repair framework that will make repair take last_net into consideration, motivate this change. With this change, in the #2 place, we will _always_ return results that are distinct on last_net. We implement the DistinctIP preference, then, by making the #1 place (ResolveIPAndNetwork()) more flexible. When DistinctIP is enabled, last_net will be calculated as it was before. But when DistinctIP is _off_, last_net can be the same as address (IP and port). That will effectively implement !DistinctIP because every record will have a distinct last_net already. As a side effect, this flexibility will allow us to change the rules about last_net construction arbitrarily. We can do tests where last_net is set to the source IP, or to a /30 prefix, or a /16 prefix, etc., and be able to exercise the production logic without requiring a virtual network bridge. This change should be safe to make without any migration code, because all known production satellite deployments use DistinctIP, and the associated last_net values will not change for them. They will only change for satellites with !DistinctIP, which are mostly test deployments that can be recreated trivially. For those satellites which are both permanent and !DistinctIP, node selection will suddenly start acting as though DistinctIP is enabled, until the operator runs a single SQL update "UPDATE nodes SET last_net = last_ip_port". That can be done either before or after deploying software with this change. I also assert that this will not hurt performance for production deployments. It's true that adding the distinct requirement to node selection makes things a little slower, but the distinct requirement is already present for all production deployments, and they will see no change. Refs: https://github.com/storj/storj/issues/5391 Change-Id: I0e7e92498c3da768df5b4d5fb213dcd2d4862924	2023-03-09 02:20:12 +00:00
Fadila Khadar	29fd36a20e	satellite/repairer: handle excluded countries For nodes in excluded areas, we don't necessarily want to remove them from the pointer, but we do want to increase the number of pieces in the segment in case those excluded area nodes go down. To do that, we increase the number of pieces repaired by the number of pieces in excluded areas. Change-Id: I0424f1bcd7e93f33eb3eeeec79dbada3b3ea1f3a	2022-03-14 10:59:36 -04:00
Moby von Briesen	b2d342aa9b	satellite/overlay: Add ability to exclude country codes on upload Create global config to specify a list of country codes that should be excluded from node selection during uploads. This exclusion is not implemented when the upload selection cache is disabled. Change-Id: Ic41e8b4f18857a11045668eac23107da99668a72	2022-03-03 16:58:48 +00:00
Márton Elek	9bdcc415bc	satellite/nodeselection: add geofencing constraints to the node selection criteria Closes https://github.com/storj/storj/issues/4242 Change-Id: Ieda59a4f37c673e4e81abb4c89c09daf3199bbc7	2021-11-08 17:04:31 +00:00
Márton Elek	20d03bebdb	satellite/nodeselection: flexible interface to includes nodes in selection This commit doesn't change any behavior, just organize the code in different way to make it easier to implement different Criterias to include nodes. Today we use NodeID and Subnet based selection but later Criteria can be extended with different kind of placement rules (like geofencing). The change nodeselection is used by segment allocaton (upload) and repair and excludes nodes from an in-memory selection. Resolves https://github.com/storj/storj/issues/4240 Change-Id: I0c1955fe16a045e3b76d7e50b2e1f4575a7ff095	2021-10-26 11:01:33 +00:00
Egon Elbre	d2033c2f52	satellite/nodeselection/uploadselection: rename package Currently nodeselection package only contained state for uploads, move these to a subpackage, such that we can make another "downloadselection" for downloads. Then move selection logic from overlay to nodeselection. Change-Id: I0fc42bcae3a29db2728dae9f3863b1e95bf5165b	2021-05-04 15:50:00 +00:00
Egon Elbre	b7a0739219	satellite/overlay: use DownloadSelectionCache for getting node IPs Change-Id: Ib8f4eedb2bf465767050693a1e961b37a294ca06	2021-01-29 16:47:10 +02:00
Egon Elbre	85fb964afe	satellite/{metainfo,overlay}: improvements to GetObjectIPs * Deduplicate NodeID list prior to fetching IPs. * Use NodeSelectionCache for fetching reliable IPs. * Return number of segements, reliable pieces and all pieces. Change-Id: I13e679caab275488b4037624b840a4068dad9589	2021-01-14 09:12:45 +00:00
Egon Elbre	94a09ce20b	all: add missing dots Change-Id: I93b86c9fb3398c5d3c9121b8859dad1c615fa23a	2020-08-11 17:50:01 +03:00
Egon Elbre	c5d4a13158	satellite/nodeselection: use NodeURL Change-Id: I2ebd4dbf993ff5c7864f3a3a665b5c8fc48aa7d1	2020-05-27 05:46:11 +00:00
Egon Elbre	16abf02b35	satellite/{nodeselection,overlay}: use the new package Change-Id: I034fdbe578dec2e5c906aca82231cd3e56f26aeb	2020-05-18 21:38:43 +00:00
Egon Elbre	08692aef90	satellite/nodeselection: node selection with proper bias Currently node selection cache is biased towards the same subnet. This implements static node selection for distinct such that it selects with equal probability subnets rather than id-s. This is mostly a copy paste + modifications from previous node selection state. Change-Id: Ia5c0aaf68e7feca78fbbd7352ad369fcb77c3a05	2020-05-18 18:09:15 +00:00

38 Commits