storj

Author	SHA1	Message	Date
Michal Niewrzal	573ce712f2	satellite/bloomfilter: don't create BF for disqualified nodes Currently we have large set of nodes that are already disqualified and we are not sending bloom filters to them. The issue is that we are still generating filters for them while garbage collection process. Even if we have only segment with one piece which was stored on this node. This consumes additional memory and processing powers. This change is changing logic behind `AllPieceCounts` (renamed to ActiveNodesPieceCounts) to return piece count for all nodes except disqualified one (even with piece count = 0). With this change we can modify GC observer to skip nodes that where not returned by ActiveNodesPieceCounts. Change-Id: Ic75159135abe535084d8aeee560bb801a4a03e17	2023-11-22 07:07:51 +00:00
Ethan Adams	e65345336f	satellite/placement: Add test for new "datacenter" placement tag Change-Id: Ic9a79bb46256fbe8b1159de2bbf787b4a63c5c8f	2023-10-27 18:15:32 +00:00
Márton Elek	4bf5d75a8a	satellite/overlay: dedicated error type for placement parsing Change-Id: Ifd78341bf4760db784b20bf80cf22018088873c7	2023-10-24 11:41:19 +00:00
Clement Sam	ab57cd30b0	satellite: remove all code references to nodes.type column Once this is merged and deployed, we can go ahead and change the db schema. Note that this will stop updating the `type` column in the nodes table but it shouldn't be a problem because this column has a default value set. Updates https://github.com/storj/storj/issues/5426 Change-Id: I2470ebacbcb0e60cf894617eb69b593227357283	2023-10-19 19:00:33 +00:00
Márton Elek	a63a69dfd9	satellite/nodeselection: support OR in placement definition Change-Id: Icc7fd465b28c0c6f09f50c4ab8bffbcc77631dbd	2023-10-19 15:21:15 +00:00
Clement Sam	bce022ea7a	satellite/overlay: remove Type field from NodeDossier The overlay.NodeDossier struct only tracks information about a storagenode, the field is deprecated and no longer needed. This is a kademlia left-over. Updates https://github.com/storj/storj/issues/5426 Change-Id: Ie278ffd88d1b9a9fde6c81eb5f0e287bab8c9ef0	2023-10-18 18:21:26 +00:00
Márton Elek	c3fbac2e7a	satellite/overlay/placement: better error message for placement parsing Given the placement.txt with the content: ``` 9:exclude(placement(10)) 10:country("DE") ``` Without patch: ``` placement-test --placement=/tmp/placement.txt countrycode=DE Error: type mismatch: reflect: Call using zero Value argument ``` With the patch: ``` placement-test --placement=/tmp/placement.txt countrycode=DE Error: Error in line 'exclude(placement(10))' when placement rule is parsed: Placement 10 is referenced before defined. Please define it first! ``` Change-Id: I9ad81016d4a57fdb32f3ff9031b5591f9a7cd2a6	2023-10-16 09:28:45 +00:00
Márton Elek	b28439be24	cmd/tools/placement-test: cli to test placement configuration Change-Id: I7308fbf8fcd740fc136e87d9c2c08eaeb461a106	2023-09-28 10:01:53 +00:00
Michal Niewrzal	e1215d5da8	satellite/overlay: add AOST to GetParticipatingNodes method This method is sometimes ends with transaction error. Most probably because it's trying to do full table scan on nodes table which is heavily used. Adding AOST should help with DB contention. Change-Id: Ibd4358d28dc26922b60c6b30862f20e7c0662cd1	2023-09-27 12:00:10 +00:00
Márton Elek	98921f9faa	satellite/overlay: fix placement selection config parsing When we do `satellite run api --placement '...'`, the placement rules are not parsed well. The problem is based on `viper.AllSettings()`, and the main logic is sg. like this (from a new unit test): ``` r := ConfigurablePlacementRule{} err := r.Set(p) require.NoError(t, err) serialized := r.String() r2 := ConfigurablePlacementRule{} err = r2.Set(serialized) require.NoError(t, err) require.Equal(t, p, r2.String()) ``` All settings evaluates the placement rules in `ConfigurablePlacementRules` and stores the string representation. The problem is that we don't have proper `String()` implementation (it prints out the structs instead of the original definition. There are two main solutions for this problem: 1. We can fix the `String()`. When we parse a placement rule, the `String()` method should print out the original definition 2. We can switch to use pure string as configuration parameter, and parse the rules only when required. I feel that 1 is error prone, we can do it (and in this patch I added a lot of `String()` implementations, but it's hard to be sure that our `String()` logic is inline with the parsing logic. Therefore I decided to make the configuration value of the placements a string (or a wrapper around string). That's the main reason why this patch seems to be big, as I updated all the usages. But the main part is in beginning of the `placement.go` (configuration parsing is not a pflag.Value implementation any more, but a separated step). And `filter.go`, (a few more String implementation for filters. https://github.com/storj/storj/issues/6248 Change-Id: I47c762d3514342b76a2e85683b1c891502a0756a	2023-09-21 14:31:41 +00:00
Márton Elek	f4fe983b1e	satellite/{placement,nodeselection}: introduce empty() and notEmpty() for tag value selection It helps to implement rules like `tag("nodeid","select",notEmpty()) Change-Id: If7a4532eacc0e4e670ffe81d504aab9d5b34302f	2023-09-14 19:30:29 +00:00
Márton Elek	f40baf8629	go.mod: bump dependencies (private,uplink,common) Change-Id: I6c55735b45cadaf36697eff53e78b5b09afe9dea	2023-09-06 13:28:22 +02:00
Márton Elek	e2006d821c	satellite/overlay: change Reliable and KnownReliable as GetParticipatingNodes and GetNodes, respectively. We now want these functions to include offline and suspended nodes as well, so that we can force immediate repair when pieces are out of placement or in excluded countries. With that change, the old names no longer made sense. Change-Id: Icbcbad43dbde0ca8cbc80a4d17a896bb89b078b7	2023-09-02 23:34:50 +00:00
Márton Elek	c202929413	satellite/nodeselection: rename (NodeFilter).MatchInclude to Match As I learned, the `Include` supposed to communicate that some internal change also "included" to the filters during the check -> filters might be stateful. But it's not the case any more after `552242387`, where we removed the only one stateful filter. Change-Id: I7c36ddadb2defbfa3b6b67bcc115e4427ba9e083	2023-08-31 16:17:52 +02:00
Márton Elek	ca0ea50cba	satellite/overlay: remove/deprecate NodeSelectionCache.Disabled Once uppon a time, at the dawn of the implementation of Storj, when all the nodes are read from the database directly, every time. After a while -- due to performance reasons -- it has been changed for upload and download: where all the nodes are read for a short period of time, and used from memory. This is the version which was improved recently to support advanced node selections using placement. But stil we have an old configuration value `service.config.NodeSelectionCache.Disabled`, and the db based implementation: `service.FindStorageNodesWithPreferences(ctx, req, &service.config.Node)`. For safety, we need to remove this option, to make sure that we use the cache, which has the advanced features. This patch was supposed to be a very small one (just removing a method and a config: https://review.dev.storj.io/c/storj/storj/+/11074/1/satellite/overlay/service.go), but it turned out that we need to update a lot of unit tests. These unit tests used the old implementation (which is not used in production any more). The tests which used both implementation are just updated to use only the new one The tests which used only the old implementation are refactored (but keeping the test cases). Using real unit tests (without DB, working on OSX, fast) Closes https://github.com/storj/storj/issues/6217 Change-Id: I023f92c7e34235665cf8474513e67b2fcc4763eb	2023-08-31 09:46:29 +00:00
Artur M. Wolff	37d6df23fa	satellite: implement metainfo.GetBucketLocation endpoint Updates storj/storj-private#408 Updates storj/storj-private#409 Change-Id: Idaaca74b4a5c9c7907d095e0a3a5f29e52843ce6	2023-08-28 13:48:07 +02:00
Márton Elek	5c12a3406d	satellite/nodeselection: improve annotation composability We would like to make it easier to accept multiple annotations. Examples: ``` country("GB") && annotation(...) annotated(annotated(X,...),...) ``` Change-Id: I92e622e8b985b314dadddf83b17976c245eb2069	2023-08-28 09:27:04 +00:00
Márton Elek	84ea80c1fd	satellite/repair/checker: respect autoExcludeSubnet anntation in checker rangedloop This patch is a oneliner: rangedloop checker should check the subnets only if it's not turned off with placement annotation. (see in satellite/repair/checker/observer.go). But I didn't find any unit test to cover that part, so I had to write one, and I prefered to write it as a unit test not an integration test, which requires a mock repair queue (observer_unit_test.go mock.go). Because it's small change, I also included a small change: creating a elper method to check if AutoExcludeSubnet annotation is defined Change-Id: I2666b937074ab57f603b356408ef108cd55bd6fd	2023-08-23 13:45:09 +00:00
Márton Elek	4ccce11893	satellite/overlay: improve realistic placement rule test 10 --> node tag inclusion in raw format 11 --> same, but using same subnet is enabled 12 --> same as 11 but with US restrictions Change-Id: I20792689e0caf5fe190f566a770d70c3b3824793	2023-08-23 13:56:35 +02:00
Márton Elek	5522423871	satellite/nodeselection: remove AutoExcludeSubnet filter It's statefull, therefore it can hit naive users. (NodeFilters couldn't be reused for more than one iterations). But looks like we don't need it, as `SelectBySubnet` doest the same job. Change-Id: Ie85b7f9c2bd9a47293f4e3b359f8b619215c7649	2023-08-18 08:31:00 +00:00
Márton Elek	b218002752	satellite/overlay/placement: improve placement configurability with &&, placement, region and ! support in country This patch makes it easier to configure existing placement rules only with string. 1. placement(n) rule can be used to reuse earlier definitions 2 .&& can be used in addition to all(n1,n2) 3. country(c) accepts exclusions (like '!RU'), regions ('EU','EEA'), all and none See the 'full example' unit test, which uses all of these, in a realistic example. https://github.com/storj/storj/issues/6126 Change-Id: Ica76f016ebd002eb7ea8103d4258bacd6a6d77bf	2023-08-17 16:12:53 +00:00
Márton Elek	da08117fcd	satellite/~placement: do not ignore placement check for placement=0 There are cases when we would like to override the default placement=0 rule. For example when we would like to exclude tagged nodes from the selection (by default). Therefore we couldn't use a shortcut any more, we should always check the placement rules, even if we use placement=0. TODO: we need to update common, and rename `EveryCountry` to `DefaultPlacement`, just to avoid confusion. https://github.com/storj/storj/issues/6126 Change-Id: Iba6c655bd623e04351ea7ff91fd741785dc193e4	2023-08-16 07:06:56 +00:00
Márton Elek	c08792f066	satellite/overlay: implement an exclude filter for placement configuration https://github.com/storj/storj/issues/6126 Change-Id: I05215b5d46bec958001cc020edf1fa97b00d3299	2023-08-15 17:29:29 +00:00
Márton Elek	0e17b1018c	satellite/{nodeselection,overlay}: support annotations on node filters Change-Id: I844d8a25042750aae189175842113e2f052d5b17	2023-08-15 16:49:57 +00:00
Márton Elek	0b02a48a10	satellite/nodeselection: SelectBySubnet should use placement filters for all nodes Current node selection logic (in case of using SelectBySubnet): 1. selects one subnet randomly 2. selects one node randomly from the subnet 3. applies the placement NodeFilters to the node and ignore it, if doesn't match This logic is wrong: 1. Imagine that we have a subnet with two DE and one GB nodes. 2. We would like to select DE nodes 2. In case of GB node is selected (randomly) in step2, step3 will ignore the subnet, even if there are good (DE) nodes in there. Change-Id: I7673f52c89b46e0cc7b20a9b74137dc689d6c17e	2023-08-04 10:48:15 +02:00
Márton Elek	f7b39aaed4	satellite/nodeselection: remove stats/size from nodeselection state stats/size/count is not used by any production code, and it's not required, as we can assert the state with other checks. real motivation: next commits will make the Selector of the State configurable, therefore we won't have one single Stat, it depends on the request parameters. (we plan to support both network and id based randomization) Change-Id: I631828fc0046d2fef5b7a674fc0268a0446e9655	2023-08-01 18:29:41 +00:00
Márton Elek	6f002f4220	satellite/overlay: NR placement should exclude nodes without geofencing information https://github.com/storj/storj-private/issues/378 Change-Id: If2af02083496e5a8eefe27beabb406388ee50644	2023-07-31 09:55:54 +02:00
Egon Elbre	465941b345	satellite/{nodeselection,overlay}: use location.Set location.Set is faster for comparisons. Updates #6028 Change-Id: I764eb5cafc507f908e4168b16a7994cc7721ce4d	2023-07-11 17:16:30 +00:00
Egon Elbre	9370bc4580	satellite/{nodeselection,overlay}: bump common and fix some potential issues * Handle failed country code conversion. * Avoid potential issues with a data-race due to shared slice. Updates #6028 Change-Id: If7beef2619abd084e1f4109de2d323f834a6090a	2023-07-11 11:13:41 +00:00
Michal Niewrzal	1d62dc63f5	satellite/repair/repairer: fix NumHealthyInExcludedCountries calculation Currently, we have issue were while counting unhealthy pieces we are counting twice piece which is in excluded country and is outside segment placement. This can cause unnecessary repair. This change is also doing another step to move RepairExcludedCountryCodes from overlay config into repair package. Change-Id: I3692f6e0ddb9982af925db42be23d644aec1963f	2023-07-10 12:01:19 +02:00
Márton Elek	97a89c3476	satellite: switch to use nodefilters instead of old placement.AllowedCountry placement.AllowedCountry is the old way to specify placement, with the new approach we can use a more generic (dynamic method), which can check full node information instead of just the country code. The 90% of this patch is just search and replace: * we need to use NodeFilters instead of placement.AllowedCountry * which means, we need an initialized PlacementRules available everywhere * which means we need to configure the placement rules The remaining 10% is the placement.go, where we introduced a new type of configuration (lightweight expression language) to define any kind of placement without code change. Change-Id: Ie644b0b1840871b0e6bbcf80c6b50a947503d7df	2023-07-07 16:55:45 +00:00
Márton Elek	70cdca5d3c	satellite: move satellite/nodeselection/uploadselection => satellite/nodeselection All the files in uploadselection are (in fact) related to generic node selection, and used not only for upload, but for download, repair, etc... Change-Id: Ie4098318a6f8f0bbf672d432761e87047d3762ab	2023-07-07 10:32:03 +02:00
Márton Elek	ddf1f1c340	satellite/{nodeselection,overlay}: NodeFilters for dynamic placement implementations Change-Id: Ica3a7b535fa6736cd8fb12066e615b70e1fa65d6	2023-07-06 12:08:01 +00:00
Michal Niewrzal	21c1e66a85	satellite/overlay: refactor ReliabilityCache to keep more data ReliabilityCache will be now using refactored overlay Reliable method. This method will provide more info about nodes (e.g. country code) and with this we are able to add two dedicated methods to classify pieces: * OutOfPlacementPieces * PiecesNodesLastNetsInOrder With those new method we will fix issue where offline but reliable node won't be checked for clumped pieces and off placement pieces. https://github.com/storj/storj/issues/5998 Change-Id: I9ffbed9f07f4881c9db3bd0e5f0412f1a418dd82	2023-07-05 11:19:10 +02:00
Michal Niewrzal	f2cd7b0928	satellite/overlay: refactor Reliable to be used with repair checker Currently we are using Reliable to get missing pieces for repair checker. The issue is that now checker is looking at more things than just missing pieces (clumped/off, placement pieces) and using only node ID is not enough. We have issue where we are skipping offline nodes from clumped and off placement pieces check. Reliable was refactored to get data (e.g. country, lastNet) about all reliable nodes. List is split into online and offline. This data will be cached for quick use by repair checker. It will be also possible to check nodes metadata like country code or lastNet. We are also slowly moving `RepairExcludedCountryCodes` config from overlay to repair which makes more sens for it. This this first part of changes. https://github.com/storj/storj/issues/5998 Change-Id: If534342488c0e440affc2894a8fbda6507b8959d	2023-07-05 10:56:31 +02:00
Márton Elek	500b6244f8	satellite/satellitedb: create table for node tags Change-Id: I884bb740974e6b8241aa6b85faf266b85fe892d4	2023-07-05 09:38:53 +02:00
Márton Elek	d38b8fa2c4	satellite/nodeselection: use the same Node object from overlay and nodeselection We use two different Node types in `overlay` and `uploadnodeselection` and converting back and forth. Using the same object would allow us to use a unified node selection interface everywhere. Change-Id: Ie71e29d60184ee0e5b4547eb54325f09c418f73c	2023-07-03 16:59:33 +00:00
Michal Niewrzal	98f4f249b2	satellite/overlay: refactor KnownReliable to be used with repairer Currently we are using KnownUnreliableOrOffline to get missing pieces for segment repairer (GetMissingPieces). The issue is that now repairer is looking at more things than just missing pieces (clumped/off placement pieces). KnownReliable was refactored to get data (e.g. country, lastNet) about all reliable nodes from provided list. List is split into online and offline. This way we will be able to use results from this method to all checks: missing pieces, clumped pieces, out of placement pieces. This this first part of changes to handle different kind of pieces in segment repairer. https://github.com/storj/storj/issues/5998 Change-Id: I6cbaf59cff9d6c4346ace75bb814ccd985c0e43e	2023-06-27 13:27:23 +02:00
Michal Niewrzal	eb407b2ae3	satellite/overlay: delete unused KnownOffline method Change-Id: Ief9288fee83f9c381dd7840f48333babcd3d6bf7	2023-06-23 13:24:30 +00:00
Michal Niewrzal	9e3fd4d514	satellite/overlay: delete unused method Change-Id: I87828fcac4f4a9fb08c86af188aa6ea28c5c64af	2023-06-22 12:45:59 +00:00
Michal Niewrzal	f7c7851519	satellite/metainfo: filter metainfo.GetObjectIPs by bucket/object placement For now we will use bucket placement to determine if we should exclude some node IPs from metainfo.GetObjectIPs results. Bucket placement is retrieved directly from DB in parallel to metabase GetStreamPieceCountByNodeID request. GetObjectIPs is not heavily used so additional request to DB shouldn't be a problem for now. https://github.com/storj/storj/issues/5950 Change-Id: Idf58b1cfbcd1afff5f23868ba2f71ce239f42439	2023-06-07 16:52:02 +00:00
Michal Niewrzal	fe21fd42f7	satellite/overlay: add GetNodesOutOfPlacement method We would like to verify if nodes matches specific placement e.g. to validate segment pieces are correctly geofenced. https://github.com/storj/storj/issues/5896 Change-Id: I842767dccc121a3c60224f677ab55e5dc150c76e	2023-05-30 14:57:20 +02:00
Michal Niewrzal	c48bd81e5f	satellite/satellitedb: update SelectAllStorageNodes* to set country code Methods SelectAllStorageNodesUpload and SelectAllStorageNodesDownload are not returning full info with overlay.SelectedNode because its missing CountryCode. Change-Id: Ie3cb396bf28d7ec4c6ab8927e5bb560236036aa6	2023-05-26 11:02:29 +00:00
paul cannon	c856d45cc0	satellite/overlay: fix GetNodesNetworkInOrder We were using the UploadSelectionCache previously, which does _not_ have all nodes, or even all online nodes, in it. So all nodes with less than MinimumVersion, or with less than MinimumDiskSpace, or nodes suspended for unknown audit errors, or nodes that have started graceful exit, were all missing, and ended up having empty last_nets. Even with all that, I'm kind of surprised how many nodes this involved, but using the upload selection cache was definitely wrong. This change uses the download selection cache instead, which excludes nodes only when they are disqualified, gracefully exited (completely), or offline. Change-Id: Iaa07c988aa29c1eb05796ac48a6f19d69f5826c1	2023-05-19 08:08:08 +00:00
paul cannon	958d8676d0	satellite/overlay: remove unnecessary test helper Change-Id: I8439eec4ed440f60353fc620ca906a917a03613c	2023-05-17 17:04:54 +00:00
paul cannon	75d10fe4fa	satellite/overlay: use UploadSelectionCache for GetNodesNetworkInOrder The query for GetNodesNetworkInOrder is causing far too much load on the database. Since it is not critical that the repair checker have perfectly up-to-date node network information, we can use a cache instead. Change-Id: I07ad45bfdeb46529da093941a06c2da8a00ce878	2023-05-16 17:32:09 +00:00
Michal Niewrzal	36e046375c	satellite/repair/checker: remove segments loop parts We are switching completely to ranged loop. https://github.com/storj/storj/issues/5368 Change-Id: I8583549973cd36aa0e0c482c20d7a75cb7568ab3	2023-05-08 12:19:13 +00:00
Michal Niewrzal	1aa24b9f0d	satellite/audit: remove segments loop parts We are switching completely to ranged loop. https://github.com/storj/storj/issues/5368 Change-Id: I9cec0ac454f40f19d52c078a8b1870c4d192bd7a	2023-04-24 15:52:11 +00:00
Egon Elbre	f40a0cb7ba	satellite/*: use typed lrucache and ReadCache Change-Id: Ieee535dd8735a95dd196a77413e4a25a6a72342c	2023-04-21 10:49:08 +00:00
paul cannon	915f3952af	satellite/repair: repair pieces on the same last_net We avoid putting more than one piece of a segment on the same /24 network (or /64 for ipv6). However, it is possible for multiple pieces of the same segment to move to the same network over time. Nodes can change addresses, or segments could be uploaded with dev settings, etc. We will call such pieces "clumped", as they are clumped into the same net, and are much more likely to be lost or preserved together. This change teaches the repair checker to recognize segments which have clumped pieces, and put them in the repair queue. It also teaches the repair worker to repair such segments (treating clumped pieces as "retrievable but unhealthy"; i.e., they will be replaced on new nodes if possible). Refs: https://github.com/storj/storj/issues/5391 Change-Id: Iaa9e339fee8f80f4ad39895438e9f18606338908	2023-04-06 17:34:25 +00:00

1 2 3 4 5 ...

273 Commits