storj

Author	SHA1	Message	Date
Michal Niewrzal	0a3ee6ff8a	satellite/metabase: remove old object segments on overwrite While adding support for pending_objects table one case was missed. When we are uploading object to location where old objects exists we are not removing old object segments at all. Old object is overwritten with new object metadata but segments remains without corresponding object. This fix removes all existing committed objects (with it's segments) before committing new object. https://github.com/storj/storj/issues/6255 Change-Id: Id657840edf763fd6aec8191788d819191b074fb7	2023-09-11 14:16:47 +00:00
Michal Niewrzal	df037564d7	satellite/zombiedeletion: remove inactive uploads from pending_objects With zombie deletion chore we are removing inactive pending objects from objects table but new we need also to do this for pending_objects table. https://github.com/storj/storj/issues/6050 Change-Id: Ia29116c103673a1d9e10c2f16654022572210a8a	2023-09-07 18:47:29 +00:00
Michal Niewrzal	005fb19a7b	satellite/metabase: adjust BucketEmpty to use pending_objects table Extends metabase.BucketEmpty logic to check also pending_objects table for any entry. https://github.com/storj/storj/issues/6057 Change-Id: Ia26c272de24a983b308a0b692e6bd5800487eb98	2023-08-21 15:01:59 +00:00
Michal Niewrzal	16588033fd	satellite/metabase: delete bucket deletes also from pending_objects While deleting bucket we need also to delete pending objects from pending_objects table. Part of https://github.com/storj/storj/issues/6048 Change-Id: Icc83eaecf8388704e0b6329c397e8028debcf672	2023-08-21 14:05:13 +00:00
Michal Niewrzal	ae2cba1d23	satellite/metabase: add IteratePendingObjectsByKeyNew method New metabase method IteratePendingObjectsByKeyNew to iterate over entries in pending_objects table with the same object key. Implementation and tests are mostly copy of code for IteratePendingObjectsByKey. Main difference is that pending_objects table have StreamID column part of primary key instead Version. Method will be used to support new table in metainfo.ListPendingObjectStreams request. After full transition to pending_objects table we should remove 'New' suffix from methods names. Part of https://github.com/storj/storj/issues/6047 Change-Id: Ifc1ecbc534f8510fbd70c4ec676cf2bf8abb94cb	2023-08-21 08:08:03 +00:00
Michal Niewrzal	929dc80091	satellite/metabase: add IteratePendingObjects method New metabase method IteratePendingObjects to iterate over entries in pending_objects table. Implementation and tests are mostly copy of code that we are using to iterate over objects table. Main difference is that pending_objects table have StreamID column part of primary key instead Version. Also structure of pending object is smaller than the one from object table but it's a detail. Method will be used to support new table in metainfo.ListObjects request. Next step will be to port rest of iterator implementation to support pending_objects table in metainfo.ListPendingObjectStreams. Part of https://github.com/storj/storj/issues/6047 Change-Id: Ia578182f88840539f3668d4a242953e061eace02	2023-08-21 08:07:16 +00:00
Michal Niewrzal	5a8ef89824	satellite/{metainfo,metabase}: delete from pending_objects table We are deleting pending objects while aborting multipart upload. We are using metainfo BeginDeleteObject to do that. This change starts using DeletePendingObjectNew to delete entry from pending_objects table when request indicates that object is in this table. Part of https://github.com/storj/storj/issues/6048 Change-Id: I4478a9c13c8e3db48dc5de3087ef03d1b4c47a5c	2023-08-21 08:06:23 +00:00
Egon Elbre	dc41978743	all: fix golangci failures Change-Id: I07421388d53c837e35a4727cead26fc21c324d04	2023-08-09 11:44:44 +03:00
Michal Niewrzal	887209bc24	satellite/metabase: fix CommitObject query for postgres Query was not working for postgres because types were not correctly set. Change-Id: Ic898aa37b71a4754e4ebe82085f3563fc954616a	2023-08-08 15:31:44 +00:00
Michal Niewrzal	7be844351d	satellite/metainfo: remove ServerSideCopyDuplicateMetadata https://github.com/storj/storj/issues/5891 Change-Id: Ib5440169107acca6e832c2280e1ad12dfd380f28	2023-08-08 12:15:10 +00:00
Michal Niewrzal	9550b5f4a5	satellite/metabase: drop object deletion code for copy with references With this change we are removing code responsible for deleting objects and supporting server side copies created with references. In practice we are restoring delete queries that we had before server side copy implementation (with small exception, see bellow). From deletion queries we are also removing parts with segment metadata as result because we are not longer sending explicit delete requests to storage nodes. https://github.com/storj/storj/issues/5891 Change-Id: Iee4e7a9688cff27a60fb95d60dd233d996f41c85	2023-08-08 11:31:12 +00:00
Michal Niewrzal	03f8ad323d	satellite/metabase: remove segment_copies support from ListSegments We don't need to support segment copies with references anymore. We migrated to copies where all metadata are copied from original segment to copy. https://github.com/storj/storj/issues/5891 Change-Id: Ic91dc21b0386ddf5c51aea45530024cd463e8ba9	2023-08-08 11:21:08 +00:00
Michal Niewrzal	a5cbec7b3b	satellite/metabase: drop bucket deletion code for copy with references With this change we are removing code responsible to handle deleting bucket and supporting server side copies created with references. In practice we are restoring delete queries that we had before server side copy implementation (with small exception, see bellow). From deletion queries we are also removing parts with segment metadata as result because we are not longer sending explicit delete requests to storage nodes. https://github.com/storj/storj/issues/5891 Change-Id: If866d9f3a1b01e9ebd9b49c4740a6425ba06dd43	2023-08-08 09:07:44 +00:00
Michal Niewrzal	03c52f184e	satellite/metabase: adjust CommitObject to use pending_objects table Change is adjusting CommitObject to use `pending_objects` table to commit object. Satellite stream id is used to determine if we need to use `pending_objects` or `objects` table during commit. General goal is to support both tables until `objects` table will be free from pending objects. Part of https://github.com/storj/storj/issues/6046 Change-Id: I2ebe0cd6b446727c98c8e210d4d00504dd0dacb6	2023-08-03 10:21:22 +00:00
Michal Niewrzal	f40805763e	satellite/metabase: adjust segment commit to use pending_objects table Change is adjusting CommitSegment to check pending object existence in `pending_objects` or `objects` table. Satellite stream id is used to determine if we need to use `pending_objects` or `objects` table. General goal is to support both tables until `objects` table will be free from pending objects. Whenever it will be needed code will be supporting both tables at once. Part of https://github.com/storj/storj/issues/6046 Change-Id: I954444a53b4733ae6fc909420573242b02746787	2023-08-02 16:56:25 +00:00
Michal Niewrzal	7b2006a883	satellite/metabase: adjust BeginSegment to use pending_objects table Change is adjusting BeginSegment to check pending object existence in `pending_objects` or `objects` table. Satellite stream id is used to determine if we need to use `pending_objects` or `objects` table. General goal is to support both tables until `objects` table will be free from pending objects. Whenever it will be needed code will be supporting both tables at once. Part of https://github.com/storj/storj/issues/6046 Change-Id: I08aaa605c23d82695fde352fdbd0a7fd11f46bb5	2023-08-02 15:30:23 +00:00
Michal Niewrzal	cebf255d64	satellite/metabase: adjust BeginObjectNextVersion to use pending_objects Change is adjusting BeginObjectNextVersion to create pending object in `pending_objects` or `objects` table depends on configuration. This is first change to move pending objects from objects table. General goal is to support both tables until `objects` table will be free from pending objects. Whenever it will be needed code will be supporting both tables at once. To be able to decide if we need to use `pending_objects` table or `objects` table we extend satellite stream id to keep that information for later use. BeginObjectExactVersion will be not adjusted because at the moment it's used only in tests. Part of https://github.com/storj/storj/issues/6046 Change-Id: Ibf21965f63cca5e1775469994a29f1fd1261af4e	2023-08-02 14:42:26 +00:00
Michal Niewrzal	4cc167a6bd	satellite/metabase: use better label for ignoring FTS queries For some test queries we are using workaround to filter them out from full table scan detection. To avoid confustion what is this all about we are changing label to be more descriptive. Change-Id: I41a744e8faf400e3e8de7e416d8f4242f9093fce	2023-07-26 20:01:38 +00:00
Michal Niewrzal	3d9c217627	satellite/metabase: add pending_objects table This change adds only schema definition of pending_objects table and small amount of supporting code which will be useful for testing later. With this table we would like to achieve two major things: * simplify `objects` table, before we will start working on object versioning * gain performance by removing need to filter `objects` results with `status` column, which is not indexed and we would like to avoid that https://github.com/storj/storj/issues/6045 Change-Id: I6097ce1c644a8a3dad13185915fe01989ad41d90	2023-07-26 20:00:58 +00:00
Michal Niewrzal	0303920da7	satellite/metainfo: remove unused method Change-Id: I08e307e6909cdc46951c5f3112d77a685e67fe2e	2023-07-18 08:45:29 +00:00
Michal Niewrzal	99128ab551	satellite/metabase: reuse Pieces while looping segments Segments loop implementation is using lots of memory to convert alias pieces to pieces for each segment while iteration. To improve situation this change is reusing Pieces between batch pages. This should signifcantly reduce memory usage for ranged loop executions. Change-Id: I469188779908facb19ad85c6bb7bc3657111cc9a	2023-07-12 09:29:34 +00:00
Michal Niewrzal	1d62dc63f5	satellite/repair/repairer: fix NumHealthyInExcludedCountries calculation Currently, we have issue were while counting unhealthy pieces we are counting twice piece which is in excluded country and is outside segment placement. This can cause unnecessary repair. This change is also doing another step to move RepairExcludedCountryCodes from overlay config into repair package. Change-Id: I3692f6e0ddb9982af925db42be23d644aec1963f	2023-07-10 12:01:19 +02:00
Márton Elek	97a89c3476	satellite: switch to use nodefilters instead of old placement.AllowedCountry placement.AllowedCountry is the old way to specify placement, with the new approach we can use a more generic (dynamic method), which can check full node information instead of just the country code. The 90% of this patch is just search and replace: * we need to use NodeFilters instead of placement.AllowedCountry * which means, we need an initialized PlacementRules available everywhere * which means we need to configure the placement rules The remaining 10% is the placement.go, where we introduced a new type of configuration (lightweight expression language) to define any kind of placement without code change. Change-Id: Ie644b0b1840871b0e6bbcf80c6b50a947503d7df	2023-07-07 16:55:45 +00:00
paul cannon	a4d68b9b7e	satellite/metabase: server-side copy copies metadata ..instead of using segment_copies and ancestor_stream_id, etc. This bypasses reference counting entirely, depending on our mark+sweep+ bloomfilter garbage collection strategy to get rid of pieces once they are no longer part of a segment. This is only safe to do after we have stopped passing delete requests on to storage nodes. Refs: https://github.com/storj/storj/issues/5889 Change-Id: I37bdcffaa752f84fd85045235d6875b3526b5ecc	2023-07-06 14:40:59 +00:00
Michal Niewrzal	f2cd7b0928	satellite/overlay: refactor Reliable to be used with repair checker Currently we are using Reliable to get missing pieces for repair checker. The issue is that now checker is looking at more things than just missing pieces (clumped/off, placement pieces) and using only node ID is not enough. We have issue where we are skipping offline nodes from clumped and off placement pieces check. Reliable was refactored to get data (e.g. country, lastNet) about all reliable nodes. List is split into online and offline. This data will be cached for quick use by repair checker. It will be also possible to check nodes metadata like country code or lastNet. We are also slowly moving `RepairExcludedCountryCodes` config from overlay to repair which makes more sens for it. This this first part of changes. https://github.com/storj/storj/issues/5998 Change-Id: If534342488c0e440affc2894a8fbda6507b8959d	2023-07-05 10:56:31 +02:00
Michal Niewrzal	96d3c41c14	satellite/metabase: convert bucket name to bytes for queries In case some invalid characters in bucket name we need to cast bucket name to byte array for query argument. This change is doing this for some missed cases. Change-Id: I47d0d8e3c85a69bdf63de1137adcd533dcfe50a8	2023-06-29 10:43:35 +00:00
Michal Niewrzal	e129841130	satellite/metabase: remove AOST from deleteInactiveObjectsAndSegments By mistake AOST was added to query in deleteInactiveObjectsAndSegments in DeleteZombieObjects. Delete statement is not supporting it. Unfortunately unit tests didn't cover this case. This change removes AOST from mentioned method and it adding AOST cases to unit tests. Change-Id: Ib7f65134290df08c490c96b7e367d12f497a3373	2023-06-28 13:24:14 +00:00
Michal Niewrzal	2b2bca8e81	satellite/accounting/tally: save tallies in a batches Because we are saving all tallies as a single SQL statement we finally reached maximum message size. With this change we will call SaveTallies multiple times in batches. https://github.com/storj/storj/issues/5977 Change-Id: I0c7dd27779b1743ede66448fb891e65c361aa3b0	2023-06-22 17:02:26 +00:00
Michal Niewrzal	ac1ff0e7e2	satellite/accounting/tally: handle well bucket names with escape char Some of tally queries are not passing bucket name as byte but as string. If bucket contains some special characters encoding to bytea can fail. This change makes sure all parts of tally passes bucket name correctly. Change-Id: I7330d546b44d86a2e4614c814580e9e5262370ed	2023-06-21 15:04:14 +00:00
Egon Elbre	edbea5efe1	go.mod: bump to pgx/v5 Change-Id: I31cf3bec1d7db94f0f612f6ed04b782f8b04d876	2023-06-14 18:32:54 +03:00
Michal Niewrzal	6c08d5024e	satellite/metainfo: stop sending delete requests to SN We decided that we will stop sending explicit delete requests to nodes and we will cleanup deleted with GC instead. https://github.com/storj/storj/issues/5888 Change-Id: I65a308cca6fb17e97e3ba85eb3212584c96a32cd	2023-06-14 10:46:02 +00:00
Michal Niewrzal	61933bc6f0	satellite/{metainfo,metabase}: optimize expired/zombie objects deletion * optimize SQL for zombie objects deletion query by reducing some direct selects to segments table * set AOST for expired/zombie object deletion (was 0 since now) https://github.com/storj/storj/issues/5881 Change-Id: I50482151d056a86fe0e31678a463f413d410759d	2023-06-09 11:22:46 +00:00
Michal Niewrzal	aea3baf6a9	satellite/accounting/tally: calculate pending object count Change-Id: I4ee6072f4c60fafd809e8184ada9c1abf7edd8aa	2023-06-09 10:15:27 +00:00
Jeff Wendling	32f683fe9d	satellite/orders: filter nodes based on segment placement this change adds code to CreateGetOrderLimits to filter out any nodes that are not in the placement specified by the segment. notably, it does not change the audit or repair order limits. the list segments code had to be changed to include getting the placement field from the database. Change-Id: Ice3e42a327811bb20928c619a72ed94e0c1464ac	2023-06-05 13:56:22 -04:00
Michal Niewrzal	9b3488276d	satellite/gracefulexit: use node alias instead id with observer Using node alias helps using less cpu and memory. Fixes https://github.com/storj/storj/issues/5654 Change-Id: If3a5c7810732cbb1bff4dcb78706c81aca56b71b	2023-05-18 22:37:46 +00:00
Michal Niewrzal	c0e7f463fe	satellite/metabase: remove segmentsloop package Last change to remove segments loop from codebase. https://github.com/storj/storj/issues/5237 Change-Id: I77b12911b6b4e390a7385e6e8057c7587e74b70a	2023-05-18 19:08:29 +00:00
Michal Niewrzal	4bdbb25d83	satellite/metabase/rangedloop: move Segment definition We will remove segments loop soon so we need first to move Segment definition to rangedloop package. https://github.com/storj/storj/issues/5237 Change-Id: Ibe6aad316ffb7073cc4de166f1f17b87aac07363	2023-05-16 12:37:17 +00:00
Michal Niewrzal	2592aaef9c	satellite/gc/bloomfilter: remove segments loop parts We are switching completely to ranged loop. https://github.com/storj/storj/issues/5368 Change-Id: I1a22ac4b242998e287b2b7d8167b64e850b61a0f	2023-05-15 11:46:26 +00:00
Michal Niewrzal	36e046375c	satellite/repair/checker: remove segments loop parts We are switching completely to ranged loop. https://github.com/storj/storj/issues/5368 Change-Id: I8583549973cd36aa0e0c482c20d7a75cb7568ab3	2023-05-08 12:19:13 +00:00
Jeremy Wharton	f61230a670	satellite/console/dbcleanup: create console DB cleanup chore A chore responsible for purging data from the console DB has been implemented. Currently, it removes old records for unverified user accounts. We plan to extend this functionality to include expired project member invitations in the future. Resolves #5790 References #5816 Change-Id: I1f3ef62fc96c10a42a383804b3b1d2846d7813f7	2023-05-05 19:11:53 +00:00
Egon Elbre	ee28103b29	satellite/metabase: fix error message The "object not found" included an additional prefix "metabase:", which broke uplink error message detection. Ideally, changing an internal error message shouldn't break the uplink. Change-Id: I5ce7789cc11742d3435af1ec555bc96927f1bedc	2023-05-02 13:53:57 +00:00
Michal Niewrzal	6a55682bc6	satellite/accounting/nodetally: remove segments loop parts We are switching completely to ranged loop. https://github.com/storj/storj/issues/5368 Change-Id: I6176a129ba14cf83fb635048d09e6748276b52a1	2023-04-24 14:25:53 +00:00
Egon Elbre	2405bc8f3b	satellite/metabase: stop using the common error type Updates https://github.com/storj/storj/issues/5291 Change-Id: I7b57a4b454d3619cb5d8ae4cd92f818ad2839c8b	2023-04-19 19:18:18 +03:00
Egon Elbre	54beea82da	satellite/metabase: define a local ErrObjectNotFound Updates https://github.com/storj/storj/issues/5291 Change-Id: If847cd95a9b500fb18535d20b46b1b1714021584	2023-04-19 12:03:01 +00:00
Egon Elbre	48256c91b5	storage: move errors to better locations Change-Id: Ia44570949a8f6bb50220dc838c5b6aa21e851a4d	2023-04-06 17:26:29 +03:00
Jeff Wendling	7b06575f6f	satellite/meta{base,info}: reduce db round trips for download This combines the ListStreamPositions and GetSegmentByPosition calls with a ListSegments call that now knows how to return only the segments within a Range, just like ListStreamPositions. It would theoretically be possible to also include the GetObjectLastCommitted call by having it do one of three queries based on the incoming request range, but that would mean duplicating the data for the object in every single row that is returned for each segment in the range. One gross thing that ListSegments has to do now is update the first segment returned with the information from any ancestor segment because GetSegmentByPosition used to do that. It only updates the first segment so that it doesn't do O(N) database queries. It seems difficult to have it do a single query to update all of the segments at once. I'm not certain this change should be merged on this basis alone. This change has made me think a couple of things should happen: 1. Server side copy with ancestor segments strikes again making the code less clear and potentially more buggy or inefficient for a rare case (empirically <0.1%) 2. The download code requests individual segments from the satellite lazily as part of its download which requires the satellite telling it the locations of all of the segments which requires the satellite querying the locations of all of the segments. Instead the download RPC could return the orders for all of the segments for a range and the download code could issue N download calls rather than 1 download call and N get segment calls. I believe both sides of the code paths would be simpler and more efficient this way. 3. In looking at the timing information for downloads when testing this, we really need to focus on getting the auth key and bandwidth limit verification times down. Here's the timing I saw: - 42ms: validate auth - 52ms: bandwidth usage checking - 14ms: get object info - 26ms: get segment position info - 26ms: getting the first segment full info - 20ms: unaccounted for by spans - 6ms: creating the orders This change will remove 26ms, but there's a good 90ms in just validation. With improved semantics hitting the database only once and improved validation, a download rpc taking ~30ms seems doable compared to our current ~200ms. Change-Id: I4109dba082eaedb79e634c61dbf86efa93ab1222	2023-04-03 16:49:00 +00:00
Michal Niewrzal	dc9cbe36ae	satellite/metabase: sort stats by creation time not number of entries By accident query to get latest table stats was ordered by 'row_count' column instead 'create'. We need latest stats so we need ordering by creation time. Change-Id: I9d0a0edda8bab59c3d96b7a15cd6502ed51633fc	2023-03-30 12:31:45 +00:00
Michal Niewrzal	54b6e1614a	satellite/metainfo: drop MultipleVersions config flag This flag was in general one time switch to enable versions internally. New we can remove it as it makes code more complex. Change-Id: I740b6e8fae80d5fac51d9425793b02678357490e	2023-03-29 13:45:09 +00:00
paul cannon	ae5947327b	satellite/accounting: Use metabase.AliasPiece with tally observer We want to eliminate usages of LoopSegmentEntry.Pieces, because it is costing a lot of cpu time to look up node IDs with every piece of every segment we read. In this change, we are eliminating use of LoopSegmentEntry.Pieces in the node tally observer (both the ranged loop and segments loop variants). It is not necessary to have a fully resolved nodeID until it is time to store totals in the database. We can use NodeAliases as the map key instead, and resolve NodeIDs just before storing totals. Refs: https://github.com/storj/storj/issues/5622 Change-Id: Iec12aa393072436d7c22cc5a4ae1b63966cbcc18	2023-03-29 12:24:05 +00:00
Michal Niewrzal	06b51258be	satellite/metabase: use table stats if are up to date Currently, to get number of entries in segments table we are doing heavy SELECT count(*) operation. For biggest satellite it's taking 25min now. We are using this method to get stat before and after segments loop so it adds almost 1h to overall loop time. With current version of crdb we are using this additional code won't be used because global configuration for stats refresh rate is inaccurate for such large table like `segments`. Soon we should be able to upgrade crdb and be able to adjust refresh rate per table and configure it to satisfy defined threshold. https://github.com/storj/storj/issues/5544 Change-Id: I05cfd9154f08894d2bc56bf716b436d1b03b87f1	2023-03-13 14:54:13 +00:00

1 2 3 4 5 ...

289 Commits