restic

Commit Graph

Author	SHA1	Message	Date
Gilbert Gilb's	536ebefff4	feat(backends/s3): add warmup support before repacks and restores (#5173 ) * feat(backends/s3): add warmup support before repacks and restores This commit introduces basic support for transitioning pack files stored in cold storage to hot storage on S3 and S3-compatible providers. To prevent unexpected behavior for existing users, the feature is gated behind new flags: - `s3.enable-restore`: opt-in flag (defaults to false) - `s3.restore-days`: number of days for the restored objects to remain in hot storage (defaults to `7`) - `s3.restore-timeout`: maximum time to wait for a single restoration (default to `1 day`) - `s3.restore-tier`: retrieval tier at which the restore will be processed. (default to `Standard`) As restoration times can be lengthy, this implementation preemptively restores selected packs to prevent incessant restore-delays during downloads. This is slightly sub-optimal as we could process packs out-of-order (as soon as they're transitioned), but this would really add too much complexity for a marginal gain in speed. To maintain simplicity and prevent resources exhautions with lots of packs, no new concurrency mechanisms or goroutines were added. This just hooks gracefully into the existing routines. Limitations: - Tests against the backend were not written due to the lack of cold storage class support in MinIO. Testing was done manually on Scaleway's S3-compatible object storage. If necessary, we could explore testing with LocalStack or mocks, though this requires further discussion. - Currently, this feature only warms up before restores and repacks (prune/copy), as those are the two main use-cases I came across. Support for other commands may be added in future iterations, as long as affected packs can be calculated in advance. - The feature is gated behind a new alpha `s3-restore` feature flag to make it explicit that the feature is still wet behind the ears. - There is no explicit user notification for ongoing pack restorations. While I think it is not necessary because of the opt-in flag, showing some notice may improve usability (but would probably require major refactoring in the progress bar which I didn't want to start). Another possibility would be to add a flag to send restores requests and fail early. See https://github.com/restic/restic/issues/3202 * ui: warn user when files are warming up from cold storage * refactor: remove the PacksWarmer struct It's easier to handle multiple handles in the backend directly, and it may open the door to reducing the number of requests made to the backend in the future.	2025-02-01 18:26:27 +00:00
Michael Eischer	e77681f2cd	remove unnecessary min function	2025-01-28 19:52:22 +01:00
Michael Eischer	9331461a13	prune: correctly account for duplicates in max-unused check The size comparison for `--max-unused` only accounted for unused but not for duplicate data. For repositories with a large amount of duplicates this can result in a situation where no data gets pruned even though the amount of unused data is much higher than specified.	2025-01-19 17:47:49 +01:00
Michael Eischer	b7ff8ea9cd	repository: expose cache via method	2025-01-13 22:40:18 +01:00
Michael Eischer	99e105eeb6	repository: restrict SaveUnpacked and RemoveUnpacked Those methods now only allow modifying snapshots. Internal data types used by the repository are now read-only. The repository-internal code can bypass the restrictions by wrapping the repository in an `internalRepository` type. The restriction itself is implemented by using a new datatype WriteableFileType in the SaveUnpacked and RemoveUnpacked methods. This statically ensures that code cannot bypass the access restrictions. The test changes are somewhat noisy as some of them modify repository internals and therefore require some way to bypass the access restrictions. This works by capturing an `internalRepository` or `Backend` when creating the Repository using a test helper function.	2025-01-13 22:39:57 +01:00
knbr13	bbb492ee65	remove duplicate imports	2025-01-05 13:53:20 +02:00
greatroar	b5c28a7ba2	internal/restic: Use IDSet.Clone + use maps package One place where IDSet.Clone is useful was reinventing it, using a conversion to list, a sort, and a conversion back to map. Also, use the stdlib "maps" package to implement as much of IDSet as possible. This requires changing one caller, which assumed that cloning nil would return a non-nil IDSet.	2024-10-03 21:14:29 +02:00
Michael Eischer	80ed863aab	repository: remove redundant cleanup code The temp files used by the packer manager are either delete after creation (unix) or marked as delete on close (windows). Thus, no explicit cleanup is necessary.	2024-08-31 17:37:25 +02:00
Michael Eischer	6024597028	drop support for s3legacy layout	2024-08-31 17:25:24 +02:00
Michael Eischer	943b6ccfba	index: remove support for legacy index format	2024-08-31 17:12:43 +02:00
Srigovind Nayak	88174cd0a4	cache: remove redundant index file cleanup addressing code review comments	2024-08-17 00:21:49 +05:30
Srigovind Nayak	b7d014b685	Revert "repository: removed redundant prepareCache method from Repository" This reverts commit `720609f8ba`.	2024-08-17 00:18:13 +05:30
Srigovind Nayak	720609f8ba	repository: removed redundant prepareCache method from Repository * remove the prepareCache method from the Repository * changed the signature of the SetIndex function to no longer return an error	2024-08-11 23:41:07 +05:30
Michael Eischer	400ae55940	replace deprecated usages of math/rand	2024-08-10 19:34:49 +02:00
Michael Eischer	0b19f6cf5a	Switch back to sha256 from the std library The std library now also supports the sha assembly instructions on ARM64. Thus, sha256-simd no longer provides a performance benefit.	2024-08-10 19:16:10 +02:00
Michael Eischer	d2f7c5a9c6	Merge pull request #4978 from konidev20/fix-gh-4949-repair-index-spurious-index rewrite: skip saving empty indexes during MasterIndex.Rewrite	2024-08-03 18:53:57 +00:00
Srigovind Nayak	068d5b95c3	rewrite: skip saving empty indexes during MasterIndex.Rewrite	2024-08-03 23:34:59 +05:30
Michael Eischer	ae1cb889dd	Add more checks for canceled contexts	2024-07-31 19:30:47 +02:00
Viktor Szépe	ac00229386	Fix typos	2024-07-03 20:02:06 +02:00
Michael Eischer	b80aa7b1cc	repository: prevent initialization if a snapshot exists	2024-06-14 20:37:01 +02:00
Michael Eischer	bab760369f	repository: double check that there is not repository before init Apparently, calling `Stat` on the config file can be unreliable for some backends.	2024-06-09 00:05:32 +02:00
Michael Eischer	496e57f956	hashing: move to repository package	2024-05-25 13:13:03 +02:00
Michael Eischer	5e0ea8fcfa	pack: move to repository package	2024-05-25 13:13:03 +02:00
Michael Eischer	50ec408302	index: move to repository package	2024-05-25 13:13:03 +02:00
Michael Eischer	8e5d7d719c	cache: move to backend package	2024-05-24 23:04:06 +02:00
Michael Eischer	3c7b7efdc9	repository: remove prune plan parts once they are no longer necessary	2024-05-24 22:18:14 +02:00
Michael Eischer	462b82a060	index: reduce size of compressed indexes use the same index size for compressed and uncompressed indexes. Otherwise, decoding the index of a compressed repository requires significantly more memory.	2024-05-24 22:18:14 +02:00
Michael Eischer	77873f5a9d	repository: let prune control data structure of usedBlobs set	2024-05-24 22:18:14 +02:00
Michael Eischer	2033c02b09	index: replace CountedBlobSet with AssociatedSet	2024-05-24 22:18:14 +02:00
Michael Eischer	93098e9265	prune: hide implementation details of counted blob set	2024-05-24 21:42:56 +02:00
Michael Eischer	027cc64737	repository: fix prune heuristic to allow resuming interrupted runs Pack files created by interrupted prune runs, appear to consist only of duplicate blobs on the next run. This caused the previous heuristic to ignore those pack files. Now, a duplicate blob in a specific pack file is also selected if that pack file only contains duplicate blobs. This allows prune to select the already rewritten pack files.	2024-05-24 21:33:17 +02:00
Michael Eischer	57d69aa640	index: cleanup SaveIndex method	2024-05-24 21:33:17 +02:00
Michael Eischer	5f7b48e65f	index: replace Save() method with Rewrite and SaveFallback Rewrite implements a streaming rewrite of the index that excludes the given packs. For this it loads all index files from the repository and only modifies those that require changes. This will reduce the index churn when running prune. Rewrite does not require the in-memory index and thus can drop it to significantly reduce the memory usage. However, `prune --unsafe-recovery` cannot use this strategy and requires a separate method to save the whole in-memory index. This is now handled using SaveFallback.	2024-05-24 21:33:17 +02:00
Michael Eischer	68fa0e0305	prune: no longer disable automatic index updates this allows prune to resume an interrupted prune run.	2024-05-24 21:33:17 +02:00
Michael Eischer	76e6719f2e	repository: make CreateIndexFromPacks method private	2024-05-24 21:33:17 +02:00
Michael Eischer	04ad9f0c0c	repository: remove Packer and SavePacker from public interface	2024-05-24 21:33:17 +02:00
Michael Eischer	550d1eeac3	repository: remove SaveIndex from interface The method is now only indirectly accessible via Prune or RepairIndex.	2024-05-24 21:33:17 +02:00
Michael Eischer	447b486c20	index: deduplicate index loading of check and repository	2024-05-24 21:33:17 +02:00
Michael Eischer	864995271e	repository: unwrap BlobHandle parameters of LookupBlob The method now uses the same parameters as LookupBlobSize.	2024-05-24 21:33:17 +02:00
Michael Eischer	1266a4932f	repository: fix parameter order of LookupBlobSize All methods should use blobType followed by ID.	2024-05-24 21:33:17 +02:00
Michael Eischer	0bb0720348	test cleanups	2024-05-24 21:33:17 +02:00
Michael Eischer	0aa5c53842	repository: replace HasBlob with LookupBlobSize	2024-05-24 21:33:17 +02:00
Michael Eischer	8f1e70cd9b	repository: remove clearIndex and packSize from public interface	2024-05-24 21:33:17 +02:00
Michael Eischer	4df887406f	repository: inline MasterIndex interface into Repository interface	2024-05-24 21:33:17 +02:00
Michael Eischer	b1266867d2	repository: wait max 1 minutes for lock removal if context is canceled The toplevel context in restic only canceled if the user interrupts a restic operation. If the network connection has failed this can require waiting the full retry duration of 15 minutes which is a bad user experience for interactive usage. Thus limit the delay to one minute in this case.	2024-05-24 20:24:02 +02:00
Michael Eischer	223aa22cb0	replace some uses of restic.Repository with finegrained interfaces	2024-05-18 21:42:51 +02:00
Michael Eischer	291c9677de	restic/repository: remove Backend() method	2024-05-18 21:42:51 +02:00
Michael Eischer	673496b091	repository: clean cache between CheckPack retries The cache cleanup pattern is also used in ListPack etc.	2024-05-18 21:42:51 +02:00
Michael Eischer	d2c26e33f3	repository: remove further usages of repo.Backend()	2024-05-18 21:42:51 +02:00
Michael Eischer	8a425c2f0a	remove usages of repo.Backend() from tests	2024-05-18 21:42:51 +02:00

1 2 3 4 5 ...

432 Commits