scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 06:05:53 +00:00

Author	SHA1	Message	Date
Kefu Chai	04434c02b3	sstables: print generation without {:d} the formatter for sstables::generation_type does not support "d" specifier, so we should not use "{:d}" for printing it. this works before `d7c90b5239`, but after that change, generation_type is not an alias of int64_t anymore. and its formatter does not support "d", so we should either specialize fmt::formatter<generation_type> to support it or just drop the specifier. since seastar::format() is using ```c++ fmt::format_to(fmt::appender(out), fmt::runtime(fmt), std::forward<A>(a)...); ``` to print the arguments with given fmt string, we cannot identify these kind of error at compile time. at runtime, if we have issues like this, {fmt} would throw exception like: ``` terminate called after throwing an instance of 'fmt::v9::format_error' what(): invalid format specifier ``` when constructing the `std::runtime_error` instance. so, in this change, "d" is removed. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14427	2023-07-03 13:53:13 +03:00
Pavel Emelyanov	d1de796f6b	sstable: Move XFS renamer hack into fs storage The method sits on sstable, but is called only from fs storage and it's the only place that really needs it Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #14230	2023-06-14 12:35:04 +03:00
Pavel Emelyanov	66e43912d6	code: Switch to seastar API level 7 In that level no io_priority_class-es exist. Instead, all the IO happens in the context of current sched-group. File API no longer accepts prio class argument (and makes io_intent arg mandatory to impls). So the change consists of - removing all usage of io_priority_class - patching file_impl's inheritants to updated API - priority manager goes away altogether - IO bandwidth update is performed on respective sched group - tune-up scylla-gdb.py io_queues command The first change is huge and was made semi-autimatically by: - grep io_priority_class \| default_priority_class - remove all calls, found methods' args and class' fields Patching file_impl-s is smaller, but also mechanical: - replace io_priority_class& argument with io_intent* one - pass intent to lower file (if applicatble) Dropping the priority manager is: - git-rm .cc and .hh - sed out all the #include-s - fix configure.py and cmakefile The scylla-gdb.py update is a bit hairry -- it needs to use task queues list for IO classes names and shares, but to detect it should it checks for the "commitlog" group is present. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13963	2023-06-06 13:29:16 +03:00
Botond Dénes	5a14c3311a	Merge 'Break S3 upload 50Gb file limit' from Pavel Emelyanov Current S3 uploading sink has implicit limit for the final file size that comes from two places. First, S3 protocol declares that uploading parts count from 1 to 10000 (inclusive). Second, uploading sink sends out parts once they grow above S3 minimal part size which is 5Mb. Since sstables puts data in 128kb (or smaller) portions, parts are almost exactly 5Mb in size, so the total uploading size cannot grow above ~50Gb. That's too low. To break the limit the new sink (called jumbo sink) uses the UploadPartCopy S3 call that helps splicing several objects into one right on the server. Jumbo sink starts uploading parts into an intermediate temporary object called a piece and named ${original_object}_${piece_number}. When the number of parts in current piece grows above the configured limit the piece is finalized and upload-copied into the object as its next part, then deleted. This happens in the background, meanwhile the new piece is created and subsequent data is put into it. When the sink is flushed the current piece is flushed as is and also squashed into the object. The new jumbo sink is capable of uploading ~500Tb of data, which looks enough. fixes: #13019 Closes #13577 * github.com:scylladb/scylladb: sstables: Switch data and index sink to use jumbo uploader s3/test: Tune-up multipart upload test alignment s3/test: Add jumbo upload test s3/client: Wait for background upload fiber on close-abort c3/client: Implement jumbo upload sink s3/client: Move memory buffers to upload_sink from base s3/client: Move last part upload out of finalize_upload() s3/client: Merge do_flush() with upload_part() s3/client: Rename upload_sink -> upload_sink_base	2023-05-25 11:44:06 +03:00
Botond Dénes	c2aee26278	Merge 'Keep sstables garbage collection in sstable_directory' from Pavel Emelyanov Currently temporary directories with incomplete sstables and pending deletion log are processed by distributed loader on start. That's not nice, because for s3 backed sstables this code makes no sense (and is currently a no-op because of incomplete implementation). This garbage collecting should be kept in sstable_directory where it can off-load this work onto lister component that is storage-aware. Once g.c. code moved, it allows to clean the class sstable list of static helpers a bit. refs: #13024 refs: #13020 refs: #12707 Closes #13767 * github.com:scylladb/scylladb: sstable: Toss tempdir extension usage sstable: Drop pending_delete_dir_basename() sstable: Drop is_pending_delete_dir() helper sstable_directory: Make garbage_collect() non-static sstable_directory: Move deletion log exists check distributed_loader: Move garbage collecting into sstable_directory distributed_loader: Collect garbace collecting in one call sstable: Coroutinize remove_temp_dir() sstable: Coroutinize touch_temp_dir() sstable: Use storage::temp_dir instead of hand-crafted path	2023-05-19 08:50:13 +03:00
Kefu Chai	03be1f438c	sstables: move get_components_lister() into sstable_directory sstables_manager::get_component_lister() is used by sstable_directory. and almost all the "ingredients" used to create a component lister are located in sstable_directory. among the other things, the two implementations of `components_lister` are located right in `sstable_directory`. there is no need to outsource this to sstables_manager just for accessing the system_keyspace, which is already exposed as a public function of `sstables_manager`. so let's move this helper into sstable_directory as a member function. with this change, we can even go further by moving the `components_lister` implementations into the same .cc file. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13853	2023-05-18 08:43:35 +03:00
Pavel Emelyanov	ed50fda1fe	sstable: Toss tempdir extension usage The tempdir for filesystem-based sstables is {generation}.sstable one. There are two places that need to know the ".sstable" extention -- the tempdir creating code and the tempdir garbage-collecting code. This patch simplifies the sstable class by patching the aforementioned functions to use newly introduced tempdir_extension string directly, without the help of static one-line helpers. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 15:19:38 +03:00
Pavel Emelyanov	22299a31c8	sstable: Coroutinize remove_temp_dir() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 15:16:23 +03:00
Pavel Emelyanov	9db5e9f77f	sstable: Coroutinize touch_temp_dir() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 15:15:38 +03:00
Pavel Emelyanov	7e506354fd	sstable: Use storage::temp_dir instead of hand-crafted path When opening an sstable on filesystem it's first created in a temporary directory whose path is saved in storage::temp_dir variable. However, the opening method constructs the path by hand. Fix that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 15:14:04 +03:00
Pavel Emelyanov	b58ad040d2	sstables: Switch data and index sink to use jumbo uploader These two can grow large. Non-jumbo sink is effectively limited with 10000 parts, since each is ~5Mb the maximum uploadable data/index happens to be 50Gb which is too small. Other components shouldn't grow that big and continue using simple and a bit faster uploading sink. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-16 12:23:18 +03:00
Botond Dénes	20ff122a84	Merge 'Delete S3 sstables without the help of deletion log' from Pavel Emelyanov There are two layers of stables deletion -- delete-atomically and wipe. The former is in fact the "API" method, it's called by table code when the specific sstable(s) are no longer needed. It's called "atomically" because it's expected to fail in the middle in a safe manner so that subsequent boot would pick the dangling parts and proceed. The latter is a low-level removal function that can fail in the middle, but it's not of _its_ care. Currently the atomic deletion is implemented with the help of sstable_directory::delete_atomically() method that commits sstables files names into deletion log, then calls wipe (indirectly), then drops the deletion log. On boot all found deletion logs are replayed. The described functionality is used regardless of the sstable storage type, even for S3, though deletion log is an overkill for S3, it's better be implemented with the help of ownership table. In fact, S3 storage already implements atomic deletion in its wipe method thus being overly careful. So this PR - makes atomic deletion be storage-specific - makes S3 wipe non-atomic fixes: #13016 note: Replaying sstables deletion from ownership table on boot is not here, see #13024 Closes #13562 * github.com:scylladb/scylladb: sstables: Implement atomic deleter for s3 storage sstables: Get atomic deleter from underlying storage sstables: Move delete_atomically to manager and rename	2023-05-15 08:57:47 +03:00
Pavel Emelyanov	8bca54902c	sstables: Implement atomic deleter for s3 storage The existing storage::wipe() method of s3 is in fact atomic deleter -- it commits "deleting" status into ownership table, deletes the objects from server, then removes the entry from ownership table. So the atomic deleter does the same and the .wipe() just removes the objects, because it's not supposed to be atomic. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-12 17:52:13 +03:00
Pavel Emelyanov	6a8139a4fe	sstables: Get atomic deleter from underlying storage While the driver isn't known without the sstable itself, we have a vector of them can can get it from the front element. This is not very generic, but fortunately all sstables here belong to the same table and, respectively, to the same storage and even prefix. The latter is also assert-checked by the sstable_directory atomic deleter code. For now S3 storage returns the same directory-based deleter, but next patch will change that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-12 17:52:13 +03:00
Pavel Emelyanov	e6760482b2	sstables: Keep clients shared across sstables Nowadays each sstable gets its own instance of an s3::client. This patch keeps clients on storage_manager's endpoints map and when creating a storage for an sstable -- grab the shared pointer from the map, thus making one client serve all sstables over there (except for those that duplicated their files with the help of foreign-info, but that's to be handled by next patches). Moving the ownership of a client to the storage_manager level also means that the client has to be closed on manager's stop, not on sstable destroy. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-11 19:39:01 +03:00
Kefu Chai	a69282e69b	sstables: storage: coroutinize idempotent_link_file() Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 16:47:00 +08:00
Kefu Chai	2eefcb37eb	sstables: extract storage out this change extracts the storage class and its derived classes out into storage.cc and storage.hh. for couple reasons: - for better readability. the sstables.hh is over 1005 lines. and sstables.cc 3602 lines. it's a little bit difficult to figure out how the different parts in these sources interact with each other. for instance, with this change, it's clear some of helper functions are only used by file_system_storage. - probably less inter-source dependency. by extracting the sources files out, they can be compiled individually, so changing one .cc file does not impact others. this could speed up the compilation time. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 16:47:00 +08:00

17 Commits