scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 20:46:56 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	c0b922a8af	sstable_directory: Construct with state This is to replace full path sitting on this object eventually. For now they have to co-exist, but state will be used to make_sstable()-s from manager with its new API Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-14 14:56:01 +03:00
Pavel Emelyanov	6fc62c2d9f	distributed_loader: Make sstable with desired state when populating This still needs to conver state to directory name internally as sstable_directory instances are hashed on populator by subdir string. Also the full string path is printed in logs. All this is now internal to populate method and will be fixed later Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-14 14:45:52 +03:00
Pavel Emelyanov	b0064f5c55	distributed_loader: Make sstable with upload state when uploading Just make use of the new shiny sstables_manager API Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-14 14:45:52 +03:00
Pavel Emelyanov	c257ad90e1	sstable_directory: Merge verify and g.c. calls Name it .prepare() and remove the sstable_directory() public method Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-14 14:45:51 +03:00
Pavel Emelyanov	07d4672054	distributed_loader: Merge verify and gc invocations Both are launched on shard-0, no need to invoke_on two times Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-14 14:41:48 +03:00
Pavel Emelyanov	c3b23fc03d	Merge 'Skip mode validation for snapshots' from Benny Halevy Skip over verification of owner and mode of the snapshots sub-directory as this might race with scylla-manager trying to delete old snapshots concurrently. Fixes #12010 Closes #14892 * github.com:scylladb/scylladb: distributed_loader: process_sstable_dir: do not verify snapshots utils/directories: verify_owner_and_mode: add recursive flag	2023-08-02 13:05:47 +03:00
Benny Halevy	845b6f901b	distributed_loader: process_sstable_dir: do not verify snapshots Skip over verification of owner and mode of the snapshots sub-directory as this might race with scylla-manager trying to delete old snapshots concurrently. Fixes #12010 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-07-31 16:01:46 +03:00
Aleksandra Martyniuk	6796721c3d	replica: add methods to get table or table id	2023-07-25 17:13:24 +02:00
Aleksandra Martyniuk	52afd9d42d	replica: wrap column families related maps into tables_metadata As a preparation for ensuring access safety for column families related maps, add tables_metadata, access to members of which would be protected by rwlock.	2023-07-25 16:13:00 +02:00
Aleksandra Martyniuk	7a7e287d8c	compaction: add reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction. A struct and some functions are moved from replica/distributed_loader.cc to compaction/task_manager_module.cc.	2023-07-19 17:15:40 +02:00
Kefu Chai	aeb160a654	sstables: use sstables_manager::uuid_stable_identifier() instead of accessing the `feature_service`'s member variable, use the accessor provided by sstable_manager. so we always access the this setting via a single channel. this should helps with the readability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14658	2023-07-13 10:31:06 +03:00
Michał Chojnowski	b511d57fc8	Revert "Merge 'Compaction resharding tasks' from Aleksandra Martyniuk" This reverts commit `2a58b4a39a`, reversing changes made to `dd63169077`. After patch `87c8d63b7a`, table_resharding_compaction_task_impl::run() performs the forbidden action of copying a lw_shared_ptr (_owned_ranges_ptr) on a remote shard, which is a data race that can cause a use-after-free, typically manifesting as allocator corruption. Note: before the bad patch, this was avoided by copying the _contents_ of the lw_shared_ptr into a new, local lw_shared_ptr. Fixes #14475 Fixes #14618 Closes #14641	2023-07-11 19:11:37 +03:00
Aleksandra Martyniuk	85cc85fc5a	replica: delete unused functions and struct	2023-06-28 11:41:43 +02:00
Aleksandra Martyniuk	837d77ba8c	compaction: add reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction.	2023-06-28 11:41:43 +02:00
Raphael S. Carvalho	83c70ac04f	utils: Extract pretty printers into a header Can be easily reused elsewhere. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-06-26 21:58:20 -03:00
Aleksandra Martyniuk	19ec5b4256	replica: delete unused function	2023-06-23 15:57:43 +02:00
Aleksandra Martyniuk	e3e2d6b886	compaction: add table_reshaping_compaction_task_impl	2023-06-23 15:57:37 +02:00
Kefu Chai	f014ccf369	Revert "Revert "Merge 'treewide: add uuid_sstable_identifier_enabled support' from Kefu Chai"" This reverts commit `562087beff`. The regressions introduced by the reverted change have been fixed. So let's revert this revert to resurrect the uuid_sstable_identifier_enabled support. Fixes #10459	2023-06-21 13:02:40 +03:00
Tomasz Grabiec	36da062bcb	db: Use table sharder in compaction	2023-06-21 00:58:24 +02:00
Tomasz Grabiec	ad983ac23d	sstables: Compute sstable shards using sharder from erm when loading schema::get_sharder() does not use the correct sharder for tablet-based tables. Code which is supposed to work with all kinds of tables should obtain the sharder from erm::get_sharder().	2023-06-21 00:58:24 +02:00
Botond Dénes	562087beff	Revert "Merge 'treewide: add uuid_sstable_identifier_enabled support' from Kefu Chai" This reverts commit `d1dc579062`, reversing changes made to `3a73048bc9`. Said commit caused regressions in dtests. We need to investigate and fix those, but in the meanwhile let's revert this to reduce the disruption to our workflows. Refs: #14283	2023-06-19 08:49:27 +03:00
Kamil Braun	33c19baabc	db: system_keyspace: take simpler service references in `make` Take references to services which are initialized earlier. The references to `gossiper`, `storage_service` and `raft_group0_registry` are no longer needed. This will allow us to move the `make` step right after starting `system_keyspace`.	2023-06-18 13:39:27 +02:00
Kefu Chai	2d265e860d	replica,sstable: introduce invalid generation id the invalid sstable id is the NULL of a sstable identifier. with this concept, it would be a lot simpler to find/track the greatest generation. the complexity is hidden in the generation_type, which compares the a) integer-based identifiers b) uuid-based identifiers c) invalid identitifer in different ways. so, in this change * the default constructor generation_type is now public. * we don't check for empty generation anymore when loading SSTables or enumerating them. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-06-15 17:54:59 +08:00
Kefu Chai	939fa087cc	sstables, replica: pass uuid_sstable_identifiers to generation generator before this change, we assume that generation is always integer based. in order to enable the UUID-based generation identifier if the related option is set, we should populate this option down to generation generator. because we don't have access to the cluster features in some places where a new generation is created, a new accessor exposing feature_service from sstable manager is added. Fixes #10459 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-06-15 17:54:59 +08:00
Pavel Emelyanov	66e43912d6	code: Switch to seastar API level 7 In that level no io_priority_class-es exist. Instead, all the IO happens in the context of current sched-group. File API no longer accepts prio class argument (and makes io_intent arg mandatory to impls). So the change consists of - removing all usage of io_priority_class - patching file_impl's inheritants to updated API - priority manager goes away altogether - IO bandwidth update is performed on respective sched group - tune-up scylla-gdb.py io_queues command The first change is huge and was made semi-autimatically by: - grep io_priority_class \| default_priority_class - remove all calls, found methods' args and class' fields Patching file_impl-s is smaller, but also mechanical: - replace io_priority_class& argument with io_intent* one - pass intent to lower file (if applicatble) Dropping the priority manager is: - git-rm .cc and .hh - sed out all the #include-s - fix configure.py and cmakefile The scylla-gdb.py update is a bit hairry -- it needs to use task queues list for IO classes names and shares, but to detect it should it checks for the "commitlog" group is present. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13963	2023-06-06 13:29:16 +03:00
Raphael S. Carvalho	156d771101	compaction: Fix sstable cleanup after resharding on refresh Problem can be reproduced easily: 1) wrote some sstables with smp 1 2) shut down scylla 3) moved sstables to upload 4) restarted scylla with smp 2 5) ran refresh (resharding happens, adds sstable to cleanup set and never removes it) 6) cleanup (tries to cleanup resharded sstables which were leaked in the cleanup set) Bumps into assert "Assertion `!sst->is_shared()' failed", as cleanup picks a shared sstable that was leaked and already processed by resharding. Fix is about not inserting shared sstables into cleanup set, as shared sstables are restricted to resharding and cannot be processed later by cleanup (nor it should because resharding itself cleaned up its input files). Dtest: https://github.com/scylladb/scylla-dtest/pull/3206 Fixes #14001. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #14147	2023-06-06 12:14:03 +03:00
Pavel Emelyanov	a19b8af187	table: Relocate ks.make_directory_for_column_family() This method initializes storage for table naturally belongs to that class. So rename it while moving. Also, there's no longer need to carry table name and uuid as arguments, being table method it can just get the paths to work on from config Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-26 18:15:41 +03:00
Pavel Emelyanov	6db5f08eab	distributed_loader: Use cf.dir() instead of ks.column_family_directory() These two return the same, but the latter makes it the harder way Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-26 17:59:47 +03:00
Botond Dénes	c2aee26278	Merge 'Keep sstables garbage collection in sstable_directory' from Pavel Emelyanov Currently temporary directories with incomplete sstables and pending deletion log are processed by distributed loader on start. That's not nice, because for s3 backed sstables this code makes no sense (and is currently a no-op because of incomplete implementation). This garbage collecting should be kept in sstable_directory where it can off-load this work onto lister component that is storage-aware. Once g.c. code moved, it allows to clean the class sstable list of static helpers a bit. refs: #13024 refs: #13020 refs: #12707 Closes #13767 * github.com:scylladb/scylladb: sstable: Toss tempdir extension usage sstable: Drop pending_delete_dir_basename() sstable: Drop is_pending_delete_dir() helper sstable_directory: Make garbage_collect() non-static sstable_directory: Move deletion log exists check distributed_loader: Move garbage collecting into sstable_directory distributed_loader: Collect garbace collecting in one call sstable: Coroutinize remove_temp_dir() sstable: Coroutinize touch_temp_dir() sstable: Use storage::temp_dir instead of hand-crafted path	2023-05-19 08:50:13 +03:00
Pavel Emelyanov	c3fca9481c	replica: Use global_table_ptr in distributed loader The loader has very similar global_column_family_ptr class for its distributed loadings. Now it can use the "standard" one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 18:14:34 +03:00
Pavel Emelyanov	7429205632	sstable_directory: Make garbage_collect() non-static When non static the call can use sstable_directory::_sstable_dir path, not the provided argument. The main benefit is that the method can later be moved onto lister so that filesystem and ownership-table listers can process dangling bits differently. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 15:16:23 +03:00
Pavel Emelyanov	3d7122d2fe	distributed_loader: Move garbage collecting into sstable_directory It's the directory that owns the components lister and can reason about the way to pick up dangling bits, be it local directories or entries from the ownership table. First thing to do is to move the g.c. code into sstable_directory. While at it -- convert ssting dir into fs::path dir and switch logger. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 15:16:23 +03:00
Pavel Emelyanov	99f924666f	distributed_loader: Collect garbace collecting in one call When the loader starts it first scans the directory for sstables' tempdirs and pending deletion logs. Put both into one call so that it can be moved more easily later. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 15:16:23 +03:00
Kefu Chai	9b35faf485	treewide: replace generation_type::value() with generation_type::as_int() * replace generation_type::value() with generation_type::as_int() * drop generation_value() because we will switch over to UUID based generation identifier, the member function or the free function generation_value() cannot fulfill the needs anymore. so, in this change, they are consolidated and are replaced by "as_int()", whose name is more specific, and will also work and won't be misleading even after switching to UUID based generation identifier. as `value()` would be confusing by then: it could be an integer or a UUID. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-06 18:24:45 +08:00
Benny Halevy	e2023877f2	sstable_directory: parallel_for_each_restricted: do not move container Commit `ecbd112979` `distributed_loader: reshard: consider sstables for cleanup` caused a regression in loading new sstables using the `upload` directory, as seen in e.g. https://jenkins.scylladb.com/view/master/job/scylla-master/job/dtest-daily-release/230/testReport/migration_test/TestMigration/Run_Dtest_Parallel_Cloud_Machines___FullDtest___full_split000___test_migrate_sstable_without_compression_3_0_md_/ ``` query = "SELECT COUNT() FROM cf" statement = SimpleStatement(query) s = self.patient_cql_connection(node, 'ks') result = list(s.execute(statement)) > assert result[0].count == expected_number_of_rows, \ "Expected {} rows. Got {}".format(expected_number_of_rows, list(s.execute("SELECT FROM ks.cf"))) E AssertionError: Expected 1 rows. Got [] E assert 0 == 1 E +0 E -1 ``` The reason for the regression is that the call to `do_for_each_sstable` in `collect_all_shared_sstables` to search for sstables that need cleanup caused the list of sstables in the sstable directory to be moved and cleared. parallel_for_each_restricted moves the container passed to it into a `do_with` continuation. This is required for parallel_for_each_restricted. However, moving the container is destructive and so, the decision whether to move or not needs to be the caller's, not the callee. This patch changes the signature of parallel_for_each_restricted to accept a lvalue reference to the container rather than a rvalue reference, allowing the callers to decide whether to move or not. Most callers are converted to move the container, as they effectively do today, and a new method, `filter_sstables` was added for the `collect_all_shared_sstables` us case, that allows the `func` that processes each sstable to decide whether the sstable is kept in `_unshared_local_sstables` or not. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-04 11:36:25 +03:00
Avi Kivity	7b7d9bcb14	Merge 'Do not access owned_ranges_ptr across shards in update_sstable_cleanup_state' from Benny Halevy This series fixes a few issues caused by `f1bbf705f9` (`f1bbf705f9`): - table, compaction_manager: prevent cross shard access to owned_ranges_ptr - Fixes #13631 - distributed_loader: distribute_reshard_jobs: pick one of the sstable shard owners - compaction: make_partition_filter: do not assert shard ownership - allow the filtering reader now used during resharding to process tokens owned by other shards Closes #13635 * github.com:scylladb/scylladb: compaction: make_partition_filter: do not assert shard ownership distributed_loader: distribute_reshard_jobs: pick one of the sstable shard owners table, compaction_manager: prevent cross shard access to owned_ranges_ptr	2023-05-01 22:51:00 +03:00
Benny Halevy	c7d064b8b1	distributed_loader: distribute_reshard_jobs: pick one of the sstable shard owners When distributing the resharding jobs, prefer one of the sstable shard owners based on foreign_sstable_open_info. This is particularly important for uploaded sstables that are resharded since they require cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 15:13:16 +03:00
Benny Halevy	2f61de8f7b	table, compaction_manager: prevent cross shard access to owned_ranges_ptr Seen after `f1bbf705f9` in debug mode distributed_loader collect_all_shared_sstables copies compaction::owned_ranges_ptr (lw_shared_ptr<const dht::token_range_vector>) across shards. Since update_sstable_cleanup_state is synchronous, it can be passed a const refrence to the token_range_vector instead. It is ok to access the memory read-only across shards and since this happens on start-up, there are no special performance requirements. Fixes #13631 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 15:12:13 +03:00
Kefu Chai	576adbdbc5	replica, test: create generation id using generator reuse generation_generator for generating generation identifiers for less repeatings. also, add allow update generator to update its lastest known generation id. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-21 22:02:30 +08:00
Raphael S. Carvalho	fe6df3d270	sstable_loader: Discard SSTable bloom filter on load-and-stream Load-and-stream reads the entire content from SSTables, therefore it can afford to discard the bloom filter that might otherwise consume a significant amount of memory. Bloom filters are only needed by compaction and other replica::table operations that might want to check the presence of keys in the SSTable files, like single-partition reads. It's not uncommon to see Data:Filter ratio of less than 100:1, meaning that for ~300G of data, filters will take ~3G. In addition to saving memory footprint, it also reduces operation time as load-and-stream no longer have to read, parse and build the filters from disk into memory. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-04-13 11:34:22 -03:00
Botond Dénes	f1bbf705f9	Merge 'Cleanup sstables in resharding and other compaction types' from Benny Halevy This series extends sstable cleanup to resharding and other (offstrategy, major, and regular) compaction types so to: * cleanup uploaded sstables (#11933) * cleanup staging sstables after they are moved back to the main directory and become eligible for compaction (#9559) When perform_cleanup is called, all sstables are scanned, and those that require cleanup are marked as such, and are added for tracking to table_state::cleanup_sstable_set. They are removed from that set once released by compaction. Along with that sstables set, we keep the owned_ranges_ptr used by cleanup in the table_state to allow other compaction types (offstrategy, major, or regular) to cleanup those sstables that are marked as require_cleanup and that were skipped by cleanup compaction for either being in the maintenance set (requiring offstrategy compaction) or in staging. Resharding is using a more straightforward mechanism of passing the owned token ranges when resharding uploaded sstables and using it to detect sstable that require cleanup, now done as piggybacked on resharding compaction. Closes #12422 * github.com:scylladb/scylladb: table: discard_sstables: update_sstable_cleanup_state when deleting sstables compaction_manager: compact_sstables: retrieve owned ranges if required sstables: add a printer for shared_sstable compaction_manager: keep owned_ranges_ptr in compaction_state compaction_manager: perform_cleanup: keep sstables in compaction_state::sstables_requiring_cleanup compaction: refactor compaction_state out of compaction_manager compaction: refactor compaction_fwd.hh out of compaction_descriptor.hh compaction_manager: compacting_sstable_registration: keep a ref to the compaction_state compaction_manager: refactor get_candidates compaction_manager: get_candidates: mark as const table, compaction_manager: add requires_cleanup sstable_set: add for_each_sstable_until distributed_loader: reshard: update sstable cleanup state table, compaction_manager: add update_sstable_cleanup_state compaction_manager: needs_cleanup: delete unused schema param compaction_manager: perform_cleanup: disallow empty sorted_owened_ranges distributed_loader: reshard: consider sstables for cleanup distributed_loader: process_upload_dir: pass owned_ranges_ptr to reshard distributed_loader: reshard: add optional owned_ranges_ptr param distributed_loader: reshard: get a ref to table_state distributed_loader: reshard: capture creator by ref distributed_loader: reshard: reserve num_jobs buckets compaction: move owned ranges filtering to base class compaction: move owned_ranges into descriptor	2023-04-11 14:52:29 +03:00
Benny Halevy	db7fa9f3be	distributed_loader: reshard: update sstable cleanup state Since the sstables are loaded from foreign open info we should mark them for cleanup if needed (and owned_ranges_ptr is provided). This will allow a later patch to enable filtering for cleanup only for sstable sets containing sstables that require cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:11:00 +03:00
Benny Halevy	d0690b64c1	table, compaction_manager: add update_sstable_cleanup_state update_sstable_cleanup_state calls needs_cleanup and inserts (or erases) the sstable into the respective compaction_state.sstables_requiring_cleanup set. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:10:55 +03:00
Benny Halevy	1baca96de1	compaction_manager: needs_cleanup: delete unused schema param It isn't needed. The sstable already has a schema. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:03:53 +03:00
Benny Halevy	ecbd112979	distributed_loader: reshard: consider sstables for cleanup When called from `process_upload_dir` we pass a list of owned tokens to `reshard`. When they are available, run resharding, with implicit cleanup, also on unshared sstables that need cleanup. Fixes #11933 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:01:38 +03:00
Benny Halevy	3ccbb28f2a	distributed_loader: process_upload_dir: pass owned_ranges_ptr to reshard To facilitate implicit cleanup of sstables via resharding. Refs #11933 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:59:38 +03:00
Benny Halevy	aa4b18f8fb	distributed_loader: reshard: add optional owned_ranges_ptr param For passing owned_ranges_ptr from distributed_loader::process_upload_dir. Refs #11933 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:57:41 +03:00
Benny Halevy	f540af930b	distributed_loader: reshard: get a ref to table_state We don't reference the table itself, only as_table_state. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:57:11 +03:00
Benny Halevy	c6b7fcc26f	distributed_loader: reshard: capture creator by ref Now that reshard is a coroutine, creator is preserved in the coroutine frame until completion so we can simply capture it by reference now. Note that previously it was moved into the compaction descriptor, but the capture wasn't mutable so it was copied anyhow and this change doesn't introduced a regression. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:56:07 +03:00
Benny Halevy	7c9d16ff96	distributed_loader: reshard: reserve num_jobs buckets We know in advance how many buckets we need. We still need to emplace the first bucket upfront. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:55:35 +03:00

1 2 3 4

157 Commits