scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-19 16:15:07 +00:00

Author	SHA1	Message	Date
Benny Halevy	3ca0c6c0a5	compaction_manager: try_perform_cleanup: set owned_ranges_ptr with compaction disabled Otherwise regular compaction can sneak in and see !cs.sstables_requiring_cleanup.empty() with cs.owned_ranges_ptr == nullptr and trigger the internal error in `compaction_task_executor::compact_sstables`. Fixes scylladb/scylladb#14296 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #14297	2023-06-27 08:47:13 +03:00
Aleksandra Martyniuk	74e5b4ebfc	compaction: turn major_compaction_task_executor into major_compaction_task_impl major_compaction_task_executor inherits both from compaction_task_executor and major_compaction_task_impl. Thanks to that an executed operation is represented in task manager.	2023-06-20 12:12:49 +02:00
Aleksandra Martyniuk	4922f4cf80	compaction: take gate holder out of task executor In the following commits, classes deriving from compaction_task_executor will be alive longer than they are kept in compaction_manager::_tasks. Thus, the compaction_task_executor::_gate_holder would be held, blocking other compactions. compaction_task_executor::_gate_holder is moved outside of compaction_task_executor object.	2023-06-20 12:12:45 +02:00
Aleksandra Martyniuk	e317ffe23a	compaction: extend signature of some methods Extend a signature of table::compact_all_sstables and compaction_manager::perform_major_compaction so that they get the info of a covering task. This allows to easily create child tasks that cover compaction group compaction.	2023-06-20 10:45:34 +02:00
Aleksandra Martyniuk	3007fbeee3	compaction: rename compaction_task_executor methods compaction_task_executor methods are renamed to prevent name colisions between compaction_task_executor and tasks::task_manager::task::impl.	2023-06-20 10:45:34 +02:00
Pavel Emelyanov	5412c7947a	backlog_controller: Unwrap scheduling_group Some time ago (`997a34bf8c`) the backlog controller was generalized to maintain some scheduling group. Back then the group was the pair of seastar::scheduling_group and seastar::io_priority_class. Now the latter is gone, so the controller's notion of what sched group is can be relaxed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #14266	2023-06-16 12:02:14 +03:00
Pavel Emelyanov	66e43912d6	code: Switch to seastar API level 7 In that level no io_priority_class-es exist. Instead, all the IO happens in the context of current sched-group. File API no longer accepts prio class argument (and makes io_intent arg mandatory to impls). So the change consists of - removing all usage of io_priority_class - patching file_impl's inheritants to updated API - priority manager goes away altogether - IO bandwidth update is performed on respective sched group - tune-up scylla-gdb.py io_queues command The first change is huge and was made semi-autimatically by: - grep io_priority_class \| default_priority_class - remove all calls, found methods' args and class' fields Patching file_impl-s is smaller, but also mechanical: - replace io_priority_class& argument with io_intent* one - pass intent to lower file (if applicatble) Dropping the priority manager is: - git-rm .cc and .hh - sed out all the #include-s - fix configure.py and cmakefile The scylla-gdb.py update is a bit hairry -- it needs to use task queues list for IO classes names and shares, but to detect it should it checks for the "commitlog" group is present. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13963	2023-06-06 13:29:16 +03:00
Raphael S. Carvalho	156d771101	compaction: Fix sstable cleanup after resharding on refresh Problem can be reproduced easily: 1) wrote some sstables with smp 1 2) shut down scylla 3) moved sstables to upload 4) restarted scylla with smp 2 5) ran refresh (resharding happens, adds sstable to cleanup set and never removes it) 6) cleanup (tries to cleanup resharded sstables which were leaked in the cleanup set) Bumps into assert "Assertion `!sst->is_shared()' failed", as cleanup picks a shared sstable that was leaked and already processed by resharding. Fix is about not inserting shared sstables into cleanup set, as shared sstables are restricted to resharding and cannot be processed later by cleanup (nor it should because resharding itself cleaned up its input files). Dtest: https://github.com/scylladb/scylla-dtest/pull/3206 Fixes #14001. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #14147	2023-06-06 12:14:03 +03:00
Benny Halevy	17795757d3	compaction_manager: compact_sstables: fix typo in log message about cleanup Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #14151	2023-06-06 11:17:02 +03:00
Raphael S. Carvalho	23443e0574	compaction: Fix incremental compaction for sstable cleanup After `c7826aa910`, sstable runs are cleaned up together. The procedure which executes cleanup was holding reference to all input sstables, such that it could later retry the same cleanup job on failure. Turns out it was not taking into account that incremental compaction will exhaust the input set incrementally. Therefore cleanup is affected by the 100% space overhead. To fix it, cleanup will now have the input set updated, by removing the sstables that were already cleaned up. On failure, cleanup will retry the same job with the remaining sstables that weren't exhausted by incremental compaction. New unit test reproduces the failure, and passes with the fix. Fixes #14035. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #14038	2023-05-31 06:46:12 +03:00
Benny Halevy	bb59687116	table: signal compaction_manager when staging sstables become eligible for cleanup perform_cleanup may be waiting for those sstables to become eligible for cleanup so signal it when table::move_sstables_from_staging detects an sstable that requires cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-17 11:33:22 +03:00
Benny Halevy	a5a8020ecd	compaction_manager: perform_cleanup: wait until all candidates are cleaned up cleanup_compaction should resolve only after all sstables that require cleanup are cleaned up. Since it is possible that some of them are in staging and therefore cannot be cleaned up, retry once a second until they become eligible. Timeout if there is no progress within 5 minutes to prevent hanging due to view building bug. Fixes #9559 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-17 11:31:07 +03:00
Benny Halevy	be4e23437f	compaction_manager: perform_cleanup: perform_offstrategy if needed It is possible that cleanup will be executed right after repair-based node operations, in which case we have a 5 minutes timer before off-strategy compaction is started. After marking the sstables that need cleanup, perform offstrategy compaction, if needed. This will implicitly cleanup those sstables as part of offstrategy compaction, before they are even passed for view update (if the table has views/secondary index). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-17 11:31:07 +03:00
Benny Halevy	53fbf9dd32	compaction_manager: perform_cleanup: update_sstables_cleanup_state in advance Scan all sstables to determine which of them requires cleanup before calling perform_task_on_all_files. This allows for cheaper no-op return when no sstable was identified as requiring cleanup, and also it will allow triggering offstrategy compaction if needed, after selecting the sstables for cleanup, in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-17 11:31:07 +03:00
Botond Dénes	3d75158fda	Merge 'Allow no owned token ranges in cleanup compaction' from Benny Halevy It is possible that a node will have no owned token ranges in some keyspaces based on their replication strategy, if the strategy is configured to have no replicas in this node's data center. In this case we should go ahead with cleanup that will effectively delete all data. Note that this is current very inefficient as we need to filter every partition and drop it as unowned. It can be optimized by either special casing this case or, better, use skip forward to the next owned range. This will skip to end-of-stream since there are no owned ranges. Fixes #13634 Also, add a respective rest_api unit test Closes #13849 * github.com:scylladb/scylladb: test: rest_api: test_storage_service: add test_storage_service_keyspace_cleanup_with_no_owned_ranges compaction_manager: perform_cleanup: handle empty owned ranges	2023-05-11 15:05:06 +03:00
Benny Halevy	2fc142279f	compaction_manager: perform_cleanup: hold on to sstable_set around yielding Updates to the compaction_group sstable sets are never done in place. Instead, the update is done on a mutable copy of the sstable set, and the lw_shared result is set back in the compaction_group. (see for example compaction_group::set_main_sstables) Therefore, there's currently a risk in perform_cleanup `get_sstables` lambda that if it yield while in set.for_each_sstable, the sstable_set might be replaced and the copy it is traversing may be destroyed. This was introduced in `c2bf0e0b72`. To prevent that, hold on to set.shared_from_this() around set.for_each_sstable. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #13852	2023-05-11 09:46:53 +03:00
Benny Halevy	c720754e37	compaction_manager: perform_cleanup: handle empty owned ranges It is possible that a node will have no owned token ranges in some keyspaces based on their replication strategy, if the strategy is configured to have no replicas in this node's data center. In this case we should go ahead with cleanup that will effectively delete all data. Note that this is current very inefficient as we need to filter every partition and drop it as unowned. It can be optimized by either special casing this case ot, better, use skip forward to the next owned range. This will skip to end-of-stream since there are no owned ranges. Fixes #13634 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-10 15:11:53 +03:00
Botond Dénes	022465d673	Merge 'Tone down offstrategy log message' from Benny Halevy In many cases we trigger offstrategy compaction opportunistically also when there's nothing to do. In this case we still print to the log lots of info-level message and call `run_offstrategy_compaction` that wastes more cpu cycles on learning that it has nothing to do. This change bails out early if the maintenance set is empty and prints a "Skipping off-strategy compaction" message in debug level instead. Fixes #13466 Also, add an group_id class and return it from compaction_group and table_state. Use that to identify the compaction_group / table_state by "ks_name.cf_name compaction_group=idx/total" in log messages. Fixes #13467 Closes #13520 * github.com:scylladb/scylladb: compaction_manager: print compaction_group id compaction_group, table_state: add group_id member compaction_manager: offstrategy compaction: skip compaction if no candidates are found	2023-05-02 08:05:18 +03:00
Benny Halevy	9768046d7c	compaction_manager: print compaction_group id Add a formatter to compaction::table_state that prints the table ks_name.cf_name and compaction group id. Fixes #13467 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-24 10:07:03 +03:00
Benny Halevy	1134ca2767	compaction_manager: offstrategy compaction: skip compaction if no candidates are found In many cases we trigger offstrategy compaction opportunistically also when there's nothing to do. In this case we still print to the log lots of info-level message and call `run_offstrategy_compaction` that wastes more cpu cycles on learning that it has nothing to do. This change bails out early if the maintenance set is empty and prints a "Skipping off-strategy compaction" message in debug level instead. Fixes #13466 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-24 09:23:32 +03:00
Benny Halevy	2f61de8f7b	table, compaction_manager: prevent cross shard access to owned_ranges_ptr Seen after `f1bbf705f9` in debug mode distributed_loader collect_all_shared_sstables copies compaction::owned_ranges_ptr (lw_shared_ptr<const dht::token_range_vector>) across shards. Since update_sstable_cleanup_state is synchronous, it can be passed a const refrence to the token_range_vector instead. It is ok to access the memory read-only across shards and since this happens on start-up, there are no special performance requirements. Fixes #13631 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 15:12:13 +03:00
Raphael S. Carvalho	9760149e8d	compaction: Don't bump compaction shares during major execution Commit `49892a0`, back in 2018, bumps the compaction shares by 200 to guarantee a minimum base line. However, after commit `e3f561d`, major compaction runs in maintenance group meaning that bumping shares became completely irrelevant and only causes regular compaction to be unnecessarily more aggressive. Fixes #13487. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #13488	2023-04-13 08:20:25 +03:00
Benny Halevy	4db961ecac	compaction_manager: compact_sstables: retrieve owned ranges if required If any of the sstables to-be-compacted requires cleanup, retrive the owned_ranges_ptr from the table_state. With that, staging sstables will eventually be cleaned up via regular compaction. Refs #9559 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:36:10 +03:00
Benny Halevy	d87925d9fc	compaction_manager: keep owned_ranges_ptr in compaction_state When perform_cleanup adds sstables to sstables_requiring_cleanup, also save the owned_ranges_ptr in the compaction_state so it could be used by other compaction types like regular, reshape, or major compaction. When the exhausted sstables are released, check if sstables_requiring_cleanup is empty, and if it is, clear also the owned_ranges_ptr. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:30:53 +03:00
Benny Halevy	c2bf0e0b72	compaction_manager: perform_cleanup: keep sstables in compaction_state::sstables_requiring_cleanup As a first step towards parallel cleanup by (regular) compaction and cleanup compaction, filter all sstables in perform_cleanup and keep the set of sstables in the compaction_state. Erase from that set when the sstables are unregistered from compaction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:30:39 +03:00
Benny Halevy	b3192b9f16	compaction: refactor compaction_state out of compaction_manager To use it both from compaction_manager and compaction_descriptor in a following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:28:16 +03:00
Benny Halevy	690697961c	compaction_manager: compacting_sstable_registration: keep a ref to the compaction_state To be used for managing sstables requiring cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:18:02 +03:00
Benny Halevy	cac60a09ac	compaction_manager: refactor get_candidates Allow getting candidates for compaction from an arbitrary range of sstable, not only the in_strategy_sstables. To be used by perform_cleanup to mark all sstables that require cleanup, even if they can't be compacted at this time. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:16:57 +03:00
Benny Halevy	bbfe839a73	compaction_manager: get_candidates: mark as const Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:16:12 +03:00
Benny Halevy	6ebafe74b9	table, compaction_manager: add requires_cleanup Returns true iff any of the sstables in the set requries cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:14:36 +03:00
Benny Halevy	d0690b64c1	table, compaction_manager: add update_sstable_cleanup_state update_sstable_cleanup_state calls needs_cleanup and inserts (or erases) the sstable into the respective compaction_state.sstables_requiring_cleanup set. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:10:55 +03:00
Benny Halevy	1baca96de1	compaction_manager: needs_cleanup: delete unused schema param It isn't needed. The sstable already has a schema. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:03:53 +03:00
Benny Halevy	ac9f8486ba	compaction_manager: perform_cleanup: disallow empty sorted_owened_ranges I'm not sure why this was originally supported, maybe for upgrade sstables where we may want to rewrite the sstables without filtering any tokens, but perform_sstable_upgrade is now following a different code path and uses `rewrite_sstables` directly, without pigybacking on cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:03:03 +03:00
Benny Halevy	09df04c919	compaction: move owned_ranges into descriptor Move the owned_ranges_ptr, currently used only by cleanup and upgrade compactions, to the generic compaction descriptor so we apply cleanup in other compaction types. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:52:12 +03:00
Botond Dénes	9a02315c6b	Merge 'Compaction reevaluation bug fixes' from Raphael "Raph" Carvalho A problem in compaction reevaluation can cause the SSTable set to be left uncompacted for indefinite amount of time, potentially causing space and read amplification to be suboptimal. Two revaluation problems are being fixed, one after off-strategy compaction ended, and another in compaction manager which intends to periodically reevaluate a need for compaction. Fixes https://github.com/scylladb/scylladb/issues/13429. Fixes https://github.com/scylladb/scylladb/issues/13430. Closes #13431 * github.com:scylladb/scylladb: compaction: Make compaction reevaluation actually periodic replica: Reevaluate regular compaction on off-strategy completion	2023-04-05 13:51:21 +03:00
Raphael S. Carvalho	457c772c9c	replica: Make compaction_group responsible for deleting off-strategy compaction input Compaction group is responsible for deleting SSTables of "in-strategy" compactions, i.e. regular, major, cleanup, etc. Both in-strategy and off-strategy compaction have their completion handled using the same compaction group interface, which is compaction_group::table_state::on_compaction_completion(..., sstables::offstrategy offstrategy) So it's important to bring symmetry there, by moving the responsibility of deleting off-strategy input, from manager to group. Another important advantage is that off-strategy deletion is now throttled and gated, allowing for better control, e.g. table waiting for deletion on shutdown. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #13432	2023-04-05 08:37:48 +03:00
Raphael S. Carvalho	156ac0a67a	compaction: Make compaction reevaluation actually periodic The manager intended to periodically reevaluate compaction need for each registered table. But it's not working as intended. The reevaluation is one-off. This means that compaction was not kicking in later for a table, with low to none write activity, that had expired data 1 hour from now. Also make sure that reevaluation happens within the compaction scheduling group. Fixes #13430. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-04-04 09:16:19 -03:00
Botond Dénes	8167f11a23	Merge 'Move compaction manager tasks out of compaction manager' from Aleksandra Martyniuk Task manager compaction tasks that cover compaction group compaction need access to compaction_manager::tasks. To avoid circular dependency and be able to rely on forward declaration, task needs to be moved out of compaction manager. To avoid naming confusion compaction_manager::task is renamed. Closes #13226 * github.com:scylladb/scylladb: compaction: use compaction namespace in compaction_manager.cc compaction: rename compaction::task compaction: move compaction_manager::task out of compaction manager compaction: move sstable_task definition to source file	2023-04-03 15:40:42 +03:00
Aleksandra Martyniuk	0ceee3e4b3	compaction: use compaction namespace in compaction_manager.cc	2023-03-29 15:28:14 +02:00
Aleksandra Martyniuk	d7d570e39d	compaction: rename compaction::task To avoid confusion with task manager tasks compaction::task is renamed to compaction::compaction_task_exector. All inheriting classes are modified similarly.	2023-03-29 15:23:18 +02:00
Aleksandra Martyniuk	f24391fbe4	compaction: move compaction_manager::task out of compaction manager compaction_manager::task needs to be accessed from task manager compaction tasks. Thus, compaction_manager::task and all inheriting classes are moved from compaction manager to compaction namespace.	2023-03-29 15:21:24 +02:00
Aleksandra Martyniuk	37cafec9d5	compaction: move sstable_task definition to source file	2023-03-29 14:53:43 +02:00
Raphael S. Carvalho	1ffe2f04ef	compaction: add table_state param to compaction_strategy::notify_completion() once compaction_strategy is made staless, the state must be retrieved in notify_completion() through table_state. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 13:40:02 -03:00
Avi Kivity	32cc975b2f	compaction: error on invalid scrub type gcc allows an enum to contain a value outside its enum set, so we need to handle it. Since it shouldn't happen, signal an internal error.	2023-03-21 13:42:49 +02:00
Botond Dénes	dc3d47b1e4	Merge 'Get compaction history without using qctx' from Pavel Emelyanov There are two methods to mess with compaction history -- update and get. The former had been patched to use local system-keyspace instance by `907fd2d3` (system_keyspace: De-static compaction history update) now it's time for the latter (spoiler: it's only used by the API handler) Closes #12889 * github.com:scylladb/scylladb: system_keyspace; Make get_compaction_history non static and drop qctx api, compaction_manager: Get compaction history via manager system_keyspace: Move compaction_history_entry to namespace scope	2023-02-16 19:05:48 +02:00
Pavel Emelyanov	52f69643b6	api, compaction_manager: Get compaction history via manager Right now the API handler directly calls static method from system keyspace. Patching it to call compaction manager instead will let the latter use on-board plugged system keyspace for that. If the system keyspace is not plugged, it means early boot or late shutdown, not a good time to get compaction history anyway. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-02-16 11:27:38 +03:00
Kefu Chai	0cb842797a	treewide: do not define/capture unused variables these warnings are found by Clang-17 after removing `-Wno-unused-lambda-capture` and '-Wno-unused-variable' from the list of disabled warnings in `configure.py`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-02-15 22:57:18 +02:00
Kefu Chai	d4315245a1	main: use defer_verbose_shutdown() to shutdown compaction manager * use `defer_verbose_shutdown()` to shutdown compaction manager `EDQUOT` is quite similar as `ENOSPC`, in the sense that both of them are caused by environmental issues. before this change, `compaction_manager` filters the ENOSPC exceptions thrown by `compaction_manager::really_do_stop()`, so they are not propagated to caller when calling `compaction_manager::stop()` -- only a warning message is printed in the log. but `EDQUOT` is not handled. after this change, the exception raised by compaction manager's stop process is not filtered anymore and is handled by `defer_verbose_shutdown()` instead, which is able to check the type of exception, and print out error message in the log. so the `ENOSPC` and `EDQUOT` errors are taken care of, and more visible from user's perspective as they are printed as errors instead of warning. but they are not printed using the `compaction_manager` logger anymore. so if our testing or user's workflow depends on this behavior, the related setting should be updated accordingly. Fixes #12626 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-02-07 16:00:40 +08:00
Botond Dénes	511c0123a2	Merge 'Add compaction module to task manager' from Aleksandra Martyniuk Introduces task manager's compaction module. That's an initial part of integration of compaction with task manager. When fully integrated, task manager will allow user to track compaction operations, check status and progress of each individual one. It will help with creating an asynchronous version of rest api that forces any compaction. Currently, users can see with /task_manager/list_modules api call that compaction is one of the modules accessible through task manager. They won't get any additional information though, since compaction tasks are not created yet. A shared_ptr to compaction module is kept in compaction manager. Closes #12635 * github.com:scylladb/scylladb: compaction: test: pass task_manager to compaction_manager in test environment compaction: create and register task manager's module for compaction tasks: add task_manager constructor without arguments	2023-02-06 09:25:05 +02:00
Aleksandra Martyniuk	12789adb95	compaction: test: pass task_manager to compaction_manager in test environment Each instance of compaction manager should have compaction module pointer initialized. All contructors get task_manager reference with which the module is created.	2023-02-03 15:15:11 +01:00

1 2 3 4 5 ...

317 Commits