scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 17:40:34 +00:00

Author	SHA1	Message	Date
Aleksandra Martyniuk	634c0d44ef	tasks: fix tasks abort Currently if task_manager::task::impl::abort preempts before children are recursively aborted and then the task gets unregistered, we hit use after free since abort uses children vector which is no longer alive. Modify abort method so that it goes over all tasks in task manager and aborts those with the given parent. Fixes: #19304. (cherry picked from commit `3463f495b1`)	2024-06-20 14:47:14 +00:00
Kefu Chai	eb9216ef11	compaction: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16707	2024-01-10 11:07:36 +02:00
Raphael S. Carvalho	dd1a6d6309	compaction: Add splitting compaction task to manager The task for splitting compaction will run until all sstables in the main set are split. The only exceptions are shutdown or user has explicitly asked for abort. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-12-17 11:40:09 -03:00
Raphael S. Carvalho	c96938c49b	compaction: remove scrub-specific code from rewrite_sstables_compaction_task_executor Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-12-17 11:40:09 -03:00
Yaniv Kaul	c658bdb150	Typos: fix typos in comments Fixes some typos as found by codespell run on the code. In this commit, I was hoping to fix only comments, not user-visible alerts, output, etc. Follow-up commits will take care of them. Refs: https://github.com/scylladb/scylladb/issues/16255 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com>	2023-12-02 22:37:22 +02:00
Aleksandra Martyniuk	aa7bba2d8b	compaction: abort task manager compaction tasks Set top level compaction tasks as abortable. Compaction tasks which have no children, i.e. compaction task executors, have abort method overriden to stop compaction data.	2023-11-24 15:44:34 +01:00
Botond Dénes	0ae1335daa	Revert "Merge 'compaction: abort compaction tasks' from Aleksandra Martyniuk" This reverts commit `11cafd2fc8`, reversing changes made to `2bae14f743`. Reverting because this series causes frequent CI failures, and the proposed quickfix causes other failures of its own. Fixes: #16113	2023-11-22 17:44:07 +02:00
Botond Dénes	11cafd2fc8	Merge 'compaction: abort compaction tasks' from Aleksandra Martyniuk Compaction tasks which do not have a parent are abortable through task manager. Their children are aborted recursively. Compaction tasks of the lowest level are aborted using existing compaction task executors stopping mechanism. Closes scylladb/scylladb#16050 * github.com:scylladb/scylladb: test: test abort of compaction task that isn't started yet test: test running compaction task abort tasks: fail if a task was aborted compaction: abort task manager compaction tasks	2023-11-14 14:55:17 +02:00
Aleksandra Martyniuk	599d6ebd52	compaction: abort task manager compaction tasks Set top level compaction tasks as abortable. Compaction tasks which have no children, i.e. compaction task executors, have abort method overriden to stop compaction data.	2023-11-13 15:46:58 +01:00
Pavel Emelyanov	f4696f21a8	test/utils: Drop compaction_manager_test This class only provides a .run() method which allocates a task and calls sstables::test_env::perform_compaction(). This can be done in a helper method, no need for the whole class for it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-11-13 11:44:51 +03:00
Pavel Emelyanov	9fd270566a	test/sstables: Introduce test_env_compaction_manager::perform_compaction() Take it from compaction_manager_test::run() which is simplified overwite of the compaction_manager::perform_compaction(). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-11-13 11:44:51 +03:00
Pavel Emelyanov	aec3fc493a	test/utils: Move compaction_manager_test::propagate_replacement() The purpose of this method is to turn public the private compaction_manager method of the same name. The caller of this method is having sstable_test_env at hand with its test_env_compaction_manager, so the de-private-isation call can be moved. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-11-13 11:44:51 +03:00
Botond Dénes	1cccc86813	Revert "Merge 'compaction: abort compaction tasks' from Aleksandra Martyniuk" This reverts commit `2860d43309`, reversing changes made to `a3621dbd3e`. Reverting because rest_api.test_compaction_task started failing after this was merged. Fixes: #16005	2023-11-09 10:43:11 +01:00
Botond Dénes	2860d43309	Merge 'compaction: abort compaction tasks' from Aleksandra Martyniuk Compaction tasks which do not have a parent are abortable through task manager. Their children are aborted recursively. Compaction tasks of the lowest level are aborted using existing compaction task executors stopping mechanism. Closes scylladb/scylladb#15083 * github.com:scylladb/scylladb: test: test abort of compaction task that isn't started yet test: test running compaction task abort tasks: fail if a task was aborted compaction: abort task manager compaction tasks	2023-11-08 08:45:16 +02:00
Benny Halevy	a1acf6854b	everywhere: reduce dependencies on i_partitioner.hh Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-11-05 20:47:44 +02:00
Aleksandra Martyniuk	0681795417	compaction: abort task manager compaction tasks Set top level compaction tasks as abortable. Compaction tasks which have no children, i.e. compaction task executors, have abort method overriden to stop compaction data.	2023-10-19 10:47:17 +02:00
Aleksandra Martyniuk	3553556708	compaction: keep compaction_progress_monitor in compaction_task_executor Keep compaction_progress_monitor in compaction_task_executor and pass a reference to it further, so that the compaction progress could be retrieved out of it.	2023-10-12 17:03:46 +02:00
Aleksandra Martyniuk	f42be12f43	repair: release resources of shard_repair_task_impl Before integration with task manager the state of one shard repair was kept in repair_info. repair_info object was destroyed immediately after shard repair was finished. In an integration process repair_info's fields were moved to shard_repair_task_impl as the two served the similar purposes. Though, shard_repair_task_impl isn't immediately destoyed, but is kept in task manager for task_ttl seconds after it's complete. Thus, some of repair_info's fields have their lifetime prolonged, which makes the repair state change delayed. Release shard_repair_task_impl resources immediately after shard repair is finished. Fixes: #15505. Closes scylladb/scylladb#15506	2023-09-26 17:09:47 +03:00
Raphael S. Carvalho	ff8510445d	compaction: Make get_candidates() work with frozen_sstable_run too This is done in preparation for ICS to retrieve candidates as sstable runs. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-09-25 17:18:21 -03:00
Avi Kivity	61440d20c3	Merge 'Enable incremental compaction on off-strategy' from Raphael "Raph" Carvalho Off-strategy suffers with a 100% space overhead, as it adopted a sort of all or nothing approach. Meaning all input sstables, living in maintenance set, are kept alive until they're all reshaped according to the strategy criteria. Input sstables in off-strategy are very likely to be mostly disjoint, so it can greatly benefit from incremental compaction. The incremental compaction approach is not only good for decreasing disk usage, but also memory usage (as metadata of input and output live in memory), and file desc count, which takes memory away from OS. Turns out that this approach also greatly simplifies the off-strategy impl in compaction manager, as it no longer have to maintain new unused sstables and mark them for deletion on failure, and also unlink intermediary sstables used between reshape rounds. Fixes https://github.com/scylladb/scylladb/issues/14992. Closes scylladb/scylladb#15400 * github.com:scylladb/scylladb: test: Verify that off-strategy can do incremental compaction compaction: Clear pending_replacement list when tombstone GC is disabled compaction: Enable incremental compaction on off-strategy compaction: Extend reshape type to allow for incremental compaction compaction: Move reshape_compaction in the source compaction: Enable incremental compaction only if replacer callback is engaged	2023-09-21 20:12:19 +03:00
Raphael S. Carvalho	42050f13a0	compaction: Enable incremental compaction on off-strategy Off-strategy suffers with a 100% space overhead, as it adopted a sort of all or nothing approach. Meaning all input sstables, living in maintenance set, are kept alive until they're all reshaped according to the strategy criteria. Input sstables in off-strategy are very likely to mostly disjoint, so it can greatly benefit from incremental compaction. The incremental compaction approach is not only good for decreasing disk usage, but also memory usage (as metadata of input and output live in memory), and file desc count, which takes memory away from OS. Turns out that this approach also greatly simplifies the off-strategy impl in compaction manager, as it no longer have to maintain new unused sstables and mark them for deletion on failure, and also unlink intermediary sstables used between reshape rounds. Fixes #14992. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-09-21 11:15:46 -03:00
Aleksandra Martyniuk	932f39e37c	compaction: warn about compaction_done() compaction_done() returns ready future before compaction_task_executor::run_compaction() even though the compaction did not start. Make compaction_done() private and add a comment to warn against incorrect usage.	2023-09-09 11:19:11 +02:00
Aleksandra Martyniuk	515b8d4890	compaction: modify lowest compaction tasks' run method For compaction_task_executors, unlike for all other task manager tasks, run method does not embrace operations performed in a scope of a task, but only waits until shared_future connected with the operations is resolved. Apart from breaking task manager task conventions, such a run method must consider all corner cases, not to break task manager or compaction manager functionality. To fix existing and prevent further bugs related to task manager and compaction manager coexistence, call perform_task inside run method and wait for it in a standard way. Executors that are not going to be reflected in task manager run call perform_task the old way.	2023-09-09 11:19:11 +02:00
Aleksandra Martyniuk	832df38d26	compaction: pass do_throw_if_stopping to compaction_task_executor As a preparation for further changes, keep do_throw_if_stopping flag as a member of compaction_task_executor.	2023-09-09 11:19:11 +02:00
Benny Halevy	96055414c7	compaction_manager: gracefully handle gate close Check if the compaction_state gate is closed along with _state != state::enabled and return early in this case. At this point entering the gate is guaranteed to succeed. So enter the gate before calling `perform_compaction` keeping the std::optional<gate_holder> throughout the compaction task. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-09-05 09:17:25 +03:00
Aleksandra Martyniuk	e0ce711e4f	compaction: do not swallow compaction_stopped_exception for reshape Loop in shard_reshaping_compaction_task_impl::run relies on whether sstables::compaction_stopped_exception is thrown from run_custom_job. The exception is swallowed for each type of compaction in compaction_manager::perform_task. Rethrow an exception in perfrom task for reshape compaction. Fixes: #15058. Closes #15067	2023-08-21 12:41:55 +03:00
Aleksandra Martyniuk	e9d94894f1	compaction: release resources of compaction executors Before compaction task executors started inheriting from compaction_task_impl, they were destructed immediately after compaction finished. Destructors of executors and their fields performed actions that affected global structures and statistics and had impact on compaction process. Currently, task executors are kept in memory much longer, as their are tracked by task manager. Thus, destructors are not called just after the compaction, which results in compaction stats not being updated, which causes e.g. infinite cleanup loop. Add release_resources() method which is called at the end of compaction process and does what destructors used to. Fixes: #14966. Fixes: #15030. Closes #15005	2023-08-16 15:51:17 +03:00
Aleksandra Martyniuk	db932c7106	compaction: hold gate immediately after task executor is created If make_task call in compaction_manager::perform_compaction yields, compaction_task_executor::_compaction_state may be gone and gate won't be held. Hold gate immediately after compaction_task_executor is created. Add comment not to call prepare_task without preparation. Refs: #14971. Fixes: #14977. Closes #14999	2023-08-11 13:56:38 +02:00
Botond Dénes	108e510a23	Merge 'Update sstable_requiring_cleanup on compaction completion' from Benny Halevy Currently `sstable_requiring_cleanup` is updated using `compacting_sstable_registration`, but that mechanism is not used by offstrategy compaction, leading to #14304. This series introduces `compaction_manager::on_compaction_completion` that intercepts the call to the table::on_compaction_completion. This allows us to update `sstable_requiring_cleanup` right before the compacted sstables are deleted, making sure they are no leaked to `sstable_requiring_cleanup`, which would hold a reference to them until cleanup attempts to clean them up. `cleanup_incremental_compaction_test` was adjusted to observe the sstables `on_delete` (by adding a new observer event) to detect the case where cleanup attempts to delete the leaked sstables and fails since they were already deleted from the file system by offstrategy compaction. The test fails with the fix and passes with it. Fixes #14304 Closes #14858 * github.com:scylladb/scylladb: compaction_manager: on_compaction_completion: erase sstables from sstables_requiring_cleanup compaction/leveled_compaction_strategy: ideal_level_for_input: special case max_sstable_size==0 sstable: add on_delete observer compaction_manager: add on_compaction_completion sstable_compaction_test: cleanup_incremental_compaction_test: verify sstables_requiring_cleanup is empty	2023-08-09 11:03:45 +03:00
Kefu Chai	47c9b25bac	compaction_manager: correct comment on compaction_task_executor::state when it comes to `regular_compaction_task_executor`, we repeat the compaction until the compaction can not proceed, so after an iteration of compaction completes successfully, the task can still continue with yet another round of the compaction as it sees appropriate. so let's update the comment to reflect this fact. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14891	2023-08-09 09:49:18 +03:00
Benny Halevy	df66895080	compaction_manager: add on_compaction_completion Pass the call to the table on_compaction_completion so we can manage the sstables requiring cleanup state along the way. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-08-08 08:12:05 +03:00
Benny Halevy	ea64ae54f8	sstable_compaction_test: cleanup_incremental_compaction_test: verify sstables_requiring_cleanup is empty Make sure that there are no sstables_requiring_cleanup after cleanup compaction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-08-08 08:12:01 +03:00
Botond Dénes	4d538e1363	Merge 'Task manager tasks covering compaction group compaction' from Aleksandra Martyniuk All compaction task executors, except for regular compaction one, become task manager compaction tasks. Creating and starting of major_compaction_task_executor is modified to be consistent with other compaction task executors. Closes #14505 * github.com:scylladb/scylladb: test: extend test_compaction_task.py to cover compaction group tasks compaction: turn custom_task_executor into compaction_task_impl compaction: turn sstables_task_executor into sstables_compaction_task_impl compaction: change sstables compaction tasks type compaction: move table_upgrade_sstables_compaction_task_impl compaction: pass task_info through sstables compaction compaction: turn offstrategy_compaction_task_executor into offstrategy_compaction_task_impl compaction: turn cleanup_compaction_task_executor into cleanup_compaction_task_impl comapction: use optional task info in major compaction compaction: use perform_compaction in compaction_manager::perform_major_compaction	2023-08-04 10:11:00 +03:00
Kefu Chai	6c66030b7b	compaction: add formatter for compaction_task_executor add fmt formatter for `compaction_task_executor::state` and `compaction_task_executor` and its derived classes. this is a part of a series to migrating from `operator<<(ostream&, ..)` based formatting to fmtlib based formatting. the goal here is to enable fmtlib to print `compaction_task_executor`, its derived classes and `compaction_task_executor::state` without the help of `operator<<`. since all of the callers of 'operator<<' of these types now use formatters, the operator<< are removed in this change. the helpers like `to_string()` and `describe()` are removed as well, as it'd be more consistent if we always use fmtlib for formatting instead of inventing APIs with different names. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14906	2023-08-02 09:15:43 +03:00
Aleksandra Martyniuk	139e147ae1	compaction: turn custom_task_executor into compaction_task_impl custom_task_executor inherits both from compaction_task_executor and compaction_task_impl.	2023-07-28 10:51:55 +02:00
Aleksandra Martyniuk	1853a5a355	compaction: turn sstables_task_executor into sstables_compaction_task_impl sstables_task_executor inherits both from compaction_task_executor and sstables_compaction_task_impl. Delete unused perform_task_on_all_files version.	2023-07-28 10:51:55 +02:00
Aleksandra Martyniuk	71db8645d5	compaction: pass task_info through sstables compaction	2023-07-28 10:51:55 +02:00
Aleksandra Martyniuk	4e439ac957	compaction: turn offstrategy_compaction_task_executor into offstrategy_compaction_task_impl offstrategy_compaction_task_executor inherits both from compaction_task_executor and offstrategy_compaction_task_impl.	2023-07-28 10:51:55 +02:00
Aleksandra Martyniuk	92f2987217	compaction: turn cleanup_compaction_task_executor into cleanup_compaction_task_impl cleanup_compaction_task_executor inherits both from compaction_task_executor and cleanup_compaction_task_impl. Add a new version of compaction_manager::perform_task_on_all_files which accepts only the tasks that are derived from compaction_task_impl. After all task executors' conversions are done, the new version replaces the original one.	2023-07-28 10:48:58 +02:00
Aleksandra Martyniuk	8317e4dd7f	comapction: use optional task info in major compaction To make it consistent with the upcoming methods, methods triggering major compaction get std::optional<tasks::task_info> as an argument. Thanks to that we can distinguish between a task that has no parent and the task which won't be registered in task manager.	2023-07-28 09:25:21 +02:00
Kefu Chai	1b7bde2e9e	compaction_manager: use range in compacting_sstable_registration simpler than the "begin, end" iterator pair. and also tighten the type constraints, now require the value type to be sstables::shared_sstable. this matches what we are expecting in the implementation. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14678	2023-07-27 09:40:20 +03:00
Kefu Chai	fdf61d2f7c	compaction_manager: prevent gc-only sstables from being compacted before this change, there are chances that the temporary sstables created for collecting the GC-able data create by a certain compaction can be picked up by another compaction job. this wastes the CPU cycles, adds write amplification, and causes inefficiency. in general, these GC-only SSTables are created with the same run id as those non-GC SSTables, but when a new sstable exhausts input sstable(s), we proactively replace the old main set with a new one so that we can free up the space as soon as possible. so the GC-only SSTables are added to the new main set along with the non-GC SSTables, but since the former have good chance to overlap the latter. these GC-only SSTables are assigned with different run ids. but we fail to register them to the `compaction_manager` when replacing the main sstable set. that's why future compactions pick them up when performing compaction, when the compaction which created them is not yet completed. so, in this change, * to prevent sstables in the transient stage from being picked up by regular compactions, a new interface class is introduced so that the sstable is always added to registration before it is added to sstable set, and removed from registration after it is removed from sstable set. the struct helps to consolidate the regitration related logic in a single place, and helps to make it more obvious that the timespan of an sstable in the registration should cover that in the sstable set. * use a different run_id for the gc sstable run, as it can overlap with the output sstable run. the run_id for the gc sstable run is created only when the gc sstable writer is created. because the gc sstables is not always created for all compactions. please note, all (indirect) callers of `compaction_task_executor::compact_sstables()` passes a non-empty `std::function` to this function, so there is no need to check for empty before calling it. so in this change, the check is dropped. Fixes #14560 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14725	2023-07-20 15:47:48 +03:00
Aleksandra Martyniuk	e3b068be4d	compaction: add compaction_manager::perform_compaction method	2023-07-17 15:54:33 +02:00
Aleksandra Martyniuk	33cb156ee3	compaction: switch state after compaction is done Compaction task executors which inherit from compaction_task_impl may stay in memory after the compaction is finished. Thus, state switch cannot happen in destructor. Switch state to none in perform_task defer.	2023-07-17 15:54:33 +02:00
Kefu Chai	057701299c	compaction_manager: remove unnecessary include also, remove unnecessary forward declarations. * compaction_manager_test_task_executor is only referenced in the friend declaration. but this declaration does not need a forward declaration of the friend class * compaction_manager_test_task_executor is not used anywhere. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14680	2023-07-13 14:59:39 +03:00
Aleksandra Martyniuk	4922f4cf80	compaction: take gate holder out of task executor In the following commits, classes deriving from compaction_task_executor will be alive longer than they are kept in compaction_manager::_tasks. Thus, the compaction_task_executor::_gate_holder would be held, blocking other compactions. compaction_task_executor::_gate_holder is moved outside of compaction_task_executor object.	2023-06-20 12:12:45 +02:00
Aleksandra Martyniuk	e317ffe23a	compaction: extend signature of some methods Extend a signature of table::compact_all_sstables and compaction_manager::perform_major_compaction so that they get the info of a covering task. This allows to easily create child tasks that cover compaction group compaction.	2023-06-20 10:45:34 +02:00
Aleksandra Martyniuk	3007fbeee3	compaction: rename compaction_task_executor methods compaction_task_executor methods are renamed to prevent name colisions between compaction_task_executor and tasks::task_manager::task::impl.	2023-06-20 10:45:34 +02:00
Raphael S. Carvalho	156d771101	compaction: Fix sstable cleanup after resharding on refresh Problem can be reproduced easily: 1) wrote some sstables with smp 1 2) shut down scylla 3) moved sstables to upload 4) restarted scylla with smp 2 5) ran refresh (resharding happens, adds sstable to cleanup set and never removes it) 6) cleanup (tries to cleanup resharded sstables which were leaked in the cleanup set) Bumps into assert "Assertion `!sst->is_shared()' failed", as cleanup picks a shared sstable that was leaked and already processed by resharding. Fix is about not inserting shared sstables into cleanup set, as shared sstables are restricted to resharding and cannot be processed later by cleanup (nor it should because resharding itself cleaned up its input files). Dtest: https://github.com/scylladb/scylla-dtest/pull/3206 Fixes #14001. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #14147	2023-06-06 12:14:03 +03:00
Benny Halevy	a5a8020ecd	compaction_manager: perform_cleanup: wait until all candidates are cleaned up cleanup_compaction should resolve only after all sstables that require cleanup are cleaned up. Since it is possible that some of them are in staging and therefore cannot be cleaned up, retry once a second until they become eligible. Timeout if there is no progress within 5 minutes to prevent hanging due to view building bug. Fixes #9559 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-17 11:31:07 +03:00

1 2 3 4 5

219 Commits