scylladb

Author	SHA1	Message	Date
Botond Dénes	493b6bc65f	Merge 'Guard tables in compaction tasks' from Benny Halevy Currently, if a compaction function enters the table or compaction_group async_gate, we can't stop it on the table/compaction_group stop path as they co_await their respective async_gate.close(). This series introduces a table_ptr smart pointer to guards the table object by entering its async_gate, and it also defers awaiting the gate.close future till after stopping ongoing compaction so that closing the gate will prevent starting new compactions while ongoing compaction can be stopped and finally awaiting the close() future will wait for them to unwind and exit the gate after being stopped. Fixes #16305 Closes scylladb/scylladb#16351 * github.com:scylladb/scylladb: compaction: run_on_table: skip compaction also on gate_closed_exception compaction: run_on_table: hold table table: add table_holder and hold method table: stop: allow compactions to be stopped while closing async_gate	2023-12-12 12:50:17 +02:00
Benny Halevy	7843025a53	compaction: run_on_table: skip compaction also on gate_closed_exception Similar to the no_such_column_family error, gate_closed_exception indicates that the table is stopped and we should skip compaction on it gracefully. Fixes #16305 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-12-12 08:46:37 +02:00
Benny Halevy	92c718c60a	compaction: run_on_table: hold table To ensure the table will not be dropped while the compaction task is ongoing. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-12-12 08:45:59 +02:00
Aleksandra Martyniuk	ceec5577d8	api: compaction: pass pointer to top level compaction tasks As a preparation for asynchronous compaction api, from which we cannot take values by reference, top level compaction tasks get pointers which need to be set to nullptr when they are not needed (like in async api).	2023-12-11 11:36:10 +01:00
Kefu Chai	f483309165	compaction, api: drop unused functions run_on_existing_tables() is not used at all. and we have two of them. in this change, let's drop them. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16304	2023-12-06 14:31:08 +02:00
Yaniv Kaul	c658bdb150	Typos: fix typos in comments Fixes some typos as found by codespell run on the code. In this commit, I was hoping to fix only comments, not user-visible alerts, output, etc. Follow-up commits will take care of them. Refs: https://github.com/scylladb/scylladb/issues/16255 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com>	2023-12-02 22:37:22 +02:00
Benny Halevy	b12b142232	api: add /storage_service/compact For major compacting all tables in the database. The advantage of this api is that `commitlog->force_new_active_segment` happens only once in `database::flush_all_tables` rather than once per keyspace (when `nodetool compact` translates to a sequence of `/storage_service/keyspace_compaction` calls). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-11-28 16:37:42 +02:00
Benny Halevy	66ba983fe0	compaction_manager: flush_all_tables before major compaction Major compaction already flushes each table to make sure it considers any mutations that are present in the memtable for the purpose of tombstone purging. See `64ec1c6ec6` However, tombstone purging may be inhibited by data in commitlog segments based on `gc_time_min` in the `tombstone_gc_state` (See `f42eb4d1ce`). Flushing all sstables in the database release all references to commitlog segments and there it maximizes the potential for tombstone purging, which is typically the reason for running major compaction. However, flushing all tables too frequently might result in tiny sstables. Since when flushing all keyspaces using `nodetool flush` the `force_keyspace_compaction` api is invoked for keyspace successively, we need a mechanism to prevent too frequent flushes by major compaction. Hence a `compaction_flush_all_tables_before_major_seconds` interval configuration option is added (defaults to 24 hours). In the case that not all tables are flushed prior to major compaction, we revert to the old behavior of flushing each table in the keyspace before major-compacting it. Fixes scylladb/scylladb#15777 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-11-28 16:37:42 +02:00
Benny Halevy	1fd85bd37b	api: compaction: add flush_memtables option When flushing is done externally, e.g. by running `nodetool flush` prior to `nodetool compact`, flush_memtables=false can be passed to skip flushing of tables right before they are major-compacted. This is useful to prevent creation of small sstables due to excessive memtable flushing. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-11-28 16:37:42 +02:00
Aleksandra Martyniuk	9c2c964b8e	test: test abort of compaction task that isn't started yet Test whether a task which parent was aborted has a proper status.	2023-11-24 19:25:27 +01:00
Aleksandra Martyniuk	aa7bba2d8b	compaction: abort task manager compaction tasks Set top level compaction tasks as abortable. Compaction tasks which have no children, i.e. compaction task executors, have abort method overriden to stop compaction data.	2023-11-24 15:44:34 +01:00
Botond Dénes	0ae1335daa	Revert "Merge 'compaction: abort compaction tasks' from Aleksandra Martyniuk" This reverts commit `11cafd2fc8`, reversing changes made to `2bae14f743`. Reverting because this series causes frequent CI failures, and the proposed quickfix causes other failures of its own. Fixes: #16113	2023-11-22 17:44:07 +02:00
Aleksandra Martyniuk	6af581301b	test: test abort of compaction task that isn't started yet Test whether a task which parent was aborted has a proper status.	2023-11-14 10:36:38 +01:00
Aleksandra Martyniuk	599d6ebd52	compaction: abort task manager compaction tasks Set top level compaction tasks as abortable. Compaction tasks which have no children, i.e. compaction task executors, have abort method overriden to stop compaction data.	2023-11-13 15:46:58 +01:00
Botond Dénes	1cccc86813	Revert "Merge 'compaction: abort compaction tasks' from Aleksandra Martyniuk" This reverts commit `2860d43309`, reversing changes made to `a3621dbd3e`. Reverting because rest_api.test_compaction_task started failing after this was merged. Fixes: #16005	2023-11-09 10:43:11 +01:00
Aleksandra Martyniuk	56221f2161	test: test abort of compaction task that isn't started yet Test whether a task which parent was aborted has a proper status.	2023-10-19 10:47:20 +02:00
Aleksandra Martyniuk	0681795417	compaction: abort task manager compaction tasks Set top level compaction tasks as abortable. Compaction tasks which have no children, i.e. compaction task executors, have abort method overriden to stop compaction data.	2023-10-19 10:47:17 +02:00
Aleksandra Martyniuk	198119f737	compaction: add get_progress method to compaction_task_impl compaction_task_impl::get_progress is used by the lowest level compaction tasks which progress can be taken from compaction_progress_monitor.	2023-10-12 17:16:05 +02:00
Aleksandra Martyniuk	3553556708	compaction: keep compaction_progress_monitor in compaction_task_executor Keep compaction_progress_monitor in compaction_task_executor and pass a reference to it further, so that the compaction progress could be retrieved out of it.	2023-10-12 17:03:46 +02:00
Aleksandra Martyniuk	d799adc536	tasks: change task_manager::task::impl::is_internal() Most of the time only the roots of tasks tree should be non internal. Change default implementation of is_internal and delete overrides consistent with it. Closes scylladb/scylladb#15353	2023-09-26 14:49:49 +03:00
Aleksandra Martyniuk	e0ce711e4f	compaction: do not swallow compaction_stopped_exception for reshape Loop in shard_reshaping_compaction_task_impl::run relies on whether sstables::compaction_stopped_exception is thrown from run_custom_job. The exception is swallowed for each type of compaction in compaction_manager::perform_task. Rethrow an exception in perfrom task for reshape compaction. Fixes: #15058. Closes #15067	2023-08-21 12:41:55 +03:00
Aleksandra Martyniuk	139e147ae1	compaction: turn custom_task_executor into compaction_task_impl custom_task_executor inherits both from compaction_task_executor and compaction_task_impl.	2023-07-28 10:51:55 +02:00
Aleksandra Martyniuk	71db8645d5	compaction: pass task_info through sstables compaction	2023-07-28 10:51:55 +02:00
Aleksandra Martyniuk	4e439ac957	compaction: turn offstrategy_compaction_task_executor into offstrategy_compaction_task_impl offstrategy_compaction_task_executor inherits both from compaction_task_executor and offstrategy_compaction_task_impl.	2023-07-28 10:51:55 +02:00
Aleksandra Martyniuk	92f2987217	compaction: turn cleanup_compaction_task_executor into cleanup_compaction_task_impl cleanup_compaction_task_executor inherits both from compaction_task_executor and cleanup_compaction_task_impl. Add a new version of compaction_manager::perform_task_on_all_files which accepts only the tasks that are derived from compaction_task_impl. After all task executors' conversions are done, the new version replaces the original one.	2023-07-28 10:48:58 +02:00
Aleksandra Martyniuk	77dcdd743e	compaction: add shard_reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction on one shard.	2023-07-19 17:19:10 +02:00
Aleksandra Martyniuk	f73178a114	compaction: invoke resharding on sharded database In reshard_sstables_compaction_task_impl::run() we call sharded<sstables::sstable_directory>::invoke_on_all. In lambda passed to that method, we use both sharded sstable_directory service and its local instance. To make it straightforward that sharded and local instances are dependend, we call sharded<replica::database>::invoke_on_all instead and access local directory through the sharded one.	2023-07-19 17:19:10 +02:00
Aleksandra Martyniuk	fa10c352a1	compaction: move run_resharding_jobs into reshard_sstables_compaction_task_impl::run()	2023-07-19 17:19:10 +02:00
Aleksandra Martyniuk	7a7e287d8c	compaction: add reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction. A struct and some functions are moved from replica/distributed_loader.cc to compaction/task_manager_module.cc.	2023-07-19 17:15:40 +02:00
Michał Chojnowski	b511d57fc8	Revert "Merge 'Compaction resharding tasks' from Aleksandra Martyniuk" This reverts commit `2a58b4a39a`, reversing changes made to `dd63169077`. After patch `87c8d63b7a`, table_resharding_compaction_task_impl::run() performs the forbidden action of copying a lw_shared_ptr (_owned_ranges_ptr) on a remote shard, which is a data race that can cause a use-after-free, typically manifesting as allocator corruption. Note: before the bad patch, this was avoided by copying the _contents_ of the lw_shared_ptr into a new, local lw_shared_ptr. Fixes #14475 Fixes #14618 Closes #14641	2023-07-11 19:11:37 +03:00
Aleksandra Martyniuk	87c8d63b7a	compaction: add shard_reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction on one shard.	2023-06-28 11:43:12 +02:00
Aleksandra Martyniuk	db6e4a356b	compaction: invoke resharding on sharded database In reshard_sstables_compaction_task_impl::run() we call sharded<sstables::sstable_directory>::invoke_on_all. In lambda passed to that method, we use both sharded sstable_directory service and its local instance. To make it straightforward that sharded and local instances are dependend, we call sharded<replica::database>::invoke_on_all instead and access local directory through the sharded one.	2023-06-28 11:43:12 +02:00
Aleksandra Martyniuk	1acaed026a	compaction: move run_resharding_jobs into reshard_sstables_compaction_task_impl::run()	2023-06-28 11:43:11 +02:00
Aleksandra Martyniuk	837d77ba8c	compaction: add reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction.	2023-06-28 11:41:43 +02:00
Aleksandra Martyniuk	0d6dd3eeda	compaction: replica: copy struct and functions from distributed_loader.cc As a preparation for integrating resharding compaction with task manager a struct and some functions are copied from replica/distributed_loader.cc to compaction/task_manager_module.cc.	2023-06-28 11:41:42 +02:00
Raphael S. Carvalho	83c70ac04f	utils: Extract pretty printers into a header Can be easily reused elsewhere. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-06-26 21:58:20 -03:00
Aleksandra Martyniuk	f9a527b06d	compaction: move reshape function to shard_reshaping_table_compaction_task_impl::run()	2023-06-23 16:22:53 +02:00
Aleksandra Martyniuk	1960904a72	compaction: add shard_reshaping_compaction_task_impl shard_reshaping_compaction_task_impl covers reshaping compaction on one shard.	2023-06-23 16:22:38 +02:00
Aleksandra Martyniuk	e3e2d6b886	compaction: add table_reshaping_compaction_task_impl	2023-06-23 15:57:37 +02:00
Aleksandra Martyniuk	dace5fb004	compaction: copy reshape to task_manager_module.cc distributed_loader::reshape is copied to compaction/task_manager_module.cc as it will be used in reshape compaction tasks.	2023-06-23 12:53:16 +02:00
Aleksandra Martyniuk	e317ffe23a	compaction: extend signature of some methods Extend a signature of table::compact_all_sstables and compaction_manager::perform_major_compaction so that they get the info of a covering task. This allows to easily create child tasks that cover compaction group compaction.	2023-06-20 10:45:34 +02:00
Aleksandra Martyniuk	fecdd75cd6	compaction: fix indentation	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	53c24c0f7d	compaction: create table_upgrade_sstables_compaction_task_impl Implementation of task_manager's task that covers upgrade sstables compaction of one table.	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	143919cfa7	compaction: create table_offstrategy_keyspace_compaction_task_impl Implementation of task_manager's task that covers offstrategy keyspace compaction of one table.	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	55ef1c24e1	compaction: create table_cleanup_keyspace_compaction_task_impl Implementation of task_manager's task that covers cleanup keyspace compaction of one table.	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	5c7832ab59	compaction: create table_major_keyspace_compaction_task_impl Implementation of task_manager's task that covers major keyspace compaction of one table.	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	d0c4028d64	compaction: add helpers for table tasks scheduling In shard compaction tasks per table tasks will be created all at once and then they will wait for their turn to run. A function that allows waking up tasks one after another and a function that makes the task wait for its turn are added.	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	6dacc45c70	compaction: add run_on_table Extract code which runs a function on a particular table from run_on_existing_tables to run_on_table.	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	5c65ac00ef	compaction: pass std::string to run_on_existing_tables Keyspace argument passed to run_on_existing_tables has its type changed from std::string_view to std::string.	2023-05-31 14:59:24 +02:00
Aleksandra Martyniuk	f48b57e7b9	compaction: use table_info in compaction tasks Task manager compaction tasks need table names for logs. Thus, compaction tasks store table infos instead of table ids. get_table_ids function is deleted as it isn't used anywhere.	2023-05-30 09:58:55 +02:00

1 2

65 Commits