scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-29 12:47:02 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	97985a68a1	compaction: Fix incremental compaction for sstable cleanup After `c7826aa910`, sstable runs are cleaned up together. The procedure which executes cleanup was holding reference to all input sstables, such that it could later retry the same cleanup job on failure. Turns out it was not taking into account that incremental compaction will exhaust the input set incrementally. Therefore cleanup is affected by the 100% space overhead. To fix it, cleanup will now have the input set updated, by removing the sstables that were already cleaned up. On failure, cleanup will retry the same job with the remaining sstables that weren't exhausted by incremental compaction. New unit test reproduces the failure, and passes with the fix. Fixes #14035. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #14038 (cherry picked from commit `23443e0574`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #14195	2023-06-13 09:57:59 +03:00
Botond Dénes	cfa8fa1d77	Merge 'Backport compaction reevaluation fixes to branch-5.1' from Raphael "Raph" Carvalho Fixes #13429. Fixes #12390. Fixes #13430. Closes #14009 * github.com:scylladb/scylladb: compaction: Make compaction reevaluation actually periodic compaction_manager: Fix reactor stalls during periodic submissions compaction_manager: reindent postponed_compactions_reevaluation() compaction_manager: coroutinize postponed_compactions_reevaluation() compaction_manager: make postponed_compactions_reevaluation() return a future replica: Reevaluate regular compaction on off-strategy completion	2023-05-25 07:55:17 +03:00
Raphael S. Carvalho	6cdd5ccabd	compaction: Make compaction reevaluation actually periodic The manager intended to periodically reevaluate compaction need for each registered table. But it's not working as intended. The reevaluation is one-off. This means that compaction was not kicking in later for a table, with low to none write activity, that had expired data 1 hour from now. Also make sure that reevaluation happens within the compaction scheduling group. Fixes #13430. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `156ac0a67a`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-23 21:30:47 -03:00
Raphael S. Carvalho	204baa0c1e	compaction_manager: Fix reactor stalls during periodic submissions Every 1 hour, compaction manager will submit all registered table_state for a regular compaction attempt, all without yielding. This can potentially cause a reactor stall if there are 1000s of table states, as compaction strategy heuristics will run on behalf of each, and processing all buckets and picking the best one is not cheap. This problem can be magnified with compaction groups, as each group is represented by a table state. This might appear in dashboard as periodic stalls, every 1h, misleading the investigator into believing that the problem is caused by a chronological job. This is fixed by piggybacking on compaction reevaluation loop which can yield between each submission attempt if needed. Fixes #12390. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12391 (cherry picked from commit `67ebd70e6e`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-23 21:18:36 -03:00
Avi Kivity	3556d2b4e8	compaction_manager: reindent postponed_compactions_reevaluation() (cherry picked from commit `d2b1d2f695`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-23 21:18:18 -03:00
Avi Kivity	6b699c9667	compaction_manager: coroutinize postponed_compactions_reevaluation() So much nicer. (cherry picked from commit `1669025736`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-23 21:17:58 -03:00
Avi Kivity	316ea63ea0	compaction_manager: make postponed_compactions_reevaluation() return a future postponed_compactions_reevaluation() runs until compaction_manager is stopped, checking if it needs to launch new compactions. Make it return a future instead of stashing its completion somewhere. This makes is easier to convert it to a coroutine. (cherry picked from commit `d2c44cba77`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-23 21:17:26 -03:00
Raphael S. Carvalho	0c9a0faf0d	compaction: Fix inefficiency when updating LCS backlog tracker LCS backlog tracker uses STCS tracker for L0. Turns out LCS tracker is calling STCS tracker's replace_sstables() with empty arguments even when higher levels (> 0) only had sstables replaced. This unnecessary call to STCS tracker will cause it to recompute the L0 backlog, yielding the same value as before. As LCS has a fragment size of 0.16G on higher levels, we may be updating the tracker multiple times during incremental compaction, which operates on SSTables on higher levels. Inefficiency is fixed by only updating the STCS tracker if any L0 sstable is being added or removed from the table. This may be fixing a quadratic behavior during boot or refresh, as new sstables are loaded one by one. Higher levels have a substantial higher number of sstables, therefore updating STCS tracker only when level 0 changes, reduces significantly the number of times L0 backlog is recomputed. Refs #12499. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12676 (cherry picked from commit `1b2140e416`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-02-07 12:45:34 -03:00
Raphael S. Carvalho	43d46a241f	compaction: LCS: don't reshape all levels if only a single breaks disjointness LCS reshape is compacting all levels if a single one breaks disjointness. That's unnecessary work because rewriting that single level is enough to restore disjointness. If multiple levels break disjointness, they'll each be reshaped in its own iteration, so reducing operation time for each step and disk space requirement, as input files can be released incrementally. Incremental compaction is not applied to reshape yet, so we need to avoid "major compaction", to avoid the space overhead. But space overhead is not the only problem, the inefficiency, when deciding what to reshape when overlapping is detected, motivated this patch. Fixes #12495. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12496 (cherry picked from commit `f2f839b9cc`)	2023-02-05 20:06:31 +02:00
Aleksandra Martyniuk	88016de43e	compaction: request abort only once in compaction_data::stop compaction_manager::task (and thus compaction_data) can be stopped because of many different reasons. Thus, abort can be requested more than once on compaction_data abort source causing a crash. To prevent this before each request_abort() we check whether an abort was requested before. Closes #12004 (cherry picked from commit `7ead1a7857`) Fixes #12002.	2022-11-17 19:15:43 +02:00
Pavel Emelyanov	dff7f3c5ba	compaction_manager: Swallow ENOSPCs in ::stop() When being stopped compaction manager may step on ENOSPC. This is not a reason to fail stopping process with abort, better to warn this fact in logs and proceed as if nothing happened refs: #11245 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-13 15:36:44 +03:00
Raphael S. Carvalho	eaded57b2e	compaction: Properly handle stop request for off-strategy If user stops off-strategy via API, compaction manager can decide to give up on it completely, so data will sit unreshaped in maintenance set, preventing it from being compacted with data in the main set. That's problematic because it will probably lead to a significant increase in read and space amplification until off-strategy is triggered again, which cannot happen anytime soon. Let's handle it by moving data in maintenance set into main one, even if unreshaped. Then regular compaction will be able to continue from where off-strategy left off. Fixes #11543. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #11545 (cherry picked from commit `a04047f390`)	2022-10-02 14:20:17 +03:00
Benny Halevy	14faa3b6f4	compaction_manager: perform_cleanup, perform_sstable_upgrade: use a lw_shared_ptr for owned token ranges And completely get rid of the dependency on replica::database. Also, add respective rest_api tests. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-02 08:08:11 +03:00
Benny Halevy	e1fe598760	compaction: cleanup, upgrade: use a lw_shared_ptr for owned token ranges Currently they are copied for the get_sstables function so this change reduces copies. Also, it will allow further decoupling of compaction_manager from replica::database, by letting the caller of perform_cleanup and perform_sstable_upgrade get the owned token ranges from db and pass it to the perform_* functions in the following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-02 07:57:41 +03:00
Benny Halevy	e4e92d44ae	main: start compaction_manager as a sharded service And pass a reference to it to the database rather than having the database construct its own compaction_manager. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-02 07:50:15 +03:00
Benny Halevy	7f70949693	compaction_manager: keep config as member Rather than keeping separate, duplicated members. And define helpers to get those members. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-02 07:48:01 +03:00
Benny Halevy	450ecd60c6	backlog_controller: scheduling_group: define default member initializers To prepare for the next patch, implement default initialization of the scheduling_group and io_priority_class, to the default values. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-02 07:38:40 +03:00
Aleksandra Martyniuk	6ea5bc96d7	scrub compaction: return status indicating aborted operations over the rest api Performing compaction scrub user did not know whether an operation was aborted. If compaction scrub is aborted, return status the user gets over rest api is set to 1.	2022-07-29 09:35:20 +02:00
Aleksandra Martyniuk	f1980f8dc6	scrub compaction: count validation errors and return status over the rest api Performing compaction scrub user did not know whether any validation errors were encountered. The number of validation errors per given compaction scrub is gathered and summed from each shard. Basing on that value return status over the rest api is set to 3 if any validation errors were encountered.	2022-07-29 09:35:20 +02:00
Aleksandra Martyniuk	7d457cffb8	scrub compaction: count validation errors for specific scrub task The number of validation errors per given compaction scrub on given shard is passed up to perform_task() function.	2022-07-29 09:35:20 +02:00
Aleksandra Martyniuk	3a805a9d9b	compaction: extract statistics in compaction_result Statistics from compaction_result are extracted to new struct compaction_stats and stored as a field of compaction_result.	2022-07-29 09:35:20 +02:00
Aleksandra Martyniuk	a80c187b20	scrub compaction: register validation errors in metrics The number of validation errors is registered in metrics. Metrics provide common counters for all scrub operation within a compaction manager, though. Thus, to check the exact number of validation errors, the comparison of counters before and after scrub operation needs to be done.	2022-07-29 09:35:20 +02:00
Aleksandra Martyniuk	ab85dab05d	scrub compaction: count validation errors The number of validation errors encountered during scrub compaction is counted.	2022-07-29 09:35:20 +02:00
Benny Halevy	f26e655646	compaction_manager: add maybe_wait_for_sstable_count_reduction Called from try_flush_memtable_to_sstable, maybe_wait_for_sstable_count_reduction will wait for compaction to catch up with memtable flush if there the bucket to compact is inflated, having too many sstables. In that case we don't want to add fuel to the fire by creating yet another sstable. Fixes #4116 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-28 14:43:30 +03:00
Benny Halevy	69d4a16908	time_window_compaction_strategy: get_sstables_for_compaction: clean up code Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-28 14:22:03 +03:00
Benny Halevy	c450f3ee11	time_window_compaction_strategy: make get_sstables_for_compaction idempotent To make sure fully_expired sstables are not missed if get_sstables_for_compaction is called just heuristically, change the state by setting _last_expired_check to the current time only when no fully_expired_sstables are found among the candidates. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-28 14:22:03 +03:00
Benny Halevy	3d07882431	time_window_compaction_strategy: get_sstables_for_compaction: improve debug messages Print the compaction_strategy `this` pointer so we can distinguish between different instance of the compaction_strategy object (some code paths copy it and some may instantiate a branch new compaction_strategy object). The motivation is detecting when the side effects of this function are applied on the "master" instance, stored in the table shard. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-28 14:22:03 +03:00
Benny Halevy	a149022ed4	leveled_manifest: pass compaction_counter as const& It is not modified by the leveld_manifest functions. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-28 14:22:03 +03:00
Igor Ribeiro Barbosa Duarte	8dd0f4672d	compaction: Make compaction_static_shares liveupdateable This patch makes compaction_static_shares liveupdateable to avoid having to restart the cluster after updating this config. Signed-off-by: Igor Ribeiro Barbosa Duarte <igor.duarte@scylladb.com>	2022-07-19 10:10:46 -03:00
Igor Ribeiro Barbosa Duarte	c2ee6492e6	backlog_controller: Unify backlog_controller constructors This patch adds the _static_shares variable to the backlog_controller so that instead of having to use a separate constructor when controller is disabled, we can use a single constructor and periodically check on the adjust method if we should use the static shares or the controller. This will be useful on the next patches to make compaction_static_shares and memtable_flush_static_shares live updateable. Signed-off-by: Igor Ribeiro Barbosa Duarte <igor.duarte@scylladb.com>	2022-07-19 10:06:12 -03:00
Raphael S. Carvalho	246e945086	compaction: remove forward declaration of replica::table compaction_manager.cc still cannot stop including replica/database.hh because upgrade and scrub still take replica::database as param, but I'll remove it soon in another series. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	a94d974835	compaction_manager: make add() and remove() switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	31655acb5e	compaction_manager: make run_custom_job() switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	9a1efc69d0	compaction_manager: major: switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	cebe6e22cb	compaction_manager: scrub: switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	d29f7070d9	compaction_manager: upgrade: switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	c2678ca661	compaction: table_state: add get_sstables_manager() That will be needed for retrieving sstable manager in perform_sstable_upgrade(). Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	bdd049afd6	compaction_manager: cleanup: switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	f547e0f2fb	compaction_manager: offstrategy: switch to table_state() Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	538d412fba	compaction_manager: rewrite_sstables(): switch to table_state rewrite_sstables() is used by maintenance compactions that perform an operation on a single file at a time. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	79e385057f	compaction_manager: make run_with_compaction_disabled() switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	79f91fe61e	compaction_manager: compaction_reenabler: switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	7c1d178f4e	compaction_manager: make submit(T) switch to table_state Now that submit() switched to table_state, compaction_reenabler and friends can switch to table_state too. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	a176022272	compaction_manager: task: switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	43136a3ca7	compaction: table_state: Add is_auto_compaction_disabled_by_user() auto_compaction_disabled_by_user is a configuration that can be enabled or disabled on a particular table. We're adding this interface to avoid having to push the configuration for every compaction_state, which would result in redundant information as the configuration value is the same for all table states. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	1deeeff825	compaction: table_state: Add on_compaction_completion() The idea is that we'll have a single on-completion interface for both "in-strategy" and off-strategy compactions, so not to pollute table_state with one interface for each. replica::table::on_compaction_completion is being moved into private namespace. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	1520580212	compaction: table_state: Add make_sstable() compaction_manager needs this interface when setting the sstable creation lambda in compaction_descriptor, which is then forwarded into the actual compaction procedure. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	956c3997cb	compaction_manager: make can_proceed switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	7a9908dbf1	compaction_manager: make stop compaction procedures switch to table_state they're used to stop all ongoing compaction on behalf of a given table T. Today, each table has a single table_state representing it, but after we implement compaction groups, we'll need to call the procedure for each group in a table. But the discussion doesn't belong here, as compaction group work will only come later. By the time being, we're only making compaction manager fully switch to table_state. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Raphael S. Carvalho	b6126395e1	compaction_manager: make get_compactions() switch to table_state The only external user of get_compactions() doesn't use any filtering, so after table_state switch, one will be allowed to get all jobs running associated with a table_state. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00

1 2 3 4 5 ...

431 Commits