scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 05:53:13 +00:00

Author	SHA1	Message	Date
Aleksandra Martyniuk	8afa54d4f6	compaction: create task manager's task for offstrategy keyspace compaction on one shard Implementation of task_manager's task that covers local offstrategy keyspace compaction.	2023-03-30 10:49:09 +02:00
Aleksandra Martyniuk	73860b7c9d	compaction: create task manager's task for offstrategy keyspace compaction Implementation of task_manager's task covering offstrategy keyspace compaction that can be started through storage_service api.	2023-03-30 10:44:56 +02:00
Aleksandra Martyniuk	e8ef8a51d5	compaction: create offstrategy_compaction_task_impl offstrategy_compaction_task_impl serves as a base class of all concrete offstrategy compaction task classes.	2023-03-30 10:28:17 +02:00
Avi Kivity	472b155d76	Merge 'Allow each compaction group to have its own compaction strategy state' from Raphael "Raph" Carvalho This is important for multiple compaction groups, as they cannot share state that must span a single SSTable set. The solution is about: 1) Decoupling compaction strategy from its state; making compaction_strategy a pure stateless entity 2) Each compaction group storing its own compaction strategy state 3) Compaction group feeds its state into compaction strategy whenever needed Closes #13351 * github.com:scylladb/scylladb: compaction: TWCS: wire up compaction_strategy_state compaction: LCS: wire up compaction_strategy_state compaction: Expose compaction_strategy_state through table_state replica: Add compaction_strategy_state to compaction group compaction: Introduce compaction_strategy_state compaction: add table_state param to compaction_strategy::notify_completion() compaction: LCS: extract state into a separate struct compaction: TWCS: prepare for stateless strategy compaction: TWCS: extract state into a separate struct compaction: add const-qualifier to a few compaction_strategy methods	2023-03-29 18:57:11 +03:00
Raphael S. Carvalho	989afbf83b	compaction: TWCS: wire up compaction_strategy_state TWCS no longer keeps internal state, and will now rely on state managed by each compaction group through compaction::table_state. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-28 08:48:15 -03:00
Raphael S. Carvalho	233fe6d3dc	compaction: LCS: wire up compaction_strategy_state LCS no longer keeps internal state, and will now rely on state managed by each compaction group through compaction::table_state. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-28 08:48:15 -03:00
Raphael S. Carvalho	2186a75e9b	compaction: Expose compaction_strategy_state through table_state That will allow compaction_strategy to access the compaction group state through compaction::table_state, which is the interface at which replica talks to the compaction layer. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-28 08:48:10 -03:00
Raphael S. Carvalho	25f73a4181	compaction: Introduce compaction_strategy_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 15:46:11 -03:00
Raphael S. Carvalho	1ffe2f04ef	compaction: add table_state param to compaction_strategy::notify_completion() once compaction_strategy is made staless, the state must be retrieved in notify_completion() through table_state. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 13:40:02 -03:00
Raphael S. Carvalho	2ffaae97a4	compaction: LCS: extract state into a separate struct Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 13:40:02 -03:00
Raphael S. Carvalho	e2f38baa92	compaction: TWCS: prepare for stateless strategy Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 13:40:01 -03:00
Raphael S. Carvalho	017f432b8f	compaction: TWCS: extract state into a separate struct This is a step towards decoupling compaction strategy (computation) and its state. Making the former stateless. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 13:38:47 -03:00
Raphael S. Carvalho	232e71f2cf	compaction: add const-qualifier to a few compaction_strategy methods Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 11:13:10 -03:00
Botond Dénes	b5afdf56c3	Merge 'Cleanup keyspace compaction task' from Aleksandra Martyniuk Task manager task implementations of classes that cover cleanup keyspace compaction which can be started through /storage_service/keyspace_compaction/ api. Top level task covers the whole compaction and creates child tasks on each shard. Closes #12712 * github.com:scylladb/scylladb: test: extend test_compaction_task.py to test cleanup compaction compaction: create task manager's task for cleanup keyspace compaction on one shard compaction: create task manager's task for cleanup keyspace compaction api: add get_table_ids to get table ids from table infos compaction: create cleanup_compaction_task_impl	2023-03-27 11:52:51 +03:00
Avi Kivity	32cc975b2f	compaction: error on invalid scrub type gcc allows an enum to contain a value outside its enum set, so we need to handle it. Since it shouldn't happen, signal an internal error.	2023-03-21 13:42:49 +02:00
Aleksandra Martyniuk	27b999808f	compaction: create task manager's task for cleanup keyspace compaction on one shard Implementation of task_manager's task that covers cleanup keyspace compaction on one shard.	2023-03-13 16:35:39 +01:00
Aleksandra Martyniuk	7dd27205f6	compaction: create task manager's task for cleanup keyspace compaction Implementation of task_manager's task covering cleanup keyspace compaction that can be started through storage_service api.	2023-03-13 16:35:39 +01:00
Aleksandra Martyniuk	8801f326c6	compaction: create cleanup_compaction_task_impl	2023-03-13 16:35:39 +01:00
Kefu Chai	0b3d25ab1b	build: cmake: add missing linkages these dependencies were found when trying to compile `user_function_test`. whenever a library libfoo references another one, say, libbar, the corresponding linkage from libfoo to libbar is added. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-03-08 22:53:42 +08:00
Botond Dénes	fb898d214c	Merge 'Shard major compaction task' from Aleksandra Martyniuk Implementation of task_manager's task that covers major keyspace compaction on one shard. Closes #12662 * github.com:scylladb/scylladb: test: extend major keyspace compaction tasks test compaction: create task manager's task for major keyspace compaction on one shard	2023-03-02 15:06:31 +02:00
Botond Dénes	91d64372db	Merge 'cmake: sync with `configure.py` (8/n)' from Kefu Chai - build: cmake: extract more subsystem out into its own CMakeLists.txt - build: cmake: remove swagger_gen_files - build: cmake: remove stale TODO comments - build: cmake: expose scylla_gen_build_dir - build: cmake: link against cryptopp - build: cmake: add missing source to utils - build: cmake: move lib sources into test-lib - build: cmake: add test/perf Closes #13059 * github.com:scylladb/scylladb: build: cmake: add expr_test test build: cmake: allow test to specify the sources build: cmake: add test/perf build: cmake: move lib sources into test-lib build: cmake: add missing source to utils build: cmake: link against cryptopp build: cmake: expose scylla_gen_build_dir build: cmake: remove stale TODO comments build: cmake: remove swagger_gen_files build: cmake: extract more subsystem out into its own CMakeLists.txt	2023-03-02 14:22:35 +02:00
Botond Dénes	1b5f8916d6	Merge 'Generalize sstable::move_to_new_dir() method' from Pavel Emelyanov This method requires callers to remember that the sstable is the collection of files on a filesystem and to know what exact directory they are all in. That's not going to work for object storage, instead, sstable should be moved between more abstract states. This PR replaces move_to_new_dir() call with the change_state() one that accepts target sub-directory string and moves files around. Currently supported state changes: * staging -> normal * upload -> normal \| staging * any -> quarantine All are pretty straightforward and move files between table basedir subdirectories with the exception that upload -> quarantine should move into upload/quarantine subdirectory. Another thing to keep in mind, that normal state doesn't have its subdir but maps directory to table's base directory. Closes #12648 * github.com:scylladb/scylladb: sstable: Remove explicit quarantization call test: Move move_to_new_dir() method from sstable class sstable, dist.-loader: Introduce and use pick_up_from_upload() method sstables, code: Introduce and use change_state() call distributed_loader: Let make_sstables_available choose target directory	2023-03-02 09:22:14 +02:00
Kefu Chai	563fbb2d11	build: cmake: extract more subsystem out into its own CMakeLists.txt namely, cdc, compaction, dht, gms, lang, locator, mutation_writer, raft, readers, replica, service, tools, tracing and transport. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-03-02 10:15:25 +08:00
Aleksandra Martyniuk	b188060535	compaction: create task manager's task for major keyspace compaction on one shard Implementation of task_manager's task that covers major keyspace compaction on one shard.	2023-03-01 18:56:26 +01:00
Aleksandra Martyniuk	159e603ac4	compaction: create task manager's task for major keyspace compaction Implementation of task_manager's task covering major keyspace compaction that can be started through storage_service api.	2023-02-23 15:48:05 +01:00
Aleksandra Martyniuk	6b1d7f5979	compaction: copy run_on_existing_tables to task_manager_module.cc Copy run_on_existing_tables from api/storage_service.cc to compaction/task_manager_module.cc	2023-02-23 15:31:59 +01:00
Aleksandra Martyniuk	b908369e85	compaction: add major_compaction_task_impl All major compaction tasks will share some methods like type or abort. The common part of the tasks should be inherited from major_compaction_task_impl.	2023-02-22 09:52:04 +01:00
Aleksandra Martyniuk	be101078a0	compacition: add pure virtual compaction_task_impl Add compaction_task_impl that is a pure virtual class from which all compaction tasks implementations will inherit.	2023-02-22 09:51:57 +01:00
Pavel Emelyanov	8a061bd862	sstables, code: Introduce and use change_state() call The call moves the sstable to the specified state. The change state is translated into the storage driver state change which is for todays filesystem storage means moving between directories. The "normal" state maps to the base dir of the table, there's no dedicated subdir for this state and this brings some trouble into the play. The thing is that in order to check if an sstable is in "normal" state already its impossible to compare filename of its path to any pre-defined values, as tables' basdirs are dynamic. To overcome this, the change-state call checks that the sstable is in one of "known" sub-states, and assumes that it's in normal state otherwise. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-02-21 17:39:34 +03:00
Aleksandra Martyniuk	4f67c0c36a	compaction: add compaction module getter to compaction manager	2023-02-20 11:19:29 +01:00
Botond Dénes	dc3d47b1e4	Merge 'Get compaction history without using qctx' from Pavel Emelyanov There are two methods to mess with compaction history -- update and get. The former had been patched to use local system-keyspace instance by `907fd2d3` (system_keyspace: De-static compaction history update) now it's time for the latter (spoiler: it's only used by the API handler) Closes #12889 * github.com:scylladb/scylladb: system_keyspace; Make get_compaction_history non static and drop qctx api, compaction_manager: Get compaction history via manager system_keyspace: Move compaction_history_entry to namespace scope	2023-02-16 19:05:48 +02:00
Pavel Emelyanov	52f69643b6	api, compaction_manager: Get compaction history via manager Right now the API handler directly calls static method from system keyspace. Patching it to call compaction manager instead will let the latter use on-board plugged system keyspace for that. If the system keyspace is not plugged, it means early boot or late shutdown, not a good time to get compaction history anyway. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-02-16 11:27:38 +03:00
Kefu Chai	0cb842797a	treewide: do not define/capture unused variables these warnings are found by Clang-17 after removing `-Wno-unused-lambda-capture` and '-Wno-unused-variable' from the list of disabled warnings in `configure.py`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-02-15 22:57:18 +02:00
Avi Kivity	69a385fd9d	Introduce schema/ module Schema related files are moved there. This excludes schema files that also interact with mutations, because the mutation module depends on the schema. Those files will have to go into a separate module. Closes #12858	2023-02-15 11:01:50 +02:00
Avi Kivity	c5e4bf51bd	Introduce mutation/ module Move mutation-related files to a new mutation/ directory. The names are kept in the global namespace to reduce churn; the names are unambiguous in any case. mutation_reader remains in the readers/ module. mutation_partition_v2.cc was missing from CMakeLists.txt; it's added in this patch. This is a step forward towards librarization or modularization of the source base. Closes #12788	2023-02-14 11:19:03 +02:00
Kefu Chai	d4315245a1	main: use defer_verbose_shutdown() to shutdown compaction manager * use `defer_verbose_shutdown()` to shutdown compaction manager `EDQUOT` is quite similar as `ENOSPC`, in the sense that both of them are caused by environmental issues. before this change, `compaction_manager` filters the ENOSPC exceptions thrown by `compaction_manager::really_do_stop()`, so they are not propagated to caller when calling `compaction_manager::stop()` -- only a warning message is printed in the log. but `EDQUOT` is not handled. after this change, the exception raised by compaction manager's stop process is not filtered anymore and is handled by `defer_verbose_shutdown()` instead, which is able to check the type of exception, and print out error message in the log. so the `ENOSPC` and `EDQUOT` errors are taken care of, and more visible from user's perspective as they are printed as errors instead of warning. but they are not printed using the `compaction_manager` logger anymore. so if our testing or user's workflow depends on this behavior, the related setting should be updated accordingly. Fixes #12626 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-02-07 16:00:40 +08:00
Botond Dénes	511c0123a2	Merge 'Add compaction module to task manager' from Aleksandra Martyniuk Introduces task manager's compaction module. That's an initial part of integration of compaction with task manager. When fully integrated, task manager will allow user to track compaction operations, check status and progress of each individual one. It will help with creating an asynchronous version of rest api that forces any compaction. Currently, users can see with /task_manager/list_modules api call that compaction is one of the modules accessible through task manager. They won't get any additional information though, since compaction tasks are not created yet. A shared_ptr to compaction module is kept in compaction manager. Closes #12635 * github.com:scylladb/scylladb: compaction: test: pass task_manager to compaction_manager in test environment compaction: create and register task manager's module for compaction tasks: add task_manager constructor without arguments	2023-02-06 09:25:05 +02:00
Aleksandra Martyniuk	12789adb95	compaction: test: pass task_manager to compaction_manager in test environment Each instance of compaction manager should have compaction module pointer initialized. All contructors get task_manager reference with which the module is created.	2023-02-03 15:15:11 +01:00
Raphael S. Carvalho	5a784c3c6d	treewide: Use new sstable_set::size() wherever possible That's the preferred alternative because it's zero copy. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-02-03 10:38:04 -03:00
Aleksandra Martyniuk	47ef689077	compaction: create and register task manager's module for compaction As an initial part of integration of compaction with task manager, compaction module is added. Compaction module inherits from tasks::task_manager::module and shared_ptr to it is kept in compaction manager. No compaction tasks are created yet.	2023-02-03 13:52:30 +01:00
Raphael S. Carvalho	1b2140e416	compaction: Fix inefficiency when updating LCS backlog tracker LCS backlog tracker uses STCS tracker for L0. Turns out LCS tracker is calling STCS tracker's replace_sstables() with empty arguments even when higher levels (> 0) only had sstables replaced. This unnecessary call to STCS tracker will cause it to recompute the L0 backlog, yielding the same value as before. As LCS has a fragment size of 0.16G on higher levels, we may be updating the tracker multiple times during incremental compaction, which operates on SSTables on higher levels. Inefficiency is fixed by only updating the STCS tracker if any L0 sstable is being added or removed from the table. This may be fixing a quadratic behavior during boot or refresh, as new sstables are loaded one by one. Higher levels have a substantial higher number of sstables, therefore updating STCS tracker only when level 0 changes, reduces significantly the number of times L0 backlog is recomputed. Refs #12499. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12676	2023-02-01 15:19:07 +02:00
Benny Halevy	82011fc489	dht: incremental_owned_ranges_checker: belongs_to_current_node: mark as const Its _it member keeps state about the current range. Although it's modified by the method, this is an implementation detail that irrelevant to the caller, hence mark the belongs_to_current_node method as const (and noexcept while at it). This allows the caller, cleanup_compaction, to use it from inside a const method, without having to mark its respective member as mutable too. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #12634	2023-01-25 14:52:21 +02:00
Raphael S. Carvalho	f2f839b9cc	compaction: LCS: don't reshape all levels if only a single breaks disjointness LCS reshape is compacting all levels if a single one breaks disjointness. That's unnecessary work because rewriting that single level is enough to restore disjointness. If multiple levels break disjointness, they'll each be reshaped in its own iteration, so reducing operation time for each step and disk space requirement, as input files can be released incrementally. Incremental compaction is not applied to reshape yet, so we need to avoid "major compaction", to avoid the space overhead. But space overhead is not the only problem, the inefficiency, when deciding what to reshape when overlapping is detected, motivated this patch. Fixes #12495. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12496	2023-01-17 09:55:15 +02:00
Raphael S. Carvalho	67ebd70e6e	compaction_manager: Fix reactor stalls during periodic submissions Every 1 hour, compaction manager will submit all registered table_state for a regular compaction attempt, all without yielding. This can potentially cause a reactor stall if there are 1000s of table states, as compaction strategy heuristics will run on behalf of each, and processing all buckets and picking the best one is not cheap. This problem can be magnified with compaction groups, as each group is represented by a table state. This might appear in dashboard as periodic stalls, every 1h, misleading the investigator into believing that the problem is caused by a chronological job. This is fixed by piggybacking on compaction reevaluation loop which can yield between each submission attempt if needed. Fixes #12390. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12391	2022-12-24 13:43:16 +02:00
Raphael S. Carvalho	e6fb3b3a75	compaction: Delete atomically off-strategy input sstables After commit `a57724e711`, off-strategy no longer races with view building, therefore deletion code can be simplified and piggyback on mechanism for deleting all sstables atomically, meaning a crash midway won't result in some of the files coming back to life, which leads to unnecessary work on restart. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #12245	2022-12-16 08:15:49 +02:00
Botond Dénes	8f8284783a	Merge 'Fix handling of non-full clustering keys in the read path' from Tomasz Grabiec This PR fixes several bugs related to handling of non-full clustering keys. One is in trim_clustering_row_ranges_to(), which is broken for non-full keys in reverse mode. It will trim the range to position_in_partition_view::after_key(full_key) instead of position_in_partition_view::before_key(key), hence it will include the key in the resulting range rather than exclude it. Fixes #12180 after_key() was creating a position which is after all keys prefixed by a non-full key, rather than a position which is right after that key. This will issue will be caught by cql_query_test::test_compact_storage in debug mode when mutation_partition_v2 merging starts inserting sentinels at position after_key() on preemption. It probably already causes problems for such keys as after_key() is used in various parts in the read path. Refs #1446 Closes #12234 * github.com:scylladb/scylladb: position_in_partition: Make after_key() work with non-full keys position_in_partition: Introduce before_key(position_in_partition_view) db: Fix trim_clustering_row_ranges_to() for non-full keys and reverse order types: Fix comparison of frozen sets with empty values	2022-12-15 10:47:12 +02:00
Tomasz Grabiec	23e4c83155	position_in_partition: Make after_key() work with non-full keys This fixes a long standing bug related to handling of non-full clustering keys, issue #1446. after_key() was creating a position which is after all keys prefixed by a non-full key, rather than a position which is right after that key. This will issue will be caught by cql_query_test::test_compact_storage in debug mode when mutation_partition_v2 merging starts inserting sentinels at position after_key() on preemption. It probably already causes problems for such keys.	2022-12-14 14:47:33 +01:00
Pavel Emelyanov	9bdea110a6	code: Reduce fanout of sstables(_manager)?.hh over headers This change removes sstables.hh from some other headers replacing it with version.hh and shared_sstable.hh. Also this drops sstables_manager.hh from some more headers, because this header propagates sstables.hh via self. That change is pretty straightforward, but has a recochet in database.hh that needs disk-error-handler.hh. Without the patch touch sstables/sstable.hh results in 409 targets recompillation, with the patch -- 299 targets. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #12222	2022-12-07 14:34:19 +02:00
Avi Kivity	d2b1d2f695	compaction_manager: reindent postponed_compactions_reevaluation()	2022-12-05 22:02:27 +02:00
Avi Kivity	1669025736	compaction_manager: coroutinize postponed_compactions_reevaluation() So much nicer.	2022-12-05 22:01:41 +02:00

1 2 3 4 5 ...

509 Commits