scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-08 07:53:20 +00:00

Author	SHA1	Message	Date
Benny Halevy	4062cd17e0	test: hashers_test: mutation_fragment_sanity_check: stop semaphore To stop the semaphore as required we need run the test in a seastar thread. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211024053402.990142-1-bhalevy@scylladb.com>	2021-10-24 11:29:23 +03:00
Benny Halevy	0746b5add6	storage_service: replicate_to_all_cores: update all keyspaces Currently we update the effective_replication_map only on non-system keyspace, leaving the system keyspace, that uses the local replication strategy, with the empty replication_map, as it was first initialized. This may lead to a crash when get_ranges is called later as seen in #9494 where get_ranges was called from the perform_sstable_upgrade path. This change updates the effective_replication_map on all keyspaces rather than just on the non-system ones and adds a unit test that reproduces #9494 without the fix and passes with it. Fixes #9494 Test: unit(dev), database_test(debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211020143217.243949-1-bhalevy@scylladb.com>	2021-10-20 17:54:23 +03:00
Nadav Har'El	e4a6569258	config: experimental flag UNUSED_CDC shouldn't be distinct from UNUSED When an experimental feature graduates from being experimental, we want to continue allow the old "--experimental-features=..." option to work, in case some user's configuration uses it - just do nothing. The way we do it is to map in db::experimental_features_t::map() the feature's name to the UNUSED value - this way the feature's name is accepted, but doesn't change anything. When the CDC feature graduated from being experimental, a new bit UNUSED_CDC was introduced to do the same thing. This separate bit was not actually necessary - if we ever check for UNUSED_CDC bit anywhere in the code it means the flag isn't actually unused ;-) And we don't check it. So simplify the code by conflating UNUSED_CDC into UNUSED. This will also make it easy to build from db::experimental_features_t::map() a list of current experimental features - now it will simply be those that do not map to UNUSED. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211013105107.123544-1-nyh@scylladb.com>	2021-10-20 17:54:17 +03:00
Kamil Braun	22061831c1	Merge 'cql3: keyspace prepare_options: expand replication_factor also for fully qualified NetworkTopologyStrategy' from Benny Halevy It was auto-expanded only if the strategy name was the short "NetworkTopologyStrategy" name. Fixes #9302. Closes #9304. * 'prepare_options' of https://github.com/bhalevy/scylla: cql3: keyspace prepare_options: expand replication_factor also for fully qualified NetworkTopologyStrategy abstract_replication_strategy: add to_qualified_class_name	2021-10-18 16:40:57 +03:00
Raphael S. Carvalho	062436829c	compaction/TWCS: optimize reshape for disjoint sstables spanning multiple windows After `a4053dbb72`, data segregation is postponed to offstrategy, so reshape procedure is called with disjoint sstables which belong to different windows, so let's extend the optimization for disjoint sstables which span more than one window. In this way, write amplification is reduced for offstrategy compaction, as all disjoint sstables will be compacted at once. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211013203046.233540-2-raphaelsc@scylladb.com>	2021-10-18 16:40:57 +03:00
Raphael S. Carvalho	aa4aba40aa	sstables: sstable_run: introduce estimate_droppable_tombstone_ratio Make it possible to estimate dropppable tombstones for sstable runs. The result is averaged by number of fragments composing the run. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211014143424.353357-1-raphaelsc@scylladb.com>	2021-10-18 12:24:08 +03:00
Benny Halevy	b9aa92edd4	cql3: keyspace prepare_options: expand replication_factor also for fully qualified NetworkTopologyStrategy It was auto-expanded only if the strategy name was the short "NetworkTopologyStrategy" name. Fixes #9302 Test: cql_query_test.test_rf_expand(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-18 12:18:07 +03:00
Avi Kivity	4f3b8f38e2	Merge "Add effective_replication_map" from Benny " The current api design of abstract_replication_strategy provides a can_yield parameter to calls that may stall when traversing the token metadata in O(n^2) and even in O(n) for a large number of token ranges. But, to use this option the caller must run in a seastar thread. It can't be used if the caller runs a coroutine or plain async tasks. Rather than keep adding threads (e.g. in storage_service::load_and_stream or storage_service::describe_ring), the series offers an infrastructure change: precalculating the token->endpoints map once, using an async task, and keeping the results in a `effective_replication_map` object. The latter can be used for efficient and stall-free calls, like get_natural_endpoints, or get_ranges/get_primary_range, replacing their equivalents in abstract_replication_strategy, and dropping the public abstract_replication_strategy::calculate_natural_endpoints and its internal cached_endpoints map. Other than the performance benefits of: 1. The current calls require running a thread to yield. Precalculating the map (using async task) allows us to use synchronous calls without stalling the rector. 2. The replication maps can and should be shared between keyspaces that use the same replication strategy. (Will be sent as a follow-up to the series) The bigger benefits (courtesy of Avi Kivity) are laying the groundwork for: 1. atomic replication metadata - an operation can capture a replication map once, and then use consistent information from the map without worrying that it changes under its feet. We may even be able to s/inet_address/replica_ptr/ later. 2. establish boundaries on the use of replication information - by making a replication map not visible, and observing when its reference count drops to zero, we can tell when the new replication map is fully in use. When we start writing to a new node we'll be able to locate a point in time where all writes that were not aware of the new node were completed (this is the point where we should start streaming). Notes: * The get_natural_endpoints method that uses the effective_replication_map is still provided as a abstract_replication_strategy virtual method so that local_strategy can override it and privide natural endpoints for any search token, even in the absence of token_metadata, when\ called early-on, before token_metadata has been established. The effective_replication_map materializes the replication strategy over a given replication strategy options and token_metadata. Whenever either of those change for a keyspace, we make a new effective_replication_map and keep it in the keyspace for latter use. Methods that depend on an ad-hoc token_metadata (e.g. during node operations like bootstrap or replace) are still provided by abstract_replication_strategy. TODO: - effective_replication_map registry - Move pending ranges from token_metadata to replication map - get rid of abstract_replication_strategy::get_range_addresses(token_metadata&) - calculate replication map and use it instead. Test: unit(dev, debug) Dtest: next-gating, bootstrap_test.py update_cluster_layout_tests.py alternator_tests.py -a 'dtest-full,!dtest-heavy' (release) " * tag 'effective_replication_strategy-v6' of github.com:bhalevy/scylla: (44 commits) effective_replication_map: add get_range_addresses abstract_replication_strategy: get rid of shared_token_metadata member and ctor param abstract_replication_strategy: recognized_options: pass const topology& abstract_replication_strategy: precacluate get_replication_factor for effective_replication_map token_metadata: get rid of now-unused sync methods abstract_replication_strategy: get rid of do_calculate_natural_endpoints abstract_replication_strategy: futurize get_address_ranges abstract_replication_strategy: futurize get_range_addresses abstract_replication_strategy: futurize get_ranges(inet_address ep, token_metadata_ptr) abstract_replication_strategy: move get_ranges and get_primary_ranges to effective_replication_map compaction_manager: pass owned_ranges via cleanup/upgrade options abstract_replication_strategy: get rid of cached_endpoints all replication strategies: get rid of do_get_natural_endpoints storage_proxy: use effective_replication_map token_metadata_ptr along with endpoints abstract_replication_strategy: move get_natural_endpoints_without_node_being_replaced to effective_replication_map storage_service: bootstrap: add log messages storage_service: get_mutable_token_metadata_ptr: always invalidate_cached_rings shared_token_metadata: set: check version monotonicity token_metadata: use static ring version token_metadata: get rid of copy constructor and assignment operator ...	2021-10-13 20:28:30 +03:00
Tomasz Grabiec	d8832b9fd8	Merge 'Memtable make reversing reader' from Michał Radwański Make a reader that reads from memtable in reverse order. This draft PR includes two commits, out of which only the second is relevant for review. Described in #9133. Refs #1413. Closes #9174 * github.com:scylladb/scylla: partition_snapshot_reader: pop_range_tombstone returns reference (instead of value) when possible. memtable: enable native reversing partition_snapshot_reader: reverse ck_range when needed by Reversing memtable, partition_snapshot_reader: read from partition in reverse partition_snapshot_reader: rows_position and rows_iter_type supporting reverse iteration partition_snapshot_reader: split responsibility of ck_range partition_snapshot_reader: separate _schema into _query_schema and _partition_schema query: reverse clustering_range test: cql_query_test: fix test_query_limit for reversed queries	2021-10-13 20:24:02 +03:00
Benny Halevy	8c85197c6c	abstract_replication_strategy: get rid of shared_token_metadata member and ctor param It is not used any more. Methods either use the token_metadata_ptr in the effective_replication_map, or receive an ad-hoc token_metadata. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 16:10:06 +03:00
Benny Halevy	4d2561ff75	abstract_replication_strategy: precacluate get_replication_factor for effective_replication_map Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 16:10:06 +03:00
Benny Halevy	dfdc8d4ddb	abstract_replication_strategy: move get_ranges and get_primary_ranges* to effective_replication_map Provide a sync get_ranges method by effective_replication_map that uses the precalculated map to get all token ranges owned by or replicated on a given endpoint. Reuse do_get_ranges as common infrastructure for all 3 cases: get_ranges, get_primary_ranges, and get_primary_ranges_within_dc. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 16:09:51 +03:00
Benny Halevy	5483269dfb	compaction_manager: pass owned_ranges via cleanup/upgrade options So they can be easily computed using an async task before constructing the compaction object in a following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 14:17:46 +03:00
Benny Halevy	3393df45eb	token_metadata, storage_service: unify token_metadata_lock and merge_lock. Serialize the metadata changes with keyspace create, update, or drop. This will become necessary in the following patch when we update the effective_replication_map on all keyspaces and we want instances on all shards end up with the same replication map. Note that storage_service::keyspace_changed is called from the scheme_merge path so it already holds the merge_lock. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 13:01:25 +03:00
Benny Halevy	eb752c3f69	test: network_topology_strategy_test: use effective_replication_map to get_natural_endpoints Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 12:53:09 +03:00
Benny Halevy	a1c573e6d3	abstract_replication_strategy: make calculate_natural_endpoints_sync private And with that rename calculate_natural_endpoints(const token& search_token, const token_metadata&, can_yield) to do_calculate_natural_endpoints and make it protected, With this patch, all its external users call the async version, so rename it back to calculate_natural_endpoints, and make calculate_natural_endpoints_sync private since it's being called only within abstract_replication_strategy. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 12:39:36 +03:00
Nadav Har'El	33f8ec09df	Merge 'treewide: improve compatibility with gcc 11' from Avi Kivity Our source base drifted away from gcc compatibility; this mostly restores the ability to build with gcc. An important exception is coroutines that have an initializer list [1]; this still doesn't work. We aim to switch back to gcc 11 if/when this gives us better C++ compatibility and performance. Test: unit (dev) [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98056 Closes #9459 * github.com:scylladb/scylla: test: radix_tree_printer: avoid template specialization in class context test: raft: avoid ignored variable errors test: reader_concurrency_semaphore_test: isolate from namespace of source_location test: cql_query_test: drop unused lambda assert_replication_not_contains test: commitlog_test: don't use deprecated seastar::unaligned_cast test: adjust signed/unsigned comparisons in loops and boost tests build: silence some gcc 11 warnings sstables: processing_result_generator: make coroutine support palatable for C++20 compilers managed_bytes: avoid compile-time loop in converting constructor service: service_level_controller: drop unused variable sl_compare raft: disambiguate promise name in raft::active_read locator: azure_snitch: use full type name in definition of globals cql3: statements: create_service_level_statement: don't ignore replace_defaults() cql3: statement_restrictions: adjust call to std::vector deduction guide types: remove recursive constraint in deserialize_value cql3: restrictions: relax constraint on visitor_with_binary_operator_content treewide: handle switch statements that return cql3: expr: correct type of captured map value_type cdc: adjust type of streams_count alternator: disambiguate attrs_to_get in table_requests	2021-10-11 16:54:01 +03:00
Pavel Emelyanov	99d8994835	storage_service: Remove view update generator from It's not used by storage service any longer. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-11 11:09:02 +03:00
Pavel Emelyanov	e106e0571a	distributed_loader: Fix methods visibility Most of the methods are marked public, but only few of them should. Test needs a bit more, however, so the distributed_loader_for_tests is declared as friend class. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-11 11:03:29 +03:00
Michał Radwański	771f3b12bd	memtable: enable native reversing This commit consists of changes, which need to reside in a single commit, so that the tests pass on each of the commits. 1. Remove do_make_flat_reader which disabled reverse reads by making the slice a forward one. Remove call to get_ranges which would do superfluous reversal of clustering ranges. 2. test: cql_query_test: remove expectation that the test_query_limit fails for reversed queries, since reversed queries no longer require linear memory wrt. the result size, when paginated.	2021-10-10 20:38:18 +02:00
Avi Kivity	cdb50b1972	test: reader_concurrency_semaphore_test: isolate from namespace of source_location More modern gcc uses std::source_location instead of std::experimental::source_location. Rely on seastar::compat to get it right for us.	2021-10-10 18:17:53 +03:00
Avi Kivity	a08bcc0528	test: cql_query_test: drop unused lambda assert_replication_not_contains gcc complains that it exists.	2021-10-10 18:17:53 +03:00
Avi Kivity	9166d1ab1d	test: commitlog_test: don't use deprecated seastar::unaligned_cast unaligned_cast is deprecated, and gcc complains that it violates strict aliasing rules. Switch to std::copy_n() instead.	2021-10-10 18:17:53 +03:00
Avi Kivity	9907303bf5	test: adjust signed/unsigned comparisons in loops and boost tests gcc complains about comparing a signed loop induction variable with an unsigned limit, or comparing an expected value and measured value. Fix by using unsigned throughout, except in one case where the signed value was needed for the data_value constructor.	2021-10-10 18:16:50 +03:00
Kamil Braun	96f18c4bb0	test: test_sstable_reversing_reader_random_schema: fix the workaround for #9352 The test generates random mutations and eliminates mutations whose keys tokenize to 0, in particular it eliminates mutations with empty partition keys (which should not end up in sstables). However it would do that after using the randomly generated mutations to create their reversed versions. So the reversed versions of mutations with empty partition keys would stay. Fix by placing the workaround earlier in the test. Closes #9447	2021-10-07 14:01:43 +03:00
Benny Halevy	90fd4d5ed7	test: sstable_conforms_to_mutation_source_test: test_sstable_reversing_reader_random_schema: auto-close reader on exception I stumbled upon this failure in dev mode: ``` test/boost/sstable_conforms_to_mutation_source_test.cc(0): Entering test case "test_sstable_reversing_reader_random_schema" sstable_conforms_to_mutation_source_test: ./seastar/src/core/fstream.cc:205: virtual seastar::file_data_source_impl::~file_data_source_impl(): Assertion `_reads_in_progress == 0' failed. Aborting on shard 0. ``` Since dev mode has no debug symbols I can't decode the stack trace so I'm not 100% sure about the root cause and I couldn't reproduce it in release or debug modes yet. One vulnerability in the current code is that r1 won't be closed if an exception is thrown before r1 and r2 are moved to `compare_readers` so this change adds a deferred close of r1 in this case. Test: sstable_conforms_to_mutation_source_test(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211006144009.696412-1-bhalevy@scylladb.com>	2021-10-06 17:53:49 +03:00
Avi Kivity	2d25705db0	cql3: deinline non-trivial methods in selection.hh This allows us to forward-declare raw_selector, which in turn reduces indirect inclusions of expression.hh from 147 to 58, reducing rebuilds when anything in that area changes. Includes that were lost due to the change are restored in individual translation units. Closes #9434	2021-10-05 12:58:55 +02:00
Avi Kivity	d3f8148807	utils: untie rjson.hh from base64.hh base64.hh pulls in the huge rjson.hh, so if someone just wants a base64 codec they have to pull in the entire rapidjson library. Move the json related parts of base64.hh to rjson.hh and adjust includes and namespaces. In practice it doesn't make much difference, as all users of base64 appear to want json too. But it's cleaner not to mix the two. Closes #9433	2021-10-05 12:57:54 +02:00
Avi Kivity	3a67c661d4	Merge "Improve parallelizm of mutation source tests" from Pavel E " There's a run_mutation_source_tests lib helper that runs a bunch of tests sequentially. The problem is that it does 4 different flavors of it each being a certain decoration over provided reader. This amplification makes some test cases run enormous amount of time without any chance for parallelizm. The simplest way to help running those cases in parallel is to teach the slowest cases to run different flavors of mutation source tests in dedicated cases. This patch makes it so. The resulting timings are dev debug sequential run: 2m1s 53m50s --parallel-cases (+ this patch): 1m3s 31m15s tests: unit(dev, debug) " * 'br-parallel-mutation-source-tests' of https://github.com/xemul/scylla: test: Split multishard combining reader case test: Split database test case test: Split run_mutation_source_tests	2021-10-05 12:22:52 +03:00
Kamil Braun	0c24c18d0c	test: cql_query_test: fix test_query_limit for reversed queries (Single-partition) reversed queries are no longer unlimited but some places still treat them as such. This causes, for example, shorter pages for such queries, which breaks a test that expects certain results to come in a single page.	2021-10-05 11:22:39 +02:00
Pavel Emelyanov	b742e6cbb6	test: Split multishard combining reader case All the cases in this test also run mutation source tests and the case with single-fragment buffer takes times more time to execute than the others. Splitting this single case so that it runs mutation source tests flavours in different cases improves the test parallelizm. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-05 11:57:02 +03:00
Pavel Emelyanov	30075094ac	test: Split database test case The test_database_with_data_in_sstables_is_a_mutation_source case runs the mutation source tests in one go. The problem is that on each step a whole new ks:cf is created which takes the majority of the tests time. In the end of the day this case is the slowest one in the suite being up to two times longer (depending on mode) than the #2 on this list. This patch splits the case into 4 so that each mutation source flavor is run in separate case. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-05 11:53:18 +03:00
Tomasz Grabiec	e89b9799b8	Merge 'sstable mx reader: implement reverse single-partition reads' from Kamil Braun Until now reversed queries were implemented inside `querier::consume_page` (more precisely, inside the free function `consume_page` used by `querier::consume_page`) by wrapping the passed-in reader into `make_reversing_reader` and then consuming fragments from the resulting reversed reader. The first couple of commits change that by pushing the reversing down below the `make_combined_reader` call in `table::query`. This allows working on improving reversing for memtables independently from reversing for sstables. We then extend the `index_reader` with functions that allow reading the promoted index in reverse. We introduce `partition_reversing_data_source`, which wraps an sstable data file and returns data buffers with contents of a single chosen partition as if the rows were stored in reverse order. We use the reversing source and the extended index reader in `mx_sstable_mutation_reader` to implement efficient (at least in theory) reversed single-partition reads. The patchset disables cache for reversed reads. Fast-forwarding is not supported in the mx reader for reversed queries at this point. Details in commit messages. Read the commits in topological order for best review experience. Refs: #9134 (not saying "Fixes" because it's only for single-partition queries without forwarding) Closes #9281 * github.com:scylladb/scylla: table: add option to automatically bypass cache for reversed queries test: reverse sstable reader with random schema and random mutations sstables: mx: implement reversed single-partition reads sstables: mx: introduce partition_reversing_data_source sstables: index_reader: add support for iterating over clustering ranges in reverse clustering_key_filter: clustering_key_filter_ranges owning constructor flat_mutation_reader: mention reversed schema in make_reversing_reader docstring clustering_key_filter: document clustering_key_filter_ranges::get_ranges	2021-10-04 15:37:34 +02:00
Kamil Braun	9bf6be5509	test: reverse sstable reader with random schema and random mutations The test generates a random set of mutations and creates two readers: - one by reversing the mutations, creating an sstable out of the result, and querying it in reverse, - one by creating an sstable directly from the mutations and querying it in forward mode. It checks that the readers give equal results. The test already managed to find a bug where offsets returned by the sstable index were interpreted incorrectly as absolute instead of relative. It also helped find another bug unrelated to reversing (#9352). Surprisingly few tests use the random schema and random mutation utilities which seem to be quite powerful.	2021-10-04 15:24:12 +02:00
Avi Kivity	148a12f3da	Merge "Keep storage_service less aware of cdc internals" from Pavel E " The storage_service is involved in the cdc_generation_service guts more than needed. - the bool _for_testing bit is cdc-only - there's API-only cdc_generation_service getter - cdc_g._s. startup code partially sits in s._s. one This patch cleans most of the above leaving only the startup _cdc_gen_id on board. tests: unit(dev) refs: #2795 " * 'br-storage-service-vs-cdc-2' of https://github.com/xemul/scylla: api: Use local sharded<cdc::generation_service> reference main: Push cdc::generation_service via API storage_service: Ditch for_testing boolean cdc: Replace db::config with generation_service::config cdc: Drop db::config from description_generator cdc: Remove all arguments from maybe_rewrite_streams_descriptions cdc: Move maybe_rewrite_streams_descriptions into after_join cdc: Squash two methods into one cdc: Turn make_new_cdc_generation a service method cdc: Remove ring-delay arg from make_new_cdc_generation cdc: Keep database reference on generation_service	2021-10-04 14:56:05 +03:00
Avi Kivity	1bac93e075	Merge "simplifications and layer violation fix for compaction manager" from Raphael "This series removes layer violation in compaction, and also simplifies compaction manager and how it interacts with compaction procedure." * 'compaction_manager_layer_violation_fix/v4' of github.com:raphaelsc/scylla: compaction: split compaction info and data for control compaction_manager: use task when stopping a given compaction type compaction: remove start_size and end_size from compaction_info compaction_manager: introduce helpers for task compaction_manager: introduce explicit ctor for task compaction: kill sstables field in compaction_info compaction: kill table pointer in compaction_info compaction: simplify procedure to stop ongoing compactions compaction: move management of compaction_info to compaction_manager compaction: move output run id from compaction_info into task	2021-10-04 13:09:31 +03:00
Eliran Sinvani	c38ceafdcf	Service Level Controller: Add an extention point to the API (#9374 ) In order to ease future extensions to the information being sent by the service level configuration change API, we pack the additional parameters (other the the service level options) to the interface in a structure. This will allow an easy expansion in the future if more parameters needs to be sent to the observer.i Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2021-10-01 10:20:28 +03:00
Raphael S. Carvalho	9067a13eac	compaction: split compaction info and data for control compaction_info must only contain info data to be exported to the outside world, whereas compaction_data will contain data for controlling compaction behavior and stats which change as compaction progresses. This separation makes the interface clearer, also allowing for future improvements like removing direct references to table in compaction. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-09-30 13:16:57 -03:00
Raphael S. Carvalho	4ce745e0b6	compaction: simplify procedure to stop ongoing compactions Today, compactions are tracked by both _compactions and _tasks, where _compactions refer to actual ongoing compaction tasks, whereas _tasks refer to manager tasks which is responsible for spawning new compactions, retry them on failure, etc. As each task can only have one ongoing compaction at a time, let's move compaction into task, such that manager won't have to look at both when deciding to do something like stopping a task. So stopping a task becomes simpler, and duplication is naturally gone. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-09-30 13:16:21 -03:00
Raphael S. Carvalho	efed06e2e4	compaction: move management of compaction_info to compaction_manager Today, compaction is calling compaction manager to register / deregister the compaction_info created by it. This is a layer violation because manager sits one layer above compaction, so manager should be responsible for managing compaction info. From now on, compaction_info will be created and managed by compaction_manager. compaction will only have a reference to info, which it can use to update the world about compaction progress. This will allow compaction_manager to be simplified as info can be coupled with its respective task, allowing duplication to be removed and layer violation to be fixed. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-09-30 13:15:00 -03:00
Raphael S. Carvalho	1f5b17fdc5	compaction: move output run id from compaction_info into task this run id is used to track partial runs that are being written to. let's move it from info into task, as this is not an external info, but rather one that belongs to compaction_manager. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-09-30 13:13:20 -03:00
Pavel Emelyanov	f669fbd230	storage_service: Ditch for_testing boolean Nowadays it purely controls whether or not to inject delays into timestamps generation by cdc. The same effect can be achieved by configuring the cdc::generation_service directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 16:04:12 +03:00
Pavel Emelyanov	db623c5f64	cdc: Replace db::config with generation_service::config This is to push the service towards general idea that each component should have its own config and db::config to stay in main. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 16:04:12 +03:00
Pavel Emelyanov	5e2a049266	cdc: Keep database reference on generation_service The service effectively depends on it when rewrites streams descriptions. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 15:34:03 +03:00
Pavel Emelyanov	e6b920017a	main: Replace cql_config_updater with updateable_value The cql_config_updater is a sharded<> service that exists in main and whose goal is to make sure some db::config's values are propagated into cql_config. There's a more handy updateable_value<> glue for that. tests: unit(dev) refs: #2795 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210927090402.25980-1-xemul@scylladb.com>	2021-09-30 07:23:43 +03:00
Botond Dénes	970fe9a339	mutation_writer: partition_based_splitting_writer: limit number of max buckets Recently we observed an OOM caused by the partition based splitting writer going crazy, creating 1.7K buckets while scrubbing an especially broken sstable. To avoid situations like that in the future, this patch provides a max limit for the number of live buckets. When the number of buckets reach this number, the largest bucket is closed and replaced by a bucket. This will end up creating more output sstables during scrub overall, but now they won't all be written at the same time causing insane memory pressure and possibly OOM. Scrub compaction sets this limit to 100, the same limit the TWCS's timestamp based splitting writer uses (implemented through the classifier - time_window_compaction_strategy::max_data_segregation_window_count). Fixes: #9400 Tests: unit(dev) Closes #9401	2021-09-29 16:31:29 +03:00
Avi Kivity	b3c95a1fc6	commitlog: reduce inclusions of commitlog.hh due to db::commitlog::force_sync (#9379 ) There are now 231 translation units that indirectly include commitlog.hh due to the need to have access to db::commitlog::force_sync. Move that type to a new file commitlog_types.hh and make it available without access to the commitlog class. This reduces the number of translation units that depend on commitlog.hh to 84, improving compile time.	2021-09-29 16:13:44 +03:00
Tomasz Grabiec	11a3b411c5	Merge 'mutation_source_test: test reverse reads' from Botond Dénes Currently no mutation-source supports reading in reverse natively but we are working on changing that, adding native reverse read support to memtable, cache and sstable readers. To ensure that all mutation sources work in a correct and uniform manner when reading in reverse, we add a reverse test to the mutation source test suite. This test reverses the data that it passes to `populate()`, then reads in forward order (in reverse compared to the data order). For this we use the currently established reverse read API: reverse schema (schema order == query order) and half-reversed (legacy) slice. All mutation sources are prepared to work with reversed reads, using the `make_reversing_reader()` adapter. As we progress with our native reverse support, we will replace these adapters with native reversing support. As part of this, we push down the reversing reader adapter currently existing on the `query::consume_page()` level, to the individual mutation sources. Closes #9384 * github.com:scylladb/scylla: test: mutation_reader_test: reversed version of test_clustering_order_merger_sstable_set querier: consume_page(): remove now unused max_size parameter test/lib: mutation_source_test: test reading in reverse test: mutation_reader_test: clustering_combined_reader_mutation_source_test: prepare for reading in reverse test: flat_mutation_reader_test: test_reverse_reader_is_mutation_source: prepare for reading in reverse test: mutation_reader_test: test_manual_paused_evictable_reader_is_mutation_source: use query schema instead of table schema treewide: move reversing to the mutation sources mutation_query: reconcilable_result_builder: document reverse query preconditions sstable_set: time_series_sstable_set: reverse mode mutlishard_mutation_query: set max result size on used permits db/virtual_table: streaming_virtual_table::as_mutation_source(): use query schema instead of table schema flat_mutation_reader: make_reversing_reader(): add convenience stored slice mutation_reader: evictable_reader: add reverse read support flat_mutation_reader: make_flat_mutation_reader_from_fragments(): add reverse read support flat_mutation_reader: flat_mutation_reader_from_mutations(): add reverse read support flat_mutation_reader: flat_mutation_reader_from_mutations(): document preconditions query-request: introduce `half_reverse_slice` flat_mutation_reader_assertions: log what's expected	2021-09-29 12:57:57 +02:00
Avi Kivity	d4aa6c2746	Merge "compaction: Update backlog tracker correctly when schema is updated" from Raphael " Backlog tracker isn't updated correctly when facing a schema change, and may leak a SSTable if compaction strategy is changed, which causes backlog to be computed incorrectly. Most of these problems happen because sstable set and tracker are updated independently, so it could happen that tracker lose track (pun intended) of changes applied to set. The first patch will fix the leak when strategy is changed, and the third patch will make sure that tracker is updated atomically with sstable set, so these kind of problems will not happen anymore. Fixes #9157 " * 'fixes_to_backlog_tracker_v4' of github.com:raphaelsc/scylla: compaction: Update backlog tracker correctly when schema is updated compaction: Don't leak backlog of input sstable when compaction strategy is changed compaction: introduce compaction_read_monitor_generator::remove_exhausted_sstables() compaction: simplify removal of monitors	2021-09-29 13:55:37 +03:00
Kamil Braun	075a894a89	test: mutation_reader_test: reversed version of test_clustering_order_merger_sstable_set	2021-09-29 12:15:48 +03:00

1 2 3 4 5 ...

1291 Commits