scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 11:55:15 +00:00

Author	SHA1	Message	Date
Benny Halevy	5e41228fe8	test: everywhere: use seastar::testing::local_random_engine Use the thread_local seastar::testing::local_random_engine in all seastar tests so they can be reproduced using the --random-seed option. Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210112103713.578301-2-bhalevy@scylladb.com>	2021-01-13 11:07:29 +02:00
Dejan Mircevski	d79c2cab63	cql3: Use correct comparator in timeuuid min/max The min/max aggregators use aggregate_type_for comparators, and the aggregate_type_for<timeuuid> is regular uuid. But that yields wrong results; timeuuids should be compared as timestamps. Fix it by changing aggregate_type_for<timeuuid> from uuid to timeuuid, so aggregators can distinguish betwen the two. Then specialize the aggregation utilities for timeuuid. Add a cql-pytest and change some unit tests, which relied on naive uuid comparators. Fixes #7729. Tests: unit (dev, debug) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #7910	2021-01-13 11:07:29 +02:00
Avi Kivity	96d64b7a1f	Merge "Wire interposer consumer for memtable flush" from Raphael " Without interposer consumer on flush, it could happen that a new sstable, produced by memtable flush, will not conform to the strategy invariant. For example, with TWCS, this new sstable could span multiple time windows, making it hard for the strategy to purge expired data. If interposer is enabled, the data will be correctly segregated into different sstables, each one spanning a single window. Fixes #4617. tests: - mode(dev). - manually tested it by forcing a flush of memtable spanning many windows " * 'segregation_on_flush_v2' of github.com:raphaelsc/scylla: test: Add test for TWCS interposer on memtable flush table: Wire interposer consumer for memtable flush table: Add write_memtable_to_sstable variant which accepts flat_mutation_reader table: Allow sstable write permit to be shared across monitors memtable: Track min timestamp table: Extend cache update to operate a memtable split into multiple sstables	2021-01-13 11:07:29 +02:00
Botond Dénes	4b254a26ab	test/boost/sstable_datafile_test: sstable_scrub_test: disable key validation The test violates clustering key order on purpose to produce a corrupt sstable (to test scrub). Disable key validation so when we move the validator into the writer itself in the next patch it doesn't abort the test.	2021-01-11 09:12:56 +02:00
Raphael S. Carvalho	d265bb9bdb	test: Add test for TWCS interposer on memtable flush Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-01-04 16:55:06 -03:00
Piotr Sarna	e26aa836a9	schema_change_test: skip distributed system tables in digest With previous design of the schema change test, a regeneration was necessary each time a new distributed system table was added. It was not the original purpose of the test to keep track of new distributed tables which simply propagate on their own, so the test case is now modified: internal distributed tables are not part of the schema digest anymore, which means that changes inside them will not cause mismatches. This change involves a one-shot regeneration of all digests, which due to historical reasons included internal distributed tables in the digest, but no further regenerations should ever be necessary when a new internal distributed table is added.	2021-01-04 10:24:40 +01:00
Gleb Natapov	d3aa17591c	migration_manager: drop announce_locally flag It looks like the history of the flag begins in Cassandra's https://issues.apache.org/jira/browse/CASSANDRA-7327 where it is introduced to speedup tests by not needing to start the gossiper. The thing is we always start gossiper in our cql tests, so the flag only introduce noise. And, of course, since we want to move schema to use raft it goes against the nature of the raft to be able to apply modification only locally, so we better get rid of the capability ASAP. Tests: units(dev, debug) Message-Id: <20201230111101.4037543-2-gleb@scylladb.com>	2021-01-03 13:58:09 +02:00
Avi Kivity	f7b731bc46	Merge 'Fix potential reactor stall on LCS compaction completion' from Raphael Carvalho On every compaction completion, sstable set is rebuilt from scratch. With LCS and ~160G of data per shard, it means we'll have to create a new sstable set with ~1000 entries whenever compaction completes, which will likely result in reactor stalling for a significant amount of time. Fixes #7758. Closes #7842 * github.com:scylladb/scylla: table: Fix potential reactor stall on LCS compaction completion table: decouple preparation from execution when updating sstable set table: change rebuild_sstable_list to return new sstable set row_cache: allow external updater to decouple preparation from execution	2020-12-28 21:16:17 +02:00
Pavel Emelyanov	7ac435f67c	test: Enhance test for range_tombstone_list de-overlapping The range_tombstone_list always (unless misused?) contains de-overlapped entries. There's a test_add_random that checks this, but it suffers from several problems: - generated "random" ranges are sequential and may only overlap on their borders - test uses the keys of the same prefix length Enhance the generator part to produce a purely random sequence of ranges with bound keys of arbitrary length. Just pay attention to generate the "valid" individual ranges, whose start is not ahead of the end. Also -- rename the test to reflect what it's doing and increase the number of iterations. tests: unit(dev) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20201228115525.20327-1-xemul@scylladb.com>	2020-12-28 18:26:48 +02:00
Raphael S. Carvalho	8dd7280107	table: Fix potential reactor stall on LCS compaction completion On every compaction completion, sstable set is rebuilt from scratch. With LCS and ~160G of data per shard, it means we'll have to create a new sstable set with ~1000 entries whenever compaction completes, which will likely result in reactor stalling for a significant amount of time. This is fixed by futurizing build_new_sstable_list(), so it will yield whenever needed. Fixes #7758. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:50 -03:00
Raphael S. Carvalho	43f0200b8f	table: change rebuild_sstable_list to return new sstable set procedure is changed to return the new set, so caller will be responsible for replacing the old set with the new one. this will allow our future work where building new set and enabling it will be decoupled. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:47 -03:00
Raphael S. Carvalho	198b87503f	row_cache: allow external updater to decouple preparation from execution External updater may do some preparatory work like constructing a new sstable list, and at the end atomically replace the old list by the new one. Decoupling the preparation from execution will give us the following benefits: - the preparation step can now yield if needed to avoid reactor stalls, as it's been futurized. - the execution step will now be able to provide strong exception guarantees, as it's now decoupled from the preparation step which can be non-exception-safe. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:45 -03:00
Benny Halevy	56aa49ca81	token_metadata: shared_token_metadata: add mutate_token_metadata mutate_token_metadata acquires the shared_token_metadata lock, clones the token_metadata (using clone_async) and calls an asynchronous functor on the cloned copy of the token_metadata to mutate it. If the functor is successful, the mutated clone is set back to to the shared_token_metadata, otherwise, the clone is destroyed. With that, get rid of shared_token_metadata::clone Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-22 11:22:19 +02:00
Benny Halevy	e089c22ec1	token_metdata: futurize update_normal_tokens The function complexity if O(#tokens) in the worst case as for each endpoint token to traverses _token_to_endpoint_map lineraly to erase the endpoint mapping if it exists. This change renames the current implementation of update_normal_tokens to update_normal_tokens_sync and clones the code as a coroutine that returns a future and may yield if needed. Eventually we should futurize the whole token_metadata and abstract_replication_strategy interface and get rid of the synchronous functions. Until then the sync version is still required from call sites that are neither returning a future nor run in a seastar thread. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-22 10:35:15 +02:00
Avi Kivity	b3e39d81aa	Merge 'Avoid scanning sstables in parallel for TWCS single-partition queries' from Kamil Braun We introduce a new single-key sstable reader for sstables created by `TimeWindowCompactionStrategy`. The reader uses the fact that sstables created by TWCS are mostly disjoint with respect to the contained `position_in_partition`s in order to avoid having multiple sstable readers opened at the same time unnecessarily. In case there are overlapping ranges (for example, in the current time-window), it performs the necessary merging (it uses `clustering_order_reader_merger`, introduced recently). The reader uses min/max clustering key metadata present in `md` sstables in order to decide when to open or close a sstable reader. The following experiment was performed: 1. create a TWCS table with 1 minute windows 2. fill the table with 8 equal windows of data (each window flushed to a separate sstable) 3. perform `select * from ks.t where pk = 0 limit 1` query with and without the change The expectation is that with the commit, only one sstable will be opened to fetch that one row; without the commit all 8 sstables would be opened at once. The difference in the value of `scylla_reactor_aio_bytes_read` was measured (value after the query minus value before the query), both with and without the commit. With the commit, the difference was 67584. Without the commit, the difference was 528384. 528384 / 67584 ~= 7.8. Fixes #6418. Closes #7437 * github.com:scylladb/scylla: sstables: gather clustering key filtering statistics in TWCS single key reader sstables: use time_series_sstable_set in time_window_compaction_strategy sstable_set: new reader for TWCS single partition queries mutation_reader_test: test clustering_order_reader_merger with time_series_sstable_set sstable_set: introduce min_position_reader_queue sstable_set: introduce time_series_sstable_set sstables: add min_position and max_position accessors sstable_set: make create_single_key_sstable_reader a virtual method clustering_order_reader_merger: fix the 0 readers case	2020-12-19 23:53:18 +02:00
Kamil Braun	b41139a07f	mutation_reader_test: test clustering_order_reader_merger with time_series_sstable_set	2020-12-18 16:33:27 +01:00
Kamil Braun	5e846b33b8	clustering_order_reader_merger: fix the 0 readers case With 0 readers the merger would produce a `partition_end` fragment when it should immediately return `end_of_stream` instead.	2020-12-18 12:30:40 +01:00
Avi Kivity	a60c81b615	Merge 'cql3: Fix handling of impossible restrictions on a primary-key column' from Dejan Mircevski There were two problems with handling conflicting equalities on the same PK column (eg, c=1 AND c=0): 1. When the column is indexed, Scylla crashed (#7772) 2. Computing ranges and slices was throwing an exception This series fixes them both; it also happens to resolve some old TODOs from restriction_test. Tests: unit (dev, debug) Closes #7804 * github.com:scylladb/scylla: cql3: Fix value_for when restriction is impossible cql3: Fix range computation for p=1 AND p=1	2020-12-17 12:01:36 +02:00
Dejan Mircevski	46b4b59945	cql3: Fix value_for when restriction is impossible Previously, single_column_restrictions::value_for() assumed that a column's restriction specifies exactly one value for the column. But since `37ebe521e3`, multiple equalities on the same column are allowed, so the restriction could be a conjunction of conflicting equalities (eg, c=1 AND c=0). That violates an assert and crashes Scylla. This patch fixes value_for() by gracefully handling the impossible-restriction case. Fixes #7772 Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-12-16 15:00:29 -05:00
Dejan Mircevski	4bb1107652	cql3: Fix range computation for p=1 AND p=1 Previously compute_bounds was assuming that primary-key columns are restricted by exactly one equality, resulting in the following error: query 'select p from t where p=1 and p=1' failed: std::bad_variant_access (std::get: wrong index for variant) This patch removes that assumption and deals correctly with the multiple-equalities case. As a byproduct, it also stops raising "invalid null value" exceptions for null RHS values. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-12-16 14:46:48 -05:00
Avi Kivity	fdb47c954d	Merge "idl: allow IDL compiler to parse `const` specifiers for template arguments" from Pavel S " This patch series consists of the following patches: 1. The first one turned out to be a massive rewrite of almost everything in `idl-compiler.py`. It aims to decouple parser structures from the internal representation which is used in the code-generation itself. Prior to the patch everything was working with raw token lists and the code was extremely fragile and hard to understand and modify. Moreover, every change in the parser code caused a cascade effect of breaking things at many different places, since they were relying on the exact format of output produced by parsing rules. Now there is a bunch of supplementary AST structures which provide hierarchical and strongly typed structure as the output of parsing routine. It is much easier to verify (by the means of `isinstance`, for example) and extend since the internal structures used in code-generation are decoupled from the structure of parsing rules, which are now controlled by custom parse actions providing high-level abstractions. It is tested manually by checking that the old code produces exactly the same autogenerated sources for all Scylla IDLs as the new one. 2 and 3. Cosmetics changes only: fixed a few typos and moved from old-fashioned `string.Template` to python f-strings. This improves readability of the idl-compiler code by a lot. Only one non-functional whitespace change introduced. 4. This patch adds a very basic support for the parser to understand `const` specifier in case it's used with a template parameter for a data member in a class, e.g. struct my_struct { std::vector<const raft::log_entry> entries; }; It actually does two things: * Adjusts `static_asserts` in corresponding serializer methods to match const-ness of fields. * Defines a second serializer specialization for const type in `.dist.hh` right next to non-const one. This seems to be sufficient for raft-related uses for now. Please note there is no support for the following cases, though: const std::vector<raft::log_entry> entries; const raft::term_t term; None of the existing IDLs are affected by the change, so that we can gradually improve on the feature and write the idl unit-tests to increase test coverage with time. 5. A basic unit-test that writes a test struct with an `std::vector<S<const T>>` field and reads it back to verify that serialization works correctly. 6. Basic documentation for AST classes. TODO: should also update the docs in `docs/IDL.md`. But it is already quite outdated, and some changes would even be out of scope for this patch set. " * 'idl-compiler-refactor-v5' of https://github.com/ManManson/scylla: idl: add docstrings for AST classes idl: add unit-test for `const` specifiers feature idl: allow to parse `const` specifiers for template arguments idl: fix a few typos in idl-compiler idl: switch from `string.Template` to python f-strings and format string in idl-compiler idl: Decouple idl-compiler data structures from grammar structure	2020-12-16 14:05:33 +02:00
Pavel Solodovnikov	1e6df841a5	idl: add unit-test for `const` specifiers feature Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-12-15 16:03:18 +03:00
Avi Kivity	19aaf8eb83	Merge "Remove global storage service from index manager" from Pavel E " The initial intent was to remove call for global storage service from secondary index manager's create_view_for_index(), but while fixing it one of intermediate schema table's helper managed to benefit from it by re-using the database reference flying by. The cleanup is done by simply pushing the database reference along the stack from the code that already has it down the create_view_for_index(). tests: unit(dev) " * 'br-no-storages-in-index-and-schema' of https://github.com/xemul/scylla: schema-tables: Use db from make_update_table_mutations in make_update_indices_mutations schema-tables: Add database argument to make_update_table_mutations schema-tables: Factor out calls getting database instance index-manager: Move feature evaluation one level up	2020-12-13 12:41:51 +02:00
Pekka Enberg	c990f2bd34	Merge 'Reinstate [[nodiscard]] support' from Avi Kivity The switch to clang disabled the clang-specific -Wunused-value since it generated some harmless warnings. Unfortunately, that also prevent [[nodiscard]] violations from warning. Fix by clearing all instances of the warning (including [[nodiscard]] violations that crept in while it was disabled) and reinstating the warning. Closes #7767 * github.com:scylladb/scylla: build: reinstate -Wunused-value warning for [[nodiscard]] test: lib: don't ignore future in compare_readers() test: mutation_test: check both ranges when comparing summaries serialializer: silence unused value warning in variant deserializer	2020-12-12 09:54:05 +02:00
Pavel Emelyanov	89fd524c5a	schema-tables: Add database argument to make_update_table_mutations There are 3 callers of this helper (cdc, migration manager and tests) and all of them already have the database object at hands. The argument will be used by next patch to remove call for global storage proxy instance from make_update_indices_mutations. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-12-11 21:21:22 +03:00
Raphael S. Carvalho	e4b55f40f3	sstables: Fix sstable reshaping for STCS The heuristic of STCS reshape is correct, and it built the compaction descriptor correctly, but forgot to return it to the caller, so no reshape was ever done on behalf of STCS even when the strategy needed it. Fixes #7774. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20201209175044.1609102-1-raphaelsc@scylladb.com>	2020-12-10 12:45:25 +02:00
Avi Kivity	f802356572	Revert "Revert "Merge "raft: fix replication if existing log on leader" from Gleb"" This reverts commit `dc77d128e9`. It was reverted due to a strange and unexplained diff, which is now explained. The HEAD on the working directory being pulled from was set back, so git thought it was merging the intended commits, plus all the work that was committed from HEAD to master. So it is safe to restore it.	2020-12-08 19:19:55 +02:00
Pavel Emelyanov	b837cf25b1	test: Make multishard_mutation_query test do less scans When built by clang this dev-mode test takes ~30 minutes to complete. Let's reduce this time by reducing the scale of the test if DEVEL is set. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-12-08 15:55:04 +03:00
Avi Kivity	732d83dc0e	test: mutation_test: check both ranges when comparing summaries A copy/paste error means we ignore the termination of one of the ranges. Change the comma expression to a disjunction to avoid the unused value warning from clang. The code is not perfect, since if the two ranges are not the same size we'll invoke undefined behavior, but it is no worse than before (where we ignored the comparison completely).	2020-12-07 16:47:52 +02:00
Pavel Solodovnikov	56c0fcfcb2	cql_query_test: handle `bounce_to_shard` msg in `test_null_value_tuple_floating_types_and_uuids` Use `prepared_on_shard` helper function to handle `bounce_to_shard` messages that can happen when using LWT statements. Fixes: #7757 Tests: unit(dev) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20201204172944.601730-1-pa.solodovnikov@scylladb.com>	2020-12-06 19:34:13 +02:00
Avi Kivity	ca950e6f08	Merge "Remove get_local_storage_service() from counters" from Pavel E " The storage service is called there to get the cached value of db::system_keyspace::get_local_host_id(). Keeping the value on database decouples it from storage service and kills one more global storage service reference. tests: unit(dev) " * 'br-remove-storage-service-from-counters-2' of https://github.com/xemul/scylla: counters: Drop call to get_local_storage_service and related counters: Use local id arg in transform_counter_update_to_shards database: Have local id arg in transform_counter_updates_to_shards() storage_service: Keep local host id to database	2020-12-06 16:15:21 +02:00
Avi Kivity	dc77d128e9	Revert "Merge "raft: fix replication if existing log on leader" from Gleb" This reverts commit `0aa1f7c70a`, reversing changes made to `72c59e8000`. The diff is strange, including unrelated commits. There is no understanding of the cause, so to be safe, revert and try again.	2020-12-06 11:34:19 +02:00
Pavel Emelyanov	62214e2258	database: Have local id arg in transform_counter_updates_to_shards() There are two places that call it -- database code itself and tests. The former already has the local host id, so just pass one. The latter are a bit trickier. Currently they use the value from storage_service created by storage_service_for_tests, but since this version of service doesn't pass through prepare_to_join() the local_host_id value there is default-initialized, so just default-initialize the needed argument in place. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-12-04 15:09:30 +03:00
Avi Kivity	a95c2a946c	Merge 'mutation_reader: introduce clustering_order_reader_merger' from Kamil Braun This abstraction is used to merge the output of multiple readers, each opened for a single partition query, into a non-decreasing stream of mutation_fragments. It is similar to `mutation_reader_merger`, but an important difference is that the new merger may select new readers in the middle of a partition after it already returned some fragments from that partition. It uses the new `position_reader_queue` abstraction to select new readers. It doesn't support multi-partition (ring range) queries. The new merger will be later used when reading from sstable sets created by TimeWindowCompactionStrategy. This strategy creates many sstables that are mostly disjoint w.r.t the contained clustering keys, so we can delay opening sstable readers when querying a partition until after we have processed all mutation fragments with positions before the keys contained by these sstables. A microbenchmark was added that compares the existing combining reader (which uses `mutation_reader_merger` underneath) with a new combining reader built using the new `clustering_order_reader_merger` and a simple queue of readers that returns readers from some supplied set. The used set of readers is built from the following ranges of keys (each range corresponds to a single reader): `[0, 31]`, `[30, 61]`, `[60, 91]`, `[90, 121]`, `[120, 151]`. The microbenchmark runs the reader and divides the result by the number of mutation fragments. The results on my laptop were: ``` $ build/release/test/perf/perf_mutation_readers -t clustering_combined.* -r 10 single run iterations: 0 single run duration: 1.000s number of runs: 10 test iterations median mad min max clustering_combined.ranges_generic 2911678 117.598ns 0.685ns 116.175ns 119.482ns clustering_combined.ranges_specialized 3005618 111.015ns 0.349ns 110.063ns 111.840ns ``` `ranges_generic` denotes the existing combining reader, `ranges_specialized` denotes the new reader. Split from https://github.com/scylladb/scylla/pull/7437. Closes #7688 * github.com:scylladb/scylla: tests: mutation_source_test for clustering_order_reader_merger perf: microbenchmark for clustering_order_reader_merger mutation_reader_test: test clustering_order_reader_merger in memory test: generalize `random_subset` and move to header mutation_reader: introduce clustering_order_reader_merger	2020-12-02 12:15:35 +02:00
Kamil Braun	502ed2e9f7	tests: mutation_source_test for clustering_order_reader_merger	2020-12-02 11:13:58 +01:00
Kamil Braun	2da723b9c8	cdc: produce postimage when inserting with no regular columns When a row was inserted into a table with no regular columns, and no such row existed in the first place, postimage would not be produced. Fix this. Fixes #7716. Closes #7723	2020-12-01 18:01:23 +02:00
Dejan Mircevski	e45af3b9b8	index: Ensure restriction is supported in find_idx Previously, statement_restrictions::find_idx() would happily return an index for a non-EQ restriction (because it checked only the column name, not the operator). This is incorrect: when the selected index is for a non-EQ restriction, it is impossible to query that index table. Fixes #7659. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #7665	2020-12-01 15:16:48 +02:00
Kamil Braun	4f7e2bf920	mutation_reader_test: test clustering_order_reader_merger in memory	2020-11-30 11:55:44 +01:00
Kamil Braun	0b36c5e116	mutation_reader: introduce clustering_order_reader_merger This abstraction is used to merge the output of multiple readers, each opened for a single partition query, into a non-decreasing stream of mutation_fragments. It is similar to `mutation_reader_merger`, an important difference is that the new merger may select new readers in the middle of a partition after it already returned some fragments from that partition. It uses the new `position_reader_queue` abstraction to select new readers. It doesn't support multi-partition (ring range) queries. The new merger will be later used when reading from sstable sets created by TimeWindowCompactionStrategy. This strategy creates many sstables that are mostly disjoint w.r.t the contained clustering keys, so we can delay opening sstable readers when querying a partition until after we have processed all mutation fragments with positions before the keys contained by these sstables.	2020-11-30 11:55:44 +01:00
Avi Kivity	ea9c058be3	Merge 'Don't use secondary indices for multi-column restrictions' from Dejan Mircevski Fix #7680 by never using secondary index for multi-column restrictions. Modify expr::is_supported_by() to handle multi-column correctly. Tests: unit (dev) Closes #7699 * github.com:scylladb/scylla: cql3/expr: Clarify multi-column doesn't use indexing cql3: Don't use index for multi-column restrictions test: Add eventually_require_rows	2020-11-30 12:38:26 +02:00
Dejan Mircevski	5bc7e31284	restrictions: Forbid mixing ck=0 and (ck)=(0) Reject the previously accepted case where the multi-column restriction applied to just a single column, as it causes a crash downstream. The user can drop the parentheses to avoid the rejection. Fixes #7710 Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #7712	2020-11-29 17:06:41 +02:00
Avi Kivity	0584db1eb3	Merge "Unstall cleanup_compaction::get_ranges_for_invalidation" from Benny " This series adds maybe_yield called from cleanup_compaction::get_ranges_for_invalidation to avoid reactor stalls. To achieve that, we first extract bool_class can_yield to utils/maybe_yield.hh, and add a convience helper: utils::maybe_yield(can_yield) that conditionally calls seastar::thread::maybe_yield if it can (when called in a seastar thread). With that, we add a can_yield parameter to dht::to_partition_ranges and dht::partition_range::deoverlap (defaults to false), and use it from cleanup_compaction::get_ranges_for_invalidation, as the latter is always called from `consume_in_thread`. Fixes #7674 Test: unit(dev) " * tag 'unstall-get_ranges_for_invalidation-v2' of github.com:bhalevy/scylla: compaction: cleanup_compaction: get_ranges_for_invalidation: add yield points dht/i_partitioner: to_partition_ranges: support yielding locator: extract can_yield to utils/maybe_yield.hh	2020-11-29 14:10:39 +02:00
Dejan Mircevski	db63b40347	cql3: Don't use index for multi-column restrictions The downstream code expects a single-column restriction when using an index. We could fix it, but we'd still have to filter the rows fetched from the index table, unlike the code that queries the base table directly. For instance, WHERE (c1,c2,c3) = (1,2,3) with an index on c3 can fetch just the right rows from the base table but all the c3=3 rows from the index table. Fixes #7680 Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-11-25 10:39:04 -05:00
Gleb Natapov	6130fb8b39	raft: commit a dummy entry after leader change After a node becomes leader it needs to do two things: send an append message to establish its leadership and commit one entry to make sure all previous entries with smaller terms are committed as well.	2020-11-24 11:35:18 +01:00
Benny Halevy	157a964a63	locator: extract can_yield to utils/maybe_yield.hh Move the definition of bool_class can_yield to a standalone header file and define there a maybe_yield(can_yield) helper. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-24 12:23:56 +02:00
Avi Kivity	e8ff77c05f	Merge 'sstables: a bunch of refactors' from Kamil Braun 1. sstables: move `sstable_set` implementations to a separate module All the implementations were kept in sstables/compaction_strategy.cc which is quite large even without them. `sstable_set` already had its own header file, now it gets its own implementation file. The declarations of implementation classes and interfaces (`sstable_set_impl`, `bag_sstable_set`, and so on) were also exposed in a header file, sstable_set_impl.hh, for the purposes of potential unit testing. 2. mutation_reader: move `mutation_reader::forwarding` to flat_mutation_reader.hh Files which need this definition won't have to include mutation_reader.hh, only flat_mutation_reader.hh (so the inclusions are in total smaller; mutation_reader.hh includes flat_mutation_reader.hh). 3. sstables: move sstable reader creation functions to `sstable_set` Lower level functions such as `create_single_key_sstable_reader` were made methods of `sstable_set`. The motivation is that each concrete sstable_set may decide to use a better sstable reading algorithm specific to the data structures used by this sstable_set. For this it needs to access the set's internals. A nice side effect is that we moved some code out of table.cc and database.hh which are huge files. 4. sstables: pass `ring_position` to `create_single_key_sstable_reader` instead of `partition_range`. It would be best to pass `partition_key` or `decorated_key` here. However, the implementation of this function needs a `partition_range` to pass into `sstable_set::select`, and `partition_range` must be constructed from `ring_position`s. We could create the `ring_position` internally from the key but that would involve a copy which we want to avoid. 5. sstable_set: refactor `filter_sstable_for_reader_by_pk` Introduce a `make_pk_filter` function, which given a ring position, returns a boolean function (a filter) that given a sstable, tells whether the sstable may contain rows with the given position. The logic has been extracted from `filter_sstable_for_reader_by_pk`. Split from #7437. Closes #7655 * github.com:scylladb/scylla: sstable_set: refactor filter_sstable_for_reader_by_pk sstables: pass ring_position to create_single_key_sstable_reader sstables: move sstable reader creation functions to `sstable_set` mutation_reader: move mutation_reader::forwarding to flat_mutation_reader.hh sstables: move sstable_set implementations to a separate module	2020-11-24 09:23:57 +02:00
Kamil Braun	d158921966	sstables: add `may_have_partition_tombstones` method For sstable versions greater or equal than md, the `min_max_column_names` sstable metadata gives a range of position-in-partitions such that all clustering rows stored in this sstable have positions in this range. Partition tombstones in this context are understood as covering the entire range of clustering keys; thus, if the sstable contains at least one partition tombstone, the sstable position range is set to be the range of all clustered rows. Therefore, by checking that the position range is not the range of all clustered rows we know that the sstable cannot have any partition tombstones. Closes #7678	2020-11-23 23:30:19 +02:00
Piotr Sarna	5a9dc6a3cc	Merge 'Cleanup CDC tests after CDC became GA' from Piotr Jastrzębski Now that CDC is GA, it should be enabled in all the tests by default. To achieve that the PR adds a special db::config::add_cdc_extension() helper which is used in cql_test_envm to make sure CDC is usable in all the tests that use cql_test_env.m As a result, cdc_tests can be simplified. Finally, some trailing whitespaces are removed from cdc_tests. Tests: unit(dev) Closes #7657 * github.com:scylladb/scylla: cdc: Remove trailing whitespaces from cdc_tests cdc: Remove mk_cdc_test_config from tests config: Add add_cdc_extension function for testing cdc: Add missing includes to cdc_extension.hh	2020-11-20 13:56:29 +01:00
Kamil Braun	40d8bfa394	sstables: move sstable reader creation functions to `sstable_set` Lower level functions such as `create_single_key_sstable_reader` were made methods of `sstable_set`. The motivation is that each concrete sstable_set may decide to use a better sstable reading algorithm specific to the data structures used by this sstable_set. For this it needs to access the set's internals. A nice side effect is that we moved some code out of table.cc and database.hh which are huge files.	2020-11-19 17:52:39 +01:00
Avi Kivity	70689088fd	Merge "Remove reference on database from global qctx" from Pavel E " The qctx is global object that references query processor and database to let the rest of the code query system keyspace. As the first step of de-globalizing it -- remove the database reference from it. After the set the qctx remains a simple wrapper over the query processor (which is already de-globalized) and the query processor in turn is mostly needed only to parse the query string into prepared statement only. This, in turn, makes it possible to remove the qctx later by parsing the query strings on boot and carrying _them_ around, not the qctx itself. tests: unit(dev), dtest(simple_cluster_driver_test:dev), manual start/stop " * 'br-remove-database-from-qctx' of https://github.com/xemul/scylla: query-context: Remove database from qctx schema-tables: Use query processor referece in save_system(_keyspace)?_schema system-keyspace: Rewrite force_blocking_flush system-keyspace: Use cluster_name string in check_health system-keyspace: Use db::config in setup_version query-context: Kill global helpers test: Use cql_test_env::evecute_cql instead of qctx version code: Use qctx::evecute_cql methods, not global ones system-keyspace: Do not call minimal_setup for the 2nd time system-keyspace: Fix indentation after previous patch system-keyspace: Do not do invoke_on_all by hands system-keyspace: Remove dead code	2020-11-19 18:31:51 +02:00

1 2 3 4 5 ...

709 Commits