scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-03 21:47:10 +00:00

Author	SHA1	Message	Date
Michael Livshin	fb6c79015a	flat_reader_assertions: do not accumulate out-of-range tombstones Also remove the incorrect difference in range tombstone checking behavior between `produces_range_tombstone()` and `produces(const range_tombstone&)` by having both turn on checking. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-28 17:11:54 +02:00
Michael Livshin	9fa4d9a2bb	flat_reader_assertions: refactor resetting accumulated tombstone lists Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-28 17:11:54 +02:00
Michael Livshin	2221aeff0e	flat_mutation_reader_test: fix "test_flat_mutation_reader_consume_single_partition" Since `flat_reader_assertions::produces(const range_tombstone&,...)` records the range tombstone for checking, be sure to explicitly pass in a clustering range that does not extend beyond the mock-read part of the mutation. Also (provisionally) change the assertion method to accept clustering ranges. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-28 17:11:54 +02:00
Michael Livshin	9bacce4359	memtable::make_flat_reader(): return flat_mutation_reader_v2 This is just a facade change. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-28 17:11:54 +02:00
Michael Livshin	2337d48b41	tests: flat_reader_assertions: improve range tombstone checking `produces_range_tombstone()` is smart enough to not just try to read one range tombstone from the input and compare it to the passed reference, but to read as many range tombstones as the reader is looking at (including none) using `may_produce_tombstones()` and record those appropriately. When `produces(const schema&, const mutation_fragment&)` is passed a range tombstone as the second argument, it does not do anything special -- it just reads one fragment, disregards it (!), and applies its second argument to both "expected" and "encountered" range tombstone lists. The right thing here is to use the same logic as `produces_range_tombstone()`; upcoming memtable-related reader changes (which result in more split range tombstones) cause some unit tests to fail without fixing this. Refactor the relevant logic into a private method (`apply_rt()`) and use that in both places. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-28 17:11:54 +02:00
Raphael S. Carvalho	96cfe7d530	test/lib/sstable_utils: Set bytes_on_disk for fake SSTables Not precise, as bytes_on_disk accounts for all components, but good enough for testing purposes. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-02-24 18:57:45 -03:00
Avi Kivity	cbba80914d	memtable: move to replica module and namespace Memtables are a replica-side entity, and so are moved to the replica module and namespace. Memtables are also used outside the replica, in two places: - in some virtual tables; this is also in some way inside the replica, (virtual readers are installed at the replica level, not the cooordinator), so I don't consider it a layering violation - in many sstable unit tests, as a convenient way to create sstables with known input. This is a layering violation. We could make memtables their own module, but I think this is wrong. Memtables are deeply tied into replica memory management, and trying to make them a low-level primitive (at a lower level than sstables) will be difficult. Not least because memtables use sstables. Instead, we should have a memtable-like thing that doesn't support merging and doesn't have all other funky memtable stuff, and instead replace the uses of memtables in sstable tests with some kind of make_flat_mutation_reader_from_unsorted_mutations() that does the sorting that is the reason for the use of memtables in tests (and live with the layering violation meanwhile). Test: unit (dev) Closes #10120	2022-02-23 09:05:16 +02:00
Avi Kivity	adc08d0ab9	Merge "Drop v1 input support for mutation compactor" from Botond " Currently the mutation compactor supports v1 and v2 output and has a v1 output. The next step is to add a v2 output but this would lead to a full conversion matrix which we want to avoid. So in preparation we drop the v1 input support. Most inputs were already v2, but there were some notable exceptions: tests, the compacting reader and the multishard query code. The former two was a simple mechanical update but the latter required some further work because it turned out the v2 version of evictable reader wasn't used yet and thus it managed to hide some bugs and dropped features. While at it, we migrate all evictable and multishard reader users to the v2 variant of the respective readers and drop the v1 variant completely. With this the road is open to a v2 compactor output and therefore to a v2 sstable writer. Tests: unit(dev, release), dtest(paging_additional_test.py) " * 'compact-mutation-v2-only-input/v5' of https://github.com/denesb/scylla: test/lib/test_utils: return OK from check() variants repair/row_level: use evictable reader v2 db/view/view_updating_consumer: migrate to v2 test/boost/mutation_reader_test: add v2 specific evictable reader tests test: migrate to evictable reader v2 and multishard combining reader v2 compact_mutation: drop support for v1 input test: pass v2 input to mutation_compaction test/boost/mutation_test: simplify test_compaction_data_stream_split test mutation_partition: do_compact(): do drop row tombstones covered by higher order tombstones multishard_mutation_query: migrate to v2 mutation_fragment_v2: range_tombstone_change: add memory_usage() evictable_reader_v2: terminate active range tombstones on reader recreation evictable_reader_v2: restore handling of non-monotonically increasing positions evictable_reader_v2: simplify handling of reader recreation mutation: counter_write_query: use v2 reader mutation: migrate consume() to v2 mutation_fragment_v2,flat_mutation_reader_v2: mirror v1 concept organization mutation_reader: compacting_reader: require a v2 input reader db/view/view_builder: use v2 reader test/lib/flat_mutation_reader_assertions: adjust has_monotonic_positions() to v2 spec	2022-02-21 14:32:55 +02:00
Botond Dénes	841b982e51	test/lib/test_utils: return OK from check() variants The various require() and check() methods in test_utils.hh were introduced to replace BOOST_REQUIRE() and BOOST_CHECK() respectively in multi-shard concurrent tests, specifically those in tests/boost/multishard_mutation_query_test.cc. This was done literally, just replacing BOOST_REQUIRE() with require() and BOOST_CHECK() with check(). The problem is that check() is missing a feature BOOST_CHECK() had: while BOOST_CHECK() doesn't cause an immediate test failure, just logging an error if the condition fails, it remembers this failure and will fail the test in the end. check() did not have this feature and this caused potential errors to just be logged while the test could still pass fine, causing false-positive tests passes. This patch fixes this by returning a [[nodiscard]] bool from the check() methods. The caller can & these together over all calls to check() methods and manually fail the test in the end. We choose this method over a hidden global (like BOOST_CHECK() does) for simplicity sake.	2022-02-21 12:29:25 +02:00
Botond Dénes	e3c618beba	test: migrate to evictable reader v2 and multishard combining reader v2 All reads are now using the v2 version of these readers, test them instead of the old v1.	2022-02-21 12:29:24 +02:00
Botond Dénes	bba20f5cce	test/lib/flat_mutation_reader_assertions: adjust has_monotonic_positions() to v2 spec The v2 spec allows for non-strictly monotonically increasing positions, but has_monotonic_positions() tried to enforce it. Relax the check so it conforms to the spec.	2022-02-21 12:27:55 +02:00
Benny Halevy	9a308bc496	test: lib: register_compaction: do not allow null table Require to pass the table to be compacted so register_compaction finds the real compaction state rather than making a bogus one. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220220120939.2362590-1-bhalevy@scylladb.com>	2022-02-21 12:06:32 +02:00
Michael Livshin	3fef604075	sstables_manager: add get_local_host_id() method and support Since ME sstable format includes originating host id in stats metadata, local host id needs to be made available for writing and validation. Both Scylla server (where local host id comes from the `system.local` table) and unit tests (where it is fabricated) must be accomodated. Regardless of how the host id is obtained, it is stored in the db config instance and accessed through `sstables_manager`. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-16 18:21:24 +02:00
Michael Livshin	0895188851	sstables_manager: formalize inheritability The class is already inherited from in tests (along with overriding a non-virtual method), so this seems to be called for. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-16 18:21:24 +02:00
Michael Livshin	0b1447c702	add "sstable_format" config Initialize it to "md" until ME format support is complete (i.e. storing originating host id in sstable stats metadata is implemented), so at present there is no observable change by default. Also declare "enable_sstables_md_format" unused -- the idea, going forward, being that only "sstable_format" controls the written sstable file format and that no more per-format enablement config options shall be added. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-16 18:21:24 +02:00
Avi Kivity	6572b297a2	treewide: clean up stray license blurbs After the mechanical change in `fcb8d040e8` ("treewide: use Software Package Data Exchange (SPDX) license identifiers"), a few stray license blurbs or fragments thereof remain. In two cases these were extra blurbs in code generators intended for the generated code, in others they were just missed by the script. Clean them up, adding an SPDX license identifier where needed. Closes #10072	2022-02-13 14:16:16 +02:00
Piotr Dulikowski	ffd439d908	cql_test_env: optimize handling result_message::exception The single_node_cql_env uses query_processor::execute_xyz family of methods to perform operations. Due to previous commits in this series, they allocate one more task than before - a continuation that converts result_message::exception into an exceptional future. We can recover that one task by using variants of those methods which do not perform a conversion, and turn .finally() invocations into .then()s which perform conversion manually.	2022-02-08 11:08:42 +01:00
Michał Sala	66a93d3000	cql3: query_processor: add forward_service reference to query_processor	2022-02-01 21:14:41 +01:00
Kamil Braun	a664ac7ba5	treewide: require `group0_guard` when performing schema changes `announce` now takes a `group0_guard` by value. `group0_guard` can only be obtained through `migration_manager::start_group0_operation` and moved, it cannot be constructed outside `migration_manager`. The guard will be a method of ensuring linearizability for group 0 operations.	2022-01-24 15:20:35 +01:00
Kamil Braun	283ac7fefe	treewide: pass mutation timestamp from call sites into `migration_manager::prepare_*` functions The functions which prepare schema change mutations (such as `prepare_new_column_family_announcement`) would use internally generated timestamps for these mutations. When schema changes are managed by group 0 we want to ensure that timestamps of mutations applied through Raft are monotonic. We will generate these timestamps at call sites and pass them into the `prepare_` functions. This commit prepares the APIs.	2022-01-24 15:12:50 +01:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Botond Dénes	518abe7415	test/lib/mutation_diff: force textual conversion If the compared mutations have binary keys, `colordiff` will declare the file as binary and will refuse to compare them, beyond a very unhelpful "binary files differ" summary. Add "-a" to the command line to force a treating all files as text. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20220117131347.106585-1-bdenes@scylladb.com>	2022-01-17 15:27:53 +02:00
Avi Kivity	fec0c09756	Merge "Convert scrub and validation to v2" from Botond " As a prerequisite the mutation fragment stream validator is converted to v2 as well (but it still supports v1). We get one step closer to eliminate conversions altogether from compaction.cc. Tests: unit(dev) " * 'scrub-v2/v1' of https://github.com/denesb/scylla: mutation_writer: remove v1 version segregate_by_partition() compaction/compaction: remove v1 version of validate and scrub reader factory methods tools/scylla-sstable: migrate to v2 test/boost/sstable_compaction_test: migrate validation tests to v2 test/boost/sstable_compaction_test: migrate scrub tests to v2 test/lib/simple_schema: add v2 of make_row() and make_static_row() compaction: use v2 version of mutation_writer::segregate_by_partition() mutation_writer: add v2 version of segregate_by_partition() compaction: migrate scrub and validate to v2 mutation_fragment_stream_validator: migrate validator to v2	2022-01-16 18:25:07 +02:00
Botond Dénes	da0c5adcc3	test/lib/simple_schema: add v2 of make_row() and make_static_row()	2022-01-14 08:54:26 +02:00
Gleb Natapov	100b44f5ff	test use new schema announcement api in cql_test_env.cc	2022-01-13 23:09:02 +02:00
Avi Kivity	4392c20bd3	replica: move distributed_loader into replica module distributed_loader is replica-side thing, so it belongs in the replica module ("distributed" refers to its ability to load sstables in their correct shards). So move it to the replica module.	2022-01-10 15:25:28 +02:00
Avi Kivity	bbad8f4677	replica: move ::database, ::keyspace, and ::table to replica namespace Move replica-oriented classes to the replica namespace. The main classes moved are ::database, ::keyspace, and ::table, but a few ancillary classes are also moved. There are certainly classes that should be moved but aren't (like distributed_loader) but we have to start somewhere. References are adjusted treewide. In many cases, it is obvious that a call site should not access the replica (but the data_dictionary instead), but that is left for separate work. scylla-gdb.py is adjusted to look for both the new and old names.	2022-01-07 12:04:38 +02:00
Avi Kivity	ae3a360725	database: Move database, keyspace, table classes to replica/ directory The database, keyspace, and table classes represent the replica-only part of the objects after which they are named. Reading from a table doesn't give you the full data, just the replica's view, and it is not consistent since reconciliation is applied on the coordinator. As a first step in acknowledging this, move the related files to a replica/ subdirectory.	2022-01-06 17:07:30 +02:00
Asias He	a8ad385ecd	repair: Get rid of the gc_grace_seconds The gc_grace_seconds is a very fragile and broken design inherited from Cassandra. Deleted data can be resurrected if cluster wide repair is not performed within gc_grace_seconds. This design pushes the job of making the database consistency to the user. In practice, it is very hard to guarantee repair is performed within gc_grace_seconds all the time. For example, repair workload has the lowest priority in the system which can be slowed down by the higher priority workload, so that there is no guarantee when a repair can finish. A gc_grace_seconds value that is used to work might not work after data volume grows in a cluster. Users might want to avoid running repair during a specific period where latency is the top priority for their business. To solve this problem, an automatic mechanism to protect data resurrection is proposed and implemented. The main idea is to remove the tombstone only after the range that covers the tombstone is repaired. In this patch, a new table option tombstone_gc is added. The option is used to configure tombstone gc mode. For example: 1) GC a tombstone after gc_grace_seconds cqlsh> ALTER TABLE ks.cf WITH tombstone_gc = {'mode':'timeout'} ; This is the default mode. If no tombstone_gc option is specified by the user. The old gc_grace_seconds based gc will be used. 2) Never GC a tombstone cqlsh> ALTER TABLE ks.cf WITH tombstone_gc = {'mode':'disabled'}; 3) GC a tombstone immediately cqlsh> ALTER TABLE ks.cf WITH tombstone_gc = {'mode':'immediate'}; 4) GC a tombstone after repair cqlsh> ALTER TABLE ks.cf WITH tombstone_gc = {'mode':'repair'}; In addition to the 'mode' option, another option 'propagation_delay_in_seconds' is added. It defines the max time a write could possibly delay before it eventually arrives at a node. A new gossip feature TOMBSTONE_GC_OPTIONS is added. The new tombstone_gc option can only be used after the whole cluster supports the new feature. A mixed cluster works with no problem. Tests: compaction_test.py, ninja test Fixes #3560 [avi: resolve conflicts vs data_dictionary]	2022-01-04 19:48:14 +02:00
Avi Kivity	9e74556413	Merge 'Support reverse reads in the row cache natively' from Tomasz Grabiec This change makes row cache support reverse reads natively so that reversing wrappers are not needed when reading from cache and thus the read can be executed efficiently, with similar cost as the forward-order read. The database is serving reverse reads from cache by default after this. Before, it was bypassing cache by default after `703aed3277`. Refs: #1413 Tests: - unit [dev] - manual query with build/dev/scylla and cache tracing on Closes #9454 * github.com:scylladb/scylla: tests: row_cache: Extend test_concurrent_reads_and_eviction to run reverse queries row_cache: partition_snapshot_row_cursor: Print more details about the current version vector row_cache: Improve trace-level logging config: Use cache for reversed reads by default config: Adjust reversed_reads_auto_bypass_cache description row_cache: Support reverse reads natively mvcc: partition_snapshot: Support slicing range tombstones in reverse test: flat_mutation_reader_assertions: Consume expected range tombstones before end_of_partition row_cache: Log produced range tombstones test: Make produces_range_tombstone() report ck_ranges tests: lib: random_mutation_generator: Extract make_random_range_tombstone() partition_snapshot_row_cursor: Support reverse iteration utils: immutable-collection: Make movable intrusive_btree: Make default-initialized iterator cast to false	2021-12-29 16:53:25 +02:00
Pavel Emelyanov	7a15f1c402	batch_\|modification_statement: Make get_mutations accept query_processor This completes the batch_ and modification_statement rework. Also touch the private batch_statement::read_command while at it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:54:28 +03:00
Benny Halevy	2f2e3b2e84	test: lib: index_reader_assertions: close reader before it is destroyed Otherwise, it may trip an assertion when the nuderlying file is closed, as seen in e.g.: https://jenkins.scylladb.com/view/master/job/scylla-master/job/next/4318/artifact/testlog/x86_64_release/sstable_3_x_test.test_read_rows_only_index.4174.log ``` test/boost/sstable_3_x_test.cc(0): Entering test case "test_read_rows_only_index" sstable_3_x_test: ./seastar/src/core/fstream.cc:205: virtual seastar::file_data_source_impl::~file_data_source_impl(): Assertion `_reads_in_progress == 0' failed. Aborting on shard 0. Backtrace: 0x22557e8 0x2286842 0x7f2799e99a1f /lib64/libc.so.6+0x3d2a1 /lib64/libc.so.6+0x268a3 /lib64/libc.so.6+0x26788 /lib64/libc.so.6+0x35a15 0x222c53d 0x222c548 0xb929cc 0xc0b23b 0xa84bbf 0x24d0111 ``` Decoded: ``` __GI___assert_fail at :? ~file_data_source_impl at ./build/release/seastar/./seastar/src/core/fstream.cc:205 ~file_data_source_impl at ./build/release/seastar/./seastar/src/core/fstream.cc:202 std::default_delete<seastar::data_source_impl>::operator()(seastar::data_source_impl) const at /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/unique_ptr.h:85 (inlined by) ~unique_ptr at /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/unique_ptr.h:361 (inlined by) ~data_source at ././seastar/include/seastar/core/iostream.hh:55 (inlined by) ~input_stream at ././seastar/include/seastar/core/iostream.hh:254 (inlined by) ~continuous_data_consumer at ././sstables/consumer.hh:484 (inlined by) ~index_consume_entry_context at ././sstables/index_reader.hh:116 (inlined by) std::default_delete<sstables::index_consume_entry_context<sstables::index_consumer> >::operator()(sstables::index_consume_entry_context<sstables::index_consumer>) const at /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/unique_ptr.h:85 (inlined by) ~unique_ptr at /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/unique_ptr.h:361 (inlined by) ~index_bound at ././sstables/index_reader.hh:395 (inlined by) ~index_reader at ././sstables/index_reader.hh:435 std::default_delete<sstables::index_reader>::operator()(sstables::index_reader*) const at /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/unique_ptr.h:85 (inlined by) ~unique_ptr at /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/unique_ptr.h:361 (inlined by) ~index_reader_assertions at ././test/lib/index_reader_assertions.hh:31 (inlined by) operator() at ./test/boost/sstable_3_x_test.cc:4630 ``` Test: unit(dev), sstable_3_x_test.test_read_rows_only_index(release X 10000) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211222132858.2155227-1-bhalevy@scylladb.com>	2021-12-22 15:33:22 +02:00
Raphael S. Carvalho	e1e8e020fe	tests: Allow memtable to be flushed through column_family_for_tests Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211217160055.96693-1-raphaelsc@scylladb.com>	2021-12-21 07:21:26 +02:00
Botond Dénes	55bb70a878	Merge "Make sure TWCS per-window major includes all files" from Raphael " TWCS perform STCS on a window as long as it's the most recent one. From there on, TWCS will compact all files in the past window into a single file. With some moderate write load, it could happen that there's still some compaction activity in that past window, meaning that per-window major may miss some files being currently compacted. As a result, a past window may contain more than 1 file after all compaction activity is done on its behalf, which may increase read amplification. To avoid that, TWCS will now make sure that per-window major is serialized, to make sure no files are missed. Fixes #9553. tests: unit(dev). " * 'fix_twcs_per_window_major_v3' of https://github.com/raphaelsc/scylla: TWCS: Make sure major on past window is done on all its sstables TWCS: remove needless param for STCS options TWCS: kill unused param in newest_bucket() compaction: Implement strategy control and wire it compaction: Add interface to control strategy behavior.	2021-12-20 17:12:50 +02:00
Avi Kivity	e772fcbd57	Merge "Convert combined reader to v2" from Botond " Users are adjusted by sprinkling `upgrade_to_v2()` and `downgrade_to_v1()` where necessary (or removing any of these where possible). No attempt was made to optimize and reduce the amount of v1<->v2 conversions. This is left for follow-up patches to keep this set small. The combined reader is composed of 3 layers: 1. fragment producer - pop fragments from readers, return them in batches (each fragment in a batch having the same type and pos). 2. fragment merger - merge fragment batches into single fragments 3. reader implementation glue-code Converting layers (1) and (3) was mostly mechanical. The logic of merging range tombstone changes is implemented at layer (2), so the two different producer (layer 1) implementations we have share this logic. Tests: unit(dev) " * 'combined-reader-v2/v4' of https://github.com/denesb/scylla: test/boost/mutation_reader_test: add test_combined_reader_range_tombstone_change_merging mutation_reader: convert make_clustering_combined_reader() to v2 mutation_reader: convert position_reader_queue to v2 mutation_reader: convert make_combined_reader() overloads to v2 mutation_reader: combined_reader: convert reader_selector to v2 mutation_reader: convert combined reader to v2 mutation_reader: combined_reader: attach stream_id to mutation_fragments flat_mutation_reader_v2: add v2 version of empty reader test/boost/mutation_reader_test: clustering_combined_reader_mutation_source_test: fix end bound calculation	2021-12-20 14:01:03 +02:00
Botond Dénes	2364144b19	mutation_reader: convert position_reader_queue to v2 By removing the converting (v1->v2) constructor of `reader_and_upper_bound` and adjusting its users.	2021-12-20 09:29:05 +02:00
Botond Dénes	aeddcf50a1	mutation_reader: convert make_combined_reader() overloads to v2 Just sprinkle the right amount downgrade_to_v1() and upgrade_to_v2() to call sites, no attempts at optimization was done.	2021-12-20 09:29:05 +02:00
Tomasz Grabiec	87c921dff5	test: flat_mutation_reader_assertions: Consume expected range tombstones before end_of_partition There may be unconsumed but expected fragments in the stream at the time of the call to produces_partition_end(). Call check_rts() sooner to avoid failures.	2021-12-19 22:41:35 +01:00
Tomasz Grabiec	5f45d45c55	test: Make produces_range_tombstone() report ck_ranges	2021-12-19 22:41:35 +01:00
Tomasz Grabiec	26ed0081a4	tests: lib: random_mutation_generator: Extract make_random_range_tombstone()	2021-12-19 22:41:35 +01:00
Avi Kivity	a97731a7e5	migration_manager: replace uses of get_storage_proxy and get_local_storage_proxy with constructor-provided reference A static helper also gained a storage_proxy parameter.	2021-12-16 21:05:47 +02:00
Avi Kivity	d768e9fac5	cql3, related: switch to data_dictionary Stop using database (and including database.hh) for schema related purposes and use data_dictionary instead. data_dictionary::database::real_database() is called from several places, for these reasons: - calling yet-to-be-converted code - callers with a legitimate need to access data (e.g. system_keyspace) but with the ::database accessor removed from query_processor. We'll need to find another way to supply system_keyspace with data access. - to gain access to the wasm engine for testing whether used defined functions compile. We'll have to find another way to do this as well. The change is a straightforward replacement. One case in modification_statement had to change a capture, but everything else was just a search-and-replace. Some files that lost "database.hh" gained "mutation.hh", which they previously had access to through "database.hh".	2021-12-15 13:54:23 +02:00
Avi Kivity	399e2895f1	test: cql_test_env: provide access to data_dictionary Allow tests to have access to the data_dictionary.	2021-12-15 13:54:18 +02:00
Avi Kivity	3ac622bdd8	Merge "Add v2 versions of make_forwadable() and make_flat_mutation_reader_from_fragments()" from Botond " These two readers are crucial for writing tests for any composable reader so we need v2 versions of them before we can convert and test the combined reader (for example). As these two readers are often used in situations where the payload they deliver is specially crafted for the test at hand, we keep their v1 versions too to avoid conversion meddling with the tests. Tests: unit(dev) " * 'forwarding-and-fragment-reader-v2/v1' of https://github.com/denesb/scylla: flat_mutation_reader_v2: add make_flat_mutation_reader_from_fragments() test/lib/mutation_source_test: don't force v1 reader in reverse run mutation_source: add native_version() getter flat_mutation_reader_v2: add make_forwardable() position_in_partition: add after_key(position_in_partition_view) flat_mutation_reader: make_forwardable(): fix indentation flat_mutation_reader: make_forwardable(): coroutinize reader	2021-12-14 20:43:09 +02:00
Raphael S. Carvalho	49f40c8791	compaction: Implement strategy control and wire it This implements strategy control interface for both manager and tests, and wire it. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-12-13 16:05:23 -03:00
Gleb Natapov	e9fafea5c1	migration_manager: pass raft_gr to the migration manager Migration manager will be use raft group zero to distribute schema changes.	2021-12-11 12:31:07 +02:00
Botond Dénes	20e45987b5	test/lib/mutation_source_test: don't force v1 reader in reverse run Currently in the reverse run we wrap the test-provided mutation-source and create a v1 reader with it, forcing a conversion if the mutation-source has a v2 factory. Worse still, if the test is v2 native, there will be a double conversion. This patch fixes this by creating a wrapper mutation-source appropriate to the version of the underlying factory of the wrapped mutation-source.	2021-12-10 15:48:49 +02:00
Avi Kivity	79bcdc104e	Merge "Fix stateful multi-range scans" from Botond " Currently stateful (readers being saved and resumed on page boundaries) multi-range scans are broken in multiple ways. Trying to use them can result in anything from use-after-free (#6716) or getting corrupt data (#9718). Luckily no-one is doing such queries today, but this started to change recently as code such as Alternator TTL and distributed aggregate reads started using this. This series fixes both problems and adds a unit test too exercising this previously completely unused code-path. Fixes: #6716 Fixes: #9718 Tests: unit(dev, release, debug) " * 'fix-stateful-multi-range-scans/v1' of https://github.com/denesb/scylla: test/boost/multishard_mutation_query_test: add multi-range test test/boost/multishard_mutation_query_test: add multi-range support multishard_mutation_query: don't drop data during stateful multi-range reads multishard_combining_reader: reader_lifecycle_policy: allow saving read range on fast-forward	2021-12-07 12:19:56 +02:00
Botond Dénes	953603199e	multishard_combining_reader: reader_lifecycle_policy: allow saving read range on fast-forward The reader_lifecycle_policy API was created around the idea of shard readers (optionally) being saved and reused on the next page. To do this, the lifecycle policy has to also be able to control the lifecycle of by-reference parameters of readers: the slice and the range. This was possible from day 1, as the readers are created through the lifecycle policy, which can intercept and replace the said parameters with copies that are created in stable storage. There was one whole in the design though: fast-forwarding, which can change the range of the read, without the lifecycle policy knowing about this. In practice this results in fast-forwarded readers being saved together with the wrong range, their range reference becoming stale. The only lifecycle implementation prone to this is the one in `multishard_mutation_query.cc`, as it is the only one actually saving readers. It will fast-forward its reader when the query happens over multiple ranges. There were no problems related to this so far because no one passes more than one range to said functions, but this is incidental. This patch solves this by adding an `update_read_range()` method to the lifecycle policy, allowing the shard reader to update the read range when being fast forwarded. To allow the shard reader to also have control over the lifecycle of this range, a shared pointer is used. This control is required because when an `evictable_reader` is the top-level reader on the shard, it can invoke `create_reader()` with an edited range after `update_read_range()`, replacing the fast-forwarded-to range with a new one, yanking it out from under the feet of the evictable reader itself. By using a shared pointer here, we can ensure the range stays alive while it is the current one.	2021-12-03 10:27:44 +02:00
Avi Kivity	3b82ef854d	Merge "Some compaction manager cleanups" from Raphael " couple of preparatory changes for coroutinization of manager " * 'some_compaction_manager_cleanups_v5' of github.com:raphaelsc/scylla: compaction_manager: move check_for_cleanup into perform_cleanup() compaction_manager: replace get_total_size by one liner compaction_manager: make consistent usage of type and name table compaction_manager: simplify rewrite_sstables() compaction_manager: restore indentation	2021-12-02 19:53:13 +02:00

1 2 3 4 5 ...

493 Commits