scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 15:03:06 +00:00

Author	SHA1	Message	Date
Benny Halevy	a126160d7e	frozen_mutation: move unfreeze_gently to async_utils Unfreeze_gently doesn't have to be a method of frozen_mutation. It might as well be implemented as a free function reading from a frozen_mutation and preparing a mutation gently. The logic will be used in a later patch to make a canonical mutation directly from a frozen_mutation instead of unfreezing it and then converting it to a canonical_mutation. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-05-02 19:27:56 +03:00
Benny Halevy	aa27ef8811	mutation: add freeze_gently Allow yielding in between serializing of range tombstones and rows to prevent reactor stalls due to large mutations with many rows or range tombstones. mutations that have many cells might still stall but those are considered infrequent enough to ignore for now. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-05-02 19:27:56 +03:00
Benny Halevy	c485ed6287	canonical_mutation: add to_mutation_gently to_mutation_gently generates mutation from canonical_mutation asynchronously using the newly introduced mutation_partition accept_gently method. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-05-02 19:27:54 +03:00
Benny Halevy	e1411f3911	mutation_partition: add apply_gently To be used for freezing mutations or making canonical mutations gently. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-05-02 18:45:24 +03:00
Benny Halevy	15e8ecb670	mutation_partition: apply_monotonically: do not support schema upgrade Currently, if the input mutation_partition requires schema upgrade, apply_monotonically always silently reverts to being non-preemptible, even if the caller passed is_preemptible::yes. To prevent that from happening, put the burden of upgrading the mutation_partition schem on the caller, which is today the apply() methods, which are synchronous anyhow. With that, we reduce the proliferation of the `apply_monotonically` overloads and keep only the low level one (which could potentially be private as well, as it's called only from within the mutation/ source files and from tests) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-05-02 18:42:41 +03:00
Benny Halevy	e5ca65f78b	test/perf: report also log_allocations/op Currently perf-simple-query --write ignores log allocations that happen on the memtable apply path. This change adds tracking and accounting of the number of log allocation, and reporting of thereof. For reference, here's the output of build/release/scylla perf-simple-query --write --default-log-level=error --random-seed=1 -c 1 ``` random-seed=1 enable-cache=1 Running test with config: {partitions=10000, concurrency=100, mode=write, frontend=cql, query_single_key=no, counters=no} Disabling auto compaction 78073.55 tps ( 59.4 allocs/op, 16.3 logallocs/op, 14.3 tasks/op, 52991 insns/op, 0 errors) 77263.59 tps ( 59.3 allocs/op, 16.0 logallocs/op, 14.3 tasks/op, 53282 insns/op, 0 errors) 79913.07 tps ( 59.3 allocs/op, 16.0 logallocs/op, 14.3 tasks/op, 53295 insns/op, 0 errors) 79554.32 tps ( 59.3 allocs/op, 16.0 logallocs/op, 14.3 tasks/op, 53284 insns/op, 0 errors) 79151.53 tps ( 59.3 allocs/op, 16.0 logallocs/op, 14.3 tasks/op, 53289 insns/op, 0 errors) median 79151.53 tps ( 59.3 allocs/op, 16.0 logallocs/op, 14.3 tasks/op, 53289 insns/op, 0 errors) median absolute deviation: 761.54 maximum: 79913.07 minimum: 77263.59 ``` Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-05-02 18:42:41 +03:00
Patryk Jędrzejczak	b8e3bf4b09	topology_coordinator: clear obsolete generations earlier We want to clear CDC generations that are no longer needed (because all writes are already using a new generation) so they don't take space and are not sent during snapshot transfers (see e.g. scylladb/scylladb#17545). The condition used previously was that we clear generations which were closed (i.e., a new generation started at this time) more than 24h ago. This is a safe choice, but too conservative: we could easily end up with a large number of obsolete generations if we boot multiple nodes during 24h (which is especially easy to do with tablets.) Change this bound from 24h to `5s + ring_delay`. The choice is explained in a comment in the code. Also, prevent `test_cdc_generation_clearing` from being flaky by firing the `increase_cdc_generation_leeway` error injection on the server being the topology coordinator. Ref: scylladb/scylladb#17545	2024-05-02 12:46:33 +02:00
Patryk Jędrzejczak	f61c50baa4	test: test_raft_snapshot_request: improve the last assertion The last assertion in the test is very sensitive to changes. The constant has already been increased from 0 to 1 due to flakiness. The old comment explains it. In the following patch, we change the CDC generation publisher so that it clears the obsolete CDC generations earlier. This change would make this assertion flaky again. After restarting the servers, the new topology coordinator could remove the first generation if it became obsolete. This operation appends a new entry to the log. If it happened after triggering snapshot, the assertion could fail with `2 <= 1`. We could increase the constant again to unflake the test, but we better improve it once and for all. We change the assertion so that it's not sensitive to changes in the code based on Raft. The explanation is in the new comment.	2024-05-02 12:46:33 +02:00
Patryk Jędrzejczak	44791a849e	test: test_raft_snapshot_request: find raft leader after restart Finding the new Raft leader after restart simplifies the test and makes it easier to reason about. There are two improvements: - we only need to wait until the leader appends a command, so the read barrier becomes unnecessary, - we only need to trigger snapshot on the leader. We also use the knowledge about the leader in the following patch.	2024-05-02 12:46:33 +02:00
Patryk Jędrzejczak	41198998c5	test: test_raft_shanpshot_request: simplify appended_command We shorten the code and remove the unused `log_size` variable.	2024-05-02 12:46:31 +02:00
Ferenc Szili	b06af5b2b9	sstable: write dead_rows count to system.large_partitions	2024-05-02 11:49:10 +02:00
Nadav Har'El	5558143014	test/alternator: test addressing LSI using REST API The name of the Scylla table backing an Alternator LSI looks like basename:!lsiname. Some REST API clients (including Scylla Manager) when they send a "!" character in the REST API request may decide to "URL encode" it - convert it to %21. Because of a Seastar bug (https://github.com/scylladb/seastar/issues/725) Scylla's REST API server forgets to do the URL decoding, which leads to the REST API request failing to address the LSI table. This patch introduces a test for this bug, which fails without the Seastar issue being fixed, and passes afterwards (i.e., after the previous patch that starts to use the new, fixed, Seastar API). The test creates an LSI, uses the REST API to find its name and then tries to call some REST API ("compaction_strategy") on this table name, after deliberately URL-encoding it. Refs #5883. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-05-02 12:33:54 +03:00
Pavel Emelyanov	67736b5cd3	Reapply "Merge 'Drain view_builder in generic drain' from ScyllaDB" This reverts commit `9c2a836607`.	2024-05-02 08:16:14 +03:00
Nadav Har'El	4e78e2d506	test/cql-pytest, cdc: add test for what happens when log name is taken In our CDC implementation, the CDC log table for table "xyz" is always called "xyz_scylla_cdc_log". If this table name is taken, and the user tries to create a table "xyz" with CDC enabled - or enable CDC on the table "xyz", the creation/enabling should fail gracefully, with a clear error message. This test verifies this. The new test passes - the code is already correct. I just wanted to verify that it is (and to prevent future regressions). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#18485	2024-05-01 14:46:19 +03:00
Kefu Chai	bd0d246b57	tools/scylla-nodetool: implement the resetlocalschema command Fixes #18468 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18470	2024-05-01 08:49:11 +03:00
Raphael S. Carvalho	b980634ff2	test: Verify tablet cleanup is properly retried on failure Doesn't test only coordinator ability to retry on failure, but also that replica will be able to properly continue cleanup of a storage group from where it left off (when failure happened), not leave any sstables behind. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#18426	2024-04-30 19:27:17 +02:00
Avi Kivity	ea15ddc7dc	Merge 'Fix population of non-normal sstables from registry' from Pavel Emelyanov On boot sstables are populated from normal location as well as from quarantine and staging. It turned out that sstables listed in registry (S3-backed ones) are not populated from non-normal states. Closes scylladb/scylladb#18439 * github.com:scylladb/scylladb: test: Add test for how quarantined sstables registry entries are loaded sstable_directory: Use sstable location to initialize registry lister	2024-04-30 18:10:11 +03:00
Avi Kivity	329b135b5e	Merge 'chunked_vector: fix use after free in emplace back' from Benny Halevy Currently, push_back or emplace_back reallocate the last chunk before constructing the new element. If the arg passed to push_back/emplace_back is a reference to an existing element in the vector, reallocating the last chunk will invalidate the arg reference before it is used. This patch changes the order when reallocating the last chunk in reserve_for_emplace_back: First, a new chunk_ptr is allocated. Then, the back_element is emplaced in the newly allocated array. And only then, existing elements in the current last chunk are migrated to the new chunk. Eventually, the new chunk replaces the existing chunk. If no reservation is requried, the back element is emplaced "in place" in the current last chunk. Fixes scylladb/scylladb#18072 Closes scylladb/scylladb#18073 * github.com:scylladb/scylladb: test: chunked_managed_vector_test: add test_push_back_using_existing_element utils: chunked_vector: reserve_for_emplace_back: emplace before migrating existing elements utils: chunked_vector: push_back: call emplace_back utils: chunked_vector: define min_chunk_capacity utils: chunked*vector: use std::clamp	2024-04-30 18:09:04 +03:00
Gleb Natapov	f2b0a5e9e1	storage_service: do not take API lock for removenode operation if topology coordinator is enabled Topology coordinator serialize operations internally, so there is no need to have an external lock. Fixes: scylladb/scylladb#17681	2024-04-30 15:13:50 +03:00
Gleb Natapov	0a7101923c	test: return file mark from wait_for that points after the found string Returning file mark allows to start searching from the point where the previous string was found.	2024-04-30 15:06:32 +03:00
Botond Dénes	0ace90ad04	test: add test for cleaning up cached querier on tablet migration Check that a cached querier, which exists prior to a migration, will be cleaned up afterwards. This reproduces #18110. The test fails before the fix for the above and passes afterwards.	2024-04-30 01:47:16 -04:00
Botond Dénes	3c813fbb99	reader_concurrency_semaphore: add range param to evict_inactive_reads_for_table() When the new optional parameter has a value, evict only inactive reads, whose ranges overlap with the provided range. The range for the inactive read is provided in `register_inactive_read()`. If the inactive read has no range, ovarlap is assumed and the read is evicted. This will be used to evict all inactive reads that could potentially use a cleaned-up tablet.	2024-04-30 01:31:08 -04:00
Piotr Dulikowski	35f456c483	Merge 'Extend `ALTER TABLE ... DROP` to allow specifying timestamp of column drop' from Michał Jadwiszczak In order to correctly restore schema from `DESC SCHEMA WITH INTERNALS`, we need a way to drop a column with a timestamp in the past. Example: - table t(a int pk, b int) - insert some data1 - drop column b - add column b int - insert some data2 If the sstables weren't compacted, after restoring the schema from description: - we will loss column b in data2 if we simply do `ALTER TABLE t DROP b` and `ALTER TABLE t ADD b int` - we will resurrect column b in data1 if we skip dropping and re-adding the column Test for this: https://github.com/scylladb/scylla-dtest/pull/4122 Fixes #16482 Closes scylladb/scylladb#18115 * github.com:scylladb/scylladb: docs/cql: update ALTER TABLE docs test/cqlpytest: add test for prepared `ALTER TABLE ... DROP ... USING TIMESTAMP ?` test/cql-pytest: remove `xfail` from alter table with timestamp tests cql3/statements: extend `ALTER TABLE ... DROP` to allow specifying timestamp of column drop cql3/statements: pass `query_options` to `prepare_schema_mutations()` cql3/statements: add bound terms to alter table statement cql3/statements: split alter_table_statement into raw and prepared schema: allow to specify timestamp of dropped column	2024-04-29 14:05:05 +02:00
Piotr Dulikowski	dec652de9e	test: topology: test that upgrade succeeds after recent removal Adds a regression test for scylladb/scylladb#18198 - start a two node cluster in legacy topology mode, use nodetool removenode on one of the nodes, upgrade the remaining 1-node cluster and observe that it succeeds.	2024-04-29 13:33:40 +02:00
Pavel Emelyanov	5e23493d25	test: Add test for how quarantined sstables registry entries are loaded Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-26 16:54:43 +03:00
Kamil Braun	d8313dda43	Merge 'db: config: move consistent-topology-changes out of experimental and make it the default for new clusters' from Patryk Jędrzejczak We move consistent cluster management out of experimental and make it the default for new clusters in 6.0. In code, we make the `consistent-topology-changes` flag unused and assumed to be true. In 6.0, the topology upgrade procedure will be manual and voluntary, so some clusters will still be using the gossip-based topology even though they support the raft-based topology. Therefore, we need to continue testing the gossip-based topology. This is possible by using the `force-gossip-topology-changes` flag introduced in scylladb/scylladb#18284. Ref scylladb/scylladb#17802 Closes scylladb/scylladb#18285 * github.com:scylladb/scylladb: docs: raft.rst: update after removing consistent-topology-changes treewide: fix indentation after the previous patch db: config: make consistent-topology-changes unused test: lib: single_node_cql_env: restart a node in noninitial run_in_thread calls test: test_read_required_hosts: run with force-gossip-topology-changes storage_service: join_cluster: replace force_gossip_based_join with force-gossip-topology-changes storage_service: join_token_ring: fix finish_setup_after_join calls	2024-04-26 14:45:29 +02:00
Patryk Jędrzejczak	3a100cd16c	test: test_raft_recovery_stuck: ensure raft upgrade procedure failed We have log browsing in test.py now, so we can fix this TODO easily. Closes scylladb/scylladb#18425	2024-04-26 10:16:49 +02:00
Botond Dénes	d566eec89a	Merge 'treewide: remove {dclocal_,}read_repair_chance options' from Kefu Chai dclocal_read_repair_chance and read_repair_chance have been removed in Cassandra 3.11 and 4.x, see https://issues.apache.org/jira/browse/CASSANDRA-13910. if we expose these properties via DDL, Cassandra would fail to consume the CQL statement creating the table when performing migration from Scylla to Cassandra 4.x, as the latter does not understand these properties anymore. currently the default values of `dc_local_read_repair_chance` and `read_repair_chance` are both "0". so they are practically disabled, unless user deliberately set them to a value greater than 0. also, as a side effect, Cassandra 4.x has better support of Python3. the cqlsh shipped along with Cassandra 3.11.16 only supports python2.7, see https://github.com/apache/cassandra/blob/cassandra-3.11.16/bin/cqlsh.py it errors out if the system only provides python3 with the error of ``` No appropriate python interpreter found. ``` but modern linux systems do not provide python2 anymore. so, in this change, we deprecate these two options. Fixes #3502 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18087 * github.com:scylladb/scylladb: docs: drop documents related to {,dclocal_}read_repair_chance treewide: remove {dclocal_,}read_repair_chance options	2024-04-26 10:48:47 +03:00
Michał Jadwiszczak	7cbce78480	test/cqlpytest: add test for prepared `ALTER TABLE ... DROP ... USING TIMESTAMP ?`	2024-04-26 07:01:02 +02:00
Michał Jadwiszczak	27a4331dcd	test/cql-pytest: remove `xfail` from alter table with timestamp tests Previous patch introduced `ALTER TABLE ... DROP .. USING TIMESTAM ...` so those test should no longer fail. Refs #9929	2024-04-25 21:27:40 +02:00
Avi Kivity	c2b8ca7d71	Merge 'cql3: statements: change default tombstone_gc mode for tablets' from Aleksandra Martyniuk Repair may miss some tablets that migrated across nodes. So if tombstones expire after some timeout, then we can have data resurrection. Set default tombstone_gc mode to "repair" for tables which use tablets (if repair is required). Fixes: #16627. Closes scylladb/scylladb#18013 * github.com:scylladb/scylladb: test: check default value of tombstone_gc test: topology: move some functions to util.py cql3: statements: change default tombstone_gc mode for tablets	2024-04-25 19:18:37 +03:00
Lakshmi Narayanan Sreethar	6af2659b57	sstables: reclaim_memory_from_components: do not update _recognised_components When reclaiming memory from bloom filters, do not remove them from _recognised_components, as that leads to the on-disk filter component being left back on disk when the SSTable is deleted. Fixes #18398 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com> Closes scylladb/scylladb#18400	2024-04-25 19:15:59 +03:00
Marcin Maliszkiewicz	7085339f72	cql3: test: include get_mutations_internal log in test.py We have a concurrent modification conflict in tests and suspect duplicated requests but since we don't log successful requests we have no way to verify if that's the case. get_mutations_internal log will help to tell wchich nodes are trying to push auth or service levels mutations into raft. Refs scylladb/scylladb#18319 Closes scylladb/scylladb#18413	2024-04-25 17:17:53 +03:00
Kefu Chai	e9b31cb4c1	test: locator_topology: s/get0()/get()/ this change addresses the leftover of `9e8805bb49` Refs `9e8805bb49` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18390	2024-04-25 16:03:01 +03:00
Patryk Jędrzejczak	3a34bb18cd	db: config: make consistent-topology-changes unused We make the `consistent-topology-changes` experimental feature unused and assumed to be true in 6.0. We remove code branches that executed if `consistent-topology-changes` was disabled.	2024-04-25 14:33:21 +02:00
Patryk Jędrzejczak	77342ffb34	test: lib: single_node_cql_env: restart a node in noninitial run_in_thread calls In the following commit, we make the `consistent-topology-changes` experimental feature unused. Then, all unit tests in the boost suite will start using the raft-based topology by default. Unfortunately, tests with multiple `single_node_cql_env::run_in_thread` calls (usually coming from the `do_with_cql_env_thread` calls) would fail. In a noninitial `run_in_thread` call, a node is started as if it booted for the first time. On the other hand, it has its persistent state from previous boots. Hence, the node can behave strangely and unexpectedly. In particular, `SYSTEM.TOPOLOGY` is not empty and the assertion that expects it to be empty when we boot for the first time fails. We fix this issue by making noninitial `run_in_thread` calls behave as normal restarts. After this change, `test_schema_digest_does_not_change_with_disabled_features` starts failing. This test copies the data directory before booting for the first time, so the new `_sys_ks.local().build_bootstrap_info().get();` makes the node incorrectly think it restarts. Then, after noticing it is not a part of group 0, the node would start the raft upgrade procedure if we didn't run it in the raft RECOVERY mode. This procedure would get stuck because it depends on messaging being enabled even if the node communicates only with itself and messaging is disabled in boost tests.	2024-04-25 14:33:21 +02:00
Patryk Jędrzejczak	88038d958a	test: test_read_required_hosts: run with force-gossip-topology-changes In one of the following commits, we make the `consistent-topology-changes` experimental feature unused. Then, all unit tests in the boost suite will start using the raft-based topology by default. Unfortunately, some tests would start failing and `test_read_required_hosts` is one of them. `tablet_cql_test_config` in `tablets_test.cc` doesn't use `consistent-topology-changes`, so all test cases in this file run incorrectly wit the gossip-based topology changes. With `consistent-topology-changes`, only `test_read_required_hosts` fails. The failure happens on `auto table2 = add_table(e).get();`: ``` ERROR 2024-04-17 11:14:16,083 [shard 0:main] load_balancer - Replica 9b94d710-fbfb-11ee-9c4f-448617b47e11:0 of tablet 9b94d713-fbfb-11ee-9c4f-448617b47e11:0 not found in topology ``` This test case needs to be investigated and rewritten so that it passes with the raft-based topology. However, we don't want this issue to block the process of making the `consistent-topology-changes` experimental feature unused. We leave a FIXME and we will open a new issue to track it.	2024-04-25 14:33:21 +02:00
Patryk Jędrzejczak	213f2f6882	storage_service: join_cluster: replace force_gossip_based_join with force-gossip-topology-changes The `force_gossip_based_join` error injection does exactly what we expect from `force-gossip-topology-changes` so we can do a simple replacement. We prefer a flag over an error injection because we will use it a lot in CI jobs' configurations, some tests, manual testing etc. It's much more convenient. Moreover, the flag can be used in the release mode, so we re-enable all tests that were disabled in release mode only because of using the `force_gossip_based_join` error injection. The name of the `force-gossip-topology-changes` flag suggests that using it should always succesfully force the gossip-based topology or, if forcing is not possible, the booting should fail. We don't want a node with `force-gossip-topology-changes=true` that silently boots in the raft-topology mode. We achieve it by throwing a runtime error from `join_cluster` in two cases: - the node is restarting in the cluster that is using raft topology - the node is joining the cluster that is using raft topology	2024-04-25 14:33:21 +02:00
Kefu Chai	c323c93fa4	treewide: remove {dclocal_,}read_repair_chance options dclocal_read_repair_chance and read_repair_chance have been removed in Cassandra 3.11 and 4.x, see https://issues.apache.org/jira/browse/CASSANDRA-13910. if we expose the properties via DDL, Cassandra would fails to consume the CQL statement to creating the table when performing migration from Scylla to Cassandra 4.x, as the latter does not understand these properties anymore. currently the default values of `dc_local_read_repair_chance` and `read_repair_chance` are both "0". so this is practically disabled, unless user deliberately set them to a value greater than 0. also, as a side effect, Cassandra 4.x has better support of Python3. the cqlsh shipped along with Cassandra 3.11.16 only supports python2.7, see https://github.com/apache/cassandra/blob/cassandra-3.11.16/bin/cqlsh.py it errors out if the system only provides python3 with the error of ``` No appropriate python interpreter found. ``` but modern linux systems do not provide python2 anymore. so, in this change, we deprecate these two options. Fixes #3502 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-04-25 17:15:27 +08:00
Botond Dénes	ca26899c36	Merge 'sstable: large data handler needs to count range tombstones as rows' from Ferenc Szili When issuing warnings about partitions with the number of rows above a configured threshold, the large partitions handler does not take into consideration the number of range tombstone markers in the total rows count. This fix adds the number of range tombstone markers to the total number of rows and saves this total in system.large_partitions.rows (if it is above the threshold). It also adds a new column range_tombstones to the system.large_partitions table which only contains the number of range tombstone markers for the given partition. This PR fixes the first part of issue #13968 It does not cover distinguishing between live and dead rows. A subsequent PR will handle that. Closes scylladb/scylladb#18346 * github.com:scylladb/scylladb: sstables: add docs changes for system.large_partitions sstable: large data handler needs to count range tombstones as rows	2024-04-25 11:38:30 +03:00
Kamil Braun	3363f6e1e8	Merge 'Fix write failures during node replace with same IP with topology over raft' from Gleb Currently a new node is marked as alive too late, after it is already reported as a pending node. The patch series changes replace procedure to be the same as what node_ops do: first stop reporting the IP of the node that is being replaced as a natural replica for writes, then mark the IP is alive, and only after that report the IP as a pending endpoint. Fixes: scylladb/scylladb#17421 * 'gleb/17421-fix-v2' of github.com:scylladb/scylla-dev: test_replace_reuse_ip: add data plane load sync_raft_topology_nodes: make replace procedure similar to nodeops one storage_service: topology_coordinator: fix indentation after previous patch storage_service: topology coordinator: drop ring check in node_state::replacing state	2024-04-24 17:09:01 +02:00
Petr Gusev	bc98774f83	test_replace_reuse_ip: add data plane load In this commit we enhance test_replace_reuse_ip to reproduce #17421. We create a test table and run insert queries on it while the first node is being replaced. In this form the test fails without the fix from the previous commit. Some insert requests fail with [Unavailable exception] "Cannot achieve consistency level for cl QUORUM...".	2024-04-24 16:59:24 +03:00
Aleksandra Martyniuk	06f6aaf2cf	test: check default value of tombstone_gc Add a test which checks whether default tombstone_gc value is properly set and if it does not override previous setting.	2024-04-24 10:57:51 +02:00
Aleksandra Martyniuk	e0d498716a	test: topology: move some functions to util.py Move functions marked with asynccontextmanager from test/topology/test_mv.py to test/topology/util.py so that they can be used in other tests.	2024-04-24 10:57:51 +02:00
Aleksandra Martyniuk	58f72f9019	cql3: statements: change default tombstone_gc mode for tablets Currently, if tombstone_gc mode isn't specified for a table, then "timeout" is used by default. With tablets, running "nodetool repair -pr" may miss a tablet if it migrated across the nodes. Then, if we expire tombstones for ranges that weren't repaired, we may get data resurrection. Set default tombstone_gc mode value for DDLs that don't specify it. It's set to "repair" for tables which use tablets unless they use local replication strategy or rf = 1. Otherwise it's set to "timeout".	2024-04-24 10:42:10 +02:00
Kamil Braun	8876b9b0ef	test/pylib: random_tables: use IF NOT EXISTS when creating keyspace Due to Python driver's unexpected behavior, "CREATE KEYSPACE" statement may sometimes get executed twice (scylladb/python-driver#317), leading to "Keyspace ... already exists" error in our tests (scylladb/scylladb#17654). Work around this by using "IF NOT EXISTS". Fixes: scylladb/scylladb#17654 Closes scylladb/scylladb#18368	2024-04-24 10:09:26 +03:00
Botond Dénes	572003c469	Merge 'Cleanup the way snapshot details are propagated via API' from Pavel Emelyanov There's a database::get_snapshot_details() method that returns collection of all snapshots for all ks.cf out there and there are several snapshot_details aux structures around it. This PR keeps only one "details" and cleans up the way it propagates from database up to the respective API calls. Closes scylladb/scylladb#18317 * github.com:scylladb/scylladb: snapshot_ctl: Brush up true_snapshots_size() internals snapshot_ctl: Remove unused details struct snapshot_ctl: No double recoding of details database,snapshots: Move database::snapshot_details into snapshot_ctl database,snapshots: Make database::get_snapshot_details() return map, not vector table,snapshots: Move table::snapshot_details into snapshot_ctl	2024-04-23 16:28:25 +03:00
Ferenc Szili	98bec4e02a	sstable: large data handler needs to count range tombstones as rows When issuing warnings about partitions with the number of rows above a configured threshold, the large partitions handler does not take into consideration the number of range tombstone markers in the total rows count. This fix adds the number of range tombstone markers to the total number of rows and saves this total in system.large_partitions.rows (if it is above the threshold). It also adds a new column range_tombstones to the system.large_partitions table which only contains the number of range tombstone markers for the given partition. This PR fixes the first part of issue #13968 It does not cover distinguishing between live and dead rows. A subsequent PR will handle that.	2024-04-22 15:24:18 +02:00
Pavel Emelyanov	e8f10be12e	snapshot_ctl: No double recoding of details Currently database::get_snapshot_details() returns a collection of snapshots. The snapshot_ctl converts this collection into similarly looking one with slightly different structures inside. The resulting collection is converted one more time on the API layer into another similarly looking map. This patch removes the intermediate conversion. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-19 20:04:32 +03:00
Pavel Emelyanov	f6bc283bbb	database,snapshots: Make database::get_snapshot_details() return map, not vector So that it's in-sync with table::get_snapshot_details(). Next patches will improve this place even further. Also, there can be many snapshots and vector can grow large, but that's less of an issue here. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-19 20:04:25 +03:00

... 100 101 102 103 104 ...

11801 Commits