scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-30 05:07:05 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	c0b922a8af	sstable_directory: Construct with state This is to replace full path sitting on this object eventually. For now they have to co-exist, but state will be used to make_sstable()-s from manager with its new API Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-14 14:56:01 +03:00
Avi Kivity	b120d35c58	Merge 'Relax cql_test_env services maintenance' from Pavel Emelyanov To add a sharded service to the cql_test_env one needs to patch it in 5 or 6 places - add cql_test_env reference - add cql_test_env constructor argument - initialize the reference in initializer list - add service variable to do_with method - pass the variable to cql_test_env constructor - (optionally) export it via cql_test_env public method Steps 1 through 5 are annoying, things get much simpler if look like - add cql_test_env variable - (optionally) export it via cql_test_env public method This is what this PR does refs: #2795 Closes #15028 * github.com:scylladb/scylladb: cql_test_env: Drop local *this reference cql_test_env: Drop local references cql_test_env: Move most of the stuff in run_in_thread() cql_test_env: Open-code env start/stop and remove both cql_test_env: Keep other services as class variables cql_test_env: Keep services as class variables cql_test_env: Construct env early cql_test_env: De-static fdpinger variable cql_test_env: Define all services' variables early cql_test_env: Keep group0_client pointer	2023-08-13 20:24:52 +03:00
Gleb Natapov	70b5360a73	cql3: Extend the scope of group0_guard during DDL statement execution Currently we hold group0_guard only during DDL statement's execute() function, but unfortunately some statements access underlying schema state also during check_access() and validate() calls which are called by the query_processor before it calls execute. We need to cover those calls with group0_guard as well and also move retry loop up. This patch does it by introducing new function to cql_statement class take_guard(). Schema altering statements return group0 guard while others do not return any guard. Query processor takes this guard at the beginning of a statement execution and retries if service::group0_concurrent_modification is thrown. The guard is passed to the execute in query_state structure. Fixes: #13942 Message-ID: <ZNSWF/cHuvcd+g1t@scylladb.com>	2023-08-13 14:19:39 +03:00
Pavel Emelyanov	64ddc9e4b4	cql_test_env: Drop local this reference The auto& env = this is also now excessive, so drop it Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:30:34 +03:00
Pavel Emelyanov	de679d7c36	cql_test_env: Drop local references The local auto& foo = env._foo references in run_in_thread() a no longer needed, the code that uses foo can be switched to use _foo (this->_foo) instead Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:29:42 +03:00
Pavel Emelyanov	487ecae517	cql_test_env: Move most of the stuff in run_in_thread() Thw do_with() method is static and cannot just access cql_test_env variable's fields, using local references instead. To simplify this, most of the method's content is moved to non-static run_in_thread() method Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:28:40 +03:00
Pavel Emelyanov	2c175660f2	cql_test_env: Open-code env start/stop and remove both These two just make more churn in next patch, so drop both Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:28:03 +03:00
Pavel Emelyanov	10f9292fe8	cql_test_env: Keep other services as class variables There are more services on do_with() stack that are not referenced from the cql_test_env. Move them to be class variables too Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:27:19 +03:00
Pavel Emelyanov	08a3be3b17	cql_test_env: Keep services as class variables Now they are duplicated -- variables exist on do_with() stack and the class references some of them. This patch makes is vice-versa -- all the variables are on the cql_test_env and do_with() references them. The latter will change soon Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:26:21 +03:00
Pavel Emelyanov	b31d2097b8	cql_test_env: Construct env early Its constructor is _just_ assigning references and setting up rlimits. Both can happen early Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:25:49 +03:00
Pavel Emelyanov	49d4760655	cql_test_env: De-static fdpinger variable So that it could be moved onto cql_test_env as a class member Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:25:25 +03:00
Pavel Emelyanov	749c5baf21	cql_test_env: Define all services' variables early Nowadays they are all scattered along the .do_with() function. Keeping them in one early place makes it possible to relocate them onto the cql_test_env later Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:23:54 +03:00
Pavel Emelyanov	d36737f094	cql_test_env: Keep group0_client pointer It's now reference, but some time later it won't be able to get initialized construction-time, so turn it into pointer Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 15:23:16 +03:00
Pavel Emelyanov	da98355bc8	test: Remove require_..._exists from cql_test_env Not used by any code anymore. This makes cql_test_env shorter and nicer Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:46:36 +03:00
Pavel Emelyanov	64c8a59e9b	test: Open-code ks.cf name parse into cdc_test The test uses qualified ks.cf name to find the schema, but it's the only test case that does it. There's no point in maintaining a dedicated helper on the cql_test_env just for that Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:46:36 +03:00
Pavel Emelyanov	6ead9a5255	test: Don't use require_table_exists() in test/lib/random_schema This check is pointless. The subsequent call to find_column_family() would call on_internal_error() in case schema is not found, and since cql_test_env sets abort-on-internal-error to true, this would fail just like that Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:46:36 +03:00
Pavel Emelyanov	b4c84f9174	test: Use BOOST_REQUIRE(!db.has_schema()) Surprisingly there's a dedicated helper for the check opposite to the one fixed in the previous patch. Fix one too Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:46:36 +03:00
Pavel Emelyanov	063baabaee	test: Use BOOST_REQUIRE(db.has_schema()) Same as in previous patch, the cql_test_env::require_table_exists() helper is exactly the same, but returns future and asserts on failures for no gain Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:46:32 +03:00
Pavel Emelyanov	a128108acd	test: Use BOOST_REQUIRE(db.has_keyspace()) The cql_test_env::require_keyspace_exists() performs exactly the same check, but is future-returning function for no reason and it assert()s on failure, that's less informative (not that it ever failed) than BOOST_REQUIRE Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:46:29 +03:00
Pavel Emelyanov	53c309063e	test: Threadify cql_query_test::test_compact_storage case It's like in previous patch, and for the same reason, but the change is a bit more complicated because it uses resolved futures' results in few places, so it likely deserves separate commit Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:39:40 +03:00
Pavel Emelyanov	59db35fba0	test: Threadify some cql_query_test cases Those two use straight .then-s sequences, no point in keeping them that long. Being threads makes next patches shorter and nicer Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-12 11:38:51 +03:00
Kamil Braun	8f658fb139	Merge 's3/client: check for available port before starting minio server' from Kefu Chai there is chance that the default port of 9000 has been used on the host running the test, in that case, we should try to use another available port. so, in this change, we try ports in the ranges of [9000, 9000+1000), and use the first one which is not connectable. Fixes #14985 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14997 * github.com:scylladb/scylladb: test: stop using HostRegistry in MinioServer s3/client: check for available port before starting minio server	2023-08-10 14:01:13 +02:00
Alejo Sanchez	e2122163f5	test/pylib: protect double call to cluster stop test.py schedules calls to cluster .uninstall() and .stop() making double calls to it running at the same time. Mark the cluster as not running early on. While there, do the same for .stop_gracefully() for consistency. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Closes #14987	2023-08-10 13:37:49 +02:00
Kefu Chai	0c0a59bf62	test: stop using HostRegistry in MinioServer since MinioServer find a free port by itself, there is no need to provide it an IP address for it anymore -- we can always use 127.0.0.1. so, in this change, we just drop the HostRegistry parameter passed to the constructor of MinioServer, and pass the host address in place of it. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-08-09 23:40:22 +08:00
Kamil Braun	59c410fb97	Merge 'migration_manager: announce: provide descriptions for all calls' from Patryk Jędrzejczak The `system.group0_history` table provides useful descriptions for each command committed to Raft group 0. One way of applying a command to group 0 is by calling `migration_manager::announce`. This function has the `description` parameter set to empty string by default. Some calls to `announce` use this default value which causes `null` values in `system.group0_history`. We want `system.group0_history` to have an actual description for every command, so we change all default descriptions to reasonable ones. Going further, We remove the default value for the `description` parameter of `migration_manager::announce` to avoid using it in the future. Thanks to this, all commands in `system.group0_history` will have a non-null description. Fixes #13370 Closes #14979 * github.com:scylladb/scylladb: migration_manager: announce: remove the default value of description test: always pass empty description to migration_manager::announce migration_manager: announce: provide descriptions for all calls	2023-08-09 16:58:41 +02:00
Kefu Chai	29554b0fc6	s3/client: check for available port before starting minio server there is chance that the default port of 9000 has been used on the host running the test, in that case, we should try to use another available port. so, in this change, we try ports in the ranges of [9000, 9000+1000), and use the first one which is not connectable. Fixes #14985 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-08-09 17:33:42 +08:00
Botond Dénes	108e510a23	Merge 'Update sstable_requiring_cleanup on compaction completion' from Benny Halevy Currently `sstable_requiring_cleanup` is updated using `compacting_sstable_registration`, but that mechanism is not used by offstrategy compaction, leading to #14304. This series introduces `compaction_manager::on_compaction_completion` that intercepts the call to the table::on_compaction_completion. This allows us to update `sstable_requiring_cleanup` right before the compacted sstables are deleted, making sure they are no leaked to `sstable_requiring_cleanup`, which would hold a reference to them until cleanup attempts to clean them up. `cleanup_incremental_compaction_test` was adjusted to observe the sstables `on_delete` (by adding a new observer event) to detect the case where cleanup attempts to delete the leaked sstables and fails since they were already deleted from the file system by offstrategy compaction. The test fails with the fix and passes with it. Fixes #14304 Closes #14858 * github.com:scylladb/scylladb: compaction_manager: on_compaction_completion: erase sstables from sstables_requiring_cleanup compaction/leveled_compaction_strategy: ideal_level_for_input: special case max_sstable_size==0 sstable: add on_delete observer compaction_manager: add on_compaction_completion sstable_compaction_test: cleanup_incremental_compaction_test: verify sstables_requiring_cleanup is empty	2023-08-09 11:03:45 +03:00
Pavel Emelyanov	f1515c610e	code: Remove query-context.hh The whole thing is unused now, so the header is no longer needed Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-08 11:11:07 +03:00
Pavel Emelyanov	413d81ac16	code: Remove qctx Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-08 11:10:56 +03:00
Benny Halevy	7a7c8d0d23	compaction_manager: on_compaction_completion: erase sstables from sstables_requiring_cleanup Erase retired sstable from compaction_state::sstables_requiring_cleanup also on_compaction_completion (in addition to compacting_sstable_registration::release_compacting for offstrategy compaction with piggybacked cleanup or any other compaction type that doesn't use compacting_sstable_registration. Add cleanup_during_offstrategy_incremental_compaction_test that is modeled after cleanup_incremental_compaction_test to check that cleanup doesn't attempt to cleanup already-deleted sstables that were left over by offstrategy compaction in sstables_requiring_cleanup. Fixes #14304 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-08-08 08:16:46 +03:00
Benny Halevy	ea64ae54f8	sstable_compaction_test: cleanup_incremental_compaction_test: verify sstables_requiring_cleanup is empty Make sure that there are no sstables_requiring_cleanup after cleanup compaction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-08-08 08:12:01 +03:00
Patryk Jędrzejczak	866c9a904d	test: always pass empty description to migration_manager::announce In the next commit, we remove the default value for the description parameter of migration_manager::announce to avoid using it in the future. However, many calls to announce in tests use the default value. We have to change it, but we don't really care about descriptions in the tests, so we pass the empty string everywhere.	2023-08-07 14:38:11 +02:00
Avi Kivity	4f7e83a4d0	cql3: select_statement: reject DISTINCT with GROUP BY on clustering keys While in SQL DISTINCT applies to the result set, in CQL it applies to the table being selected, and doesn't allow GROUP BY with clustering keys. So reject the combination like Cassandra does. While this is not an important issue to fix, it blocks un-xfailing other issues, so I'm clearing it ahead of fixing those issues. An issue is unmarked as xfail, and other xfails lose this issue as a blocker. Fixes #12479 Closes #14970	2023-08-07 15:35:59 +03:00
Botond Dénes	fa4aec90e9	Merge 'test: tasks: Fix task_manager/wait_task test ' from Aleksandra Martyniuk Rewrite test that checks whether task_manager/wait_task works properly. The old version didn't work. Delete functions used in old version. Closes #14959 * github.com:scylladb/scylladb: test: rewrite wait_task test test: move ThreadWrapper to rest_util.py	2023-08-07 09:04:29 +03:00
Avi Kivity	6c1e44e237	Merge 'Make replica::database and cql3::query_processor share wasm manager' from Pavel Emelyanov This makes it possible to remove remaining users of the global qctx. The thing is that db::schema_tables code needs to get wasm's engine, alien runner and instance cache to build wasm context for the merged function or to drop it from cache in the opposite case. To get the wasm stuff, this code uses global qctx -> query_processor -> wasm chain. However, the functions (un)merging code already has the database reference at hand, and its natural to get wasm stuff from it, not from the q.p. which is not available So this PR packs the wasm engine, runner and cache on sharded<wasm::manager> instance, makes the manager be referenced by both q.p. and database and removes the qctx from schema tables code Closes #14933 * github.com:scylladb/scylladb: schema_tables: Stop using qctx database: Add wasm::manager& dependency main, cql_test_env, wasm: Start wasm::manager earlier wasm: Shuffle context::context() wasm: Add manager::remove() wasm: Add manager::precompile() wasm: Move stop() out of query_processor wasm: Make wasm sharded<manager> query_processor: Wrap wasm stuff in a struct	2023-08-06 17:00:28 +03:00
Avi Kivity	412629a9a1	Merge 'Export tablet load-balancer metrics' from Tomasz Grabiec The metrics are registered on-demand when load-balancer is invoked, so that only leader exports the metrics. When leader changes, the old leader will stop exporting. The metrics are divided into two levels: per-dc and per-node. In prometheus, they will have appropriate labels for dc and host_id values. Closes #14962 * github.com:scylladb/scylladb: tablet_allocator: unregister metrics when leadership is lost tablets: load_balancer: Export metrics service, raft: Move balance_tablets() to tablet_allocator tablet_allocator: Start even if tablets feature is not enabled main, storage_service: Pass tablet allocator to storage_service	2023-08-06 16:58:27 +03:00
Tomasz Grabiec	f26e65d4d4	tablets: Fix crash on table drop Before the patch, tablet metadata update was processed on local schema merge before table changes. When table is dropped, this means that for a while table will exist without a corresponding tablet map. This can cause memtable flush for this table to fail, resulting in intentional abort(). That's because sstable writing attempts to access tablet map to generate sharding metadata. If auto_snapshot is enabled, this is much more likely to happen, because we flush memtables on table drop. To fix the problem, process tablet metadata after dropping tables, but before creating tables. Fixes #14943 Closes #14954	2023-08-06 16:45:43 +03:00
Tomasz Grabiec	67c7aadded	service, raft: Move balance_tablets() to tablet_allocator The implementation will access metrics registered from tablet_allocator.	2023-08-05 21:48:08 +02:00
Tomasz Grabiec	5bfc8b0445	main, storage_service: Pass tablet allocator to storage_service Tablet balancing will be done through tablet_allocator later.	2023-08-05 03:10:26 +02:00
Pavel Emelyanov	fa93ac9bfd	database: Add wasm::manager& dependency The dependency is needed by db::schema_tables to get wasm manager for its needs. This patch prepares the ground. Now the wasm::manager is shared between replica::database and cql3::query_processor Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-04 19:47:50 +03:00
Pavel Emelyanov	f4e7ffa0fc	main, cql_test_env, wasm: Start wasm::manager earlier It will be needed by replica::database and should be available that early. It doesn't depend on anything and can be moved in the starting order safely Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-04 19:47:50 +03:00
Pavel Emelyanov	243f2217dd	wasm: Make wasm sharded<manager> The wasm::manager is just cql3::wasm_context renamed. It now sits in lang/wasm* and is started as a sharded service in main (and cql test env). This move also needs some headers shuffling, but it's not severe This change is required to make it possible for the wasm::manager to be shared (by reference) between q.p. and replica::database further Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-08-04 19:47:50 +03:00
Aleksandra Martyniuk	629f893355	test: rewrite wait_task test Rewrite test that checks whether task_manager/wait_task works properly. The old version didn't work. Delete functions used in old version.	2023-08-04 13:34:58 +02:00
Aleksandra Martyniuk	9d2e55fd37	test: move ThreadWrapper to rest_util.py Move ThreadWrapper to rest_util.py so it can be reused in different tests.	2023-08-04 13:29:03 +02:00
Botond Dénes	4d538e1363	Merge 'Task manager tasks covering compaction group compaction' from Aleksandra Martyniuk All compaction task executors, except for regular compaction one, become task manager compaction tasks. Creating and starting of major_compaction_task_executor is modified to be consistent with other compaction task executors. Closes #14505 * github.com:scylladb/scylladb: test: extend test_compaction_task.py to cover compaction group tasks compaction: turn custom_task_executor into compaction_task_impl compaction: turn sstables_task_executor into sstables_compaction_task_impl compaction: change sstables compaction tasks type compaction: move table_upgrade_sstables_compaction_task_impl compaction: pass task_info through sstables compaction compaction: turn offstrategy_compaction_task_executor into offstrategy_compaction_task_impl compaction: turn cleanup_compaction_task_executor into cleanup_compaction_task_impl comapction: use optional task info in major compaction compaction: use perform_compaction in compaction_manager::perform_major_compaction	2023-08-04 10:11:00 +03:00
Michał Jadwiszczak	b92d47362f	schema::describe: print 'synchronous_updates' only if it was specified While describing materialized view, print `synchronous_updates` option only if the tag is present in schema's extensions map. Previously if the key wasn't present, the default (false) value was printed. Fixes: #14924 Closes #14928	2023-08-04 09:52:37 +03:00
Kefu Chai	d8d91379e7	test: remove unnecessary check in compaction_manager_basic_test we wait for the same condition couple lines before, so no need to check it again using `BOOST_CHECK_EQUAL()`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14921	2023-08-04 09:26:22 +03:00
Kefu Chai	d4ee84ee1e	s3/test: nuke tempdir but keep $tempdir/log before this change, if the object_store test fails, the tempdir will be preserved. and if our CI test pipeline is used to perform the test, the test job would scan for the artifacts, and if the test in question fails, it would take over 1 hour to scan the tempdir. to alleviate the pain, let's just keep the scylla logging file no matter the test fails or succeeds. so that jenkins can scan the artifacts faster if the test fails. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14880	2023-08-03 11:07:59 +03:00
Konstantin Osipov	df97135583	test.py: forward the optional property file when creating a server To support multi-DC tests we need to provide a property file when creating a server. Forward it from the test client to test.py. Closes #14683	2023-08-02 13:45:19 +02:00
Kamil Braun	b835acf853	Merge 'Cluster features on raft: topology coordinator + check on boot' from Piotr Dulikowski This PR implements the functionality of the raft-based cluster features needed to safely manage and enable cluster features, according to the cluster features on raft design doc. Enabling features is a two phase process, performed by the topology coordinator when it notices that there are no topology changes in progress and there are some not-yet enabled features that are declared to be supported by all nodes: 1. First, a global barrier is performed to make sure that all nodes saw and persisted the same state of the `system.topology` table as the coordinator and see the same supported features of all nodes. When booting, nodes are now forbidden to revoke support for a feature if all nodes declare support for it, a successful barrier this makes sure that no node will restart and disable the features. 2. After a successful barrier, the features are marked as enabled in the `system.topology` table. The whole procedure is a group 0 operation and fails if the topology table is modified in the meantime (e.g. some node changes its supported features set). For now, the implementation relies on gossip shadow round check to protect from nodes without all features joining the cluster. In a followup, a new joining procedure will be implemented which involves the topology coordinator and lets it verify joining node's cluster features before the new node is added to group 0 and to the cluster. A set of tests for the new implementation is introduced, containing the same tests as for the non-raft-based cluster feature implementation plus one additional test, specific to this implementation. Closes #14722 * github.com:scylladb/scylladb: test: topology_experimental_raft: cluster feature tests test: topology: fix a skipped test storage_service: add injection to prevent enabling features storage_service: initialize enabled features from first node topology_state_machine: add size(), is_empty() group0_state_machine: enable features when applying cmds/snapshots persistent_feature_enabler: attach to gossip only if not using raft feature_service: enable and check raft cluster features on startup storage_service: provide raft_topology_change_enabled flag from outside storage_service: enable features in topology coordinator storage_service: add barrier_after_feature_update topology_coordinator: exec_global_command: make it optional to retake the guard topology_state_machine: add calculate_not_yet_enabled_features	2023-08-02 12:32:27 +02:00

1 2 3 4 5 ...

5424 Commits