scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Kefu Chai	20da130cdf	mutation: specialize fmt::formatter<range_tombstone_{entry,list}> this is a part of a series to migrating from `operator<<(ostream&, ..)` based formatting to fmtlib based formatting. the goal here is to enable fmtlib to print `range_tombstone_list` and `range_tombstone_entry` without the help of `operator<<`. the corresponding `operator<<()` for `range_tombstone_entry` is moved into test, where it is used. and the other one is dropped in this change, as all its callers are now using fmtlib for formatting now. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13627	2023-04-26 09:00:25 +03:00
Kamil Braun	a29b8cd02b	Merge 'cql3: fix a few misformatted printouts of column names in error messages' from Nadav Har'El Fix a few cases where instead of printing column names in error messages, we printed weird stuff like ASCII codes or the address of the name. Fixes #13657 Closes #13658 * github.com:scylladb/scylladb: cql3: fix printing of column_specification::name in some error messages cql3: fix printing of column_definition::name in some error messages	2023-04-25 14:21:09 +02:00
Tomasz Grabiec	a717c803c7	tests: row_cache: Add reproducer for reader producing missing closing range tombstone Adds a reproducer for #12462, which doesn't manifest in master any more after `f73e2c992f`. It's still useful to keep the test to avoid regresions. The bug manifests by reader throwing: std::logic_error: Stream ends with an active range tombstone: {range_tombstone_change: pos={position: clustered,ckp{},-1}, {tombstone: timestamp=-9223372036854775805, deletion_time=2}} The reason is that prior to the rework of the cache reader, range_tombstone_generator::flush() was used with end_of_range=true to produce the closing range_tombstone_change and it did not handle correctly the case when there are two adjacent range tombstones and flush(pos, end_of_range=true) is called such that pos is the boundary between the two. Closes #13665	2023-04-25 14:20:57 +02:00
Botond Dénes	8765442f3f	Merge 'utils: add basic_xx_hasher' from Benny Halevy Consolidate `bytes_view_hasher` and abstract_replication_strategy `factory_key_hasher` which are the same into a reusable utils::basic_xx_hasher. To be used in a followup series for netw:msg_addr. Closes #13530 * github.com:scylladb/scylladb: utils: hashing: use simple_xx_hasher utils: hashing: add simple_xx_hasher utils: hashers: add HasherReturning concept hashing: move static_assert to source file	2023-04-25 09:53:47 +02:00
Botond Dénes	b9491c0134	Merge 'Test the column_family rest api' from Benny Halevy Add a test for get/enable/disable auto_compaction via to column_family api. And add log messages for admin operations over that api. Closes #13566 * github.com:scylladb/scylladb: api: column_family: add log messages for admin operation test: rest_api: add test_column_family	2023-04-25 09:53:47 +02:00
Kefu Chai	b0a01d85e9	s3/test: collect log on exit the temporary directory holding the log file collecting the scylla subprocess's output is specified by the test itself, and it is `test_tempdir`. but unfortunately, cql-pytest/run.py is not aware of this. so `cleanup_all()` is not able to print out the logging messages at exit. as, please note, cql-pytest/run.py always collect "log" file under the directory created using `pid_to_dir()` where pid is the spawned subprocesses. but `object_store/run` uses the main process's pid for its reusable tempdir. so, with this change, we also register a cleanup func to printout the logging message when the test exits. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-25 09:53:47 +02:00
Alejo Sanchez	c06e01cfba	test/topology: log stages for concurrent test For concurrent schema changes test, log when the different stages of the test are finished. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Closes #13654	2023-04-25 09:53:47 +02:00
Kefu Chai	cc87e10f40	dht: print pk in decorated_key with "pk" prefix this change ensures that `dk._key` is formatted with the "pk" prefix. as in `3738fcb`, the `operator<<` for partition_key was removed. so the compiler has to find an alternative when trying to fulfill the needs when this operator<< is called. fortunately, from the compiler's perspective, `partition_key` has an `operator managed_bytes_view`, and this operator does not have the explicit specifier, and, `managed_bytes_view` does support `operator<<`. so this ends up with a change in the format of `decorated_key` when it is printed using `operator<<`. the code compiles. but unfortunately, the behavior is changed, and it breaks scylla-dtest/cdc_tracing_info_test.py where the partition_key is supposed to be printed like "pk{010203}" instead of "010203". the latter is how `managed_bytes_view` is formatted. a test is added accordingly to avoid future changes which break the dtest. Fixes scylladb#13628 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13653	2023-04-25 09:53:47 +02:00
Nadav Har'El	4eabb3f429	cql3: fix printing of column_definition::name in some error messages Printing a column_definition::name() in an error message is wrong, because it is "bytes" and printed as hexadecimal ASCII codes :-( Some error messages in cql3/operation.cc incorrectly used name() and should be changed to name_as_text(), as was correctly done in a few other error messages in the same file. This patch also fixes a few places in the test/cql approval tests which "enshrined" the wrong behavior - printing things like 666c697374696e74 in error messages - and now needs to be fixed for the right behavior. Fixes #13657 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2023-04-25 10:46:47 +03:00
Benny Halevy	f4fefec343	utils: hashing: add simple_xx_hasher And a respective unit test. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-24 14:06:43 +03:00
Pavel Emelyanov	28a01c9e60	Merge 'test: object_store: fix various pylint warnings' from Kefu Chai when reading this source code, there are a handful issues reported by my flycheck plugin. none of them is critical, but better off fixing them. Closes #13612 * github.com:scylladb/scylladb: test: object_store: specify timeout test: object_store: s/exit/sys.exit/ test: object_store: do not declare a global variable for read test: object_store: remove unused imports	2023-04-24 13:45:01 +03:00
Kefu Chai	642854f36f	test: s/os.P_NOWAIT/os.WNOHANG/ `os.P_NOWAIT` is supposed to be used in spawn calls, while `os.WNOHANG` is used as in the options parameter passed to wait calls. fortunately, `P_NOWAIT` is defined as "1" in CPython, and `os.WNOHANG` is defined as "1" in linux kernel. that's why the existing implementation works. but we should not rely on this coincidence. so, in this change, `os.P_NOWAIT` is replaced with `os.WNOHANG` for correctness and for better readability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13646	2023-04-24 11:42:34 +03:00
Botond Dénes	864d27f9af	Merge 'clear_gently: handle null unique_ptr and optional values' from Benny Halevy This series adds handling of null std::unique_ptr to utils::clear_gently and handling of std::optional and seastar::optimized_optional (both engaged and disengaged cases). Also, unit tests were added to tests the above cases. Fixes #13636 Closes #13638 * github.com:scylladb/scylladb: utils: clear_gently: add variants for optional values utils: clear_gently: do not clear null unique_ptr	2023-04-24 10:27:32 +03:00
Botond Dénes	9e757d9c6d	Merge 'De-globalize storage proxy' from Pavel Emelyanov All users of global proxy are gone (), proxy can be made fully main/cql_test_env local. () one test case still needs it, but can get it via cql_test_env Closes #13616 * github.com:scylladb/scylladb: code: Remove global proxy schema_change_test: Use proxy from cql_test_env test: Carry proxy reference on cql_test_env	2023-04-24 09:38:00 +03:00
Botond Dénes	1750bb34b7	Merge 'sstables, replica: add generation generator' from Kefu Chai this is the first step to the uuid-based generation identifier. the goal is to encapsulate the generation related logic in generator, so its consumers do not have to understand the difference between the int64_t based generation and UUID v1 based generation. this commit should not change the behavior of existing scylla. it just allows us to derive from `generation_generator` so we can have another generator which generates UUID based generation identifier. Closes #13073 * github.com:scylladb/scylladb: replica, test: create generation id using generator sstables: add generation_generator test: sstables: use generate_n for generating ids for testing	2023-04-24 09:31:08 +03:00
Botond Dénes	7f04d8231d	Merge 'gms: define and use generation and version types' from Benny Halevy This series cleans up the generation and value types used in gms / gossiper. Currently we use a blend of int, int32_t, and int64_t around messaging. This change defines gms::generation_type and gms::version_type as int32_t and add check in non-release modes that the respective int64 value passed over messaging do not overflow 32 bits. Closes #12966 * github.com:scylladb/scylladb: gossiper: version_generator: add {debug_,}validate_gossip_generation gms: gossip_digest: use generation_type and version_type gms: heart_beat_state: use generation_type and version_type gms: versioned_value: use version_type gms: version_generator: define version_type and generation_type strong types utils: move generation-number to gms utils: add tagged_integer gms: versioned_value: make members private scylla-gdb: add get_gms_versioned_value gms: versioned_value: delete unused compare_to function gms: gossip_digest: delete unused compare_to function	2023-04-24 08:44:48 +03:00
Benny Halevy	002865018f	utils: clear_gently: add variants for optional values Implement clear_gently for std:;optional<T> and seastar::optimized_optional<T> and respective unit tests. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 21:34:02 +03:00
Benny Halevy	12877ad026	utils: clear_gently: do not clear null unique_ptr Otherwise the null pointer is dereferenced. Add a unit test reproducing the issue and testing this fix. Fixes #13636 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 21:33:11 +03:00
Pavel Emelyanov	5e201b9120	database: Remove compaction_manager.hh inclusion into database.hh The only reason why it's there (right next to compaction_fwd.hh) is because the database::table_truncate_state subclass needs the definition of compaction_manager::compaction_reenabler subclass. However, the former sub is not used outside of database.cc and can be defined in .cc. Keeping it outside of the header allows dropping the compaction_manager.hh from database.hh thus greatly reducing its fanout over the code (from ~180 indirect inclusions down to ~20). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13622	2023-04-23 16:27:11 +03:00
Benny Halevy	5dc7b7811c	gms: gossip_digest: use generation_type and version_type Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 08:48:01 +03:00
Benny Halevy	2d20ee7d61	gms: version_generator: define version_type and generation_type strong types Derived from utils::tagged_integer, using different tags, the types are incompatible with each other and require explicit typecasting to- and from- their value type. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 08:47:17 +03:00
Benny Halevy	f5f566bdd8	utils: add tagged_integer A generic template for defining strongly typed integer types. Use it here to replace raft::internal::tagged_uint64. Will be used for defining gms generation and version as strong and distinguishable types in following patches. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 08:37:32 +03:00
Kefu Chai	c2488fc516	test: object_store: specify timeout just in case scylla does not behave as expected, so we can identify the issue and error out sooner without hang forever until the whole test timesout. this issue was identified by pylint, see https://pylint.readthedocs.io/en/latest/user_guide/messages/warning/missing-timeout.html Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-22 00:38:37 +08:00
Tomasz Grabiec	bd0b299322	Merge 'Manage CDC generations when bootstrapping nodes using Raft Group 0 topology coordinator' from Kamil Braun Introduce a new table `CDC_GENERATIONS_V3` (`system.cdc_generations_v3`). The table schema is a copy-paste of the `CDC_GENERATIONS_V2` schema. The difference is that V2 lives in `system_distributed_keyspace` and writes to it are distributed using regular `storage_proxy` replication mechanisms based on the token ring. The V3 table lives in `system_keyspace` and any mutations written to it will go through group 0. Extend the `TOPOLOGY` schema with new columns: - `new_cdc_generation_data_uuid` will be stored as part of a bootstrapping node's `ring_slice`, it stores UUID of a newly introduced CDC generation which is used as partition key for the `CDC_GENERATIONS_V3` table to access this new generation's data. It's a regular column, meaning that every row (corresponding to a node) will have its own. - `current_cdc_generation_uuid` and `current_cdc_generation_timestamp` together form the ID of the newest CDC generation in the cluster. (the uuid is the data key for `CDC_GENERATIONS_V3`, the timestamp is when the CDC generation starts operating). Those are static columns since there's a single newest CDC generation. When topology coordinator handles a request for node to join, calculate a new CDC generation using the bootstrapping node's tokens, translate it to mutation format, and insert this mutation to the CDC_GENERATIONS_V3 table through group 0 at the same time we assign tokens to the node in Raft topology. The partition key for this data is stored in the bootstrapping node's `ring_slice`. After inserting new CDC generation data , we need to pick a timestamp for this generation and commit it, telling all nodes in the cluster to start using the generation for CDC log writes once their clocks cross that timestamp. We introduce a separate step to the bootstrap saga, before `write_both_read_old`, called `commit_cdc_generation`. In this step, the coordinator takes the `new_cdc_generation_data_uuid` stored in a bootstrapping node's `ring_slice` - which serves as the key to the table where the CDC generation data is stored - and combines it with a timestamp which it generates a bit into the future (as in old gossiper-based code, we use 2 * ring_delay, by default 1 minute). This gives us a CDC generation ID which we commit into the topology state as the `current_cdc_generation_id` while switching the saga to the next step, `write_both_read_old`. Once a new CDC generation is committed to the cluster by the topology coordinator, we also need to publish it to the user-facing description tables so CDC applications know which streams to read from. This uses regular distributed table writes underneath (tables living in the `system_distributed` keyspace) so it requires `token_metadata` to be nonempty. We need a hack for the case of bootstrapping the first node in the cluster - turning the tokens into normal tokens earlier in the procedure in `token_metadata`, but this is fine for the single-node case since no streaming is happening. When a node notices that a new CDC generation was introduced in `storage_service::topology_state_load`, it updates its internal data structures that are used when coordinating writes to CDC log tables. We include the current CDC generation data in topology snapshot transfers. Some fixes and refactors included. Closes #13385 * github.com:scylladb/scylladb: docs: cdc: describe generation changes using group 0 topology coordinator cdc: generation_service: add a FIXME cdc: generation_service: add legacy_ prefix for gossiper-based functions storage_service: include current CDC generation data in topology snapshots db: system_keyspace: introduce `query_mutations` with range/slice storage_service: hold group 0 apply mutex when reading topology snapshot service: raft_group0_client: introduce `hold_read_apply_mutex` storage_service: use CDC generations introduced by Raft topology raft topology: publish new CDC generation to the user description tables raft topology: commit a new CDC generation on node bootstrap raft topology: create new CDC generation data during node bootstrap service: topology_state_machine: make topology::find const db: system_keyspace: small refactor of `load_topology_state` cdc: generation: extract pure parts of `make_new_generation` outside db: system_keyspace: add storage for CDC generations managed by group 0 service: topology_state_machine: better error checking for state name (de)serialization service: raft: plumbing `cdc::generation_service&` cdc: generation: `get_cdc_generation_mutations`: take timestamp as parameter cdc: generation: make `topology_description_generator::get_sharding_info` a parameter sys_dist_ks: make `get_cdc_generation_mutations` public sys_dist_ks: move find_schema outside `get_cdc_generation_mutations` sys_dist_ks: move mutation size threshold calculation outside `get_cdc_generation_mutations` service/raft: group0_state_machine: signal topology state machine in `load_snapshot`	2023-04-21 18:11:27 +02:00
Kefu Chai	f85da1bd30	test: object_store: s/exit/sys.exit/ the former is expected to be used in an interactive session, not in an application. see also: https://docs.python.org/3/library/constants.html#constants-added-by-the-site-module and https://docs.python.org/3/library/sys.html#sys.exit Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-21 23:25:59 +08:00
Kefu Chai	c7b62fbf81	test: object_store: do not declare a global variable for read we only need to declare a variable with `global` when we need to write to it, but if we just want to read it, there is no need to declare it. because the way how python looks up for a variable when reading from it enables python to find the global variables (and apparently the functions!). but when we assign a variable in python, the interpreter would have to tell in which scope the variable lives. by default the local scope is used, and a new variable is added to `locals()`. but in this case, we just read from it. so no need to add the `global` statement. see also https://docs.python.org/3/reference/simple_stmts.html#global Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-21 23:25:59 +08:00
Kefu Chai	4989a59a0b	test: object_store: remove unused imports Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-21 23:25:59 +08:00
Kefu Chai	576adbdbc5	replica, test: create generation id using generator reuse generation_generator for generating generation identifiers for less repeatings. also, add allow update generator to update its lastest known generation id. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-21 22:02:30 +08:00
Kefu Chai	a2aa133822	treewide: use std::lexicographical_compare_threeway this the standard library offers `std::lexicographical_compare_threeway()`, and we never uses the last two addition parameters which are not provided by `std::lexicographical_compare_threeway()`. there is no need to have the homebrew version of trichotomic compare function. in this change, * all occurrences of `lexicographical_tri_compare()` are replaced with `std::lexicographical_compare_threeway()`. * ``lexicographical_tri_compare()` is dropped. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13615	2023-04-21 14:28:18 +03:00
Pavel Emelyanov	739455c3aa	code: Remove global proxy No code needs global proxy anymore. Keep on-stack values in main and cql_test_env and keep the pointer on debug:: namespace. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-21 14:18:59 +03:00
Pavel Emelyanov	f953fb2f52	schema_change_test: Use proxy from cql_test_env There's one place where test case calls for storage proxy and currently does it via global refernece. Time to switch it to cql_test_env's one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-21 14:18:00 +03:00
Pavel Emelyanov	681a19f54c	test: Carry proxy reference on cql_test_env All sharded<> services are created by cql_test_env on the stack. The cql_test_env() is then used to keep references on some of them and to export them to test cases via its methods. Proxy is missing on that exportable list, but will be needed, so add one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-21 14:16:54 +03:00
Kamil Braun	55f43e532c	Merge 'get rid of gms/failure_detector' from Benny Halevy Move gms::arrival_window to api/failure_detector which is its only user. and get rid of the rest, which is not used, now that we use direct_failure_detector instead. TODO: integare direct_failure_detector with failure_detector api. Closes #13576 * github.com:scylladb/scylladb: gms: get rid of unused failure_detector api: failure_detector: remove false dependency on failure_detector::arrival_window test: rest_api: add test_failure_detector	2023-04-21 11:47:44 +02:00
Kamil Braun	f7408130c9	Merge 'Fix topology management when raft-based topology is enabled' from Tomasz Grabiec Fixes a problem when raft-based topology is enabled, which loads topology from storage. It starts by clearing topology and then adding nodes one by one. Before this patch, this violates internal invariant of topology object which puts the local node as the first node. This would manifest by triggering an assert in topology::pop_node() which throws if popping the node at index 0 in order to keep the information about local node around. This is normally prevented by a check in topology::remove_node() which avoid calling pop_node() if removing the local node. But since there is no node which is marked as local, this check allows the first node to be popped. To fix the problem I lift the invariant that local node is always in _nodes. We still have information about local node in config. Instead of keeping it in _nodes, we recognize it as part of indexing. We also allow removing the local node like a regular node. The path which reloads topology works correctly after this, the local node will be recognized when (if) it is added to the topology. Fixes #13495 Closes #13498 * github.com:scylladb/scylladb: locator: topology: Fix move assignment locator: topology: Add printer tests: topology: Test that topology clearing preserves information about local node locator: topology: Recognize local node as part of indexing it locator: topology: Fix get_location(ep) for local node locator: topology: Fix typo locator: topology: Preserve config when cloning	2023-04-21 11:45:08 +02:00
Alejo Sanchez	ce87aedd30	test: topology smp test with custom cluster Instead of decommission of initial cluster, use custom cluster. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Closes #13589	2023-04-21 10:43:54 +02:00
Kefu Chai	b0ef053552	test: sstables: use generate_n for generating ids for testing so we don't need to keep a `prev_gen` around, this also prepares for the coming change to use generation generator. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-21 15:45:16 +08:00
Kefu Chai	c5fa1ac9f7	sstable: specialize fmt::formatter<component_type> this is a part of a series to migrating from `operator<<(ostream&, ..)` based formatting to fmtlib based formatting. the goal here is to enable fmtlib to print `component_type` without the help of `operator<<`. the corresponding `operator<<()` are dropped dropped in this change, as all its callers are now using fmtlib for formatting now. also, please note, to enable fmtlib to format `std::set<component_type>` in `test/boost/sstable_3_x_test.cc` , we need to include `<fmt/ranges.h>` in that source file. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13598	2023-04-21 09:49:24 +03:00
Kefu Chai	ecb5380638	treewide: s/boost::lexical_cast<std::string>/fmt::to_string()/ this change replaces all occurrences of `boost::lexical_cast<std::string>` in the source tree with `fmt::to_string()`. for couple reasons: * `boost::lexical_cast<std::string>` is longer than `fmt::to_string()`, so the latter is easier to parse and read. * `boost::lexical_cast<std::string>` creates a stringstream under the hood, so it can use the `operator<<` to stringify the given object. but stringstream is known to be less performant than fmtlib. * we are migrating to fmtlib based formatting, see #13245. so using `fmt::to_string()` helps us to remove yet another dependency on `operator<<`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13611	2023-04-21 09:43:53 +03:00
Benny Halevy	3f1ac846d8	gms: get rid of unused failure_detector The legacy failure_detector is now unused and can be removed. TODO: integare direct_failure_detector with failure_detector api. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-21 09:08:27 +03:00
Benny Halevy	35de60670c	test: rest_api: add test_failure_detector Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-21 09:06:15 +03:00
Nadav Har'El	9c3907bb3c	test/cql-pytest: reproducers for incorrect AVG of "decimal" type This patch contains tests reproducing issue #13601 and the corresponding Cassandra issue CASSANDRA-18470. These issues are about what the AVG aggregation does for arbitrary-precision "decimal" numbers - the tests we add here show examples where the current behavior doesn't make sense: The problem is that "decimal" has arbitrary precision - so, should an average of 1/3 be returned as 0.3 or 0.33333333333333333? This is not specified, so Scylla (and Cassandra) decided to pick the result precision based on the input precision. In particular, the average of 1 and 2 is returned as 2 (zero digits after the decimal point, like in the inputs) instead of the expected 1.5. Arguably this isn't useful behavior. The test adds a second test which fails on Cassandra, but does pass on Scylla: Cassandra returns as the average of 1, 2, 2, 3 the integer 1 whereas the correct average is 2 (and Scylla returns it correctly). The reason why this bug is even worse on Cassandra is that Scylla's AVG only loses precision when dividing the sum and count, but Cassandra tries to maintain only the average, and loses precision at every step. Refs #13601 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13603	2023-04-21 08:32:30 +03:00
Avi Kivity	0c64dd12b1	test: raft_server_test: fix string compare for clang 15 Clang 15 rejects string compares where the left-hand-side is a C string, so help it along by converting it ourselves. Closes #13582	2023-04-21 06:38:10 +03:00
Tomasz Grabiec	0ec700cd00	locator: topology: Fix move assignment Defaulted assignment doesn't update node::_topology.	2023-04-20 23:39:18 +02:00
Tomasz Grabiec	3dfd49fe62	tests: topology: Test that topology clearing preserves information about local node	2023-04-20 23:39:18 +02:00
Tomasz Grabiec	7d3384089a	locator: topology: Recognize local node as part of indexing it Fixes a problem when raft-based topology is enabled, which loads topology from storage. It starts by clearing topology and then adding nodes one by one. Before this patch, this violates internal invariant of topology object which puts the local node as the first node. This would manifest by triggering an assert in topology::pop_node() which throws if popping the node at index 0 in order to keep the information about local node around. This is normally prevented by a check in topology::remove_node() which avoid calling pop_node() if removing the local node. But since there is no node which is marked as local, this check allows the first node to be popped. To fix the problem I lift the invariant that local node is always in _nodes. We still have information about local node in config. Instead of keeping it in _nodes, we recognize it as part of indexing. We also allow removing the local node like a regular node. The path which reloads topology works correctly after this, the local node will be recognized when (if) it is added to the topology. Fixes #13495	2023-04-20 23:39:18 +02:00
Botond Dénes	1426c623eb	Merge 'Tune up S3 unit tests environment usage (and a bit more)' from Pavel Emelyanov The tests in question are using MINIO_SERVER_ADDRESS environment variable to export minio server address from pylib to test cases. Also they use hard-coded public bucket name. Both plays badly with AWS S3, the former due to MINIO_... in its name and the latter because public bucket name can be any. So this PR puts address and public bucket name into S3_..._FOR_TEST environment variables and fixes output stream closure on failure while at it. Detached from #13493 Closes #13546 * github.com:scylladb/scylladb: s3/test: Rename MINIO_SERVER_ADDRESS environment variable s3/test: Keep public bucket name in environment s3/test: Fix upload stream closure test/lib: Add getenv_safe() helper	2023-04-20 18:01:12 +03:00
Kamil Braun	59b692e799	service: raft: plumbing `cdc::generation_service&` Pass a reference to the service into places. It shall be used later, by the group 0 state machine and topology coordinator.	2023-04-20 15:38:37 +02:00
Botond Dénes	66ee73641e	test/cql-pytest/nodetool.py: no_autocompaction_context: use the correct API This `with` context is supposed to disable, then re-enable autocompaction for the given keyspaces, but it used the wrong API for it, it used the column_family/autocompaction API, which operates on column families, not keyspaces. This oversight led to a silent failure because the code didn't check the result of the request. Both are fixed in this patch: * switch to use `storage_service/auto_compaction/{keyspace}` endpoint * check the result of the API calls and report errors as exceptions Fixes: #13553 Closes #13568	2023-04-20 16:21:16 +03:00
Kamil Braun	8d7b5f1710	Merge 'test/pylib: topology fix asyncio fixture and fix logger' from Alecco Remove unnecessary asyncio marker and re-introduce top level logger instance. Closes #13561 * github.com:scylladb/scylladb: test/pylib: add missing logger test/pylib: remove unnecessary asyncio marker	2023-04-20 14:23:05 +02:00
Alejo Sanchez	11561a73cb	test/pylib: ManagerClient helpers to wait for... server to see other servers after start/restart When starting/restarting a server, provide a way to wait for the server to see at least n other servers. Also leave the implementation methods available for manual use and update previous tests, one to wait for a specific server to be seen, and one to wait for a specific server to not be seen (down). Fixes #13147 Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Closes #13438	2023-04-20 14:22:31 +02:00

1 2 3 4 5 ...

4730 Commits