scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Author	SHA1	Message	Date
Botond Dénes	f4f757af23	db/system_keyspace: replace storage_proxy::query*() with replica:: equivalent Use the recently introduced replica side query utility functions to query the content of the system tables. This allows us to cut the dependency of the system keyspace on storage proxy. The methods still take storage proxy parameter, this will be replaced with replica::database in the next patch. There is still one hidden storage proxy dependency left, via clq3::query_processor. This will be addressed later.	2023-05-12 07:27:55 -04:00
Botond Dénes	f5d41ac88c	replica: add query.hh Containing utility methods to query data from the local replica. Intended to be used to read system tables, completely bypassing storage proxy in the process. This duplicates some code already found in storage proxy, but that is a small price to pay, to be able to break some circular dependencies involving storage proxy, that have been plaguing us since time immemorial. One thing we lose with this, is the smp service level using in storage proxy. If this becomes a problem, we can create one in database and use it in these methods too. Another thing we lose is increasing `replica_cross_shard_ops` storage proxy stat. I think this is not a problem at all as these new functions are meant to be used by internal users, which will reduce the internal noise in this metric, which is meant to indicate users not using shard-aware clients.	2023-05-12 07:26:18 -04:00
Asias He	7fcc403122	tombstone_gc: Fix gc_before for immediate mode The immediate mode is similar to timeout mode with gc_grace_seconds zero. Thus, the gc_before returned should be the query_time instead of gc_clock::time_point::max in immediate mode. Setting gc_before to gc_clock::time_point::max, a row could be dropped by compaction even if the ttl is not expired yet. The following procedure reproduces the issue: - Start 2 nodes - Insert data ``` CREATE KEYSPACE ks2a WITH REPLICATION = { 'class' : 'SimpleStrategy', 'replication_factor' : 2 }; CREATE TABLE ks2a.tb (pk int, ck int, c0 text, c1 text, c2 text, PRIMARY KEY(pk, ck)) WITH tombstone_gc = {'mode': 'immediate'}; INSERT into ks2a.tb (pk,ck, c0, c1, c2) values (10 ,1, 'x', 'y', 'z') USING TTL 1000000; INSERT into ks2a.tb (pk,ck, c0, c1, c2) values (20 ,1, 'x', 'y', 'z') USING TTL 1000000; INSERT into ks2a.tb (pk,ck, c0, c1, c2) values (30 ,1, 'x', 'y', 'z') USING TTL 1000000; ``` - Run nodetool flush and nodetool compact - Compaction drops all data ``` ~128 total partitions merged to 0. ``` Fixes #13572 Closes #13800	2023-05-11 15:10:00 +03:00
Botond Dénes	3d75158fda	Merge 'Allow no owned token ranges in cleanup compaction' from Benny Halevy It is possible that a node will have no owned token ranges in some keyspaces based on their replication strategy, if the strategy is configured to have no replicas in this node's data center. In this case we should go ahead with cleanup that will effectively delete all data. Note that this is current very inefficient as we need to filter every partition and drop it as unowned. It can be optimized by either special casing this case or, better, use skip forward to the next owned range. This will skip to end-of-stream since there are no owned ranges. Fixes #13634 Also, add a respective rest_api unit test Closes #13849 * github.com:scylladb/scylladb: test: rest_api: test_storage_service: add test_storage_service_keyspace_cleanup_with_no_owned_ranges compaction_manager: perform_cleanup: handle empty owned ranges	2023-05-11 15:05:06 +03:00
Anna Stuchlik	7f7ab3ae3e	doc: fix the broken Glossary link Fixes https://github.com/scylladb/scylladb/issues/13805 This commit fixes the redirection required by moving the Glossary page from the top of the page tree to the Reference section. As the change was only merged to master (not to branch-5.2), it is not working for version 5.2, which is now the latest stable version. For this reason, "stable" in the path must be replaced with "master". Closes #13847	2023-05-11 10:30:59 +03:00
Botond Dénes	24cb351655	Merge 'test: sstable_test: avoid using helper using generation_type::int_t ' from Kefu Chai the series drops some of the callers using SSTable generation as integer. as the generation of SSTable is but an identifier, we should not use it as an integer out of generation_type's implementation. Closes #13845 github.com:scylladb/scylladb: test: drop unused helper functions test: sstable_mutation_test: avoid using helper using generation_type::int_t test: sstable_move_test: avoid using helper using generation_type::int_t test: sstable_*test: avoid using helper using generation_type::int_t test: sstable_3_x_test: do not use reuseable_sst() accepting integer	2023-05-11 10:17:02 +03:00
Benny Halevy	2fc142279f	compaction_manager: perform_cleanup: hold on to sstable_set around yielding Updates to the compaction_group sstable sets are never done in place. Instead, the update is done on a mutable copy of the sstable set, and the lw_shared result is set back in the compaction_group. (see for example compaction_group::set_main_sstables) Therefore, there's currently a risk in perform_cleanup `get_sstables` lambda that if it yield while in set.for_each_sstable, the sstable_set might be replaced and the copy it is traversing may be destroyed. This was introduced in `c2bf0e0b72`. To prevent that, hold on to set.shared_from_this() around set.for_each_sstable. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #13852	2023-05-11 09:46:53 +03:00
Benny Halevy	0b91bfbcc5	test: rest_api: test_storage_service: add test_storage_service_keyspace_cleanup_with_no_owned_ranges Test cleanup on a keyspace after altering it replication factor to 0. Expect no sstables to remain. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-11 08:16:31 +03:00
Kefu Chai	29284d64a5	test: drop unused helper functions all users of these two helpers have switched to their alternatives, so there is no need to keep them. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-11 12:32:37 +08:00
Kefu Chai	b036d2b50c	test: sstable_mutation_test: avoid using helper using generation_type::int_t this change is one of the series which drops most of the callers using SSTable generation as integer. as the generation of SSTable is but an identifier, we should not use it as an integer out of generation_type's implementation. so, in this change, instead of using `generation_type::int_t` in the helper functions, we just pass `generation_type` in place of integer. also, since `generate_clustered()` is only used by functions in the same compilation unit, let's take the opportunity to mark it `static`. and there is no need to pass generation as a template parameter, we just pass it as a regular parameter. we will divert other callers of `reusable_sst(..., generation_type::int)` in following-up changes in different ways. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-11 12:32:22 +08:00
Kefu Chai	689e1e99d6	test: sstable_move_test: avoid using helper using generation_type::int_t this change is one of the series which drops most of the callers using SSTable generation as integer. as the generation of SSTable is but an identifier, we should not use it as an integer out of generation_type's implementation. so, in this change, instead of using `generation_type::int_t` in helper functions, we just use `generation_type`. please note, despite that we'd prefer generating the generations using generator, the SSTables used by the tests modified by this change are stored in the repo, to ensure that the tests are always able to find the SSTable files, we keep them unchanged instead of using generation_generator, or a random generation for the testing. we will divert other callers of `reusable_sst(..., generation_type::int)` in following-up changes in different ways. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-11 12:32:22 +08:00
Kefu Chai	bfd6caffbb	test: sstable_*test: avoid using helper using generation_type::int_t this change is one of the series which drops most of the callers using SSTable generation as integer. as the generation of SSTable is but an identifier, we should not use it as an integer out of generation_type's implementation. so, in this change, instead of using the helper accepting int, we switch to the one which accepts generation_type by offering a default paramter, which is a generation created using 1. this preserves the existing behavior. we will divert other callers of `reusable_sst(..., generation_type::int)` in following-up changes in different ways. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-11 12:32:22 +08:00
Kefu Chai	ab8efbf1ab	test: sstable_3_x_test: do not use reuseable_sst() accepting integer this change is one of the series which drops most of the callers using SSTable generation as integer. as the generation of SSTable is but an identifier, we should not use it as an integer out of generation_type's implementation. so, in this change, instead of using the helper accepting int, we switch to the one which accepts generation_type. also, as no callers are using the last parameter of `make_test_sstable()`, let's drop it . Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-11 12:32:21 +08:00
Nadav Har'El	f1cad230bb	Merge 'cql: enable setting permissions on resources with quoted UDT names' from Wojciech Mitros This series fixes an issue with altering permissions on UDFs with parameter types that are UDTs with quoted names and adds a test for it. The issue was caused by the format of the temporary string that represented the UDT in `auth::resource`. After parsing the user input to a raw type, we created a string representing the UDT using `ut_name::to_string()`. The segment of the resulting string that represented the name of the UDT was not quoted, making us unable to parse it again when the UDT was being `prepare`d. Other than for this purpose, the `ut_name::to_string()` is used only for logging, so the solution was modifying it to maybe quote the UDT name. Ref: https://github.com/scylladb/scylladb/pull/12869 Closes #13257 * github.com:scylladb/scylladb: cql-pytest: test permissions for UDTs with quoted names cql: maybe quote user type name in ut_name::to_string() cql: add a check for currently used stack in parser cql-pytest: add an optional name parameter to new_type()	2023-05-10 19:10:29 +03:00
Wojciech Mitros	1f45c7364c	cql: check permissions for used functions when creating a UDA Currently, when creating a UDA, we only check for permissions for creating functions. However, the creator gains all permissions to the UDA, including the EXECUTE permission. This enables the user to also execute the state/reduce/final functions that were used in the UDA, even if they don't have the EXECUTE permissions on them. This patch adds checks for the missing EXECUTE permissions, so that the UDA can be only created if the user has all required permissions. The new permissions that are now required when creating a UDA are now granted in the existing UDA test. Fixes #13818 Closes #13819	2023-05-10 18:06:04 +03:00
Wojciech Mitros	a86b9fa0bb	auth: fix formatting of function resource with no arguments Currently, when a function has no arguments, the function_args() method, which is supposed to return a vector of string_views representing the arguments of the function, returns a nullopt instead, as if it was a functions_resource on all functions or all functions in a keyspace. As a result, the functions_resource can't be properly formatted. This is fixed in this patch by returning an empty vector instead, and the fix is confirmed in a cql-pytest. Fixes #13842 Closes #13844	2023-05-10 17:07:33 +03:00
Benny Halevy	3771d48488	sstables: mx: validate: close consumer context data_consume_rows keeps an input_stream member that must be closed. In particular, on the error path, when we destroy it possibly with readaheads in flight. Fixes #13836 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #13840	2023-05-10 17:05:43 +03:00
Benny Halevy	c720754e37	compaction_manager: perform_cleanup: handle empty owned ranges It is possible that a node will have no owned token ranges in some keyspaces based on their replication strategy, if the strategy is configured to have no replicas in this node's data center. In this case we should go ahead with cleanup that will effectively delete all data. Note that this is current very inefficient as we need to filter every partition and drop it as unowned. It can be optimized by either special casing this case ot, better, use skip forward to the next owned range. This will skip to end-of-stream since there are no owned ranges. Fixes #13634 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-10 15:11:53 +03:00
Avi Kivity	171a6cbbaa	cql3: untyped_result_set: document performance characteristics untyped_result_set is optimized towards conventience and safety, so note that. Closes #13661	2023-05-10 15:03:12 +03:00
Nadav Har'El	e57252092c	Merge 'cql3: result_set, selector: change value type to managed_bytes_opt' from Avi Kivity CQL evolved several expression evaluation mechanisms: WHERE clause, selectors (the SELECT clause), and the LWT IF clause are just some examples. Most now use expressions, which use managed_bytes_opt as the underlying value representation, but selectors still use bytes_opt. This poses two problems: 1. bytes_opt generates large contiguous allocations when used with large blobs, impacting latency 2. trying to use expressions with bytes_opt will incur a copy, reducing performance To solve the problem, we harmonize the data types to managed_bytes_opt (#13216 notwithstanding). This is somewhat difficult since the source of the values are views into a bytes_ostream. However, luckily bytes_ostream and managed_bytes_view are mostly compatible so with a little effort this can be done. The series is neutral wrt performance: before: ``` 222118.61 tps ( 61.1 allocs/op, 12.1 tasks/op, 43092 insns/op, 0 errors) 224250.14 tps ( 61.1 allocs/op, 12.1 tasks/op, 43094 insns/op, 0 errors) 224115.66 tps ( 61.1 allocs/op, 12.1 tasks/op, 43092 insns/op, 0 errors) 223508.70 tps ( 61.1 allocs/op, 12.1 tasks/op, 43107 insns/op, 0 errors) 223498.04 tps ( 61.1 allocs/op, 12.1 tasks/op, 43087 insns/op, 0 errors) ``` after: ``` 220708.37 tps ( 61.1 allocs/op, 12.1 tasks/op, 43118 insns/op, 0 errors) 225168.99 tps ( 61.1 allocs/op, 12.1 tasks/op, 43081 insns/op, 0 errors) 222406.00 tps ( 61.1 allocs/op, 12.1 tasks/op, 43088 insns/op, 0 errors) 224608.27 tps ( 61.1 allocs/op, 12.1 tasks/op, 43102 insns/op, 0 errors) 225458.32 tps ( 61.1 allocs/op, 12.1 tasks/op, 43098 insns/op, 0 errors) ``` Though I expect with some more effort we can eliminate some copies. Closes #13637 * github.com:scylladb/scylladb: cql3: untyped_result_set: switch to managed_bytes_view as the cell type cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt cql3: untyped_result_set: always own data types: abstract_type: add mixed-type versions of compare() and equal() utils/managed_bytes, serializer: add conversion between buffer_view<bytes_ostream> and managed_bytes_view utils: managed_bytes: add bidirectional conversion between bytes_opt and managed_bytes_opt utils: managed_bytes: add managed_bytes_view::with_linearized() utils: managed_bytes: mark managed_bytes_view::is_linearized() const	2023-05-10 15:01:45 +03:00
Wojciech Mitros	9ae1b02144	service: revoke permissions on functions when a function/keyspace is dropped Currently, when a user has permissions on a function/all functions in keyspace, and the function/keyspace is dropped, the user keeps the permissions. As a result, when a new function/keyspace is created with the same name (and signature), they will be able to use it even if no permissions on it are granted to them. Simliarly to regular UDFs, the same applies to UDAs. After this patch, the corresponding permissions on functions are dropped when a function/keyspace is dropped. Fixes #13820 Closes #13823	2023-05-10 14:39:42 +03:00
Botond Dénes	bb62038119	Merge 'Scrub compaction task' from Aleksandra Martyniuk Task manager's tasks covering scrub compaction on top, shard and table level. For this levels we have common scrub tasks for each scrub mode since they share code. Scrub modes will be differentiated on compaction group level. Closes #13694 * github.com:scylladb/scylladb: test: extend test_compaction_task.py to test scrub compaction compaction: add table_scrub_sstables_compaction_task_impl compaction: add shard_scrub_sstables_compaction_task_impl compaction: add scrub_sstables_compaction_task_impl api: get rid of unnecessary std::optional in scrub compaction: rename rewrite_sstables_compaction_task_impl	2023-05-10 14:18:20 +03:00
Anna Stuchlik	4898a20ae9	doc: add troubleshooting for failed schema sync Fixes https://github.com/scylladb/scylladb/issues/12133 This commit adds a Troubleshooting article to support users when schema sync failed on their cluster. Closes #13709	2023-05-10 14:01:36 +03:00
Avi Kivity	1a3545b13d	Merge 'data_dictionary: define helpers in options and define == operator only' from Kefu Chai in this series, `data_dictionary::storage_options` is refactored so that each dedicated storage option takes care of itself, instead of putting all the logic into `storage_options`. cleaner this way. as the next step, i will add yet another set of options for the tiered_storage which is backed by the s3_storage and the local filesystem_storage. with this change, we will be able to group the per-option functionalities together by the option thy are designed for, instead of sharding them by the actual function. Closes #13826 * github.com:scylladb/scylladb: data_dictionary: define helpers in options data_dictionary: only define operator== for storage options	2023-05-10 12:59:57 +03:00
Avi Kivity	e252dbcfb8	Merge ' readers,mutation: move mutation_fragment_stream_validator to mutation/' from Botond Dénes The validator classes have their definition in a header located in mutation/, while their implementation is located in a .cc in readers/mutation_reader.cc. This PR fixes this inconsistency by moving the implementation into mutation/mutation_fragment_stream_validator.cc. The only change is that the validator code gets a new logger instance (but the logger variable itself is left unchanged for now). Closes #13831 * github.com:scylladb/scylladb: mutation/mutation_fragment_stream_validator.cc: rename logger readers,mutation: move mutation_fragment_stream_validator to mutation/	2023-05-10 12:54:53 +03:00
Kamil Braun	7d9ab44e81	Merge 'token_metadata: read remapping for write_both_read_new' from Gusev Petr When new nodes are added or existing nodes are deleted, the topology state machine needs to shunt reads from the old nodes to the new ones. This happens in the `write_both_read_new` state. The problem is that previously this state was not handled in any way in `token_metadata` and the read nodes were only changed when the topology state machine reached the final 'owned' state. To handle `write_both_read_new` an additional `interval_map` inside `token_metadata` is maintained similar to `pending_endpoints`. It maps the ranges affected by the ongoing topology change operation to replicas which should be used for reading. When topology state sm reaches the point when it needs to switch reads to a new topology, it passes `request_read_new=true` in a call to `update_pending_ranges`. This forces `update_pending_ranges` to compute the ranges based on new topology and store them to the `interval_map`. On the data plane, when a read on coordinator needs to decide which endpoints to use, it first consults this `interval_map` in `token_metadata`, and only if it doesn't contain a range for current token it uses normal endpoints from `effective_replication_map`. Closes #13376 * github.com:scylladb/scylladb: storage_proxy, storage_service: use new read endpoints storage_proxy: rename get_live_sorted_endpoints->get_endpoints_for_reading token_metadata: add unit test for endpoints_for_reading token_metadata: add endpoints for reading sequenced_set: add extract_set method token_metadata_impl: extract maybe_migration_endpoints helper function token_metadata_impl: introduce migration_info token_metadata_impl: refactor update_pending_ranges token_metadata: add unit tests token_metadata: fix indentation token_metadata_impl: return unique_ptr from clone functions	2023-05-10 10:03:30 +02:00
Avi Kivity	550aa01242	Merge 'Restore raft::internal::tagged_uint64 type' from Benny Halevy Change `f5f566bdd8` introduced tagged_integer and replaced raft::internal::tagged_uint64 with utils::tagged_integer. However, the idl type for raft::internal::tagged_uint64 was not marked as final, but utils::tagged_integer is, breaking the on-the-wire compatibility. This change restores the use of raft::internal::tagged_uint64 for the raft types and adds back an idl definition for it that is not marked as final, similar to the way raft::internal::tagged_id extends utils::tagged_uuid. Fixes #13752 Closes #13774 * github.com:scylladb/scylladb: raft, idl: restore internal::tagged_uint64 type raft: define term_t as a tagged uint64_t idl: gossip_digest: include required headers	2023-05-09 22:51:25 +03:00
Kefu Chai	d8cd62b91a	compaction/compaction: initialize local variable the initial `validation_errors` should be zero. so let's initialize it instead of leaving it to uninitialized. this should address following warning from Clang-16: ``` /usr/bin/clang++ -DDEBUG -DDEBUG_LSA_SANITIZER -DFMT_DEPRECATED_OSTREAM -DFMT_SHARED -DSANITIZE -DSCYLLA_BUILD_MODE=debug -DSCYLLA_ENABLE_ERROR_INJECTION -DSEASTAR_API_LEVEL=6 -DSEASTAR_DEBUG -DSEASTAR_DEBUG_SHARED_PTR -DSEASTAR_DEFAULT_ALLOCATOR -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SHUFFLE_TASK_QUEUE -DSEASTAR_TYPE_ERASE_MORE -DXXH_PRIVATE_API -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/cmake/seastar/gen/include -I/home/kefu/dev/scylladb/build/cmake/gen -isystem /home/kefu/dev/scylladb/build/cmake/rust -Wall -Werror -Wno-error=deprecated-declarations -Wno-c++11-narrowing -Wno-mismatched-tags -Wno-overloaded-virtual -Wno-unsupported-friend -march=westmere -Og -g -gz -std=gnu++20 -fvisibility=hidden -U_FORTIFY_SOURCE -DSEASTAR_SSTRING -Wno-error=unused-result "-Wno-error=#warnings" -fstack-clash-protection -fsanitize=address -fsanitize=undefined -fno-sanitize=vptr -MD -MT compaction/CMakeFiles/compaction.dir/compaction.cc.o -MF compaction/CMakeFiles/compaction.dir/compaction.cc.o.d -o compaction/CMakeFiles/compaction.dir/compaction.cc.o -c /home/kefu/dev/scylladb/compaction/compaction.cc /home/kefu/dev/scylladb/compaction/compaction.cc:1681:9: error: variable 'validation_errors' is uninitialized when used here [-Werror,-Wuninitialized] validation_errors += co_await sst->validate(permit, descriptor.io_priority, cdata.abort, [&schema] (sstring what) { ^~~~~~~~~~~~~~~~~ /home/kefu/dev/scylladb/compaction/compaction.cc:1676:31: note: initialize the variable 'validation_errors' to silence this warning uint64_t validation_errors; ^ = 0 ``` the change which introduced this local variable was `7ba5c9cc6a`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13813	2023-05-09 22:49:29 +03:00
Avi Kivity	8c6229d229	Merge 'sstable: encode value using UUID' from Kefu Chai in this series, we encode the value of generation using UUID to prepare for the UUID generation identifier. simpler this way, as we don't need to have two ways to encode integer or a timeduuid: uuid with a zero timestamp, and a variant. also, add a `from_string()` factory method to convert string to generation to hide the the underlying type of value from generation_type's users. Closes #13782 * github.com:scylladb/scylladb: sstable: use generation_type::from_string() to convert from string sstable: encode int using UUID in generation_type	2023-05-09 22:07:23 +03:00
Avi Kivity	996f717dfc	Merge 'cql3/prepare_expr: force token() receiver name to be partition key token' from Jan Ciołek Let's say that we have a prepared statement with a token restriction: ```cql SELECT * FROM some_table WHERE token(p1, p2) = ? ``` After calling `prepare` the drivers receives some information about the prepared statment, including names of values bound to each bind marker. In case of a partition token restriction (`token(p1, p2) = ?`) there's an expectation that the name assigned to this bind marker will be `"partition key token"`. In a recent change the code handling `token()` expressions has been unified with the code that handles generic function calls, and as a result the name has changed to `token(p1, p2)`. It turns out that the Java driver relies on the name being `"partition key token"`, so a change to `token(p1, p2)` broke some things. This patch sets the name back to `"partition key token"`. To achieve this we detect any restrictions that match the pattern `token(p1, p2, p3) = X` and set the receiver name for X to `"partition key token"`. Fixes: #13769 Closes #13815 * github.com:scylladb/scylladb: cql-pytest: test that bind marker is partition key token cql3/prepare_expr: force token() receiver name to be partition key token	2023-05-09 20:44:46 +03:00
Anna Stuchlik	c64109d8c7	doc: add driver support for Serverless Fixes https://github.com/scylladb/scylladb/issues/13453 This is V2 of https://github.com/scylladb/scylladb/pull/13710/. This commit adds: - the information about which ScyllaDB drivers support ScyllaDB Cloud Serverless. - language and organization improvements to the ScyllaDB CQL Drivers page. Closes #13825	2023-05-09 20:43:22 +03:00
Kefu Chai	c872ade50f	sstable: use generation_type::from_string() to convert from string in this change, * instead of using "\d+" to match the generation, use "[^-]", * let generation_type to convert a string to generation before this change, we casts the matched string in SSTable file name to integer and then construct a generation identifier from the integer. this solution has a strong assumption that the generation is represented with an integer, we should not encode this assumption in sstable.cc, instead we'd better let generation_type itself to take care of this. also, to relax the restriction of regex for matching generation, let's just use any characters except for the delimeter -- "-". Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 22:57:39 +08:00
Kefu Chai	478c13d0d4	sstable: encode int using UUID in generation_type since we already use UUID for encoding an bigint in SSTable registry table, let's just use the same approach for encoding bigint in generation_type, to be more consistent, and less repeatings this way. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 22:57:38 +08:00
Petr Gusev	08529a1c6c	storage_proxy, storage_service: use new read endpoints We use set_topology_transition_state to set read_new state in storage_service::topology_state_load based on _topology_state_machine._topology.tstate. This triggers update_pending_ranges to compute and store new ranges for read requests. We use this information in storage_proxy::get_endpoints_for_reading when we need to decide which nodes to use for reading.	2023-05-09 18:42:03 +04:00
Petr Gusev	052b91fb1f	storage_proxy: rename get_live_sorted_endpoints->get_endpoints_for_reading We are going to use remapped_endpoints_for_reading, we need to make sure we use it in the right place. The get_live_sorted_endpoints function looks like what we need - it's used in all read code paths. From its name, however, this was not obvious. Also, we add the parameter ks_name as we'll need it to pass to remapped_endpoints_for_reading.	2023-05-09 18:42:03 +04:00
Petr Gusev	15fe4d8d69	token_metadata: add unit test for endpoints_for_reading	2023-05-09 18:42:03 +04:00
Petr Gusev	0e4e2df657	token_metadata: add endpoints for reading In this patch we add token_metadata::set_topology_transition_state method. If the current state is write_both_read_new update_pending_ranges will compute new ranges for read requests. The default value of topology_transition_state is null, meaning no read ranges are computed. We will add the appropriate set_topology_transition_state calls later. Also, we add endpoints_for_reading method to get read endpoints based on the computed ranges.	2023-05-09 18:41:59 +04:00
Kefu Chai	d24687ea26	data_dictionary: define helpers in options instead of dispatching and implementing the per-option handling right in `storage_option`, define these helpers in the dedicated option themselves, so `storage_option` is only responsible for dispatching. much cleaner this way. this change also makes it easier to add yet another storage backend. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 21:51:52 +08:00
Kefu Chai	152d0224dc	data_dictionary: only define operator== for storage options as the only user of these comparison operators is `storage_options::can_update_to()`, which just check if the given `storage_options` is equal to the stored one. so no need to define the <=> operator. also, no need to add the `friend` specifier, as the options are plain struct, all the member variables are public. make the comparison operator a member function instead of a free function, as in C++20 comparision operators are symmetric. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 21:51:45 +08:00
Botond Dénes	ef7b7223d5	mutation/mutation_fragment_stream_validator.cc: rename logger This code inherited its logger variable name from mutation reader, rename it to better match its new context.	2023-05-09 07:55:13 -04:00
Botond Dénes	8681f3e997	readers,mutation: move mutation_fragment_stream_validator to mutation/ The validator classes have their definition in a header located in mutation/, while their implementation is located in a .cc in readers/mutation_reader.cc. This patch fixes this inconsistency by moving the implementation into mutation/mutation_fragment_stream_validator.cc. The only change is that the validator code gets a new logger instance (but the logger variable itself is left unchanged for now).	2023-05-09 07:55:13 -04:00
Botond Dénes	287ccce1cc	Merge 'sstables: extract storage out ' from Kefu Chai this change extracts the storage class and its derived classes out into their own source files. for couple reasons: - for better readability. the sstables.hh is over 1005 lines. and sstables.cc 3602 lines. it's a little bit difficult to figure out how the different parts in these sources interact with each other. for instance, with this change, it's clear some of helper functions are only used by file_system_storage. - probably less inter-source dependency. by extracting the sources files out, they can be compiled individually, so changing one .cc file does not impact others. this could speed up the compilation time. Closes #13785 * github.com:scylladb/scylladb: sstables: storage: coroutinize idempotent_link_file() sstables: extract storage out	2023-05-09 14:03:40 +03:00
Jan Ciolek	9ad1c5d9f2	cql-pytest: test that bind marker is partition key token When preparing a query each bind marker gets a name. For a query like: ```cql SELECT * FROM some_table WHERE token(p1, p2) = ? ``` The bind marker's name should be `"partition key token"`. Java driver relies on this name, having something else, like `"token(p1, p2)"` be the name breaks the Java driver. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-05-09 12:33:06 +02:00
Jan Ciolek	8a256f63db	cql3/prepare_expr: force token() receiver name to be partition key token Let's say that we have a prepared statement with a token restriction: ```cql SELECT * FROM some_table WHERE token(p1, p2) = ? ``` After calling `prepare` the drivers receives some information about the prepared statment, including names of values bound to each bind marker. In case of a partition token restriction (`token(p1, p2) = ?`) there's an expectation that the name assigned to this bind marker will be `"partition key token"`. In a recent change the code handling `token()` expressions has been unified with the code that handles generic function calls, and as a result the name has changed to `token(p1, p2)`. It turns out that the Java driver relies on the name being `"partition key token"`, so a change to `token(p1, p2)` broke some things. This patch sets the name back to `"partition key token"`. To achieve this we detect any restrictions that match the pattern `token(p1, p2, p3) = X` and set the receiver name for X to `"partition key token"`. Fixes: #13769 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-05-09 12:32:57 +02:00
Petr Gusev	b2e5d8c21c	sequenced_set: add extract_set method Can be useful if we want to reuse the set when we are done with this sequenced_set instance.	2023-05-09 13:56:38 +04:00
Petr Gusev	0567ab82ac	token_metadata_impl: extract maybe_migration_endpoints helper function We are going to add a function in token_metadata to get read endpoints, similar to pending_endpoints_for. So in this commit we extract the maybe_migration_endpoints helper function, which will be used in both cases.	2023-05-09 13:56:38 +04:00
Petr Gusev	030f0f73aa	token_metadata_impl: introduce migration_info We are going to store read_endpoints in a way similar to pending ranges, so in this commit we add migration_info - a container for two boost::icl::interval_map. Also, _pending_ranges_interval_map is renamed to _keyspace_to_migration_info, since it captures the meaning better.	2023-05-09 13:56:38 +04:00
Petr Gusev	56c2b3e893	token_metadata_impl: refactor update_pending_ranges Now update_pending_ranges is quite complex, mainly because it tries to act efficiently and update only the affected intervals. However, it uses the function abstract_replication_strategy::get_ranges, which calls calculate_natural_endpoints for every token in the ring anyway. Our goal is to start reading from the new replicas for ranges in write_both_read_new state. In the current code structure this is quite difficult to do, so in this commit we first simplify update_pending_ranges. The main idea of the refactoring is to build a new version of token_metadata based on all planned changes (join, bootstrap, replace) and then for each token range compare the result of calculate_natural_endpoints on the old token_metadata and on the new one. Those endpoints that are in the new version and are not in the old version should be added to the pending_ranges. The add_mapping function is extracted for the future - we are going to use it to handle read mappings. Special care is taken when replacing with the same IP. The coordinator employs the get_natural_endpoints_without_node_being_replaced function, which excludes such endpoints from its result. If we compare the new (merged) and current token_metadata configurations, such endpoints will also be absent from pending_endpoints since they exist in both. To address this, we copy the current token_metadata and remove these endpoints prior to comparison. This ensures that nodes being replaced are treated like those being deleted.	2023-05-09 13:56:28 +04:00
Petr Gusev	3120cabf56	token_metadata: add unit tests We are going to refactor update_pending_ranges, so in this commit we add some simple unit tests to ensure we don't break it.	2023-05-09 13:56:06 +04:00
Benny Halevy	adfb79ba3e	raft, idl: restore internal::tagged_uint64 type Change `f5f566bdd8` introduced tagged_integer and replaced raft::internal::tagged_uint64 with utils::tagged_integer. However, the idl type for raft::internal::tagged_uint64 was not marked as final, but utils::tagged_integer is, breaking the on-the-wire compatibility. This change defines the different raft tagged_uint64 types in idl/raft_storage.idl.hh as non-final to restore the way they were serialized prior to `f5f566bdd8` Fixes #13752 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-09 12:38:20 +03:00

1 2 3 4 5 ...

36737 Commits