scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-21 09:00:35 +00:00

Author	SHA1	Message	Date
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Pavel Emelyanov	b29d3f1758	cas_request: Make read_command() accept query_processor Just relpace the argument and patch the callers Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:54:28 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Solodovnikov	2187a59089	treewide: move `service::cas_request` out from `storage_proxy.hh` And remove all remaining inclusions of `storage_proxy.hh` in the headers. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-06-06 19:18:49 +03:00
Pavel Solodovnikov	92fd515186	lwt: for each statement in cas_request provide a row in CAS result set Previously batch statement result set included rows for only those updates which have a prefetch data present (i.e. there was an "old" (pre-existing) row for a key). Also, these rows were sorted not in the order in which statements appear in the batch, but in the order of updated clustering keys. If we have a batch which updates a few non-existent keys, then it's impossible to figure out which update inserted a new key by looking at the query response. Not only because the responses may not correspond to the order of statements in the batch, but even some rows may not show up in the result set at all. The patch proposes the following fix: For conditional batch statements the result set now always includes a row for each LWT statement, in the same order in which individual statements appear in the batch. This way we can always tell which update did actually insert a new key or update the existing one. `update_parameters::prefetch_data::row::is_in_cas_result_set` member variable was removed as well as supporting code in `cas_request::applies_to` which iterated through cas updates and marked individual `prefetch_data` rows as "need to be in cas result set". Instead now `cas_request::applies_to` is significantly simplified since it doesn't do anything more than checking `stmt.applies_to()` in short-circuiting manner. A few tests for the issue are written, other lwt-batch-related tests were adjusted accordingly to include rows in result set for each statement inside conditional batches. Tests: unit(dev, debug) Co-authored-by: Konstantin Osipov <kostja@scylladb.com> Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-09-04 13:13:26 +03:00
Pavel Solodovnikov	feaf2b6320	cas_request: move `modification_statement::build_cas_result_set` to `cas_request` This is just a plain move of the code from `modification_statement` to `cas_request` without changes in the logic, which will further help to refactor `build_cas_result_set` behavior to include a row for each LWT statement and order rows in the order of statements in a batch. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-09-04 12:25:06 +03:00
Pavel Solodovnikov	0f0ff73a58	cas_request: extract `find_old_row` helper function Factor out little helper function which finds a pre-existing row for a given `cas_row_update` (matching the primary key). Used in `cas_request::applies_to`. Will be used in a subsequent patch to move `modification_statement::build_cas_result_set` into `cas_request`. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-09-04 12:09:31 +03:00
Wojciech Mitros	45215746fe	increase the maximum size of query results to 2^64 Currently, we cannot select more than 2^32 rows from a table because we are limited by types of variables containing the numbers of rows. This patch changes these types and sets new limits. The new limits take effect while selecting all rows from a table - custom limits of rows in a result stay the same (2^32-1). In classes which are being serialized and used in messaging, in order to be able to process queries originating from older nodes, the top 32 bits of new integers are optional and stay at the end of the class - if they're absent we assume they equal 0. The backward compatibility was tested by querying an older node for a paged selection, using the received paging_state with the same select statement on an upgraded node, and comparing the returned rows with the result generated for the same query by the older node, additionally checking if the paging_state returned by the upgraded node contained new fields with correct values. Also verified if the older node simply ignores the top 32 bits of the remaining rows number when handling a query with a paging_state originating from an upgraded node by generating and sending such a query to an older node and checking the paging_state in the reply(using python driver). Fixes #5101.	2020-08-03 17:32:49 +02:00
Botond Dénes	92a7b16cba	query: read_command: add max_result_size This field will replace max size which is currently passed once per established rpc connection via the CLIENT_ID verb and stored as an auxiliary value on the client_info. For now it is unused, but we update all sites creating a read command to pass the correct value to it. In the next patch we will phase out the old max size and use this field to pass max size on each verb instead.	2020-07-28 18:00:29 +03:00
Pavel Emelyanov	3df4f3078f	storage_proxy: Move hint_wrapper from .hh to .cc It's only used there, but requires mutation_query.hh, which can thus be removed from storage_proxy.hh Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-17 17:40:25 +03:00
Gleb Natapov	0fed86e4c6	lwt: change cas_request::apply signature Change the way query result is passed from getting a reference to a result to getting a foreign_ptr<lw_shared_ptr<query::result>>. This will allow cas_request to keep it without copying.	2020-05-05 12:38:23 +03:00
Avi Kivity	6c84dd0045	cql3: update_statement: do not set query option always_return_static_content for list read-before-write The query option always_return_static_content was added for lightweight transations in commits `e0b31dd273` (infrastructure) and `65b86d155e` (actual use). However, the flag was added unconditionally to update_parameters::options. This caused it to be set for list read-modify-write operations, not just for lightweight transactions. This is a little wasteful, and worse, it breaks compatibility as old nodes do not understand the always_return_static_content flag and complain when they see it. To fix, remove the always_return_static_content from update_parameters::options and only set it from compare-and-swap operations that are used to implement lightweight transactions. Fixes #5593. Reviewed-by: Gleb Natapov <gleb@scylladb.com> Message-Id: <20200114135133.2338238-1-avi@scylladb.com>	2020-01-14 16:15:20 +02:00
Konstantin Osipov	90346236ac	cql: propagate const property through prepared statement tree. cql_statement is a class representing a prepared statement in Scylla. It is used concurrently during execution, so it is important that its change is not changed by execution. Add const qualifier to the execution methods family, throghout the cql hierarchy. Mark a few places which do mutate prepared statement state during execution as mutable. While these are not affecting production today, as code ages, they may become a source of latent bugs and should be moved out of the prepared state or evaluated at prepare eventually: cf_property_defs::_compaction_strategy_class list_permissions_statement::_resource permission_altering_statement::_resource property_definitions::_properties select_statement::_opts	2019-11-26 14:18:17 +03:00
Avi Kivity	6aed3b7471	Merge "cql: trivial cleanup" from Vova * 'cql-trivial-cleanup' of ssh://github.com/scylladb/scylla-dev: cql: rename modification_statement::_sets_a_collection to _selects_a_collection cql: rename _column_conditions to _regular_conditions cql: remove unnecessary optional around prefetch_data	2019-11-13 15:12:10 +02:00
Konstantin Osipov	6159c012db	schema: pre-allocate the bitset of column_set The number of columns is usually small, and avoiding a resize speeds up bit manipulation functions.	2019-11-13 11:41:51 +03:00
Konstantin Osipov	191acec7ab	schema: rename column_mask to column_set Since it contains a precise set of columns, it's more accurate to call it a set, not a mask. Besides, the name column_mask is already used for column options on storage level.	2019-11-13 11:41:30 +03:00
Konstantin Osipov	0ad0369684	cql: remove unnecessary optional around prefetch_data	2019-11-12 20:15:24 +03:00
Vladimir Davydov	65b86d155e	cql: add static row to CAS failure result if there are static conditions Even if no rows match clustering key restrictions of a conditional statement with static columns conditions, we still must include the static column value into the CAS failure result set. For example, the following conditional DELETE statement create table t(k int, c int, s int static, v int, primary key(k, c)); insert into t(k, s) values(1, 1); delete v from t where k=1 and c=1 if v=1 and s=1; must return [applied=False, v=null, s=1] not just [applied=False, v=null, s=null] To fix that, set partition_slice::option::always_return_static_content for querying rows used for checking conditions so that we have the static row in update_parameters::prefetch_data even if no regular row matches clustering column restrictions. Plus modify cas_request:: applies_to() so that it sets is_in_cas_result_set flag for the static row in case there are static column conditions, but the result set happens to be empty. As pointed out by Tomek, there's another reason to set partition_slice:: option::always_return_static_content apart from building a correct result set on CAS failure. There could be a batch with two statements, one with clustering key restrictions which select no row, and another statement with only static column conditions. If we didn't enable this flag, we wouldn't get a static row even if it exists, and static column conditions would evaluate as if the static row didn't exist, for example, the following batch create table t(k int, c int, s int static, primary key(k, c)); insert into t(k, s) values(1, 1); begin batch insert into t(k, c) values(1, 1) if not exists update t set s = 2 where k = 1 if s = 1 apply batch; would fail although it clearly must succeed.	2019-10-28 22:30:37 +03:00
Vladimir Davydov	57d284d254	cql: exclude statements not checked by cas from result set Apart from conditional statements, there may be other reading statements in a batch, e.g. manipulating lists. We must not include rows fetched for them into the CAS result set. For instance, the following CAS batch: create table t(p int, c int, i int, l list<int>, primary key(p, c)); insert into t(p, c, i) values(1, 1, 1) insert into t(p, c, i, l) values(1, 1, 1, [1, 2, 3]) begin batch update t set i=3 where p=1 and c=1 if i=2 update t set l=l-[2] where p=1 and c=2 apply batch; is supposed to return [applied] \| p \| c \| i ----------+---+---+--- False \| 1 \| 1 \| 1 not [applied] \| p \| c \| i ----------+---+---+--- False \| 1 \| 1 \| 1 False \| 1 \| 2 \| 1 To filter out such collateral rows from the result set, let's mark rows checked by conditional statements with a special flag.	2019-10-28 21:50:43 +03:00
Vladimir Davydov	8fbf344f03	cql: ignore clustering key if statement checks only static columns In case a CQL statement has only static columns conditions, we must ignore clustering key restrictions. Example: create table t(p int, c int, s int static, v int, primary key(p, c)); insert into t(p, s) values(1, 1); update t set v=1 where p=1 and c=1 if s=1; This conditional statement must successfully insert row (p=1, c=1, v=1) into the table even though there's no regular row with p=1 and c=1 in the table before it's executed, because the statement condition only applies to the static column s, which exists and matches.	2019-10-28 21:13:19 +03:00
Konstantin Osipov	e555dc502e	lwt: implement basic lightweight transactions support Support single-statement conditional updates and as well as batches. This patch almost fully rewrites column_condition.cc, implementing is_satisfied_by(). Most of the remaining complications in column_condition implementation come from the need to properly handle frozen and multi-cell collection in predicates - up until now it was not possible to compare entire collection values between each other. This is further complicated since multi-cell lists and sets are returned as maps. We can no longer assume that the columns fetched by prefetch operation are non-frozen collections. IF EXISTS/IF NOT EXISTS condition fetches all columns, besides, a column may be needed to check other condition. When fetching the old row for LWT or to apply updates on list/columns, we now calculate precisely the list of columns to fetch. The primary key columns are also included in CAS batch result set, and are thus also prefetched (the user needs them to figure out which statements failed to apply). The patch is cross-checked for compatibility with cassandra-3.11.4-1545-g86812fa502 but does deviate from the origin in handling of conditions on static row cells. This is addressed in future series.	2019-10-27 23:42:49 +03:00

21 Commits