scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-20 16:40:35 +00:00

Author	SHA1	Message	Date
Avi Kivity	8ab20bae68	Merge 'prepared_statements: Invalidate batch statement too' from Eliran Sinvani It seams that batch prepared statements always return false for depends_on_keyspace and depends_on_column_family, this in turn renders the removal criteria from the cache to always be false which result by the queries not being evicted. Here we change the functions to return the true state meaning, they will return true if any of the sub queries is dependant upon the keyspace or column family. In this fix we first make the API more coherent and then use this new API to implement the batch statement's dependency test. Fixes #10129 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Closes #10132 * github.com:scylladb/scylla: prepared_statements: Invalidate batch statement too cql3 statements: Change dependency test API to express better it's purpose	2022-03-07 14:00:05 +02:00
Nadav Har'El	fa7a302130	cross-tree: split coordinator_result from exceptions.hh Recently, coordinator_result was introduced as an alternative for exceptions. It was placed in the main "exceptions/exceptions.hh" header, which virtually every single source file in Scylla includes. But unfortunately, it brings in some heavy header files and templates, leading to a lot of wasted build time - ClangBuildAnalyzer measured that we include exceptions.hh in 323 source files, taking almost two seconds each on average. In this patch, we split the coordinator_result feature into a separate header file, "exceptions/coordinator_result", and only the few places which need it include the header file. Unfortunately, some of these few places are themselves header, so the new header file ends up being included in 100 source files - but 100 is still much less than 323 and perhaps we can reduce this number 100 later. After this patch, the total Scylla object-file size is reduced by 6.5% (the object size is a proxy for build time, which I didn't directly measure). ClangBuildAnalyzer reports that now each of the 323 includes of exceptions.hh only takes 80ms, coordinator_result.hh is only included 100 times, and virtually all the cost to include it comes from Boost's result.hh (400ms per inclusion). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220228204323.1427012-1-nyh@scylladb.com>	2022-03-02 10:12:57 +02:00
Eliran Sinvani	bf50dbd35b	cql3 statements: Change dependency test API to express better it's purpose Cql statements used to have two API functions, depends_on_keyspace and depends_on_column_family. The former, took as a parameter only a table name, which makes no sense. There could be multiple tables with the same name each in a different keyspace and it doesn't make sense to generalize the test - i.e to ask "Does a statement depend on any table named XXX?" In this change we unify the two calls to one - depends on that takes a keyspace name and optionally also a table name, that way every logical dependency tests that makes sense is supported by a single API call.	2022-02-27 11:48:03 +02:00
Piotr Dulikowski	2572104dfe	modification_statement: propagate failed result through result_message::exception Modifies the modification_statement code so that is converts failed `result<>` into a `result_message::exception` without involving the C++ exception runtime.	2022-02-08 11:08:42 +01:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Pavel Emelyanov	7a15f1c402	batch_\|modification_statement: Make get_mutations accept query_processor This completes the batch_ and modification_statement rework. Also touch the private batch_statement::read_command while at it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:54:28 +03:00
Pavel Emelyanov	b1b230548b	modification_statement: Replace most of proxy-s with query_processor There are some internal methods that use proxy argument. Replace most of them with query_processor, next patch will fix the rest -- those that interact with batch statement. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:54:28 +03:00
Pavel Emelyanov	bce2ed9c6c	cql3: Make execution stages carry query_processor over The batch_ , modification_ and select_ statements get proxy from query processor just to push it through execution stage. Simplify that by pushing the query processor itself. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:53:44 +03:00
Pavel Emelyanov	b990ca5550	cql3: Make .validate() and .check_access() accept query_processor This is mostly a sed script that replaces methods' first argument plus fixes of compiler-generated errors. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:53:44 +03:00
Avi Kivity	d768e9fac5	cql3, related: switch to data_dictionary Stop using database (and including database.hh) for schema related purposes and use data_dictionary instead. data_dictionary::database::real_database() is called from several places, for these reasons: - calling yet-to-be-converted code - callers with a legitimate need to access data (e.g. system_keyspace) but with the ::database accessor removed from query_processor. We'll need to find another way to supply system_keyspace with data access. - to gain access to the wasm engine for testing whether used defined functions compile. We'll have to find another way to do this as well. The change is a straightforward replacement. One case in modification_statement had to change a capture, but everything else was just a search-and-replace. Some files that lost "database.hh" gained "mutation.hh", which they previously had access to through "database.hh".	2021-12-15 13:54:23 +02:00
Pavel Solodovnikov	49ddd269ea	cql3: rename `variable_specifications` to `prepare_context` The class is repurposed to be more generic and also be able to hold additional metadata related to function calls within a CQL statement. Rename all methods appropriately. Visitor functions in AST nodes (`collect_marker_specification`) are also renamed to a more generic `fill_prepare_context`. The name `prepare_context` designates that this metadata structure is a byproduct of `stmt::raw::prepare()` call and is needed only for "prepare" step of query execution. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-07-24 14:33:33 +03:00
Pavel Solodovnikov	76bea23174	treewide: reduce header interdependencies Use forward declarations wherever possible. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Closes #8813	2021-06-07 15:58:35 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Avi Kivity	daeddda7cc	treewide: remove inclusions of storage_proxy.hh from headers storage_proxy.hh is huge and includes many headers itself, so remove its inclusions from headers and re-add smaller headers where needed (and storage_proxy.hh itself in source files that need it). Ref #1.	2021-04-20 21:23:00 +03:00
Pavel Emelyanov	26c115f379	cql3: Change execute()'s 1st arg to query_processor Currently the statement's execute() method accepts storage proxy as the first argument. This is enough for all of them but schema altering ones, because the latter need to call migration manager's announce. To provide the migration manager to those who need it it's needed to have some higher-level service that the proxy. The query processor seems to be good candidate for it. Said that -- all the .execute()s now accept the querty processor instead of the proxy and get the proxy itself from the query processor. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-15 19:00:33 +03:00
Piotr Sarna	639d90d2d6	cql3: use timeout config from client state instead of query options ... in modification statement, in order to be able to remove the timeout from query options later.	2021-02-25 17:20:27 +01:00
Piotr Sarna	3c49b6bd88	cql3: add per-query timeout to modification statement If TIMEOUT attribute is set, it will be used as the effective timeout for a particular query.	2020-12-14 07:50:40 +01:00
Dejan Mircevski	df3ea2443b	cql3: Drop all uses_function methods No one seems to call them except for other uses_function methods. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-09-04 17:27:30 +02:00
Pavel Solodovnikov	feaf2b6320	cas_request: move `modification_statement::build_cas_result_set` to `cas_request` This is just a plain move of the code from `modification_statement` to `cas_request` without changes in the logic, which will further help to refactor `build_cas_result_set` behavior to include a row for each LWT statement and order rows in the order of statements in a batch. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-09-04 12:25:06 +03:00
Botond Dénes	92a7b16cba	query: read_command: add max_result_size This field will replace max size which is currently passed once per established rpc connection via the CLIENT_ID verb and stored as an auxiliary value on the client_info. For now it is unused, but we update all sites creating a read command to pass the correct value to it. In the next patch we will phase out the old max size and use this field to pass max size on each verb instead.	2020-07-28 18:00:29 +03:00
Pavel Solodovnikov	6c6f3dbe42	lwt: introduce "LWT" flag in prepared statement metadata This patch adds a new `LWT` flag to `cql3::prepared_metadata`. That allows clients to clearly distinguish betwen lwt and non-lwt statements without need to execute some custom parsing logic (e.g. parsing the prepared query with regular expressions), which is obviously quite fragile. The feature is meant to be further utilized by client drivers to use primary replicas consistently when dealing with conditional statements. Whether to use lwt optimization flag or not is handled by negotiation procedure between scylla server and client library via SUPPORTED/STARTUP messages (`LWT_ADD_METADATA_MARK` extension). Tests: unit(dev, debug), manual testing with modified scylla/gocql driver Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-06-29 12:30:37 +03:00
Pavel Solodovnikov	b183530f2c	cql3: use lw_shared_ptr instead of shared_ptr for column_condition Both `cql3::column_condition` and `cql3::column_condition::raw` classes are marked as `final`: it's safe to use lw_shared_ptr instead of generic `seastar::shared_ptr`. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200428202249.82785-1-pa.solodovnikov@scylladb.com>	2020-05-06 13:11:07 +03:00
Pavel Emelyanov	6892dbdde7	cql3: Add storage_proxy argument to .check_access method Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-24 11:17:19 +03:00
Pavel Solodovnikov	8efb02146f	cql3: const cleanups and API de-pointerization * Pass raw::select_statement::parameters as lw_shared_ptr * Some more const cleanups here and there * lists,maps,sets::equals now accept const-ref to _type_impl instead of shared_ptr Remove unused `get_column_for_condition` from modification_statement.hh * More methods now accept const-refs instead of shared_ptr Every call site where a shared_ptr was required as an argument has been inspected to be sure that no dangling references are possible. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200220153204.279940-1-pa.solodovnikov@scylladb.com>	2020-02-20 18:14:49 +02:00
Pavel Solodovnikov	e1b22b6a4c	cql3: get rid of lw_shared_ptr for `variable_specifications` `parsed_statement::get_bound_variables` is assumed to always return a nonnull pointer to `variable_specifications` instance. In this case using a pointer is superfluous and can be safely replaced by a plain reference. Also add a default ctor and a utility method `set_bound_variables` to the `variable_specifications` class to actually reset the contents of the class instance. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200120195839.164296-1-pa.solodovnikov@scylladb.com>	2020-01-22 12:51:02 +02:00
Pavel Solodovnikov	aba9a11ff0	cql: pass variable_specifications via lw_shared_ptr Instances of `variable_specifications` are passed around as shared_ptr's, which are redundant in this case since the class is marked as `final`. Use `lw_shared_ptr` instead since we know for sure it's not a polymorphic pointer. Tests: unit(debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20191225232853.45395-1-pa.solodovnikov@scylladb.com>	2019-12-29 16:26:26 +02:00
Gleb Natapov	1538cea043	cql: modification_statement: store _restrictions as optional instead of shared_ptr _restrictions can be optional since its lifetime is managed by modification_statement class explicitly. Message-Id: <20191128131217.26294-4-gleb@scylladb.com>	2019-11-28 15:48:54 +02:00
Konstantin Osipov	90346236ac	cql: propagate const property through prepared statement tree. cql_statement is a class representing a prepared statement in Scylla. It is used concurrently during execution, so it is important that its change is not changed by execution. Add const qualifier to the execution methods family, throghout the cql hierarchy. Mark a few places which do mutate prepared statement state during execution as mutable. While these are not affecting production today, as code ages, they may become a source of latent bugs and should be moved out of the prepared state or evaluated at prepare eventually: cf_property_defs::_compaction_strategy_class list_permissions_statement::_resource permission_altering_statement::_resource property_definitions::_properties select_statement::_opts	2019-11-26 14:18:17 +03:00
Nadav Har'El	b38c3f1288	Merge "Add separate counters for accesses to system tables" Merged patch series from Juliusz Stasiewicz: Welcome to my first PR to Scylla! The task was intended as a warm-up ("noob") exercise; its description is here: #4182 Sorry, I also couldn't help it and did some scouting: edited descriptions of some metrics and shortened few annoyingly long LoC.	2019-11-19 15:21:56 +02:00
Juliusz Stasiewicz	1cfa458409	metrics: separate counters for `system' KS accesses Resolves #4182. Metrics per system tables are accumulated separately, depending on the origin of query (DB internals vs clients).	2019-11-14 13:14:39 +01:00
Avi Kivity	6aed3b7471	Merge "cql: trivial cleanup" from Vova * 'cql-trivial-cleanup' of ssh://github.com/scylladb/scylla-dev: cql: rename modification_statement::_sets_a_collection to _selects_a_collection cql: rename _column_conditions to _regular_conditions cql: remove unnecessary optional around prefetch_data	2019-11-13 15:12:10 +02:00
Konstantin Osipov	191acec7ab	schema: rename column_mask to column_set Since it contains a precise set of columns, it's more accurate to call it a set, not a mask. Besides, the name column_mask is already used for column options on storage level.	2019-11-13 11:41:30 +03:00
Vladimir Davydov	8110178e5d	cql: rename modification_statement::_sets_a_collection to _selects_a_collection This is merely to avoid confusion: we use _sets prefix to indicate that there are operations over static/regular columns (_sets_static_columns, _sets_regular_columns), but _sets_a_collection is set for both operations and conditions. So let's rename it to _selects_a_collection and add some comments.	2019-11-12 20:15:42 +03:00
Vladimir Davydov	a19192950e	cql: rename _column_conditions to _regular_conditions It's weird that modification_statement has _static_conditions for conditions on static columns and _column_conditions for conditions on regular columns, as if conditions on static columns are not column conditions. Let's rename _column_conditions to _regular_conditions to avoid confusion.	2019-11-12 20:15:35 +03:00
Vladimir Davydov	f0075ba845	cql: account cas requests separately This patch adds "type" label to the following CQL metrics: inserts updates deletes batches statements_in_batches The label is set to "cas" for conditional statements and "non-cas" for unconditional statements. Note, for a batch to be accounted as CAS, it is enough to have just one conditional statement. In this case all statements within the batch are accounted as CAS as well.	2019-10-30 13:44:35 +03:00
Vladimir Davydov	65b86d155e	cql: add static row to CAS failure result if there are static conditions Even if no rows match clustering key restrictions of a conditional statement with static columns conditions, we still must include the static column value into the CAS failure result set. For example, the following conditional DELETE statement create table t(k int, c int, s int static, v int, primary key(k, c)); insert into t(k, s) values(1, 1); delete v from t where k=1 and c=1 if v=1 and s=1; must return [applied=False, v=null, s=1] not just [applied=False, v=null, s=null] To fix that, set partition_slice::option::always_return_static_content for querying rows used for checking conditions so that we have the static row in update_parameters::prefetch_data even if no regular row matches clustering column restrictions. Plus modify cas_request:: applies_to() so that it sets is_in_cas_result_set flag for the static row in case there are static column conditions, but the result set happens to be empty. As pointed out by Tomek, there's another reason to set partition_slice:: option::always_return_static_content apart from building a correct result set on CAS failure. There could be a batch with two statements, one with clustering key restrictions which select no row, and another statement with only static column conditions. If we didn't enable this flag, we wouldn't get a static row even if it exists, and static column conditions would evaluate as if the static row didn't exist, for example, the following batch create table t(k int, c int, s int static, primary key(k, c)); insert into t(k, s) values(1, 1); begin batch insert into t(k, c) values(1, 1) if not exists update t set s = 2 where k = 1 if s = 1 apply batch; would fail although it clearly must succeed.	2019-10-28 22:30:37 +03:00
Vladimir Davydov	8fbf344f03	cql: ignore clustering key if statement checks only static columns In case a CQL statement has only static columns conditions, we must ignore clustering key restrictions. Example: create table t(p int, c int, s int static, v int, primary key(p, c)); insert into t(p, s) values(1, 1); update t set v=1 where p=1 and c=1 if s=1; This conditional statement must successfully insert row (p=1, c=1, v=1) into the table even though there's no regular row with p=1 and c=1 in the table before it's executed, because the statement condition only applies to the static column s, which exists and matches.	2019-10-28 21:13:19 +03:00
Vladimir Davydov	54cf903bb2	cql: differentiate static from regular EXISTS conditions If a modification statement doesn't have a clustering column restriction while the table has static columns, then EXISTS condition just needs to check if there's a static row in the partition, i.e. it doesn't need to select any regular rows. Let's treat such EXIST condition like a static column condition so that we can ignore its clustering key range while checking CAS conditions.	2019-10-28 21:13:05 +03:00
Vladimir Davydov	a8ddbffa75	cql: fix applies_only_to_static_columns check Currently, we set _sets_regular_columns/_sets_static_columns flags when adding regular/static conditions to modification_statement. We use them in applies_only_to_static_columns() function that returns true iff _sets_static_columns is set and _sets_regular_columns is clear. We assume that if this function returns true then the statement only deals with static columns and so must not have clustering key restrictions. Usually, that's true, but there's one exception: DELETE FROM ... statement that deletes whole rows. Technically, this statement doesn't have any column operations, i.e. _sets_regular_columns flag is clear. So if such a statement happens to have a static condition, we will assume that it only applies to static columns and mistakenly raise an error. Example: create table t(k int, c int, s int static, v int, primary key(k, c)); delete from t where k=1 and c=1 if s=1; To fix this, let's not set the above mentioned flags when adding conditions and instead check if _column_conditions array is empty in applies_only_to_static_columns().	2019-10-28 21:12:36 +03:00
Konstantin Osipov	e555dc502e	lwt: implement basic lightweight transactions support Support single-statement conditional updates and as well as batches. This patch almost fully rewrites column_condition.cc, implementing is_satisfied_by(). Most of the remaining complications in column_condition implementation come from the need to properly handle frozen and multi-cell collection in predicates - up until now it was not possible to compare entire collection values between each other. This is further complicated since multi-cell lists and sets are returned as maps. We can no longer assume that the columns fetched by prefetch operation are non-frozen collections. IF EXISTS/IF NOT EXISTS condition fetches all columns, besides, a column may be needed to check other condition. When fetching the old row for LWT or to apply updates on list/columns, we now calculate precisely the list of columns to fetch. The primary key columns are also included in CAS batch result set, and are thus also prefetched (the user needs them to figure out which statements failed to apply). The patch is cross-checked for compatibility with cassandra-3.11.4-1545-g86812fa502 but does deviate from the origin in handling of conditions on static row cells. This is addressed in future series.	2019-10-27 23:42:49 +03:00
Konstantin Osipov	df28985295	lwt: introduce cql_statment_opt_metadata cql_statement_opt_metadata is an interim node in cql (prepared) statement hierarchy parenting modification_statement and batch_statement. If there is IF condition in such statements, they return a result set, and thus have a result set metadata. The metadata itself is filled in a subsequent patch.	2019-10-27 23:42:03 +03:00
Konstantin Osipov	eb5e82c6a1	lwt: add CAS where clause validation Add checks for conditional modification statement limitations: - WHERE clustering_key IN (list) IF condition is not supported since a conditions is evaluated for a single row/cell, so allowing multiple rows to match the WHERE clause would create ambiguity, - the same is true for conditional range deletions. - ensure all clustering restrictions are eq for conditional delete We must not allow statements like create table t(p int, c int, v int, primary key (p, c)); delete from t where p=1 and c>0 if v=1; because there may be more than one statement in a partition satisfying WHERE clause, in which case it's unclear which of them should satisfy IF condition: all or just one. Raising an error on such a statement is consistent with Cassandra's behavior.	2019-10-27 23:42:03 +03:00
Vladimir Davydov	6c6689f779	cql: refactor statement accounting Rather than passing a pointer to a cql_stats member corresponding to the statement type, pass a reference to a cql_stats object and use statement_type, which is already stored in modification_statement, for determining which counter to increment. This will allow us to account conditional statements, which will have a separate set of counters, right in modification_statement::execute() - all we'll need to do is add the new counters and bump them in case execute_with_condition is called. While we are at it, remove extra inclusions from statement_type.hh so as not to introduce any extra dependencies for cql_stats.hh users. Message-Id: <20191022092258.GC21588@esperanza>	2019-10-22 12:39:14 +03:00
Konstantin Osipov	5d3bf03811	lwt: pre-compute modification_statement properties at prepare They are used more extensively with introduction of lightweight transactions, and pre-computing makes it easier to reason about complexity of the scenarios where they are involved.	2019-10-16 22:44:44 +03:00
Konstantin Osipov	6e0f76ea60	lwt: use column mask to build partition_slice Pre-compute column mask of columns to prefetch when preparing a modification statement and use it to build partition_slice object for read command. Fetch only the required columns. Ligthweight transactions build up on this by using adding columns used in conditions and in cas result set to the column maks of columns to read. Batch statements unite all column masks to build a single relation for all rows modified by conditional statements of a batch.	2019-10-16 22:44:37 +03:00
Konstantin Osipov	a00b9a92b3	lwt: refactor modification statement get_mutations() Refactor get_mutations() so that the read command and apply_updates() functions can be used in lightweight transactions. Move read_command creation to an own method, as well as apply_updates(). Rewrite get_mutations() using the new API. Avoid unnecessary shared pointers.	2019-10-16 22:32:51 +03:00
Konstantin Osipov	a4ccbece5c	lwt: remove an unnecessary optional around prefetch_data Get rid of an unnecessary optional around update_parameters::prefetch_data. update_parameters won't own prefetch_data in the future anyway, since prefetch_data can be shared among multiple modification statements of a batch, each statement having its own options and hence its own update_parameters instance.	2019-10-16 15:48:25 +03:00
Konstantin Osipov	6cde985946	lwt: remove code that no longer servers as a reference Remove ifdef'ed Java code, since LWT implementation is based on the current state of the origin. Message-Id: <20190930201022.40240-2-kostja@scylladb.com>	2019-10-01 19:46:15 +02:00
Gleb Natapov	e72a105b5e	lwt: Pass client_state reference all the way to storage_proxy::query client_state holds a state to generate monotonically increasing unique timestamp. Queries with a SERIAL consistency level need it to generate a paxos round.	2019-09-26 11:44:00 +03:00
Gleb Natapov	6a4207f202	Pass service permit to storage_proxy Current cql transport code acquire a permit before processing a query and release it when the query gets a reply, but some quires leave work behind. If the work is allowed to accumulate without any limit a server may eventually run out of memory. To prevent that the permit system should account for the background work as well. The patch is a first step in this direction. It passes a permit down to storage proxy where it will be later hold by background work.	2019-08-12 10:20:43 +03:00

1 2 3

120 Commits