scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Author	SHA1	Message	Date
Vladimir Davydov	ab42b72c6d	cql: fix SERIAL consistency check for batch statements If CONSISTENCY is set to SERIAL or LOCAL SERIAL, all write requests must fail according to Cassandra's documentation. However, batched writes bypass this check. Fix this.	2019-11-14 12:15:39 +01:00
Vladimir Davydov	25aeefd6f3	cql: fix CAS consistency level validation This patch resurrects Cassandra's code validating a consistency level for CAS requests. Basically, it makes CAS requests use a special function instead of validate_for_write to make error messages more coherent. Note, we don't need to resurrect requireNetworkTopologyStrategy as EACH_QUORUM should work just fine for both CAS and non-CAS writes. Looks like it is just an artefact of a rebase in the Cassandra repository.	2019-11-14 12:15:39 +01:00
Avi Kivity	6aed3b7471	Merge "cql: trivial cleanup" from Vova * 'cql-trivial-cleanup' of ssh://github.com/scylladb/scylla-dev: cql: rename modification_statement::_sets_a_collection to _selects_a_collection cql: rename _column_conditions to _regular_conditions cql: remove unnecessary optional around prefetch_data	2019-11-13 15:12:10 +02:00
Konstantin Osipov	6159c012db	schema: pre-allocate the bitset of column_set The number of columns is usually small, and avoiding a resize speeds up bit manipulation functions.	2019-11-13 11:41:51 +03:00
Konstantin Osipov	191acec7ab	schema: rename column_mask to column_set Since it contains a precise set of columns, it's more accurate to call it a set, not a mask. Besides, the name column_mask is already used for column options on storage level.	2019-11-13 11:41:30 +03:00
Vladimir Davydov	8110178e5d	cql: rename modification_statement::_sets_a_collection to _selects_a_collection This is merely to avoid confusion: we use _sets prefix to indicate that there are operations over static/regular columns (_sets_static_columns, _sets_regular_columns), but _sets_a_collection is set for both operations and conditions. So let's rename it to _selects_a_collection and add some comments.	2019-11-12 20:15:42 +03:00
Vladimir Davydov	a19192950e	cql: rename _column_conditions to _regular_conditions It's weird that modification_statement has _static_conditions for conditions on static columns and _column_conditions for conditions on regular columns, as if conditions on static columns are not column conditions. Let's rename _column_conditions to _regular_conditions to avoid confusion.	2019-11-12 20:15:35 +03:00
Vladimir Davydov	f0075ba845	cql: account cas requests separately This patch adds "type" label to the following CQL metrics: inserts updates deletes batches statements_in_batches The label is set to "cas" for conditional statements and "non-cas" for unconditional statements. Note, for a batch to be accounted as CAS, it is enough to have just one conditional statement. In this case all statements within the batch are accounted as CAS as well.	2019-10-30 13:44:35 +03:00
Vladimir Davydov	57d284d254	cql: exclude statements not checked by cas from result set Apart from conditional statements, there may be other reading statements in a batch, e.g. manipulating lists. We must not include rows fetched for them into the CAS result set. For instance, the following CAS batch: create table t(p int, c int, i int, l list<int>, primary key(p, c)); insert into t(p, c, i) values(1, 1, 1) insert into t(p, c, i, l) values(1, 1, 1, [1, 2, 3]) begin batch update t set i=3 where p=1 and c=1 if i=2 update t set l=l-[2] where p=1 and c=2 apply batch; is supposed to return [applied] \| p \| c \| i ----------+---+---+--- False \| 1 \| 1 \| 1 not [applied] \| p \| c \| i ----------+---+---+--- False \| 1 \| 1 \| 1 False \| 1 \| 2 \| 1 To filter out such collateral rows from the result set, let's mark rows checked by conditional statements with a special flag.	2019-10-28 21:50:43 +03:00
Vladimir Davydov	74b9e80e4c	cql: fix EXISTS check that applies only to static columns If a CQL statement only updates static columns, i.e. has no clustering key restrictions, we still fetch a regular row so that we can check it against EXISTS condition. In this case we must be especially careful: we can't simply pass the row to modification_statement::applies_to, because it may turn out that the row has no static columns set, i.e. there's no in fact static row in the partition. So we filter out such rows without static columns right in cas_request::applies_to before passing them further to modification_statement::applies_to. Example: create table t(p int, c int, s int static, primary key(p, c)); insert into t(p, c) values(1, 1); insert into t(p, s) values(1, 1) if not exists; The conditional statement must succeed in this case.	2019-10-28 21:49:37 +03:00
Vladimir Davydov	54cf903bb2	cql: differentiate static from regular EXISTS conditions If a modification statement doesn't have a clustering column restriction while the table has static columns, then EXISTS condition just needs to check if there's a static row in the partition, i.e. it doesn't need to select any regular rows. Let's treat such EXIST condition like a static column condition so that we can ignore its clustering key range while checking CAS conditions.	2019-10-28 21:13:05 +03:00
Vladimir Davydov	934a87999f	cql: turn prefetch_data::row into struct This will allow us to add helper methods and store extra info in each row. For example, we can add a method for checking if a row has static columns. Also, to build CAS result set, we need to differentiate rows fetched to check conditions from those fetched for reading operations. Using struct as row container will allow us to store this information in each prefetched row.	2019-10-28 21:12:52 +03:00
Vladimir Davydov	bdd62b8bc3	cql: remove static column check from create_clustering_ranges The check is pointless, because we check exactly the same while preparing the statement, see process_where_clause() method of modification_statement.	2019-10-28 21:12:43 +03:00
Vladimir Davydov	a8ddbffa75	cql: fix applies_only_to_static_columns check Currently, we set _sets_regular_columns/_sets_static_columns flags when adding regular/static conditions to modification_statement. We use them in applies_only_to_static_columns() function that returns true iff _sets_static_columns is set and _sets_regular_columns is clear. We assume that if this function returns true then the statement only deals with static columns and so must not have clustering key restrictions. Usually, that's true, but there's one exception: DELETE FROM ... statement that deletes whole rows. Technically, this statement doesn't have any column operations, i.e. _sets_regular_columns flag is clear. So if such a statement happens to have a static condition, we will assume that it only applies to static columns and mistakenly raise an error. Example: create table t(k int, c int, s int static, v int, primary key(k, c)); delete from t where k=1 and c=1 if s=1; To fix this, let's not set the above mentioned flags when adding conditions and instead check if _column_conditions array is empty in applies_only_to_static_columns().	2019-10-28 21:12:36 +03:00
Vladimir Davydov	fbb11dac11	cql: set conditions before processing where clause modification_statement::process_where_clause() assumes that both operations and conditions has been added to the statement when it's called: it uses this information to raise an error in case the statement restrictions are incompatible with operations or conditions. Currently, operations are set before this function is called, but not conditions. This results in "Invalid restrictions on clustering columns since the {} statement modifies only static columns" error while trying to execute the following statements: create table t(k int, c int, s int static, v int, primary key(k, c)); delete s from t where k=1 and c=1 if v=1; update t set s=1 where k=1 and c=1 if v=1; Fix this by always initializing conditions before processing WHERE clause.	2019-10-28 21:12:22 +03:00
Konstantin Osipov	e555dc502e	lwt: implement basic lightweight transactions support Support single-statement conditional updates and as well as batches. This patch almost fully rewrites column_condition.cc, implementing is_satisfied_by(). Most of the remaining complications in column_condition implementation come from the need to properly handle frozen and multi-cell collection in predicates - up until now it was not possible to compare entire collection values between each other. This is further complicated since multi-cell lists and sets are returned as maps. We can no longer assume that the columns fetched by prefetch operation are non-frozen collections. IF EXISTS/IF NOT EXISTS condition fetches all columns, besides, a column may be needed to check other condition. When fetching the old row for LWT or to apply updates on list/columns, we now calculate precisely the list of columns to fetch. The primary key columns are also included in CAS batch result set, and are thus also prefetched (the user needs them to figure out which statements failed to apply). The patch is cross-checked for compatibility with cassandra-3.11.4-1545-g86812fa502 but does deviate from the origin in handling of conditions on static row cells. This is addressed in future series.	2019-10-27 23:42:49 +03:00
Konstantin Osipov	df28985295	lwt: introduce cql_statment_opt_metadata cql_statement_opt_metadata is an interim node in cql (prepared) statement hierarchy parenting modification_statement and batch_statement. If there is IF condition in such statements, they return a result set, and thus have a result set metadata. The metadata itself is filled in a subsequent patch.	2019-10-27 23:42:03 +03:00
Konstantin Osipov	eb5e82c6a1	lwt: add CAS where clause validation Add checks for conditional modification statement limitations: - WHERE clustering_key IN (list) IF condition is not supported since a conditions is evaluated for a single row/cell, so allowing multiple rows to match the WHERE clause would create ambiguity, - the same is true for conditional range deletions. - ensure all clustering restrictions are eq for conditional delete We must not allow statements like create table t(p int, c int, v int, primary key (p, c)); delete from t where p=1 and c>0 if v=1; because there may be more than one statement in a partition satisfying WHERE clause, in which case it's unclear which of them should satisfy IF condition: all or just one. Raising an error on such a statement is consistent with Cassandra's behavior.	2019-10-27 23:42:03 +03:00
Vladimir Davydov	6c6689f779	cql: refactor statement accounting Rather than passing a pointer to a cql_stats member corresponding to the statement type, pass a reference to a cql_stats object and use statement_type, which is already stored in modification_statement, for determining which counter to increment. This will allow us to account conditional statements, which will have a separate set of counters, right in modification_statement::execute() - all we'll need to do is add the new counters and bump them in case execute_with_condition is called. While we are at it, remove extra inclusions from statement_type.hh so as not to introduce any extra dependencies for cql_stats.hh users. Message-Id: <20191022092258.GC21588@esperanza>	2019-10-22 12:39:14 +03:00
Nadav Har'El	631846a852	CDC: Implement minimal version that logs only primary key of each change Merge a patch series from Piotr Jastrzębski (haaawk): This PR introduces CDC in it's minimal version. It is possible now to create a table with CDC enabled or to enable/disable CDC on existing table. There is a management of CDC log and description related to enabling/disabling CDC for a table. For now only primary key of the changed data is logged. To be able to co-locate cdc streams with related base table partitions it was needed to propagate the information about the number of shards per node. This was node through gossip. There is an assumption that all the nodes use the same value for sharding_ignore_msb_bits. If it does not hold we would have to gossip sharding_ignore_msb_bits around together with the number of shards. Fixes #4986. Tests: unit(dev, release, debug)	2019-10-20 11:41:01 +03:00
Piotr Jastrzebski	96c800ed0b	modification_statement: log in cdc partition key of a change Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-10-17 11:28:23 +02:00
Konstantin Osipov	5d3bf03811	lwt: pre-compute modification_statement properties at prepare They are used more extensively with introduction of lightweight transactions, and pre-computing makes it easier to reason about complexity of the scenarios where they are involved.	2019-10-16 22:44:44 +03:00
Konstantin Osipov	6e0f76ea60	lwt: use column mask to build partition_slice Pre-compute column mask of columns to prefetch when preparing a modification statement and use it to build partition_slice object for read command. Fetch only the required columns. Ligthweight transactions build up on this by using adding columns used in conditions and in cas result set to the column maks of columns to read. Batch statements unite all column masks to build a single relation for all rows modified by conditional statements of a batch.	2019-10-16 22:44:37 +03:00
Konstantin Osipov	f32a7a0763	lwt: move option set for modification statement read command Move the option set for read command to update_parameters class, since this class encapsulates the logic of working with the read command result.	2019-10-16 22:41:00 +03:00
Konstantin Osipov	a00b9a92b3	lwt: refactor modification statement get_mutations() Refactor get_mutations() so that the read command and apply_updates() functions can be used in lightweight transactions. Move read_command creation to an own method, as well as apply_updates(). Rewrite get_mutations() using the new API. Avoid unnecessary shared pointers.	2019-10-16 22:32:51 +03:00
Konstantin Osipov	a4ccbece5c	lwt: remove an unnecessary optional around prefetch_data Get rid of an unnecessary optional around update_parameters::prefetch_data. update_parameters won't own prefetch_data in the future anyway, since prefetch_data can be shared among multiple modification statements of a batch, each statement having its own options and hence its own update_parameters instance.	2019-10-16 15:48:25 +03:00
Konstantin Osipov	7a399ebe0d	lwt: move prefetch_data_builder to update_parameters.cc Move prefetch_data_builder class from modification_statement.cc to update_parameters.cc. We're going to share the same builder to build a result set for condition evaluation and to apply updates of batch statements, so we need to share it. No other changes.	2019-10-16 15:48:08 +03:00
Konstantin Osipov	6cde985946	lwt: remove code that no longer servers as a reference Remove ifdef'ed Java code, since LWT implementation is based on the current state of the origin. Message-Id: <20190930201022.40240-2-kostja@scylladb.com>	2019-10-01 19:46:15 +02:00
Gleb Natapov	e72a105b5e	lwt: Pass client_state reference all the way to storage_proxy::query client_state holds a state to generate monotonically increasing unique timestamp. Queries with a SERIAL consistency level need it to generate a paxos round.	2019-09-26 11:44:00 +03:00
Gleb Natapov	6a4207f202	Pass service permit to storage_proxy Current cql transport code acquire a permit before processing a query and release it when the query gets a reply, but some quires leave work behind. If the work is allowed to accumulate without any limit a server may eventually run out of memory. To prevent that the permit system should account for the background work as well. The patch is a first step in this direction. It passes a permit down to storage proxy where it will be later hold by background work.	2019-08-12 10:20:43 +03:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	f02c64cadf	streaming: stream_session: remove include of db/view/view_update_from_staging_generator.hh This header, which is easily replaced with a forward declaration, introduces a dependency on database.hh everywhere. Remove it and scatter includes of database.hh in source files that really need it.	2019-01-05 17:33:25 +02:00
Paweł Dziepak	9024187222	partition_slice: use small_vector for column_ids	2018-12-06 14:21:04 +00:00
Avi Kivity	4676e07400	consistency_level: simplify validation API Remove unused parameters, replace refcounted pointers by references.	2018-11-27 13:41:49 +02:00
Avi Kivity	2c08bff8d5	Split consistency_level.hh header It has two unrelated users: cql for validation, and storage_proxy for complicated calculations. Split the simple stuff into a new header to reduce dependencies.	2018-11-27 13:32:10 +02:00
Avi Kivity	9201d22c06	cql: remove unneeded includes of consistency_level.hh Move the includes to .cc to reduce include pollution.	2018-11-27 13:18:33 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	cb7ee5c765	cql3: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Avi Kivity	512baf536f	storage_proxy: implement write timeouts Require a timeout parameter for storage_proxy::mutate_begin() and all its callers (all the way to thrift and cql modification_statement and batch_statement). This should fix spurious debug-mode test failures, where overcommit and general debug slowness result in the default timeouts being exceeded. Since the tests use infinite timeouts, they should not time out any more. Tests: unit (release), with an extra patch that aborts when a non-infinite timeout is detected. Message-Id: <20180707204424.17116-1-avi@scylladb.com>	2018-07-08 10:27:03 +01:00
Paweł Dziepak	8feab33cf4	query::result: use std::optional instead of experimental version	2018-06-25 09:21:47 +01:00
Paweł Dziepak	4704c4efab	query::result: avoid copying and linearising cell value query::result_view already operates on views of a serialised query::result. However, until now the value of a cell was always linearised and copied. This patch makes use of ser::buffer_view to avoid that.	2018-06-25 09:21:47 +01:00
Avi Kivity	0cf4cf5981	cql: make modification_statement execution_stage scheduling aware Inherit scheduling from the caller, preventing a fall back into the main group.	2018-06-18 18:30:21 +03:00
Avi Kivity	b70febe246	cql: cql_statement: remove execute_internal() With no callers, it can be safely removed.	2018-05-27 12:40:27 +03:00
Avi Kivity	d8dd7e05a7	storage_proxy: don't use default timeouts Require all callers to supply timeouts instead of relying on defaults. Since all callers now have the timeouts set up, they can easily supply them.	2018-04-30 13:19:53 +03:00
Avi Kivity	49fdf01b5d	cql3: define and populate timeout_config_selector Determine which timeout we need to apply at prepare time. We don't know the numerical value (since it depends on whoever is executing the query, not just the statement type), but we know which member of timeout_config we need, so determine and remember that.	2018-04-30 13:19:49 +03:00
Piotr Sarna	cdcbf654a8	cql3: add support for INSERT JSON clause This commit adds the implementation of INSERT JSON clause which accepts JSON object as parameter and inserts appropriate values into appropriate columns, as defined in given JSON. Example: INSERT INTO testme JSON '{ "id" : 77, "name" : "Jones", "ranking" : 8.5 }' References #2058	2018-04-23 12:00:57 +02:00
Avi Kivity	f7b102238a	cql3: change cql_statement methods to accept a local storage_proxy The storage_proxy represents the entire cluster, so there's never a need to access it on a remote shard; the local shard instance will contact remote shard or remote nodes as needed. Simplify the API by passing storage_proxy references instead of seastar::sharded<storage_proxy> references. query_processor and other callers are adjusted to call seastar::sharded::local() first. Message-Id: <20180415142656.25370-2-avi@scylladb.com>	2018-04-16 10:18:28 +02:00
Botond Dénes	2e2abf6edb	storage_proxy: add coordinator_query_options and coordinator_query_result As yet more parameters and return-values are about to be added to all storage_proxy::query_* methods we need a way that scales better than changing the signatures every time. To this end we aggregate all non-mandatory query parameters into `coordinator_query_options` and all return values into `coordinator_query_result`. This way new fields can be simply added to the respective structs while the signatures of the methods themselves and their client code can remain unchanged.	2018-03-19 15:17:35 +02:00
Botond Dénes	eac597d726	Add preferred and last replicas to the signature of query() preferred_replicas are added to the parameters and last_replicas are added to the return type. The preferred replicas will be used as a hint for the selection of the replicas to send the read requests to. The last replicas (returned) are the replicas actually selected for the read. This will allow queries to consistently hit the same replicas for each page thus reusing readers created on these replicas. For convenience a query() overload is provided that doesn't take or return the preferred and last replicas. This patch only adds the parameters and propagates them down to query_singular() and query_partition_key_range(). The code to actually use these preferred-replicas will be added in later patches. This reason for separating this is to reduce noise and improve reviewability for those functional changes later.	2018-03-13 10:34:34 +02:00
José Guilherme Vanz	380bc0aa0d	Swap arguments order of mutation constructor Swap arguments in the mutation constructor keeping the same standard from the constructor variants. Refs #3084 Signed-off-by: José Guilherme Vanz <guilherme.sft@gmail.com> Message-Id: <20180120000154.3823-1-guilherme.sft@gmail.com>	2018-01-21 12:58:42 +02:00

1 2 3

129 Commits