scylladb

Author	SHA1	Message	Date
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	d768e9fac5	cql3, related: switch to data_dictionary Stop using database (and including database.hh) for schema related purposes and use data_dictionary instead. data_dictionary::database::real_database() is called from several places, for these reasons: - calling yet-to-be-converted code - callers with a legitimate need to access data (e.g. system_keyspace) but with the ::database accessor removed from query_processor. We'll need to find another way to supply system_keyspace with data access. - to gain access to the wasm engine for testing whether used defined functions compile. We'll have to find another way to do this as well. The change is a straightforward replacement. One case in modification_statement had to change a capture, but everything else was just a search-and-replace. Some files that lost "database.hh" gained "mutation.hh", which they previously had access to through "database.hh".	2021-12-15 13:54:23 +02:00
Avi Kivity	595cc328b1	Merge 'cql3: Remove term, replace with expression' from Jan Ciołek This PR finally removes the `term` class and replaces it with `expression`. * There was some trouble with `lwt_cache_id` in `expr::function_call`. The current code works the following way: * for each `function_call` inside a `term` that describes a pk restriction, `prepare_context::add_pk_function_call` is called. * `add_pk_function_call` takes a `::shared_ptr<cql3::functions::function_call>`, sets its `cache_id` and pushes this shared pointer onto a vector of all collected function calls * Later when some condiition is met we want to clear cache ids of all those collected function calls. To do this we iterate through shared pointers collected in `prepare_context` and clear cache id for each of them. This doesn't work with `expr::function_call` because it isn't kept inside a shared pointer. To solve this I put the `lwt_cache_id` inside a shared pointer and then `prepare_context` collects these shared pointers to cache ids. I also experimented with doing this without any shared pointers, maybe we could just walk through the expression and clear the cache ids ourselves. But the problem is that expressions are copied all the time, we could clear the cache in one place, but forget about a copy. Doing it using shared pointers more closely matches the original behaviour. The experiment is on the [term2-pr3-backup-altcache](https://github.com/cvybhu/scylla/tree/term2-pr3-backup-altcache) branch * `shared_ptr<term>` being `nullptr` could mean: * It represents a cql value `null` * That there is no value, like `std::nullopt` (for example in `attributes.hh`) * That it's a mistake, it shouldn't be possible A good way to distinguish between optional and mistake is to look for `my_term->bind_and_get()`, we then know that it's not an optional value. * On the other hand `raw_value` cased to bool means: * `false` - null or unset * `true` - some value, maybe empty I ran a simple benchmark on my laptop to see how performance is affected: ``` build/release/test/perf/perf_simple_query --smp 1 -m 1G --operations-per-shard 1000000 --task-quota-ms 10 ``` * On master (`a21b1fbb2f`) I get: ``` 176506.60 tps ( 77.0 allocs/op, 12.0 tasks/op, 45831 insns/op) median 176506.60 tps ( 77.0 allocs/op, 12.0 tasks/op, 45831 insns/op) median absolute deviation: 0.00 maximum: 176506.60 minimum: 176506.60 ``` * On this branch I get: ``` 172225.30 tps ( 75.1 allocs/op, 12.1 tasks/op, 46106 insns/op) median 172225.30 tps ( 75.1 allocs/op, 12.1 tasks/op, 46106 insns/op) median absolute deviation: 0.00 maximum: 172225.30 minimum: 172225.30 ``` Closes #9481 * github.com:scylladb/scylla: cql3: Remove remaining mentions of term cql3: Remove term cql3: Rename prepare_term to prepare_expression cql3: Make prepare_term return an expression instead of term cql3: expr: Add size check to evaluate_set cql3: expr: Add expr::contains_bind_marker cql3: expr: Rename find_atom to find_binop cql3: expr: Add find_in_expression cql3: Remove term in operations cql3: Remove term in relations cql3: Remove term in multi_column_restrictions cql3: Remove term in term_slice, rename to bounds_slice cql3: expr: Remove term in expression cql3: expr: Add evaluate_IN_list(expression, options) cql3: Remove term in column_condition cql3: Remove term in select_statement cql3: Remove term in update_statement cql3: Use internal cql format in insert_prepared_json_statement cache types: Add map_type_impl::serialize(range of <bytes, bytes>) cql3: Remove term in cql3/attributes cql3: expr: Add constant::view() method cql3: expr: Implement fill_prepare_context(expression) cql3: expr: add expr::visit that takes a mutable expression cql3: expr: Add receiver to expr::bind_variable	2021-11-30 16:39:39 +02:00
Jan Ciolek	e458340821	cql3: Remove term term isn't used anywhere now. We can remove it and all classes that derive from it. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Jan Ciolek	dcd3199037	cql3: Rename prepare_term to prepare_expression prepare_term now takes an expression and returns a prepared expression. It should be renamed to prepare_expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Jan Ciolek	219f1a4359	cql3: Make prepare_term return an expression instead of term prepare_term is now the only function that uses terms. Change it so that it returns expression instead of term and remove all occurences of expr::to_expression(prepare_term(...)) Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Jan Ciolek	3b4dc39eb8	cql3: Remove term in relations Replace uses of term with expression in cql3/*relation.hh Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Jan Ciolek	7f2ecf1aa2	cql3: Remove term in multi_column_restrictions Replace all uses of term with expression in cql3/multi_column_restrictions Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Avi Kivity	9424f6e12f	cql3: replace seastar::sprint() with fmt::format() sprint() is obsolete. Note some calls where to helper functions that use sprint(), not to sprint() directly, so both the helpers and the callers were modified.	2021-10-27 17:02:00 +03:00
Avi Kivity	ad285c3c84	cql3: expr: hide column_specification_or_tuple column_specification_or_tuple was introduced since some terms were prepared using a single receiver e.g. (receiver = <term>) and some using multiple receivers (e.g. (r1, r2) = <term>. Some term types supported both. To hide this complexity, the term->expr conversion used a single interface for both variations (column_expression_or_tuple), but now that we got rid of the term class and there are no virtual functions any more, we can just use two separate functions for the two variants. Internally we still use column_expression_or_tuple, it can be removed later.	2021-08-26 16:17:49 +03:00
Avi Kivity	cb2560728a	cql3: relation: convert to_term() to experssions Now that the entire relation hierarchy was converted to expressions, also convert relation::to_term().	2021-08-26 15:56:44 +03:00
Avi Kivity	b6e17ed111	cql3: multi_column_relation: convert term::raw to expressions Change term::raw in multi_column_relation to expressions. Because a single raw class is used to represent multiple shapes (IN ? and IN (x, y, z)), some of the expressions are optional, corresponding to nullables before the conversion. to_term() is not converted, since it's part of the larger relation hierarchy.	2021-08-26 15:36:42 +03:00
Avi Kivity	218f4d87f8	cql3: column_condition: relax types around abstact_marker::in_raw We can only convert expressions to term::raw, not the subclass abstract_marker::in_raw, so relax the types. They will all be converted to expressions. Relaxing types isn't good, but the structure is enforced now by the grammar (and dynamically using variant casts), and in the future by a typecheck pass (which will allow us to remove the many variations of markers).	2021-08-26 14:55:17 +03:00
Avi Kivity	9a158cd7b5	cql3: eliminate multi_column_raw Now that the signatures of term::raw::prepare and multi_column_raw::prepare are identical, we can eliminate multi_column_raw, replacing it with term::raw where needed. In some cases we delete it from the inheritance chain since we reach term::raw via a different base class. Note that a dynamic_cast<> is eliminated, so we compenate for the addition of runtime checks in the previous patch by the deletion of runtime checks in this patch.	2021-08-26 14:11:42 +03:00
Pavel Solodovnikov	49ddd269ea	cql3: rename `variable_specifications` to `prepare_context` The class is repurposed to be more generic and also be able to hold additional metadata related to function calls within a CQL statement. Rename all methods appropriately. Visitor functions in AST nodes (`collect_marker_specification`) are also renamed to a more generic `fill_prepare_context`. The name `prepare_context` designates that this metadata structure is a byproduct of `stmt::raw::prepare()` call and is needed only for "prepare" step of query execution. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-07-24 14:33:33 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Dejan Mircevski	c0c93982d0	cql3: Replace CK-bound mode with comparison_order Instead of defining this enum in multi_column_restriction::slice, put it in the expr namespace and add it to binary_operator. We will need it when we switch bounds calculation from multi_column_restriction to expr classes. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2021-03-10 21:25:43 -05:00
Calle Wilund	58489dc003	cql3::restrictions: Add SCYLLA_CLUSTERING_BOUND keyword for sstableloader Refs #8093 Refs /scylladb/scylla-tools-java#218 Adds keyword that can preface value tuples in (a, b, c) > (1, 2, 3) expressions, forcing the restriction to bypass column sort order treatment, and instead just create the raw ck bounds accordningly. This is a very limited, and simple version, but since we only need to cover this above exact syntax, this should be sufficient. v2: * Add small cql test v3: * Added comment in multi_column_restriction::slice, on what "mode" means and is for * Added small document of our internal CQL extension keywords, including this. v4: * Added a few more cases to tests to verify multi-column restrictions * Reworded docs a bit v5: * Fixed copy-paste error in comment v6: * Added negative (error) test cases v7: * Added check + reject of trying to combine SCYLLA_CLUST... slice and normal one Closes #8094	2021-03-03 07:06:45 +01:00
Dejan Mircevski	d97605f4f8	cql3: Drop operator_type from the parser Replace operator_type with the nicer-behaved oper_t in CQL parser and, consequently, in the relation hierarchy and column_condition. After this, no references to operator_type remain in live code. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-08-18 12:27:00 +02:00
Rafael Ávila de Espíndola	ad6d65dbbd	Everywhere: Explicitly instantiate make_shared seastar::make_shared has a constructor taking a T&&. There is no such constructor in std::make_shared: https://en.cppreference.com/w/cpp/memory/shared_ptr/make_shared This means that we have to move from make_shared(T(...) to make_shared<T>(...) If we don't want to depend on the idiosyncrasies of seastar::make_shared. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-21 10:33:49 -07:00
Rafael Ávila de Espíndola	abba521199	cql3: Add a create_multi_column_relation helper This moves a few calls to make_shared to a single location. This makes it easier to drop a dependency on the differences between seastar::make_shared and std::make_shared. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-21 10:33:49 -07:00
Pavel Solodovnikov	f6e765b70f	cql3: pass `column_specification` via lw_shared_ptr `column_specification` class is marked as "final": it's safe to use non-polymorphic pointer "lw_shared_ptr" instead of a more generic "shared_ptr". tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200427084016.26068-1-pa.solodovnikov@scylladb.com>	2020-04-27 12:47:42 +03:00
Pavel Solodovnikov	8efb02146f	cql3: const cleanups and API de-pointerization * Pass raw::select_statement::parameters as lw_shared_ptr * Some more const cleanups here and there * lists,maps,sets::equals now accept const-ref to _type_impl instead of shared_ptr Remove unused `get_column_for_condition` from modification_statement.hh * More methods now accept const-refs instead of shared_ptr Every call site where a shared_ptr was required as an argument has been inspected to be sure that no dangling references are possible. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200220153204.279940-1-pa.solodovnikov@scylladb.com>	2020-02-20 18:14:49 +02:00
Pavel Solodovnikov	a46f235092	cql3: prefer passing schema as const ref instead of shared_ptr De-pointerize cql3 code APIs further: change some call sites to pass `schema` as const-ref instead of `shared_ptr`. Affected functions known to be expecting always non-null pointer to schema and don't store or pass the pointer somewhere else, assuming it's safe to give them just a reference. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200218142338.69824-1-pa.solodovnikov@scylladb.com>	2020-02-18 20:13:10 +02:00
Pavel Solodovnikov	bf95bd0916	cql3: more functions marked as const The following functions are now "const": * `term::collect_marker_specification` * `relation::to_term` * `multi_item_terminal::get_elements` * `raw_update::is_compatible_with` Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200213142445.35312-1-pa.solodovnikov@scylladb.com>	2020-02-16 11:22:30 +02:00
Pavel Solodovnikov	e1b22b6a4c	cql3: get rid of lw_shared_ptr for `variable_specifications` `parsed_statement::get_bound_variables` is assumed to always return a nonnull pointer to `variable_specifications` instance. In this case using a pointer is superfluous and can be safely replaced by a plain reference. Also add a default ctor and a utility method `set_bound_variables` to the `variable_specifications` class to actually reset the contents of the class instance. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200120195839.164296-1-pa.solodovnikov@scylladb.com>	2020-01-22 12:51:02 +02:00
Pavel Solodovnikov	aba9a11ff0	cql: pass variable_specifications via lw_shared_ptr Instances of `variable_specifications` are passed around as shared_ptr's, which are redundant in this case since the class is marked as `final`. Use `lw_shared_ptr` instead since we know for sure it's not a polymorphic pointer. Tests: unit(debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20191225232853.45395-1-pa.solodovnikov@scylladb.com>	2019-12-29 16:26:26 +02:00
Dejan Mircevski	21d7722594	cql3: Add LIKE relation Add a new type of relation with operator LIKE. Handle it in relation::to_restriction by introducing a new virtual method for it. The temporary implementation of this method returns null; that will be replaced in a subsequent patch. Add abstract_type::is_string() to recognize string columns and disallow LIKE operator on non-string columns. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-07-04 10:54:30 +02:00
Nadav Har'El	92d5f61ba5	cql: support single-value IN restriction wherever EQ restriction is supported There are several places were IN restrictions are not currently supported, especially in queries involving a secondary index. However, when the IN restriction has just a single value, it is nothing more than an equality restriction and can be converted into one and be supported. So this patch does exactly this. Note that Cassandra does this conversion since August 2016, and therefore supports the special case of single-value IN even where general IN is not supported. So it's important for Cassandra compatibility that we do this conversion too. This patch also includes a test with two queries involving a secondary index that were previously disallowed because of the "IN" on the primary key or the indexed column - and are now allowed when the IN restriction has just a single value. A third query tested is not related to secondary indexes, but confirms we don't break multi-column single-value IN queries. Fixes #4455. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190428160317.23328-1-nyh@scylladb.com>	2019-04-30 12:13:06 +01:00
Avi Kivity	cb7ee5c765	cql3: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Duarte Nunes	ced4b6e4ff	cql3: Allow renaming an identifier in a relation This patch adds an utility function to rename an identifier occurring in a cql3 relation. This function will be used when renaming an identifier in a view's where clause. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Paweł Dziepak	0afbbb9d44	cql3: fix empty IN () restriction Values inside IN () restrictions may be either in a vector _in_values or a marker (_in_marker or _value). To determine which one is appropriate we check whether _in_values is empty, which is wrong because IN clause can be empty (and there is no marker in such case). This is fixed by using the presence of a marker to determine whether a vector of values or a marker should be used. Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-08-13 10:45:27 +02:00
Paweł Dziepak	f5a9fdfc61	cql3: translate MultiColumnRelation.java to C++ Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-08-04 10:59:06 +02:00

35 Commits