scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 22:13:19 +00:00

Author	SHA1	Message	Date
Dejan Mircevski	f9b00a4318	cql: Fix mixed selection with GROUP BY GROUP BY is currently supported by simple_selection, the class used when all selectors are simple. But when selectors are mixed, we use selection_with_processing, which does not yet support GROUP BY. This patch fixes that. It also adapts one testcase in filtering_test to the new behavior of simple_selector. The test currently expects the last value seen, but simple_selector now outputs the first value seen. (More details: the WHERE clause implicitly selects the columns it references, and unit tests are forced to provide expected values for these columns. The user-visible result is unchanged in the test; users never see the WHERE column values due to filtering in cql::transport, outside unit tests.) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-14 12:50:39 -04:00
Dejan Mircevski	06e3b36164	cql: Allow mixing of aggregate and simple selectors Scylla currently rejects SELECT statements with both simple and aggregate selectors, but Cassandra allows them. This patch brings parity to Scylla. Fixes #4447. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-14 10:34:02 -04:00
Dejan Mircevski	d51e4a589d	Implement grouping in selection processing Make result_set_builder obey its _group_by_cell_indices by recognizing group boundaries and resetting the selectors. Also make simple_selectors work correctly when grouping. Fixes #2206. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-08 11:05:36 -04:00
Dejan Mircevski	c3929aee3a	Propagate GROUP BY indices to result_set_builder Ensure that the indices recorded in select_statement are passed to result_set_builder when one is created for processing the cell values. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-08 10:10:10 -04:00
Piotr Sarna	b0ab4c28cf	schema: add column_definition::is_hidden_from_cql Right now the only columns hidden from CQL are view virtual columns, but in case of expanding this set, a helper function is provided.	2019-02-27 15:07:54 +01:00
Piotr Sarna	4dc0b0672c	cql3: add multi-column restrictions filtering It's now possible to pass multi-column restrictions to queries that require filtering. Fixes #3574	2019-02-19 13:24:25 +01:00
Piotr Sarna	1dadae212a	cql3: add checking for previous partition count to filtering Filtering now needs to take into account per partition limits as well, and for that it's essential to be able to compare partition keys and decide which rows should be dropped - if previous page(s) contained rows with the same partition key, these need to be taken into consideration too.	2019-02-18 11:06:43 +01:00
Piotr Sarna	b965c3778f	cql3: obey per partition limit for filtering Filtering queries now take into account the limit of rows per single partition provided by the user.	2019-02-18 10:29:34 +01:00
Piotr Sarna	87c23372fb	cql3: fix filtering with LIMIT with regard to paging Previously the limit was erroneously applied per page instead of being accumulated, which might have caused returning too many rows. As of now, LIMIT is handled properly inside restrictions filter. Fixes #4100	2019-01-17 13:25:09 +01:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Piotr Sarna	4f5ee3dfcd	cql3: add counting dropped rows in filtering pager Counter for dropped rows is added to the filtering pager. This metrics can be used later to implement applying LIMIT to filtering queries properly. Dropped rows are returned on visitor::accept_partition_end.	2018-11-29 14:06:59 +01:00
Piotr Sarna	65f21cc518	cql3: check filtering restrictions only if applicable Primary key restrictions should be checked only when they need filtering - otherwise it's superfluous, since they were already applied on query level.	2018-11-28 13:58:16 +01:00
Piotr Sarna	0fc7d63842	cql3: enable filtering for CONTAINS restriction With contains::is_satisfied_by(bytes_view) implemented, it's possible to enable filtering support for CONTAINS restriction. Fixes #3573	2018-11-14 14:39:21 +01:00
Eliran Sinvani	ded3a03356	cql3: rename selection metadata manipulation functions In the past the addition of non serializable columns was being used only for post ordering of result sets.The newly added ALLOW FILTERING feature will need to use these functions to other post processing operations i.e filtering. The renaming accounts for the new and existing uses for the function. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-18 17:52:04 +03:00
Piotr Sarna	5b5c9f2707	cql3: fix a 'pratition_key' typo partition_key got misspelled with 'pratition_key' typo in the original series. Message-Id: <de59fe6161df5442b19d8ba4336e2f828b7ede32.1535981852.git.sarna@scylladb.com>	2018-09-04 16:05:09 +03:00
Nadav Har'El	3f3a76aa8f	Do not allow selecting a virtual column For issue #3362, we will need to add to a materialized view also unselected base-table columns as "virtual columns". We need these columns to exist to keep view rows alive, but we don't want the user to be able to see them. In this patch we prevent SELECTing the virtual columns of the view, and also exclude the virtual columns from a "SELECT *" on a view. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-08-16 15:34:22 +03:00
Piotr Sarna	8c18aaa511	cql3: pass query options to restrictions filter Query options may contain bound values needed for checking filtering restrictions. Previously, empty query_options{} were used, which caused prepared statements to fail. Fixes #3677	2018-08-09 17:44:45 +02:00
Piotr Sarna	aadbfc6b84	cql3: throw instead of log for collection filtering Original series that introduced filtering logged a warning when collection restrictions appeared. Instead, an exception should be thrown until collection restrictions are supported for ALLOW FILTERING clauses. Message-Id: <ddaf342d4d6766fadb756f66e5afa0b99ce054f8.1531220558.git.sarna@scylladb.com>	2018-07-10 14:44:29 +03:00
Piotr Sarna	77aa97f62a	cql3: fix ALLOW FILTERING iterator In original series cell iterator for regular cells was erroneously taken by copy instead of by reference, which will result in iterating over the first value indefinitely. Also, the same iterator was not updated for collections, which is fixed too. Message-Id: <83297adf8121de4fd37257c87f250d61ea9ec80b.1530892191.git.sarna@scylladb.com>	2018-07-06 17:23:12 +01:00
Piotr Sarna	a08fba19e3	cql3: optimize filtering partition keys and static rows If any restriction on partition key or static row part fails, it will be so for every row that belongs to a partition. Hence, full check of the rest of the rows is skipped.	2018-07-05 10:50:43 +02:00
Piotr Sarna	2a0b720102	cql3: add filtering visitor In order to filter results of an 'ALLOW FILTERING' query, a visitor that can take optional filter for result_builder is provided. It defaults to nop_filter, which accepts all rows.	2018-07-05 10:50:43 +02:00
Piotr Sarna	1cf5653f89	cql3: move result_set_builder functions to header Moving function definitions to header is a preparation step before turning result_set_builder into a template.	2018-07-05 10:50:43 +02:00
Paweł Dziepak	3f1184d16d	cql3: selection: add is_trivial() cql3::result_generator supports only trivial selections.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	4704c4efab	query::result: avoid copying and linearising cell value query::result_view already operates on views of a serialised query::result. However, until now the value of a cell was always linearised and copied. This patch makes use of ser::buffer_view to avoid that.	2018-06-25 09:21:47 +01:00
Calle Wilund	6c8b5fc09d	schema_tables: Use v3 schema tables and formats Switches system/schema_* for system_schema/*, updates schema/schema builder and uses to hold/expect v3 style info (i.e. types & dropped).	2017-05-10 16:44:48 +00:00
Paweł Dziepak	fce6e0987f	cql3: selection: do not panic when seeing counters At this stage counters cells are already long_type values, so no special handling is necessary.	2017-02-02 10:35:14 +00:00
Tomasz Grabiec	bc6486b304	Use gc_clock instead of db_clock where possible Some code paths were obtaining db_clock timestamp to only convert it to gc_clock later. Avoid this. In the future we could make gc_clock cheaper cause it has low precision. Message-Id: <1482401190-2035-1-git-send-email-tgrabiec@scylladb.com>	2016-12-22 13:27:55 +02:00
Pekka Enberg	e1e8ca2788	cql3: Fix selecting same column multiple times Under the hood, the selectable::add_and_get_index() function deliberately filters out duplicate columns. This causes simple_selector::get_output_row() to return a row with all duplicate columns filtered out, which triggers and assertion because of row mismatch with metadata (which contains the duplicate columns). The fix is rather simple: just make selection::from_selectors() use selection_with_processing if the number of selectors and column definitions doesn't match -- like Apache Cassandra does. Fixes #1367 Message-Id: <1477989740-6485-1-git-send-email-penberg@scylladb.com>	2016-11-01 09:09:01 +00:00
Duarte Nunes	cb0516a76c	schema: Remove compact_column concept This is a confusing one, and can be replaced the fact that dense schemas have a single regular column. Ref #1542 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-08-03 17:21:41 +00:00
Duarte Nunes	529c3a3ae6	column_kind: Drop compact_column A compact column is a dense schema's single regular column. The fact that it is a different column_kind has lead to various bugs (#1535, derived by the schema being dense and the column being regular. Fixes #1542 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-08-03 17:21:37 +00:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	63006e5dd2	query: Serialize collection cells using CQL format We want the format of query results to be eventually defined in the IDL and be independent of the format we use in memory to represent collections. This change is a step in this direction. The change decouples format of collection cells in query results from our in-memory representation. We currently use collection_mutation_view, after the change we will use CQL binary protocol format. We use that because it requires less transformations on the coordinator side. One complication is that some list operations need to retrieve keys used in list cells, not only values. To satisfy this need, new query option was added called "collections_as_maps" which will cause lists and sets to be reinterpreted as maps matching their underlying representation. This allows the coordinator to generate mutations referencing existing items in lists.	2016-02-15 17:05:55 +01:00
Tomasz Grabiec	9d11968ad8	Rename serialization_format to cql_serialization_format	2016-02-15 16:53:56 +01:00
Tomasz Grabiec	916a91c913	query: Split send_timestamp_and_expiry into two separate options It's cleaner that way. They don't need to come together.	2016-02-15 16:53:56 +01:00
Paweł Dziepak	3287022000	cql3: do not assume that clustering key is full In case of schemas that use compact storage it is possible that trailing components of clustering keys are not set. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-10 05:46:26 +01:00
Avi Kivity	79f7431a03	db: change collection_mutation::{one,view} not to use nested classes Nested classes cannot be forward-declared, so change the naming not to use them. Follows atomic_cell{,_view}.	2015-11-13 17:13:07 +02:00
Calle Wilund	4a1a17defc	cql3::selection: Move result set building visitor to result_set_builder Allows its use (and partial override - hint hint) in more place than one.	2015-11-10 13:12:33 +01:00
Calle Wilund	23b6240dad	cql3::selection: Fix some constness correctness	2015-11-10 13:12:33 +01:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Paweł Dziepak	f6a93be655	cql3: skip compact value columns with no name Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-08-14 14:53:35 +02:00
Paweł Dziepak	a6d0ed205b	cql3: use api::missing_timestamp for missing timestamps A missing timestamp is a missing one, not the smallest one. Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-07-16 16:23:03 +02:00
Tomasz Grabiec	5ba1486ae7	db: Rename "ttl" to "expiry" when it's used as time point To avoid confusion with "ttl" the duration.	2015-05-06 17:27:22 +02:00
Avi Kivity	6290dee438	db: const correctness for abstract_type and friends Types are immutable.	2015-04-29 15:40:38 +03:00
Avi Kivity	3d38708434	cql3: pass a database& instance to most foo::raw::prepare() variants To prepare a user-defined type, we need to look up its name in the keyspace. While we get the keyspace name as an argument to prepare(), it is useless without the database instance. Fix the problem by passing a database reference along with the keyspace. This precolates through the class structure, so most cql3 raw types end up receiving this treatment. Origin gets along without it by using a singleton. We can't do this due to sharding (we could use a thread-local instance, but that's ugly too). Hopefully the transition to a visitor will clean this up.	2015-04-20 16:15:34 +03:00
Tomasz Grabiec	ee906471ab	cql3: Move method implementations to .cc	2015-04-15 20:44:59 +02:00
Tomasz Grabiec	878a740b9d	db: Write query results in serialized form This gives about 30% increase in tps in: build/release/tests/perf/perf_simple_query -c1 --query-single-key This patch switches query result format from a structured one to a serialized one. The problems with structured format are: - high level of indirection (vector of vectors of vectors of blobs), which is not CPU cache friendly - high allocation rate due to fine-grained object structure On replica side, the query results are probably going to be serialized in the transport layer anyway, so this change only subtracts work. There is no processing of the query results on replica other than concatenation in case of range queries. If query results are collected in serialized form from different cores, we can concatenate them without copying by simply appending the fragments into the packet. This optimization is not implemented yet. On coordinator side, the query results would have to be parsed from the transport layer buffers anyway, so this also doesn't add work, but again saves allocations and copying. The CQL server doesn't need complex data structures to process the results, it just goes over it linearly consuming it. This patch provides views, iterators and visitors for consuming query results in serialized form. Currently the iterators assume that the buffer is contiguous but we could easily relax this in future so that we can avoid linearization of data received from seastar sockets. The coordinator side could be optimized even further for CQL queries which do not need processing (eg. select * from cf where ...) we could make the replica send the query results in the format which is expected by the CQL binary protocol client. So in the typical case the coordinator would just pass the data using zero-copy to the client, prepending a header. We do need structure for prefetched rows (needed by list manipulations), and this change adds query result post-processing which converts serialized query result into a structured one, tailored particularly for prefetched rows needs. This change also introduces partition_slice options. In some queries (maybe even in typical ones), we don't need to send partition or clustering keys back to the client, because they are already specified in the query request, and not queried for. The query results hold now keys as optional elements. Also, meta-data like cell timestamp and ttl is now also optional. It is only needed if the query has writetime() or ttl() functions in it, which it typically won't have.	2015-04-15 20:44:50 +02:00
Avi Kivity	aa0e7c4b23	cql3: fix result_set_builder skipping empty rows result_set_builder's API is: new_row add add add new_row add add add new_row add add add build Since there is no end_row, it relies on an internal flag to see (in new_row and in build) whether we need to end a previous row. The problem is that we check if the row is empty(), which is true both for the first row, and for an empty row (if add() is never called, e.g. "SELECT COUNT(*) FROM tab". Fix by using optional<> to mark whether the row exists (new_row has been called). This is ugly, but matches origin. We should improve that by adding an explicit end_row().	2015-04-06 19:04:39 +03:00
Avi Kivity	86f378e4d7	cql3: convert result set to use serialization format rather than protocol_version	2015-03-30 14:28:16 +03:00
Tomasz Grabiec	8912417dd0	cql3: Convert classes from org.cassandra.cql3.selection package	2015-03-11 14:56:10 +01:00

49 Commits