scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Author	SHA1	Message	Date
Piotr Sarna	000ce24306	cql3: solve JSON case-sensitivity issues This commit fixes two closely related issues with handling case-sensitive column names in JSON: * according to doc, case-sensitive names should be wrapped with additional pair of double quotes during JSON SELECT * logic error in parse_json() prevented INSERT JSON from working properly on case-sensitive column names This commit is followed by updated cql_query_test, which checks case-sensitive cases as well. Message-Id: <82d9d5e193a656e99bc86b297c00662a6fb808a0.1524576066.git.sarna@scylladb.com>	2018-04-24 16:30:55 +03:00
Nadav Har'El	1ec5688b0b	Materialized Views: fix incorrect limitations on row filtering This patch fixes several cases where it was disallowed to create a materialized view with a filter ("where ..."), for no good reason. After this patch, these cases will be allowed. Fixes #2367. In ordinary SELECT queries, certain types of filtering which is known to be deceptively inefficient is now allowed. For example, trying to query a range of partition keys cannot be done without reading the entire database (because the murmur3 tokenizer randomizes the order of partitions). Restricting two partition key components also cannot be done without reading excessive amount of the entire partition. So Scylla, following Cassandra, chooses to disallow such SELECT queries, and give an error message. However, the same SELECT statements should be allowed when defining a materialized view. In this case, the filter is just used to check an individual row - not to search for one - so there is no performance concern. Unfortunately the existing code did these validations while building the SELECT statement's "restrictions", in code shared by both uses of SELECT (query and MV definition). It was easy to move one of the validations to later code which runs after the restriction has already been built (and knows if it is working for query or MV), but because of the way the "restrictions" objects (translated from Cassandra 2's code) hide what they contain, many of the checks are harder to perform after having built the restrictions object. So instead, we add in strategic places in the restriction-handling code a new "allow_filtering" flag. If restrictions are built with allow_filtering=true, the extra performance-oriented tests on the filtering restrictions is not done. Materialized views sets allow_filtering=true. The allow_filtering flag will also be useful later when we want to support the "ALLOW FILTERING" query option which is currently not supported properly (we have several open issues on that). However note that this patch doesn't complete that support: I left a FIXME in the spot where we set allow_filtering in the Materialized Views case, but in the futre also need to set it if the user specified "ALLOWED FILTERING" in the query. This patch also enables several unit tests written by Duarte which used to fail because of this bug, and now pass. These tests verify that the restrictions are now allowed and filter the view as desired; But I also added test code to verify that the same restrictions are still forbidden, as before, when used in ordinary SELECT queries. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180423124343.17591-1-nyh@scylladb.com>	2018-04-23 14:08:04 +01:00
Avi Kivity	f7b102238a	cql3: change cql_statement methods to accept a local storage_proxy The storage_proxy represents the entire cluster, so there's never a need to access it on a remote shard; the local shard instance will contact remote shard or remote nodes as needed. Simplify the API by passing storage_proxy references instead of seastar::sharded<storage_proxy> references. query_processor and other callers are adjusted to call seastar::sharded::local() first. Message-Id: <20180415142656.25370-2-avi@scylladb.com>	2018-04-16 10:18:28 +02:00
Avi Kivity	dc0c458c12	Merge "First series on JSON support in CQL" from Piotr " This series introduces 'SELECT JSON' clause support for CQL. Things implemented: * expanding CQL grammar with JSON keyword * converting values to JSON format * serving 'SELECT JSON ' clauses tests for 'SELECT JSON' " * 'json_ops' of https://github.com/psarna/scylla: tests: add cql unit tests for SELECT JSON cql3: Add JSON token to CQL grammar cql3: add support for SELECT JSON clause cql3: add to_json_string function to types	2018-04-11 18:26:53 +03:00
Piotr Sarna	15545da572	cql3: add support for SELECT JSON clause This commit adds the implementation of SELECT JSON clause which returns rows in JSON format. Each returned row has a single '[json]' column. References #2058	2018-04-11 17:12:02 +02:00
Piotr Sarna	a5b6047ffa	cql3: add row-wise read statistics Database read metrics is now extended by total number of rows read, exported through cql_rows_read field. Closes #3146 Message-Id: <02f0816c509f3d7fea06da22869eea61548284e2.1522919708.git.sarna@scylladb.com>	2018-04-05 13:39:08 +03:00
Botond Dénes	2e2abf6edb	storage_proxy: add coordinator_query_options and coordinator_query_result As yet more parameters and return-values are about to be added to all storage_proxy::query_* methods we need a way that scales better than changing the signatures every time. To this end we aggregate all non-mandatory query parameters into `coordinator_query_options` and all return values into `coordinator_query_result`. This way new fields can be simply added to the respective structs while the signatures of the methods themselves and their client code can remain unchanged.	2018-03-19 15:17:35 +02:00
Botond Dénes	eac597d726	Add preferred and last replicas to the signature of query() preferred_replicas are added to the parameters and last_replicas are added to the return type. The preferred replicas will be used as a hint for the selection of the replicas to send the read requests to. The last replicas (returned) are the replicas actually selected for the read. This will allow queries to consistently hit the same replicas for each page thus reusing readers created on these replicas. For convenience a query() overload is provided that doesn't take or return the preferred and last replicas. This patch only adds the parameters and propagates them down to query_singular() and query_partition_key_range(). The code to actually use these preferred-replicas will be added in later patches. This reason for separating this is to reduce noise and improve reviewability for those functional changes later.	2018-03-13 10:34:34 +02:00
Nadav Har'El	fa284f6307	Add query UUID to read command This patch adds the parameter to read_command which is needed for caching of readers during multiple pages of a paged queries, which we will introduce in the next patches. The query_uuid is a UUID of a previously saved reader, which the replica is now asked to recall and resume (if this saved reader is no longer in the cache, it is fine, a new reader will be started). Additionally a helper flag is_first_page is added so that the replica can avoid doing any cache lookups (and incrementing miss counters) for the first page. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-03-13 10:34:34 +02:00
Duarte Nunes	ac6abf8021	Merge 'CQL clustering column secondary indexing support' from Pekka "This patch series adds support for clustering column secondary indexing. Fixes #2961 Tests: unit-tests (release)" * 'penberg/cql-2i-clustering-key-indexing/v2' of github.com:penberg/scylla: tests/cql_query_test: Add indexed clustering key query test cql3: Fix clustering column secondary indexing cql3/statements: Add values() helper to restrictions cql3/restrictions: Fix multi_column_restriction::values() cql3/restrictions: Fix single_column_primary_key_restrictions::values()	2018-02-12 18:49:34 +00:00
Paweł Dziepak	b635fec9bf	cql3/select_statement: do not capture stack variables by reference Default capture by reference considered harmful in async code.	2018-02-08 14:46:10 +00:00
Pekka Enberg	0128f802ed	cql3: Fix clustering column secondary indexing Fix clustering column indexing by lifting the limitation of only considering non-primary key restrictions in select_statement::find_index_partition_ranges().	2018-02-06 16:57:27 +02:00
Glauber Costa	08a0c3714c	allow request-specific read timeouts in storage proxy reads Timeouts are a global property. However, for tables in keyspaces like the system keyspace, we don't want to uphold that timeout--in fact, we wan't no timeout there at all. We already apply such configuration for requests waiting in the queued sstable queue: system keyspace requests won't be removed. However, the storage proxy will insert its own timeouts in those requests, causing them to fail. This patch changes the storage proxy read layer so that the timeout is applied based on the column family configuration, which is in turn inherited from the keyspace configuration. This matches our usual way of passing db parameters down. In terms of implementation, we can either move the timeout inside the abstract read executor or keep it external. The former is a bit cleaner, the the latter has the nice property that all executors generated will share the exact same timeout point. In this patch, we chose the latter. We are also careful to propagate the timeout information to the replica. So even if we are talking about the local replica, when we add the request to the concurrency queue, we will do it in accordance with the timeout specified by the storage proxy layer. After this patch, Scylla is able to start just fine with very low timeouts--since read timeouts in the system keyspace are now ignored. Fixes #2462 Implementation notes, and general comments about open discussion in 2462: * Because we are not bypassing the timeout, just setting it high enough, I consider the concerns about the batchlog moot: if we fail for any other reason that will be propagated. Last case, because the timeout is per-CF, we could do what we do for the dirty memory manager and move the batchlog alone to use a different timeout setting. * Storage proxy likes specifying its timeouts as a time_point, whereas when we get low enough as to deal with the read_concurrency_config, we are talking about deltas. So at some point we need to convert time_points to durations. We do that in the database query functions. v2: - use per-request instead of per-table timeouts. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-01-12 07:43:21 -05:00
Vladimir Krivopalov	41eb278899	Only allow DISTINCT SELECT queries with partition key restrictions. Fixes #2049 Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <75e69626d797e63fb1e93a9120f135d4959fad1c.1512162540.git.vladimir@scylladb.com>	2017-12-03 11:59:11 +02:00
Pekka Enberg	9048f741ad	cql3: Secondary-index backed select statements This patch adds support for secondary-index backed select statements. Current select_statement class is split into two separate classes: primary_key_select_statement that retains regular query behavior and indexed_table_select_statement that introduces the new secondary-index backed query logic. One of the two behaviors is selected at query preparation time to minimize overhead for non-indexed queries.	2017-11-03 10:12:58 +02:00
Amnon Heiman	08c81427b9	Add paging for internal queries Usually, internal queries are used for short queries. Sometimes though, like in the case of get compaction history, there could be a large amount of results. Without paging it will overload the system. This patch adds the ability to use paging internally. Using paging will be done explicitely, all the relevant information would be store in an internal_query_state, that would hold both the paging state but also the query so consecutive calls can be made. To use paging use the query method with a function. The function gets beside a statement and its parameters a function that will be used for each of the returned rows. For example if qp is a query_processor: qp.query("SELECT * from system.compaction_history", [] (const cql3::untyped_result_set::row& row) { .... // do something with row ... return stop_iteration::no; // keep on reading }); Will run the function on each of the compaction history table rows. To stop the iteration, the function can return stop_iteration::yes.	2017-07-20 17:43:51 +03:00
Duarte Nunes	6ac73b57fb	cql3/statements/select_statement: Remove dead code Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170522100230.17393-1-duarte@scylladb.com>	2017-05-22 14:32:12 +03:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Duarte Nunes	d7701087af	cql3/restrictions/statement_restrictions: Consider statement type Now that update_statement uses statement_restrictions, we need our validation logic to take the statement type into account, in particular to deal with insertion statements which only set static columns but specify clustering values. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-10 19:54:42 +02:00
Pekka Enberg	dfee4d2bb0	cql3: Fix partition key bind indices for prepared statements Fix the CQL front-end to populate the partition key bind index array in result message prepared metadata, which is needed for CQL binary protocol v4 to function correctly. Fixes #2355. Message-Id: <1494247871-3148-1-git-send-email-penberg@scylladb.com>	2017-05-08 16:33:17 +03:00
Vlad Zolotarov	ff55b76562	cql3::query_processor: use weak_ptr for passing the prepared statements around Use seastar::checked_ptr<weak_ptr<pepared_statement>> instead of shared_ptr for passing prepared statements around. This allows an easy tracking and handling of statements invalidation. This implementation will throw an exception every time an invalidated statement reference is dereferenced. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-12 12:24:03 -04:00
Avi Kivity	27c42359bc	Merge seastar upstream * seastar 6b21197...2ebe842 (6): > Merge "Various improvements to execution stages" from Paweł > app-template: allow apps to specify a name for help message > bool_class: avoid initializing object of incomplete type > app-template: make sure we can still get help with required options > prometheus: Http handler that returns prometheus 0.4 protobuf or text format > Update DPDK to 17.02 Includes patch from Pawel to adjust to updated execution_stage interface.	2017-03-26 10:50:21 +03:00
Duarte Nunes	bfb8a3c172	materialized views: Replace db::view::view class The write path uses a base schema at a particular version, and we want it to use the materialized views at the corresponding version. To achieve this, we need to map the state currently in db::view::view to a particular schema version, which this patch does by introducing the view_info class to hold the state previously in db::view::view, and by having a view schema directly point to it. The changes in the patch are thus: 1) Introduce view_info to hold the extra view state; 2) Point to the view_info from the schema; 3) Make the functions in the now stateless db::view::view non-member; 4) Remove the db::view::view class. All changes are structural and don't affect current behavior. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-03-15 15:50:05 +01:00
Paweł Dziepak	d005b20071	cql3: make select statement an execution stage	2017-03-09 09:27:43 +00:00
Pekka Enberg	2bd560118e	cql3/statements/select_statement: Unset value support	2017-01-27 09:24:36 +02:00
Tomasz Grabiec	bc6486b304	Use gc_clock instead of db_clock where possible Some code paths were obtaining db_clock timestamp to only convert it to gc_clock later. Avoid this. In the future we could make gc_clock cheaper cause it has low precision. Message-Id: <1482401190-2035-1-git-send-email-tgrabiec@scylladb.com>	2016-12-22 13:27:55 +02:00
Duarte Nunes	124802e196	cql3: Add function to build view's select statement This patch adds an utility function that creates a raw select statement from a set of columns and a where clause. It is intended to be used to create the prepared select statement used by the view class. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Duarte Nunes	088dfdb108	select_statement: Consider materialized views This patch considers materialized views in select_statement::check_access(). Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Duarte Nunes	8792fed651	create_view_statement: Complete implementation Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Duarte Nunes	a9c17b0a52	select_statement: Propagate for_view argument This patch propagates the for_view argument, used by statement_restrictions to ensure IS NOT NULL can be used when creating a materialized view. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Asias He	937f28d2f1	Convert to use dht::partition_range_vector and dht::token_range_vector	2016-12-19 14:08:50 +08:00
Asias He	e5485f3ea6	Get rid of query::partition_range Use dht::partition_range instead	2016-12-19 08:09:25 +08:00
Duarte Nunes	7ce859799b	select_statement: Don't always trim result set Trimming the result set is only needed when the query contains an "IN" relation, an ORDER BY clause, and defines a limit, which is the case where we query different ranges concurrently. We don't use the result_merger to trim since we first need to reorder the rows. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-15 11:00:46 +00:00
Duarte Nunes	fee0b7fa48	query_result_merger: Limit rows This patch makes the row limit enforced by the storage_proxy layer. It adds a row limit to the query_result_merger, useful when merging results for concurrent queries. More importantly, it provides guarantees that upper layers may be relying on implicitly (e.g., the paging code). Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-15 11:00:36 +00:00
Duarte Nunes	108011a839	query_result_merger: Limit partitions This patch adds a partition limit to the query_result_merger, useful when merging results for concurrent queries. This change also makes the partition limit enforced by the storage_proxy layer, no changes being needed by the upper layers, namely the Thrift interface. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-15 10:27:41 +00:00
Paweł Dziepak	dde4bd5051	cql3: allow short reads with paged queries Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-12-14 14:28:37 +00:00
Vlad Zolotarov	fa4e1db0cb	cql: add a counter for CQL read (SELECT) requests - Add a "reads" counter to a cql3::cql_stats struct. - Store a reference for a query_processor::_cql_stats in the select_statement object. - Increment a "reads" counter where needed. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-11-03 11:48:57 -04:00
Vlad Zolotarov	7606588267	cql3::query_processor: add cql_stats - Add cql_stats member. - Pass it to cql3::raw::parsed_statement::prepare() virtual method. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-11-03 11:48:57 -04:00
Vlad Zolotarov	25be28bb3c	tracing: set a table_name parameter in a SELECT statement Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	baa6496816	service::storage_proxy: READ instrumentation: store trace state object in abstract_read_executor Having a trace_state_ptr in the storage_proxy level is needed to trace code bits in this level. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:59 +03:00
Vlad Zolotarov	54a758dfff	cql3::select_statement: simplify the tracing code by using a tracing::make_trace_info() helper Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:58 +03:00
Vlad Zolotarov	a5022a09a4	tracing: use 'write' instead of 'flush' and 'store' for consistency with seastar's API In names of functions and variables: s/flush_/write_/ s/store_/write_/ In a i_tracing_backend_helper: s/flush()/kick()/ Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:57 +03:00
Duarte Nunes	f013425bb5	query: Ensure timestamp is last param in read_command Since the timestamp is not serialized, it must always be the last parameter of query::read_command. This patch reorders it with the partition_limit parameters and updates callers that specified a timestamp argument. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1468312334-10623-1-git-send-email-duarte@scylladb.com>	2016-07-12 10:41:54 +01:00
Pekka Enberg	f64c25a495	cql3/statements/select_statement: Unify coding style The coding style in select_statement.cc is very inconsistent which makes the code hard to read. Clean that up. Message-Id: <1464871790-21031-1-git-send-email-penberg@scylladb.com>	2016-06-02 16:17:21 +02:00
Vlad Zolotarov	4c17a422e0	cql3: instrument a SELECT query to send tracing info Instrument a coordinator of a SELECT query to send tracing session info to the corresponding replica Nodes. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-06-01 20:17:25 +03:00
Vlad Zolotarov	6e26909b02	query::read_command: add an optional trace_info field Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-06-01 20:17:19 +03:00
Avi Kivity	0135b4d5cd	cql3: constify metadata users Metadata usually doesn't change after it is created; make that visible in the code, allowing further optimizations to be applied later. Message-Id: <1464334638-7971-3-git-send-email-avi@scylladb.com>	2016-05-31 09:12:11 +03:00
Avi Kivity	25b3d74f45	cql3: Split select_statement::raw_statement into raw namespace cql3::select_statement::raw_statement -> cql3::raw::select_statement Message-Id: <1464609556-3756-4-git-send-email-avi@scylladb.com>	2016-05-31 09:09:30 +03:00
Avi Kivity	caf8d4f0e6	cql3: separate parsed_statement and parsed_statment::prepared cql3::statements::parsed_statement -> cql3::statements::raw::parsed_statement cql3::statements::parsed_statement::prepared -> cql3::statements::prepared_statement Message-Id: <1464609556-3756-2-git-send-email-avi@scylladb.com>	2016-05-31 09:09:10 +03:00
Gleb Natapov	7f6b12c97a	query: add user provided timestamp to read_command If read query supplies timestamp move it to read_command to be used later otherwise get local timestamp.	2016-05-24 15:19:35 +03:00

1 2

96 Commits