scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 06:05:53 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	e55034a33e	cql3: batch_statement: use external_memory_usage() to get mutation size batch_statement::verify_batch_size() verifies that the total size of mutations generated by the batch statement is smaller than certain configurable thresholds. This is done by a custom mutation_partition visitor, which violates atomic_cell_view::value() preconditions by calling it even for dead cells. The simples solution is to use mutation_partition::external_memory_usage() instead. Message-Id: <20180619131405.12601-1-pdziepak@scylladb.com>	2018-06-19 16:26:52 +03:00
Piotr Sarna	61e3ee6c3c	cql3: fix supernumerary column on view update Patch `f39891a999` fixed 3443, but also introduced a regression in dtest - new column was unconditionally added to view during ALTER TABLE ADD, while it should only be the case for "include all columns" views. This patch fixes the regression (spotted by query_new_column_test). References #3443 Message-Id: <7410d965255a514d78cf0ce941a3236b9d8ddbbd.1529399135.git.sarna@scylladb.com>	2018-06-19 16:26:51 +03:00
Avi Kivity	0cf4cf5981	cql: make modification_statement execution_stage scheduling aware Inherit scheduling from the caller, preventing a fall back into the main group.	2018-06-18 18:30:21 +03:00
Avi Kivity	9479d3f345	cql: make batch_statement execution_stage scheduling aware Inherit scheduling from the caller, preventing a fall back into the main group.	2018-06-18 18:30:21 +03:00
Avi Kivity	fdfc347595	cql: make select_statement execution_stage scheduling aware Inherit scheduling from the caller, preventing a fall back into the main group.	2018-06-18 18:30:21 +03:00
Piotr Sarna	70ba8c8317	cql3: update token order comments Comments about token order were outdated with token column patches and they are now up to date. Fixes #3423	2018-06-06 09:02:37 +02:00
Avi Kivity	aab6b0ee27	Merge "Introduce new in-memory representation for cells" from Paweł " This is the first part of the first step of switching Scylla. It covers converting cells to the new serialisation format. The actual structure of the cells doesn't differ much from the original one with a notable exception of the fact that large values are now fragmented and linearisation needs to be explicit. Counters and collections still partially rely on their old, custom serialisation code and their handling is not optimial (although not significantly worse than it used to be). The new in-memory representation allows objects to be of varying size and makes it possible to provide deserialisation context so that we don't need to keep in each instance of an IMR type all the information needed to interpret it. The structure of IMR types is described in C++ using some metaprogramming with the hopes of making it much easier to modify the serialisation format that it would be in case of open-coded serialisation functions. Moreover, IMR types can own memory thanks to a limited support for destructors and movers (the latter are not exactly the same thing as C++ move constructors hence a different name). This makes it (relatively) to ensure that there is an upper bound on the size of all allocations. For now the only thing that is converted to the IMR are atomic_cells and collections which means that the reduction in the memory footprint is not as big as it can be, but introducing the IMR is a big step on its own and also paves the way towards complete elimination of unbounded memory allocations. The first part of this patchset contains miscellaneous preparatory changes to various parts of the Scylla codebase. They are followed by introduction of the IMR infrastructure. Then structure of cells is defined and all helper functions are implemented. Next are several treewide patches that mostly deal with propagating type information to the cell-related operations. Finally, atomic_cell and collections are switched to used the new IMR-based cell implementation. The IMR is described in much more detail in imr/IMR.md added in "imr: add IMR documentation". Refs #2031. Refs #2409. perf_simple_query -c4, medians of 30 results: ./perf_base ./perf_imr diff read 308790.08 309775.35 0.3% write 402127.32 417729.18 3.9% The same with 1 byte values: ./perf_base1 ./perf_imr1 diff read 314107.26 314648.96 0.2% write 463801.40 433255.96 -6.6% The memory footprint is reduced, but that is partially due to removal of small buffer optimisation (whether it will be restored depends on the exact mesurements of the performance impact). Generally, this series was not expected to make a huge difference as this would require converting whole rows to the IMR. Memory footprint: Before: mutation footprint: - in cache: 1264 - in memtable: 986 After: mutation footprint: - in cache: 1104 - in memtable: 866 Tests: unit (release, debug) " * tag 'imr-cells/v3' of https://github.com/pdziepak/scylla: (37 commits) tests/mutation: add test for changing column type atomic_cell: switch to new IMR-based cell reperesentation atomic_cell: explicitly state when atomic_cell is a collection member treewide: require type for creating collection_mutation_view treewide: require type for comparing cells atomic_cell: introduce fragmented buffer value interface treewide: require type to compute cell memory usage treewide: require type to copy atomic_cell treewide: require type info for copying atomic_cell_or_collection treewide: require type for creating atomic_cell atomic_cell: require column_definition for creating atomic_cell views tests: test imr representation of cells types: provide information for IMR data: introduce cell data: introduce type_info imr/utils: add imr object holder imr: introduce concepts imr: add helper for allocating objects imr: allow creating lsa migrators for IMR objects imr: introduce placeholders ...	2018-05-31 19:21:15 +03:00
Piotr Sarna	360326fdc5	cql3: add compatibility with libjsoncpp < 1.6.0 Only libjsoncpp >= 1.6.0 offers a safe name() method for value iterators. For older versions, deprecated memberName() is used instead. Note that memberName() was deprecated because of its inability to deal with embedded null characters. Fixes #3471 Message-Id: <e64a62bfc24ef06daee238d79d557fe6ec8979d3.1527758708.git.sarna@scylladb.com>	2018-05-31 18:00:22 +03:00
Paweł Dziepak	aa25f0844f	atomic_cell: introduce fragmented buffer value interface As a prepratation for the switch to the new cell representation this patch changes the type returned by atomic_cell_view::value() to one that requires explicit linearisation of the cell value. Even though the value is still implicitly linearised (and only when managed by the LSA) the new interface is the same as the target one so that no more changes to its users will be needed.	2018-05-31 15:51:11 +01:00
Duarte Nunes	c4f267bdfe	database: Refresh view dependent fields when altering base A view schema's view_info contains the id of the base regular column that view includes in its primary key. Since the column id of a particular column can potentially change with a new schema version, we need to refresh the stored column id. We weren't doing that when unselected base columns are added, and this patch fixes it by triggering an update of the view schema when base columns are added and the view contains a base regular column in its PK. Fixes #3443 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180530194536.51202-1-duarte@scylladb.com>	2018-05-31 12:10:49 +03:00
Avi Kivity	b70febe246	cql: cql_statement: remove execute_internal() With no callers, it can be safely removed.	2018-05-27 12:40:27 +03:00
Avi Kivity	eb19798f99	cql: select_statement: make execute() and execute_internal() equivalent execute_internal(), for some code paths, differs from execute by the following: 1. it uses CL_ONE unconditionally 2. it has no query timeout 3. it doesn't use execution stages for other code paths, it just calls execute. As preparation for getting rid of execute_internal(), unify the two code paths. Commit `4859b759b9` caused the consistency level and timeouts to be provided by the caller, so using the caller provided parameters instead of overriding them does not change behavior.	2018-05-27 12:36:02 +03:00
Avi Kivity	d998f06633	cql: schema_altering_statement: make execute() and execute_internal() equivalent To get rid of execute_internal(), make the normal execute() equivalent and call it instead of having two different paths.	2018-05-27 11:08:55 +03:00
Nadav Har'El	1b29dd44f7	secondary index: fix wrong results returned in certain cases The current secondary-index search code, in indexed_table_select_statement::do_execute(), begins by fetching a list of partitions, and then the content of these partitions from the base table. However, in some cases, when the table has clustering columns and not searching on the first one of them, doing this work in partition granularity is wrong, and yields wrong results as demonstrated in issue #3405. So in this patch, we recognize the cases where we need to work in clustering row granularity, and in those cases use the new functions introduced in the previous patches - find_index_clustering_rows() and the execute() variant taking a list of primary-keys of rows. Fixes #3405. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:56:03 +03:00
Nadav Har'El	adf6d742be	secondary index: method for fetching list of rows from base table We add a new variant of select_statement::execute() which allows selecting an arbitrary list of clustering rows. The existing execute() variant can't do that - it can only take a list of partitions, and read the same clustering rows from all of them. The new select variant is not needed for regular CQL queries (which do not have a syntax allowing reading a list of rows with arbitrary primary keys), but we will need it for secondary index search, for solving issue #3405. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:54:36 +03:00
Nadav Har'El	a096a82adc	secondary index: method for fetching list of rows from index We already have a method find_index_partition_ranges(), to fetch a list of partition keys from the secondary index. However, as we shall see in the following patches (and see also issue #3405), getting a list of entire partitions is not always enough - the secondary index actually holds a list of primary keys, which includes clustering keys, and in some queries we can't just ignore them. So this patch provides a new method find_index_clustering_rows(), to query the secondary index and get a list of matching clustering keys. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:53:29 +03:00
Nadav Har'El	083b2ae573	select_statement.cc: refactor find_index_partition_ranges() The function find_index_partition_ranges() is used in secondary index searches for fetching a list of matching partition. In a following patch, we want to add a similar function for getting a list of rows. To avoid duplicate code, in this patch we split parts of find_index_partition_ranges() into two new functions: 1. get_index_schema() returns a pointer to the index view's schema. 2. read_posting_list() reads from this view the posting list (i.e., list of keys) for the current searched value. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:50:45 +03:00
Nadav Har'El	7dc9b77682	select_statement.cc: fix variable lifetime errors do_with() provides code a reference to an object which will be kept alive. It is a mistake to make a copy of this object or of parts of it, because then the lifetime of this copy will have to be maintained as well. In particular, it is a mistake to do do_with(..., [] (auto x) { ... }) - note how "auto x" appears instead of the correct "auto& x". This causes the object to be copied, and its lifetime not maintained. This patch fixes several cases where this rule was broken in select_statement.cc. I could not reproduce actual crashes caused by these mistakes, but in theory they could have happened. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:46:12 +03:00
Vlad Zolotarov	9723988926	cql3::statements::batch_statement: introduce a single_statement class This is a helper class needed to control the handling process of a single statement in the current batch. In particular it has the boolean defining if the authorization is needed for this statement. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-05-22 20:15:03 -04:00
Piotr Sarna	40bf5d671b	cql: add secondary index metrics This commit adds basic secondary index metrics to cql_stats: * total number of indexes creates * total number of indexes dropped * total number of reads from a secondary index * total number of rows read from a secondary index References #3384 Reviewed-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <d5eda7a343cee547c921dd4d289ecb1ac1c2bf24.1526374243.git.sarna@scylladb.com>	2018-05-15 17:59:53 +03:00
Nadav Har'El	f5536d607e	secondary index: fix multiple appearance of rows This patch fixes a bug where queries using a secondary index would, in some cases, produce the same rows multiple times. The problem was that the code begins by finding a list of primary keys that match the search, and then work on the partitions containing them. If multiple rows matched in the same partition, the partition was considered multiple times, and the same rows were output multiple times. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180510203141.17157-1-nyh@scylladb.com>	2018-05-13 20:08:14 +02:00
Duarte Nunes	a23bda3393	Merge 'Implement separate timeout for range queries' from Avi " This patchset implements separate timeouts for range queries, and lays the foundations for separate timeouts for other query types. While the feature in itself is worthy, the real motivation is to have the timeouts decided by the caller, instead of storage_proxy. This in turn is required to disentangle each layer behaving differently depending on whether the query is internal or not; instead, the goal is to have each caller declare its needs in terms of consistency level and timeouts, and have the lower layers implement its requirements instead of making their own decisions. Fixes #3013. Tests: unit (release) " * tag '3013/v1.1' of https://github.com/avikivity/scylla: storage_proxy: remove default_query_timeout() storage_proxy: don't use default timeouts query_options: augment with timeout_config thrift: configure thrift transport and handler with a timeout_config transport: configure native transport with a timeout_config cql3: define and populate timeout_config_selector timeout_config: introduce timeout configuration	2018-05-13 20:05:50 +02:00
Jesse Haber-Kucharsky	4ffb4c6788	cql3: Include custom options in LIST ROLES An implementation of `authenticator` can support custom options for a each role. If, to make up an example, the authenticator supported the `region` key, then a role would be created as follows: CREATE ROLE jsmith WITH OPTIONS = { 'region': 'north_america' } AND PASSWORD = 'super_secure'; LIST ROLES will now print this custom option map as an additional column with the heading "options". However, none of the implementations of `authenticator` in Scylla currently support OPTIONS, so LIST ROLES will in practice, for now, print the empty set: role \| super \| login \| options -----------+-------+-------+--------- cassandra \| True \| True \| {}	2018-05-09 21:17:14 -04:00
Nadav Har'El	21d7507b74	secondary index: move stuff out of db/index directory The db/index directory contains just a few lines of code that exists there for historical reasons. It's confusing that we have both db/index and index/ directory related to secondary-indexing. This patch moves what little is still in db/index/ to index/. In the future we should probably get rid of the "secondary_index" class we had there, but for now, let's at least not have a whole new directory for it. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180501101246.21143-1-nyh@scylladb.com>	2018-05-01 13:21:24 +03:00
Avi Kivity	d8dd7e05a7	storage_proxy: don't use default timeouts Require all callers to supply timeouts instead of relying on defaults. Since all callers now have the timeouts set up, they can easily supply them.	2018-04-30 13:19:53 +03:00
Avi Kivity	49fdf01b5d	cql3: define and populate timeout_config_selector Determine which timeout we need to apply at prepare time. We don't know the numerical value (since it depends on whoever is executing the query, not just the statement type), but we know which member of timeout_config we need, so determine and remember that.	2018-04-30 13:19:49 +03:00
Nadav Har'El	8012f231ca	materialized views: fix another case-sensitivity bug We had another case-sensitivity bug in materialized views, where if a case-sensitive (quoted) column name was listed explicitly on "SELECT" (instead of implicitly, e.g., in "SELECT *") the column name was incorrectly folded to lower-case and inserts would fail. This patch fixes the code, where a "SELECT" statement was built using the desired column names, but column names that needed quoting were not being quoted. The bug was in a helper function build_select_statement() which took column name strings and failed to quote them. We clean up this function to take column definitions instead of strings - and take care of the quoting itself. It also needs to quote the table's name in the select statement being built. Fixes #3391. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180429221857.6248-6-nyh@scylladb.com>	2018-04-30 00:27:23 +02:00
Nadav Har'El	a0bc0d2d11	secondary index: fix support for compound partition key In the current code, if the base table has a compound partition key (i.e., multiple partition-key columns) searching its secondary indexes didn't work. There is no real reason why this, it was a just a bug in preparing the second query: Every SI query is converted to two queries. The first queries the associated materialized view, to find a list of primary keys. Those we need to use in a second query, of the base table. The second query needs to list, as restrictions, the keys found above. When a partition key is compound, its components build one key and one restriction. But in the buggy code, we incorrectly used each component as a separate (improperly formatted) key and restriction, and obviously this didn't work. This patch also adds a test that reproduces this problem and confirms its fix. In the fixed code I also found another incorrect use of to_cql_string() (which could break case-sensitive primary key column names) and changed it to to_string(). Fixes #3210. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180429124138.24406-1-nyh@scylladb.com>	2018-04-29 14:40:13 +01:00
Nadav Har'El	d674b6f672	secondary index: fix bug in indexing case-sensitive column names CQL normally folds identifiers such as column names to lowercase. However, if the column name is quoted, case-sensitive column names and other strange characters can be used. We had a bug where such columns could be indexed, but then, when trying to use the index in a SELECT statement, it was not found. The existing code remembered the index's column after converting it to CQL format (adding quotes). But such conversion was unnecessary, and wrong, because the rest of the code works with bare strings and does not involve actual CQL statements. So the fix avoids this mistaken conversion. This patch also includes a test to reproduce this problem. Fixes #3154. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180424154920.15924-1-nyh@scylladb.com>	2018-04-24 16:57:17 +01:00
Piotr Sarna	000ce24306	cql3: solve JSON case-sensitivity issues This commit fixes two closely related issues with handling case-sensitive column names in JSON: * according to doc, case-sensitive names should be wrapped with additional pair of double quotes during JSON SELECT * logic error in parse_json() prevented INSERT JSON from working properly on case-sensitive column names This commit is followed by updated cql_query_test, which checks case-sensitive cases as well. Message-Id: <82d9d5e193a656e99bc86b297c00662a6fb808a0.1524576066.git.sarna@scylladb.com>	2018-04-24 16:30:55 +03:00
Vladimir Krivopalov	fc644a8778	Fix Scylla to compile with older versions of JsonCpp (<= 1.7.0). Old versions of JsonCpp declare the following typedefs for internally used aliases: typedef long long int Int64; typedef unsigned long long int UInt64; In newer versions (1.8.x), those are declared as: typedef int64_t Int64; typedef uint64_t UInt64; Those base types are not identical so in cases when a type has constructors overloaded only for specific integral types (such as Json::Value in JsonCpp or data_value in Scylla), an attempt to pack/unpack an integer from/to a JSON object causes ambiguous calls. Fixes #3208 Tests: unit {release}. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <e9fff9f41e0f34b15afc90b5439be03e4295623e.1524556258.git.vladimir@scylladb.com>	2018-04-24 10:58:38 +03:00
Avi Kivity	8a8f688dbf	Merge "Materialized views: Fixes to update generation" from Duarte " Fixes to several issues around view update generation, pertaining to timestamp and TTL management. Fixes #3361 Fixes #3360 Fixes #3140 Refs #3362 Tests: unit(release, debug), dtest(materialized_views.py) " Reviewed-by: Nadav Har'El <nyh@scylladb.com> * 'materialized-views/fixes-galore/v2' of http://github.com/duarten/scylla: mutation_partition: Clarify comment about emptiness tests: Add view_complex_test tests/view_schema_test: Complete test db/view: Move cells instead of copying in add_cells_to_view() db/view: Handle unselected base columns and corner cases mutation_partition: Regular base column in view determines row liveness db/view: Don't avoid read-before-write when view PK matches base db/view: Process base updates to column unselected by its views db/view: Consider partition tombstone when generating updates tests/view_schema_test: Remove unneeded test mutation_fragment: Allow querying if row is live view_info: Add view_column() overload view_info: Explicitly initialize base-dependent fields cql3/alter_table_statement: Forbid dropping columns of MV base tables	2018-04-23 16:49:29 +03:00
Nadav Har'El	1ec5688b0b	Materialized Views: fix incorrect limitations on row filtering This patch fixes several cases where it was disallowed to create a materialized view with a filter ("where ..."), for no good reason. After this patch, these cases will be allowed. Fixes #2367. In ordinary SELECT queries, certain types of filtering which is known to be deceptively inefficient is now allowed. For example, trying to query a range of partition keys cannot be done without reading the entire database (because the murmur3 tokenizer randomizes the order of partitions). Restricting two partition key components also cannot be done without reading excessive amount of the entire partition. So Scylla, following Cassandra, chooses to disallow such SELECT queries, and give an error message. However, the same SELECT statements should be allowed when defining a materialized view. In this case, the filter is just used to check an individual row - not to search for one - so there is no performance concern. Unfortunately the existing code did these validations while building the SELECT statement's "restrictions", in code shared by both uses of SELECT (query and MV definition). It was easy to move one of the validations to later code which runs after the restriction has already been built (and knows if it is working for query or MV), but because of the way the "restrictions" objects (translated from Cassandra 2's code) hide what they contain, many of the checks are harder to perform after having built the restrictions object. So instead, we add in strategic places in the restriction-handling code a new "allow_filtering" flag. If restrictions are built with allow_filtering=true, the extra performance-oriented tests on the filtering restrictions is not done. Materialized views sets allow_filtering=true. The allow_filtering flag will also be useful later when we want to support the "ALLOW FILTERING" query option which is currently not supported properly (we have several open issues on that). However note that this patch doesn't complete that support: I left a FIXME in the spot where we set allow_filtering in the Materialized Views case, but in the futre also need to set it if the user specified "ALLOWED FILTERING" in the query. This patch also enables several unit tests written by Duarte which used to fail because of this bug, and now pass. These tests verify that the restrictions are now allowed and filter the view as desired; But I also added test code to verify that the same restrictions are still forbidden, as before, when used in ordinary SELECT queries. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180423124343.17591-1-nyh@scylladb.com>	2018-04-23 14:08:04 +01:00
Piotr Sarna	cdcbf654a8	cql3: add support for INSERT JSON clause This commit adds the implementation of INSERT JSON clause which accepts JSON object as parameter and inserts appropriate values into appropriate columns, as defined in given JSON. Example: INSERT INTO testme JSON '{ "id" : 77, "name" : "Jones", "ranking" : 8.5 }' References #2058	2018-04-23 12:00:57 +02:00
Duarte Nunes	b77b71436d	cql3/alter_table_statement: Forbid dropping columns of MV base tables When a view's PK only contains the columns that form the base's PK, then the liveness of a particular view row is determined not only by the base row's marker, but also by the selected and, more importantly, unselected columns. The fact that unselected columns can keep a view row alive also requires that users cannot drop columns of base tables with materialized views, which this patch implements. Refs #3362 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Avi Kivity	f7b102238a	cql3: change cql_statement methods to accept a local storage_proxy The storage_proxy represents the entire cluster, so there's never a need to access it on a remote shard; the local shard instance will contact remote shard or remote nodes as needed. Simplify the API by passing storage_proxy references instead of seastar::sharded<storage_proxy> references. query_processor and other callers are adjusted to call seastar::sharded::local() first. Message-Id: <20180415142656.25370-2-avi@scylladb.com>	2018-04-16 10:18:28 +02:00
Avi Kivity	dc0c458c12	Merge "First series on JSON support in CQL" from Piotr " This series introduces 'SELECT JSON' clause support for CQL. Things implemented: * expanding CQL grammar with JSON keyword * converting values to JSON format * serving 'SELECT JSON ' clauses tests for 'SELECT JSON' " * 'json_ops' of https://github.com/psarna/scylla: tests: add cql unit tests for SELECT JSON cql3: Add JSON token to CQL grammar cql3: add support for SELECT JSON clause cql3: add to_json_string function to types	2018-04-11 18:26:53 +03:00
Piotr Sarna	15545da572	cql3: add support for SELECT JSON clause This commit adds the implementation of SELECT JSON clause which returns rows in JSON format. Each returned row has a single '[json]' column. References #2058	2018-04-11 17:12:02 +02:00
Piotr Sarna	a5b6047ffa	cql3: add row-wise read statistics Database read metrics is now extended by total number of rows read, exported through cql_rows_read field. Closes #3146 Message-Id: <02f0816c509f3d7fea06da22869eea61548284e2.1522919708.git.sarna@scylladb.com>	2018-04-05 13:39:08 +03:00
Tomasz Grabiec	52c61df930	Relax includes To avoid unnecessary recompilations. Message-Id: <1522168295-994-1-git-send-email-tgrabiec@scylladb.com>	2018-03-28 10:49:07 +03:00
Jesse Haber-Kucharsky	849cf49b8d	Roles are implemented Fixes #1941.	2018-03-26 00:52:59 -04:00
Nadav Har'El	c809dd2e66	Materialized Views: change order of view creation verification Changed the order to check a couple of error conditions after checking for too many or missing primary key columns. This order (showing the too many or missing key columns first) is more useful, and is the order in Cassadra's code. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180320161121.13392-2-nyh@scylladb.com>	2018-03-21 09:47:41 +00:00
Nadav Har'El	871cecfd3b	Materialized Views: fix checking that view key includes base key A view's primary key must include all the columns of the base's primary key. If we don't check this and fail the table's creation, we can discover problems later on when using the table, as demonstrated in issue #2720. We had such checking code (translated from the same code in Java) but it had an extra "else" which caused nothing to be put in "missing_pk_columns" so the error was never recognized. Also, when the error does happen, we should print the column's name_as_text(), not name() which is (surprisingly) just a number. Fixes #2720. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180320161121.13392-1-nyh@scylladb.com>	2018-03-21 09:47:41 +00:00
Duarte Nunes	237184324e	Merge 'Make the read repair decision per-query instead of per-page' from Botond " Since `f8613a8415` we have reader-caching on replicas for single-partition queries. This caching works best when all pages of a query are sent to the same replicas consistently and thus they can reuse the cached readers there. The propability-based nature of read-repair works against this as on any given page a read-repair will be attempted or not based on probability. This will cause hight drop-rates on the replicas used for read-repair as the cached reader will not be reusable if the replica was skipped for one or more pages. To fix this make the repair-decision once, on the first page of the query and store the decision in the paging-state. On all remaining pages of the query use this stored decision. Tests: unit-tests(release, debug), dtest(paging_advanced_tests.py) Refs: #1865 " * 'per_query_repair_decision/v2' of https://github.com/denesb/scylla: Make the read-repair decision only once storage_proxy: add coordinator_query_options and coordinator_query_result Add query_read_repair_decision to paging-state	2018-03-20 11:59:41 +00:00
Nadav Har'El	da110d612e	Materialized Views: Fix "IS NOT NULL" checking When creating a materialized view, the user must provide a "IS NOT NULL" restriction for each of the created view's primary columns. If such a restriction is missing, the view creation should fail. In #2628 we noticed that sometimes it wasn't failing, but later updates to such table would fail, which is a bug. There is actually one special case where "IS NOT NULL" is optional: It is optional on the base's partition key column (when there is just one of these) because it is already assumed that the partition key in its entirety can never be. Our "IS NOT NULL" test, validate_primary_key(), had two logic errors which caused it to miss some cases of missing "IS NOT NULL": 1. Instead of checking whether a certain column is a the base's only partition-key column, and avoid testing IS NOT NULL just for that specific column, the code tested whether the schema has such a column, and if it did, the test was skipped for all columns. 2. When the code found the one new column in the view's primary key, it was so happy to find it that it immediately returned, and forgot to test the IS NOT NULL on that column :-) Both errors are fixed by this patch. See the next patch for a unit test. Fixes #2628. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180319233657.522-1-nyh@scylladb.com>	2018-03-20 00:30:18 +00:00
Botond Dénes	2e2abf6edb	storage_proxy: add coordinator_query_options and coordinator_query_result As yet more parameters and return-values are about to be added to all storage_proxy::query_* methods we need a way that scales better than changing the signatures every time. To this end we aggregate all non-mandatory query parameters into `coordinator_query_options` and all return values into `coordinator_query_result`. This way new fields can be simply added to the respective structs while the signatures of the methods themselves and their client code can remain unchanged.	2018-03-19 15:17:35 +02:00
Jesse Haber-Kucharsky	6a360c2d17	auth: Grant all permissions to object creator When a table, keyspace, or role is created, the creator now is automatically granted all applicable permissions on the object. This behavior is consistent with Apache Cassandra. Fixes #3216.	2018-03-14 01:54:31 -04:00
Jesse Haber-Kucharsky	c502fe24ce	auth: Unify handling for unsupported errors Instead of some functions in `allow_all_authorizer` throwing exceptions and others being silently pass-through, we consistently return exception futures with `auth::unsupported_authorization_operation`. These errors are converted to `invalid_request_exception` in the CQL error and ignored where appropriate in the auth subsystem.	2018-03-14 01:54:28 -04:00
Jesse Haber-Kucharsky	9117a689cf	auth: Fix `const` correctness This patch came about because of an important (and obvious, in hindsight) realization: instances of the authorizer, role manager, and authenticator are clients for access-control state and not the state itself. This is reflected directly in Scylla: `auth::service` is sharded across cores and this is possible because each instance queries and modifies the same global state. To give more examples, the value of an instance of `std::vector<int>` is the structure of the container and its contents. The value of `int file_descriptor` is an identifier for state maintained elsewhere. Having watched an excellent talk by Herb Sutter [1] and having read an informative blog post [2], it's clear that a member function marked `const` communicates that the observable state of the instance is not modified. Thus, the member functions of the role-manager, authenticator, and authorizer clients should not be marked `const` only if the state of the client itself is observably changed. By this principle, member functions which do not change the state of the client, but which mutate the global state the client is associated with (for example, by creating a role) are marked `const`. The `start` (and `stop`) functions of the client have the dual role of initializing (finalizing) both the local client state and the external state; they are not marked `const`. [1] https://herbsutter.com/2013/01/01/video-you-dont-know-const-and-mutable/ [2] http://talesofcpp.fusionfenix.com/post-2/episode-one-to-be-or-not-to-be-const	2018-03-14 01:32:43 -04:00
Botond Dénes	eac597d726	Add preferred and last replicas to the signature of query() preferred_replicas are added to the parameters and last_replicas are added to the return type. The preferred replicas will be used as a hint for the selection of the replicas to send the read requests to. The last replicas (returned) are the replicas actually selected for the read. This will allow queries to consistently hit the same replicas for each page thus reusing readers created on these replicas. For convenience a query() overload is provided that doesn't take or return the preferred and last replicas. This patch only adds the parameters and propagates them down to query_singular() and query_partition_key_range(). The code to actually use these preferred-replicas will be added in later patches. This reason for separating this is to reduce noise and improve reviewability for those functional changes later.	2018-03-13 10:34:34 +02:00

1 2 3 4 5 ...

571 Commits