scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 11:30:36 +00:00

Author	SHA1	Message	Date
Avi Kivity	c8a66efb6a	cql: query_processor: don't call cql_statement::execute_internal() any more All cql_statement::execute_internal() overrides now either throw or call execute(). Since we shouldn't be calling the throwing overrides internally, we can safely call execute() instead. This allows us to get rid of execute_internal().	2018-05-27 12:37:37 +03:00
Avi Kivity	eb19798f99	cql: select_statement: make execute() and execute_internal() equivalent execute_internal(), for some code paths, differs from execute by the following: 1. it uses CL_ONE unconditionally 2. it has no query timeout 3. it doesn't use execution stages for other code paths, it just calls execute. As preparation for getting rid of execute_internal(), unify the two code paths. Commit `4859b759b9` caused the consistency level and timeouts to be provided by the caller, so using the caller provided parameters instead of overriding them does not change behavior.	2018-05-27 12:36:02 +03:00
Avi Kivity	d998f06633	cql: schema_altering_statement: make execute() and execute_internal() equivalent To get rid of execute_internal(), make the normal execute() equivalent and call it instead of having two different paths.	2018-05-27 11:08:55 +03:00
Duarte Nunes	4859b759b9	Merge 'Make all timeouts explicit' from Avi " This patchset makes all users of query_processor specify their timeouts explicitly, in preparation for the removal of cql_statement::execute_internal() (whose main function was to override timeouts). " * tag 'cql-explicit-timeouts/v1' of https://github.com/avikivity/scylla: query_processor: require clients to specify timeout configuration query_processor: un-default consistency level in make_internal_options	2018-05-26 16:10:58 +02:00
Avi Kivity	6e97609049	Merge "Improve support for data types handling in SSTables 3.x" from Vladimir " Firstly, this patchset removes the is_fixed_length() function of abstract_type in favour of value_length_if_fixed(). Secondly, it fixed the byte_type to be compatible with Cassandra which erroneously treats it as a variable-length data type. Lastly, it adds a unit test covering all non-composite CQL data types for writing. Tests: unit {release} " * 'projects/sstables-30/different-data-types/v1' of https://github.com/argenet/scylla: tests: Add a unit test for writing different data types to SSTables 3.x format. types: Treat byte_type as a variable-length type for compatibility reasons. types: Remove is_value_fixed() and use value_length_if_fixed() instead.	2018-05-26 10:24:35 +03:00
Vladimir Krivopalov	0951153292	tests: Add a unit test for writing different data types to SSTables 3.x format. This tests covers all non-composite CQL data types. The resulting files are dumped using sstabledump as follows: [ { "partition" : { "key" : [ "key" ], "position" : 0 }, "rows" : [ { "type" : "row", "position" : 174, "liveness_info" : { "tstamp" : "1525385507816568" }, "cells" : [ { "name" : "asciival", "value" : "hello" }, { "name" : "bigintval", "value" : 9223372036854775807 }, { "name" : "blobval", "value" : "0x6772656174" }, { "name" : "boolval", "value" : true }, { "name" : "dateval", "value" : "2017-05-05" }, { "name" : "decimalval", "value" : 5.45 }, { "name" : "doubleval", "value" : 36.6 }, { "name" : "durationval", "value" : 1h4m48s20ms }, { "name" : "floatval", "value" : 7.62 }, { "name" : "inetval", "value" : "192.168.0.110" }, { "name" : "intval", "value" : -2147483648 }, { "name" : "smallintval", "value" : 32767 }, { "name" : "timeuuidval", "value" : "50554d6e-29bb-11e5-b345-feff819cdc9f" }, { "name" : "timeval", "value" : "19:45:05.090000000" }, { "name" : "tinyintval", "value" : 127 }, { "name" : "tsval", "value" : "2015-05-01 09:30:54.234Z" }, { "name" : "uuidval", "value" : "01234567-0123-0123-0123-0123456789ab" }, { "name" : "varcharval", "value" : "привет" }, { "name" : "varintval", "value" : 123 } ] } ] } ] Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-25 21:41:23 -07:00
Vladimir Krivopalov	3981dd6dd6	types: Treat byte_type as a variable-length type for compatibility reasons. Although values of the byte_type that corresponds to CQL TINYINT type always occupy only a single byte, Cassandra treats this it as a variable-length type for SSTables 3.0 reading and writing. While it is clearly a mistake at Cassandra side, we have to stay compatible. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-25 21:41:23 -07:00
Vladimir Krivopalov	24cb062834	types: Remove is_value_fixed() and use value_length_if_fixed() instead. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-25 21:41:23 -07:00
Duarte Nunes	4db0b4af58	Merge 'secondary index: Fixes for tables with multiple clustering columns' from Nadav " This patch series fixes #3405: secondary-index search only provided correct results in certain cases, where entire partitions or contiguous partition slices matched the query. When this was not the case, and individual clustering rows match or do not match the query, the wrong results were returned. To fix this bug, we need to fix the two stages of secondary-index search: 1. In the first stage, we read from the index MV a list of row keys (i.e., primary keys) matching the query. We can no longer remember just the partition keys, and need to keep the list of full primary keys. 2. In the second stage, we have a list of rows (not partitions) and need to read their selected contents to return to the user. Since CQL queries do not have a syntax to select an arbitrary list of rows, we have to add new code to do such a selection. Because we provide an ad-hoc, inefficient, implementation for the row selection described in stage 2, these patches leave two paths in the code: The old path, efficiently selecting entire partitions, and the new path, selecting individual rows. The old path is still used when it is applicable, which is when a partition key column or the first clustering key column is searched. " * 'si-fix-v4' of http://github.com/nyh/scylla: secondary index: test multiple clustering column secondary index: fix wrong results returned in certain cases secondary index: method for fetching list of rows from base table secondary index: method for fetching list of rows from index select_statement.cc: refactor find_index_partition_ranges() select_statement.cc: fix variable lifetime errors	2018-05-24 21:36:18 +01:00
Nadav Har'El	a6d9ea2fb5	secondary index: test multiple clustering column This patch adds a test for secondary indexes on a table which has many columns - two partition key column, two clustering key columns, and two regular columns. We add a bunch of data in various rows and partitions, index all columns and search on this data and verify the results. This test exposed various bugs in secondary index search, including issue #3405. After we fixed those bugs, the test now passes. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:56:57 +03:00
Nadav Har'El	1b29dd44f7	secondary index: fix wrong results returned in certain cases The current secondary-index search code, in indexed_table_select_statement::do_execute(), begins by fetching a list of partitions, and then the content of these partitions from the base table. However, in some cases, when the table has clustering columns and not searching on the first one of them, doing this work in partition granularity is wrong, and yields wrong results as demonstrated in issue #3405. So in this patch, we recognize the cases where we need to work in clustering row granularity, and in those cases use the new functions introduced in the previous patches - find_index_clustering_rows() and the execute() variant taking a list of primary-keys of rows. Fixes #3405. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:56:03 +03:00
Nadav Har'El	adf6d742be	secondary index: method for fetching list of rows from base table We add a new variant of select_statement::execute() which allows selecting an arbitrary list of clustering rows. The existing execute() variant can't do that - it can only take a list of partitions, and read the same clustering rows from all of them. The new select variant is not needed for regular CQL queries (which do not have a syntax allowing reading a list of rows with arbitrary primary keys), but we will need it for secondary index search, for solving issue #3405. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:54:36 +03:00
Nadav Har'El	a096a82adc	secondary index: method for fetching list of rows from index We already have a method find_index_partition_ranges(), to fetch a list of partition keys from the secondary index. However, as we shall see in the following patches (and see also issue #3405), getting a list of entire partitions is not always enough - the secondary index actually holds a list of primary keys, which includes clustering keys, and in some queries we can't just ignore them. So this patch provides a new method find_index_clustering_rows(), to query the secondary index and get a list of matching clustering keys. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:53:29 +03:00
Nadav Har'El	083b2ae573	select_statement.cc: refactor find_index_partition_ranges() The function find_index_partition_ranges() is used in secondary index searches for fetching a list of matching partition. In a following patch, we want to add a similar function for getting a list of rows. To avoid duplicate code, in this patch we split parts of find_index_partition_ranges() into two new functions: 1. get_index_schema() returns a pointer to the index view's schema. 2. read_posting_list() reads from this view the posting list (i.e., list of keys) for the current searched value. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:50:45 +03:00
Nadav Har'El	7dc9b77682	select_statement.cc: fix variable lifetime errors do_with() provides code a reference to an object which will be kept alive. It is a mistake to make a copy of this object or of parts of it, because then the lifetime of this copy will have to be maintained as well. In particular, it is a mistake to do do_with(..., [] (auto x) { ... }) - note how "auto x" appears instead of the correct "auto& x". This causes the object to be copied, and its lifetime not maintained. This patch fixes several cases where this rule was broken in select_statement.cc. I could not reproduce actual crashes caused by these mistakes, but in theory they could have happened. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:46:12 +03:00
Avi Kivity	0b8d06ebf9	Merge seastar upstream * seastar a48fe69...12cffef (5): > variant_utils: don't pass variant by rref to boost::apply_visitor > Revert "build: fix compilation issues on cmake. missing stdc++-fs" > reactor: prevent expected overflow from triggering ubsan warning > cmake: Add cmake option to disable testing altogether > build: fix compilation issues on cmake. missing stdc++-fs	2018-05-24 12:17:56 +03:00
Avi Kivity	f893dc61f0	Merge "Implement reading columns from SSTable 3 format" from Piotr " This patchset implements reading row columns from SSTable 3 format data file. Tests: units (release) " * 'haaawk/sstables3/read-columns-v4' of ssh://github.com/scylladb/seastar-dev: (21 commits) Add test for reading column values of different types. Support all fixed size column types from SSTable 3.x Add abstract_type::value_length_if_fixed Add test for simple table with value flat_reader_assertions: Add produces_row taking column values Implement reading rows and columns in data_consume_rows_context_m Introduce column_flags_m Add column_translation to data_consume_rows_context_m Pass schema to data_consume_context Add column_translation.hh consumer_m: Add consume methods for consuming rows and columns Extract make_atomic_cell from mp_row_consumer_k_l Rename NON_STATIC_ROW_* states to ROW_BODY_* Add liveness_info and use it in reading sstables Add helper methods for parsing simple types. Add unfiltered_flags_m::has_all_columns data_consume_context: use make_unique instead of new Pass serialization_header to data_consume_rows_context* Use disk_string_vint_size for bytes_array_vint_size Introduce disk_string_vint_size type ...	2018-05-24 10:11:25 +03:00
Takuya ASADA	e0d49aae37	dist/debian: fix missing --configfile parameter on pdebuild We need to specify --configfile on pdebuild too, otherwise we will always fail to build .deb on newly created build environment. Only reason why we still able to build .deb is we already copied .pbuilderrc to home directory on existing build environment. Fixes #3456 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180523204112.24669-1-syuu@scylladb.com>	2018-05-24 10:10:27 +03:00
Piotr Jastrzebski	7869bd98b1	Add test for reading column values of different types. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	a572d126e4	Support all fixed size column types from SSTable 3.x Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	7a25819e5a	Add abstract_type::value_length_if_fixed This info is used by SSTable 3.x format to read column values without reading their lengths. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	f58f10d708	Add test for simple table with value Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	0a5d06b2f3	flat_reader_assertions: Add produces_row taking column values Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	9348006092	Implement reading rows and columns in data_consume_rows_context_m Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	f6e1c38486	Introduce column_flags_m This will be used for reading columns from data file. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	609854e21a	Add column_translation to data_consume_rows_context_m Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	7fd222e639	Pass schema to data_consume_context It will be needed to obtain column_translation that will be added to data_consume_context in the next patch. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	d3f3cd36dd	Add column_translation.hh It contains a class that manages mapping between sstable columns and schema column definitions. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Piotr Jastrzebski	25b8cf9e4c	consumer_m: Add consume methods for consuming rows and columns Also implement them in mp_row_consumer_m. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:53:29 +02:00
Piotr Jastrzebski	94e3138dc5	Extract make_atomic_cell from mp_row_consumer_k_l It will be used in both mp_row_consumer_k_l and mp_row_consumer_m. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	c6d5ebc274	Rename NON_STATIC_ROW_* states to ROW_BODY_* New name describes the states in a better way as those states will be used both for static and non-static rows. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	10c669d2b5	Add liveness_info and use it in reading sstables Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	b2f9841dd4	Add helper methods for parsing simple types. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	d8cd8e04ed	Add unfiltered_flags_m::has_all_columns Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	51d079e17c	data_consume_context: use make_unique instead of new Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	54ef775501	Pass serialization_header to data_consume_rows_context* This header is needed to parse data for SSTable 3.0 format Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	b849eefc8c	Use disk_string_vint_size for bytes_array_vint_size Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:39:52 +02:00
Piotr Jastrzebski	76f0f2693d	Introduce disk_string_vint_size type Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:30:03 +02:00
Piotr Jastrzebski	5ca4bfd69a	disk_array_vint_size: Remove unused Size template parameter Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 16:15:44 +02:00
Duarte Nunes	4eb47d136b	Merge 'Introduce authorized_prepared_statements_cache' from Vlad " This series introduces a cache of already authenticated prepared statements which is meant to optimize the prepared statement lookup when authentication is enabled. This cache allows to perform a single cache lookup per EXECUTE operation as opposed to at least 2 lookups: one in the prepared statements cache and one in the authentication cache. Tests: - cql_query_test {debug, release}. - cassandra-stress with authentication enabled and with short eviction timeout. - Manual (with printouts) checks: - Tested the eviction due to eviction in the prepared_statements_cache: - Artificially decreased the prepared_statements_cache size and ran c-s with different keyspaces. - Verified that the corresponding authorized_prepared_statements_cache entry is evicted and re-populated. - Tested the BATCH of prepared statements (with dtest infrastructure): - Verified that for each prepared statement authorized_prepared_statements_cache is updated only once: - The batch contained a few entries of the same prepared statement. " * 'authorized_prepared_statements_cache-v3' of https://github.com/vladzcloudius/scylla: cql3: use authorized_prepared_statements_cache in the BATCH processing cql3::statements::batch_statement: introduce a single_statement class cql3: introduce the authorized_prepared_statements_cache class loading_shared_values: introduce the templated find() overload tests: loading_cache_test: add a tests for a loading_cache::remove(key)/remove(iterator) utils::loading_cache: add remove(key)/remove(iterator) methods cql3::query_processor: properly stop() prepared_statements_cache object	2018-05-23 14:40:09 +01:00
Avi Kivity	3dd2f68712	dist: drop libunwind dependency Since Seastar no longer (`1f005fb434`) requires libunwind, we can drop it from our dependency list. This helps the power build, for which no libunwind is available. Fixes #3453. Message-Id: <20180523114750.10753-1-avi@scylladb.com>	2018-05-23 13:53:29 +02:00
Avi Kivity	1f005fb434	Merge seastar upstream * seastar 5da5d4e...a48fe69 (1): > backtrace: drop libwind in favor of libc backtrace()	2018-05-23 14:42:14 +03:00
Duarte Nunes	eed09dfdf9	mutation_partition: Throw std::out_of_range with backtrace on cell_at Makes it easier to investigate bugs. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180521133753.16375-1-duarte@scylladb.com>	2018-05-23 13:51:54 +03:00
Avi Kivity	701e6f2cff	Merge "Implement backlog controller for TWCS" from Glauber " This series implements the backlog tracker for TWCS, allowing it to be controlled. The backlog for a TWCS colum family is just the sum of the SizeTiered backlogs for all the windows that we know about. A possible optimization for this is to stop tracking windows after they become old enough and revert to zero backlog. I reverted that last minute, though, since this will probably cause the backlog to completely misrepresent reality if we import SSTables into old buckets with things like repairs or nodetool refresh. " * 'twcs-backlog-v4.1' of github.com:glommer/scylla: backlog: implement backlog tracker for the TWCS STCS_backlog: allow users to query for the total bytes managed backlog: keep track of maximum timestamp in write monitor memtable: also keep track of max timestamp	2018-05-23 13:37:49 +03:00
Glauber Costa	44a89d654b	backlog: implement backlog tracker for the TWCS The TWCS backlog is relatively simple: we just need to keep track of which SSTable belong to which time window (and actually as usual, just their sizes). That is an easy thing to do since we can statically calculate the time bound from the timestamp. Once we do that we can just sum the backlogs for each individual window. Time windows that are well enough into the past can be at some point discarded when their backlogs become zero. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-05-23 06:20:21 -04:00
Nadav Har'El	433fc6c36e	keys.hh: simplify empty clustering-key check The exploded_clustering_prefix type has a convenient is_empty() method and an even more convenient "operator bool" shortcut. Unfortunately, the other clustering prefix types (clustering_key_prefix, clustering_key_prefix_view) have, for historic reasons, an is_empty method which takes a schema parameter. That also means they can't have an "operator bool" shortcut. But checking if a prefix doesn't really need the schema - all we need to check is whether the byte representation is empty. The result is simpler and more efficient code, and easier to use. It is also more consistent - all clustering-key-related types will have an "operator bool" instead of just some of them. To avoid massive code changes, we leave a is_empty(schema) variant, which simply calls is_empty(). There's already precedent for that - various methods which have a variant taking schema (and ignoring it) and one taking nothing. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180521174220.13262-1-nyh@scylladb.com>	2018-05-23 11:46:23 +02:00
Takuya ASADA	300af65555	dist/common/scripts/scylla_setup: abort running script when one of setup failed in silent mode Current script silently continues even one of setup fails, need to abort. Fixes #3433 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180522180355.1648-1-syuu@scylladb.com>	2018-05-23 11:05:33 +03:00
Vlad Zolotarov	82f7d1d006	cql3: use authorized_prepared_statements_cache in the BATCH processing Like with the EXECUTE command avoid authorizing the same prepared statement twice - this time in the context of processing the BATCH command. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-05-22 20:15:03 -04:00
Vlad Zolotarov	9723988926	cql3::statements::batch_statement: introduce a single_statement class This is a helper class needed to control the handling process of a single statement in the current batch. In particular it has the boolean defining if the authorization is needed for this statement. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-05-22 20:15:03 -04:00
Vlad Zolotarov	a138c59991	cql3: introduce the authorized_prepared_statements_cache class Add a cache that would store the checked weak pointer to already authorized prepared statements and which key is a tuple of an authenticated_user and key of the prepared_statements_cache. The entries will be held as long as the corresponding prepared statement is valid (cached) and will be discarded with the period equal to the refresh period of the permissions cache. Entries are also going to be discarded after 60 minutes if not used. The purpose of this new cache is to save the lookup in the permissions cache for already authenticated resource (whatever is needed to be authenticated for the particular prepared statement). This is meant to improve the cache coherency as well (since we are going to look in a single cache instead of two). Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-05-22 20:15:03 -04:00

1 2 3 4 5 ...

15497 Commits