scylladb

Author	SHA1	Message	Date
Kefu Chai	0ae81446ef	./: not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16766	2024-01-17 16:30:14 +02:00
Avi Kivity	c5e4bf51bd	Introduce mutation/ module Move mutation-related files to a new mutation/ directory. The names are kept in the global namespace to reduce churn; the names are unambiguous in any case. mutation_reader remains in the readers/ module. mutation_partition_v2.cc was missing from CMakeLists.txt; it's added in this patch. This is a step forward towards librarization or modularization of the source base. Closes #12788	2023-02-14 11:19:03 +02:00
Avi Kivity	2739ac66ed	treewide: drop cql_serialization_format Now that we don't accept cql protocol version 1 or 2, we can drop cql_serialization format everywhere, except when in the IDL (since it's part of the inter-node protocol). A few functions had duplicate versions, one with and one without a cql_serialization_format parameter. They are deduplicated. Care is taken that `partition_slice`, which communicates the cql_serialization_format across nodes, still presents a valid cql_serialization_format to other nodes when transmitting itself and rejects protocol 1 and 2 serialization\ format when receiving. The IDL is unchanged. One test checking the 16-bit serialization format is removed.	2023-01-03 19:54:13 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Nadav Har'El	de21455dfe	Rename one logger which had a space in its name We had a logger called "query result log", with spaces, which made it impossible to enable it with the REST API due to missing percent decoding support in our HTTP server (see #9614). Although that HTTP server bug should be fixed as well (in Seastar - see scylladb/seastar#725), there is no good reason to have a logger name with a space in it. This is the only logger whose name has a space: We have 77 other loggers using underscores (_) in their name, and only 9 using hyphens (-). So in this patch we choose the more popular alternative - an underscore. Fixes #9614. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211205093732.1092553-1-nyh@scylladb.com>	2021-12-05 12:18:21 +02:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Botond Dénes	1a3ee71b39	treewide: use query_mutations() instead of mutation::query() We want to retire the latter.	2021-01-22 15:36:37 +02:00
Michał Chojnowski	0d5c5b8645	query-result-set: don't linearize in result_set_builder::deserialize We can deserialize directly from fragmented buffers now.	2020-12-04 09:19:39 +01:00
Wojciech Mitros	45215746fe	increase the maximum size of query results to 2^64 Currently, we cannot select more than 2^32 rows from a table because we are limited by types of variables containing the numbers of rows. This patch changes these types and sets new limits. The new limits take effect while selecting all rows from a table - custom limits of rows in a result stay the same (2^32-1). In classes which are being serialized and used in messaging, in order to be able to process queries originating from older nodes, the top 32 bits of new integers are optional and stay at the end of the class - if they're absent we assume they equal 0. The backward compatibility was tested by querying an older node for a paged selection, using the received paging_state with the same select statement on an upgraded node, and comparing the returned rows with the result generated for the same query by the older node, additionally checking if the paging_state returned by the upgraded node contained new fields with correct values. Also verified if the older node simply ignores the top 32 bits of the remaining rows number when handling a query with a paging_state originating from an upgraded node by generating and sending such a query to an older node and checking the paging_state in the reply(using python driver). Fixes #5101.	2020-08-03 17:32:49 +02:00
Botond Dénes	6660a5df51	result_memory_accounter: remove default constructor If somebody wants to bypass proper memory accounting they should at the very least be forced to consider if that is indeed wise and think a second about the limit they want to apply.	2020-07-28 18:00:29 +03:00
Rafael Ávila de Espíndola	2b45edd97e	query-result-set: Assert that we don't have null values Null values are represented with dead cells and never included in a result_set. To enforce that, this adds a non_null_data_value that wraps a data_value and whose constructor calls on_internal_error if a null data_value is passed. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-01-29 13:24:10 -08:00
Rafael Ávila de Espíndola	66290c3bb9	query-result-set: Avoid a copy during construction No functionality change. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-01-29 13:24:10 -08:00
Kamil Braun	270cf2b289	query-result-set: generalize result_set_builder to UDTs.	2019-10-25 12:04:44 +02:00
Tomasz Grabiec	ecff716f40	query-result-set: Give more context on failure We've seen schema application failing with marshal_exception here. That's not enough information to figure out what is the problem. Knowing which table and column is affected would make diagnosis much easier in certain cases. This patch wraps errors in query::deserialization_error with more information. Example output: query::deserialization_error (failed on column system_schema.tables#bloom_filter_fp_chance \ (version: c179c1d7-9503-3f66-a5b3-70e72af3392a, id: 0, index: 0, type: org.apache.cassandra.db.marshal.DoubleType):\ seastar::internal::backtraced<marshal_exception> (marshaling error: read_simple - not enough bytes (expected 8, got 3) Message-Id: <20190221113219.13018-1-tgrabiec@scylladb.com>	2019-02-21 11:35:27 +00:00
Piotr Jastrzebski	147cc031db	Move map_type_impl out of types.hh to types/map.hh Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-24 09:56:38 +01:00
Paweł Dziepak	4704c4efab	query::result: avoid copying and linearising cell value query::result_view already operates on views of a serialised query::result. However, until now the value of a cell was always linearised and copied. This patch makes use of ser::buffer_view to avoid that.	2018-06-25 09:21:47 +01:00
Duarte Nunes	6b4b429883	query-result: Introduce class result_options Introduce class result_options to carry result options through the request pipeline, which at this point mean the result type and the digest algorithm. This class allows us to encapsulate the concrete digest algorithm to use. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-02-01 00:22:50 +00:00
Paweł Dziepak	3fe5ed3cd9	query: use result_view::consume() where appropriate Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Gleb Natapov	2a00c06dd5	query: fix non full clustering key deserialization Clustering key prefix may have less columns than described in schema. Deserailiaztion should stop when end of buffer is reached. Message-Id: <20160503140420.GP23113@scylladb.com>	2016-05-04 17:42:28 +02:00
Avi Kivity	db03295c8a	Merge "Fix query digest mismatch" from Tomasz "Currently data query digest includes cells and tombstones which may have expired or be covered by higher-level tombstones. This causes digest mismatch between replicas if some elements are compacted on one of the nodes and not on others. This mismatch triggers read-repair which doesn't resolve because mutations received by mutation queries are not differing, they are compacted already. The fix adds compacting step before writing and digesting query results by reusing the algorithm used by mutation query. This is not the most optimal way to fix this. The compaction step could be folded with the query writing, there is redundancy in both steps. However such change carries more risk, and thus was postponed. perf_simple_query test (cassandra-stress-like partitions) shows regression from 83k to 77k (7%) ops/s. Fixes #1165."	2016-04-08 12:13:29 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	f15c380a4f	database: Compact mutations when executing data queries Currently data query digest includes cells and tombstones which may have expired or be covered by higher-level tombstones. This causes digest mismatch between replicas if some elements are compacted on one of the nodes and not on others. This mismatch triggers read-repair which doesn't resolve because mutations received by mutation queries are not differing, they are compacted already. The fix adds compacting step before writing and digesting query results by reusing the algorithm used by mutation query. This is not the most optimal way to fix this. The compaction step could be folded with the query writing, there is redundancy in both steps. However such change carries more risk, and thus was postponed. perf_simple_query test (cassandra-stress-like partitions) shows regression from 83k to 77k (7%) ops/s. Fixes #1165.	2016-04-07 19:56:58 +02:00
Paweł Dziepak	82d2a2dccb	specify whether query::result, result_digest or both are needed Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-11 18:27:13 +00:00
Tomasz Grabiec	63006e5dd2	query: Serialize collection cells using CQL format We want the format of query results to be eventually defined in the IDL and be independent of the format we use in memory to represent collections. This change is a step in this direction. The change decouples format of collection cells in query results from our in-memory representation. We currently use collection_mutation_view, after the change we will use CQL binary protocol format. We use that because it requires less transformations on the coordinator side. One complication is that some list operations need to retrieve keys used in list cells, not only values. To satisfy this need, new query option was added called "collections_as_maps" which will cause lists and sets to be reinterpreted as maps matching their underlying representation. This allows the coordinator to generate mutations referencing existing items in lists.	2016-02-15 17:05:55 +01:00
Tomasz Grabiec	5f756fcbe5	query: Add cql_format property to partition_slice It will specify in which format CQL values should be serialized. Will allow for rolling out new CQL binary protocol versions without stalling reads.	2016-02-15 17:05:55 +01:00
Tomasz Grabiec	9d11968ad8	Rename serialization_format to cql_serialization_format	2016-02-15 16:53:56 +01:00
Tomasz Grabiec	22254e94cc	query::result_set: Add constructor from mutation	2016-01-08 21:10:26 +01:00
Avi Kivity	2c3591cbd9	data_value de-any-fication We use boost::any to convert to and from database values (stored in serlialized form) and native C++ values. boost::any captures information about the data type (how to copy/move/delete etc.) and stores it inside the boost::any instance. We later retrieve the real value using boost::any_cast. However, data_value (which has a boost::any member) already has type information as a data_type instance. By teaching data_type intances about the corresponding native type, we can elimiante the use of boost::any. While boost::any is evil and eliminating it improves efficiency somewhat, the real goal is growing native type support in data_type. We will use that later to store native types in the cache, enabling O(log n) access to collections, O(1) access to tuples, and more efficient large blob support.	2015-10-30 17:38:51 +01:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Avi Kivity	deccb0f718	result_set: fix deserialization of collection types Collection cells in query results are serialized using the "mutation form", and must be deserialized using that format as well. Fixes "--logger-log-level storage_proxy=trace" crasher.	2015-09-03 20:39:30 +03:00
Tomasz Grabiec	9aec8b9591	query: result_set: Use type's to_string() to get prettier printout	2015-07-09 19:55:00 +02:00
Tomasz Grabiec	cb9b48d9be	query: result_set: Add static columns even if there are no clustered rows	2015-07-09 19:55:00 +02:00
Tomasz Grabiec	724099abe9	query: result_set_builder: Use the set of columns which was queried for This builder is the one used to build the convenient result_set (not on fast path). The builder was assuming that the whole set of columns was always queried, which resulted in buffer underflow exceptions during parsing of the results if this was not the case. Let's also handle queries which have narrower column sets.	2015-07-09 11:35:31 +03:00
Tomasz Grabiec	4c008e059a	result_set: Store schema pointer with result_set	2015-07-02 13:25:46 +02:00
Tomasz Grabiec	f388139a7b	result_set_builder: Move to source file	2015-07-02 13:25:46 +02:00
Tomasz Grabiec	a1f6dec067	result_set: Introduce from_raw_result() factory method	2015-07-02 13:25:46 +02:00
Tomasz Grabiec	c9e5508e3c	result_set_builder: Make build() return unwrapped object It's better to let the user decide which kind (if any) of smart pointer to wrap it into.	2015-07-02 13:25:46 +02:00
Pekka Enberg	ee3dbcd294	query-result-set: Add operator<< for result sets Add operator<< for result sets to make debugging schema merging code issues easier. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-06-05 12:59:57 +03:00
Pekka Enberg	446555f2de	query-result-set.hh: Use data_value instead of boost::any Switch to data_value in preparation for adding support for comparison operators. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-05-27 11:49:12 +03:00
Avi Kivity	6290dee438	db: const correctness for abstract_type and friends Types are immutable.	2015-04-29 15:40:38 +03:00
Pekka Enberg	f17a8a7a92	query: Add support for result sets Add a query::result_set class that contains per-row cells that can be accessed by column name. Partition keys, clustering keys, and static values are duplicated for every row for convenience. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-04-28 15:49:34 +03:00

41 Commits