scylladb

Author	SHA1	Message	Date
Vlad Zolotarov	ff55b76562	cql3::query_processor: use weak_ptr for passing the prepared statements around Use seastar::checked_ptr<weak_ptr<pepared_statement>> instead of shared_ptr for passing prepared statements around. This allows an easy tracking and handling of statements invalidation. This implementation will throw an exception every time an invalidated statement reference is dereferenced. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-12 12:24:03 -04:00
Avi Kivity	27c42359bc	Merge seastar upstream * seastar 6b21197...2ebe842 (6): > Merge "Various improvements to execution stages" from Paweł > app-template: allow apps to specify a name for help message > bool_class: avoid initializing object of incomplete type > app-template: make sure we can still get help with required options > prometheus: Http handler that returns prometheus 0.4 protobuf or text format > Update DPDK to 17.02 Includes patch from Pawel to adjust to updated execution_stage interface.	2017-03-26 10:50:21 +03:00
Paweł Dziepak	dce785089a	cql3: make modification statement an execution stage	2017-03-09 09:27:43 +00:00
Calle Wilund	0a4edca756	counters/cql: allow wormholing actual counter values (with shards) via cql Adds yet another magic function "SCYLLA_COUNTER_SHARD_LIST", indicating that argument value, which must be a list of tuples <int, UUID, long, long>, should be inserted as an actual counter value, not update. This of course to allow counters to be read from sstable loader. Note that we also need to allow timestamps for counter mutations, as well as convince the counter code itself to treat the data as already baked. So ugly wormhole galore. v2: * Changed flag names * More explicit wormholing, bypassing normal counter path, to avoid read-before-write etc * throw exceptions on unhandled shard types in marshalling v3: * Added counter id ordering check * Added batch statement check for mixing normal and raw counter updates Message-Id: <1487683665-23426-2-git-send-email-calle@scylladb.com>	2017-02-22 09:19:46 +00:00
Paweł Dziepak	1e8814f5ce	storage_proxy: support counter updates	2017-02-02 10:35:14 +00:00
Duarte Nunes	65535b3444	modification_statement: Check access for tables with views This patch checks for additional permissions when modifying a table with views, since that update will require reading from the table and writing into its views. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Duarte Nunes	5187fdbb3a	modification_statement: Views aren't updated directly This patch ensures that views cannot be modified directly through an insert or update statement. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Asias He	937f28d2f1	Convert to use dht::partition_range_vector and dht::token_range_vector	2016-12-19 14:08:50 +08:00
Asias He	e5485f3ea6	Get rid of query::partition_range Use dht::partition_range instead	2016-12-19 08:09:25 +08:00
Vlad Zolotarov	6e1d27bed1	cql3::query_processor: add a counter for a number of CQL modification requests ("writes") - Add a inserts, updates, deletes members to cql_stats. - Store cql_stats& in a modification_statement and increment the corresponding counter according to the value of a "type" field. - Store cql_stats& in a batch_statement and increment the statistics for each BATCH member. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-11-03 11:49:15 -04:00
Vlad Zolotarov	7606588267	cql3::query_processor: add cql_stats - Add cql_stats member. - Pass it to cql3::raw::parsed_statement::prepare() virtual method. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-11-03 11:48:57 -04:00
Vlad Zolotarov	abae3b05e7	tracing: set table_name in a modification statement Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	baa6496816	service::storage_proxy: READ instrumentation: store trace state object in abstract_read_executor Having a trace_state_ptr in the storage_proxy level is needed to trace code bits in this level. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:59 +03:00
Vlad Zolotarov	4c16df9e4c	service: instrument MUTATE flow with tracing Store the trace state in the abstract_write_response_handler. Instrument send_mutation RPC to receive an additional rpc::optional parameter that will contain optional<trace_info> value. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:58 +03:00
Tomasz Grabiec	7328a8eff8	cql: modification_statement: Avoid copying keyspace and table names Message-Id: <1468574135-4701-1-git-send-email-tgrabiec@scylladb.com>	2016-07-15 10:36:53 +01:00
Avi Kivity	10213c4211	cql3: extract raw modification_statement into raw sub-namespace	2016-05-31 20:53:37 +03:00
Avi Kivity	c8f98c5981	cql3: move cf_statement into raw hierarchy cql3::statements::cf_statement -> cql3::statements::raw::cf_statement Message-Id: <1464609556-3756-3-git-send-email-avi@scylladb.com>	2016-05-31 09:09:21 +03:00
Avi Kivity	caf8d4f0e6	cql3: separate parsed_statement and parsed_statment::prepared cql3::statements::parsed_statement -> cql3::statements::raw::parsed_statement cql3::statements::parsed_statement::prepared -> cql3::statements::prepared_statement Message-Id: <1464609556-3756-2-git-send-email-avi@scylladb.com>	2016-05-31 09:09:10 +03:00
Calle Wilund	3906dc9f0d	cql3::statements: Change check_access to future<> + implement	2016-04-19 11:49:05 +00:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	c518e852ee	modificiation_statement: Use result_view::do_with() Reduces code duplication. Message-Id: <1458336592-22065-1-git-send-email-tgrabiec@scylladb.com>	2016-03-20 15:14:28 +02:00
Tomasz Grabiec	63006e5dd2	query: Serialize collection cells using CQL format We want the format of query results to be eventually defined in the IDL and be independent of the format we use in memory to represent collections. This change is a step in this direction. The change decouples format of collection cells in query results from our in-memory representation. We currently use collection_mutation_view, after the change we will use CQL binary protocol format. We use that because it requires less transformations on the coordinator side. One complication is that some list operations need to retrieve keys used in list cells, not only values. To satisfy this need, new query option was added called "collections_as_maps" which will cause lists and sets to be reinterpreted as maps matching their underlying representation. This allows the coordinator to generate mutations referencing existing items in lists.	2016-02-15 17:05:55 +01:00
Tomasz Grabiec	383296c05b	cql3: Fix handling of lists with static columns List operations and prefetching were not handling static columns correctly. One issue was that prefetching was attaching static column data to row data using ids which might overlap with clustered columns. Another problem was that list operations were always constructing clustering key even if they worked on a static column. For static columns the key would be always empty and lookup would fail. The effect was that list operations which depend on curent state had no effect. Similar problem could be observed on C* 2.1.9, but not on 2.2.3. Fixes #903.	2016-02-15 17:05:55 +01:00
Tomasz Grabiec	04eb58159a	query: Add schema_version field to read_command	2016-01-11 10:34:51 +01:00
Pekka Enberg	227e517852	cql3: Move modification_statement implementation to source file	2015-12-18 13:29:58 +02:00
Avi Kivity	79f7431a03	db: change collection_mutation::{one,view} not to use nested classes Nested classes cannot be forward-declared, so change the naming not to use them. Follows atomic_cell{,_view}.	2015-11-13 17:13:07 +02:00
Pekka Enberg	1890d276b9	cql3: Add depends_on_{keyspace\|column_family} helper to cql_statement Signed-off-by: Pekka Enberg <penberg@scylladb.com>	2015-10-15 09:18:52 +03:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Paweł Dziepak	39e5f92433	cql3: modification_statement: avoid copying mutations Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-09-03 10:30:18 +02:00
Glauber Costa	4e83530c3f	do not "throw new" This is how Java does. But in C++, "throw new", although valid, would require the catcher to catch a pointer to the exception - which isn't really what we do. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-07-23 07:07:17 +03:00
Tomasz Grabiec	ad99e84505	storage_proxy: Take schema_ptr in query() It will be needed for reconciliation.	2015-07-12 12:54:38 +02:00
Pekka Enberg	86d913954a	db/legacy_schema_tables: Store CF "is_dense" to system tables Persist column family's "is_dense" value to system tables. Please note that we throw an exception if "is_dense" is null upon read. That needs to be fixed later by inferring the value from other information like Origin does. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-07 12:36:50 +02:00
Calle Wilund	41cbd0d267	Implement modification_statement::execute_internal	2015-07-06 08:21:15 +02:00
Calle Wilund	72ebb6360f	modification_statement bugfix: don't move out of shared pointer in loop	2015-07-05 16:04:41 +03:00
Paweł Dziepak	290a7ca1bf	query: add timestamp to read_command Read command needs a timestamp in order to determine which cells have already expired. Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-07-02 17:01:19 +02:00
Tomasz Grabiec	3779506990	db: query: Make partition_range hold ring_position Current model was not really correct because Origin doesn't support querying of partition ranges by their value. We can query slices according to dht::decorated_key ordering, which orders partitions first by token then by key value. ring_position encapsulates range constraint. Key value is optional, in which case only token is constrained.	2015-06-18 15:47:40 +02:00
Gleb Natapov	a338407e29	make storage_proxy object distributed storage_proxy holds per cpu state now to track clustering, so it has to be distributed otherwise smp setup does not work.	2015-06-17 15:14:06 +02:00
Gleb Natapov	b7155ad862	pass partitions_ranges separately from from read_command partitions_ranges will be manipulated upon to be split for different destination, so provide it separately from read_command to not copy the later for each destination.	2015-06-11 15:18:07 +03:00
Calle Wilund	15b8267dab	Add thrift_schema and placeholder "has_compound_comparator()" thrift_schema == place to collect thrift compatibility aspects of schema definition.	2015-06-03 10:13:53 +02:00
Calle Wilund	1631ce132e	Add "storage_proxy&" argument to cql_statement::validate To make db, schemas etc reachable	2015-06-03 10:13:52 +02:00
Avi Kivity	750543fc04	cql3: fix shared_ptr misuse in modification_statement A shared_ptr is mutable, so it must be thread_local, not static.	2015-06-01 17:31:57 +02:00
Tomasz Grabiec	731a63e371	schema: Embed raw_schema inside schema Public fields got encapsulated.	2015-04-24 18:01:01 +02:00
Avi Kivity	3d38708434	cql3: pass a database& instance to most foo::raw::prepare() variants To prepare a user-defined type, we need to look up its name in the keyspace. While we get the keyspace name as an argument to prepare(), it is useless without the database instance. Fix the problem by passing a database reference along with the keyspace. This precolates through the class structure, so most cql3 raw types end up receiving this treatment. Origin gets along without it by using a singleton. We can't do this due to sharding (we could use a thread-local instance, but that's ugly too). Hopefully the transition to a visitor will clean this up.	2015-04-20 16:15:34 +03:00
Tomasz Grabiec	00f99cefd4	db: split query.hh to reduce header dependencies	2015-04-15 20:44:59 +02:00
Tomasz Grabiec	878a740b9d	db: Write query results in serialized form This gives about 30% increase in tps in: build/release/tests/perf/perf_simple_query -c1 --query-single-key This patch switches query result format from a structured one to a serialized one. The problems with structured format are: - high level of indirection (vector of vectors of vectors of blobs), which is not CPU cache friendly - high allocation rate due to fine-grained object structure On replica side, the query results are probably going to be serialized in the transport layer anyway, so this change only subtracts work. There is no processing of the query results on replica other than concatenation in case of range queries. If query results are collected in serialized form from different cores, we can concatenate them without copying by simply appending the fragments into the packet. This optimization is not implemented yet. On coordinator side, the query results would have to be parsed from the transport layer buffers anyway, so this also doesn't add work, but again saves allocations and copying. The CQL server doesn't need complex data structures to process the results, it just goes over it linearly consuming it. This patch provides views, iterators and visitors for consuming query results in serialized form. Currently the iterators assume that the buffer is contiguous but we could easily relax this in future so that we can avoid linearization of data received from seastar sockets. The coordinator side could be optimized even further for CQL queries which do not need processing (eg. select * from cf where ...) we could make the replica send the query results in the format which is expected by the CQL binary protocol client. So in the typical case the coordinator would just pass the data using zero-copy to the client, prepending a header. We do need structure for prefetched rows (needed by list manipulations), and this change adds query result post-processing which converts serialized query result into a structured one, tailored particularly for prefetched rows needs. This change also introduces partition_slice options. In some queries (maybe even in typical ones), we don't need to send partition or clustering keys back to the client, because they are already specified in the query request, and not queried for. The query results hold now keys as optional elements. Also, meta-data like cell timestamp and ttl is now also optional. It is only needed if the query has writetime() or ttl() functions in it, which it typically won't have.	2015-04-15 20:44:50 +02:00
Tomasz Grabiec	7ebc7830b7	db: Optimize column family lookup in query path	2015-04-15 20:33:48 +02:00
Avi Kivity	b3f3c76dd8	cql3: fix overzealous move in modification_statement::get_mutations() 'keys' and 'prefix' are used twice in the same expression, and as the language does not guarantee any ordering in this case, any moves are illegal. Get rid of them.	2015-03-26 12:14:01 +02:00
Avi Kivity	b650383d67	cql3: implement read_required_rows() Some modification statements require reading rows before modifying them; implement it.	2015-03-26 12:14:01 +02:00
Tomasz Grabiec	e3422525c0	Use column_definition via const reference	2015-03-24 12:03:00 +01:00
Tomasz Grabiec	bdbd5547e3	db: Cleanup key names clustering_key::one -> clustering_key clustering_key::prefix::one -> clustering_key_prefix partition_key::one -> partition_key clustering_prefix -> exploded_clustering_prefix	2015-03-20 18:59:29 +01:00

1 2

72 Commits