scylladb

Author	SHA1	Message	Date
Vlad Zolotarov	baa6496816	service::storage_proxy: READ instrumentation: store trace state object in abstract_read_executor Having a trace_state_ptr in the storage_proxy level is needed to trace code bits in this level. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:59 +03:00
Vlad Zolotarov	4c16df9e4c	service: instrument MUTATE flow with tracing Store the trace state in the abstract_write_response_handler. Instrument send_mutation RPC to receive an additional rpc::optional parameter that will contain optional<trace_info> value. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:58 +03:00
Tomasz Grabiec	7328a8eff8	cql: modification_statement: Avoid copying keyspace and table names Message-Id: <1468574135-4701-1-git-send-email-tgrabiec@scylladb.com>	2016-07-15 10:36:53 +01:00
Avi Kivity	10213c4211	cql3: extract raw modification_statement into raw sub-namespace	2016-05-31 20:53:37 +03:00
Avi Kivity	c8f98c5981	cql3: move cf_statement into raw hierarchy cql3::statements::cf_statement -> cql3::statements::raw::cf_statement Message-Id: <1464609556-3756-3-git-send-email-avi@scylladb.com>	2016-05-31 09:09:21 +03:00
Avi Kivity	caf8d4f0e6	cql3: separate parsed_statement and parsed_statment::prepared cql3::statements::parsed_statement -> cql3::statements::raw::parsed_statement cql3::statements::parsed_statement::prepared -> cql3::statements::prepared_statement Message-Id: <1464609556-3756-2-git-send-email-avi@scylladb.com>	2016-05-31 09:09:10 +03:00
Calle Wilund	3906dc9f0d	cql3::statements: Change check_access to future<> + implement	2016-04-19 11:49:05 +00:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	c518e852ee	modificiation_statement: Use result_view::do_with() Reduces code duplication. Message-Id: <1458336592-22065-1-git-send-email-tgrabiec@scylladb.com>	2016-03-20 15:14:28 +02:00
Tomasz Grabiec	63006e5dd2	query: Serialize collection cells using CQL format We want the format of query results to be eventually defined in the IDL and be independent of the format we use in memory to represent collections. This change is a step in this direction. The change decouples format of collection cells in query results from our in-memory representation. We currently use collection_mutation_view, after the change we will use CQL binary protocol format. We use that because it requires less transformations on the coordinator side. One complication is that some list operations need to retrieve keys used in list cells, not only values. To satisfy this need, new query option was added called "collections_as_maps" which will cause lists and sets to be reinterpreted as maps matching their underlying representation. This allows the coordinator to generate mutations referencing existing items in lists.	2016-02-15 17:05:55 +01:00
Tomasz Grabiec	383296c05b	cql3: Fix handling of lists with static columns List operations and prefetching were not handling static columns correctly. One issue was that prefetching was attaching static column data to row data using ids which might overlap with clustered columns. Another problem was that list operations were always constructing clustering key even if they worked on a static column. For static columns the key would be always empty and lookup would fail. The effect was that list operations which depend on curent state had no effect. Similar problem could be observed on C* 2.1.9, but not on 2.2.3. Fixes #903.	2016-02-15 17:05:55 +01:00
Tomasz Grabiec	04eb58159a	query: Add schema_version field to read_command	2016-01-11 10:34:51 +01:00
Pekka Enberg	227e517852	cql3: Move modification_statement implementation to source file	2015-12-18 13:29:58 +02:00
Avi Kivity	79f7431a03	db: change collection_mutation::{one,view} not to use nested classes Nested classes cannot be forward-declared, so change the naming not to use them. Follows atomic_cell{,_view}.	2015-11-13 17:13:07 +02:00
Pekka Enberg	1890d276b9	cql3: Add depends_on_{keyspace\|column_family} helper to cql_statement Signed-off-by: Pekka Enberg <penberg@scylladb.com>	2015-10-15 09:18:52 +03:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Paweł Dziepak	39e5f92433	cql3: modification_statement: avoid copying mutations Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-09-03 10:30:18 +02:00
Glauber Costa	4e83530c3f	do not "throw new" This is how Java does. But in C++, "throw new", although valid, would require the catcher to catch a pointer to the exception - which isn't really what we do. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-07-23 07:07:17 +03:00
Tomasz Grabiec	ad99e84505	storage_proxy: Take schema_ptr in query() It will be needed for reconciliation.	2015-07-12 12:54:38 +02:00
Pekka Enberg	86d913954a	db/legacy_schema_tables: Store CF "is_dense" to system tables Persist column family's "is_dense" value to system tables. Please note that we throw an exception if "is_dense" is null upon read. That needs to be fixed later by inferring the value from other information like Origin does. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-07 12:36:50 +02:00
Calle Wilund	41cbd0d267	Implement modification_statement::execute_internal	2015-07-06 08:21:15 +02:00
Calle Wilund	72ebb6360f	modification_statement bugfix: don't move out of shared pointer in loop	2015-07-05 16:04:41 +03:00
Paweł Dziepak	290a7ca1bf	query: add timestamp to read_command Read command needs a timestamp in order to determine which cells have already expired. Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-07-02 17:01:19 +02:00
Tomasz Grabiec	3779506990	db: query: Make partition_range hold ring_position Current model was not really correct because Origin doesn't support querying of partition ranges by their value. We can query slices according to dht::decorated_key ordering, which orders partitions first by token then by key value. ring_position encapsulates range constraint. Key value is optional, in which case only token is constrained.	2015-06-18 15:47:40 +02:00
Gleb Natapov	a338407e29	make storage_proxy object distributed storage_proxy holds per cpu state now to track clustering, so it has to be distributed otherwise smp setup does not work.	2015-06-17 15:14:06 +02:00
Gleb Natapov	b7155ad862	pass partitions_ranges separately from from read_command partitions_ranges will be manipulated upon to be split for different destination, so provide it separately from read_command to not copy the later for each destination.	2015-06-11 15:18:07 +03:00
Calle Wilund	15b8267dab	Add thrift_schema and placeholder "has_compound_comparator()" thrift_schema == place to collect thrift compatibility aspects of schema definition.	2015-06-03 10:13:53 +02:00
Calle Wilund	1631ce132e	Add "storage_proxy&" argument to cql_statement::validate To make db, schemas etc reachable	2015-06-03 10:13:52 +02:00
Avi Kivity	750543fc04	cql3: fix shared_ptr misuse in modification_statement A shared_ptr is mutable, so it must be thread_local, not static.	2015-06-01 17:31:57 +02:00
Tomasz Grabiec	731a63e371	schema: Embed raw_schema inside schema Public fields got encapsulated.	2015-04-24 18:01:01 +02:00
Avi Kivity	3d38708434	cql3: pass a database& instance to most foo::raw::prepare() variants To prepare a user-defined type, we need to look up its name in the keyspace. While we get the keyspace name as an argument to prepare(), it is useless without the database instance. Fix the problem by passing a database reference along with the keyspace. This precolates through the class structure, so most cql3 raw types end up receiving this treatment. Origin gets along without it by using a singleton. We can't do this due to sharding (we could use a thread-local instance, but that's ugly too). Hopefully the transition to a visitor will clean this up.	2015-04-20 16:15:34 +03:00
Tomasz Grabiec	00f99cefd4	db: split query.hh to reduce header dependencies	2015-04-15 20:44:59 +02:00
Tomasz Grabiec	878a740b9d	db: Write query results in serialized form This gives about 30% increase in tps in: build/release/tests/perf/perf_simple_query -c1 --query-single-key This patch switches query result format from a structured one to a serialized one. The problems with structured format are: - high level of indirection (vector of vectors of vectors of blobs), which is not CPU cache friendly - high allocation rate due to fine-grained object structure On replica side, the query results are probably going to be serialized in the transport layer anyway, so this change only subtracts work. There is no processing of the query results on replica other than concatenation in case of range queries. If query results are collected in serialized form from different cores, we can concatenate them without copying by simply appending the fragments into the packet. This optimization is not implemented yet. On coordinator side, the query results would have to be parsed from the transport layer buffers anyway, so this also doesn't add work, but again saves allocations and copying. The CQL server doesn't need complex data structures to process the results, it just goes over it linearly consuming it. This patch provides views, iterators and visitors for consuming query results in serialized form. Currently the iterators assume that the buffer is contiguous but we could easily relax this in future so that we can avoid linearization of data received from seastar sockets. The coordinator side could be optimized even further for CQL queries which do not need processing (eg. select * from cf where ...) we could make the replica send the query results in the format which is expected by the CQL binary protocol client. So in the typical case the coordinator would just pass the data using zero-copy to the client, prepending a header. We do need structure for prefetched rows (needed by list manipulations), and this change adds query result post-processing which converts serialized query result into a structured one, tailored particularly for prefetched rows needs. This change also introduces partition_slice options. In some queries (maybe even in typical ones), we don't need to send partition or clustering keys back to the client, because they are already specified in the query request, and not queried for. The query results hold now keys as optional elements. Also, meta-data like cell timestamp and ttl is now also optional. It is only needed if the query has writetime() or ttl() functions in it, which it typically won't have.	2015-04-15 20:44:50 +02:00
Tomasz Grabiec	7ebc7830b7	db: Optimize column family lookup in query path	2015-04-15 20:33:48 +02:00
Avi Kivity	b3f3c76dd8	cql3: fix overzealous move in modification_statement::get_mutations() 'keys' and 'prefix' are used twice in the same expression, and as the language does not guarantee any ordering in this case, any moves are illegal. Get rid of them.	2015-03-26 12:14:01 +02:00
Avi Kivity	b650383d67	cql3: implement read_required_rows() Some modification statements require reading rows before modifying them; implement it.	2015-03-26 12:14:01 +02:00
Tomasz Grabiec	e3422525c0	Use column_definition via const reference	2015-03-24 12:03:00 +01:00
Tomasz Grabiec	bdbd5547e3	db: Cleanup key names clustering_key::one -> clustering_key clustering_key::prefix::one -> clustering_key_prefix partition_key::one -> partition_key clustering_prefix -> exploded_clustering_prefix	2015-03-20 18:59:29 +01:00
Tomasz Grabiec	49b7a166a8	keys: Make key components non-optional	2015-03-19 14:54:41 +01:00
Avi Kivity	bdd188f459	Merge branch 'tgrabiec/select' of github.com:cloudius-systems/seastar-dev into db Preparation for range queries, from Tomasz: "This series adds static typic for different key variants. It also changes clustered row map to boost implementation which allows to use heterogenous keys, so that we can lookup a row by a full prefix without reserializing it. Similar change is made to row prefix tombstones."	2015-03-18 12:58:01 +02:00
Pekka Enberg	b40661c330	cql3: Use shared_ptr for prepared statements Query processor needs to store prepared statements as part of a client session for PREPARE and EXECUTE requests. Switch from unique_ptr to shared_ptr in preparation for that. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-03-18 10:49:41 +02:00
Tomasz Grabiec	1b1af8cdfd	db: Introduce types to hold keys Holding keys and their prefixes as "bytes" is error prone. It's easy to mix them up (or use wrong types). This change adds wrappers for keys with accessors which are meant to make misuses as difficult as possible. Prefix and full keys are now distinguished. Places which assumed that the representation is the same (it currently is) were changed not to do so. This will allow us to introduce more compact storage for non-prefix keys.	2015-03-17 15:56:29 +01:00
Tomasz Grabiec	1f6360ec3b	cql3: Drop redundant key validation Keys are also validated right before in build_partition_keys().	2015-03-17 15:56:28 +01:00
Tomasz Grabiec	1a0ffdfb99	schema: Encapsulate column sets	2015-02-27 10:48:56 +01:00
Tomasz Grabiec	b77367dabe	cql3: Simplify primary key membership checks	2015-02-27 10:48:56 +01:00
Tomasz Grabiec	609e893055	unimplemented: Separate subject from behavior You can now do: fail(unimplemented::cause::PAGING); and: warn(unimplemented::cause::PAGING);	2015-02-27 10:48:56 +01:00
Tomasz Grabiec	0293b151dc	cql3: Fix bug in modification_statement::process_where_clause()	2015-02-12 19:40:58 +01:00
Tomasz Grabiec	43300e9998	cql3: Use find() instead of [] when looking up processed keys find() is sufficient and it has less surprising side effects. This doesn't fix any issue.	2015-02-12 19:40:58 +01:00
Tomasz Grabiec	f3130d395f	cql3: Return shared_ptr<result_message> instead of optional It's polymorphic type in Origin.	2015-02-12 19:40:57 +01:00
Tomasz Grabiec	524c6a4e40	cql3: Implement modification_statement::validate()	2015-02-12 19:40:57 +01:00

1 2

60 Commits