scylladb

Author	SHA1	Message	Date
Paweł Dziepak	374c8a56ac	commitlog: avoid copying column_mapping It is safe to copy column_mapping accros shards. Such guarantee comes at the cost of performance. This patch makes commitlog_entry_writer use IDL generated writer to serialise commitlog_entry so that column_mapping is not copied. This also simplifies commitlog_entry itself. Performance difference tested with: perf_simple_query -c4 --write --duration 60 (medians) before after diff write 79434.35 89247.54 +12.3%	2017-02-27 17:05:58 +00:00
Paweł Dziepak	9989239c97	idl: add idl description of consistency level	2017-02-02 10:35:14 +00:00
Paweł Dziepak	9f1ebd4f7c	idl/mutation: add counter serialisation logic	2017-02-02 10:35:14 +00:00
Paweł Dziepak	b8e29cc99c	idl: is_short_read() was added in 1.6	2016-12-22 13:35:04 +01:00
Duarte Nunes	19a76a82e8	frozen_schema: Support view schemas This patch allows a view schema to be frozen. To unfreeze such a schema, we add an is_view attribute to the schema idl. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-20 13:06:11 +00:00
Paweł Dziepak	43fe3439ca	reconcilable_result: properly propagate short_read flag reconcilable_result can be merged with another or transformed into query::result. Make sure that short_read information is never lost. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-12-14 14:10:02 +00:00
Paweł Dziepak	da7ca85040	query: allow short reads When paging is used the cluster is allowed to return less rows than the client asked for. However, if such possibility is used we need a way of telling that to the coordinator and the paging implementation so that they can differentiate between short reads caused by the replica running out of data to sent and short reads caused by any other means. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-12-14 14:10:01 +00:00
Avi Kivity	18078bea9b	storage_proxy: avoid calculating digest when only one replica is contacted If we're talking to just one replica, the digest is not going to be used, so better not to calculate it at all. The optimization helps with LOCAL_ONE queries where the result is large, but does not contain large blobs (many small rows). This patch adds a digest_algorithm parameter to the READ_DATA verb that can take on two values: none and MD5 (default), and sets it to none when we're reading from one replica. In the future we may add other values for more hardware-friendly digest algorithms. Message-Id: <1479380600-19206-1-git-send-email-avi@scylladb.com>	2016-11-17 13:04:30 +02:00
Avi Kivity	a35136533d	Convert ring_position and token ranges to be nonwrapping Wrapping ranges are a pain, so we are moving wrap handling to the edges. Since cql can't generate wrapping ranges, this means thrift and the ring maintenance code; also range->ring transformations need to merge the first and last ranges. Message-Id: <1478105905-31613-1-git-send-email-avi@scylladb.com>	2016-11-02 21:04:11 +02:00
Vlad Zolotarov	a491ac0f18	tracing: introduce a log_slow_query logic The main idea is to log queries that take "too long" to complete. The "too long" is above the given threshold. To achieve the above this patch does the following: - Introduce two new properties to the tracing::trace_state: - "Full tracing": when the tracing of this query was explicitly requested. In this state we will record all possible traces related to this query: both on the coordinator and on any replica involved. - "Log slow query": when slow query logging is enabled. If slow query logging is enabled and a session's "duration" is above the specified threshold we will create a record in the "slow queries log" and write all trace records created on the coordinator and on a replica if a replica's session lasts longer than that threshold. (We will propagate the Coordinator's slow query logging threshold to replicas in the context of a specific tracing/logging session). The properties above are independent, namely they may be enabled and/or disabled independently and any combination of them is legal (naturally, creating a tracing session when both states above are disabled makes no sense). - Instrument the tracing::tracing service to allow the following: - Enable/disable slow query logging. - Set/get the slow query duration threshold (in microseconds). - Set/get the slow query log record TTL value (in seconds). - Instrument the trace_keyspace_helper to write a slow query log entry when requested. - The slow query logging is disabled by default and the threshold is set to half a second. - The TTL of a slow log record is set to 86400 seconds by default. - It makes sense to use the same "slow query logging threshold" and a "slow query record TTL" both on a coordinator and on a replica Nodes in a context of the same tracing session: - Pass both TTL and a threshold to the replica in a trace_info. This patch also implements the new slow query logging specific logic: - Don't write the pending tracing records before the end of a tracing session until "duration" reaches the logging threshold. - Don't build the parameters<sstring, sstring> map unless we know we will write it to I/O. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-28 18:28:44 +03:00
Vlad Zolotarov	8609900621	tracing: introduce trace_state capabilities bit field - Instead of keeping separate booleans introduce a trace_state_props_set enum_set and pass it around instead of separate booleans. - Change the trace_info to hold this value in addition to write_on_close. Initialize a corresponding bit in an enum_set based on a write_on_close value in a trace_info constructor for a backward compatibility. - Separate a trace_state constructor into two: - For a primary session object. - For a secondary session object. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 18:34:36 +03:00
Paweł Dziepak	dcf794b04d	idl: make bytes compatible with bytes_ostream This patch makes idl type "bytes" compatible with both bytes and bytes_ostream. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Duarte Nunes	5161ea283f	query: query::clustering_range can't wrap around This patch changes the type of query::clustering_range to express that ranges that wrap around are not allowed, and ranges that have the start bound after the end bound are considered empty. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-08-15 14:50:20 +00:00
Tomasz Grabiec	d3658b33da	tests: Add test for skip() not doing full deserialization	2016-07-25 17:35:42 +02:00
Vlad Zolotarov	a5022a09a4	tracing: use 'write' instead of 'flush' and 'store' for consistency with seastar's API In names of functions and variables: s/flush_/write_/ s/store_/write_/ In a i_tracing_backend_helper: s/flush()/kick()/ Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:57 +03:00
Duarte Nunes	21d0a2c764	query: Optionally send cell ttl This patch adds support to send a cell's ttl as part of a query's result. This is needed for thrift support. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-07-14 15:36:23 +02:00
Paweł Dziepak	7e06499458	repair: convert hashing to streamed_mutations This patch makes hashing for repair calculate checksums in a way that doesn't require rebuilding whole mutation. Unfortunately, such checksums are incompatible with the old ones so the old way for computing checksums is preserved for compatibility reasons. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-13 09:51:23 +01:00
Duarte Nunes	69798df95e	query: Limit number of partitions returned This is required to implement a thrift verb. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-22 09:48:13 +02:00
Duarte Nunes	01b18063ea	query: Add per-partition row limit This patch as a per-partition row limit. It ensures both local queries and the reconciliation logic abide by this limit. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-22 09:46:51 +02:00
Duarte Nunes	6a111fdd01	mutations: Introduce the range_tombstone class This patch introduces the range_tombstone class, composed of a [start, end] pair of clustering_key_prefixes, the type of inclusiveness of each bound, and a tombstone. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:58 +02:00
Duarte Nunes	7f8c35dd8c	idl: Add range tombstone IDL This patch adds the range tombstone IDL, preserving backwards compatibility. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:36 +02:00
Duarte Nunes	e2812c1b7a	idl: Rename range_tombstone::key to start ... and make it a clustering_key_prefix, in preparation of supporting not-whole-row range tombstones. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:36 +02:00
Vlad Zolotarov	6e26909b02	query::read_command: add an optional trace_info field Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-06-01 20:17:19 +03:00
Vlad Zolotarov	a53d329b25	tracing: add a serializable trace_info object tracing::trace_info is used to pass the tracing information between nodes. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-06-01 20:16:53 +03:00
Gleb Natapov	1e6f64f4ab	query: add latest modification timestamp to result structure	2016-05-24 13:27:34 +03:00
Asias He	a6080773b3	gossip: Add SUPPORTED_FEATURES application_state It is used to negotiate cluster wide features.	2016-04-06 07:12:34 +08:00
Asias He	39992dd559	gossip: Sync gossip_digest.idl.hh and application_state.hh We did the clean up in idl/gossip_digest.idl.hh, but the patch to clean up gms/application_state.hh was never merged. To maintain compatibility with previous version of scylla, we can not change application_state.hh, instead change idl to be sync with application_state.hh. Message-Id: <3a78b159d5cb60bc65b354d323d163ce8528b36d.1458557948.git.asias@scylladb.com>	2016-03-21 13:07:22 +02:00
Paweł Dziepak	3efb10bd08	result.idl: keep digest together with result Result digest is going to be computed in query result builder and require information not available in the query resylt. That's why the digest now needs to be sent to the other nodes together with the result as they won't be able compute it on their own. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-11 18:27:13 +00:00
Paweł Dziepak	e194835d8a	tests/idl: add test for stdx::optional<> serialization Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1456761055-23916-1-git-send-email-pdziepak@scylladb.com>	2016-02-29 18:12:59 +02:00
Tomasz Grabiec	6cec131432	query: Switch to IDL-generated views and writers The query result footprint for cassandra-stress mutation as reported by tests/memory-footprint increased by 18% from 285 B to 337 B. perf_simple_query shows slight regression in throughput (-8%): build/release/tests/perf/perf_simple_query -c4 -m1G --partitions 100000 Before: ~433k tps After: ~400k tps	2016-02-26 12:26:13 +01:00
Paweł Dziepak	351c69b476	frozen_schema: use IDL-based serialization Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 23:12:00 +00:00
Paweł Dziepak	81f42415d4	schema_mutations: prepare for auto-generated serializers Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 23:12:00 +00:00
Paweł Dziepak	6c8b298ccd	canonical_mutation: prepare for auto-generated serializers Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 23:12:00 +00:00
Paweł Dziepak	89b75a02d4	commitlog: use IDL-based serialization for entries Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 23:11:59 +00:00
Paweł Dziepak	5a353486c6	canonical_mutation: switch to IDL-based serialization Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 23:11:31 +00:00
Paweł Dziepak	186061adef	mutation_partition: switch serialization to IDL-based one Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 21:49:08 +00:00
Paweł Dziepak	597ed15dfd	tests: add idl unit test Test auto-generated and writer-based serialization as well as deserialization of simple compound type, vectors and variants. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 21:19:30 +00:00
Amnon Heiman	38cd55e9cf	Adding the mutation idl This adds the mutation definition idl and add it to the compilation. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-02-17 18:42:09 +02:00
Tomasz Grabiec	5f756fcbe5	query: Add cql_format property to partition_slice It will specify in which format CQL values should be serialized. Will allow for rolling out new CQL binary protocol versions without stalling reads.	2016-02-15 17:05:55 +01:00
Avi Kivity	1f752446d2	Merge "Truncation format & fixes" from Calle "Fixes #884 Fixes #895 Also at seastar-dev: calle/truncate_more 1.) Change truncation records to be stored with IDL serialization 2.) Fix db::serializers encoding of replay_position 3.) Detect attempted reading of Origin truncation records, and instead of crashing, ignore and warn. 4.) Change truncation time stamps to be generated per-shard, _after_ CF flush is done, otherwise data in memtables at flush would be retained/replayed on next start. Retain the highest time stamp generated. Note for (3): This patch set does _not_ clear out origin records automatically. This because I feel that is a somewhat drastic and irreversible thing to do. If we want to avail the user of a means to get rid of the (3) warning, we should probably tell him to either use cqlsh, or add an API call for this, so he can do it explicitly. "	2016-02-15 11:39:56 +02:00
Tomasz Grabiec	3e2c1840d8	idl: Make key definitions independent of in-memory representation	2016-02-10 15:22:56 +01:00
Calle Wilund	dff89fffcd	IDL: Add idl definitions for replay_position and truncation_record	2016-02-09 13:55:33 +00:00
Gleb Natapov	b4b560e0fc	change result_digest to hold std::array instead of a std::vector Digest size if fixed, so no need to use std::vector to hold it. Message-Id: <20160203102530.GU6705@scylladb.com>	2016-02-03 12:27:39 +02:00
Gleb Natapov	10cd4d948c	Move result_digest to idl	2016-02-02 12:15:50 +02:00
Gleb Natapov	e6f7b12b51	Move partition_checksum to use idl	2016-02-02 12:15:49 +02:00
Gleb Natapov	60e3637efc	Move frozen_schema to idl	2016-02-02 12:15:49 +02:00
Gleb Natapov	b065e2003f	Move paging_state to use idl	2016-01-27 18:39:43 +02:00
Asias He	e8b8b454df	streaming: Flatten streaming messages class namespace There are only two messages: prepare_message and outgoing_file_message. Actually only the prepare_message is the message we send on wire. Flatten the namespace.	2016-01-26 13:04:29 +08:00
Asias He	b299cc3bee	idl: Add streaming.idl.hh - stream_request - stream_summary - prepare_message	2016-01-25 22:29:25 +08:00
Asias He	d94b7e49d2	idl: Add gossip_digest_syn Added get_partioner and get_cluster_id	2016-01-25 11:28:28 +08:00

1 2

56 Commits