scylladb

Author	SHA1	Message	Date
Daniel Fiala	f5629b3a23	types: Use std::pair instead of std::tuple to avoid compile-time error with explicit constructor. Fixes #2895. Signed-off-by: Daniel Fiala <daniel@scylladb.com> Message-Id: <20171017071316.2836-1-daniel@scylladb.com>	2017-10-17 12:32:43 +01:00
Daniel Fiala	61570e4a73	types:: Add support for CAST AS functions. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-10-07 21:04:40 +02:00
Daniel Fiala	e2c0a57ecf	types: Moved code that implements conversion of types' values to string. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-10-07 21:04:40 +02:00
Daniel Fiala	1133838b9f	types: Add data_type_for for varint and decimal, data_value constructor for simple_date_type. Signed-off-by: Daniel Fiala <daniel@scylladb.com> Message-Id: <20171004044040.21631-1-daniel@scylladb.com>	2017-10-04 10:52:57 +03:00
Daniel Fiala	19b21a0ab2	types: Allow 'T' as a date-time separator in timestamps. * Letter 'T' is specified in ISO 8601 and also in Cassandra documentation. Signed-off-by: Daniel Fiala <daniel@scylladb.com> Message-Id: <20171003073558.19257-1-daniel@scylladb.com>	2017-10-03 11:10:11 +03:00
Duarte Nunes	20337053ad	Don't use literal lambdas These are only available in C++17. Fixes the build after `b5460c2`. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-08-11 13:08:42 +02:00
Jesse Haber-Kucharsky	509626fe08	Support `duration` CQL native type `duration` is a new native type that was introduced in Cassandra 3.10 [1]. Support for parsing and the internal representation of the type was added in `8fa47b74e8`. Important note: The version of cqlsh distributed with Scylla does not have support for durations included (it was added to Cassandra in [2]). To test this change, you can use cqlsh distributed with Cassandra. Duration types are useful when working with time-series tables, because they can be used to manipulate date-time values in relative terms. Two interesting applications are: - Aggregation by time intervals [3]: `SELECT * FROM my_table GROUP BY floor(time, 3h)` - Querying on changes in date-times: `SELECT ... WHERE last_heartbeat_time < now() - 3h` (Note: neither of these is currently supported, though columns with duration values are.) Internally, durations are represented as three signed counters: one for months, for days, and for nanoseconds. Each of these counters is serialized using a variable-length encoding which is described in version 5 of the CQL native protocol specification. The representation of a duration as three counters means that a semantic ordering on durations doesn't exist: Is `1mo` greater than `1mo1d`? We cannot know, because some months have more days than others. Durations can only have a concrete absolute value when they are "attached" to absolute date-time references. For example, `2015-04-31 at 12:00:00 + 1mo`. That duration values are not comparable presents some difficulties for the implementation, because most CQL types are. Like in Cassandra's implementation [2], I adopted a similar strategy to the way restrictions on the `counter` type are checked. A type "references" a duration if it is either a duration or it contains a duration (like a `tuple<..., duration, ...>`, or a UDT with a duration member). The following restrictions apply on durations. Note that some of these contexts are either experimental features (materialized views), or not currently supported at run-time (though support exists in the parser and code, so it is prudent to add the restrictions now): - Durations cannot appear in any part of a primary key, either for tables or materialized views. - Durations cannot be directly used as the element type of a `set`, nor can they be used as the key type of a `map`. Because internal ordering on durations is based on a byte-level comparison, this property of Cassandra was intended to help avoid user confusion around ordering of collection elements. - Secondary indexes on durations are not supported. - "Slice" relations (<=, <, >=, >) are not supported on durations with `WHERE` restrictions (like `SELECT ... WHERE span <= 3d`). Multi-column restrictions only work with clustering columns, which cannot be `duration` due to the first rule. - "Slice" relations are not supported on durations with query conditions (like `UPDATE my_table ... IF span > 5us`). Backwards incompatibility note: As described in the documentation [4], duration literals take one of two forms: either ISO 8601 formats (there are three), or a "standard" format. The ISO 8601 formats start with "P" (like "P5W"). Therefore, identifiers that have this form are no longer supported. Fixes #2240. [1] https://issues.apache.org/jira/browse/CASSANDRA-11873 [2] `bfd57d13b7` [3] https://issues.apache.org/jira/browse/CASSANDRA-11871 [4] http://cassandra.apache.org/doc/latest/cql/types.html#working-with-durations	2017-08-10 15:01:10 -04:00
Duarte Nunes	3bfcf47cc6	types: Implement hash() for collections This patch provides a rather trivial implementation of hash() for collection types. It is needed for view building, where we hold mutations in a map indexed by partition keys (and frozen collection types can be part of the key). Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170718192107.13746-1-duarte@scylladb.com>	2017-07-19 09:52:56 +03:00
Gleb Natapov	a032078410	intern also tuple and user defined types Currently each time UDT or tuple is parsed new object is created. If those objects are used to create container type repeatedly it will cause memory leak since container types are interned, but lookup in the cache is done using pointer to a contained type (which will be always different for UDT and tuples). This patches interns also UDT and tuple, so each type the same object is parsed same pointer is also returned. Refs #2469 Fixes #2487 Message-Id: <20170612142942.GO21915@scylladb.com>	2017-06-14 14:41:17 +03:00
Avi Kivity	f5dae826ce	Merge "Migrate schema tables to v3 format" from Calle "Defines origin v3-format for system/schema tables, and use them for schema storage/retrival. Includes a legacy_schema_migrator implementation/port from origin. Note that since we don't support features like triggers, functions and aggregates, it will bail if encountering such a feature used. Note also that this patch set does not convert the "hints" and "backlog" tables, even though these have changed in v3 as well. That will be a separate patch set. Tested against dtests. Note that patches for dtest + ccm will follow." * 'calle/systemtables' of github.com:cloudius-systems/seastar-dev: (36 commits) legacy_schema_migrator: Actually truncate legacy schema tables on finish database: Extract "remove" from "drop_columnfamily" v3 schema test fixes thrift: Update CQL mapping of static CFs schema_tables: Use v3 schema tables and formats type_parser: Origin expects empty string -> bytes_type cf_prop_defs: Add crc_check_chance as recognized (even if we don't use) types_test: v3 style schemas enforce explicit "frozen" in tupes/ut:s cql3_type: v3 to_string cql_types: Introduce cql3_type::empty and associate with empty data_type schema: rename column accessors to be in line with origin schema: Add "is_static_compact_table" schema_builder: Add helper to generate unique column names akin origin schema: Add utility functions for static columns schema: Use heterogeneous comparator for columns bounds cql3_type_parser: Resolve from cql3 names/expressions cql3_type: Add "prepare_interal" and "references_user_type" cql3::cql3_type: Add prepare_internal path using only "local" holders cql3_type: Add virtual destructor. database/main: encapsulate system CF dir touching ...	2017-05-17 11:25:52 +03:00
Vlad Zolotarov	494ea82a88	utils::UUID: align the UUID serialization API with the similar API of other classes in the project The standard serialization API (e.g. in data_value) includes the following methods: size_t serialized_size() const; void serialize(bytes::iterator& it) const; bytes serialize() const; Align the utils::UUID API with the pattern above. The only addition is that we are going to make an output iterator parameter of a second method above a template so that we may serialize into different output sources. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-05-16 15:56:03 -04:00
Calle Wilund	c572a8c83c	cql_types: Introduce cql3_type::empty and associate with empty data_type	2017-05-10 16:44:48 +00:00
Duarte Nunes	4e693383f7	mutation_partion: Use row_tombstone This patch replaces the current row tombstone representation by a row_tombstone. The intent of the patch is thus to reify the idea of shadowable tombstones, that up until now we considered all materialized view row tombstones to be. We need to distinguish shadowable from non-shadowable row tombstones to support scenarios such as, when inserting to a table with a materialzied view: 1. insert into base (p, v1, v2) values (3, 1, 3) using timestamp 1 2. delete from base using timestamp 2 where p = 3 3. insert into base (p, v1) values (3, 1) using timestamp 3 These should yield a view row where v2 is definitely null, but with the current implementation, v2 will pop back with its value v2=3@TS=1, even though its dead in the base row. This is because the row tombstone inserted at 2) is a shadowable one. This patch only addresses the memory representation of such row_tombstones. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-04-25 11:46:33 +02:00
Raphael S. Carvalho	a6f8f4fe24	compaction: do not write expired cell as dead cell if it can be purged right away When compacting a fully expired sstable, we're not allowing that sstable to be purged because expired cell is unconditionally converted into a dead cell. Why not check if the expired cell can be purged instead using gc before and max purgeable timestamp? Currently, we need two compactions to get rid of a fully expired sstable which cells could have always been purged. look at this sstable with expired cell: { "partition" : { "key" : [ "2" ], "position" : 0 }, "rows" : [ { "type" : "row", "position" : 120, "liveness_info" : { "tstamp" : "2017-04-09T17:07:12.702597Z", "ttl" : 20, "expires_at" : "2017-04-09T17:07:32Z", "expired" : true }, "cells" : [ { "name" : "country", "value" : "1" }, ] now this sstable data after first compaction: [shard 0] compaction - Compacted 1 sstables to [...]. 120 bytes to 79 (~65% of original) in 229ms = 0.000328997MB/s. { ... "rows" : [ { "type" : "row", "position" : 79, "cells" : [ { "name" : "country", "deletion_info" : { "local_delete_time" : "2017-04-09T17:07:12Z" }, "tstamp" : "2017-04-09T17:07:12.702597Z" }, ] now another compaction will actually get rid of data: compaction - Compacted 1 sstables to []. 79 bytes to 0 (~0% of original) in 1ms = 0MB/s. ~2 total partitions merged to 0 NOTE: It's a waste of time to wait for second compaction because the expired cell could have been purged at first compaction because it satisfied gc_before and max purgeable timestamp. Fixes #2249, #2253 Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20170413001049.9663-1-raphaelsc@scylladb.com>	2017-04-13 10:59:19 +03:00
Duarte Nunes	61741a69b6	collection_type_impl: Use set difference for tombstones This patch fixes collection_type_impl::difference() so it does set difference for tombstones instead of just returning the larger one, as difference() is supposed to return only the information in mutation A that supersedes that in B, given difference(A, B). Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-03-15 14:34:01 +01:00
Duarte Nunes	19fcd2d140	collection_type_impl: A mutation with a tombstone is not empty This patch changes the collection_type_impl::is_empty() function so that it doesn't consider empty a collection_mutation which has a tombstone. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-03-15 14:34:01 +01:00
Paweł Dziepak	53d9a6f220	types: make counter_type_impl report its cql3_type	2017-02-02 10:35:14 +00:00
Paweł Dziepak	0c93d01232	atomic_cell: make sure upper level tombstones cover counters Support for deletion of counters is limited in a way that once deleted they cannot be used again (i.e. tombstone always wins, regardless of the timestamp). Logic responsible for merging two counter cells already makes sure that tombstones are handled properly, but it is also necessary to ensure that higher level tombstones always cover counters.	2017-02-02 10:35:14 +00:00
Paweł Dziepak	8cdffd7c57	time_type_impl: value initialize result parse_time() adds hourse, minutes, etc to a final value 'result'. However, it is of type std::chrono::nanoseconds which means it is not zeroed at initialization unless it is explicitly asked to do so. Fixed debug mode failures in types_tyes and cql_query_test. Message-Id: <20170125155239.1253-1-pdziepak@scylladb.com>	2017-01-25 17:56:31 +02:00
Pekka Enberg	93e6592296	cql3: TIME data type support This adds support for the TIME data type introduced in CQL 3.3.1. Refs #1284	2017-01-09 10:42:20 +02:00
Pekka Enberg	9def7db381	cql3: DATE type support This adds support for the DATE type introduced in CQL 3.3.1. Refs #1284	2017-01-09 10:42:20 +02:00
Pekka Enberg	fcaa743e3d	cql3: TINYINT and SMALLINT data type support This adds support for the TINYINT and SMALLINT data types introduced in CQL 3.3.1. Refs #1284	2017-01-05 10:57:35 +02:00
Pekka Enberg	257fa541f1	types: Fix integer_type_impl::parse_int() for bytes The integer_type_impl::parse_int() function uses boost::lexical_cast() under the hood, which parses 8-bit numbers as characters. Fix the function to lexical cast to 64-bit integer and convert the result to integer_type_impl template type.	2017-01-05 10:57:35 +02:00
Tomasz Grabiec	804fe50b7f	types: fix uuid_type_impl::less timeuuid_type_impl::compare_bytes is a "trichotomic" comparator (-1, 0, 1) while less() is a "less" comparator (false, true). The code incorrectly returns c1 instead of c1 < 0 which breaks the ordering. Fixes #1196. Message-Id: <1473956716-5209-1-git-send-email-tgrabiec@scylladb.com>	2016-09-16 11:06:55 +01:00
Paweł Dziepak	c220c676c8	types: honour end of sstring_view There are several places in types.cc where we assume that sstring_view range is null terminated. That may be not true and we should always use either begin()/end() or data()/size() pairs. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-09-07 14:30:56 -07:00
Tomasz Grabiec	ce768858f5	types: Fix update_types() We should replace the old type, not insert the new type before the old type. Fixes #1465 Message-Id: <1468861076-20397-1-git-send-email-tgrabiec@scylladb.com>	2016-07-18 20:14:22 +03:00
Paweł Dziepak	10c144ffd4	types: fix type aliasing violation Any pointer can be casted to char*, but not the other way around. This causes GCC6 to misoptimize timestamp_type_impl::from_string(). Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1468413349-27267-1-git-send-email-pdziepak@scylladb.com>	2016-07-13 17:22:16 +03:00
Tomasz Grabiec	8c4b5e4283	db: Avoiding checking bloom filters during compaction Checking bloom filters of sstables to compute max purgeable timestamp for compaction is expensive in terms of CPU time. We can avoid calculating it if we're not about to GC any tombstone. This patch changes compacting functions to accept a function instead of ready value for max_purgeable. I verified that bloom filter operations no longer appear on flame graphs during compaction-heavy workload (without tombstones). Refs #1322.	2016-07-10 09:54:20 +02:00
Nadav Har'El	c4e871ea2d	Work around unexpected data_value constructor If someone tried to naively use utf8_type->decompose("18wX"), this would mysteriously fail, returning an empty key. decompose takes a data_value, so the compiler looked for an implict conversion from the string constant (const char) to data_value. We did not have such a conversion, only conversion from sstring. But the compiler chose (backed by the C++ standard, no doubt) to implicitly convert the const char to a bool (!), and then use data_value(bool). It did not convert the const char* to an sstring, nor did it warn about the possible ambiguity. So this patch adds a data_value(const char*) constructor, so people will not fall into the same trap that I fell into... Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1467643462-6349-1-git-send-email-nyh@scylladb.com>	2016-07-04 17:50:53 +03:00
Gleb Natapov	5fef0717cc	query: find latest modification timestamp while calculating result digest	2016-05-24 13:27:34 +03:00
Duarte Nunes	bc90d6a730	udt: type_parser handles user defined types This patch ensures type_parser can handle user defined types. It also prefixes user_type_impl::make_name() with org.apache.cassandra.db.marshal.UserType. Fixes #631 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 18:07:07 +02:00
Duarte Nunes	3e663cfa9a	udt: Add capability to replace a user_type This patch adds a function to abstract_type that locates the usage of a given user_type and recursively returns an updated version of the containing type containing the updated user type. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 18:07:06 +02:00
Duarte Nunes	66c60f03fe	udt: Add references_user_type to abstract_type This patch adds a virtual function to the abstract_type hierarchy to tell whether a given type references the specified type. Needed to implement the drop and alter type statements. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 09:54:07 +02:00
Duarte Nunes	ddb4a4b29b	udt: Implement as_cql3_type for user_type_impl Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 09:54:06 +02:00
Duarte Nunes	fdddcfb3ea	udt: Fix user type compatibility check A new user type is checked for compatibility against the previous version of that type, so as to ensure that an updated field type is compatible with the previous field type (e.g., altering a field type from text to blob is allowed, but not the other way around). However, it is also possible to add new fields to a user type. So, when comparing a user type against its previous version, we should also allow the current, new type to be longer than the previous one. The current code instead allows for the previous type to be longer, which this patch fixes. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 09:54:06 +02:00
Duarte Nunes	0aeb4dcaaf	udt: Implement equals() for user_type_impl Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-04-20 09:54:06 +02:00
Duarte Nunes	f8d8dbdeb7	types: Don't wrap tombstone in an std::optional All the callers of do_serialize_mutation_form pass a valid tombstone that is converted into a non-empty optional. This happens even if the tombstone is empty (tombstone::timestamp == api::missing_timestamp). This patch fixes this by passing in a reference to the tombstone which is convertible to bool, based on whether it is empty or not. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1460620528-3628-1-git-send-email-duarte@scylladb.com>	2016-04-20 09:22:01 +02:00
Duarte Nunes	40c1b29701	cql3: Implement contains relation Although it doesn't work in the absence of secondary indexes, now we provide the same error messages as origin when trying to use the contains relation. Fixes #1158 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1461088626-26958-1-git-send-email-duarte@scylladb.com>	2016-04-20 09:22:25 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Piotr Jastrzebski	d3f91eec61	Implement tuple_type_impl::from_string This is a fix for: https://github.com/scylladb/scylla/issues/574 It mirrors the behavior of: org.apache.cassandra.db.marshal.TupleType.java#fromString Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <24a7d6253727d0faebb1df117c2f52410523d42f.1459843091.git.piotr@scylladb.com>	2016-04-05 16:00:18 +03:00
Paweł Dziepak	23ee493d91	types: make collection_type_impl::deserialize_mutation_form static Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-11 18:27:13 +00:00
Pekka Enberg	ab502bcfa8	types: Implement to_string for timestamps and dates The to_string() function is used for logging purpose so use boost to_iso_extended_string() to format both timestamps and dates. Fixes #968 (showstopper) Message-Id: <1457528755-6164-1-git-send-email-penberg@scylladb.com>	2016-03-09 14:08:33 +01:00
Paweł Dziepak	e332f95960	types: make serialize_mutation_form() static Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 21:47:42 +00:00
Tomasz Grabiec	9d11968ad8	Rename serialization_format to cql_serialization_format	2016-02-15 16:53:56 +01:00
Paweł Dziepak	dbb878d16e	Revert "do not use boost::multiprecision::msb()" This reverts commit `dadd097f9c`. That commit caused serialized forms of varint and decimal to have some excess leading zeros. They didn't affect deserialization in any way but caused computed tokens to differ from the Cassandra ones. Fixes #898. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1455537278-20106-1-git-send-email-pdziepak@scylladb.com>	2016-02-15 14:24:37 +02:00
Paweł Dziepak	900f5338e7	types: make timestamp_type and date_type compatible Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-01-19 14:03:15 +01:00
Paweł Dziepak	a6171d3e99	types: add date type to parse_type() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-01-19 13:43:36 +01:00
Paweł Dziepak	f77ab67809	types: use correct name for date_type Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-01-19 13:42:53 +01:00
Paweł Dziepak	440b6d058e	types: fix compatibility for text types bytes_type is_compatible_with utf8_type and ascii_type utf8_type is_compatible_with ascii_type Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-01-19 09:39:16 +01:00
Avi Kivity	78429ad818	types: implement collection compatibility checks compatible: can be cast, keeps sort order value-compatible: can be cast, may change sort order frozen: values participate in sort order unfrozen: only sort keys participate in sort order Fixes #740.	2016-01-04 11:02:21 +01:00

1 2 3 4

165 Commits