scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-07 07:23:15 +00:00

Author	SHA1	Message	Date
Piotr Jastrzebski	96b880f81c	Add comment explaining tuple type name creation To keep format compatibiliti we never wrap tuple type name into "org.apache.cassandra.db.marshal.FrozenType(...)". Even when the tuple is frozen. This patch adds a comment in tuple_type_impl::make_name that explains the situation. For more details see #4087 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 12:14:26 +01:00
Piotr Jastrzebski	57e655d716	Add "FrozenType(...)" to UDT name only when it's frozen At the moment Scylla supports only frozen UDTs but the code should be able to handle non-frozen UDTs as well. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 12:08:02 +01:00
Piotr Jastrzebski	fc17bd376b	Move "FrozenType(...)" addition to UDT name to user_type_impl This logic belongs in types.hh/types.cc layer. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 12:07:47 +01:00
Piotr Jastrzebski	1fdfc461b8	Add "frozen<...>" to tuple CQL name only when it's frozen At the moment Scylla supports only frozen tuples but the code should be able to handle non-frozen tuples as well. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 11:14:30 +01:00
Piotr Jastrzebski	749eee2711	Move "frozen<...>" addition to tuple CQL name to tuple_type_impl This logic belongs in types.hh/types.cc layer. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 11:14:30 +01:00
Piotr Jastrzebski	7aba17de2c	Merge make_cql3_tuple_type into tuple_type_impl::as_cql3_type This logic belongs in types.hh/types.cc layer. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 11:14:30 +01:00
Piotr Jastrzebski	56060573bb	Add "frozen<...>" to UDT CQL name only when it's frozen At the moment Scylla supports only frozen UDTs but the code should be able to handle non-frozen UDTs as well. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 11:14:30 +01:00
Piotr Jastrzebski	a928c103c2	Move "frozen<...>" addition to UDT CQL name to user_type_impl This logic belongs in types.hh/types.cc layer. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-11 11:09:00 +01:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Yibo Cai (Arm Technology China)	422987ab04	utils: add fast ascii string validation Validate ascii string by ORing all bytes and check if 7-th bit is 0. Compared with original std::any_of(), which checks ascii string byte by byte, this new approach validates input in 8 bytes and two independent streams. Performance is much higher for normal cases, though slightly slower when string is very short. See table below. Speed(MB/s) of ascii string validation +---------------+-------------+---------+ \| String length \| std::any_of \| u64 x 2 \| +---------------+-------------+---------+ \| 9 bytes \| 1691 \| 1635 \| +---------------+-------------+---------+ \| 31 bytes \| 2923 \| 3181 \| +---------------+-------------+---------+ \| 129 bytes \| 3377 \| 15110 \| +---------------+-------------+---------+ \| 1039 bytes \| 3357 \| 31815 \| +---------------+-------------+---------+ \| 16385 bytes \| 3448 \| 47983 \| +---------------+-------------+---------+ \| 1048576 bytes \| 3394 \| 31391 \| +---------------+-------------+---------+ Signed-off-by: Yibo Cai <yibo.cai@arm.com> Message-Id: <1544669646-31881-1-git-send-email-yibo.cai@arm.com>	2018-12-24 09:58:08 +02:00
Yibo Cai (Arm Technology China)	6fadba56cc	utils: optimize UTF-8 validation UTF-8 string is now validated by boost::locale::conv::utf_to_utf, it actually does string conversions which is more than necessary. As observed on Arm server, UTF-8 validation can become bottleneck under heavy loads. This patch introduces a brand new SIMD implementation supporting both NEON and SSE, as well as a naive approach to handle short strings. The naive approach is 3x faster than boost utf_to_utf, whilst SIMD method outperforms naive approach 3x ~ 5x on Arm and x86. Details at https://github.com/cyb70289/utf8/. UTF-8 unit test is added to check various corner cases. Signed-off-by: Yibo Cai <yibo.cai@arm.com> Message-Id: <1543978498-12123-1-git-send-email-yibo.cai@arm.com>	2018-12-05 21:51:01 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	a71ab365e3	toplevel: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Avi Kivity	8db8c01fbe	types: get rid of PRId64 formatting It's not needed for out sprint() implementation, and gets in the way of converting all formatting to fmt.	2018-11-01 13:16:16 +00:00
Piotr Sarna	37a5c38471	types: enable deserializing varint from JSON string Previously deserialization failed because the JSON string representing a number was unnecessarily quoted. Fixes #3666 Message-Id: <a0a100dbac7c151d627522174303657d1da05c27.1534845398.git.sarna@scylladb.com>	2018-08-21 11:20:11 +01:00
Piotr Sarna	b3f438bfec	types: enable parsing numeric JSON values from string In order to be Cassandra-compatible, JSON values passed in INSERT JSON statement should accept string parameters for numeric types - int, double, etc. Fixes #3666 Message-Id: <4da9a2f68de31492a2e9432493663a62b138c2f2.1534153955.git.sarna@scylladb.com>	2018-08-13 23:57:37 +01:00
Piotr Sarna	9ba218c161	cql3: remove superfluous null conversions in to_json_string Some types checked when passed bytes argument was empty, and if so, returned "null" as a JSON string. Now, with to_json_string(bytes_opt) it's not needed anymore. Also, some types returned "null" instead of signaling a deserialization error.	2018-08-09 18:07:12 +02:00
Piotr Sarna	957cc712b6	cql3: enable parsing decimal JSON values from string In order to be Cassandra-compatible, decimal type should be parsable from both numeric values and strings. Fixes #3666	2018-08-09 18:07:12 +02:00
Piotr Sarna	d307b5712c	types: use value_to_quoted_string in JSON quoting In order to avoid regressions caused by external libraries, our own value_to_quoted_string implementation is used. Fixes #3622	2018-07-25 13:16:06 +02:00
Paweł Dziepak	a0c1c0c921	types: bytes_view: override fragmented validate() The default implementation linearises the buffer and calls validate(bytes_view). This is bad and not needed for bytes_type which doesn't do any validation anyway.	2018-07-18 12:28:06 +01:00
Piotr Sarna	90d323a522	types: add time_native_type CQL3's time_type didn't have any suitable native type, so time_native_type is introduced to serve that purpose.	2018-06-14 11:11:41 +02:00
Paweł Dziepak	e34ff8b4bf	treewide: require type for creating collection_mutation_view	2018-05-31 15:51:11 +01:00
Paweł Dziepak	aa25f0844f	atomic_cell: introduce fragmented buffer value interface As a prepratation for the switch to the new cell representation this patch changes the type returned by atomic_cell_view::value() to one that requires explicit linearisation of the cell value. Even though the value is still implicitly linearised (and only when managed by the LSA) the new interface is the same as the target one so that no more changes to its users will be needed.	2018-05-31 15:51:11 +01:00
Paweł Dziepak	418c159057	treewide: require type to copy atomic_cell	2018-05-31 15:51:11 +01:00
Paweł Dziepak	43b216b43d	types: provide information for IMR	2018-05-31 15:51:11 +01:00
Vladimir Krivopalov	3981dd6dd6	types: Treat byte_type as a variable-length type for compatibility reasons. Although values of the byte_type that corresponds to CQL TINYINT type always occupy only a single byte, Cassandra treats this it as a variable-length type for SSTables 3.0 reading and writing. While it is clearly a mistake at Cassandra side, we have to stay compatible. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-25 21:41:23 -07:00
Vladimir Krivopalov	24cb062834	types: Remove is_value_fixed() and use value_length_if_fixed() instead. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-25 21:41:23 -07:00
Piotr Jastrzebski	7a25819e5a	Add abstract_type::value_length_if_fixed This info is used by SSTable 3.x format to read column values without reading their lengths. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-23 19:54:16 +02:00
Paweł Dziepak	0b4c6b8938	types: make some collection_type_impl functions non-static The switch to the new in-memory representation will require a larger parts of the logic be aware of the type of the values they are dealing with. In most cases it is not a significant burden for the users.	2018-05-09 16:52:26 +01:00
Vladimir Krivopalov	36fe06fd3e	Make abstract_type::is_fixed_length() non-virtual. This method is called agressively through SSTable 3.0 read/write, we want to reasonably optimise it to no incur extra indirect calls. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <2d00ddecd112af867a30d3d6930c10165dd5af34.1524851530.git.vladimir@scylladb.com>	2018-04-27 20:57:46 +03:00
Vladimir Krivopalov	54bd74fda0	Add is_fixed_length() to data types. For any given CQL data type, this member returns whether its values are of fixed or variable length. This is used by SSTables 3.0 format to only store the length value for variable-length cells. For #1969. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-04-26 14:34:20 -07:00
Calle Wilund	b1edf75c8b	types: Make seastar::inet_address the "native" type for CQL inet. Fixes #3187 Requires seastar "inet_address: Add constructor and conversion function from/to IPv4" Implements support IPv6 for CQL inet data. The actual data stored will now vary between 4 and 16 bytes. gms::inet_address has been augumented to interop with seastar::inet_address, though of course actually trying to use an Ipv6 address there or in any of its tables with throw badly. Tests assuming ipv4 changed. Storing a ipv4_address should be transparent, as it now "widens". However, since all ipv4 is inet_address, but not vice versa, there is no implicit overloading on the read paths. I.e. tests and system_keyspace (where we read ip addresses from tables explicitly) are modified to use the proper type. Message-Id: <20180424161817.26316-1-calle@scylladb.com>	2018-04-24 23:12:07 +01:00
Vladimir Krivopalov	fc644a8778	Fix Scylla to compile with older versions of JsonCpp (<= 1.7.0). Old versions of JsonCpp declare the following typedefs for internally used aliases: typedef long long int Int64; typedef unsigned long long int UInt64; In newer versions (1.8.x), those are declared as: typedef int64_t Int64; typedef uint64_t UInt64; Those base types are not identical so in cases when a type has constructors overloaded only for specific integral types (such as Json::Value in JsonCpp or data_value in Scylla), an attempt to pack/unpack an integer from/to a JSON object causes ambiguous calls. Fixes #3208 Tests: unit {release}. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <e9fff9f41e0f34b15afc90b5439be03e4295623e.1524556258.git.vladimir@scylladb.com>	2018-04-24 10:58:38 +03:00
Piotr Sarna	1d40d2186e	cql3: add from_json_object function to types This commit adds a 'from_json_object' method which will be used for converting JSON representation of a value to raw bytes representing the same value. This functionality will be needed by 'INSERT JSON' clause implementation, which can turn these raw bytes into cql3::term. References #2058	2018-04-23 12:00:56 +02:00
Piotr Sarna	399ab1d455	cql3: add to_json_string function to types This commit adds a 'to_json_string' method which will be used for converting values to JSON strings. In several cases it's not sufficient to use 'to_string', e.g. actual strings need to be surrounded with double quotes. References #2058	2018-04-11 13:27:56 +02:00
Tomasz Grabiec	52c61df930	Relax includes To avoid unnecessary recompilations. Message-Id: <1522168295-994-1-git-send-email-tgrabiec@scylladb.com>	2018-03-28 10:49:07 +03:00
Avi Kivity	1193e7d2e2	Merge "CAST from integers to decimal" from Daniel "It turned out that decimal numbers that were obtained as cast from integers should always contain just one decimal place 0. This can be recognised especially when calculating avg(.) over such numbers because result contains just one decimal point. Fixes #3111." * 'danfiala/integers-to-decimal' of github.com:hagrid-the-developer/scylla: tests: Add test that decimal obtained as CAST from integer always contain one decimal place. types: Decimal that is obtained from integer always contain one decimal place.	2018-01-21 20:21:00 +02:00
Daniel Fiala	39a08cac6b	types: Decimal that is obtained from integer always contain one decimal place. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2018-01-21 17:37:24 +01:00
Daniel Fiala	0d71194da6	types: Added native types for timestamp and timeuuid. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2018-01-14 13:11:36 +01:00
Vladimir Krivopalov	6d76ac8043	Lift checks on list and map values to allow values of length > 64K. Fixes #3007 Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <7b232a655b5531d4bfa2be3d9611f8b1ba0349b0.1512021011.git.vladimir@scylladb.com>	2017-11-30 10:31:19 +02:00
Vladimir Krivopalov	61b1988aa1	Use meaningful error messages when throwing a marshal_exception Fixes #2977 Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <20171121005108.23074-1-vladimir@scylladb.com>	2017-11-21 16:05:43 +02:00
Daniel Fiala	f5629b3a23	types: Use std::pair instead of std::tuple to avoid compile-time error with explicit constructor. Fixes #2895. Signed-off-by: Daniel Fiala <daniel@scylladb.com> Message-Id: <20171017071316.2836-1-daniel@scylladb.com>	2017-10-17 12:32:43 +01:00
Daniel Fiala	61570e4a73	types:: Add support for CAST AS functions. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-10-07 21:04:40 +02:00
Daniel Fiala	e2c0a57ecf	types: Moved code that implements conversion of types' values to string. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-10-07 21:04:40 +02:00
Daniel Fiala	1133838b9f	types: Add data_type_for for varint and decimal, data_value constructor for simple_date_type. Signed-off-by: Daniel Fiala <daniel@scylladb.com> Message-Id: <20171004044040.21631-1-daniel@scylladb.com>	2017-10-04 10:52:57 +03:00
Daniel Fiala	19b21a0ab2	types: Allow 'T' as a date-time separator in timestamps. * Letter 'T' is specified in ISO 8601 and also in Cassandra documentation. Signed-off-by: Daniel Fiala <daniel@scylladb.com> Message-Id: <20171003073558.19257-1-daniel@scylladb.com>	2017-10-03 11:10:11 +03:00
Duarte Nunes	20337053ad	Don't use literal lambdas These are only available in C++17. Fixes the build after `b5460c2`. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-08-11 13:08:42 +02:00
Jesse Haber-Kucharsky	509626fe08	Support `duration` CQL native type `duration` is a new native type that was introduced in Cassandra 3.10 [1]. Support for parsing and the internal representation of the type was added in `8fa47b74e8`. Important note: The version of cqlsh distributed with Scylla does not have support for durations included (it was added to Cassandra in [2]). To test this change, you can use cqlsh distributed with Cassandra. Duration types are useful when working with time-series tables, because they can be used to manipulate date-time values in relative terms. Two interesting applications are: - Aggregation by time intervals [3]: `SELECT * FROM my_table GROUP BY floor(time, 3h)` - Querying on changes in date-times: `SELECT ... WHERE last_heartbeat_time < now() - 3h` (Note: neither of these is currently supported, though columns with duration values are.) Internally, durations are represented as three signed counters: one for months, for days, and for nanoseconds. Each of these counters is serialized using a variable-length encoding which is described in version 5 of the CQL native protocol specification. The representation of a duration as three counters means that a semantic ordering on durations doesn't exist: Is `1mo` greater than `1mo1d`? We cannot know, because some months have more days than others. Durations can only have a concrete absolute value when they are "attached" to absolute date-time references. For example, `2015-04-31 at 12:00:00 + 1mo`. That duration values are not comparable presents some difficulties for the implementation, because most CQL types are. Like in Cassandra's implementation [2], I adopted a similar strategy to the way restrictions on the `counter` type are checked. A type "references" a duration if it is either a duration or it contains a duration (like a `tuple<..., duration, ...>`, or a UDT with a duration member). The following restrictions apply on durations. Note that some of these contexts are either experimental features (materialized views), or not currently supported at run-time (though support exists in the parser and code, so it is prudent to add the restrictions now): - Durations cannot appear in any part of a primary key, either for tables or materialized views. - Durations cannot be directly used as the element type of a `set`, nor can they be used as the key type of a `map`. Because internal ordering on durations is based on a byte-level comparison, this property of Cassandra was intended to help avoid user confusion around ordering of collection elements. - Secondary indexes on durations are not supported. - "Slice" relations (<=, <, >=, >) are not supported on durations with `WHERE` restrictions (like `SELECT ... WHERE span <= 3d`). Multi-column restrictions only work with clustering columns, which cannot be `duration` due to the first rule. - "Slice" relations are not supported on durations with query conditions (like `UPDATE my_table ... IF span > 5us`). Backwards incompatibility note: As described in the documentation [4], duration literals take one of two forms: either ISO 8601 formats (there are three), or a "standard" format. The ISO 8601 formats start with "P" (like "P5W"). Therefore, identifiers that have this form are no longer supported. Fixes #2240. [1] https://issues.apache.org/jira/browse/CASSANDRA-11873 [2] `bfd57d13b7` [3] https://issues.apache.org/jira/browse/CASSANDRA-11871 [4] http://cassandra.apache.org/doc/latest/cql/types.html#working-with-durations	2017-08-10 15:01:10 -04:00
Duarte Nunes	3bfcf47cc6	types: Implement hash() for collections This patch provides a rather trivial implementation of hash() for collection types. It is needed for view building, where we hold mutations in a map indexed by partition keys (and frozen collection types can be part of the key). Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170718192107.13746-1-duarte@scylladb.com>	2017-07-19 09:52:56 +03:00
Gleb Natapov	a032078410	intern also tuple and user defined types Currently each time UDT or tuple is parsed new object is created. If those objects are used to create container type repeatedly it will cause memory leak since container types are interned, but lookup in the cache is done using pointer to a contained type (which will be always different for UDT and tuples). This patches interns also UDT and tuple, so each type the same object is parsed same pointer is also returned. Refs #2469 Fixes #2487 Message-Id: <20170612142942.GO21915@scylladb.com>	2017-06-14 14:41:17 +03:00

1 2 3 4 5

206 Commits