scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 20:46:56 +00:00

Author	SHA1	Message	Date
Nadav Har'El	de1171181c	user defined types: fix support for case-sensitive type names In the current code, support for case-sensitive (quoted) user-defined type names is broken. For example, a test doing: CREATE TYPE "PHone" (country_code int, number text) CREATE TABLE cf (pk blob, pn "PHone", PRIMARY KEY (pk)) Fails - the first line creates the type with the case-sensitive name PHone, but the second line wrongly ends up looking for the lowercased name phone, and fails with an exception "Unknown type ks.phone". The problem is in cql3_type_name_impl. This class is used to convert a type object into its proper CQL syntax - for example frozen<list<int>>. The problem is that for a user-defined type, we forgot to quote its name if not lowercase, and the result is wrong CQL; For example, a list of PHone will be written as list<PHone> - but this is wrong because the CQL parser, when it sees this expression, lowercases the unquoted type name PHone and it becomes just phone. It should be list<"PHone">, not list<PHone>. The solution is for cql3_type_name_impl to use for a user-defined type its get_name_as_cql_string() method instead of get_name_as_string(). get_name_as_cql_string() is a new method which prints the name of the user type as it should be in a CQL expression, i.e., quoted if necessary. The bug in the above test was apparently caused when our code serialized the type name to disk as the string PHone (without any quoting), and then later deserialized it using the CQL type parser, which converted it into a lowercase phone. With this patch, the type's name is serialized as "PHone", with the quotes, and deserialized properly as the type PHone. While the extra quotes may seem excessive, they are necessary for the correct CQL type expression - remember that the type expression may be significantly more complex, e.g., frozen<list<"PHone">> and all of this, including the quotes, is necessary for our parser to be able to translate this string back into a type object. This patch may cause breakage to existing databases which used case- sensitive user-defined types, but I argue that these use cases were already broken (as demonstrated by this test) so we won't break anything that actually worked before. Fixes #5544 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200101160805.15847-1-nyh@scylladb.com>	2020-01-03 15:48:20 +02:00
Rafael Ávila de Espíndola	5417c5356b	types: Move get_castas_fctn to cql3 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191120181213.111758-9-espindola@scylladb.com>	2019-11-21 12:08:50 +02:00
Rafael Ávila de Espíndola	f06d6df4df	types: Simplify casts to string These now just use the to_string member functions, which makes it possible to move the code to another file. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191120181213.111758-8-espindola@scylladb.com>	2019-11-21 12:08:50 +02:00
Rafael Ávila de Espíndola	786b1ec364	types: Move json code to its own file Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191120181213.111758-7-espindola@scylladb.com>	2019-11-21 12:08:49 +02:00
Rafael Ávila de Espíndola	af8e207491	types: Avoid using deserialize_value in json code This makes it independent of internal functions and makes it possible to move it to another file. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191120181213.111758-6-espindola@scylladb.com>	2019-11-21 12:08:49 +02:00
Rafael Ávila de Espíndola	ed65e2c848	types: Move cql3_kind to the cql3 directory Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191120181213.111758-5-espindola@scylladb.com>	2019-11-21 12:08:47 +02:00
Rafael Ávila de Espíndola	bd560e5520	types: Fix dynamic types of some data_value objects I found these mismatched types while converting some member functions to standalone functions, since they have to use the public API that has more type checks. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191120181213.111758-4-espindola@scylladb.com>	2019-11-21 12:08:46 +02:00
Rafael Ávila de Espíndola	9208b2f498	Lua: Implement support for returning inet Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	64be94ab01	Lua: Implement support for inet arguments Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	faf029d472	Lua: Implement support for returning time Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	484f498534	Lua: Implement support for returning timeuuid Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	9c2daf6554	Lua: Implement support for returning uuid Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	f8aeed5beb	Lua: Implement support for returning date Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	63bc960152	Lua: Implement support for returning timestamp Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	0d9d53b5da	Lua: Implement support for counter arguments Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	9bf9a84e4d	types: Move the data_value visitor to a header It will be used by the UDF implementation. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:19:52 -08:00
Rafael Ávila de Espíndola	c74864447b	types: Simplify validate_visitor for strings We have different types for ascii and utf8, so there is no need for an extra if. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191024232911.22700-1-espindola@scylladb.com>	2019-10-29 11:02:55 +02:00
Kamil Braun	612de1f4e3	types: handle trailing nulls in tuples/UDTs better. Comparing user types after adding new fields was bugged. In the following scenario: create type ut (a int); create table cf (a int primary key, b frozen<ut>); insert into cf (a, b) values (0, (0)); alter type ut add b int; select * from cf where b = {a:0,b:null}; the row with a = 0 should be returned, even though the value stored in the database is shorter (by one null) than the value given by the user. Until now it wouldn't have.	2019-10-25 12:04:44 +02:00
Kamil Braun	abe6c2d3d2	types: introduce to_bytes_opt_vec function. It converts a vector<bytes_view_opt> to a vector<bytes_opt>. Used in a bunch of places.	2019-10-25 12:04:44 +02:00
Kamil Braun	a8c7670722	types: add multi_cell field to user_type_impl. is_value_compatible_with_internal and update_user_type were generalized to the non-frozen case. For now, all user_type_impls in the code are non-multi-cell (frozen). This will be changed in future commits.	2019-10-25 12:04:44 +02:00
Kamil Braun	a3a2f65fbf	types: generalize serialize_for_cql to UDTs. Also introduces a helper "linearized" function, which implements a pattern occurring in all serialize_for_cql_aux functions.	2019-10-25 12:04:44 +02:00
Kamil Braun	4327bba0db	types: introduce `(de)serialize_field_index` functions. These functions are used to translate field indices, which are used to identify fields inside UDTs, from/to a serialized representation to be stored inside sstables and mutations. They do it in a way that is compatible with C*.	2019-10-25 10:49:19 +02:00
Kamil Braun	4374982de0	types: collection_type_impl::to_value becomes serialize_for_cql. The purpose of collection_type_impl::to_value was to serialize a collection for sending over CQL. The corresponding function in origin is called serializeForNativeProtocol, but the name is a bit lengthy, so I settled for serialize_for_cql. The method now became a free-standing function, using the visit function to perform a dispatch on the collection type instead of a virtual call. This also makes it easier to generalize it to UDTs in future commits. Remove the old serialize_for_native_protocol with a FIXME: implement inside. It was already implemented (to_value), just called differently. remove dead methods: enforce_limit and serialized_values. The corresponding methods in C* are auxiliary methods used inside serializeForNativeProtocol. In our case, the entire algorithm is wholly written in serialize_for_cql.	2019-10-25 10:49:19 +02:00
Kamil Braun	d8f8908d34	types: introduce user_type_impl::idx_of_field method. Each field of a user type has its index inside the type. This method allows to find it easily, which is needed in a bunch of places.	2019-10-25 10:42:58 +02:00
Kamil Braun	bbdb438d89	collection_mutation: easier (de)serialization of collection_mutation(s). `collection_type_impl::serialize_mutation_form` became `collection_mutation(_view)_description::serialize`. Previously callers had to cast their data_type down to collection_type to use serialize_mutation_form. Now it's done inside `serialize`. In the future `serialize` will be generalized to handle UDTs. `collection_type_impl::deserialize_mutation_form` became a free standing function `deserialize_collection_mutation` with similiar benefits. Actually, noone needs to call this function manually because of the next paragraph. A common pattern consisting of linearizing data inside a `collection_mutation_view` followed by calling `deserialize_mutation_form` has been abstracted out as a `with_deserialized` method inside collection_mutation_view. serialize_mutation_form_only_live was removed, because it hadn't been used anywhere.	2019-10-25 10:42:58 +02:00
Kamil Braun	b1d16c1601	types: move collection_type_impl::mutation(_view) out of collection_type_impl. collection_type_impl::mutation became collection_mutation_description. collection_type_impl::mutation_view became collection_mutation_view_description. These classes now reside inside collection_mutation.hh. Additional documentation has been written for these classes. Related function implementations were moved to collection_mutation.cc. This makes it easier to generalize these classes to non-frozen UDTs in future commits. The new names (together with documentation) better describe their purpose.	2019-10-25 10:19:45 +02:00
Kamil Braun	c0d3e6c773	atomic_cell: move collection_mutation(_view) to a new file. The classes 'collection_mutation' and 'collection_mutation_view' were moved to a separate header, collection_mutation.hh. Implementations of functions that operate on these classes, including some methods of collection_type_impl, were moved to a separate compilation unit, collection_mutation.cc. This makes it easier to modify these structures in future commits in order to generalize them for non-frozen User Defined Types. Some additional documentation has been written for collection_mutation.	2019-10-25 10:19:45 +02:00
Konstantin Osipov	a30c08e04e	lwt: support for multi-cell set & list value serialization	2019-10-22 17:40:42 +03:00
Konstantin Osipov	605755e3f6	lwt: support for multi-cell map & list comparison with literal values Multi-cell lists and maps may be stored in different formats: as sorted vectors of pairs of values, when retreived from storage, or as sorted vectors of values, when created from parser literals or supplied as parameter values. Implement a specialized compare for use when receiver and paramter representation don't match. Add helpers.	2019-10-22 17:07:33 +03:00
Rafael Ávila de Espíndola	1d9ba4c79b	types: Simplify and explain from_varint_to_integer This simplifies the implementation of from_varint_to_integer and avoids using the fact that a static_cast from cpp_int to uint64_t seems to just keep the low 64 bits. The boost release notes (https://www.boost.org/users/history/version_1_67_0.html) implies that the conversion function should return the maximum value a uint64_t can hold if the original value is too large. The idea of using a & with ~0 is a suggestion from the boost release notes. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-15 14:44:54 -07:00
Avi Kivity	5663218fac	Merge "types: Fix decimal to integer and varint to integer conversion" from Rafael " The release notes for boost 1.67.0 includes: Breaking Change: When converting a multiprecision integer to a narrower type, if the value is too large (or negative) to fit in the smaller type, then the result is either the maximum (or minimum) value of the target Since we just moved out of boost 1.66, we have to update our code. This fixes issue #4960 " * 'espindola/fix-4960' of https://github.com/espindola/scylla: types: fix varint to integer conversion types: extract a from_varint_to_integer from make_castas_fctn_from_decimal_to_integer types: fix decimal to integer conversion types: extract helper for converting a decimal to a cppint types: rename and detemplate make_castas_fctn_from_decimal_to_integer	2019-09-08 10:45:42 +03:00
Rafael Ávila de Espíndola	3bac4ebac7	types: Reduce duplication around date_type_impl According to the comments, the only different between date_type_impl and timestamp_type_impl is the comparison function. This patch makes that explicit by merging all code paths except: * The warning when converting between the two * The compare function The date_type_impl type can still be user visible via very old sstables or via the thrift protocol. It is not clear if we still need to support either, but with this patch it is easy to do so. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	36d40b4858	types: Don't use date_type_native_type when we want a timestamp In these cases it is pretty clear that the original code wanted to create a timestamp_type data_value but was creating a date_type one because of the old defaults. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	01cd21c04d	types: Remove timestamp_native_type Now that we know that anything expecting a date_type has been converted to date_type_native_type, switch to using db_clock::time_point when we want a timestamp_type. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	e09fa2dcff	types: Make it harder to create date_type date_type was replaced with timestamp_type, but it was very easy to create a date_type instead of a timestamp_type by accident. This patch changes the code so that a date_type is no longer implicitly used when constructing a data_value. All existing code that was depending on this is converted to explicitly using date_type_native_type. A followup patch will convert to timestamp_type when appropriate. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	dd81e94684	types: fix varint to integer conversion The previous code was using the boost::multiprecision::cpp_int to integer conversion, but that doesn't have the same semantics an cql for signed numbers. This fixes the dtest cql_cast_test.py:CQLCastTest.cast_varint_test. Fixes #4960 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:14 -07:00
Rafael Ávila de Espíndola	263e18b625	types: extract a from_varint_to_integer from make_castas_fctn_from_decimal_to_integer It will be used when converting varint to integer too. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:14 -07:00
Rafael Ávila de Espíndola	2d453b8e17	types: fix decimal to integer conversion The previous code was using the boost::multiprecision::cpp_rational to integer conversion, but that doesn't have the same semantics an cql. This patch avoids creating a cpp_rational in the first place and works just with integers. This fixes the dtest cql_cast_test.py:CQLCastTest.cast_decimal_test. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:14 -07:00
Rafael Ávila de Espíndola	fb760774dd	types: extract helper for converting a decimal to a cppint It will also be used in the decimal to integer conversion. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:07 -07:00
Rafael Ávila de Espíndola	40e6882906	types: rename and detemplate make_castas_fctn_from_decimal_to_integer It was only ever used for varint. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 14:54:47 -07:00
Rafael Ávila de Espíndola	b100f95adc	types: optimize type find implementation This turns find into a template so there is only one switch over the kind of each type in the search. To evaluate the change in code size sizes, I added [[noinline]] to find and obtained the following results. The release columns for release in the before case have an extra column because the functions are sufficiently complex to trigger gcc to split them in hot + cold. before: dev release (hot + cold split) find 0x35f = 863 0x3d5 + 0x112 = 1255 references_duration 0x62 + 0x22 + 0x8 = 140 0x55 + 0x1f + 0x2a + 0x8 = 166 references_user_type 0x6b + 0x26 + 0x111 = 418 0x65 + 0x1f + 0x32 + 0x11b = 465 after: dev release find 0xd6 + 0x1b4 = 650 0xd2 + 0x1f5 = 711 references_duration 0x13 = 19 0x13 = 19 references_user_type 0x1a = 26 0x21 = 33 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-03 08:23:21 -07:00
Rafael Ávila de Espíndola	86c29256eb	types: Fix references_user_type This was broken since the type refactoring. It was checking the static type, which is always abstract_type. Unfortunately we only had dtests for this. This can probably be optimized to avoid the double switch over kind, but it is probably better to do the simple fix first. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190821155354.47704-1-espindola@scylladb.com>	2019-08-21 19:13:59 +03:00
Rafael Ávila de Espíndola	7f0a434cfa	types: Move abstract_type visit to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00
Rafael Ávila de Espíndola	dccefd1ddb	types: Move uuid_type_impl to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00
Rafael Ávila de Espíndola	038728a381	types: Move inet_addr_type_impl to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00
Rafael Ávila de Espíndola	1966416cb3	types: Move varint_type_impl to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00
Rafael Ávila de Espíndola	9229f99c86	types: Move timeuuid_type_impl to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00
Rafael Ávila de Espíndola	993f132619	types: Move date_type_impl to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00
Rafael Ávila de Espíndola	a299ed3b9b	types: Move bytes_type_impl to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00
Rafael Ávila de Espíndola	09ac2a1bc6	types: Move utf8_type_impl to a header Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 16:25:43 -07:00

1 2 3 4 5 ...

361 Commits