scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 20:16:43 +00:00

Author	SHA1	Message	Date
Jesse Haber-Kucharsky	509626fe08	Support `duration` CQL native type `duration` is a new native type that was introduced in Cassandra 3.10 [1]. Support for parsing and the internal representation of the type was added in `8fa47b74e8`. Important note: The version of cqlsh distributed with Scylla does not have support for durations included (it was added to Cassandra in [2]). To test this change, you can use cqlsh distributed with Cassandra. Duration types are useful when working with time-series tables, because they can be used to manipulate date-time values in relative terms. Two interesting applications are: - Aggregation by time intervals [3]: `SELECT * FROM my_table GROUP BY floor(time, 3h)` - Querying on changes in date-times: `SELECT ... WHERE last_heartbeat_time < now() - 3h` (Note: neither of these is currently supported, though columns with duration values are.) Internally, durations are represented as three signed counters: one for months, for days, and for nanoseconds. Each of these counters is serialized using a variable-length encoding which is described in version 5 of the CQL native protocol specification. The representation of a duration as three counters means that a semantic ordering on durations doesn't exist: Is `1mo` greater than `1mo1d`? We cannot know, because some months have more days than others. Durations can only have a concrete absolute value when they are "attached" to absolute date-time references. For example, `2015-04-31 at 12:00:00 + 1mo`. That duration values are not comparable presents some difficulties for the implementation, because most CQL types are. Like in Cassandra's implementation [2], I adopted a similar strategy to the way restrictions on the `counter` type are checked. A type "references" a duration if it is either a duration or it contains a duration (like a `tuple<..., duration, ...>`, or a UDT with a duration member). The following restrictions apply on durations. Note that some of these contexts are either experimental features (materialized views), or not currently supported at run-time (though support exists in the parser and code, so it is prudent to add the restrictions now): - Durations cannot appear in any part of a primary key, either for tables or materialized views. - Durations cannot be directly used as the element type of a `set`, nor can they be used as the key type of a `map`. Because internal ordering on durations is based on a byte-level comparison, this property of Cassandra was intended to help avoid user confusion around ordering of collection elements. - Secondary indexes on durations are not supported. - "Slice" relations (<=, <, >=, >) are not supported on durations with `WHERE` restrictions (like `SELECT ... WHERE span <= 3d`). Multi-column restrictions only work with clustering columns, which cannot be `duration` due to the first rule. - "Slice" relations are not supported on durations with query conditions (like `UPDATE my_table ... IF span > 5us`). Backwards incompatibility note: As described in the documentation [4], duration literals take one of two forms: either ISO 8601 formats (there are three), or a "standard" format. The ISO 8601 formats start with "P" (like "P5W"). Therefore, identifiers that have this form are no longer supported. Fixes #2240. [1] https://issues.apache.org/jira/browse/CASSANDRA-11873 [2] `bfd57d13b7` [3] https://issues.apache.org/jira/browse/CASSANDRA-11871 [4] http://cassandra.apache.org/doc/latest/cql/types.html#working-with-durations	2017-08-10 15:01:10 -04:00
Avi Kivity	ebff739a84	Merge "use paging for compaction history" from Amnon "This series adds an option to use paging in internal query and use that for the get compaction history function. Internal paging will be done explicitly, to use paging, you first create a state object (that contains the query as well) and use that state to get the first page, the result will contain both the query result and a new state that can be used to get the next page. Fixes #2366" * 'amnon/paged_compaction_history_v5' of github.com:cloudius-systems/seastar-dev: system_keyspace: Use paging for get compaction history Add paging for internal queries query_options: Allows creating query_options from query_options	2017-08-02 18:15:58 +03:00
Duarte Nunes	64fcf0c642	alter_table_statement: Allow collection columns to replace normal ones Fixes #2632 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170725183811.5155-1-duarte@scylladb.com>	2017-07-26 13:24:03 +02:00
Duarte Nunes	7eecda3a61	schema: Support compaction enabled attribute Fixes #2547 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170721132206.3037-1-duarte@scylladb.com>	2017-07-21 15:38:45 +02:00
Amnon Heiman	08c81427b9	Add paging for internal queries Usually, internal queries are used for short queries. Sometimes though, like in the case of get compaction history, there could be a large amount of results. Without paging it will overload the system. This patch adds the ability to use paging internally. Using paging will be done explicitely, all the relevant information would be store in an internal_query_state, that would hold both the paging state but also the query so consecutive calls can be made. To use paging use the query method with a function. The function gets beside a statement and its parameters a function that will be used for each of the returned rows. For example if qp is a query_processor: qp.query("SELECT * from system.compaction_history", [] (const cql3::untyped_result_set::row& row) { .... // do something with row ... return stop_iteration::no; // keep on reading }); Will run the function on each of the compaction history table rows. To stop the iteration, the function can return stop_iteration::yes.	2017-07-20 17:43:51 +03:00
Calle Wilund	247c36e048	system_schema: Fix remaining places not handing two system keyspaces Some places remained where code looked directly at system_keyspace::NAME to determine iff a ks is considered special/system/protected. Including schema digest calculation. Export "is_system_keyspace" and use accordingly. Message-Id: <1500469809-23546-1-git-send-email-calle@scylladb.com>	2017-07-19 16:18:45 +03:00
Piotr Jastrzebski	05b56fcfb0	mutation_partition: Add support for specifying continuity This will allow expressing lack of information about certain ranges of rows (including the static row), which will be used in cache to determine if information in cache is complete or not. Continuity is represented internally using flags on row entries. The key range between two consecutive entries is continuous iff rows_entry::continuous() is true for the later entry. The range starting after the last entry is assumed to be continuous. The range corresponding to the key of the entry is continuous iff rows_entry::dummy() is false. [tgrabiec: - based on the following commits: 4a5bf75 - Piotr Jastrzebski : mutation_partition: introduce dummy rows_entry 773070e - Piotr Jastrzebski : mutation_partition: add continuity flag to rows_entry - documented that partition tombstone is always complete - require specifying the partition tombstone when creating an incomplete entry - replaced rows_entry(dummy_tag, ...) constructor with more general rows_entry(position_in_partition, ...) - documented continuity semantics on mutation_partition - fixed _static_row_cached being lost by mutation_partition copy constructors - fixed conversion to streamed_mutation to ignore dummy entries - fixed mutation_partition serializer to drop dummy entries - documented semantics of continuity on mutation_partition level - dropped assumptions that dummy entries can be only at the last position - changed equality to ignore continuity completely, rather than partially (it was not ignoring dummy entries, but ignoring continuity flag) - added printout of continuity information in mutation_partition - fixed handling of empty entries in apply_reversibly() with regards to continuity; we no longer can remove empty entries before merging, since that may affect continuity of the right-hand mutation. Added _erased flag. - fixed mutation_partition::clustered_row() with dummy==true to not ignore the key - fixed partition_builder to not ignore continuity - renamed dummy_tag_t to dummy_tag. _t suffix is reserved. - standardized all APIs on is_dummy and is_continuous bool_class:es - replaced add_dummy_entry() with ensure_last_dummy() with safer semantics - dropped unused remove_dummy_entry() - simplified and inlined cache_entry::add_dummy_entry() - fixed mutation_partition(incomplete_tag) constructor to mark all row ranges as discontinuous ]	2017-06-24 18:06:11 +02:00
Duarte Nunes	6ac73b57fb	cql3/statements/select_statement: Remove dead code Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170522100230.17393-1-duarte@scylladb.com>	2017-05-22 14:32:12 +03:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Duarte Nunes	99b234d717	cql3/statements: Prevent setting default ttl on view Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-17 10:33:19 +02:00
Avi Kivity	f5dae826ce	Merge "Migrate schema tables to v3 format" from Calle "Defines origin v3-format for system/schema tables, and use them for schema storage/retrival. Includes a legacy_schema_migrator implementation/port from origin. Note that since we don't support features like triggers, functions and aggregates, it will bail if encountering such a feature used. Note also that this patch set does not convert the "hints" and "backlog" tables, even though these have changed in v3 as well. That will be a separate patch set. Tested against dtests. Note that patches for dtest + ccm will follow." * 'calle/systemtables' of github.com:cloudius-systems/seastar-dev: (36 commits) legacy_schema_migrator: Actually truncate legacy schema tables on finish database: Extract "remove" from "drop_columnfamily" v3 schema test fixes thrift: Update CQL mapping of static CFs schema_tables: Use v3 schema tables and formats type_parser: Origin expects empty string -> bytes_type cf_prop_defs: Add crc_check_chance as recognized (even if we don't use) types_test: v3 style schemas enforce explicit "frozen" in tupes/ut:s cql3_type: v3 to_string cql_types: Introduce cql3_type::empty and associate with empty data_type schema: rename column accessors to be in line with origin schema: Add "is_static_compact_table" schema_builder: Add helper to generate unique column names akin origin schema: Add utility functions for static columns schema: Use heterogeneous comparator for columns bounds cql3_type_parser: Resolve from cql3 names/expressions cql3_type: Add "prepare_interal" and "references_user_type" cql3::cql3_type: Add prepare_internal path using only "local" holders cql3_type: Add virtual destructor. database/main: encapsulate system CF dir touching ...	2017-05-17 11:25:52 +03:00
Duarte Nunes	ec35cc33f1	update_statement: Reject empty values for dense clustering key Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-10 19:54:42 +02:00
Duarte Nunes	03f765c468	modification_statement: Fix detection of clustering keys Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-10 19:54:42 +02:00
Duarte Nunes	d7701087af	cql3/restrictions/statement_restrictions: Consider statement type Now that update_statement uses statement_restrictions, we need our validation logic to take the statement type into account, in particular to deal with insertion statements which only set static columns but specify clustering values. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-10 19:54:42 +02:00
Duarte Nunes	c2041753c9	cql3/statements/modification_statement: Extract statement_type This patch extracts the statement_type into its own file. The type will be later passed to statement_restrictions for validation purposes. Further along, we could add methods to it that currently live in other statements so we can move more validation into statement_restrictions. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-10 19:54:42 +02:00
Calle Wilund	6c8b5fc09d	schema_tables: Use v3 schema tables and formats Switches system/schema_* for system_schema/*, updates schema/schema builder and uses to hold/expect v3 style info (i.e. types & dropped).	2017-05-10 16:44:48 +00:00
Calle Wilund	97c54d254b	cf_prop_defs: Add crc_check_chance as recognized (even if we don't use)	2017-05-10 16:44:48 +00:00
Calle Wilund	0e6ae8dec2	schema: rename column accessors to be in line with origin More pointedly: Expose columns as is (currently all_columns_in_select_order), expose name->column mapping more appropriately named. Renaming like this is not strictly neccesary, but there is a point to trying to keep nomenclature similar-ish with origin, esp. when select order column need to become filtered (spoiler alert).	2017-05-10 16:44:48 +00:00
Avi Kivity	8c5c5d3004	Merge "CQL front-end for secondary indices" from Pekka "This patch series adds CQL front-end support for secondary indices. You can now execute CREATE INDEX and DROP INDEX statements, which will update the newly added "Indexes" system table. However, the indexes are not actually backed up by anything nor are they available for CQL queries. The feature is hidden behind a new cluster feature flag and enabled only with the "--experimental" flag." * 'penberg/cql-2i/v2' of github.com:cloudius-systems/seastar-dev: (34 commits) schema: Kill index_type enum schema: Kill index_info class cql3/statements/create_index_statement: Use database::existing_index_names() in validation cql3/statements: Use secondary index manager in alter_table_statement class index: Add secondary_index_manager thrift/handler: Use index_metadata db/schema_tables: Index persistence schema: Add all_indices() to schema class schema: Remove add_default_index_names() from schema_builder class db/schema_tables: Add system table for indices cql3/Cgl.g: DROP INDEX cql3/statements: Add drop_index_statement class database: Add find_indexed_table() to database class cql3: Return change event from announce_migration() cql3/statements: Multiple index targets for CREATE INDEX cql3/statements: Use index_metadata in create_index_statement class cql3/statements: Use feature flag in create_index_statement class service/storage_service: Add feature flag for secondary indices database: Add get_available_index_name() to database class schema: Add get_default_index_name() to index_metadata class ...	2017-05-08 17:04:40 +03:00
Pekka Enberg	dfee4d2bb0	cql3: Fix partition key bind indices for prepared statements Fix the CQL front-end to populate the partition key bind index array in result message prepared metadata, which is needed for CQL binary protocol v4 to function correctly. Fixes #2355. Message-Id: <1494247871-3148-1-git-send-email-penberg@scylladb.com>	2017-05-08 16:33:17 +03:00
Pekka Enberg	06564afedb	schema: Kill index_info class It's no longer used. Indices are managed by the index_metadata class.	2017-05-08 10:19:34 +03:00
Pekka Enberg	3f27d12e99	cql3/statements/create_index_statement: Use database::existing_index_names() in validation	2017-05-08 10:19:34 +03:00
Pekka Enberg	b87679821c	cql3/statements: Use secondary index manager in alter_table_statement class	2017-05-08 10:03:28 +03:00
Duarte Nunes	9e88b60ef5	mutation: Set cell using clustering_key_prefix Change the clustering key argument in mutation::set_cell from exploded_clustering_prefix to clustering_key_prefix, which allows for some overall code simplification and fewer copies. This mostly affects the cql3 layer. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-04 15:59:50 +02:00
Duarte Nunes	42873189d4	cql3: Partially support ranged deletions This patch introduces partial support for range deletions. This allows deletion operations such as delete from cf where p=1 and c > 0 and c <= 3. We enforce that both range bounds be specified, because we can't represent infinite bounds in the current sstable format. Such bounds are represented as a prefix with no components, with the bound_kind informing whether they are a bottom of top bound. We're currently unable to serialize an infinite bound in such a way that it would be correctly interpreted by Cassandra 2.2.x. A serialized bound is a composite with a (<length><value><EOC>)+ format. While we could technically represent the bottom bound, the top bound, if written as a single component with 0 bytes in size and some EOC, would always sort before other values. The same would happen if represented as an empty (no components) composite, because in Cassandra 2.2.x those always have EOC = NONE. This limitation should stay in place until we can properly represent range tombstones in the storage format. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-04 15:59:50 +02:00
Duarte Nunes	f7bc88734a	modification_statement: Use statement_restrictions for where clause This patch replaces the custom where clause processing by adding and using a statement_restrictions field to modification_statement. This improves code reuse and also moves some checks to prepare-time. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-04 15:59:49 +02:00
Pekka Enberg	a4ee9f9fa1	cql3/statements: Add drop_index_statement class	2017-05-04 14:59:12 +03:00
Pekka Enberg	14391a8ec8	cql3: Return change event from announce_migration() This changes announce_migration() to return a change event directory in schema_altering_statement base class. It's needed for drop index statement, which does not know the keyspace or column family until it looks up them based on the index. Two stage approach of announcing a migration and then creating the change event won't work because in the latter stage, the lookup will fail. The same change in announce_migration() has been applied to Apache Cassandra.	2017-05-04 14:59:12 +03:00
Pekka Enberg	82394debe6	cql3/statements: Multiple index targets for CREATE INDEX	2017-05-04 14:59:12 +03:00
Pekka Enberg	fe315bd31a	cql3/statements: Use index_metadata in create_index_statement class	2017-05-04 14:59:12 +03:00
Pekka Enberg	651af0f45a	cql3/statements: Use feature flag in create_index_statement class	2017-05-04 14:59:12 +03:00
Pekka Enberg	4391faaf45	cql3/statements: Add constants to index_target	2017-05-04 14:59:11 +03:00
Pekka Enberg	546d1e47dd	cql3/statements: Add as_cql_string() to index_target class	2017-05-04 14:59:11 +03:00
Pekka Enberg	58b90655d2	cql3/statements: Add to_string(target_type)	2017-05-04 14:59:11 +03:00
Pekka Enberg	1f5a52d03f	cql3/statements: Use namespaces in index_target.cc file	2017-05-04 14:59:11 +03:00
Duarte Nunes	4e693383f7	mutation_partion: Use row_tombstone This patch replaces the current row tombstone representation by a row_tombstone. The intent of the patch is thus to reify the idea of shadowable tombstones, that up until now we considered all materialized view row tombstones to be. We need to distinguish shadowable from non-shadowable row tombstones to support scenarios such as, when inserting to a table with a materialzied view: 1. insert into base (p, v1, v2) values (3, 1, 3) using timestamp 1 2. delete from base using timestamp 2 where p = 3 3. insert into base (p, v1) values (3, 1) using timestamp 3 These should yield a view row where v2 is definitely null, but with the current implementation, v2 will pop back with its value v2=3@TS=1, even though its dead in the base row. This is because the row tombstone inserted at 2) is a shadowable one. This patch only addresses the memory representation of such row_tombstones. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-04-25 11:46:33 +02:00
Duarte Nunes	e06bafdc6c	alter_type_statement: Fix signed to unsigned conversion This could allow us to alter a non-existing field of an UDT. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170419114254.5582-1-duarte@scylladb.com>	2017-04-19 14:48:12 +03:00
Avi Kivity	7d16cfa5f0	Merge branch 'penberg/create-index-stmt-cleanup/v1' of github.com:cloudius-systems/seastar-dev "The version of create_index_statement class that was translated to C++ is pretty old by now. This series of cleanups brings it closer to Apache Cassandra trunk to make it easier to bring over more secondary index code to Scylla." * 'penberg/create-index-stmt-cleanup/v1' of github.com:cloudius-systems/seastar-dev: cql3/statements/create_index_statement: Move target validation cql3/statements/create_index_statement: Remove static column validation cql3/statements/create_index_statement: Extract validations cql3/statements/create_index_statement: Kill bogus custom validation cql3/statements/create_index_statement: Add materialized view to validate() cql3/statements/create_index_statement: Remove validation	2017-04-13 13:27:53 +03:00
Pekka Enberg	8c45038729	cql3/statements/create_index_statement: Move target validation Move index target validation to preserve the same code structure as Apache Cassandra and simplify support for multiple index targets.	2017-04-12 20:50:09 +03:00
Pekka Enberg	528c33a05b	cql3/statements/create_index_statement: Remove static column validation Apache Cassandra supports secondary indices on static columns since commit 9e74891 ("Add support for secondary indexes on static columns").	2017-04-12 20:46:29 +03:00
Pekka Enberg	975e1c8fc6	cql3/statements/create_index_statement: Extract validations Extract specific validations to separate functions to preserve the same structure as Apache Cassandra code and make it easier to add support for multiple index targets.	2017-04-12 20:44:15 +03:00
Pekka Enberg	940d6de1b8	cql3/statements/create_index_statement: Kill bogus custom validation Rejecting custom indices is bogus because it's just a configuration mechanism like replication strategy, for example. Furthermore, it's needed for SASI indices, which we likely need to be compatible with.	2017-04-12 20:17:09 +03:00
Pekka Enberg	cfadb70565	cql3/statements/create_index_statement: Add materialized view to validate() Apache Cassandra does not support secondary indices on materialized views so neither should we.	2017-04-12 20:15:20 +03:00
Pekka Enberg	1706f346f2	cql3/statements/create_index_statement: Remove validation The validation was removed in Apacha Cassandra commit 0626be8 ("New 2i API and implementations for built in indexes"). Let's also remove it from our code so that we remove one dependency to the obsolete db/index/ code.	2017-04-12 20:14:52 +03:00
Vlad Zolotarov	75fbc7c558	cql3::statements::batch_statement: add a constructor that doesn't receive the "bound_terms" value This constructor should be used when we know that there are no bound terms in the current batch statement. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-12 12:24:08 -04:00
Vlad Zolotarov	ff55b76562	cql3::query_processor: use weak_ptr for passing the prepared statements around Use seastar::checked_ptr<weak_ptr<pepared_statement>> instead of shared_ptr for passing prepared statements around. This allows an easy tracking and handling of statements invalidation. This implementation will throw an exception every time an invalidated statement reference is dereferenced. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-12 12:24:03 -04:00
Pekka Enberg	ecb8ee4efd	cql3/statements: Cleanup create_index_statement.cc Use namespaces and fix formatting issues to make the source file easier on the eyes. Message-Id: <1491977964-26629-1-git-send-email-penberg@scylladb.com>	2017-04-12 16:48:45 +03:00
Avi Kivity	27c42359bc	Merge seastar upstream * seastar 6b21197...2ebe842 (6): > Merge "Various improvements to execution stages" from Paweł > app-template: allow apps to specify a name for help message > bool_class: avoid initializing object of incomplete type > app-template: make sure we can still get help with required options > prometheus: Http handler that returns prometheus 0.4 protobuf or text format > Update DPDK to 17.02 Includes patch from Pawel to adjust to updated execution_stage interface.	2017-03-26 10:50:21 +03:00
Duarte Nunes	e215f25b11	migration_manager: Atomically migrate table and views This patch changes the migration path for table updates such that the base table mutations are sent and applied atomically with the view schema mutations. This ensures that after schema merging, we have a consistent mapping of base table versions to view table versions, which will be used in later patches. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-03-15 16:03:56 +01:00
Duarte Nunes	bfb8a3c172	materialized views: Replace db::view::view class The write path uses a base schema at a particular version, and we want it to use the materialized views at the corresponding version. To achieve this, we need to map the state currently in db::view::view to a particular schema version, which this patch does by introducing the view_info class to hold the state previously in db::view::view, and by having a view schema directly point to it. The changes in the patch are thus: 1) Introduce view_info to hold the extra view state; 2) Point to the view_info from the schema; 3) Make the functions in the now stateless db::view::view non-member; 4) Remove the db::view::view class. All changes are structural and don't affect current behavior. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-03-15 15:50:05 +01:00

1 2 3 4 5 ...

443 Commits