scylladb

Author	SHA1	Message	Date
Pavel Solodovnikov	9aa4712270	lwt: introduce `paxos_grace_seconds` per-table option to set paxos ttl Previously system.paxos TTL was set as max(3h, gc_grace_seconds). Introduce new per-table option named `paxos_grace_seconds` to set the amount of seconds which are used to TTL data in paxos tables when using LWT queries against the base table. Default value is equal to `DEFAULT_GC_GRACE_SECONDS`, which is 10 days. This change allows to easily test various issues related to paxos TTL. Fixes #6284 Tests: unit (dev, debug) Co-authored-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200816223935.919081-1-pa.solodovnikov@scylladb.com>	2020-08-17 16:44:14 +02:00
Nadav Har'El	7e01ae089e	cdc: avoid including cdc/cdc_options.hh everywhere Before this patch, modifying cdc/cdc_options.hh required recompiling 264 source files. This is because this header file was included by a couple other header files - most notably schema.hh, where a forward declaration would have been enough. Only the handful of source files which really need to access the CDC options should include "cdc/cdc_options.hh" directly. After this patch, modifying cdc/cdc_options.hh requires only 6 source files to be recompiled. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200813070631.180192-1-nyh@scylladb.com>	2020-08-16 14:41:47 +03:00
Rafael Ávila de Espíndola	efeaded427	Everywhere: Add a make_shared_schema helper This replaces a lot of make_lw_shared(schema(...)) with make_shared_schema(...). This makes it easier to drop a dependency on the differences between seastar::make_shared and std::make_shared. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-21 10:33:49 -07:00
Pavel Emelyanov	f045cec586	snap: Get rid of storage_service reference in schema.cc Now when the snapshot stopping is correctly handled, we may pull the database reference all the way down to the schema::describe(). One tricky place is in table::napshot() -- the local db reference is pulled through an smp::submit_to call, but thanks to the shard checks in the place where it is needed the db is still "local" Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:28:25 +03:00
Glauber Costa	44a0e40cb2	compaction: move compaction_strategy_type to its own header I just hit a circularity in header inclusion that I traced back to the fact that schema.hh includes compaction_strategy.hh. schema.hh is in turn included in lots of places, so a circularity is not hard to come by. The schema header really only needs to know about the compaction_type, so it can inform schema users about it. Following the trend in header clenups, I am moving that to a separate header which will both break the circularity and make sure we are included less stuff that is not needed. With this change, Scylla fails to compile due to a new missing forward declaration at index/secondary_index_manager.hh, so this is fixed. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200527172203.915936-1-glauber@scylladb.com>	2020-05-29 08:14:27 +03:00
Pekka Enberg	ed0d00f51e	Revert "Revert "schema: Default dc_local_read_repair_chance to zero"" This reverts commit `43b488a7bc`. The commit was originally reverted because a dtest was sensitive to the value. The dtest is fixed now, so let's revert the revert as requested by Glauber.	2020-05-21 08:05:13 +03:00
Pavel Solodovnikov	f6e765b70f	cql3: pass `column_specification` via lw_shared_ptr `column_specification` class is marked as "final": it's safe to use non-polymorphic pointer "lw_shared_ptr" instead of a more generic "shared_ptr". tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200427084016.26068-1-pa.solodovnikov@scylladb.com>	2020-04-27 12:47:42 +03:00
Pekka Enberg	43b488a7bc	Revert "schema: Default dc_local_read_repair_chance to zero" This reverts commit `fdd2d9de3d` because it breaks one heat-weighted load balancing dtest: FAIL: heat_weighted_load_balancing_cl_QUORUM_test (heat_weighted_load_balancing_test.HeatWeightedLB) ---------------------------------------------------------------------- Traceback (most recent call last): File "/home/penberg/src/scylla/scylla-dtest/heat_weighted_load_balancing_test.py", line 182, in heat_weighted_load_balancing_cl_QUORUM_test self.run_heat_weighted_load_balancing('QUORUM') File "/home/penberg/src/scylla/scylla-dtest/heat_weighted_load_balancing_test.py", line 165, in run_heat_weighted_load_balancing self.verify_metrics(metrics, cached=False) File "/home/penberg/src/scylla/scylla-dtest/heat_weighted_load_balancing_test.py", line 73, in verify_metrics mean_avg, node_mean_avg, key)) AssertionError: 19.0 not found in range(3, 13) : Cache difference between nodes is less then expected: 6469.6/328.2, metric scylla_storage_proxy_coordinator_reads_local_node I am reverting because it's a test issue, and we should bring this commit back once the test is fixed. Gleb Natapov explains: "dtest result directly depends on replicas we contact. Glauber's patch make us contacts less replicas, so numbers differ."	2020-04-02 13:43:29 +03:00
Glauber Costa	fdd2d9de3d	schema: Default dc_local_read_repair_chance to zero dc_local_read_repair_chance is a legacy of old times: Cassandra itself now defaults to zero, and we should look into that too. Most serious production clusters are either repaired through our asynchronous repair, or don't need repair at all. Synchronous read repair can help things converging, but it implies an impact at query time. For clusters that are on an asynchronous repair schedule this should not be needed. Fixes #6109 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200331183418.21452-1-glauber@scylladb.com>	2020-04-01 08:27:49 +02:00
Piotr Jastrzebski	e72696a8e6	sharding_info: rename the class to sharder Also rename all variables that were named si or sinfo to sharder. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-30 18:42:33 +02:00
Piotr Jastrzebski	92cdc21123	schema: remove incorrect comment partitioner is actually part of schema digest and is stored locally in internal tables. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-30 18:42:33 +02:00
Piotr Jastrzebski	7bd2b8d73f	schema: make it possible to set sharding_info per schema Previously schema::get_sharding_info was obtaining sharding_info from the partitioner but we want to remove sharding_info from the partitioner so we need a place in schema to store it there instead. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-30 18:42:33 +02:00
Piotr Jastrzebski	8d81a2498f	schema: add get_sharding_info At the moment, we have a single sharding logic per node but we want to be able to set it per table in the future. To make it easy to change in the future sharding_info will be managed inside schema and all the other code will access it through schema::get_sharding_info function. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-30 09:35:27 +02:00
Nadav Har'El	35d95d6887	merge: Add postimage implementation Merged pull request https://github.com/scylladb/scylla/pull/5996 from Calle Wilund: Fixes #4992 Implements post-image support by synthesizing it from pre-image + delta. Post-image data differs from the delta data in two ways: 1.) It merges non-atomics into an actual result value 2.) It contains all columns of the row, not just those affected by the update. For a non-atomic field, the post-image value of a column is either the pre-image or the delta (maybe null) Tested by adding post-image checks to pre-image test and collection/udt tests	2020-03-16 13:42:07 +02:00
Calle Wilund	ca7046256f	schema: Add "columns" accessor for columns by kind To prevent switch-code everywhere.	2020-03-16 09:21:06 +00:00
Piotr Jastrzebski	5bbb826c49	schema: drop optional from _partitioner field Always set the field to the default value if no table specific partitioner has been set. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-15 10:25:21 +01:00
Piotr Jastrzebski	22daa262ee	partitioner: move default_partitioner to schema.cc Make it inaccessible to other compilation units. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-15 10:25:20 +01:00
Piotr Jastrzebski	57b69fb804	schema: include partitioner name in scylla tables mutation There are two results of this patch: 1. New partitioner name column is persited on node's disk in scylla_tables 2. New partitioner name column is included into schema digest This is achieved by including this new column in scylla tables mutation. For that we: 1. Add partitioner name to the result of make_scylla_tables_mutation. If table does not have a specific partitioner set and uses default partitioner then we don't include the name of such default partitioner. Only the name of custom partitioner is added if a table has one. 2. In create_table_from_mutations we check whether scylla tables mutation has a partitioner name set. If so then we use it as a parameter for schema_builder. Note that previous patches have ensured that this new column will be included into schema digest only after the whole cluster supports per table partitioners. Before that, during rolling upgrade, new partitioner name column is hidden and not shared with other nodes. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-15 10:25:20 +01:00
Piotr Jastrzebski	1d6cec1b0a	schema: make it possible to set custom partitioner schema_builder::with_partitioner can be used now to set custom partitioner on a table. If no such partitioner is set, global partitioner is still used. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-15 10:25:20 +01:00
Piotr Jastrzebski	54d24553bb	schema: get_partitioner return const& Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-06 13:33:53 +01:00
Piotr Dulikowski	861c7b5626	schema: get cdc options from schema extensions Removes logic responsible for setting cdc_options from dedicated column in scylla_tables, and uses the "cdc" schema extension instead.	2020-03-05 16:11:21 +01:00
Rafael Ávila de Espíndola	151f5e723f	Pass string_view to the schema constructor This moves string copies from the callers of the constructor to the implementation. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-02-28 17:04:12 -08:00
Piotr Jastrzebski	9b95153136	schema: add get_partitioner() The plan is to remove dht::global_partitioner() and use schema::get_partitioner() instead. This will allow a usage of per schema/table partitioner instead of a single global partitioner everywhere. Initially schema::get_partitioner will call dht::global_partitioner. After all the calls to dht::global_partitioner are switched to schema::get_partitioner, the ability to set per schema partitioner will be implemented. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-17 10:04:41 +01:00
Nadav Har'El	9953a33354	merge "Adding a schema file when creating a snapshot" Merged pull request https://github.com/scylladb/scylla/pull/5294 from Amnon Heiman: To use a snapshot we need a schema file that is similar to the result of running cql DESCRIBE command. The DESCRIBE is implemented in the cql driver so the functionality needs to be re-implemented inside scylla. This series adds a describe method to the schema file and use it when doing a snapshot. There are different approach of how to handle materialize views and secondary indexes. This implementation creates each schema.cql file in its own relevant directory, so the schema for materializing view, for example, will be placed in the snapshot directory of the table of that view. Fixes #4192	2020-01-16 12:05:50 +02:00
Amnon Heiman	82367b325a	schema: Add a describe method This patch adds a describe method to a table schema. It acts similar to a DESCRIBE cql command that is implemented in a CQL driver. The method supports tables, secondary indexes local indexes and materialize views. relates to: #4192 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2020-01-15 15:06:00 +02:00
Gleb Natapov	16e0fc4742	schema: allow schema to be marked as 'always sync to commitlog' All writes that uses this schema will be immediately persisted on a storage.	2020-01-15 12:15:42 +02:00
Calle Wilund	2787b0c4f8	cdc: Move "options" to separate header to avoid to much header inclusion cdc should not contaminate the whole universe.	2019-12-09 12:12:09 +00:00
Konstantin Osipov	6159c012db	schema: pre-allocate the bitset of column_set The number of columns is usually small, and avoiding a resize speeds up bit manipulation functions.	2019-11-13 11:41:51 +03:00
Konstantin Osipov	e95d675567	schema: introduce schema::all_columns_count() schema::all_columns_count() will be used to reserve memory of the column_set bitmask.	2019-11-13 11:41:42 +03:00
Konstantin Osipov	191acec7ab	schema: rename column_mask to column_set Since it contains a precise set of columns, it's more accurate to call it a set, not a mask. Besides, the name column_mask is already used for column options on storage level.	2019-11-13 11:41:30 +03:00
Nadav Har'El	631846a852	CDC: Implement minimal version that logs only primary key of each change Merge a patch series from Piotr Jastrzębski (haaawk): This PR introduces CDC in it's minimal version. It is possible now to create a table with CDC enabled or to enable/disable CDC on existing table. There is a management of CDC log and description related to enabling/disabling CDC for a table. For now only primary key of the changed data is logged. To be able to co-locate cdc streams with related base table partitions it was needed to propagate the information about the number of shards per node. This was node through gossip. There is an assumption that all the nodes use the same value for sharding_ignore_msb_bits. If it does not hold we would have to gossip sharding_ignore_msb_bits around together with the number of shards. Fixes #4986. Tests: unit(dev, release, debug)	2019-10-20 11:41:01 +03:00
Piotr Jastrzebski	ca9536a771	schema: add _cdc_options field Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-10-17 10:55:31 +02:00
Konstantin Osipov	c0f0ab5edd	lwt: introduce column mask Introduce a bitset container which can be used to compute all columns used in a query. Add a partition_slice constructor which uses the bitset.	2019-10-16 22:40:55 +03:00
Konstantin Osipov	fa73421198	lwt: introduce column_definition::ordinal_id Make sure every column in the schema, be it a column of partition key, clustering key, static or regular one, has a unique ordinal identifier. This makes it easy to compute the set of columns used in a query, as well as index row cells. Allow to get column definition in schema by ordinal id.	2019-10-16 15:46:25 +03:00
Piotr Sarna	491b7a817f	schema: add computed info to column definition Some columns may represent not user-provided values, but ones computed from other columns. Currently an example is token column used in secondary indexes to provide proper ordering. In order to avoid hardcoding special cases in execution stage, optional additional information for computed columns is stored in column definition.	2019-07-19 11:47:46 +02:00
Tomasz Grabiec	f798f724c8	frozen_mutation: Guard against unfreezing using wrong schema Currently, calling unfreeze() using the wrong version of the schema results in undefined behavior. That can cause hard-to-debug problems. Better to throw in such cases. Refs #4549. Tests: - unit (dev) Message-Id: <1560459022-23786-1-git-send-email-tgrabiec@scylladb.com>	2019-06-17 15:23:24 +03:00
Dejan Mircevski	274a77f45e	Process GROUP BY columns into select_statement Validate raw GROUP BY identifiers and translate them into a select_statement member. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-08 10:10:10 -04:00
Piotr Sarna	90d47ca183	schema: add is_local_index cached value to index metadata In order to quickly distinguish global indexes from local ones, a cached boolean value is introduced.	2019-03-20 09:51:46 +01:00
Piotr Sarna	b0ab4c28cf	schema: add column_definition::is_hidden_from_cql Right now the only columns hidden from CQL are view virtual columns, but in case of expanding this set, a helper function is provided.	2019-02-27 15:07:54 +01:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Tomasz Grabiec	789fac9884	schema: Optimize column count getters	2018-11-21 14:04:27 +01:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	a71ab365e3	toplevel: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Vladimir Krivopalov	399f815a89	schema: Add helper method returning the count of columns of specified kind. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-10-25 17:07:20 -07:00
Nadav Har'El	0a1d93138d	schema: add "view virtual" flag to schema's column_definition In this patch we add a flag, "view virtual", that we can mark on on a column defined in a schema. In following patches, we will add such virtual columns to materialized views to allow view rows to remain alive despite having no data (refs #3362). After this patch, the "view virtual" flag exists in our in-memory representation of the schema, but not persisted to disk - we will fix this in the next patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-08-16 15:23:09 +03:00
Paweł Dziepak	6c54a97320	schema: column_mapping_entry: cache abstract_type::is_atomic() IDL deserialisation code calls is_atomic() for each cell. An additional indirection and a virtual call can be avoided by caching that value in column_mapping_entry. There is already very similar optimisation done for column_definitions.	2018-06-28 22:16:42 +01:00
Piotr Sarna	bc019205b3	schema: fix typos in a comment Message-Id: <2b2a169e8a511fa9e0e1556ac7559ce9bef896e1.1525431353.git.sarna@scylladb.com>	2018-05-04 15:26:51 +01:00
Calle Wilund	ff41f47a08	db::extensions: Allow extensions to modify (system) schemas Allows extensions/config listeners to potentially augument (system) schemas at boot time. This is only useful for schemas who do not pass through system_schema tables.	2018-03-26 11:58:28 +00:00
Calle Wilund	3ab760b375	schema: Add opaque type to represent extensions A virtual opaque object meant to represent the "extensions" mapping in schema_tables::tables/views	2018-02-07 10:11:45 +00:00
Duarte Nunes	fbb4c9edda	schema: Provide all-selecting partition slice This patch introduces schema::full_slice(), which returns a partition_slice selecting the full clustering range, as well as all static and regular columns. No options aside from the default are set in that partition_slice. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1507732800-9448-1-git-send-email-duarte@scylladb.com>	2017-10-17 11:25:35 +02:00

1 2 3 4 5

215 Commits