scylladb

Author	SHA1	Message	Date
Pavel Emelyanov	ffdafe4024	keyspace_metadata: Add default value for new_keyspace's durable_writes Almost all callers call new_keyspace with durable writes ON, so it's worth having default value for it Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-12-26 11:47:37 +03:00
Pavel Emelyanov	a67c535539	keyspace_metadata: Carry optional<initial_tablets> on board The object in question fully describes the keyspace to be created and, among other things, contains replication strategy options. Next patches move the "initial_tablets" option out of those options and keep it separately, so the ks metadata should also carry this option separately. This patch is _just_ extending the metadata creation API, in fact the new field is unused (write-only) so all the places that need to provide this data keep it disengaged and are explicitly marked with FIXME comment. Next patches will fix that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-12-25 15:58:05 +03:00
Patryk Jędrzejczak	dacec6374d	table_helper: fix indentation Broken in the previous commit.	2023-11-02 14:21:15 +01:00
Patryk Jędrzejczak	e10036babe	table_helper: retry in setup_keyspace on concurrent operation Currently, table_helper::setup_keyspace is used only for starting the system_traces keyspace. We need to handle concurrent group 0 operations possible during concurrent bootstrap in the Raft-based topology.	2023-11-02 14:21:15 +01:00
Patryk Jędrzejczak	e2894a081a	table_helper: add logger It will be used in the next commit to log information when a concurrent group 0 modification occurs.	2023-11-02 14:21:15 +01:00
Patryk Jędrzejczak	ba5275a6ae	table_helper: announce twice in setup_keyspace We refactor table_helper::setup_keyspace so that it calls migration_manager::announce at most twice. We achieve it by announcing all tables at once. The number of announcements should further be reduced to one, but it requires a big refactor. The CQL code used in parse_new_cf_statement assumes the keyspace has already been created. We cannot have such an assumption if we want to announce a keyspace and its tables together. However, we shouldn't touch the CQL code as it would impact user requests, too. One solution is using schema_builder instead of the CQL statements to create tables in table_helper. Another approach is removing table_helper completely. It is used only for the system_traces keyspace, which Scylla creates automatically. We could refactor the way Scylla handles this keyspace and make table_helper unneeded.	2023-10-31 12:08:04 +01:00
Patryk Jędrzejczak	bf15d5f7bb	table_helper: refactor setup_table In the following commit, we reduce migration_manager::announce calls in table_helper::setup_keyspace by announcing all tables together. To do it, we cannot use table_helper::setup_table anymore, which announces a single table itself. However, the new code still has to translate CQL statements, so we extract it to the new parse_new_cf_statement function to avoid duplication.	2023-10-31 12:08:04 +01:00
Avi Kivity	d450a145ce	Revert "Merge 'reduce announcements of the automatic schema changes ' from Patryk Jędrzejczak" This reverts commit `4b80130b0b`, reversing changes made to `a5519c7c1f`. It's suspected of causing dtest failures due to a bug in coroutine::parallel_for_each.	2023-10-29 18:32:06 +02:00
Patryk Jędrzejczak	7810e8d860	table_helper: announce twice in setup_keyspace We refactor table_helper::setup_keyspace so that it calls migration_manager::announce at most twice. We achieve it by announcing all tables at once. The number of announcements should further be reduced to one, but it requires a big refactor. The CQL code used in parse_new_cf_statement assumes the keyspace has already been created. We cannot have such an assumption if we want to announce a keyspace and its tables together. However, we shouldn't touch the CQL code as it would impact user requests, too. One solution is using schema_builder instead of the CQL statements to create tables in table_helper. Another approach is removing table_helper completely. It is used only for the system_traces keyspace, which Scylla creates automatically. We could refactor the way Scylla handles this keyspace and make table_helper unneeded.	2023-10-16 14:59:53 +02:00
Patryk Jędrzejczak	2b4e1e0f9c	table_helper: refactor setup_table In the following commit, we reduce migration_manager::announce calls in table_helper::setup_keyspace by announcing all tables together. To do it, we cannot use table_helper::setup_table anymore, which announces a single table itself. However, the new code still has to translate CQL statements, so we extract it to the new parse_new_cf_statement function to avoid duplication.	2023-10-16 14:59:53 +02:00
Gleb Natapov	4ffc39d885	cql3: Extend the scope of group0_guard during DDL statement execution Currently we hold group0_guard only during DDL statement's execute() function, but unfortunately some statements access underlying schema state also during check_access() and validate() calls which are called by the query_processor before it calls execute. We need to cover those calls with group0_guard as well and also move retry loop up. This patch does it by introducing new function to cql_statement class take_guard(). Schema altering statements return group0 guard while others do not return any guard. Query processor takes this guard at the beginning of a statement execution and retries if service::group0_concurrent_modification is thrown. The guard is passed to the execute in query_state structure. Fixes: #13942 Message-ID: <ZNsynXayKim2XAFr@scylladb.com>	2023-08-17 15:52:48 +03:00
Avi Kivity	d57a951d48	Revert "cql3: Extend the scope of group0_guard during DDL statement execution" This reverts commit `70b5360a73`. It generates a failure in group0_test .test_concurrent_group0_modifications in debug mode with about 4% probability. Fixes #15050	2023-08-15 00:26:45 +03:00
Gleb Natapov	70b5360a73	cql3: Extend the scope of group0_guard during DDL statement execution Currently we hold group0_guard only during DDL statement's execute() function, but unfortunately some statements access underlying schema state also during check_access() and validate() calls which are called by the query_processor before it calls execute. We need to cover those calls with group0_guard as well and also move retry loop up. This patch does it by introducing new function to cql_statement class take_guard(). Schema altering statements return group0 guard while others do not return any guard. Query processor takes this guard at the beginning of a statement execution and retries if service::group0_concurrent_modification is thrown. The guard is passed to the execute in query_state structure. Fixes: #13942 Message-ID: <ZNSWF/cHuvcd+g1t@scylladb.com>	2023-08-13 14:19:39 +03:00
Patryk Jędrzejczak	27ddf78171	migration_manager: announce: provide descriptions for all calls The system.group0_history table provides useful descriptions for each command committed to Raft group 0. One way of applying a command to group 0 is by calling migration_manager::announce. This function has the description parameter set to empty string by default. Some calls to announce use this default value which causes null values in system.group0_history. We want system.group0_history to have an actual description for every command, so we change all default descriptions to reasonable ones. We can't provide a reasonable description to announce in query_processor::execute_thrift_schema_command because this function is called in multiple situations. To solve this issue, we add the description parameter to this function and to handler::execute_schema_command that calls it.	2023-08-07 14:38:11 +02:00
Patryk Jędrzejczak	3468cbd66b	service: migration_manager: change the prepare_ methods to functions The migration_manager service is responsible for schema convergence in the cluster - pushing schema changes to other nodes and pulling schema when a version mismatch is observed. However, there is also a part of migration_manager that doesn't really belong there - creating mutations for schema updates. These are the functions with prepare_ prefix. They don't modify any state and don't exchange any messages. They only need to read the local database. We take these functions out of migration_manager and make them separate functions to reduce the dependency of other modules (especially query_processor and CQL statements) on migration_manager. Since all of these functions only need access to storage_proxy (or even only replica::database), doing such a refactor is not complicated. We just have to add one parameter, either storage_proxy or database and both of them are easily accessible in the places where these functions are called.	2023-07-28 13:55:27 +02:00
Kamil Braun	1b68e8582b	table_helper: remove `qp.get_migration_manager()` calls Push those calls up the call stack, to `trace_keyspace_helper` module. Pass `migration_manager` reference around together with `query_processor` reference.	2023-06-15 09:48:54 +02:00
Avi Kivity	4b53af0bd5	treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines coroutine::parallel_for_each avoids an allocation and is therefore preferred. The lifetime of the function object is less ambiguous, and so it is safer. Replace all eligible occurences (i.e. caller is a coroutine). One case (storage_service::node_ops_cmd_heartbeat_updater()) needed a little extra attention since there was a handle_exception() continuation attached. It is converted to a try/catch. Closes #10699	2022-05-31 09:06:24 +03:00
Kamil Braun	a664ac7ba5	treewide: require `group0_guard` when performing schema changes `announce` now takes a `group0_guard` by value. `group0_guard` can only be obtained through `migration_manager::start_group0_operation` and moved, it cannot be constructed outside `migration_manager`. The guard will be a method of ensuring linearizability for group 0 operations.	2022-01-24 15:20:35 +01:00
Kamil Braun	86762a1dd9	service: migration_manager: rename `schema_read_barrier` to `start_group0_operation` 1. Generalize the name so it mentions group 0, which schema will be a strict subset of. 2. Remove the fact that it performs a "read barrier" from the name. The function will be used in general to ensure linearizability of group0 operations - both reads and writes. "Read barrier" is Raft-specific terminology, so it can be thought of as an implementation detail.	2022-01-24 15:12:50 +01:00
Kamil Braun	283ac7fefe	treewide: pass mutation timestamp from call sites into `migration_manager::prepare_*` functions The functions which prepare schema change mutations (such as `prepare_new_column_family_announcement`) would use internally generated timestamps for these mutations. When schema changes are managed by group 0 we want to ensure that timestamps of mutations applied through Raft are monotonic. We will generate these timestamps at call sites and pass them into the `prepare_` functions. This commit prepares the APIs.	2022-01-24 15:12:50 +01:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Gleb Natapov	03184bd786	table_helper: move schema creation code to use raft	2022-01-12 16:40:06 +02:00
Gleb Natapov	e2a29d9239	table_helper: make setup_table() static It will make it easier to move schema creation to shard 0.	2022-01-12 16:40:06 +02:00
Gleb Natapov	3995f75b30	table_helper: co-routinize setup_keyspace() Also replace open-coded loops with more modern c++ alternatives.	2022-01-12 16:40:05 +02:00
Gleb Natapov	459539e812	migration_manager: do not allow creating keyspace with arbitrary timestamp This was needed to fix issue #2129 which was only manifest itself with auto_bootstrap set to false. The option is ignored now and we always wait for schema to synch during boot.	2022-01-12 16:33:15 +02:00
Avi Kivity	bfa4abaf6b	tracing: make sure keyspace and table names are available to static constructors Static constructors (specifically for the `system_keyspaces` global variable) need their dependencies to be already constructed when their own construction begins. Because tracing uses seastar::sstring, which is not constexpr, we must change it to std::string_view (which is). Change the type and perform the required adjustments. The definition is moved to the header file for simplicity.	2022-01-10 15:24:57 +02:00
Avi Kivity	ae3a360725	database: Move database, keyspace, table classes to replica/ directory The database, keyspace, and table classes represent the replica-only part of the objects after which they are named. Reading from a table doesn't give you the full data, just the replica's view, and it is not consistent since reconciliation is applied on the coordinator. As a first step in acknowledging this, move the related files to a replica/ subdirectory.	2022-01-06 17:07:30 +02:00
Avi Kivity	d768e9fac5	cql3, related: switch to data_dictionary Stop using database (and including database.hh) for schema related purposes and use data_dictionary instead. data_dictionary::database::real_database() is called from several places, for these reasons: - calling yet-to-be-converted code - callers with a legitimate need to access data (e.g. system_keyspace) but with the ::database accessor removed from query_processor. We'll need to find another way to supply system_keyspace with data access. - to gain access to the wasm engine for testing whether used defined functions compile. We'll have to find another way to do this as well. The change is a straightforward replacement. One case in modification_statement had to change a capture, but everything else was just a search-and-replace. Some files that lost "database.hh" gained "mutation.hh", which they previously had access to through "database.hh".	2021-12-15 13:54:23 +02:00
Pavel Solodovnikov	76bea23174	treewide: reduce header interdependencies Use forward declarations wherever possible. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Closes #8813	2021-06-07 15:58:35 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Emelyanov	6dc9a16b4e	table_helper: Use query_processor::get_migration_manager() After the migration manager can be obtained from the query processor the table heler can also benefit from it and not call for global migration manager instance any longer. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-15 19:35:53 +03:00
Pavel Emelyanov	26c115f379	cql3: Change execute()'s 1st arg to query_processor Currently the statement's execute() method accepts storage proxy as the first argument. This is enough for all of them but schema altering ones, because the latter need to call migration manager's announce. To provide the migration manager to those who need it it's needed to have some higher-level service that the proxy. The query processor seems to be good candidate for it. Said that -- all the .execute()s now accept the querty processor instead of the proxy and get the proxy itself from the query processor. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-15 19:00:33 +03:00
Gleb Natapov	d3aa17591c	migration_manager: drop announce_locally flag It looks like the history of the flag begins in Cassandra's https://issues.apache.org/jira/browse/CASSANDRA-7327 where it is introduced to speedup tests by not needing to start the gossiper. The thing is we always start gossiper in our cql tests, so the flag only introduce noise. And, of course, since we want to move schema to use raft it goes against the nature of the raft to be able to apply modification only locally, so we better get rid of the capability ASAP. Tests: units(dev, debug) Message-Id: <20201230111101.4037543-2-gleb@scylladb.com>	2021-01-03 13:58:09 +02:00
Pavel Emelyanov	b18522a7ab	table_helper: Require local query processor in calls Keeping the query processor reference on the table_helper in raii manner seems waistful, the only user of it -- the trace_keyspace_helper -- has a bunch of helpers on board, each would then keep its own copy for no gain. At the same time the trace_keyspace_helper already gets the query processor for its needs, so it can share one with table_helper-s. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:44:20 +03:00
Pavel Emelyanov	f5d39b9638	table_helper: Use local qp as setup_table argument The goal is to make table_helper API require the query_processor reference and use it where needed. The .setup_table() is private method, and still grabs the query processor reference itself. Since its futures do noth reshard, it's safe to carry the query processor reference through. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:44:00 +03:00
Pavel Emelyanov	2f69e90fc9	table_helper: Use local db variable The .setup_keyspace() method already has the db variable in this continuation lambda. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:43:54 +03:00
Wojciech Mitros	e79ad38425	tracing: add username to the session table In order to improve observability, add a username field to the the system_traces.sessions table. The system table should be change while upgrading by running the fix_system_distributed_tables.py script. Until the table is updated, the old behaviour is preserved. Fixes #6737.	2020-10-01 04:46:40 +02:00
Rafael Ávila de Espíndola	c5795e8199	everywhere: Replace engine().cpu_id() with this_shard_id() This is a bit simpler and might allow removing a few includes of reactor.hh. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200326194656.74041-1-espindola@scylladb.com>	2020-03-27 11:40:03 +03:00
Pavel Solodovnikov	adc6a98b59	cql3: return raw::parsed_statement as unique_ptr Change CQL parsing routine to return std::unique_ptr instead of seastar::shared_ptr. This can help reduce redundant shared_ptr copies even further. Make some supplementary changes necessary for this transition: * Remove enabled_shared_from_this base class from the following classes: truncate_statement, authorization_statement, authentication_statement: these were previously constructing prepared_statement instance in `prepare` method using `shared_from_this`. Make `prepare` methods implementation of inheriting classes mirror implementation from other statements (i.e. create a shallow copy of the object when prepairing into `prepared_statement`; this could be further refactored to avoid copies as much as possible). * Remove unused fields in create_role_statement which led to error while using compiler-generated copy ctor (copying uninitialied bool values via ctor). Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-03-23 23:19:21 +03:00
Pavel Emelyanov	b11cf6e950	cql3/query_processor.hh: Debloat from other headers This gives ~30% less (251 jobs -> 181 jobs) recompile when touching it Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200212225828.3374-1-xemul@scylladb.com>	2020-02-16 11:22:30 +02:00
Avi Kivity	b26ded8ec5	tracing: remove #include of modification_statement.hh from table_helper Replace with a forward declration to reduce #include bloat and dependencies.	2020-02-09 13:04:13 +02:00
Botond Dénes	136fc856c5	treewide: silence discarded future warnings for questionable discards This patches silences the remaining discarded future warnings, those where it cannot be determined with reasonable confidence that this was indeed the actual intent of the author, or that the discarding of the future could lead to problems. For all those places a FIXME is added, with the intent that these will be soon followed-up with an actual fix. I deliberately haven't fixed any of these, even if the fix seems trivial. It is too easy to overlook a bad fix mixed in with so many mechanical changes.	2019-08-26 19:28:43 +03:00
Avi Kivity	53a21c7787	table_helper: remove database.hh include	2019-01-05 16:39:26 +02:00
Avi Kivity	7534412071	table_helper: de-inline insert() and setup_keyspace() After previous patches de-templated these functions, we can de-inline them. This helps reduce compile time and prepares to reduce header dependencies.	2019-01-05 16:28:46 +02:00
Avi Kivity	30745eeb72	query_processor: replace sharded<database> with the local shard query_processor uses storage_proxy to access data, and the local database object to access replicated metadata. While it seems strange that the database object is not used to access data, it is logical when you consider that a sharded<database> only contain's this node's data, not the cluster data. Take advantage of this to replace sharded<database> with a single database shard.	2018-12-29 11:02:15 +02:00
Calle Wilund	dcc75263c6	cql: Add schema extensions processing to properties Automatically accept registered schema extensions into the properties set, and when building, generate the corresponding extension object into the resulting schema.	2018-02-07 10:11:46 +00:00
Piotr Jastrzebski	80f08921c4	Make table_helper independent from trace_keyspace_helper table_helper is a generic helper than can easily be used in other places. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <11e46dbc1c90d0273a41c8144e6f6013e21efcdb.1499077818.git.piotr@scylladb.com>	2017-07-03 15:55:00 +03:00
Avi Kivity	5bc13e4454	Revert "Make table_helper independent from trace_keyspace_helper" This reverts commit `db5bf363d0`. Causes errors of the sort Exiting on unhandled exception: exceptions::invalid_request_exception (Keyspace 'system_traces' does not exist)	2017-07-02 11:30:51 +03:00
Piotr Jastrzebski	db5bf363d0	Make table_helper independent from trace_keyspace_helper table_helper is a generic helper than can easily be used in other places. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <3e360a963d4a53de6d758ba8bada78fc572f001a.1498745600.git.piotr@scylladb.com>	2017-06-29 17:20:07 +03:00

49 Commits