scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Author	SHA1	Message	Date
Botond Dénes	f8a8fe41d6	cdc/log.hh: expose is_log_name() Allow outside code to use it to determine whether a table is cdc or not. This is currently the most reliable method if the custom partitioner is not set on the schema of the investigated table.	2022-06-10 10:57:12 +03:00
Calle Wilund	adda43edc7	CDC - do not remove log table on CDC disable Fixes #10489 Killing the CDC log table on CDC disable is unhelpful in many ways, partly because it can cause random exceptions on nodes trying to do a CDC-enabled write at the same time as log table is dropped, but also because it makes it impossible to collect data generated before CDC was turned off, but which is not yet consumed. Since data should be TTL:ed anyway, retaining the table should not really add any overhead beyond the compaction to eventually clear it. And user did set TTL=0 (disabled), then he is already responsible for clearing out the data. This also has the nice feature of meshing with the alternator streams semantics. Closes #10601	2022-05-31 19:07:07 +03:00
cvybhu	d85f680df3	cql3: Remove relation class Functionality of the relation class has been replaced by expr::to_restriction. Relation and all classes deriving from it can now be removed. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:58 +02:00
Piotr Sarna	eeec502aee	Merge 'gms: feature_service: reduce boilerplate to add a cluster feature' from Avi Kivity Currently, adding a cluster feature requires editing several files and repeating the new feature name several times. This series reduces the boilerplate to a single line (for non-experimental features), and perhaps three for experimental features. Closes #10488 * github.com:scylladb/scylla: gms: feature_service: remove variable/helper function duplication gms: feature: make `operator bool` implicit gms: feature_service: remove feature variable duplication in enable() gms: feature_service: remove feature variable declaration/definition duplication gms: features: de-quadruplicate active feature names gms: features: de-quadruplicate deprecated feature names gms: feature_service: avoid duplicating feature names when listing known features	2022-05-05 12:43:15 +02:00
Avi Kivity	19ab3edd77	gms: feature_service: remove variable/helper function duplication Each feature has a private variable and a public accessor. Since the accessor effectively makes the variable public, avoid the intermediary and make the variable public directly. To ease mechanical translation, the variable name is chosen as the function name (without the cluster_supports_ prefix). References throughout the codebase are adjusted.	2022-05-04 18:59:56 +03:00
Calle Wilund	78350a7e1b	cdc: Ensure columns removed from log table are registered as dropped If we are redefining the log table, we need to ensure any dropped columns are registered in "dropped_columns" table, otherwise clients will not be able to read data older than now. Includes unit test. Should probably be backported to all CDC enabled versions. Fixes #10473 Closes #10474	2022-05-04 14:19:39 +02:00
Pavel Emelyanov	c2cf4e3536	system_keyspace,cdc,storage_service: Make bootstrap manipulations non-static The users of get_/set_bootstrap_sate and aux helpers are CDC and storage service. Both have local system_keyspace references and can just use them. This removes some users of global system ks. cache and the qctx thing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-03-25 15:08:13 +03:00
Pavel Emelyanov	62417577ab	cdc_generation_service: Add system keyspace dependency The service uses system keyspace to, e.g., manage the generation id, thus it depends on the system_keyspace instance and deserves the explicit reference. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-03-25 13:39:32 +03:00
Nadav Har'El	7be3129458	cdc: don't need current keyspace to create the log table CDC registers to the table-creation hook (before_create_column_family) to add a second table - the CDC log table - to the same keyspace. The handler function (on_before_update_column_family() in cdc/log.cc) wants to retrieve the keyspace's definition, but that does NOT WORK if we create the keyspace and table in one operation (which is exactly what we intend to do in Alternator to solve issue #9868) - because at the time of the hook, the keyspace does not yet exist in the schema. It turns out that on_before_update_column_family() does not REALLY need the keyspace. It needed it to pass it on to make_create_table_mutations() but that function doesn't use the keyspace parameter passed to it! All it needs is the keyspace's name - which is in the schema anyway and doesn't need to be looked up. So in this patch we fix make_create_table_mutations() to not require the unused keyspace parameter - and fix the CDC code not to look for the keyspace that is no longer needed. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220215162342.622509-1-nyh@scylladb.com>	2022-02-16 08:38:56 +02:00
Piotr Jastrzebski	09d4438a0d	cdc: Handle compact storage correctly in preimage Base tables that use compact storage may have a special artificial column that has an empty type. `c010cefc4d` fixed the main CDC path to handle such columns correctly and to not include them in the CDC Log schema. This patch makes sure that generation of preimage ignores such empty column as well. Fixes #9876 Closes #9910 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2022-01-20 13:23:38 +01:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Kamil Braun	fe0366f6bc	cdc: `check_and_repair_cdc_streams`: fix indentation	2022-01-13 23:10:18 +02:00
Juliusz Stasiewicz	ea46439858	cdc: `check_and_repair_cdc_streams`: regenerate if too many streams are present If the number of streams exceeds the number of token ranges it indicates that some spurious streams from decommissioned nodes are present. In such a situation - simply regenerate. Fixes #9772 Closes #9780	2022-01-13 23:10:18 +02:00
Pavel Solodovnikov	5dcfb94d5a	gms: i_endpoint_state_change_subscriber: make callbacks to return futures Coroutinize a few simple callbacks in the process. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2022-01-11 09:29:12 +03:00
Avi Kivity	bbad8f4677	replica: move ::database, ::keyspace, and ::table to replica namespace Move replica-oriented classes to the replica namespace. The main classes moved are ::database, ::keyspace, and ::table, but a few ancillary classes are also moved. There are certainly classes that should be moved but aren't (like distributed_loader) but we have to start somewhere. References are adjusted treewide. In many cases, it is obvious that a call site should not access the replica (but the data_dictionary instead), but that is left for separate work. scylla-gdb.py is adjusted to look for both the new and old names.	2022-01-07 12:04:38 +02:00
Avi Kivity	ae3a360725	database: Move database, keyspace, table classes to replica/ directory The database, keyspace, and table classes represent the replica-only part of the objects after which they are named. Reading from a table doesn't give you the full data, just the replica's view, and it is not consistent since reconciliation is applied on the coordinator. As a first step in acknowledging this, move the related files to a replica/ subdirectory.	2022-01-06 17:07:30 +02:00
Juliusz Stasiewicz	351f142791	cdc/check_and_repair_cdc_streams: ignore LEFT endpoints When `check_and_repair_cdc_streams` encountered a node with status LEFT, Scylla would throw. This behavior is fixed so that LEFT nodes are simply ignored. Fixes #9771 Closes #9778	2021-12-10 15:28:14 +01:00
Juliusz Stasiewicz	5a8741a1ca	cdc: Throw when ALTERing cdc options without "enabled":"..." The problem was that such a command: ``` alter table ks.cf with cdc={'ttl': 120}; ``` would assume that "enabled" parameter is the default ("false") and, in effect, disable CDC on that table. This commit forces the user to specify that key. Fixes #6475 Closes #9720	2021-12-07 17:37:44 +02:00
Avi Kivity	595cc328b1	Merge 'cql3: Remove term, replace with expression' from Jan Ciołek This PR finally removes the `term` class and replaces it with `expression`. * There was some trouble with `lwt_cache_id` in `expr::function_call`. The current code works the following way: * for each `function_call` inside a `term` that describes a pk restriction, `prepare_context::add_pk_function_call` is called. * `add_pk_function_call` takes a `::shared_ptr<cql3::functions::function_call>`, sets its `cache_id` and pushes this shared pointer onto a vector of all collected function calls * Later when some condiition is met we want to clear cache ids of all those collected function calls. To do this we iterate through shared pointers collected in `prepare_context` and clear cache id for each of them. This doesn't work with `expr::function_call` because it isn't kept inside a shared pointer. To solve this I put the `lwt_cache_id` inside a shared pointer and then `prepare_context` collects these shared pointers to cache ids. I also experimented with doing this without any shared pointers, maybe we could just walk through the expression and clear the cache ids ourselves. But the problem is that expressions are copied all the time, we could clear the cache in one place, but forget about a copy. Doing it using shared pointers more closely matches the original behaviour. The experiment is on the [term2-pr3-backup-altcache](https://github.com/cvybhu/scylla/tree/term2-pr3-backup-altcache) branch * `shared_ptr<term>` being `nullptr` could mean: * It represents a cql value `null` * That there is no value, like `std::nullopt` (for example in `attributes.hh`) * That it's a mistake, it shouldn't be possible A good way to distinguish between optional and mistake is to look for `my_term->bind_and_get()`, we then know that it's not an optional value. * On the other hand `raw_value` cased to bool means: * `false` - null or unset * `true` - some value, maybe empty I ran a simple benchmark on my laptop to see how performance is affected: ``` build/release/test/perf/perf_simple_query --smp 1 -m 1G --operations-per-shard 1000000 --task-quota-ms 10 ``` * On master (`a21b1fbb2f`) I get: ``` 176506.60 tps ( 77.0 allocs/op, 12.0 tasks/op, 45831 insns/op) median 176506.60 tps ( 77.0 allocs/op, 12.0 tasks/op, 45831 insns/op) median absolute deviation: 0.00 maximum: 176506.60 minimum: 176506.60 ``` * On this branch I get: ``` 172225.30 tps ( 75.1 allocs/op, 12.1 tasks/op, 46106 insns/op) median 172225.30 tps ( 75.1 allocs/op, 12.1 tasks/op, 46106 insns/op) median absolute deviation: 0.00 maximum: 172225.30 minimum: 172225.30 ``` Closes #9481 * github.com:scylladb/scylla: cql3: Remove remaining mentions of term cql3: Remove term cql3: Rename prepare_term to prepare_expression cql3: Make prepare_term return an expression instead of term cql3: expr: Add size check to evaluate_set cql3: expr: Add expr::contains_bind_marker cql3: expr: Rename find_atom to find_binop cql3: expr: Add find_in_expression cql3: Remove term in operations cql3: Remove term in relations cql3: Remove term in multi_column_restrictions cql3: Remove term in term_slice, rename to bounds_slice cql3: expr: Remove term in expression cql3: expr: Add evaluate_IN_list(expression, options) cql3: Remove term in column_condition cql3: Remove term in select_statement cql3: Remove term in update_statement cql3: Use internal cql format in insert_prepared_json_statement cache types: Add map_type_impl::serialize(range of <bytes, bytes>) cql3: Remove term in cql3/attributes cql3: expr: Add constant::view() method cql3: expr: Implement fill_prepare_context(expression) cql3: expr: add expr::visit that takes a mutable expression cql3: expr: Add receiver to expr::bind_variable	2021-11-30 16:39:39 +02:00
Piotr Jastrzebski	033a75ff96	cdc: Don't support "on" and "off" values for preimage any more This is an undocumented feature that causes confusion so let's get rid of it. tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Closes #9639	2021-11-17 11:54:11 +01:00
Jan Ciolek	e458340821	cql3: Remove term term isn't used anywhere now. We can remove it and all classes that derive from it. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Nadav Har'El	666017f2f0	Merge 'Convert last uses of sprint() to fmt::format()' from Avi Kivity sprint() uses the printf-style formatting language while most of our code uses the Python-derived format language from fmt::format(). The last mass conversion of sprint() to fmt (in `1129134a4a`) missed some callers (principally those that were on multiple lines, and so the automatic converter missed them). Convert the remainder to fmt::format(), and some sprintf() and printf() calls, so we have just one format language in the code base. Seastar::sprint() ought to be deprecated and removed. Test: unit (dev) Closes #9529 * github.com:scylladb/scylla: utils: logalloc: convert debug printf to fmt::print() utils: convert fmt::fprintf() to fmt::print() main: convert fprint() to fmt::print() compress: convert fmt::sprintf() to fmt::format() tracing: replace seastar::sprint() with fmt::format() thrift: replace seastar::sprint() with fmt::format() test: replace seastar::sprint() with fmt::format() streaming: replace seastar::sprint() with fmt::format() storage_service: replace seastar::sprint() with fmt::format() repair: replace seastar::sprint() with fmt::format() redis: replace seastar::sprint() with fmt::format() locator: replace seastar::sprint() with fmt::format() db: replace seastar::sprint() with fmt::format() cql3: replace seastar::sprint() with fmt::format() cdc: replace seastar::sprint() with fmt::format() auth: replace seastar::sprint() with fmt::format()	2021-10-28 22:33:23 +03:00
Avi Kivity	6b02aa72e2	cdc: replace seastar::sprint() with fmt::format() sprint() is obsolete.	2021-10-27 14:30:06 +03:00
Avi Kivity	e44057d5e1	cdc: don't allow background streams description rewrite to delay too far If we're upgrading from an older version with the previous CDC streams format, we'll upgrade it in the background. Background update is needed since we need the cluster to be available when performing the upgrade, but at this point we're just starting a node, and may not succeed in forming a cluster before we shut down. However, running in the background is dangerous since the objects we use may stop existing. The code is careful to use reference counting, but this does not guarantee that other dependencies are still alive, especially since not all dependencies are expressed via constructor parameters. Fix by waiting for the rewrite work in generation_service::stop(). As long as generation_service is up, the required dependencies should be working too. Note that there is another change here besides limiting the background work: checks that were previously done in the foreground (limited to local tables) are now also done in the background. I don't think this has any impact. Note: I expect this to have no real impact. Any CDC users will have long since ugpraded. This is just preparing for other patches that bring in other dependencies, which cannot be passed via reference counted pointers, so they expose the existing problem.	2021-10-18 16:56:59 +03:00
Avi Kivity	eac95e2370	cdc: adjust type of streams_count streams_count has signed type, but it's compared against an unsigned type, annoying gcc. Since a count should be positive, convert it to an unsigned type.	2021-10-06 14:56:00 +03:00
Pavel Emelyanov	db623c5f64	cdc: Replace db::config with generation_service::config This is to push the service towards general idea that each component should have its own config and db::config to stay in main. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 16:04:12 +03:00
Pavel Emelyanov	b879d3f3a5	cdc: Drop db::config from description_generator It only needs one for murmur3_partitioner_ignore_msb_bits value, provide it directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 16:04:12 +03:00
Pavel Emelyanov	2e7364b94f	cdc: Remove all arguments from maybe_rewrite_streams_descriptions All of them are references taken from 'this', since the function is the generation_service method it can use 'this' directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 16:04:12 +03:00
Pavel Emelyanov	6fe31d8eac	cdc: Move maybe_rewrite_streams_descriptions into after_join The generation service already has all it needs to do it. This keeps storage_service smaller and less aware about cdc internals. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 15:34:03 +03:00
Pavel Emelyanov	3b51c5c96a	cdc: Squash two methods into one The recently introduced make_new_generation() method just calls another one by passing more this->... stuff as arguments. Relax the flow by teaching the latter to use 'this' directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 15:34:03 +03:00
Pavel Emelyanov	7a7a87f24a	cdc: Turn make_new_cdc_generation a service method It has everything needed onboard. Only two arguments are required -- the booststrap tokens and whether or not to inject a delay. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 15:34:03 +03:00
Pavel Emelyanov	b867a19da1	cdc: Remove ring-delay arg from make_new_cdc_generation It already has the db::config from where to get one (and even this will change soon). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 15:34:03 +03:00
Pavel Emelyanov	5e2a049266	cdc: Keep database reference on generation_service The service effectively depends on it when rewrites streams descriptions. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-30 15:34:03 +03:00
Avi Kivity	ed396a31f3	Merge "Remove global storage proxy from cdc" from Pavel E " There's a single call to get_local_storage_proxy in cdc code that needs to get database from. Furtunately, the database can be easily provided there via call argument. tests: unit(dev) " * 'br-remove-proxy-from-cdc' of https://github.com/xemul/scylla: cdc: Add database argument to is_log_for_some_table client_state: Pass database into has_access() client_state: Add database argument to has_schema_access client_state: Add database argument to has_keyspace_access() cdc: Add database argument to check_for_attempt_to_create_nested_cdc_log	2021-09-13 18:45:46 +03:00
Pavel Emelyanov	5515f7187d	range_tombstone, code: Add range_tombstone& getters Currently all the code operates on the range_tombstone class. and many of those places get the range tombstone in question from the range_tombstone_list. Next patches will make that list carry (and return) some new object called range_tombstone_entry, so all the code that expects to see the former one there will need to patched to get the range_tombstone from the _entry one. This patch prepares the ground for that by introdusing the range_tombstone& tombstone() { return *this; } getter on the range_tombstone itself and patching all future users of the _entry to call .tombstone() right now. Next patch will remove those getters together with adding the new range_tombstone_entry object thus automatically converting all the patched places into using the entry in a proper way. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-03 19:34:45 +03:00
Pavel Emelyanov	0fd00d7016	cdc: Add database argument to is_log_for_some_table All callers has been patched already. This argument can now be used to replace get_local_storage_proxy().get_db().local() call. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-08-27 14:07:26 +03:00
Pavel Emelyanov	fe8bc0757b	cdc: Add database argument to check_for_attempt_to_create_nested_cdc_log The only caller of it already has database argument, just pass it a bit further Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-08-27 14:07:18 +03:00
Benny Halevy	4439e5c132	everywhere: cleanup defer.hh includes Get rid of unused includes of seastar/util/{defer,closeable}.hh and add a few that are missing from source files. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-22 21:11:39 +03:00
Asias He	6350a19f73	compaction: Move compaction_strategy.hh to compaction dir The top dir is a mess. Move compaction_strategy.hh and compaction_strategy_type.hh to the new home.	2021-08-07 08:06:37 +08:00
Avi Kivity	e52ebe2da5	types: convert abstract_type::compare and related to std::strong_ordering Change comparators around types to std::strong_ordering. Ref #1449.	2021-07-28 13:19:24 +03:00
Calle Wilund	59555fa363	cdc: fix broken function signature in maybe_back_insert_iterator Fixes #9103 compare overload was declared as "bool" even though it is a tri-cmp. causes us to never use the speed-up shortcut (lessen search set), in turn meaning more overhead for collections. Closes #9104	2021-07-27 20:37:30 +03:00
Piotr Jastrzebski	c010cefc4d	cdc: Handle compact storage tables correctly When a table with compact storage has no regular column (only primary key columns), an artificial column of type empty is added. Such column type can't be returned via CQL so CDC Log shouldn't contain a column that reflects this artificial column. This patch does two things: 1. Make sure that CDC Log schema does not contain columns that reflect the artificial column from a base table. 2. When composing mutation to CDC Log, ommit the artificial column. Fixes #8410 Test: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Closes #8988	2021-07-12 12:17:35 +03:00
Tomasz Grabiec	06e373e272	sstables: index_reader: Keep index objects under LSA In preparation for caching index objects, manage them under LSA. Implementation notes: key_view was changed to be a view on managed_bytes_view instead of bytes, so it now can be fragmented. Old users of key_view now have to linearize it. Actual linearization should be rare since partition keys are typically small. Index parser is now not constructing the index_entry directly, but produces value objects which live in the standard allocator space: class parsed_promoted_index_entry; calss parsed_partition_index_entry; This change was needed to support consumers which don't populate the partition index cache and don't use LSA, e.g. sstable::generate_summary(). It's now consumer's responsibility to allocate index_entry out of parsed_partition_index_entry.	2021-07-02 19:02:14 +02:00
Kamil Braun	a3f3563828	storage_service: check for existing normal token owners before bootstrapping The bootstrap procedure starts by "waiting for range setup", which means waiting for a time interval specified by the `ring_delay` parameter (30s by default) so the node can receive the tokens of other nodes before introducing its own tokens. However it may sometimes happen that the node doesn't receive the tokens. There are no explicit checks for this. But the code may crash in weird ways if the tokens-received assuption is false, and we are lucky if it does crash (instead of, for example, allowing the node to incorrectly bootstrap, causing data loss in the process). Introduce an explicit check-and-throw-if-false: a bootstrapping node now checks that there's at least one NORMAL token in the token ring, which means that it had to have contacted at least one existing node in the cluster, which means that it received the gossip application states of all nodes from that node; in particular the tokens of all nodes. Also add an assert in CDC code which relies on that assumption (and would cause weird division-by-zero errors if the assumption was false; better to crash on assert than this). Ref #8889. Closes #8896	2021-06-24 13:19:08 +03:00
Pavel Solodovnikov	76bea23174	treewide: reduce header interdependencies Use forward declarations wherever possible. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Closes #8813	2021-06-07 15:58:35 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Solodovnikov	142d3b5ad9	cdc: self-sufficient headers fixup Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-06-06 19:18:49 +03:00
Kamil Braun	337a4ef8ad	cdc: when creating new generations, use format v2 if possible A node with this commit, when creating a new CDC generation (during bootstrap, upgrade, or when running checkAndRepairCdcStreams command) will check for the CDC_GENERATIONS_V2 feature and: - If the feature is enabled create the generation in the v2 format and insert it into the new internal table. This is safe because a node joins the feature only if it understands the new format. - Otherwise create it in the v1 format, limiting its size as before, and insert it into the old table. The second case should only happen if we perform bootstrap or run checkAndRepairCdcStreams in the middle of an upgrade procedure. On fully upgraded clusters the feature shall be enabled, causing all new generations to use the new format.	2021-05-25 16:07:23 +02:00
Kamil Braun	4d3870b24b	main: pass feature_service to cdc::generation_service	2021-05-25 16:07:23 +02:00
Kamil Braun	9c1a3180bb	cdc: introduce retrieve_generation_data This function given a generation ID retrieves its data from the internal table in which the data resides. This depends on the version of the ID: for _v1 we're using system_distributed.cdc_generation_descriptions, for _v2 we're using the better system_distributed_v2.cdc_generation_descriptions_v2 (see the previous commit for detailed explanation of the superiority of the new table).	2021-05-25 16:07:23 +02:00

1 2 3 4 5 ...

303 Commits