scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 11:55:15 +00:00

Author	SHA1	Message	Date
Piotr Sarna	5ef9dbfa8a	test: move config to heap in schema_registry_test ... in order to get rid of a large stack warning. Tests: unit(dev) Message-Id: <82b55e8440ade8a3d81880dd66127776b2661112.1585128726.git.sarna@scylladb.com>	2020-03-25 14:19:30 +01:00
Nadav Har'El	a0f025f4ce	sstable: LA format is the default, so ignore "LA_SSTABLE" feature flag The previous patch made the LA format the default. We no longer need to choose between writing the older KA format or LA, so the LA_SSTABLE cluster feature has became unnecessary. Unfortunately, we cannot completely remove this feature: Since commit `4f3ce42163` we cannot remove cluster features because this node will refuse to join a cluster which already agreed on features that it lacks - thinking it is an old node trying to join a new cluster. So the LA_SSTABLE feature flag remains, and we continue to advertise that our node supports it. We just no longer care about what other nodes advertised for it, so we can remove a bit of code that cared. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200324232607.4215-3-nyh@scylladb.com>	2020-03-25 13:00:28 +01:00
Nadav Har'El	91aba40114	sstable: default to LA format instead of KA format Over the years, Scylla updated the sstable format from the KA format to the LA format, and most recently to the MC format. On a mixed cluster - as occurs during a rolling upgrade - we want all the nodes, even new ones, to write sstables in the format preferred by the old version. The thinking is that if the upgrade fails, and we want to downgrade all nodes back to the older version, we don't want to lose data because we already have too-new sstables. So the current code starts by selecting the oldest format we ever had - KA, and only switching this choice to LA and MC after we verify that all the nodes in the cluster support these newer formats. But before an agreement is reached on the new format, sstables may already be created in the antique KA format. This is usually harmless - we can read this format just fine. However, the KA format has a problem that it is unable to represent table names or keyspaces with the "-" character in them, because this character is used to separate the keyspace and table names in the file name. For CQL, a "-" is not allowed anyway in keyspace or table names; But for Alternator, this character is allowed - and if a KA table happens to be created by accident (before the LA or MC formats are chosen), it cannot be read again during boot, and Scylla cannot reboot. The solution that this patch takes is to change Scylla's default sstable format to LA (and, as before, if the entire cluster agrees, the newer MC format will be used). From now on, new KA tables will never be written. But we still fully support reading the KA format - this is important in case some very old sstables never underwent compaction. The old code had, confusingly, two places where the default KA format was chosen. This patch fixes is so the new default (LA) is specified in only one place. Fixes #6071. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200324232607.4215-2-nyh@scylladb.com>	2020-03-25 13:00:28 +01:00
Rafael Ávila de Espíndola	eca0ac5772	everywhere: Update for deprecated apply functions Now apply is only for tuples, for varargs use invoke. This depends on the seastar changes adding invoke. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200324163809.93648-1-espindola@scylladb.com>	2020-03-25 08:49:53 +02:00
Avi Kivity	088660680c	Update seastar submodule * seastar 92c488706...c7b6b84e5 (6): > semaphore: Use futurize_invoke instead of futurize_apply > future: specify futurize::make_exception_future as noexcept > future: Move ignore out of line > future: Split then and then_impl to enable NRVO > semaphore_units: allow getting the number of units held > Merge "Split futurize::apply into invoke(...) and apply(tuple)" from Rafael	2020-03-25 08:48:00 +02:00
Botond Dénes	0418a74fa9	querier: consume_page(): resolve FIXME related to non-movable consumer Now that #3158 is fixed, we can move the consumer to its place after the `compaction_mutation_state::start_new_page()` call. No need to keep it as `std::unique_ptr<>`. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200310185147.207665-1-bdenes@scylladb.com>	2020-03-24 15:28:42 +02:00
Avi Kivity	a314283469	Merge "Minor cleanups to cql3 code regarding shared_ptr's" from Pavel S " This small series consists of several changes that aim to reduce the number of shared_ptr's in cql3 code. Also it contains a patch that makes CqlParser::query to return std::unique_ptr<> instead of seastar::shared_ptr<>, which leads to more understandable code and lays foundation for further optimizations (e.g. possibly eliminating shared_ptr's in `prepared_statement` and just moving raw statements in `prepare` without copying them). Tests: unit(dev, debug) " * 'feature/cql_cleanups_9' of https://github.com/ManManson/scylla: cql3: return raw::parsed_statement as unique_ptr cql3: de-pointerize arguments to some of CQL grammar rules and definitions. cql3: make abstract_marker::make_in_receiver accept cref to column_specification	2020-03-24 14:51:49 +02:00
Calle Wilund	9fee712d62	db::commitlog: Don't write trailing zero block unless needed Fixes #5899 When terminating (closing) a segment, we write a trailing block of zero so reader can have an empty region after last used chunk as end marker. This is due to using recycled, pre-allocated segments with potentially non-zero data extending over the point where we are ending the segment (i.e. we are not fully filling the segment due to a huge mutation or similar). However, if we reach end of segment writing the final block (typically many small mutations), the file will end naturally after the data written, and any trailing zero block would in fact just extend the file further. While this will only happen once per segment recycled (independent on how many times it is recycled), it is still both slightly breaking the disk usage contract and also potentially causing some disk stalls due to metadata changes (though of course very infrequent). We should only write trailing zero if we are below the max_size file size when terminating Adds a small size check to commitlog test to verify size bounds. (Which breaks without the patch) v2: - Fix test to take into account that files might be deleted behind our backs. v3: - Fix test better, by doing verification _before_ segments are queued for delete. Message-Id: <20200226121601.15347-2-calle@scylladb.com> Message-Id: <20200324100235.23982-1-calle@scylladb.com>	2020-03-24 11:31:55 +01:00
Pavel Solodovnikov	adc6a98b59	cql3: return raw::parsed_statement as unique_ptr Change CQL parsing routine to return std::unique_ptr instead of seastar::shared_ptr. This can help reduce redundant shared_ptr copies even further. Make some supplementary changes necessary for this transition: * Remove enabled_shared_from_this base class from the following classes: truncate_statement, authorization_statement, authentication_statement: these were previously constructing prepared_statement instance in `prepare` method using `shared_from_this`. Make `prepare` methods implementation of inheriting classes mirror implementation from other statements (i.e. create a shallow copy of the object when prepairing into `prepared_statement`; this could be further refactored to avoid copies as much as possible). * Remove unused fields in create_role_statement which led to error while using compiler-generated copy ctor (copying uninitialied bool values via ctor). Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-03-23 23:19:21 +03:00
Pavel Solodovnikov	df1d687fc6	cql3: de-pointerize arguments to some of CQL grammar rules and definitions. Make the following rules and definitions accept a reference instead of shared_ptr's: * cfamDefinition * cfamColumns * pkDef * typeColumns * ksName * cfName * idxName * properties * property This will reduce a bit the number of countless shared_ptr copies and moves all over the place in cql3 code. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-03-23 23:19:21 +03:00
Pavel Solodovnikov	279b52f275	cql3: make abstract_marker::make_in_receiver accept cref to column_specification These methods just extract some info out of column_specification, so no need have another copy of shared_ptr since it's not stored anywhere inside. Transform abstract_marker::in_raw::make_in_receiver as well following the call chain. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-03-23 23:19:21 +03:00
Nadav Har'El	f1aaa91e21	merge: add metrics Merged pull request https://github.com/scylladb/scylla/pull/6030 from Piotr Dulikowski: Adds CDC-related metrics. Following counters are added, both for total and failed operations: Total number of CDC operations that did/did not perform splitting, Total number of CDC operations that touched a particular mutation part. Total number of preimage selects. Fixes #6002. Tests: unit(dev, debug) * 'cdc-metrics' of github.com:piodul/scylla: storage_proxy: track CDC operations in LWT flow storage_proxy: track CDC operations in logged batches storage_proxy: track CDC operations in standard flow storage_proxy: add cdc tracker hooks to write response handlers storage_proxy: move "else if" remainder into "else" block cdc: create an operation_result_tracker object cdc: add an object for tracking progress of cdc mutations cdc: count touched mutation parts in transformer::transform cdc: track preimage selects in metrics cdc: register metric counters cdc: fix non-atomic updates in splitting	2020-03-23 21:55:58 +02:00
Botond Dénes	ec36c7cb2f	test: random_schema: remove redundant gc grace period from tombstone expiry Compaction automatically adds gc grace period to expiry times already, no need to add it when creating the tombstones. Remove the redundant additions form the code. The direct impact is really minor as this is only used in tests, but it might confuse readers who are looking at how tombstones are created across the codebase. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200323120948.92104-1-bdenes@scylladb.com>	2020-03-23 15:12:25 +02:00
Piotr Dulikowski	736c1c6056	storage_proxy: track CDC operations in LWT flow Register cdc operation result tracker during LWT flow.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	f7fd6f4607	storage_proxy: track CDC operations in logged batches Register cdc operation result tracker in logged batch flow.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	ef1c62aa04	storage_proxy: track CDC operations in standard flow Register cdc operation result tracker for write response handlers coming from the usual write requests.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	cccc33f0fd	storage_proxy: add cdc tracker hooks to write response handlers Adds a field to abstract_write_response_handler that points to the cdc operation result tracker, and a function for registering the tracker in the handlers that currently write to a CDC log table.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	dc05d30fd3	storage_proxy: move "else if" remainder into "else" block In the following commit, more code will be added to the newly created "else" block.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	5a5cc57878	cdc: create an operation_result_tracker object An `operation_result_tracker` object is now returned as a second return value from the `augment_mutation_call` function.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	1b92cbeabe	cdc: add an object for tracking progress of cdc mutations CDC metrics, apart from tracking "total" metrics for all performed CDC operations, also track metrics for "failed" operations. Because the result of the CDC operation depends on whether all CDC mutations were written successfully by storage_proxy, checking for failure and incrementing appropriate counters is deferred after all write response handlers finish. The `cdc::operation_result_tracker` object was created for that purpose. It contains all the details needed to accurately update the metrics based on what actually happened in the `augment_mutation_call` function, and holds a flag which tells if any of write response handlers failed. This object is supposed to be referenced by write response handlers for CDC mutations created after the same `augment_mutation_call`. After all write response handlers are destroyed, the destructor of `operation_result_tracker` will update appropriate metrics. Actual creating and attaching this object to write response handlers will be done in subsequent commits.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	98e5fdc7ac	cdc: count touched mutation parts in transformer::transform Modifies the transformer::transform so that it also returns a set of flags indicating what parts of the mutation (e.g. rows, tombstones, collections, etc.) were processed during transforming.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	53570d8657	cdc: track preimage selects in metrics This commit causes preimage select counter to be increased after performing this operation.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	e7062de02b	cdc: register metric counters This patch defines a CDC metrics object and registers all of its counters. storage_proxy is chosen as the owner of the metrics object. Because in subsequent commits it will become possible for CDC metrics to be updated after a write operation ends, and because the cdc_service has shorter lifetime than storage_proxy, we could risk a use-after-free if we placed this object inside cdc_service.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	338e473946	cdc: fix non-atomic updates in splitting This patch fixes a bug in mutation splitting logic of CDC. In the part that handles updates of non-atomic clustering columns, the column definition was fetched from a static column of the same id instead of the actual definition of the clustering column. It could cause the value to be written to a wrong column. Tests: unit(dev)	2020-03-23 13:47:23 +01:00
Ivan Prisyazhnyy	5ec7e77b2e	api: /column_family/major_compaction/{keyspace:table} implementation This implements support for triggering major compations through the REST API. Please note that "split_output" is not supported and Glauber Costa confirmed this this is fine: "We don't support splits, nor do I think we should." Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com>	2020-03-23 13:48:29 +02:00
Avi Kivity	0d885dbb00	Merge "Make all headers standalone" from Botond " Make sure all headers compile on their own, without requiring any additional includes externally. Even though this requirement is not documented in our coding guides it is still quasi enforced and we semi-regularly get and merge patches adding missing includes to headers. This patch-set fixes all headers and adds a `{mode}-headers` target that can be used to verify each header. This target should be built by promotion to ensure no new non-conforming code sneaks in. Individual headers can be verified using the `build/dev/path/to/header.hh.o` target, that is generated for every header. The majority of the headers was just missing `seastarx.hh`. I think we should just include this via a compiler flag to remove the noise from our code (in a followup). " * 'compiling-headers/v2' of https://github.com/denesb/scylla: configure.py: add {mode}-headers phony target treewide: add missing headers and/or forward declarations test/boost/sstable_test.hh: move generic stuff to test/lib/sstable_utils.hh sstables: size_tiered_backlog_tracker: move methods out-of-line sstables: date_tiered_compaction_strategy.hh: move methods out-of-line	2020-03-23 13:09:09 +02:00
Avi Kivity	c6a441f9c2	Update seastar submodule * seastar 3c498abcab...92c488706c (14): > dpdk: restore including reactor.hh > tests: distributed_test: add missing #include <mutex> > reactor: un-static-ify make_pollfn() > merge: Reduce inclusions of reactor.hh A few #includes added to compensate for this > sharded: delete move constructor > future: Avoid a move constructor call > future: Erase types a bit more in then_wrapped > memory: Drop a never nullopt optional > semaphore: specify get_units and with_semaphore as noexcept > spinlock.hh: Add include for <cassert> header > dpdk: Avoid a variable sized array > future: Add an explicit promise member to continuation > net: remove smart pointer wrappers around pollable_fd > Merge "cleanup reactor file functions" from Benny	2020-03-23 11:59:30 +02:00
Piotr Dulikowski	a693e6ff6c	cdc: fix non-atomic updates in splitting This patch fixes a bug in mutation splitting logic of CDC. In the part that handles updates of non-atomic clustering columns, the schema for serializing that column was looked up incorrectly in the table schema - instead of a `regular_column`, a `static_column` was looked up. Due to how the `column_at` function works, a correct schema was always returned if the table had no static columns. Therefore, in order for this bug to manifest, a table with a static column and a regular column with non-atomic collection was needed.	2020-03-23 10:20:24 +01:00
Piotr Sarna	602a771105	Merge 'utils: error injector API' from Alejo Closes #3295 The error_injection class allows injecting custom handlers into normal control flow at the pre-determined injection points. This is especially useful in various testing scenarios: * Throwing an exception at some rare and extreme corner-cases * Injecting a delay to test for timeouts to be handled correctly * More advanced uses with custom lambda as an injection handler Injection points are defined by `inject` calls. Enabling and disabling injections are done by the corresponding `enable` and `disable` calls. REST frontend APIs is provided for convenience. Branch URL: https://github.com/alecco/scylla/tree/as_error_injection Tests: unit {{dev}}, unit {{debug}} * 'as_error_injection' of github.com:alecco/scylla: api: add error injection to REST API utils: add error injection	2020-03-23 08:39:22 +01:00
Botond Dénes	5174acb359	configure.py: add {mode}-headers phony target	2020-03-23 09:29:45 +02:00
Botond Dénes	e0284bb9ee	treewide: add missing headers and/or forward declarations	2020-03-23 09:29:45 +02:00
Botond Dénes	575466b2cf	test/boost/sstable_test.hh: move generic stuff to test/lib/sstable_utils.hh sstable_test.hh started as collection of utilities shared between the various `_sstable_test.cc` files. Predictably other tests started using it as well, among them some that are non boost unit tests. This poses a problem as if we add the missing boost/test/unit_test.hpp include to sstable_test.hh these tests will suddenly have missing symbols from boost::test. To avoid linking boost::test into all these users, extract utilities more widely used into sstable_utils.hh	2020-03-23 09:29:45 +02:00
Botond Dénes	84329a16ee	sstables: size_tiered_backlog_tracker: move methods out-of-line	2020-03-23 09:29:45 +02:00
Botond Dénes	d58ec632e3	sstables: date_tiered_compaction_strategy.hh: move methods out-of-line	2020-03-23 09:26:19 +02:00
Glauber Costa	dd65f7dcbb	tests: move token_generation_for_shard to common code We now have a utils file for SSTables. This is potentially useful for other tests. As a matter of fact, this function is repeated right now for the resharding test. And to add insult to injury, the version in the resharding test has the parameters shard and number of tokens flipped, which although extremely confusing is the predictable outcome of such repetition Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-03-22 19:00:26 +02:00
Asias He	be1a196988	repair: Handle keyspace with zero table The following error was seen in materialized_views_test.py:TestMaterializedViews.decommission_node_during_mv_insert_4_nodes_test INFO [shard 0] repair - repair id 3 to sync data for keyspace=ks, status=started repair/repair.cc:662:36: runtime error: member call on null pointer of type 'const struct schema' Aborting on shard 0. The problem is in the test a keyspace was created without creating any table. Since db19a76b1f(selective_token_range_sharder: stop calling global_partitioner()), in get_partitioner_for_tables, we access nullptr when no table is present. schema_ptr last_s; for (auto t: tables) { // set last_s } last_s->get_partitione() To fix: 1) Skip the repair in sync_data_using_repair if there is no table in the keyspace 2) Throw if no schema_ptr is found in get_partitioner_for_tables. Be defensive. After: INFO [shard 0] repair - decommission_with_repair: started with keyspace=ks, leaving_node=127.0.0.2, nr_ranges=744 INFO [shard 0] repair - repair id 3 to sync data for keyspace=ks, status=started WARN [shard 0] repair - repair id 3 to sync data for keyspace=ks, no table in this keyspace INFO [shard 0] repair - repair id 3 completed successfully INFO [shard 0] repair - repair id 3 to sync data for keyspace=ks, status=succeeded Tests: materialized_views_test.py:TestMaterializedViews.decommission_node_during_mv_insert_4_nodes_test Fixes: #6022	2020-03-22 13:46:36 +02:00
Avi Kivity	d310e7c7ea	Merge 'repair: Ignore keyspace that is removed in sync_data_using_repair' from Asias repair: Ignore keyspace that is removed in sync_data_using_repair When a keyspace is removed during node operations, we should not fail the whole operation. Ignore the keyspace that is removed. Fixes #5942 * asias-repair_fix_5942: repair: Stop the nodes that have run repair_row_level_start repair: Ignore keyspace that is removed in sync_data_using_repair	2020-03-22 13:19:51 +02:00
Takuya ASADA	005211bad6	redis: add lolwut command Add lolwut command that shows redis version and ascii art. see: https://redis.io/commands/lolwut	2020-03-22 13:16:20 +02:00
Takuya ASADA	2ab366e653	install.sh: create user/group correctly on redhat variants Seems like adduser in redhat variants and deiban variants are incompatible, and there is no addgroup in redhat variants. Since adduser in install.sh is implemented on debian variants, does not work on redhat compatible. To fix this we need to use 'useradd' / 'groupadd' instead. Fixes #6018	2020-03-22 13:13:00 +02:00
Avi Kivity	7ed083a6a7	Merge "test.py: Allow to change the tests starting order" from Pavel E " In debug mode some tests take veeery looong time to finish, those tests are better to be started first. This set adds this by marking such long tests in suite.yaml files. Tests: unit(dev) " * 'br-split-unit-tests-sorting-2' of https://github.com/xemul/scylla: test.py: Mark some tests as "run_first" test.py: Generate list with short names test.py: Rename "long" to "skip_in_debug_mode"	2020-03-21 19:53:23 +02:00
Rafael Ávila de Espíndola	482fbfcfdb	build: Use more strict stack frame limits A recent seastar update has resolved the worse offenders, so we can lower the limit a bit to warn on the next set of functions. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200317183209.1664860-1-espindola@scylladb.com>	2020-03-21 19:51:57 +02:00
Rafael Ávila de Espíndola	01ac4aef3a	everywhere: Use futurize_apply instead of futurize<void>::apply No functionality change, just simpler. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200318234149.283090-1-espindola@scylladb.com>	2020-03-21 19:51:38 +02:00
Rafael Ávila de Espíndola	0d7281ca06	sstable: Move sstables_manager constructor out of line There is no reason to have it in a header. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200320005225.178381-1-espindola@scylladb.com>	2020-03-21 19:47:29 +02:00
Piotr Dulikowski	6c5c745e25	cdc: add cdc log schema test	2020-03-21 07:33:35 +01:00
Piotr Dulikowski	3bfb044bf1	cdc: do not create cdc$deleted columns for pks and cks Primary key and clustering key column should not have a corresponding "cdc$deleted_<name>" column in cdc log table, because it does not make sense to delete such a column from a row. Fixes: #6049 Tests: unit(dev)	2020-03-21 07:33:23 +01:00
Pekka Enberg	6b2cd1bd7d	Revert "db::commitlog: Don't write trailing zero block unless needed" This reverts commit `0b34d88957`. According to Rafael Avila de Espindola: "I have bisected the recent failures [in commitlog_test] on next to this patch."	2020-03-20 22:30:58 +02:00
Pekka Enberg	12b6092ac2	Revert "sstables: Fix incorrect calculation of Compaction Backlog" This reverts commit `458ef4bb06`. According to Glauber Costa: "It may give us the illusion that fixes something for a particular case but this fix is wrong. I am trying to help Raphael figure out why the backlog is wrong but this patch is not the answer."	2020-03-20 22:28:57 +02:00
Piotr Sarna	331ddf41e5	api: add error injection to REST API Simple REST API for error injection is implemented. The API allow the following operations: * injecting an error at given injection name * listing injections * disabling an injection * disabling all injections Currently the API enables/disables on all shards. Closes #3295 Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-03-20 20:49:03 +01:00
Pavel Solodovnikov	057adc8b4d	utils: add error injection Error injection class is implemented in order to allow injecting various errors (exceptions, stalls, etc.) in code for testing purposes. Error injection is enabled via compile flag SCYLLA_ENABLE_ERROR_INJECTION TODO: manage shard instances Enable error injection in debug/dev/sanitize modes. Unit tests for error injection class. Closes #3295 Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-03-20 19:37:48 +01:00
Rafael Ávila de Espíndola	9445608df6	gms: Add a default constructor to feature_config Also move it out of line while at it. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200316180321.45914-1-espindola@scylladb.com>	2020-03-20 13:34:26 +01:00

1 2 3 4 5 ...

21671 Commits