scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-30 05:07:05 +00:00

Author	SHA1	Message	Date
Kamil Braun	774ef653b1	test: raft: randomized_nemesis_test: move `ticker` to its own header	2021-07-13 11:15:25 +02:00
Kamil Braun	a45e8e0db0	test: raft: randomized_nemesis_test: ticker: take `logger` as a constructor parameter Remove the global dependency on `tlogger`.	2021-07-13 11:15:25 +02:00
Kamil Braun	21b5a6d9f7	test: raft: logical_timer: handle immediate timeout If the user calls `with_timeout` with a time point that's already been reached, we return `timed_out_error` immediately.	2021-07-13 11:15:25 +02:00
Kamil Braun	ed8e9a564a	test: raft: logical_timer: on timeout, return the original future in the exception More specifically, return a future which is equivalent to the original future (when the original future resolves, this future will contain its result). Thus we don't discard the future, the user gets it back. Let them decide what to do with it.	2021-07-13 11:15:25 +02:00
Kamil Braun	c86ff1eb7c	test: raft: logical_timer: add `schedule` member function It allows scheduling the given function to be called at the given logical time point.	2021-07-13 11:15:25 +02:00
Kamil Braun	cf0d503a92	test: raft: randomized_nemesis_test: move `logical_timer` to its own header	2021-07-13 11:15:25 +02:00
Kamil Braun	9f5eeec56a	test: raft: include the leader's ID in the `not_a_leader` exception's message	2021-07-13 11:15:25 +02:00
Tomasz Grabiec	e947fac74c	database: Fix cache metrics not being registered Introduced in `6a6403d`. The default constructor with dummy_app_stats is also used by production code. Fixes #9012 Message-Id: <20210712221447.71902-1-tgrabiec@scylladb.com>	2021-07-13 07:50:44 +03:00
Avi Kivity	058afbcee8	build: re-enable -Wmisleading-indentation This can catch mismatches between visual indication about control flow and what the compiler actually does. Looks like boost cleaned up its indentation since it was disabled in `7f38634080` ("dist/debian: Switch to g++-7/boost-1.63 on Ubuntu 14.04/16.04"). It's unlikely to pop back since modern compilers enable it by default. Closes #9015	2021-07-12 22:29:19 +03:00
Avi Kivity	8fb4fe2f24	Merge "reader_concurrency_semaphore: relax on destroy stop checks" from Botond " Currently we `assert(_stopped)` in the destructor, but this is too harsh, especially on freshly created semaphore instances that weren't even used yet. This basically mandates semaphores to be initialized at the end of the constructor body, which is very cumbersome. Further to that, this series relaxes the checks on destroying an unstopped previously (but not currently) used semaphore. As destroying such a semaphore without stop is risky an error is still logged. Tests: unit(dev) " * 'reader-concurrency-semaphore-relax-stop-check/v1' of https://github.com/denesb/scylla: reader_concurrency_semaphore: relax _stopped check when destroying a used semaphore reader_concurrency_semaphore: allow destroying without stop() when not used yet reader_concurrency_semaphore: add permit-stats	2021-07-12 20:07:01 +02:00
Nadav Har'El	f540a69a82	Update tools/java submodule * tools/java 5013321823...79a441972d (2): > Add Zstd compressor > Settings Schema: fix typo in settings printing Adding the Zstd compressor fixes #8887.	2021-07-12 20:07:00 +02:00
Avi Kivity	4d48e1e9e1	build: avoid sanitize/coverage builds in multi-mode targets The default target (i.e. what gets executed under "ninja") excludes sanitize and coverage modes (since they're useful in special cases only), but the other multi-mode targets such as "ninja build" do not. This means that two extra modes are built. Make things consistent by always using default_modes (which default to release,dev,debug). This can be overriden using the --mode switch to configure.py. Closes #8775	2021-07-12 20:07:00 +02:00
Botond Dénes	f8004c652b	reader_concurrency_semaphore: relax _stopped check when destroying a used semaphore Further relax the conditions under which we abort on destroying a unstopped semaphore. We already allow destroying completely unused semaphores, this patch further relaxes this to allow destroying formerly used but presently not used semaphores without stopping. We still call `on_internal_error_noexcept()` even if destroying the semaphore is safe, because without calling `stop()`, destroying the semaphore depends on luck, which we shouldn't rely on.	2021-07-12 15:53:00 +03:00
Botond Dénes	750b20fd85	reader_concurrency_semaphore: allow destroying without stop() when not used yet To make it easier to construct objects with semaphore members. When the construction of such object fails, they can now just destroy their semaphore member as usual, without having to employ tricks to make sure its is stopped before.	2021-07-12 15:53:00 +03:00
Botond Dénes	03959a332b	reader_concurrency_semaphore: add permit-stats Which stores permit related stats. For now only total number of permits is maintained which is useful to determine whether the semaphore was used already or not.	2021-07-12 15:53:00 +03:00
Nadav Har'El	3fda13e20e	cql-pytest: fix sporadic failure in over-zealous TTL test We have been seeing rare failures of the cql-pytest (translated from Cassandra's unit tests) for testing TTL in secondary indexes: cassandra_tests/validation/entities/secondary_index_test.py::testIndexOnRegularColumnInsertExpiringColumn The problem is that the test writes an item with 1 second TTL, and then sleeps exactly 1.0 seconds, and expects to see the item disappear by that time. Which doesn't always happen: The problem with that assumption stems from Scylla's TTL clock ("gc_clock") being based on Seastar's lowres clock. lowres_clock only has a 10ms "granularity": The time Scylla sees when deciding whether an item expires may be up to 10ms in the past - the arbitrary point when the lowres timer happened to last run. In rare overload cases, the inaccuracy may be even grater than 10ms (if the timer got delayed by other things running). So when Scylla is asked to expire an item in 1 second - we cannot be sure it will be expired in exactly 1 second or less - the expiration can be also around 10ms later. So in this patch we change the test to sleep with more than enough margin - 1.1 seconds (i.e., 100ms more than 1 second). By that time we're sure the item must have expired. Before this patch, I saw the test failing once every few hundred runs, after this patch I ran if 2,000 times without a single failure. Fixes #9008 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210712100655.953969-1-nyh@scylladb.com>	2021-07-12 13:48:21 +03:00
Yaron Kaikov	aa7c40ba50	dist: build docker based on ubuntu 20.04 OS Today our docker image is based on Centos7 ,Since centos will be EOL in 2024 and no longer has stable release stream. let's move our docker image to be based on ubuntu 20.04 Based on the work done in https://github.com/scylladb/scylla/pull/8730, let's build our docker image based on local packages using buildah Closes #8849	2021-07-12 13:32:03 +03:00
Piotr Jastrzebski	c010cefc4d	cdc: Handle compact storage tables correctly When a table with compact storage has no regular column (only primary key columns), an artificial column of type empty is added. Such column type can't be returned via CQL so CDC Log shouldn't contain a column that reflects this artificial column. This patch does two things: 1. Make sure that CDC Log schema does not contain columns that reflect the artificial column from a base table. 2. When composing mutation to CDC Log, ommit the artificial column. Fixes #8410 Test: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Closes #8988	2021-07-12 12:17:35 +03:00
Nadav Har'El	2cc8c40c07	Merge 'Fix some issues found by gcc 11' from Avi Kivity This series fixes some issues that gcc 11 complains about. I believe all are correct errors from the standard's view. Clang accepts the changed code. Note that this is not enough to build with gcc 11, but it's a start. Closes #9007 * github.com:scylladb/scylla: utils: compact-radix-tree: detemplate array_of<> utils: compact-radix-tree: don't redefine type as member raft: avoid changing meaning of a symbol inside a class cql3: lists: catch polymorphic exceptions by reference	2021-07-12 11:17:57 +03:00
Avi Kivity	4433178ccb	utils: exceptions: convert sprint() to format() sprint() is type-unsafe, and we are converging on format(). Convert exceptions.hh to format(). Closes #9006	2021-07-12 11:17:57 +03:00
Avi Kivity	29c9570556	utils: compact-radix-tree: detemplate array_of<> The radix tree template defines a nested class template array_of; both a generic template and a fully specialized version. However, gcc (I believe correctly) rejects the fully specialized template that happens to be a member of another class template. As it happens, we don't really need a template here at all. Define a non-template class for each of the cases we need, and use std::conditional_t to select the type we need.	2021-07-11 18:16:21 +03:00
Avi Kivity	f576ecb7cc	utils: compact-radix-tree: don't redefine type as member The `direct_layout` and `indirect_layout` template classes accept a template parameter named `Layout` of type `layout`, and re-export `Layout` as a static data member named `layout`. This redefinition of `layout` is disliked by gcc. Fix by renaming the static data member to `this_layout` and adjust all references.	2021-07-11 18:16:21 +03:00
Avi Kivity	332b5c395f	raft: avoid changing meaning of a symbol inside a class The construct struct q { a a; }; Changes the meaning of `a` from a type to a data member. gcc dislikes it and I agree. Fully qualify the type name to avoid an error.	2021-07-11 18:16:21 +03:00
Avi Kivity	cb8ef1489b	cql3: lists: catch polymorphic exceptions by reference gcc 11 notes that catching polymorphic exceptions is a bad idea; the resulting copy can slice the exception object. Fix by capturing by reference.	2021-07-11 17:34:43 +03:00
Avi Kivity	222ef17305	build, treewide: enable -Wredundant-move Returning a function parameter guarantees copy elision and does not require a std::move(). Enable -Wredundant-move to warn us that the move is unneeded, and gain slightly more readable code. A few violations are trivially adjusted. Closes #9004	2021-07-11 12:53:02 +03:00
Dejan Mircevski	7119730f2d	cql3: Don't look for indexed column in CK prefix When creating an index-table query, we form its clustering-key restrictions by picking the right restrictions from the WHERE clause. But we skip the indexed column, which isn't in the index-table clutering key. This is, however, both incorrect and unnecessary: It is incorrect because we compare the column IDs from different schemas (indexed table vs. base table). We should instead be comparing column names. It is unnecessary because this code is only executed when the whole partition key plus a clustering prefix is specified in the WHERE clause. In such cases, the index cannot possibly be on a member of the clustering prefix, as such a query would be satisfied out of the base table. Therefore, it is redundant to check for the indexed table among the CK prefix elements. A careful reader will note that this check was first introduced to fix the issue #7888 in commit `0bd201d`. But it now seems to me that that fix was misguided. The root problem was the old code miscalculating the clustering prefix by including too many columns in it; it should have stopped before reaching the indexed column. The new code, introduced by commit `845e36e76`, calculates the clustering prefix correctly, never reaching the indexed column. (Details, for the curious: the old code invoked clustering_key_restrictions::prefix_size(), which is buggy -- it doesn't check the restriction operator. It will, for instance, calculate the prefix of `c1=0 AND c2 CONTAINS 0 AND c3=0` as 3, because it restricts c1, c2, and c3. But the correct prefix is clearly 1, because c2 is not restricted by equality.) Tests: unit (dev, debug) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #8993	2021-07-08 21:39:38 +03:00
Avi Kivity	9059514335	build, treewide: enable -Wpessimizing-move warning This warning prevents using std::move() where it can hurt - on an unnamed temporary or a named automatic variable being returned from a function. In both cases the value could be constructed directly in its final destination, but std::move() prevents it. Fix the handful of cases (all trivial), and enable the warning. Closes #8992	2021-07-08 17:52:34 +03:00
Avi Kivity	fe4002c6c4	Update seastar submodule * seastar eaa00e761f...8ed9771ae9 (5): > *: drop prometheus protobuf support > reactor: Fix calculations of bandwidth in legacy mode > gate: add gate::holder > Revert "gate: add gate::holder", does not build. > gate: add gate::holder	2021-07-08 17:42:39 +03:00
Avi Kivity	f756f34392	Merge "Add scylla-bench datasets to perf_fast_forward" from Tomasz " After this series one can use perf_fast_forward to generate the data set. It takes a lot less time this way than to use scylla-bench. " * 'perf-fast-forward-scylla-bench-dataset' of github.com:tgrabiec/scylla: tests: perf_fast_forward: Use data_source::make_ck() tests: perf_fast_forward: Move declaration of clustered_ds up tests: perf_fast_forward: Make scylla_bench_small_part_ds1 not included by default tests: perf_fast_forward: Add data sets which conform to scylla-bench schema	2021-07-08 17:33:30 +03:00
Nadav Har'El	d0546a9bb5	cql-pytest: improve README This patch adds to cql-pytest/README.md a paragraph on where run / run-cassandra expect to find Scylla or Cassandra, and how to override that choice. Also make a couple of trivial formatting changes. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210708142730.813660-1-nyh@scylladb.com>	2021-07-08 17:29:20 +03:00
Avi Kivity	4f1e21ceac	Merge "reader_concurrency_semaphore: get rid of global semaphores" from Botond " When obtaining a valid permit was made mandatory, code which now had to create reader permits but didn't have a semaphore handy suddenly found itself in a difficult situation. Many places and most prominently tests solved the problem by creating a thread-local semaphore to source permits from. This was fine at the time but as usual, globals came back to haunt us when `reader_concurrency_semaphore::stop()` was introduced, as these global semaphores had no easy way to be stopped before being destroyed. This patch-set cleans up this wart, by getting rid of all global semaphores, replacing them with appropriately scoped local semaphores, that are stopped after being used. With that, the FIXME in `~reader_concurrency_semaphore()` can be resolved and we an finally `assert()` that the semaphore was stopped before being destroyed. This series is another preparatory one for the series which moves the semaphore in front of the cache. tests: unit(dev) " * 'reader-concurrency-semaphore-mandatory-stop/v2' of https://github.com/denesb/scylla: (26 commits) reader_concurrency_semaphore: assert(_stopped) in the destructor test/lib: remove now unused reader_permit.{hh,cc} test/boost: migrate off the global test reader semaphore test/manual: migrate off the global test reader semaphore test/unit: migrate off the global test reader semaphore test/perf: migrate off the global test reader semaphore test/perf: perf.hh: add reader_concurrency_semaphore_wrapper test/lib: migrate off the global test reader semaphore test/lib/simple_schema: migrate off the global test reader semaphore test/lib/sstable_utils: migrate off the global test reader semaphore test/lib/test_services: migrate off the global test reader semaphore test/lib/sstable_test_env: add reader_concurrency_semaphore member test/lib/cql_test_env: add make_reader_permit() test/lib: add reader_concurrency_semaphore.hh test/boost/sstable_test: migrate row counting tests to seastar thread test/boost/sstable_test: test_using_reusable_sst(): pass env to func test/lib/reader_lifecycle_policy: add permit parameter to factory function test/boost/mutation_reader_test: share permit between readers in a read memtable: migrate off the global reader concurrency semaphore mutation_writer: multishard_writer: migrate off the global reader concurrency semaphore ...	2021-07-08 17:28:13 +03:00
Botond Dénes	42bd5c980f	reader_concurrency_semaphore: assert(_stopped) in the destructor Now that there are no more global semaphore which are impossible to stop properly we can resolve the related FIXME and arm the assert in the semaphore destructor. We can also remove all the other cleanup code from the destructor as they are taken care of by stop(), which we now assert to have been run.	2021-07-08 16:53:38 +03:00
Botond Dénes	6b941c4d34	test/lib: remove now unused reader_permit.{hh,cc} Finally getting rid of the global test reader concurrency semaphore.	2021-07-08 16:53:38 +03:00
Botond Dénes	2d2b9e7b36	test/boost: migrate off the global test reader semaphore	2021-07-08 16:53:38 +03:00
Botond Dénes	0bf07cde7b	test/manual: migrate off the global test reader semaphore	2021-07-08 16:53:38 +03:00
Botond Dénes	18e0c40c5d	test/unit: migrate off the global test reader semaphore	2021-07-08 16:53:38 +03:00
Botond Dénes	37a1e506b1	test/perf: migrate off the global test reader semaphore	2021-07-08 16:53:38 +03:00
Botond Dénes	2454811dd6	test/perf: perf.hh: add reader_concurrency_semaphore_wrapper A convenience, self-closing wrapper for those perf tests that have no way to stop the semaphore and wait for it too.	2021-07-08 16:53:38 +03:00
Nadav Har'El	e22a52e69c	cql-pytest: fix tests on Cassandra 3 After commit `76227fa` ("cql-pytest: use NetworkTopologyStrategy, not SimpleStrategy"), the cql-pytest tests now NetworkTopologyStrategy instead of SimpleStrategy in the test keyspaces. The tests continued to use the "replication_factor" option. The support for this option is a relatively recent, and was only added to Cassandra in the 4.0 release series (see https://issues.apache.org/jira/browse/CASSANDRA-14303). So users who happen to have Cassandra 3 installed and want to run a cql-pytest against it will see the test failing when it can't create a keyspace. This patch trivially fixes the problem by using the name of the current DC (automatically determined) instead of the word 'replication_factor'. Almost all tests are fixed by a single fix to the test_keyspace fixture which creates one keyspace used by most tests. Additional changes were needed in test_keyspace.py, for tests which explicitly create keyspaces. I tested the result on Cassandra 3.11.10, Cassandra 4 (git master) and Scylla. Fixes #8990 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210708123428.811184-1-nyh@scylladb.com>	2021-07-08 15:35:21 +02:00
Nadav Har'El	eb11ce046c	cql-pytest: add reproducer for concurrent DROP KEYSPACE bug We know that today in Scylla concurrent schema changes done on different coordinators are not safe - and we plan to address this problem with Raft. However, the test in this patch - reproducing issue #8968 - demonstrates that even on a single node concurrent schema changes are not safe: The test involves one thread which constantly creates a keyspace and then a table in it - and a second thread which constantly deletes this keyspace. After doing this for a while, the schema reaches an inconsistent state: The keyspace is at a state of limbo where it cannot be dropped (dropping it succeeds, but doesn't actually drop it), and a new keyspace cannot be created under the same name). Note that to reproduce this bug, it was important that the test create both a keyspace and a table. Were the test to just create an empty keyspace, without a table in it, the bug would not be reproduced. Refs #8968. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210704121049.662169-1-nyh@scylladb.com>	2021-07-08 15:35:03 +02:00
Botond Dénes	0e78399051	test/lib: migrate off the global test reader semaphore	2021-07-08 15:28:39 +03:00
Botond Dénes	5fff314739	test/lib/simple_schema: migrate off the global test reader semaphore	2021-07-08 15:28:39 +03:00
Botond Dénes	d520655730	test/lib/sstable_utils: migrate off the global test reader semaphore	2021-07-08 15:28:39 +03:00
Botond Dénes	3679418e62	test/lib/test_services: migrate off the global test reader semaphore	2021-07-08 15:28:39 +03:00
Botond Dénes	0acc4d63da	test/lib/sstable_test_env: add reader_concurrency_semaphore member To enable tests using the test env to conveniently create permits for themselves, reducing the pain of migrating to local semaphores.	2021-07-08 15:28:39 +03:00
Botond Dénes	7174d1beee	test/lib/cql_test_env: add make_reader_permit() A convenience method, allowing tests using the cql test env to conveniently create a permit, reducing the pain of migrating to local semaphores.	2021-07-08 15:28:39 +03:00
Botond Dénes	b739525fb6	test/lib: add reader_concurrency_semaphore.hh Supplying a convenience semaphore wrapper, which stops the contained semaphore when destroyed. It also provides a more convenient `make_permit()`. This class is intended to make the migration to local semaphores less painful.	2021-07-08 15:28:36 +03:00
Benny Halevy	fa5d70da32	storage_proxy: abstract_read_resolver: handle semaphore_timed_out error semaphore_timed_out errors should be ignored, similar to rpc::timeout_error or seastar::timed_out_error, so that they eventually be converted to `read_timeout_exception` via the data/digest read resolver on_timeout() method. Otherwise, the semaphore timeout is mistranslated to read_failure_exception, via on_error(). Note that originally the intention was to change the exception thrown by the reader_concurrency_semaphore expiry_handler, but there are already several places in the code that catch and handle the semaphore_timed_out exception that would need to be changed, increasing the risk in this change. Fixes #8958 Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210708083252.1934651-2-bhalevy@scylladb.com>	2021-07-08 15:23:30 +03:00
Benny Halevy	023d103fee	utils: exceptions: is_timeout_exception: add timed_out_error Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210708083252.1934651-1-bhalevy@scylladb.com>	2021-07-08 15:23:29 +03:00
Nadav Har'El	814c4ad4ce	cql-pytest: fix run-cassandra for older versions of Cassandra In older versions of Cassandra (such as 3.11.10 which I tried), the CQL server is not turned on by default, unless the configuration file explicitly has "start_native_transport: true" - without it only the Thrift server is started. So fix the cql-pytest/run-cassandra to pass this option. It also works correctly in Cassandra 4. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210708113423.804980-1-nyh@scylladb.com>	2021-07-08 14:59:09 +03:00

1 2 3 4 5 ...

27279 Commits