scylladb

Author	SHA1	Message	Date
Botond Dénes	aa18bb33b9	tests: add unit test for multishard reader correctly handling non-strictly monotonous positions	2019-04-29 10:24:14 +03:00
Benny Halevy	c8f239ff2b	tests: introduce sstables::test_env In preparation to adding sstables_manager we want to establish an environment for testing sstables. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-02-14 22:37:41 +02:00
Botond Dénes	9000626647	shard_reader: auto pause readers after being used Previously it was the responsibility of the layer above (multishard combining reader) to pause readers, which happened via an explicit `pause()` call. This proved to be a very bad design as we kept finding spots where the multishard reader should have paused the reader to avoid potential deadlocks (due to starved reader concurrency semaphores), but didn't. This commit moves the responsibility of pausing the reader into the shard reader. The reader is now kept in a paused state, except when it is actually used (a `fill_buffer()` or `fast_forward_to()` call is executing). This is fully transparent to the layer above. As a side note, the shard reader now also hides when the reader is created. This also used to be the responsibility of the multishard reader, and although it caused no problems so far, it can be considered a leak of internal details. The shard reader now automatically creates the remote reader on the first time it is attempted to be used. The code has been reorganized, such that there is now a clear separation of responsibilities. The multishard combining reader handles the combining of the output of the shard readers, as well as issuing read-aheads. The shard reader handles read-ahead and creating the remote reader when needed, as well as transferring the results of remote reads to the "home" shard. The remote reader (`shard_reader::remote_reader`, new in this patch) handles pausing-resuming as well as recreating the reader after it was evicted. Layers don't access each other's internals (like they used to). After this commit, the reader passed to `destroy_reader()` will always be in paused state.	2019-02-12 16:20:51 +02:00
Botond Dénes	37006135dc	shard_reader: make reader creation sync Reader creation happens through the `reader_lifecycle_policy` interface, which offers a `create_reader()` method. This method accepts a shard parameter (among others) and returns a future. Its implementation is expected to go to the specified shard and then return with the created reader. The method is expected to be called from the shard where the shard reader (and consequently the multishard reader) lives. This API, while reasonable enough, has a serious flaw. It doesn't make batching possible. For example, if the shard reader issues a call to the remote shard to fill the remote reader's buffer, but finds that it was evicted while paused, it has to come back to the local shard just to issue the recreate call. This makes the code both convoluted and slow. Change the reader creation API to be synchronous, that is, callable from the shard where the reader has to be created, allowing for simple call sites and batching. This change requires that implementations of the lifecycle policy update any per-reader data-structure they have from the remote shard. This is not a problem however, as these data-structures are usually partitioned, such that they can be accessed safely from a remote shard. Another, very pleasant, consequence of this change is that now all methods of the lifecycle interface are sync and thus calls to them cannot overlap anymore. This patch also removes the `test_multishard_combining_reader_destroyed_with_pending_create_reader` unit test, which is not useful anymore. For now just emulate the old interface inside shard reader. We will overhaul the shard reader after some further changes to minimize noise.	2019-02-12 16:20:51 +02:00
Botond Dénes	57d1f6589c	shard_reader: use semaphore directly to pause-resume The shard reader relies on the `reader_lifecycle_policy` for pausing and resuming the remote reader. The lifecycle policy's API was designed to be as general as possible, allowing for any implementation of pause/resume. However, in practice, we have a single implementation of pause/resume: registering/unregistering the reader with the relevant `reader_concurrency_semaphore`, and we don't expect any new implementations to appear in the future. Thus, the generic API of the lifecycle policy, is needlessly abstract making its implementations needlessly complex. We can instead make this very concrete and have the lifecycle policy just return the relevant semaphore, removing the need for every implementor of the lifecycle policy interface to have a duplicate implementation of the very same logic. For now just emulate the old interface inside shard reader. We will overhaul the shard reader after some further changes to minimize noise.	2019-02-12 16:20:51 +02:00
Paweł Dziepak	64b1a2caf9	tests: modernise tmpdir tmpdir is a helper class representing a temporary directory. Unfortunately, it suffers for some problems such as lack of proper encapsulation and weak typing. This has caused bugs in the past when the user code accidentally modified the member variable with the path to the directory. This patch modernises tmpdir and updates its users. The path is stored in a std::filesystem::path and available read-only to the class users. mkdtemp and boost are replaced by standard solution. The users are update to use path more (when it didn't involve too many changes to their code) and stop using lw_shared_ptr to store the tmpdir when it wasn't necessary. tmpdir intentionally doesn't provide any helpers for getting the path as a string in order to discourage weak types. Message-Id: <20190207145727.491-1-pdziepak@scylladb.com>	2019-02-07 20:18:14 +02:00
Jesse Haber-Kucharsky	b39eac653d	Switch to the the CMake-ified Seastar Committer: Avi Kivity <avi@scylladb.com> Branch: next Switch to the the CMake-ified Seastar This change allows Scylla to be compiled against the `master` branch of Seastar. The necessary changes: - Add `-Wno-error` to prevent a Seastar warning from terminating the build - The new Seastar build system generates the pkg-config files (for example, `seastar.pc`) at configure time, so we don't need to invoke Ninja to generate them - The `-march` argument is no longer inherited from Seastar (correctly), so it needs to be provided independently - Define `SEASTAR_TESTING_MAIN` so that the definition of an entry point is included for all unit test compilation units - Independently link Scylla against Seastar's compiled copy of fmt in its build directory - All test files use the (now public) Seastar testing headers - Add some missing Seastar headers to source files [avi: regenerate frozen toolchain, adjust seastar submoule] Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <02141f2e1ecff5cbcd56b32768356c3bf62750c4.1548820547.git.jhaberku@scylladb.com>	2019-01-30 11:17:38 +02:00
Paweł Dziepak	5db8dacd1f	tests/mutation_reader: reduce sleeping time It is a very bad taste to sleep anywhere in the code. The test should be fixed to explicitly test various orderings between concurrent operations, but before that happens let's at least readuce how much those sleeps slow it down by changing it from milliseconds to microseconds.	2018-12-20 13:27:25 +00:00
Paweł Dziepak	bcb5aed1ef	Revert "mutation_source_test: add option to skip intra-partition fast-forwarding tests" This reverts commit `b36733971b`. That commit made run_mutation_reader_tests() support mutation_sources that do not implement streamed_mutation::forwarding::yes. This is wrong since mutation_sources are not allowed to ignore or otherwise not support that mode. Moreover, there is absolutely no reason for them to do so since there is a make_forwardable() adapter that can make any mutation_reader a forwardable one (at the cost of performance, but that's not always important).	2018-12-20 13:27:25 +00:00
Paweł Dziepak	8706750b9b	tests/mutation_readers: do not ignore streamed_mutation::forwarding It is wrong to silently ignore streamed_mutation::forwarding option which completely changes how the reader is supposed to operate. The best solution is to use make_forwardable() adapter which changes non-forwardable reader to a forwardable one.	2018-12-20 13:27:25 +00:00
Botond Dénes	dcd2d116a3	tests/mutation_reader_test: add test_multishard_combining_reader_next_partition Test the interaction of the multishard reader with the foreign reader w.r.t next_partition(). next_partition() is a special operation, as it its execution is deferred until the next cross-shard operations. Give it some extra stress-testing.	2018-12-04 08:51:05 +02:00
Botond Dénes	20e994e526	tests/mutation_reader_test: restore indentation Left over from the previous patch.	2018-12-04 08:51:05 +02:00
Botond Dénes	a577ff97e9	tests/mutation_reader_test: enrich pause-related multishard reader test Enrich the existing test_multishard_combining_reader_as_mutation_source test case with delaying the pause/resume and eviction of paused readers.	2018-12-04 08:51:05 +02:00
Botond Dénes	a12fae366d	tests/mutation_reader_test: implement the pause-resume API	2018-12-04 08:51:05 +02:00
Botond Dénes	5f67a065c6	reader_lifecycle_policy: extend with a pause-resume API This API provides a way for the mulishard reader to pause inactive shard readers and later resume them when they are needed again. This allows for these paused shard readers to be evicted when the node is under pressure. How the readers are made evictable while paused is up to the clients. Using this API in the `multishard_combining_reader` and implementing it in the clients will be done in the next patches. Provide default implementation for the new virtual methods to facilitate gradual adoption.	2018-12-04 08:51:05 +02:00
Botond Dénes	007619de4c	multishard_combining_reader: use the reader lifecycle policy Refactor the multishard combining reader and its clients to use the reader lifecycle policy introduced in the previous patch.	2018-12-04 08:51:05 +02:00
Botond Dénes	5a4fd1abab	multishard_combining_reader: drop support for streamed_mutation fast-forwarding It doesn't make sense for the multishard reader anyway, as it's only used by the row-cache. We are about to introduce the pausing of inactive shard readers, and it would require complex data structures and code to maintain support for this feature that is not even used. So drop it.	2018-12-04 08:51:05 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	f70ece9f88	tests: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Paweł Dziepak	637b9a7b3b	atomic_cell_or_collection: make operator<< show cell content After the new in-memory representation of cells was introduced there was a regression in atomic_cell_or_collection::operator<< which stopped printing the content of the cell. This makes debugging more incovenient are time-consuming. This patch fixes the problem. Schema is propagated to the atomic_cell_or_collection printer and the full content of the cell is printed. Fixes #3571. Message-Id: <20181024095413.10736-1-pdziepak@scylladb.com>	2018-10-24 13:29:51 +03:00
George Kollias	c2343dc841	Make restricting reader fill_buffer more efficient Currently, restricting_mutation_reader::fill_buffer justs reads lower-layer reader's fragments one by one without doing any further transformations. This change just swaps the parent-child buffers in a single step, as suggested in #3604, and, hence, removing any possible per-fragment overhead. I couldn't find any test that exercises restricting_mutation_reader as a mutation source, so I added test_restricted_reader_as_mutation_source in mutation_reader_test. Tests: unit (release), though these 4 tests are failing regardless of my changes (they fail on master for me as well): snitch_reset_test, sstable_mutation_test, sstable_test, sstable_3_x_test. Fixes: #3604 Signed-off-by: George Kollias <georgioskollias@gmail.com> Message-Id: <1540052861-621-1-git-send-email-georgioskollias@gmail.com>	2018-10-22 11:36:54 +03:00
Botond Dénes	23f3831aaf	table::make_streaming_reader(): add forwarding parameter The single-range overload, when used by make_multishard_streaming_reader(), has to create a reader that is forwardable. Otherwise the multishard streaming reader will not produce any output as it cannot fast-forward its shard readers to the ranges produced by the generator. Also add a unit test, that is based on the real-life purpose the multishard streaming reader was designed for - serving partition from a shard, according to a sharding configuration that is different than the local one. This is also the scenario that found the buf in the first place. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <bf799961bfd535882ede6a54cd6c4b6f92e4e1c1.1539235034.git.bdenes@scylladb.com>	2018-10-11 10:59:18 +03:00
Botond Dénes	eb357a385d	flat_mutation_reader: make timeout opt-out rather than opt-in Currently timeout is opt-in, that is, all methods that even have it default it to `db::no_timeout`. This means that ensuring timeout is used where it should be is completely up to the author and the reviewrs of the code. As humans are notoriously prone to mistakes this has resulted in a very inconsistent usage of timeout, many clients of `flat_mutation_reader` passing the timeout only to some members and only on certain call sites. This is small wonder considering that some core operations like `operator()()` only recently received a timeout parameter and others like `peek()` didn't even have one until this patch. Both of these methods call `fill_buffer()` which potentially talks to the lower layers and is supposed to propagate the timeout. All this makes the `flat_mutation_reader`'s timeout effectively useless. To make order in this chaos make the timeout parameter a mandatory one on all `flat_mutation_reader` methods that need it. This ensures that humans now get a reminder from the compiler when they forget to pass the timeout. Clients can still opt-out from passing a timeout by passing `db::no_timeout` (the previous default value) but this will be now explicit and developers should think before typing it. There were suprisingly few core call sites to fix up. Where a timeout was available nearby I propagated it to be able to pass it to the reader, where I couldn't I passed `db::no_timeout`. Authors of the latter kind of code (view, streaming and repair are some of the notable examples) should maybe consider propagating down a timeout if needed. In the test code (the wast majority of the changes) I just used `db::no_timeout` everywhere. Tests: unit(release, debug) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <1edc10802d5eb23de8af28c9f48b8d3be0f1a468.1536744563.git.bdenes@scylladb.com>	2018-09-20 11:31:24 +02:00
Botond Dénes	a84c26799d	tests/mutation_reader_test: fix flaky restricted reader timeout test The test in question is `restricted_reader_timeout`. Use `eventually_true()` instead of `sleep()` to wait on the timeout expiring, making the test more robust on overloaded machines. Also fix graceful failing, another longstanding issue with this test. The readers created for the test need different destruction logic depending whether the test failed or succeeded. Previously this was dealt with by using the logic that worked in case of success and using asserts to abort when the test failed, thus avoiding developers investigating the invalid memory accesses happening due to the wrong destruction logic. The solution is to use BOOST_CHECK() macro in the check that validates whether timeout works as expected. This allows for execution to continue even if the test failed, and thus allows for running the proper cleanup code even when the test failed. Fixes: #3719 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <911921dffc924f1b0a3e86408757467e9be2b65b.1537169933.git.bdenes@scylladb.com>	2018-09-17 09:40:45 +01:00
Botond Dénes	6779b63dfe	tests: add unit test for multishard_mutation_query()	2018-09-03 10:31:44 +03:00
Botond Dénes	f13b878a94	mutation_reader: pass all standard reader params to `remote_reader_factory` Extend `remote_reader_factory` interface so that it accepts all standard mutation reader creation parameters. This allows factory lambdas to be truly stateless, not having to capture any standard parameters that is needed for creating the reader. Standard parameters are those accepted by `mutation_source::make_reader()`.	2018-09-03 10:31:44 +03:00
Duarte Nunes	b89fa0d67b	tests/mutation_reader_test: Extract eventually_true() to eventually.hh Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-08-27 19:24:05 +01:00
Botond Dénes	b32f94d31e	tests/mutation_reader_test: combined_mutation_reader_test: use SEASTAR_THREAD_TEST_CASE	2018-07-04 17:42:37 +03:00
Botond Dénes	77ad085393	tests/mutation_reader_test: refactor combined_mutation_reader_test Make combined_mutation_reader_test more interesting: * Set the levels on the sstables * Arrange the sstables so that they test for the "jump over sstables" bug. * Arrange the sstables so that they test for the "gap between sstables". While at it also make the code more compact.	2018-07-04 17:42:37 +03:00
Botond Dénes	4b57fc9aea	tests/mutation_reader_test: fix reader_selector related tests Don't assume the partition keys use lexical ordering. Add some additional checks.	2018-07-04 17:42:37 +03:00
Botond Dénes	81a03db955	mutation_reader: reader_selector: use ring_position instead of token sstable_set::incremental selector was migrated to ring position, follow suit and migrate the reader_selector to use ring_position as well. Above correctness this also improves efficiency in case of dense tables, avoiding prematurely selecting sstables that share the token but start at different keys, altough one could argue that this is a niche case.	2018-07-04 17:42:37 +03:00
Botond Dénes	a8e795a16e	sstables_set::incremental_selector: use ring_position instead of token Currently `sstable_set::incremental_selector` works in terms of tokens. Sstables can be selected with tokens and internally the token-space is partitioned (in `partitioned_sstable_set`, used for LCS) with tokens as well. This is problematic for severeal reasons. The sub-range sstables cover from the token-space is defined in terms of decorated keys. It is even possible that multiple sstables cover multiple non-overlapping sub-ranges of a single token. The current system is unable to model this and will at best result in selecting unnecessary sstables. The usage of token for providing the next position where the intersecting sstables change [1] causes further problems. Attempting to walk over the token-space by repeatedly calling `select()` with the `next_position` returned from the previous call will quite possibly lead to an infinite loop as a token cannot express inclusiveness/exclusiveness and thus the incremental selector will not be able to make progress when the upper and lower bounds of two neighbouring intervals share the same token with different inclusiveness e.g. [t1, t2](t2, t3]. To solve these problems update incremental_selector to work in terms of ring position. This makes it possible to partition the token-space amoing sstables at decorated key granularity. It also makes it possible for select() to return a next_position that is guaranteed to make progress. partitioned_sstable_set now builds the internal interval map using the decorated key of the sstables, not just the tokens. incremental_selector::select() now uses `dht::ring_position_view` as both the selector and the next_position. ring_position_view can express positions between keys so it can also include information about inclusiveness/exclusiveness of the next interval guaranteeing forward progress. [1] `sstable_set::incremental_selector::selection::next_position`	2018-07-04 17:42:33 +03:00
Botond Dénes	5fd9c3b9d4	tests/mutation_reader_test: require min shard-count for multishard tests Tests testing different aspects of `foreign_reader` and `multishard_combining_reader` are designed to run with a certain minimum shard count. Running them with any shard count below this minimum makes them useless at best but can even fail them. Refuse to run these tests when the shard count is below the required minimum to avoid an accidental and unnecessary investigation into a false-positive test failure. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <d24159415b6a9d74eafb8355b6e3fba98c1ff7ff.1530274392.git.bdenes@scylladb.com>	2018-07-01 12:44:41 +03:00
Paweł Dziepak	fde9e1d55f	tests/mutation_reader: disambiguate freeze() overload freeze() is about to get overloaded so make sure we don't get any ambiguities.	2018-05-25 10:15:10 +01:00
Paweł Dziepak	7c5c77369a	tests/mutation_reader: do not apply mutations created on another shard Scylla uses shared-nothing architecture and communication between the shards is supposed to be very restricted. Applying to a memtable mutations created on another shard is way to complex operation to be allowed. Using frozen mutations is a much safer option.	2018-05-09 16:52:26 +01:00
Botond Dénes	777f3c7dc2	mutation_reader_test: don't lock up with smp=1 test_foreign_reader_destroyed_with_pending_read_ahead lock up completely when run with SMP=1. As a solution skip the test-case when SMP < 2. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <815585c40a65a66f3b03e6393b46fbd6849c8ef5.1525866777.git.bdenes@scylladb.com>	2018-05-09 15:10:18 +03:00
Botond Dénes	5d5bc0e1ab	mutation_reader_test: fix multishard-reader test with smp > 3 test_multishard_combining_reader_destroyed_with_pending_create_reader was failing because it relied on smp == 3 and thus the shard on which the reader creation is blocked being shard-2. Since the test requires to be run with smp >= 3 we can hardcode this shard to be 2 because if the test runs at all we are guaranteed to have at least smp >= 3. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <38883a1f4c18ca0cd065aa13826a4f1858353289.1525328233.git.bdenes@scylladb.com>	2018-05-03 10:30:21 +03:00
Botond Dénes	efa08f623a	mutation_reader_test: add description to multishard-tests These tests are quite complicated and require intimate knowledge of how foreign_reader and multishard_combining_reader operates. Knowing these two objects is still required to understand the tests but make it that much easier by explaining how they were designed to test what they test. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <8de580131a8652924de920c2bc68a98e579398ee.1525328226.git.bdenes@scylladb.com>	2018-05-03 10:30:20 +03:00
Paweł Dziepak	bfc017daa8	tests/mutation_reader: do not capture on-stack variable by reference 'shard' is a short-lived on-stack variable that gets captured by reference by continuation that gets executed on another shard. Fixes a race condition that leads to an heap-use-after-free. Message-Id: <20180502150507.2776-1-pdziepak@scylladb.com>	2018-05-02 18:07:37 +03:00
Botond Dénes	d80e586ccb	mutation_reader_test: remove leftover comments Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <580dcf664fc4fc84f3a29137fba5c982f57d7601.1525269726.git.bdenes@scylladb.com>	2018-05-02 17:03:50 +03:00
Botond Dénes	e14b0ca13e	mutation_reader_test: fix possible use-after-free The test_foreign_reader_destroyed_with_pending_read_ahead test currently doesn't ensure that the objects in it's scope are destroyed in the correct order. This is necessary as there are severeal foreign pointers to objects that live on remote shards and use each other. Since foreign pointers destory their managed object in the background we cannot rely on the to reliably destroy objects in order, nor can we be sure when the object they manage is actually destroy. So to work around that ensure that the puppet_reader is destroyed before the remote_control it references even has a chance of being destroyed. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <232eaa899878b03fb2a765c2916e4f05841472a3.1525269726.git.bdenes@scylladb.com>	2018-05-02 17:03:49 +03:00
Botond Dénes	79684eff8e	mutation_reader_test: add read-ahead related multishard reader tests Add tests for foreign_reader and multishard_combining_reader that check that readers destroyed while there is pending read-head will not result in use-after-free. Specifically check that: * multishard_combining_reader destroyed with pending reader creation * foreign_reader destroyed with pending read-ahead * multishard_combining_reader destroyed with pending read-ahead does not result in use-after-free or SEGFAULT. These tests try to do their best to check for correct behaviour with various BOOST_REQUIRE* checks but they still heavily rely on ASAN to detect any use-after-free, SEGFAULT or similar errors.	2018-04-30 17:17:45 +03:00
Botond Dénes	cb25afa8bf	tests/mutation_reader_test: change recommented smp to 3 Of the test_multishard_combining_reader_reading_empty_table test. Running this test with smp=3 instead of smp=2 helps detecting additional read-ahead related memory problems.	2018-04-30 17:17:45 +03:00
Botond Dénes	78266f11c4	mutation_reader_test: fix name of existing multishard reader tests s/multishard_combined_reader/multishard_combining_reader/	2018-04-30 17:17:44 +03:00
Botond Dénes	ff3982a817	Add unit tests for multishard_combined_reader	2018-04-11 10:03:50 +03:00
Botond Dénes	de4a3c8bdb	Add unit tests for foreign_reader	2018-04-11 09:22:49 +03:00
Botond Dénes	341ddd096a	Modify unit tests so that they test the dual-limits	2018-03-08 14:12:12 +02:00
Botond Dénes	1259031af3	Use the reader_concurrency_semaphore to limit reader concurrency	2018-03-08 14:12:12 +02:00
Avi Kivity	1dae29b48d	test: mutation_reader_test: fix no-timeout case in reader_wrapper reader_wrapper's _timeout defaults to now(), which means to time out immediately rather than no timeout. Fix by switching to a time_point, defaulting to no_timeout, and provide a compatible constructor (with a duration parameter) for callers that do want a duration-based timeout. Tests: mutation_reader_test (debug, release) Message-Id: <20180305111739.31972-1-avi@scylladb.com>	2018-03-05 12:40:07 +01:00
Paweł Dziepak	ea50806172	tests/mutation_reader: avoid static local lw_shared_ptr Shared pointer don't like being shared across shards. Fixes assertion failure in build/debug/tests/mutation_reader_test. Message-Id: <20180201125017.30259-1-pdziepak@scylladb.com>	2018-02-01 13:53:55 +01:00

1 2 3

112 Commits