scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 20:27:03 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	7fa4e10aa0	row_cache: Use generic LRU for eviction In preparation for tracking different kinds of objects, not just rows_entry, in the LRU, switch to the LRU implementation form utils/lru.hh which can hold arbitrary element type.	2021-07-02 10:25:58 +02:00
Nadav Har'El	029991bfc2	test/cql-pytest: test that SSL CQL port doesn't accept unencrypted connections Scylla doesn't allow unencrypted connections over encrypted CQL ports (Cassandra does allow this, by setting "optional: true", but it's not secure and not recommended). Here we add a test that in indeed, we can't connect to an SSL port using an unencrypted connection. The test passes on Scylla, and also on Cassandra (run it on Cassandra with "test/cql-pytest/run-cassandra --ssl" - for which we added support in a recent patch). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210629121514.541042-1-nyh@scylladb.com>	2021-06-29 16:42:22 +03:00
Nadav Har'El	dc4c05b2e3	test/cql-pytest: switch some fixture scopes from "session" to "module" Fixtures in conftest.py (e.g., the test_keyspace fixture) can be shared by all tests in all source files, so they are marked with the "session" scope: All the tests in the testing session may share the same instance. This is fine. Some of test files have additional fixtures for creating special tables needed only in those files. Those were also, unnecessarily, marked "session" scope as well. This means that these temporary tables are only deleted at the very end of test suite, event though they can be deleted at the end of the test file which needed them - other test source files don't have access to it anyway. This is exactly what the "module" fixture scope is, so this patch changes all the fixtures that are private to one test file to use the "module" scope. After this patch, the teardown of the last test in the suite goes down from 0.26 seconds to just 0.06 seconds. Another benefit is that the peak disk usage of the test suite is lower, because some of the temporary tables are deleted sooner. This patch does not change any test functionality, and also does not make any test faster - it just changes the order of the fixture teardowns. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #8932	2021-06-29 16:10:47 +03:00
Nadav Har'El	7e4bef96af	test/cql-pytest: support "--ssl" option in run-cassandra This patch adds support for the "--ssl" option in run-cassandra, which will now be able, like run (which runs Scylla), to run Cassandra with listening to a SSL-encrypted CQL connection. The "--ssl" option is also passed to the tests, so they know to encrypt their CQL connections. We already had support for this feature in the test/cql-pytest/run script - which runs Scylla. Adding this also to the run-cassandra script can help verify that a behavior we notice in Scylla's SSL support and we want to add to a test - is also shared by Cassandra. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210629082532.535229-1-nyh@scylladb.com>	2021-06-29 12:05:40 +03:00
Piotr Jastrzebski	430fd5cfa9	sstables: move sstable_writer to separate header This class is used in only few places and does not have to be included everywhere sstable class is needed. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-27 15:12:31 +02:00
Piotr Jastrzebski	2d6608bb88	sstables: stop including metadata_collector.hh in sstables.hh metadata collector is rarely used so it's better to include it only in those few places. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-27 15:12:31 +02:00
Piotr Jastrzebski	314bc0e8a5	sstable_datafile_test: switch tests to use latest sstables format instead of LA. Ability to write LA and KA sstables will be removed by the following patches so we need to switch all the tests to write newer sstables. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-27 15:12:30 +02:00
Piotr Jastrzebski	f03ed9b9a7	sstable_datafile_test: switch compaction_with_fully_expired_table to latest sstable version Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-27 15:12:30 +02:00
Piotr Jastrzebski	1ed298b08b	test_offstrategy_sstable_compaction: test all writable sstables Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-27 15:12:12 +02:00
Piotr Jastrzebski	995eb8c274	compaction_with_fully_expired_table: Remove some LA specific code Following patches will switch all sstable writing tests to use the latest sstables format. compaction_with_fully_expired_table contains some test for a LA specific behaviour so let's remove it to make the switch possible. For more context see https://github.com/scylladb/scylla/issues/2620 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-25 10:12:00 +02:00
Piotr Jastrzebski	8ff37bec17	sstable_mutation_test: test latest sstable format instead of LA Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-25 10:12:00 +02:00
Piotr Jastrzebski	80f8f970e9	sstable_test: Test MX sstables instead of KA/LA Replace calls to make_compressed_file_k_l_format_input_stream with calls to make_compressed_file_m_format_input_stream. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-25 10:12:00 +02:00
Piotr Jastrzebski	131a0babc0	sstable_datafile_test: Fix schema used by check_compacted_sstables check_compacted_sstables is used in compact_02 test which uses sstables created by compact_sstables. The problem is that schema used in check_compacted_sstables and compact_sstables is not the same. The type of r1 column is different. This was not a problem when the test was running on LA sstables but following patches will switch all the tests to use MC and then sstable schema becomes validated when reading the sstable and the test will fail such validation. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-25 10:12:00 +02:00
Piotr Jastrzebski	680e341f54	sstables: Remove LA/KA sstable writting tests that check exact format Those tests check that created sstables have exactly the expected bytes inside. This won't work with other sstable formats and writting LA/KA sstables will be removed by the following patches so there's nothing we can do with those tests but to remove them. Otherwise they will be failing after LA/KA writting capability is removed. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-25 10:12:00 +02:00
Piotr Jastrzebski	2bd6ad1e2f	sstables: define writable_sstable_versions and use it instead of all_sstable_versions in tests that check writting of sstables. Following patches remove LA/KA writer so we want tests to be ready for that and not break by trying to write LA/KA sstables. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-25 10:12:00 +02:00
Piotr Jastrzebski	1bdcef6890	features: assume MC_SSTABLE and UNBOUNDED_RANGE_TOMBSTONES are always enabled These features have been around for over 2 years and every reasonable deployment should have them enabled. The only case when those features could be not enabled is when the user has used enable_sstables_mc_format config flag to disable MC sstable format. This case has been eliminated by removing enable_sstables_mc_format config flag. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2021-06-25 10:12:00 +02:00
Pavel Emelyanov	3552e99ce7	scylla-gdb: Bring scylla netw back to work The netw command tries to access the netw::_the_messaging_service that was removed long ago. The correct place for the messaging service is in debug:: namespace. The scylla-gdb test checks that, but the netw command sees that the ptr in question is not initialized, thinks it's not yet sharded::start()-ed and exits without errors. tests: unit(gdb) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210624135107.12375-1-xemul@scylladb.com>	2021-06-24 20:59:27 +03:00
Avi Kivity	14252c8b71	Merge 'Commitlog: Handle disk usage and disk footprint discrepancies, ensuring we flush when needed (#8695 ) (v3)' from Calle Wilund Fixes #8270 If we have an allocation pattern where we leave large parts of segments "wasted" (typically because the segment has empty space, but cannot hold the mutation being added), we can have a disk usage that is below threshold, yet still get a disk footprint that is over limit causing new segment allocation to stall. We need to take a few things into account: 1.) Need to include wasted space in the threshold check. Whether or not disk is actually used does not matter here. 2.) If we stall a segment alloc, we should just flush immediately. No point in waiting for the timer task. 3.) Need to adjust the thresholds a bit. Depending on sizes, we should probably consider start flushing once we've used up space enough to be in the last available segment, so a new one is hopefully available by the time we hit the limit. 4.) (v2) Must ensure discard/delete routines are executed. Because we can race with background disk syncs, we may need to issue segment prunes from end_flush() so we wake up actual file deletion/recycling 5.) (v2) Shutdown must ensure discard/delete is run after we've disabled background task etc, otherwise we might fail waking up replenish and get stuck in gate 6.) (v2) Recycling or deleting segments must be consistent, regardless of shutdown. For same reason as above. 7.) (v3) Signal recycle/delete queues/promise on shutdown (with recognized marker) to handle edge case where we only have a single (allocating) segment in the list, and cannot wake up replenisher in any more civilized way. Also fix edge case (for tests), when we have too few segment to have an active one (i.e. need flush everything). New attempt at this, should fix intermittent shutdown deadlocks in commitlog_test. Closes #8764 * github.com:scylladb/scylla: commitlog_test: Add test case for usage/disk size threshold mismatch commitlog_test: Improve test assertion commitlog: Add waitable future for background sync/flush commitlog: abort queues on shutdown commitlog: break out "abort" calls into member functions commitlog: Do explicit discard+delete in shutdown commitlog: Recycle or not should not depend on shutdown state commitlog: Issue discard_unused_segments on segment::flush end IFF deletable commitlog: Flush all segments if we only have one. commitlog: Always force flush if segment allocation is waiting commitlog: Include segment wasted (slack) size in footprint check commitlog: Adjust (lower) usage threshold	2021-06-24 12:03:26 +03:00
Tomasz Grabiec	ee28eb4100	Merge "test: raft: move some tests to `raft` folder" from Pavel Solodovnikov Move `raft_sys_table_storage_test` and `raft_address_map_test` to `test/raft` folder since they naturally belong here, not in `test/boost` folder. Tests: unit(dev) * manmanson/move_some_raft_tests_to_raft_folder: test: raft: move `raft_address_map_test` to `raft` folder test: raft: move `raft_sys_table_storage_test` to `raft` folder configure: add extended raft testing dependencies	2021-06-24 12:03:26 +03:00
Tomasz Grabiec	a60e73fe14	Merge "raft: allow to initiate leader stepdown process explicitly" from Gleb Sometimes an ability to force a leader change is needed. For instance if a node that is currently serving as a leader needs to be brought down for maintenance. If it will be shutdown without leadership transfer the cluster will be unavailable for leader election timeout at least. * scylla-dev/raft-stepdown-v4: raft: test: test leadership transfer timeout raft: allow to initiate leader stepdown process	2021-06-23 00:14:46 +02:00
Pavel Solodovnikov	a96ddbec35	test: raft: move `raft_address_map_test` to `raft` folder Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-06-22 23:33:22 +03:00
Pavel Solodovnikov	cf5025c44e	test: raft: move `raft_sys_table_storage_test` to `raft` folder Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-06-22 23:31:41 +03:00
Avi Kivity	d27e88e785	Merge "compaction: prevent broken_promise or dangling reader errors" from Benny " This series prevents broken_promise or dangling reader errors when (resharding) compaction is stopped, e.g. during shutdown. At the moment compaction just closes the reader unilaterally and this yanks the reader from under the queue_reader_handle feet, causing dangling queue reader and broken_promise errors as seen in #8755. Instead, fix queue_reader::close to set value on the _full/_not_full promises and detach from the handle, and return _consume_fut from bucket_writer::consume if handle is terminated. Fixes #8755 Test: unit(dev) DTest: materialized_views_test.py:TestMaterializedViews.interrupt_build_process_and_resharding_half_to_max_test(debug) " * tag 'propagate-reader-abort-v3' of github.com:bhalevy/scylla: mutation_writer: bucket_writer: consume: propagate _consume_fut if queue_reader_handle is_terminated queue_reader_handle: add get_exception method queue_reader: close: set value on promises on detach from handle	2021-06-22 18:52:11 +03:00
Gleb Natapov	09528b8671	raft: test: test leadership transfer timeout Test that if leadership transfer cannot be done in configured time frame fsm cancels the leadership transfer process. Also check that timeout_now message is resent on each tick while leadership transfer is in progress.	2021-06-22 14:42:50 +03:00
Konstantin Osipov	bd410da77a	raft: (service) rename raft_services service to raft_group_registry This is a more informative name. Helps see that, say, group0 is a separate service and not bundle all raft services together. Message-Id: <20210619211412.3035835-3-kostja@scylladb.com>	2021-06-21 14:53:54 +03:00
Konstantin Osipov	025f18325e	raft: (service) move raft service to namespace service Message-Id: <20210619211412.3035835-2-kostja@scylladb.com>	2021-06-21 14:53:54 +03:00
Calle Wilund	0a7823e683	commitlog_test: Add test case for usage/disk size threshold mismatch Refs #8270 Tries to simulate case where we mismatch segments usage with actual disk footprint and fail to flush enough to allow segment recycling	2021-06-21 06:01:19 +00:00
Calle Wilund	954da1f0a9	commitlog_test: Improve test assertion Changes it so actual data is printed, not just error.	2021-06-21 06:01:19 +00:00
Nadav Har'El	a9b383f423	cql-pytest: improve test for SSL/TLS versions The existing test_ssl.py which tests for Scylla's support of various TLS and SSL versions, used a deprecated and misleading Python API for choosing the protocol version. In particular, the protocol version ssl.PROTOCOL_SSLv23 is not, despite it's name, SSL versions 2 or 3, or SSL at all - it is in fact an alias for the latest TLS version :-( This misunderstanding led us to open the incorrect issue #8837. So in this patch, we avoid the old Python APIs for choosing protocols, which were gradually deprecated, and switch to the new API introduced in Python 3.7 and OpenSSL 1.1.0g - supplying the minimum and maximum desired protocol version. With this new API, we can correctly connect with various versions of the SSL and TLS protocol - between SSLv3 through TLSv1.3. With the fixed test, we confirm that Scylla does not allow SSLv3 - as desired - so issue #8837 is a non-issue. Moreover, after issue #8827 was already fixed, this test now passes, so the "xfail" mark is removed. Refs #8837. Refs #8827. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210617134305.173034-1-nyh@scylladb.com>	2021-06-17 17:06:31 +03:00
Tomasz Grabiec	6d8440fe70	Merge "raft: (testing) leadership transfer tests" from Pavel Solodovnikov The patch set introduces a few leadership transfer tests, some of them are adaptations of corresponding etcd tests (e.g. `test_leader_transfer_ignore_proposal` and `test_transfer_non_member`). Others test different scenarios ensuring that pending leadership transfer doesn't disrupt the rest of the cluster from progressing: Lost `timeout_now` messages` (`test_leader_transfer_lost_timeout_now` and `test_leader_transferee_dies_upon_receiving_timeout_now`) as well as lost `vote_request(force)` from the new candidate (test_leader_transfer_lost_force_vote_request) don't impact the election process following that and the leader is elected as normal. * manmanson/leadership_transfer_tests_v3: raft: etcd_test: test_transfer_non_member raft: etcd_test: test_leader_transfer_ignore_proposal raft: fsm_test: test_leader_transfer_lost_force_vote_request raft: fsm_test: test_leader_transfer_lost_timeout_now raft: fsm_test: test_leader_transferee_dies_upon_receiving_timeout_now	2021-06-17 13:58:31 +02:00
Piotr Sarna	8cca68de75	cql3: add USING TIMEOUT support for deletes Turns out the DELETE statement already supports attributes like timestamp, so it's ridiculously easy to add USING TIMEOUT support - it's just the matter of accepting it in the grammar. Fixes #8855 Closes #8876	2021-06-17 14:21:01 +03:00
Avi Kivity	00ff3c1366	Merge 'treewide: add support for snapshot skip-flush option' from Benny Halevy The option is provided by nodetool snapshot https://docs.scylladb.com/operating-scylla/nodetool-commands/snapshot/ ``` nodetool [(-h <host> \| --host <host>)] [(-p <port> \| --port <port>)] [(-pp \| --print-port)] [(-pw <password> \| --password <password>)] [(-pwf <passwordFilePath> \| --password-file <passwordFilePath>)] [(-u <username> \| --username <username>)] snapshot [(-cf <table> \| --column-family <table> \| --table <table>)] [(-kc <kclist> \| --kc.list <kclist>)] [(-sf \| --skip-flush)] [(-t <tag> \| --tag <tag>)] [--] [<keyspaces...>] -sf / –skip-flush Do not flush memtables before snapshotting (snapshot will not contain unflushed data) ``` But is currently ignored by scylla-jmx (scylladb/scylla-jmx#167) and not supported at the api level. This patch adds support for the option in advance from the api service level down via snapshot_ctl to the table class and snapshot implementation. In addition, a corresponding unit test was added to verify that taking a snapshot with `skip_flush` does not flush the memtable (at the table::snapshot level). Refs #8725 Closes #8726 * github.com:scylladb/scylla: test: database_test: add snapshot_skip_flush_works api: storage_service/snapshots: support skip-flush option snapshot: support skip_flush option table: snapshot: add skip_flush option api: storage_service/snapshots: add sf (skip_flush) option	2021-06-17 13:32:23 +03:00
Nadav Har'El	7fd7e90213	cql-pytest: translate Cassandra's tests for static columns This is a translation of Cassandra's CQL unit test source file validation/entities/StaticColumnsTest.java into our our cql-pytest framework. This test file checks various features of static columns. All these tests pass on Cassandra, and all but one pass on Scylla. The xfailing test, testStaticColumnsWithSecondaryIndex, exposes a query that Cassandra allows but we don't. The new issue about that is: Refs #8869. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210616141633.114325-1-nyh@scylladb.com>	2021-06-17 11:08:28 +02:00
Tomasz Grabiec	6bdf8c4c46	Merge "raft: second series of preparatory patches for group 0 discovery" from Kostja Miscellaneous preparatory patches for group 0 discovery. * scylla-dev/raft-group-0-part-2-v4: raft: (service) servers map is gid -> server, not sid -> server system_keyspace: raft.group_id and raft_snapshots.group_id are TIMEUUID raft: (server) wait for configuration transition to complete raft: (server) implement raft::server::get_configuration() raft: (service) don't throw from schema state machine raft: (service) permit some scylla.raft cells to be empty raft: (service) properly handle failure to add a server raft: implement is_transient_error()	2021-06-17 00:15:40 +02:00
Konstantin Osipov	18e3fcdbf1	raft: (service) servers map is gid -> server, not sid -> server Raft Group registry should map Raft Group Id to Raft Server, not Raft Server ID (which is identical for all groups) to Raft server. Raft Group 0 ID works as a cluster identifier, so is generated when a new cluster is created and is shared by all nodes of the same cluster. Implement a helper to get raft::server by group id. Consistently throw a new raft_group_not_found exception if there is no server or rpc for the specified group id.	2021-06-16 19:05:50 +03:00
Avi Kivity	f05ddf0967	Merge "Improve LSA descriptor encoding" from Pavel " The LSA small objects allocation latency is greatly affected by the way this allocator encodes the object descriptor in front of each allocated slot. Nowadays it's one of VLE variants implemented with the help of a loop. Re-implementing this piece with less instructions and without a loop allows greatly reducing the allocation latency. The speed-up mostly comes from loop-less code that doesn't confuse branch predictor. Also the express encoder seems to benefit from writing 8 bytes of the encoded value in one go, rather than byte- -by-byte. Perf measurements: 1. (new) logallog test shows ~40% smaller times 2. perf_mutation in release mode shows ~2% increase in tps 3. the encoder itself is 2 - 4 times faster on x86_64 and 1.05 - 3 times faster on aarch64. The speed-up depends on the 'encoded length', old encoder has linear time, the new one is constant tests: unit(dev), perf(release), just encoder on Aarch64 " * 'br-lsa-alloc-latency-4' of https://github.com/xemul/scylla: lsa: Use express encoder uleb64: Add express encoding lsa: Extract uleb64 code into header test: LSA allocation perf test	2021-06-16 18:07:13 +03:00
Avi Kivity	0948908502	Merge "mutation_reader: multishard_combining_reader clean-up close path" from Botond " The close path of the multishard combining reader is riddled with workarounds the fact that the flat mutation reader couldn't wait on futures when destroyed. Now that we have a close() method that can do just that, all these workarounds can be removed. Even more workarounds can be found in tests, where resources like the reader concurrency semaphore are created separately for each tested multishard reader and then destroyed after it doesn't need it, so we had to come up with all sorts of creative and ugly workarounds to keep these alive until background cleanup is finished. This series fixes all this. Now, after calling close on the multishard reader, all resources it used, including the life-cycle policy, the semaphores created by it can be safely destroyed. This greatly simplifies the handling of the multishard reader, and makes it much easier to reason about life-cycle dependencies. Tests: unit(dev, release:v2, debug:v2, mutation_reader_test:debug -t test_multishard, multishard_mutation_query_test:debug, multishard_combining_reader_as_mutation_source:debug) " * 'multishard-combining-reader-close-cleanup/v3' of https://github.com/denesb/scylla: mutation_reader: reader_lifecycle_policy: remove convenience methods mutation_reader: multishard_combining_reader: store shard_reader via unique ptr test/lib/reader_lifecycle_policy: destroy_reader: cleanup context test/lib/reader_lifecycle_policy: get rid of lifecycle workarounds test/lib/reader_lifecycle_policy: destroy_reader(): stop the semaphore test/lib/reader_lifecycle_policy: use a more robust eviction mechanism reader_concurrency_semaphore: wait for all permits to be destroyed in stop() test/lib/reader_lifcecycle_policy: fix indentation mutation_reader: reader_lifecycle_policy::destroy_reader(): require to be called on native shard reader_lifecycle_policy implementations: fix indentation mutation_reader: reader_lifecycle_policy::destroy_reader(): de-futurize reader parameter mutation_reader: shard_reader::close(): wait on the remote reader multishard_mutation_query: destroy remote parts in the foreground mutation_reader: shard_reader::close(): close _reader mutation_reader: reader_lifcecycle_policy::destroy_reader(): remove out-of-date comment	2021-06-16 17:25:50 +03:00
Benny Halevy	4830b6647c	queue_reader: close: set value on promises on detach from handle To prevent broken_promise exception. Since close() is manadatory the queue_reader destructor, that just detaches the reader from the handle, is not needed anymore, so remove it. Adjust the test_queue_reader unit test accordingly. Test: test_queue_reader(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-06-16 17:25:14 +03:00
Konstantin Osipov	9c93d77e74	system_keyspace: raft.group_id and raft_snapshots.group_id are TIMEUUID Fix a bug in definitions of system.raft, system.raft_snapshots, group_id is TIMEUUID, not long.	2021-06-16 16:52:43 +03:00
Pavel Emelyanov	1e67361267	test: LSA allocation perf test The test measures the time it takes to allocate a bunch of small objects on LSA inside single segment. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-06-16 13:40:44 +03:00
Botond Dénes	b4e69cf63d	test/lib/test_utils: require(): also log failed conditions Currently `require()` throws an exception when the condition fails. The problem with this is that the error is only printed at the end of the test, with no trace in the logs on where exactly it happened, compared to other logged events. This patchs also adds an error-level log line to address this. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210616065711.46224-1-bdenes@scylladb.com>	2021-06-16 12:05:25 +03:00
Botond Dénes	a69db31b5c	test/lib/reader_lifecycle_policy: destroy_reader: cleanup context Now that we don't rely on any external machinery to keep the relevant parts of the context alive until needed as its life-cycle is effectively enclosed in that of the life-cycle policy itself, we can cleanup the context in `destroy_reader()` itself, avoiding a background trip back to this shard.	2021-06-16 11:29:36 +03:00
Botond Dénes	d2ddaced4e	test/lib/reader_lifecycle_policy: get rid of lifecycle workarounds The lifecycle of the reader lifecycle policy and all the resources the reads use is now enclosed in that of the multishard reader thanks to its close() method. We can now remove all the workarounds we had in place to keep different resources as long as background reader cleanup finishes.	2021-06-16 11:29:36 +03:00
Botond Dénes	5a271e42a5	test/lib/reader_lifecycle_policy: destroy_reader(): stop the semaphore So that when this method returns the semaphore is safe to destroy. This in turn will enable us to get rid of all the machinery we have in place to deal with the semaphore having to out-live the lifecycle policy without a clear time as to when it can be safe to destroy.	2021-06-16 11:29:36 +03:00
Botond Dénes	c09c62a0fb	test/lib/reader_lifecycle_policy: use a more robust eviction mechanism The test reader lifecycle policy has a mode in which it wants to ensure all inactive readers are evicted, so tests can stress reader recreation logic. For this it currently employs a trick of creating a waiter on the semaphore. I don't even know how this even works (or if it even does) but it sure complicates the lifecycle policy code a lot. So switch to the much more reliable and simple method of creating the semaphore with a single count and no memory. This ensures that all inactive reads are immediately evicted, while still allows a single read to be admitted at all times.	2021-06-16 11:29:36 +03:00
Botond Dénes	578a092e4a	reader_concurrency_semaphore: wait for all permits to be destroyed in stop() To prevent use-after-free resulting from any permit out-living the semaphore.	2021-06-16 11:29:36 +03:00
Botond Dénes	a10a6e253e	test/lib/reader_lifcecycle_policy: fix indentation Left broken from the previous patch.	2021-06-16 11:29:36 +03:00
Botond Dénes	8c7447effd	mutation_reader: reader_lifecycle_policy::destroy_reader(): require to be called on native shard Currently shard_reader::close() (its caller) goes to the remote shard, copies back all fragments left there to the local shard, then calls `destroy_reader()`, which in the case of the multishard mutation query copies it all back to the native shard. This was required before because `shard_reader::stop()` (`close()`'s) predecessor) couldn't wait on `smp::submit_to()`. But close can, so we can get rid of all this back-and-forth and just call `destroy_reader()` on the shard the reader lives on, just like we do with `create_reader()`.	2021-06-16 11:29:35 +03:00
Botond Dénes	4ecf061c90	reader_lifecycle_policy implementations: fix indentation Left broken from the previous patch.	2021-06-16 11:21:38 +03:00
Botond Dénes	a7e59d3e2c	mutation_reader: reader_lifecycle_policy::destroy_reader(): de-futurize reader parameter The shard reader is now able to wait on the stopped reader and pass the already stopped reader to `destroy_reader()`, so we can de-futurize the reader parameter of said method. The shard reader was already patched to pass a ready future so adjusting the call-site is trivial. The most prominent implementation, the multishard mutation query, can now also drop its `_dismantling_gate` which was put in place so it can wait on the background stopping if readers. A consequence of this move is that handling errors that might happen during the stopping of the reader is now handled in the shard reader, not all lifecycle policy implementations.	2021-06-16 11:21:38 +03:00

1 2 3 4 5 ...

1888 Commits