scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-29 19:21:01 +00:00

Author	SHA1	Message	Date
Benny Halevy	f5ef687acd	perf: perf_mutation_readers: add one_mutation case Measure performance of the single-mutation reader: make_flat_mutation_reader_from_mutation_v2. Comparable to the `one_row` case that consumes the single mutation using the multi-mutatio reader: make_flat_mutation_reader_from_mutations_v2 perf_mutation_readers shows ~20-30% improvement of make_flat_mutation_reader_from_mutation_v2 the same single mutation, just given as a single-item vector to make_flat_mutation_reader_from_mutations_v2. test iterations median mad min max Before: combined.one_row 720118 825.668ns 1.020ns 824.648ns 827.750ns After: combined.one_mutation 881482 751.157ns 0.397ns 750.211ns 751.912ns combined.one_row 843270 756.553ns 0.303ns 755.889ns 757.911ns Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-14 11:39:05 +03:00
Benny Halevy	a4b69fe7b6	test: mutation_query_test: make make_source static No need for it to be public. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-14 11:15:19 +03:00
Benny Halevy	ddb5166b82	mutation readers: refactor make_flat_mutation_reader_from_mutation*_v2 Extract the common parts of the single mutation reader and the vector-based variant into mutation_reader_base and reuse from both readers. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-14 11:15:17 +03:00
Benny Halevy	e85241d5b6	mutation readers: add make_flat_mutation_reader_from_mutation_v2 Optimize reading from a single partition. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-14 11:14:43 +03:00
Benny Halevy	394eb1271d	readers: delete slice_mutation.hh slice_mutations() is currently used only by readers/mutation_readers.cc so there's no need to expose it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-14 08:41:31 +03:00
Benny Halevy	ee2c7948f3	test: flat_mutation_reader_test: mock_consumer: add debug logging Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-14 08:41:31 +03:00
Benny Halevy	38cdfca824	test: flat_mutation_reader_test: mock_consumer: make depth counter signed We want to return stop_iteration::yes once we crossed the initial depth threshold, with an unsigned depth counter, it might wraparound and look > 1. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-14 08:41:31 +03:00
Tomasz Grabiec	d293bd9579	Merge "Enable forwarding in raft randomized_nemesis_test" from Kamil The test will now, with probability 1/2, enable forwarding of entries by followers to leaders. This is possible thanks to the new abort_source& APIs which we use to ensure that no operations are running on servers before we destroy them. Some adjustments were required to the server abort procedure in order to prevent rare hangs (see first patch). We also translate some low-level exceptions coming from seastar primitives to high-level Raft API exceptions (second patch). * kbr/nemesis-enable-fd-v1: test: raft: randomized_nemesis_test: enable entry forwarding test: raft: randomized_nemesis_test: increase logging level on some rare operations raft: server: translate abort_requested_exception to raft::request_aborted raft: fsm: when stopping, become follower to reject new requests	2022-04-13 18:40:23 +02:00
Piotr Sarna	61057446f7	Merge 'forward_service: retry failed forwarder call' from Michał Sala This pull request adds support for retrying failed forwarder calls (currently used to parallelize `select count() from ...` queries). Failed-to-forward sub-queries will be executed locally (on a super-coordinator). This local execution is meant as a fallback for a forward_requests that could not be sent to its destined coordinator (e.g. due gossiper not reacting fast enough). Local execution was chosen as the safest one - it does not require sending data to another coordinator. Due to problems with misscompilations, some parts of the `forward_service` were uncoroutinized. Fixes: #10131 Closes #10329 github.com:scylladb/scylla: forward_service: uncoroutinize dispatch method forward_service: uncoroutinize retrying_dispatcher forward_service: rety a failed forwarder call forward_service: copy arguments/captured vars to local variables	2022-04-13 09:41:35 +02:00
Nadav Har'El	6cafffe281	test/cql-pytest: reproduce internal server error on null subscript The restriction "WHERE m[NULL] = 2" should result in an invalid request error, but currently results in an ugly internal server error. This test reproduces it, and since the bug is still in the code - is marked as xfail. Refs #10361 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220412134118.829671-1-nyh@scylladb.com>	2022-04-13 08:49:48 +03:00
Nadav Har'El	ae0e1574dc	test/cql-pytest: reproducer for CONTAINS NULL bug This is a reproducer for issue #10359 that a "CONTAINS NULL" and "CONTAINS KEY NULL" restrictions should not match any set, but currently do match non-empty or all sets. The tests currently fail on Scylla, so marked xfail. They also fails on Cassandra because Cassandra considers such a request an error, which we consider a mistake (see #4776) - so the tests are marked "cassandra_bug". Refs #10359. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220412130914.823646-1-nyh@scylladb.com>	2022-04-13 08:49:23 +03:00
Nadav Har'El	5d87ead9f1	test/cql-pytest: add more tests comparing against NULL We already have a test showing that WHERE v=NULL ALLOW FILTERING is allowed in Scylla (unlike Cassandra), and matches nothing. Here we add two further tests that confirm that: 1. Not only is v=NULL allowed - v<NULL, v<=NULL, and so on, is also allowed and matches nothing. 2. The ALLOW FILTERING is required in in those requests. Without it, both Scylla and Cassandra generate the same "ALLOW FILTERING is required" error. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220411214503.770413-1-nyh@scylladb.com>	2022-04-13 08:48:55 +03:00
Avi Kivity	987e6533d2	transport: return correct error codes when downgrading v4 {WRITE,READ}_FAILURE to {WRITE,READ}_TIMEOUT Protocol v4 added WRITE_FAILURE and READ_FAILURE. When running under v3 we downgrade these exceptions to WRITE_TIMEOUT and READ_TIMEOUT (since the client won't understand the v4 errors), but we still send the new error codes. This causes the client to become confused. Fix by updating the error codes. A better fix is to move the error code from the constructor parameter list and hard-code it in the constructor, but that is left for a follow-up after this minimal fix. Fixes #5610. Closes #10362	2022-04-12 19:19:52 +03:00
Avi Kivity	8aec146dec	Merge "Remove qctx from repair" from Pavel E " Repair code keeps its history in system keyspace and uses the qctx global thing to update and query it. This set replaces the qctx with the explicit reference on the system_keyspace object. tests: unit(dev), dtest.repair_test(dev) " * 'br-repair-vs-qctx' of https://github.com/xemul/scylla: repair, system_keyspace: Query repair_history with a helper repair: Update loader code to use system_keyspace entry repair, system_keyspace: Update repair_history with a helper repair: Keep system keyspace reference	2022-04-12 17:08:41 +03:00
Tomasz Grabiec	0c365818c3	utils/chunked_managed_vector: Fix sigsegv during reserve() Fixes the case of make_room() invoked with last_chunk_capacity_deficit but _size not in the last reserved chunk. Found during code review, no user impact. Fixes #10364. Message-Id: <20220411224741.644113-1-tgrabiec@scylladb.com>	2022-04-12 16:37:11 +03:00
Tomasz Grabiec	01eeb33c6e	utils/chunked_vector: Fix sigsegv during reserve() Fixes the case of make_room() invoked with last_chunk_capacity_deficit but _size not in the last reserved chunk. Found during code review, no known user impact. Fixes #10363. Message-Id: <20220411222605.641614-1-tgrabiec@scylladb.com>	2022-04-12 16:35:17 +03:00
Avi Kivity	546ee814dd	Merge 'schema_tables, sstables: return instead of throwing' from Piotr Sarna This miniseries rewrites a few unnecessary throws into forwarding the exception directly. It's partially possible thanks to the new `co_await coroutine::return_exception` mechanism which allows returning from a coroutine early, without explicitly calling co_return (`d5843f6e88`). Closes #10360 * github.com:scylladb/scylla: sstables: : remove unnecessary throws schema_tables: remove unnecessary throws	2022-04-12 15:18:14 +03:00
Piotr Sarna	bce2933d99	sstables: : remove unnecessary throws Throws are translated to passing the exceptions directly.	2022-04-12 13:09:54 +02:00
Piotr Sarna	91f130bd9c	schema_tables: remove unnecessary throws Throws are translated to passing the exception directly.	2022-04-12 13:09:27 +02:00
Pavel Emelyanov	05eb9c9416	repair, system_keyspace: Query repair_history with a helper Querying the table is now done with the help of qctx directly. This patch replaces it with a querying helper that calls the consumer function with the entry struct as the argument. After this change repair code can stop including query_context and mess with untyped_result_set. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-04-12 14:04:21 +03:00
Pavel Emelyanov	59f4aa0934	repair: Update loader code to use system_keyspace entry Patch the history entry loader to use the recently introduced history entry. This is just to reduce the churn in the next patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-04-12 13:59:55 +03:00
Pavel Emelyanov	9940016e05	repair, system_keyspace: Update repair_history with a helper Current code works directly on the qctx which is not nice. Instead, make it use the system keyspace reference. To make it work, the patch adds a helper method and introduces a helper struct for the table entry. This struct will also be used to query the table (next patch). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-04-12 13:57:57 +03:00
Pavel Emelyanov	e501ebd6c2	repair: Keep system keyspace reference Repair updates (and queries on start) the system.repair_history table and thus depends on the system_keyspace object Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-04-12 13:57:08 +03:00
Avi Kivity	aa7c6dfaa9	Merge 'Commitlog: refactor file handling - simplify file management + make bookkeep safer' from Calle Wilund Adds a "named_file" wrapper type in commitlog, encapsulating file and disk size, the latter being updated automatically on write/truncate/allocate/delete operations. Use this instead of loose vars in segments, and also in recycle/delete lists. Having the data propagate with the objects means we can dispose of re-reading sizes from disk, which in turn means we know what "our" view of the file sizes is when we try to delete/recycle them -> we can bookkeep accurately (from our view point) without having to resort to the rather horrible recalculation of disk footprint. This series also drops non-recycled segment handling, since it is not used anywhere, and just makes things harder. It also adds a parameter to set flush threshold. These two first patches could be broken out into separate PR:s if need be. Closes #10084 * github.com:scylladb/scylla: commitlog: Fold named_file continuations into caller coroutine frame commitlog: Use named named_file objects in delete/dispose/recycle lists commitlog: Use named_file size tracking instead of segment var commitlog: Use named_file in segment commitlog: Add "named_file" file wrapping type commitlog: Make flush threshold a config parameter commitlog: kill non-recycled segment management	2022-04-12 11:28:36 +03:00
Raphael S. Carvalho	f05ae92849	compaction: move compaction::enable_garbage_collected_sstable_writer() into protected namespace Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220411181322.192830-2-raphaelsc@scylladb.com>	2022-04-12 11:21:18 +03:00
Raphael S. Carvalho	3741e7fb6d	compaction: LCS: kill unused bootstrapping code With off-strategy, we no longer need LCS explicitly switching to STCS mode, and even without off-strategy, the dynamic fan-in approach in compaction manager will cause LCS to automatically switch to STCS under heavy write load. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220411181322.192830-1-raphaelsc@scylladb.com>	2022-04-12 11:21:18 +03:00
Calle Wilund	0e2a3e02ae	commitlog: Fold named_file continuations into caller coroutine frame Saves a continuation. That matters very little. But... Uses a special awaiter type on returns from the "then(...)"-wrapping named_file methods (which use a then([...update]) to keep internal size counters up-to-date, making the continuation instead a stored func into the returned awaiter, executed on successul resume of the caller co_await.	2022-04-11 16:34:00 +00:00
Calle Wilund	ed8f0df105	commitlog: Use named named_file objects in delete/dispose/recycle lists Changes delete/close queue, as well as deletetion queue into one, using named_file objects + marker. Recycle list now also contains said named file type. This removes the need to re-eval file sizes on disk when deleting etc, which in turn means we can dispose of recalculate_footprint on errors, thus making things simpler and safer.	2022-04-11 16:34:00 +00:00
Calle Wilund	cdd4066006	commitlog: Use named_file size tracking instead of segment var I.e. "auto-keep-track" of disk footprint	2022-04-11 16:34:00 +00:00
Calle Wilund	320c49e8d3	commitlog: Use named_file in segment Uses named_file instead of file+string in segments. Does not do anything particularly useful with it.	2022-04-11 16:34:00 +00:00
Calle Wilund	97bf7b1fc8	commitlog: Add "named_file" file wrapping type For keeping track of file, name and size, even across close/rename/delete.	2022-04-11 16:34:00 +00:00
Calle Wilund	7dd7760e8d	commitlog: Make flush threshold a config parameter	2022-04-11 16:34:00 +00:00
Calle Wilund	d478896d46	commitlog: kill non-recycled segment management It has been default for a while now. Makes no sense to not do it. Even hints can use it (even if it makes no difference there)	2022-04-11 16:34:00 +00:00
Raphael S. Carvalho	8427ec056c	gms: gossiper: don't duplicate knowledge of minimum time for gossip to settle Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220409022435.58070-2-raphaelsc@scylladb.com>	2022-04-11 19:19:02 +03:00
cvybhu	5c199cad45	cql3: expr: possible_lhs_values: Handle subscript This commit makes subscript an invalid argument to possible_lhs_values. Previously this function simply ignored subscripts and behaved as if it was called on the subscripted column without a subscript. This behaviour is unexpected and potentially dangerous so it would be better to forbid passing subscript to possible_lhs_values entirely. Trying to handle subscript correctly is impossible without refactoring the whole function. The first argument is a column for which we would like to know the possible values. What are possible values of a subscripted column c where c[0] = 1? All lists that have 1 on 0th position? If we wanted to handle this nicely we would have to change the arguments. Such refectoring is best left until the time when this functionality is actually needed, right now it's hard to predict what interface will be needed then. Signed-off-by: cvybhu <jan.ciolek@scylladb.com> Closes #10228	2022-04-11 19:05:09 +03:00
Gleb Natapov	a3e8ae0979	storage_proxy: fix silencing of remote read errors Filtering remote rpc errors based on exception type did not work because the remote errors were reported as std::runtime_error and all rpc exceptions inherit from it. New rpc propagates remote errors using special type rpc::remote_verb_error now, so we can filter on that instead. Fixes #10339 Message-Id: <YlQYV5G6GksDytGp@scylladb.com>	2022-04-11 18:53:25 +03:00
Botond Dénes	08bcbd25e7	Merge 'toolchain: speed up prepare' from Avi Kivity This series speeds up tools/toolchain/prepare in a few ways: - builds images in parallel - allows running on any arch as host - reduces work in building the image - removes unneeded layers Closes #10348 * github.com:scylladb/scylla: tools: toolchain: prepare: sqush intermediate container layers tools: toolchain: update container image first thing tools: toolchain: prepare: build arch images in parallel tools: toolchain: prepare: aloow running on non-x86	2022-04-11 15:47:10 +03:00
Avi Kivity	fda99de15b	Update seastar submodule * seastar 05cdfc2d30...acf7e3523b (3): > http reply: avoid copying content > rpc: deliver remote verb exceptions as rpc::remote_verb_error instead of std::runtime_error > rpc: drop unneeded code	2022-04-11 15:12:43 +03:00
Botond Dénes	270aba0f51	Merge "Abort database stopping barriers on exception" by Pavel Emelyanov " The database::shutdown() and ::drain() methods are called inside the invoke_on_all()s synchronizing with each other via the cross-shard _stop_barrier. If either shard throws in between all others may get stuck waiting for the barrier to collect all arrivals. To fix it the throwing shard should wake up others, resolving the wait somehow. The fix is actually patch #4, the first and the second are the abort() method for the barrier itself. Fixes: #10304 tests: unit(dev), manual " * 'br-barrier-exception-2' of https://github.com/xemul/scylla: database: Abort barriers on exception database: Coroutinize close_tables test: Add test for cross_shard_barrier::abort() cross-shard-barrier: Add .abort() method	2022-04-11 13:48:43 +03:00
Pavel Emelyanov	f63f1c3d69	database: Abort barriers on exception The database::shutdown() and ::drain() methods are called inside the container().invoke_on_all() and synchronize with each other via the cross-shard _stop_barrier. If either shard throws in between all others may get stuck waiting for the barrier to collect all arrivals. The fix is to abort the barrier on exception thus making all the shards sitting in shutdown or drain to bail out with exceptions too. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-04-11 13:47:02 +03:00
Piotr Sarna	6d937f26ba	Update seastar submodule * seastar 2a2a1305...05cdfc2d (5): > Revert "core: reactor: fix a typo in `smp_pollfn::poll()`" > core: reactor: fix a typo in `smp_pollfn::poll()` > coroutine/exception: make it work with co_await > perftune.py: arfs: allow toggling on/off and allow auto-detection > coroutine: introduce as_future	2022-04-11 12:18:10 +02:00
Nadav Har'El	d9ec5ed46c	test/cql-pytest: add test for blobAsInt() et al for various blob lengths Recently I added a test that verified that blobAsInt() accepts a zero- byte blob and return an "empty" integer. I was asked by one of the reviewers - what happens if we try to pass a three byte blob to blobAsInt()? Here is a new test that demonstrates that the answer is: Besides the 0-byte blob, blobAsInt() only allows a 4-byte blob. Trying 3 or 5 bytes will result in an invalid query error being returned. The test passes on both Cassandra and Scylla, confirming their behavior is the same. The test checks all fixed-sized integer types - int (4 bytes), bigint (8 bytes), smallint (2 bytes) and tinyint (1 byte). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220411093803.651881-1-nyh@scylladb.com>	2022-04-11 12:44:22 +03:00
Raphael S. Carvalho	5cc46b3691	compaction: STCS: kill unused avg_size() Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220408184419.100827-3-raphaelsc@scylladb.com>	2022-04-11 11:24:07 +03:00
Raphael S. Carvalho	6ab570d115	compaction: STCS: only proceed to trim bucket if interesting In practice, a bucket that needs trimming will be interesting, but this could be made clearer in the code. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220408184419.100827-2-raphaelsc@scylladb.com>	2022-04-11 11:24:07 +03:00
Raphael S. Carvalho	4f6003d335	compaction: STCS: simplify most_interesting_bucket() Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220408184419.100827-1-raphaelsc@scylladb.com>	2022-04-11 11:24:07 +03:00
Nadav Har'El	84143c2ee5	alternator: implement Select option of Query and Scan This patch implements the previously-unimplemented Select option of the Query and Scan operators. The most interesting use case of this option is Select=COUNT which means we should only count the items, without returning their actual content. But there are actually four different Select settings: COUNT, ALL_ATTRIBUTES, SPECIFIC_ATTRIBUTES, and ALL_PROJECTED_ATTRIBUTES. Five previously-failing tests now pass, and their xfail mark is removed: * test_query.py::test_query_select * test_scan.py::test_scan_select * test_query_filter.py::test_query_filter_and_select_count * test_filter_expression.py::test_filter_expression_and_select_count * test_gsi.py::test_gsi_query_select_1 These tests cover many different cases of successes and errors, including combination of Select and other options. E.g., combining Select=COUNT with filtering requires us to get the parts of the items needed for the filtering function - even if we don't need to return them to the user at the end. Because we do not yet support GSI/LSI projection (issue #5036), the support for ALL_PROJECTED_ATTRIBUTES is a bit simpler than it will need to be in the future, but we can only finish that after #5036 is done. Fixes #5058. The most intrusive part of this patch is a change from attrs_to_get - a map of top-level attributes that a read needs to fetch - to an optional<attrs_to_get>. This change is needed because we also need to support the case that we want to read no attributes (Select=COUNT), and attrs_to_get.empty() used to mean that we want to read all attributes, not no attributes. After this patch, an unset optional<attrs_to_get> means read all attributes, a set but empty attrs_to_get means read no attributes, and a set and non-empty attrs_to_get means read those specific attributes. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220405113700.9768-2-nyh@scylladb.com>	2022-04-11 10:04:32 +02:00
Nadav Har'El	9c1ebdceea	alternator: forbid empty AttributesToGet In DynamoDB one can retrieve only a subset of the attributes using the AttributesToGet or ProjectionExpression paramters to read requests. Neither allows an empty list of attributes - if you don't want any attributes, you should use Select=COUNT instead. Currently we correctly refuse an empty ProjectionExpression - and have a test for it: test_projection_expression.py::test_projection_expression_toplevel_syntax However, Alternator is missing the same empty-forbidding logic for AttributesToGet. An empty AttributesToGet is currently allowed, and basically says "retrieve everything", which is sort of unexpected. So this patch adds the missing logic, and the missing test (actually two tests for the same thing - one using GetItem and the other Query). Fixes #10332 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220405113700.9768-1-nyh@scylladb.com>	2022-04-11 10:21:02 +03:00
Nadav Har'El	86d01542de	test/alternator: test another example of nested function calls In the existing test we noticed that list_append(if_not_exists(...)) is allowed, but list_append(list_append(...)) is not. I wasn't sure whether if_not_exists(if_not_exists(..)) will be allowed - and this test verifies that it is - it works on both Scylla and DynamoDB, and gives the same results on both. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220407122729.155648-1-nyh@scylladb.com>	2022-04-11 09:56:02 +03:00
Nadav Har'El	3456cbcfcf	test/cql-pytest: split test_null.py into test_null and test_empty We had in test_null.py a mixture of tests for null values and the "null" CQL keyword - and tests for empty values. Null and empty values are not the same thing, and there is no reason to keep the tests for the two things in the same file and further confuse these two distinct concepts. This patch just moves code from test_null.py into a new test_empty.py - there are no functional changes. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220407090348.137583-2-nyh@scylladb.com>	2022-04-11 09:54:54 +03:00
Nadav Har'El	cf79d84efa	test/cql-pytest: add regression test for "empty" integer In https://github.com/scylladb/scylla-rust-driver/issues/278 we noted that beyond the concept of a null integer value (which has size -1), there is also an empty integer value (size 0). This patch adds a test that it works as expected. And we see that it does - Scylla stores such a value fine, and the Python driver retrieves it the same as a null (arguably, this is fine - the important point is to see that we don't get a crash or an error). The test passes - I just added it as a regression test for the future. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220407090348.137583-1-nyh@scylladb.com>	2022-04-11 09:54:53 +03:00

1 2 3 4 5 ...

30871 Commits