scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 23:13:15 +00:00

Author	SHA1	Message	Date
Alejo Sanchez	f72e89fcfe	raft: replication test: parametrize drop_replication Pass drop_replication down instead of keeping it global. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2021-03-09 11:58:20 -04:00
Alejo Sanchez	5a03670f91	raft: replication test: remove unused configuration Remove test case configuration as it's not implemented yet. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2021-03-09 11:58:20 -04:00
Alejo Sanchez	efc6681cd6	raft: replication test: add license Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2021-03-09 11:58:20 -04:00
Avi Kivity	9038a81317	treewide: drop SEASTAR_CONCEPT Since Scylla requires C++20, there is no need to protect concept definitions or usages with SEASTAR_CONCEPT; it just clutters the code. This patch therefore removes all uses. Closes #8236	2021-03-08 16:04:20 +01:00
Asias He	dc40184faa	gossip: Handle timeout error in gossiper::do_shadow_round Currently, the rpc timeout error for the GOSSIP_GET_ENDPOINT_STATES verb is not handled in gossiper::do_shadow_round. If the GOSSIP_GET_ENDPOINT_STATES rpc call to any of the remote nodes goes timeout, gossiper::do_shadow_round will throw an exception and fail the whole boot up process. It is fine that some of the remote nodes timeout in shadow round. It is not a must to talk to all nodes. This patch fixes an issue we saw recently in our sct tests: ``` INFO \| scylla[1579]: [shard 0] init - Shutting down gossiping INFO \| scylla[1579]: [shard 0] gossip - gossip is already stopped INFO \| scylla[1579]: [shard 0] init - Shutting down gossiping was successful ... ERR \| scylla[1579]: [shard 0] init - Startup failed: seastar::rpc::timeout_error (rpc call timed out) ``` Fixes #8187 Closes #8213	2021-03-08 13:03:41 +01:00
Nadav Har'El	28804a50f7	alternator-test: test that index can't be a name reference (#xyz) We already have a test which shows verify DynamoDB and Alternator do not allow an index in an attribute path - like a[0].b - to be a value reference - a[:xyz].b. We forgot to verify that the index also can't be a name reference - a[#xyz].b is a syntax error. So here we add a test which confirms that this is indeed the case - DynamoDB doesn't allow it, and neither does Alternator. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210219123310.1240271-1-nyh@scylladb.com>	2021-03-08 10:17:19 +01:00
Avi Kivity	938761f49f	types.cc: drop unused #include "compaction_garbage_collector.hh" Garbage-collect unused #includes. Closes #8232	2021-03-08 06:44:03 +01:00
Takuya ASADA	2d9feaacea	scylla_raid_setup: don't abort using raiddev when array_state is 'clear' On Ubuntu 20.04 AMI, scylla_raid_setup --raiddev /dev/md0 causes '/dev/md0 is already using' (issue #7627). So we merged the patch to find free mdX (`587b909`). However, look into /proc/mdstat of the AMI, it actually says no active md device available: ubuntu@ip-10-0-0-43:~$ cat /proc/mdstat Personalities : unused devices: <none> We currently decide mdX is used when os.path.exists('/sys/block/mdX/md/array_state') == True, but according to kernel doc, the file may available even array is STOPPED: clear No devices, no size, no level Writing is equivalent to STOP_ARRAY ioctl https://www.kernel.org/doc/html/v4.15/admin-guide/md.html So we should also check array_state != 'clear', not just array_state existance. Fixes #8219 Closes #8220	2021-03-07 18:30:11 +02:00
Avi Kivity	1287a5e1d0	test: index_reader_assertions: fix misuse of trichotomic comparator in has_monotonic_positions has_monotonic_positions() wants to check for a greater-than-or-equal-to relation, but actually tests for not-equal, since it treats a trichotomic comparator as a less-than comparator. This is clearly seen in the BOOST_FAIL message just below. Fix by aligning the test with the intended invariant. Luckily, the tests still pass. Ref #1449. Closes #8222	2021-03-07 13:44:37 +02:00
Takuya ASADA	53c7600da8	dist: increase fs.aio-max-nr value for other apps Current fs.aio-max-nr value cpu_count() * 11026 is exact size of scylla uses, if other apps on the environment also try to use aio, aio slot will be run out. So increase value +65536 for other apps. Related #8133 Closes #8228	2021-03-07 12:11:36 +02:00
Piotr Sarna	7106ca27e6	service: reduce continuation length for paxos pruning A pair of (finally, handle_exception) is reduced to a single use of then_wrapped(), which saves an allocation. Message-Id: <01949e286db93397209435a85fcc46a8beef6d24.1614937462.git.sarna@scylladb.com>	2021-03-07 11:59:10 +02:00
Nadav Har'El	ad563c6279	Update tools/java submodule Fixes an sstableloader bug where we quoted twice column names that had to be quoted, and therefore failed on such tables - and in particular Alternator tables which always have a column called ":attrs". Fixes #8229 * tools/java 142f517a23...c5d9e8513e (1): > sstableloader: Only escape column names once	2021-03-07 10:33:49 +02:00
Botond Dénes	debaae41f9	mutation_partition: operator<<(mutation_partition::printer) Include row tombstones in the row printout. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210305094106.210249-1-bdenes@scylladb.com>	2021-03-05 14:39:39 +02:00
Botond Dénes	45471419d0	multishard_mutation_query: re-enable reverse queries `034cb81323` and `0f0c3be` disallowed reverse partition-range scans based on the observation that the CQL frontend disallows them, assuming that other client APIs also disallow them. As it turns out this is not true and there it at least one client API (Thrift) which does allows reverse range scans. So re-enable them. Fixes: #8211 Tests: unit(release), dtest(thrift_tests.py) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210304142249.164247-1-bdenes@scylladb.com>	2021-03-04 17:06:16 +02:00
Nadav Har'El	acfa180766	cql-pytest: recognize when Scylla crashes Before this patch, if Scylla crashes during some test in cql-pytest, all tests after it will fail because they can't connect to Scylla - and we can get a report on hundreds of failures without a clear sign of where the real problem was. This patch introduces an autouse fixture (i.e., a fixture automatically used by every test) which tries to run a do-nothing CQL command after each test. If this CQL command fails, we conclude that Scylla crashed and report the test in which this happened - and exist pytest instead of failing a hundred more tests. Fixes #8080 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210304132804.1527977-1-nyh@scylladb.com>	2021-03-04 16:00:00 +02:00
Raphael S. Carvalho	1226fc755f	compaction_manager: Increase cleanup compaction resilience when low on disk space In a scenario where node is running out of disk space, which is a common cause of cluster expansion, it's very important to clean up the smallest files first to increase the chances of success when the biggest files are reached down the road. That's possible given that cleanup operates on a single file at a time, and that the smaller the file the smaller the space requirement. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210303165520.55563-1-raphaelsc@scylladb.com>	2021-03-04 15:11:06 +02:00
Botond Dénes	25367deb01	mutation_partition: make row::consume_with() exception safe This function currently eagerly decrements `_size`, before `func()` is invoked. If `func()` throws the consumption fails but the size remains decremented. If this happens right at the last element in the row, the `row::empty()` will incorrectly return `true`, even though there is still one cell left in it. Move the decrement after the `func()` invocation to avoid this by only decrementing if the consumption was successful. Fixes: #8154 Tests: unit(mutation_test:release) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210304125318.143323-1-bdenes@scylladb.com>	2021-03-04 15:07:15 +02:00
Piotr Sarna	added53b7d	Merge 'hints: use a soft disk space limit in hints commitlog' from Piotr Dulikowski A recent change to the commitlog (`4082f57`) caused its configurable size limit to be strictly enforced - after reaching the limit, new segments wouldn't be allocated until some of the previous segments are freed. This flow can work for the regular commitlog, however the hints commitlog does not delete the segments itself - instead, hints manager recreates its commitlog every 10 seconds, picks up segments left by the previous instance and deletes each segment manually only after all hints are sent out from a segment. Because of the non-standard flow, it is possible that the hints commitlog fills up and stops accepting more hints. Hints manager uses a relatively low limit for each commitlog instance (128MB divided by shard count), so it's not hard to fill it up. What's worse, hints manager tries to acquire file_update_mutex in exclusive mode before re-creating the commitlog, while hints waiting to be written acquire this lock in shared mode - which causes hints flushing to completely deadlock and no more hints be admitted to the commitlog. The queue of hints waiting to be admitted grows very quickly and soon all writes which could result in a hint being generated are rejected with OverloadedException. To solve this problem, it is now possible to bring back the soft disk space limit by setting a flag in commitlog's configuration. Tests: - unit(dev) - wrote hints for 15 minutes in order to see if it gets stuck again Fixes #8137 Closes #8206 * github.com:scylladb/scylla: hints_manager: don't use commitlog hard space limit commitlog: add an option to allow going over size limit	2021-03-04 12:24:05 +01:00
Tomasz Grabiec	d6a94a7db1	Merge 'Make dht::token tri_compare safer' from Avi Kivity tri_compare() returns an int, which is dangerous as a tri_compare can be misused where a less_compare is expected. To prevent such misuse, convert the interval<> template to accept comparators that return std::strong_ordering, and then convert dht::token's comparator to do the same. Ref #1449. Closes #8181 * github.com:scylladb/scylla: dht: convert token tri_compare to std::strong_ordering interval: support C++20 three-way comparisons	2021-03-04 11:55:08 +01:00
Nadav Har'El	3e66a5cd43	Merge 'More Redis cleanups' from Pekka Enberg This pull request removes seastar namespace imports from the header files. There are some additional cleanups to make that easier and to remove some commented out code. Closes #8202 * github.com:scylladb/scylla: redis: Remove seastar namespace import from query_processor.hh redis: Switch to seastar::sharded<> in query_procesor.hh redis: Remove seastar namespace import from query_utils.hh redis: Remove seastar namespace import from reply.hh redis: Remove commented out code from options.hh redis: Remove seastar namespace import from options.hh redis: Remove seastar namespace import from service.hh redis: Switch to seastar::sharded<> in service.{hh,cc} redis: Remove unneeded include from keyspace_utils.hh redis: Remove seastar namespace import from keyspace_utils.hh redis: Remove seastar namespace import from command_factory.hh redis: Fix include path in command_factory.hh redis: Remove unneeded includes from command_factory.hh	2021-03-04 11:08:24 +02:00
Pekka Enberg	6066db7c90	Update tools/jmx submodule * tools/jmx bac7d0b...15c1d4f (2): > StorageService: Add a method to return the uptime > Bump Jackson version in scylla-apiclient	2021-03-04 10:56:37 +02:00
Nadav Har'El	e12e57c915	Merge 'Fix alternator streams management regression' from Calle Wilund Refs: #8012 Fixes: #8210 With the update to CDC generation management, the way we retrieve and process these changed. One very bad bug slipped through though; the code for getting versioned streams did not take into account the late-in-pr change to make clustering of CDC gen timestamps reversed. So our alternator shard info became quite rump-stumped, leading to more or less no data depending on when generations changed w.r. data. Also, the way we track the above timestamps changed, so we should utilize this for our end-of-iterator check. Closes #8209 * github.com:scylladb/scylla: alternator::streams: Use better method for generation timestamp system_distributed_keyspace: Add better routine to get latest cdc gen. timestamp system_distributed_keyspace: Fix cdc_get_versioned_streams timestamp range	2021-03-04 09:43:56 +02:00
Pekka Enberg	1d8a94f941	Update tools/jmx submodule * tools/jmx c2fc96b...bac7d0b (1): > Merge 'Fix locking in APIBuilder.remove()' from Pekka Enberg	2021-03-03 18:30:48 +02:00
Calle Wilund	8bbc976ff1	alternator::streams: Use better method for generation timestamp Get timestamp via system_distributed, instead of local gen.	2021-03-03 15:46:38 +00:00
Calle Wilund	5da0129775	system_distributed_keyspace: Add better routine to get latest cdc gen. timestamp Since we have a table of cdc version timestamps, conviniently sorted reversed, we can just query this and get the latest known gen ts.	2021-03-03 15:44:54 +00:00
Calle Wilund	5a69250d7e	system_distributed_keyspace: Fix cdc_get_versioned_streams timestamp range With the new scheme for cdc generation management, one of the last changes was to make the time ordering of the stream timestamps reversed. However, cdc_get_versioned_streams forgot to take this into account when sifting out timestamp ranges for stream retrieval (based on low mark). Fixed by doing reverse iteration.	2021-03-03 15:41:42 +00:00
Tomasz Grabiec	3cb01f218f	Merge "raft: add unit tests for log, tracker, votes and fix found bugs" from Kostja Test log consistency after apply_snapshot() is called. Ensure log::last_term() log::last_conf_index() and log::size() work as expected. Misc cleanups. * scylla-dev.git/raft-confchange-test-v4: raft: fix spelling raft: add a unit test for voting raft: do not account for the same vote twice raft: remove fsm::set_configuration() raft: consistently use configuration from the log raft: add ostream serialization for enum vote_result raft: advance commit index right after leaving joint configuration raft: add tracker test raft: tidy up follower_progress API raft: update raft::log::apply_snapshot() assert raft: add a unit test for raft::log raft: rename log::non_snapshoted_length() to log::in_memory_size() raft: inline raft::log::truncate_tail() raft: ignore AppendEntries RPC with a very old term raft: remove log::start_idx() raft: return a correct last term on an empty log raft: do not use raft::log::start_idx() outside raft::log() raft: rename progress.hh to tracker.hh raft: extend single_node_is_quiet test	2021-03-03 16:29:40 +01:00
Tomasz Grabiec	0dc57db248	Revert "Merge "raft: add unit tests for log, tracker, votes and fix found bugs" from Kostja" This reverts commit `f94f70cda8`, reversing changes made to `5206a97915`. Not the latest version of the series was merged. Rvert prior to merging the latest one.	2021-03-03 16:29:02 +01:00
Avi Kivity	facc7c370e	Update tools/jmx submodule * tools/jmx 8073af6...c2fc96b (1): > APIBuilder: Remove RW-lock in JMX server repository wrapper Fixes #7991.	2021-03-03 15:41:09 +02:00
Avi Kivity	aae43e1a20	Merge 'Untyped_result_set: make non-copying and fragment retaining' from Calle Wilund Refs #7961 Fixes #8014 The "untyped_result_set" object was created for small, internal access to cql-stored metadata. It is nowadays used for rather more than that (cdc). This has the potential of mixing badly with the fact that the type does deep copying of data and linearizes all (not to mention handles multiple rows rather inefficiently). Instead of doing a deep copy of input, we keep assume ownership and build rows of the views therein, potentially retaining fragmented data as-is avoiding premature linearization. Note that this is not all sugar and flowers though. Any data access will by nature be more expensive, and the view collections we create are potentially just as expensive as copying for small cells. Otoh, it allows writing code using this that avoids data copying, depending on destination. v2: * Fixed wrong collection reserved in visitor * Changed row index from shared ptr to ref * Moved typedef * Removed non-existing constructors * Added const ref to index build * Fixed raft usage after rebase v3: * Changed shared_ptr to unique Closes #8015 * github.com:scylladb/scylla: untyped_result_set: Do not copy data from input store (retain fragmented views) result_generator: make visitor callback args explicit optionals listlike_partial_deserializing_iterator: expose templated collection routines	2021-03-03 13:13:18 +02:00
Nadav Har'El	4e3db5297a	cql-pytest: rework tests for filtering leaving out most rows Previously, we had two tests demonstrating issue #7966. But since then, our understanding of this issue has improved which resulted in issue #8203, so this patch improves those tests and makes them reproduce the new issue. Importantly, we now know that this problem is not specific to a full-table scan, and also happens in a single-partition scan, so we fix the test to demonstrate this (instead of the old test, which missed the problem so the test passed). Both tests pass on Cassandra, and fail on Scylla. Refs #8203. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210302224020.1498868-1-nyh@scylladb.com>	2021-03-03 11:22:08 +01:00
Calle Wilund	e4d6c8904f	untyped_result_set: Do not copy data from input store (retain fragmented views) Refs #7961 Fixes #8014 Instead of doing a deep copy of input, we keep assume ownership and build rows of the views therein, potentially retaining fragmented data as-is avoiding premature linearization. Note that this is not all sugar and flowers though. Any data access will by nature be more expensive, and the view collections we create are potentially just as expensive as copying for small cells. Otoh, it allows writing code using this that avoids data copying, depending on destination. v2: * Fixed wrong collection reserved in visitor * Changed row index from shared ptr to ref * Moved typedef * Removed non-existing constructors * Added const ref to index build * Fixed raft usage after rebase v3: * Changed shared_ptr to unique	2021-03-03 10:19:46 +00:00
Calle Wilund	353730d4bb	result_generator: make visitor callback args explicit optionals This allows a visitor to separate temporaries (non-optional views) from store backed views (optionals) when traversing.	2021-03-03 10:19:46 +00:00
Calle Wilund	bba43ce31a	listlike_partial_deserializing_iterator: expose templated collection routines To allow using fragmented types as input.	2021-03-03 10:19:46 +00:00
Nadav Har'El	0fea089b37	Merge 'Fix reading whole requests during shedding' from Piotr Sarna When shedding requests (e.g. due to their size or number exceeding the limits), errors were returned right after parsing their headers, which resulted in their bodies lingering in the socket. The server always expects a correct request header when reading from the socket after the processing of a single request is finished, so shedding the requests should also take care of draining their bodies from the socket. Fixes #8193 Closes #8194 * github.com:scylladb/scylla: cql-pytest: add a shedding test transport: return error on correct stream during size shedding transport: return error on correct stream during shedding transport: skip the whole request if it is too large transport: skip the whole request during shedding	2021-03-03 08:52:48 +02:00
Piotr Sarna	4499f89916	cql-pytest: add a shedding test This scylla-only test case tries to push a too-large request to Scylla, and then retries with a smaller request, expecting a success this time. Refs #8193	2021-03-03 07:08:55 +01:00
Pekka Enberg	310b5c9592	redis: Fix license text in server.hh The search and replace pattern went bit overboard. Let's fix up the license text. Message-Id: <20210302171150.3346-1-penberg@scylladb.com>	2021-03-03 07:06:45 +01:00
Dejan Mircevski	05497fe14d	cql3/maps: Drop redundant if condition Accidentally introduced in `9eed26ca3d`, it can never be true due to code above it. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #8201	2021-03-03 07:06:45 +01:00
Nadav Har'El	d6335b7fda	test/alternator: better tests of oversized requests Like DynamoDB, Alternator rejects requests larger than some fixed maximum size (16MB). We had a test for this feature - test_too_large_request, but it was too blunt, and missed two issues: Refs #8195 Refs #8196 So this patch adds two better tests that reproduce these two issues: First, test_too_large_request_chunked verifies that an oversized request is detected even if the body is sent with chunked encoding. Second, both tests - test_too_large_request_chunked and test_too_large_request_content_length - verify that the rather limited (and arguably buggy) Python HTTP client is able to read the 413 status code - and doesn't report some generic I/O error. Both tests pass on DynamoDB, but fail on Alternator because of these two open issues. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210302154555.1488812-1-nyh@scylladb.com>	2021-03-03 07:06:45 +01:00
Nadav Har'El	c6ca1ec643	cql-pytest: add reproducers for two filtering-related issues The main goal of this patch is to add a reproducer for issue #7966, where partition-range scan with filtering that begins with a long string of non-matches aborts the query prematurely - but the same thing is fine with a single-partition scan. The test, test_filtering_with_few_matches, is marked as "xfail" because it still fails on Scylla. It passes on Cassandra. I put a lot of effort into making this reproducer fast - the dev-build test takes 0.4 seconds on my laptop. Earlier reproducers for the same problem took as much as 30 seconds, but 0.4 seconds turns this test into a viable regression test. We also add a test, test_filter_on_unset, reproduces issue #6295 (or the duplicate #8122), which was already solved so this test passes. Refs #6295 Refs #7966 Refs #8122 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210301170451.1470824-1-nyh@scylladb.com>	2021-03-03 07:06:45 +01:00
Calle Wilund	58489dc003	cql3::restrictions: Add SCYLLA_CLUSTERING_BOUND keyword for sstableloader Refs #8093 Refs /scylladb/scylla-tools-java#218 Adds keyword that can preface value tuples in (a, b, c) > (1, 2, 3) expressions, forcing the restriction to bypass column sort order treatment, and instead just create the raw ck bounds accordningly. This is a very limited, and simple version, but since we only need to cover this above exact syntax, this should be sufficient. v2: * Add small cql test v3: * Added comment in multi_column_restriction::slice, on what "mode" means and is for * Added small document of our internal CQL extension keywords, including this. v4: * Added a few more cases to tests to verify multi-column restrictions * Reworded docs a bit v5: * Fixed copy-paste error in comment v6: * Added negative (error) test cases v7: * Added check + reject of trying to combine SCYLLA_CLUST... slice and normal one Closes #8094	2021-03-03 07:06:45 +01:00
Pekka Enberg	9d54a3e743	redis: Remove seastar namespace import from query_processor.hh	2021-03-02 18:39:30 +02:00
Pekka Enberg	27c5041c86	redis: Switch to seastar::sharded<> in query_procesor.hh	2021-03-02 18:38:41 +02:00
Pekka Enberg	ee8fe53b3c	redis: Remove seastar namespace import from query_utils.hh	2021-03-02 18:37:31 +02:00
Pekka Enberg	c90c1ccd44	redis: Remove seastar namespace import from reply.hh	2021-03-02 18:36:30 +02:00
Pekka Enberg	1075d72780	redis: Remove commented out code from options.hh	2021-03-02 18:34:46 +02:00
Pekka Enberg	1c222fda65	redis: Remove seastar namespace import from options.hh	2021-03-02 18:34:30 +02:00
Pekka Enberg	d452c8f42e	redis: Remove seastar namespace import from service.hh	2021-03-02 18:33:31 +02:00
Pekka Enberg	d0594a86aa	redis: Switch to seastar::sharded<> in service.{hh,cc}	2021-03-02 18:30:39 +02:00
Avi Kivity	ee9db75210	Merge 'Clean up Redis transport layer' from Pekka Enberg The Redis transport layer seems to have originated as a copy-paste of the CQL transport layer. This pull request removes bunch of unused and commented out bits of code, and also does some minor cleanups like organizing includes, to make the code more readable. Closes #8198 * github.com:scylladb/scylla: redis: Remove unused to_bytes_view() function from server.cc redis: Remove unused tracing_request_type enum redis: Remove unneeded connection friend declaration redis: Remove unused process_request_executor friend declaration redis: Remove unused _request_cpu class member redis: Remove commented out code from server.hh redis: Remove duplicate request.hh include redis: Remove unused db::config forward declaration redis: Remove unused fmt_visitor forward declaration redis: Organize includes in server.{cc,hh} redis: Switch to seastar::sharded<> redis: Remove redundant access modifiers from server.hh	2021-03-02 18:27:38 +02:00

1 2 3 4 5 ...

25415 Commits