scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 10:41:12 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	fa59ccb89d	snitch: Declare snitch_ptr peering and rework container() method This patch makes the snitch base class reference local snitch_ptr, not its sharded<> container and, respectively, makes the base container() method return _backreference->container() instead. The motivation of this change is, again, in the next patch, which will move snitch_ptr<->driver_object linkage into snitch_ptr constructor. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-04-11 14:38:32 +03:00
Pavel Emelyanov	552a08ecd0	snitch: Introduce container() method Some snitch drivers want the peering_sharded_service::container() functionality, but they can't directly use it, because the driver class is in fact the pimplification behind the sharded<snitch_ptr> service. To overcome this there's a _my_distributed pointer on the driver base class that points back to sharded<snitch_ptr> object. This patch replaces the direct _my_distributed usage with the container() method that does it and also asserts that the pointer in question is initialized (some drivers already do it, some don't). Other than making the code more peering_sharded_service-like, this patch allows changing _my_distributed into _backreference that points to this shard's snitch_ptr, see next patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-04-11 14:38:27 +03:00
Avi Kivity	59d56a3fd7	Merge 'Add keyspace storage options' from Piotr Sarna This series is part of the shared storage project. The STORAGE option is designed to hold a map of options used for customizing storage for given keyspace. The option is kept in a system_schema.scylla_keyspaces table. This option is guarded with a schema feature, because it's kept in a new schema table: `system_schema.scylla_keyspaces`. Example of the contents of the new table: ```cql cassandra@cqlsh> select * from system_schema.scylla_keyspaces; keyspace_name \| storage_options \| storage_type ---------------+------------------------------------------------+-------------- ksx \| {'bucket': '/tmp/xx', 'endpoint': 'localhost'} \| S3 ``` Native storage options are not kept in the table, as this format doesn't hold any extra options and it would therefore just be a waste of storage. Closes #10144 * github.com:scylladb/scylla: test: regenerate schema_change_test for storage options case test: improve output of schema_change_test regeneration docs: add a paragraph on keyspace storage options test: add test cases for keyspace storage options database,cql3: add STORAGE option to keyspaces db: add keyspace-storage-options experimental feature db,schema_tables: add scylla_keyspaces table db,gms: add SCYLLA_KEYSPACE schema feature db,gms: add KEYSPACE_STORAGE_OPTIONS feature	2022-04-10 17:23:56 +03:00
Avi Kivity	379892142d	Merge 'Coroutinize view_update_builder::build_some' from Benny Halevy Simplify view_update_builder::build_some by turning it into a coroutine, and make view_updates::move_to async (also using a coroutine) so it may yield in-between building the updates, since freezing each mutation can be cpu intensive and preparing many updates synchronously may cause reactor stalls. Test: unit(dev) DTest: materialized_views_test.py(dev) Closes #10344 * github.com:scylladb/scylla: db: view_updates: coroutinize move_to db: view_update_builder: build_some: maybe yield between updates db: view_update_builder: build_some: fixup indentation db: view_update_builder: coroutinize build_some	2022-04-10 16:13:58 +03:00
Raphael S. Carvalho	7b1589cb3d	tests: chunked_managed_vector_test: Test correctness when crossing chunk boundary While reviewing "utils/chunked_managed_vector: Fix corruption in case there is more than one chunk", I was worried that there could be a correctness issue when pop_back() pops off the first element of the last chunk, but turns out I made an off-by-one error in my theory. Anyway, I wrote a unit test to verify my assumption and I found worth submitting it upstream. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220408133555.12397-2-raphaelsc@scylladb.com>	2022-04-08 16:44:16 +02:00
Raphael S. Carvalho	2c11673246	utils/chunked_managed_vector: expose max_chunk_capacity() That's useful for tests which want to verify correctness when the vector is performing operations across the chunk boundary. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220408133555.12397-1-raphaelsc@scylladb.com>	2022-04-08 16:44:00 +02:00
Benny Halevy	6454c8d67f	db: view_updates: coroutinize move_to And allow yielding in-between freezing each update mutation. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-08 11:29:25 +03:00
Benny Halevy	0e570d6ffa	db: view_update_builder: build_some: maybe yield between updates `update.move_to` freezes the mutation Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-08 11:22:41 +03:00
Benny Halevy	243ba2e976	db: view_update_builder: build_some: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-08 11:21:42 +03:00
Benny Halevy	3e376155ef	db: view_update_builder: coroutinize build_some Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-04-08 11:20:35 +03:00
Piotr Sarna	151d8f7c58	test: regenerate schema_change_test for storage options case Keyspace storage options series adds a new schema table: system_schema.scylla_keyspaces. The regenerated cases ensure that this new table is taken into account when the schema feature is available.	2022-04-08 09:17:01 +02:00
Piotr Sarna	4705a5fa42	test: improve output of schema_change_test regeneration Schema change test operates on pre-generated sstables, and sometimes this set of sstables needs to be regenerated. In order to make the regeneration process more ergonomic, the output is now directly copyable as valid C++ representation of UUIDs.	2022-04-08 09:17:01 +02:00
Piotr Sarna	20de52d96c	docs: add a paragraph on keyspace storage options A new CQL extension: allowing to specify keyspace storage options, is now described in our design notes.	2022-04-08 09:17:01 +02:00
Piotr Sarna	97c9729487	test: add test cases for keyspace storage options The test cases check if it's possible to set and/or alter storage options for keyspaces with CQL, and whether the changes are reflected in the schema tables.	2022-04-08 09:17:01 +02:00
Piotr Sarna	58529591a9	database,cql3: add STORAGE option to keyspaces The STORAGE option is designed to hold a map of options used for customizing storage for given keyspace. The option is kept in a system_schema.scylla_keyspaces table. The option is only available if the whole cluster is aware of it - guarded by a cluster feature. Example of the table contents: ``` cassandra@cqlsh> select * from system_schema.scylla_keyspaces; keyspace_name \| storage_options \| storage_type ---------------+------------------------------------------------+-------------- ksx \| {'bucket': '/tmp/xx', 'endpoint': 'localhost'} \| S3 ```	2022-04-08 09:17:01 +02:00
Piotr Sarna	3272b4826f	db: add keyspace-storage-options experimental feature Specifying non-standard keyspace options is experimental, so it's going to be protected by a configuration flag.	2022-04-08 09:17:01 +02:00
Piotr Sarna	7f02b188b7	db,schema_tables: add scylla_keyspaces table The table holds scylla-specific information on keyspaces. The first columns include storage_type and storage_options, which will be used later to store storage information.	2022-04-08 09:17:00 +02:00
Piotr Sarna	120980ac8e	db,gms: add SCYLLA_KEYSPACE schema feature This schema feature will be used to guard the upcoming system_schema.scylla_keyspaces schema table.	2022-04-08 09:17:00 +02:00
Piotr Sarna	567c0d0368	db,gms: add KEYSPACE_STORAGE_OPTIONS feature The feature represents the ability to store storage options in keyspace metadata: represented as a map of options, e.g. storage type, bucket, authentication details, etc.	2022-04-08 09:17:00 +02:00
Tomasz Grabiec	41fe01ecff	utils/chunked_managed_vector: Fix corruption in case there is more than one chunk If reserve() allocates more than one chunk, push_back() should not work with the last chunk. This can result in items being pushed to the wrong chunk, breaking internal invariants. Also, pop_back() should not work with the last chunk. This breaks when there is more than one chunk. Currently, the container is only used in the sstable partition index cache. Manifests by crashes in sstable reader which touch sstables which have partition index pages with more than 1638 partition entries. Introduced in `78e5b9fd85` (4.6.0) Fixes #10290 Message-Id: <20220407174023.527059-1-tgrabiec@scylladb.com>	2022-04-07 21:26:35 +03:00
Benny Halevy	40ad057b6c	database: delete db_apply_executor forward declaration The class is long gone, since version 3.0. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220407094632.2647967-1-bhalevy@scylladb.com>	2022-04-07 17:11:38 +03:00
Wojciech Mitros	97408078a1	dependencies: add rust The main reason for adding rust dependency to scylla is the wasmtime library, which is written in rust. Although there exist c++ bindings, they don't expose all of its features, so we want to do that ourselves using rust's cxx. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com> [avi: update toolchain] [avi: remove example, saving for a follow-on]	2022-04-07 12:26:05 +03:00
Botond Dénes	ad075b27a4	test/lib/mutation_diff: s/colordiff/diff/ Colordiff is problematic when writing the diff into a file for later examination. Use regular diff instead. One can still get syntax highlighting by writing the output into `.diff` file (which most editors will recognize). Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20220407080944.324108-1-bdenes@scylladb.com>	2022-04-07 12:07:24 +03:00
Michael Livshin	da7c7fd3dc	delete code of the unused normalizing_reader class Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20220406161107.2376568-3-michael.livshin@scylladb.com>	2022-04-07 09:29:41 +03:00
Michael Livshin	d8598d048a	enormous_table_reader: inherit from flat_mutation_reader_v2::impl (completely mechanical change) Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20220406161107.2376568-2-michael.livshin@scylladb.com>	2022-04-07 09:29:41 +03:00
Michael Livshin	702ad7447a	enormous_table_reader: remove the duplicate _schema field flat_mutation_reader{,_v2}::impl already contains one, which makes for very exciting debugging experience (and no, clang does not mind at all). Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20220406161107.2376568-1-michael.livshin@scylladb.com>	2022-04-07 09:29:41 +03:00
Pavel Emelyanov	9066224cf4	table: Don't export compaction manager reference There's a public call on replica::table to get back the compaction manager reference. It's not needed, actually. The users of the call are distributed loader which already has database at hand, and a test that creates itw own instance of compaction manager for its testing tables and thus also has it available. tests: unit(dev) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20220406171351.3050-1-xemul@scylladb.com>	2022-04-07 09:27:45 +03:00
Botond Dénes	18be2e9faf	Merge "Remove gossiper->snitch kicking" from Pavel Emelyanov " Gossiper calls snitch->gossiper_starting() when being enabled. This generates a dependency loop -- snitch needs gossiper to gossip its states and get DC/RACK, gossiper needs snitch to do this kick. This set removes this notification. The new approach is to kick the snitch to gossip its states in the same places where gossiper is enabled() so that only the snitch->gossiper dependency remains. As a side effect the set ditches a bunch of references to global snitch instance. tests: unit(dev) " * 'br-snitch-gossiper-starting' of https://github.com/xemul/scylla: snitch: Remove gossiper_starting() snitch: Remove gossip_snitch_info() property-file snitch: Re-gossip states with the help of .get_app_states() property-file snitch: Reload state in .start() ec2 multi-region snitch: Register helper in .start() snitch, storage service: Gossip snitch info once snitch: Introduce get_app_states() method property-file snitch: Use _my_distributed to re-shard storage service: Shuffle snitch name gossiping	2022-04-06 17:41:36 +03:00
Piotr Sarna	2683b54402	Merge 'CQL3: Optional FINALFUNC and INITCOND for UDA' from Michał Jadwiszczak Makes final function and initial condition to be optional while creating UDA. No final function means UDA returns final state and default initial condition is `null`. Both items were optional in cql's grammar but they were treated as required in code. Additionally I've added check if state function returns state. Fixes #10324 Closes #10331 * github.com:scylladb/scylla: CQL3: check sfunc return type in UDA cql-pytest: UDA no final_func/initcond tests cql3: allow no final_func and no initcond in UDA	2022-04-06 16:04:47 +02:00
Michael Livshin	a90e02c302	skeleton_reader: inherit from flat_mutation_reader_v2::impl (completely mechanical change) Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20220406122912.2248111-1-michael.livshin@scylladb.com>	2022-04-06 16:55:54 +03:00
Michael Livshin	6001a0fef1	multi_partition_reader: inherit from flat_mutation_reader_v2::impl (completely mechanical change) Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20220406122122.2246058-1-michael.livshin@scylladb.com>	2022-04-06 16:55:07 +03:00
Benny Halevy	17358ac2a0	cmake: CMakeLists.txt: rename flat_mutation_reader.cc to readers/mutation_readers.cc It was moved in 31d84a254c00b36dc2576e06ee288e28a13238195. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220406110512.3731011-3-bhalevy@scylladb.com>	2022-04-06 14:10:34 +03:00
Benny Halevy	4b3d0643a8	cmake: CMakeLists.txt: remove conncetion_notifier.cc It was removed in `3aa05f7f03`. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220406110512.3731011-2-bhalevy@scylladb.com>	2022-04-06 14:10:33 +03:00
Benny Halevy	8d95e12ecd	cmake: CMakeLists.txt: update source paths Those were moved to subdirectories. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220406110512.3731011-1-bhalevy@scylladb.com>	2022-04-06 14:10:32 +03:00
Avi Kivity	82733aeadb	Merge 'Perf: Add extended template version of timed_perf + use in CL perf' from Calle Wilund Adds sub-template for time_parallel with templated result type + optional per-iteration post-process func. Idea is that Res may be a subtype of perf_result, with additional stats, initiated on init, and post-process function can fix up and apply stats -> we can add stats to result. Then uses this mighty construct to add some IO stats to CL perf. Closes #10334 * github.com:scylladb/scylla: perf_commitlog: Add bytes + bytes written stats perf: Add aio_writes mixin for perf_results test/perf/perf.hh: Make templated version of test routine to allow extended stats	2022-04-06 12:52:53 +03:00
Nadav Har'El	0f3cd6ad18	test/cql-pytest: fix fails_without_raft tests on Cassandra We had a Python typo ("false" instead of "False") which prevented tests with the fails_without_raft marker for running on Cassandra. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220405170337.36321-1-nyh@scylladb.com>	2022-04-06 11:20:25 +03:00
Jadw1	b560286ffe	CQL3: check sfunc return type in UDA Thre return type of state function is now checked while creating UDA. Appropriate test added to cql-pytest.	2022-04-06 09:25:17 +02:00
Jadw1	977d6ac8b0	cql-pytest: UDA no final_func/initcond tests Cql-pytests to check if UDA works properly without final function or initial condition.	2022-04-06 09:25:12 +02:00
Jadw1	c921efd1b3	cql3: allow no final_func and no initcond in UDA Makes final function and initial condition to be optional while creating UDA. No final function means UDA returns final state and defeult initial condition is `null`. Fixes: #10324	2022-04-06 09:08:50 +02:00
Nadav Har'El	cfe04e6437	test/cql-pytest: nicer error message if a test can't find nodetool When testing Scylla, cql-pytest does not need an external nodetool command - it uses the REST API instead because it is much faster and there is no need to install anything. However, if cql-pytest is run against Cassandra, the tests do want to use the "nodetool" utility and want to know what it is. The tests use either the NODETOOL environment variable, or if that doesn't exist, look for "nodetool" in the path. If nodetool wasn't found in that way, before this patch, we got an ugly error message with long irrelevant Python backtraces. It wasn't easy to understand that what actually happened was that the user forgot to set the NODETOOL environment variable. This patch cleans up this error handling. Now, if nodetool cannot be found, every test that tries to run nodetool will report just a one- line error message, clearly explaining what went wrong and how to fix it: Error: Can't find nodetool. Please set the NODETOOL environment variable to the path of the nodetool utility. To reiterate, when testing Scylla, nodetool is not needed even after this patch. These errors will not happen even if you don't have the nodetool utility. You only need nodetool if you plan to test Cassandra. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220405171835.43992-1-nyh@scylladb.com>	2022-04-05 20:29:02 +03:00
Calle Wilund	af28fb6d94	perf_commitlog: Add bytes + bytes written stats Used extended perf_result used with aio_writes + aio_write_bytes to include some IO stats for the benchmark.	2022-04-05 13:43:57 +00:00
Calle Wilund	5b60a6cf7c	perf: Add aio_writes mixin for perf_results Can be used with time_parallel_ex. Adds measurements for aio writes/aio written bytes.	2022-04-05 13:42:36 +00:00
Calle Wilund	12ab34a3d9	test/perf/perf.hh: Make templated version of test routine to allow extended stats Adds sub-template for time_parallel with templated result type + optional per-iteration post-process func. Idea is that Res may be a subtype of perf_result, with additional stats, initiated on init, and post-process function can fix up and apply stats -> we can add stats to result.	2022-04-05 13:30:42 +00:00
Avi Kivity	0d5fd526a5	Merge "tools/scylla-sstable alternative schema load method for system tables" from Botond " Examining sstables of system tables is quite a common task. Having to dump the schemas of such tables into a schema.cql is annoying knowing that these schemas are readily available in scylla, as they are hardcoded. This mini-series adds a method to make use of this fact, by adding a new option: `--system-schema`, which takes the name of a system table and looks up its schema. Tests: unit(dev) " * 'scylla-sstable-system-schema/v1' of https://github.com/denesb/scylla: tools/scylla-sstable: add alternative schema load method for system tables tools/schema_loader: add load_system_schema() db/system_distributed_keyspace: add all tables methods tools/scylla-sstable: reorganize main help text	2022-04-05 15:48:29 +03:00
Avi Kivity	6cfc1d6f6a	Update seastar submodule * seastar 798ec50701...2a2a13058e (2): > condition_variable: Add "has_waiters()" accessor + test > Merge "RPC tester" from Pavel E	2022-04-05 13:47:51 +03:00
Gleb Natapov	7bf557332f	storage_service: remove maybe from maybe_start_sys_dist_ks There is nothing "maybe" about it now. Message-Id: <Ykv/bj8MvKh0UU23@scylladb.com>	2022-04-05 12:49:56 +03:00
Benny Halevy	abbf5de68c	frozen_mutation: introduce consume method Allowing to consume the frozen_mutation directly to a stream rather than unfreezing it first and then consuming the unfrozen mutation. Streaming directly from the frozen_mutation saves both cpu and memory, and will make it easier to be made async as a follow, to allow yielding, e.g. between rows. This is used today only in to_data_query_result which is invoked on the read-repair path. Refs #10038 Fixes #10021 Test: unit(release) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220405055807.1834494-1-bhalevy@scylladb.com>	2022-04-05 10:51:21 +03:00
Nadav Har'El	67e0590bbc	alternator: remove old TODO (with test verifying it) We had an old TODO in the Alternator "Scan" operation code which suggested that we may need to do something to limit the size of pages when a row limit ("Limit") isn't given. But we do already have a built-in limit on page sizes (1 MB), so this TODO isn't needed and can be removed. But I also wanted to make sure we have a test that this limit works: We already had a test that this 1 MB limit works for a single-partition Query (test_query.py::test_query_reverse_longish - tested both forward and reversed queries). In this patch I add a similar test for a whole- table Scan. It turns out that although page size is limited in this case as well, it's not exactly 1 MB... For small tables can even reach 3 MB. I consider this "good enough" and that we can drop the TODO, but also opened issue #10327 to document this surprising (for me) finding. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220404145240.354198-1-nyh@scylladb.com>	2022-04-05 09:23:23 +03:00
Nadav Har'El	56936d3c16	test/alternator: add reproducers for scan of long string of tombstones This patch adds two xfailing tests for issue #7933. That issue is about what Scan or Query paging does when encountering a very long string of consecutive tombstones (partition or row tombstones). Ideally, in that case the scan could stop on one of these tombstones after already processing too many. But as these two tests demonstrate, the scan can't stop in the middle of a long string of tombstones - and as a result retrieving a single page can take an unbounded amount of time, which is wrong. Currently the tests are marked `@veryslow` (they each take more than a minute) because they each create a huge number of tombstones to demonstrate a huge amount of work for a single page. When we fix issue #7933 and have a much smaller limit on the number of tombstones processed in a single page, we can hopefully make these tests much shorter and remove the `@veryslow` tag. The `@veryslow` tags means that although these tests can be used manually (with `--runveryslow`) they will not yet be run as part of the usual regression tests. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220403070706.250147-1-nyh@scylladb.com>	2022-04-05 09:11:38 +03:00
Raphael S. Carvalho	840500fc4d	compaction: Make cleanup for Leveled strategy bucket-aware Bucket awareness in cleanup was introduced in `a69d98c3d0`. STCS and TWCS already support it, and now LCS will receive it. The goal of bucket awareness is to reduce writeamp in cleanup, therefore reducing operation time. Additionally, garbage collection becomes more efficient as shadowed data can now be potentially compacted with the data that shadows it, assuming they're on the same level. The implementation for LCS is simple. Will reuse the procedure for STCS for returning jobs in level 0. And one job will be returned for each non-empty level > 0. What allows us to do it is our incremental selection approach used in compaction, that sets a limit on memory usage and disk space requirement. Fixes #10097. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220331173417.211257-1-raphaelsc@scylladb.com>	2022-04-05 09:10:21 +03:00

1 2 3 4 5 ...

30808 Commits