scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 05:53:13 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	689a4c1e54	view: Use proxy to get token metadata from The mutate_MV() call needs token metadata and it gets them from global storage service. Fixing it not to use globals is a huge refactoring, so for now just get the tokens from global storage proxy. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Pavel Emelyanov	5a13031ce8	thrift: Use local storage service in handlers Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Pavel Emelyanov	f2992f4e32	thrift: Carry sharded<storage_service>& down to handler The thrift_handler class' methods need storage service. This patch makes sure this class has sharded storage service reference on board. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Pavel Emelyanov	df285fca7a	api: Capture and use sharded<storage_service>& in handlers The reference in question is already there, handlers that need storage service can capture it and use. These handlers are not yet stopped, but neither is the storage service itself, so the potentially dangling reference is not being set up here. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Pavel Emelyanov	2e50ba7079	api: Carry sharded<storage_service>& down to some handlers Both set_server_storage_service and set_server_storage_proxy set up API handlers that need storage service to work. Now they all call for global storage service instance, but it's better if they receive one from main. This patch carries the sharded storage service reference down to handlers setting function, next patch will make use of it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Pavel Emelyanov	a965a742fc	alternator: Take token metadata from server's storage_proxy There's a local_nodelist_handler serving API requests that calls for global storage service to get token metadata from. Now it can get storage proxy reference from server upon construction and use it for tokens. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Pavel Emelyanov	ba10e96c75	alternator: Keep storage_proxy on server It's already available on controller and will be needed by API handlers in the next patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Gleb Natapov	4764028cb3	raft: Remove leader_id from append_request The filed is not used anywhere. Message-Id: <YP0khmjK2JSp77AG@scylladb.com>	2021-07-28 20:30:07 +02:00
Avi Kivity	42e1f318d7	Merge "Respect "bypass cache" in sstable index caching" from Tomasz " This series changes the behavior of the system when executing reads annotated with "bypass cache" clause in CQL. Such reads will not use nor populate the sstable partition index cache and sstable index page cache. " * 'bypass-cache-in-sstable-index-reads' of github.com:tgrabiec/scylla: sstables: Do not populate page cache when searching in promoted index for "bypass cache" reads sstables: Do not populate partition index cache for "bypass cache" reads	2021-07-28 18:45:39 +03:00
Avi Kivity	331eb57e17	Revert "compression: define 'class' attribute for compression and deprecate 'sstable_compression'" This reverts commit `5571ef0d6d`. It causes rolling upgrade failures. Fixes #9055. Reopens #8948.	2021-07-28 14:14:22 +03:00
Avi Kivity	12f9a5462d	Merge 'repair: Drop unused partition level repair related code' from Asias He This series removes unused partition level repair related code. Closes #9105 * github.com:scylladb/scylla: repair: Drop stream plan related code locator: Add missing file.hh include in production_snitch_base repair: Drop request_transfer_ranges and do_streaming repair: Drop parallelism_semaphore	2021-07-28 11:24:30 +03:00
Benny Halevy	67d5addc09	test: mutation_reader_test: clustering_order_merger_test_generator: use explicit type for num_ranges gcc 10.3.1 spews the following error: ``` _test_generator::generate_scenario(std::mt19937&) const’: test/boost/mutation_reader_test.cc:3731:28: error: comparison of integer expressions of different signedness: ‘int’ and ‘long unsigned int’ [-Werror=sign-compare] 3731 \| for (auto i = 0; i < num_ranges; ++i) { \| ~~^~~~~~~~~~~~ ``` Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210728073538.2467040-1-bhalevy@scylladb.com>	2021-07-28 11:22:59 +03:00
Asias He	91bcfba3f7	repair: Drop stream plan related code We do not use stream plan to sync data in repair anymore.	2021-07-28 11:23:42 +08:00
Asias He	860109daca	locator: Add missing file.hh include in production_snitch_base ``` clang++ build/dev/locator/production_snitch_base.o locator/production_snitch_base.cc In file included from locator/production_snitch_base.cc:41: In file included from ./locator/production_snitch_base.hh:41: In file included from /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/unordered_map:38: /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/type_traits:1329:23: error: incomplete type 'seastar::file' used in type trait expression __bool_constant<__has_trivial_destructor(_Tp)>> ^ ``` This code compiles now due to indirect include from repair.hh.	2021-07-28 11:22:37 +08:00
Asias He	f274ed6512	repair: Drop request_transfer_ranges and do_streaming They are used by parititon level repair. With row level repair, we do not need them anymore.	2021-07-28 10:54:18 +08:00
Asias He	f1c08f121a	repair: Drop parallelism_semaphore It is used by partition level repair. With row level repair, we do not need it anymore.	2021-07-28 10:54:15 +08:00
Avi Kivity	77a2b4b520	test: perf: perf_simple_query: add instructions_per_op to the json-result output It's in text output, but `863b49af03` forgot to add it to the machine readable results. Closes #9017	2021-07-27 20:26:19 +02:00
Calle Wilund	59555fa363	cdc: fix broken function signature in maybe_back_insert_iterator Fixes #9103 compare overload was declared as "bool" even though it is a tri-cmp. causes us to never use the speed-up shortcut (lessen search set), in turn meaning more overhead for collections. Closes #9104	2021-07-27 20:37:30 +03:00
Avi Kivity	0a4884e87a	Merge "Expose immutable rows and tombstones collections" from Pavel E " While working on evicting range tombstones one of the nastiest difficulties is that mutation_partition has very loose control over adding and removing of rows and range tombstones. This is because it exposes both collections via public methods, so it's pretty easy to grab a non-const reference on either of it and modify the collection. At the same time restricting the API with returning only const reference on the collection is not possible either, since finding in or iterating over a const-referenced collection would expose the const-reference element as well, while it can be perfectly valid to modify the single row/tombstone without touching the whole collection. In other words there's the need for an access method that both guarantees that no new elements are added to the collection, nor existing ones are removed from it, AND doesn't impose const on the obtained elements. The solution proposed here is the immutable_collection<> template that wraps a non-const collection reference and gives the caller only reading methods (find, lower_bound, begin, etc) so that it's guaranteed that the external user of mutation_partition won't be able to modify the collections. Those that already use the const reference on the mutation_partition itself are OK, they also use const-referenced everything. The places than do need to modify the partition's collections are thus made explicit. tests: unit(dev) " * 'br-mutation-partition-collection-view-3' of https://github.com/xemul/scylla: mutation_partition: Return immutable collection for range tombstones mutation_partition: Pin mutable access to range tombstones mutation_partition: Return immutable collection for rows mutation_partition: Pin mutable access to rows utils: Introduce immutable_collection<> btree: Generalize some iterator methods btree: Make iterators not modify the tree itself btree tests: Dont use iterator erase mutation_partition: Shuffle declarations range_tombstone_list: Mark more methods noexcept range_tombstone_list, code: Mark external_memory_usage noexcept	2021-07-27 20:34:04 +03:00
Pavel Emelyanov	b3c89787be	mutation_partition: Return immutable collection for range tombstones Patch the .row_tombstones() to return the range_tombstone_list wrapped into the immutable_collection<> so that callers are guaranteed not to touch the collection itself, but still can modify the tombstones. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	1bf643d4fd	mutation_partition: Pin mutable access to range tombstones Some callers of mutation_partition::row_tomstones() don't want (and shouldn't) modify the list itself, while they may want to modify the tombstones. This patch explicitly locates those that need to modify the collection, because the next patch will return immutable collection for the others. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	05b8cdfd24	mutation_partition: Return immutable collection for rows Patch the .clustered_rows() method to return the btree of rows wrapped into the immutable_collection<> so that callers are guaranteed not to touch the collection itself, but still can modify the elements in it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	ad27bf40e6	mutation_partition: Pin mutable access to rows Some callers of mutation_partition::clustered_rows() don't want (and shouldn't) modify the tree of rows, while they may want to modify the rows themselves. This patch explicitly locates those that need to modify the collection, because the next patch will return immutable collection for the others. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	c2a36f5668	utils: Introduce immutable_collection<> Wokring with collections can be done via const- and non-const references. In the former case the collection can only be read from (find, iterate, etc) in the latter it's possible to alter the collection (erase elements from or insert them into). Also the const-ness of the collection refernece is transparently inherited by the returned _elements_ of the collection, so when having a const reference on a collection it's impossible to modify the found element. This patch introduces a immutable_collection -- a wrapper over a random collection that makes sure the collection itself is not modified, but the obtained from it elements can be non-const. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	d1c693473a	btree: Generalize some iterator methods The non-const iterator has constructor from key pointer and the tree_if_singular method. There's no reasons why these two are absent in the const_iterator. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	6ef27c9fa1	btree: Make iterators not modify the tree itself The const_iterator cannot modify anything, but the plain iterator has public methods to remove the key from the tree. To control how the tree is modified this method must be marked private and modification by iterator should come from somewhere else. This somewhere else is the existing key_grabber that's already used to move keys between trees. Generalize this ability to move a key out of a tree (i.e. -- erase). Once done -- mark the iterator::erase_and_dispose private. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	e652b03b4e	btree tests: Dont use iterator erase Next patches will mark btree::iterator methods that modify the tree itself as private, so stop using them in tests. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	a9b4fa9db3	mutation_partition: Shuffle declarations Its methods that provide access to enclosed collections of rows and range tombstones are intermixed, so group them for smoother next patching and mark noexcept while at it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	9122f4129d	range_tombstone_list: Mark more methods noexcept Those returning iterators and size for the underlying collection of range tombstones are all non-throwing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	0f53e83a8e	range_tombstone_list, code: Mark external_memory_usage noexcept The range_tombstone_list's method is at the top of the stack of calls each not throwing anything, so do the deep-dive noexcept marking. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Benny Halevy	3a4e4f9914	compaction: to_string: handle invalid values as internal error Although the switch in `to_string(compaction_options::scrub::mode)` covers all possible cases, gcc 10.3.1 warns about: ``` sstables/compaction.cc: In function ‘std::string_view sstables::to_string(sstables::compaction_options::scrub::mode)’: sstables/compaction.cc:95:1: error: control reaches end of non-void function [-Werror=return-type] ``` Adding __builtin_unreachable(), as in `to_string(compaction_type)` does calm the compiler down, but it might cause undefined behavior in the future in case the switch won't cover all cases, or the passed value is corrupt somehow. Instead, call on_internal_error_noexcept to report the error and abort if configure to do so, otherwise, just return an "(invalid)" string. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210727130251.2283068-1-bhalevy@scylladb.com>	2021-07-27 16:04:09 +03:00
Avi Kivity	df4d77e857	table: simplify generate_and_propagate_view_updates exception handling We have both try/catch and handle_exception() to ignore exceptions. Try/catch is enough, so remove handle_exception(). Closes #9011	2021-07-27 14:08:30 +02:00
Avi Kivity	f86e65b4e7	Merge "Fix quadratic behavior in memtable/row_cache with lots of range tombstones" from Tomasz " This series fixes two issues which cause very poor efficiency of reads when there is a lot of range tombstones per live row in a partition. The first issue is in the row_cache reader. Before the patch, all range tombstones up to the next row were copied into a vector, and then put into the buffer until it's full. This would get quadratic if there is much more range tombstones than fit in a buffer. The fix is to avoid the accumulation of all tombstones in the vector and invoke the callback instead, which stops the iteration as soon as the buffer is full. Fixes #2581. The second, similar issue was in the memtable reader. Tests: - unit (dev) - perf_row_cache_update (release) " * tag 'no-quadratic-rt-in-reads-v1' of github.com:tgrabiec/scylla: test: perf_row_cache_update: Uncomment test case for lots of range tombstones row_cache: Consume range tombstones incrementally partition_snapshot_reader: Avoid quadratic behavior with lots of range tombstones tests: mvcc: Relax monotonicity check range_tombstone_stream: Introduce peek_next()	2021-07-27 14:39:13 +03:00
Avi Kivity	05d22d27a8	Merge "Cut repair->storage-service link" from Pavel E " It exists in the node-ops handler which is registered by repair code, but is handled by storage service. Probably, the whole node-ops handler should instead be moved into repair, but this looks like rather huge rework. So instead -- put the node-ops verb registration inside the storage-service. This removes some more calls for global storage service instance and allows slight optimization of node-ops cross-shards calls. tests: unit(dev), start-stop " * 'br-remove-storage-service-from-nodeops' of https://github.com/xemul/scylla: storage_service: Replace globals with locals storage_service: Remove one extra hop of node-ops handler storage_service: Fix indentation after previous patch storage_service: Move cross-shard hop up the stack repair: Drop empty verbs reg/unreg methods repair, storage_service: Move nodeops reg/unreg to storage service repair: Coroutinize row-level start/stop	2021-07-27 13:27:27 +03:00
Takuya ASADA	fdc786b451	install.sh: add supervisor support Bring supervisor support from dist/docker to install.sh, make it installable from relocatable package. This enables to use supervisor with nonroot / offline environment, and also make relocatable package able to run in Docker environment. Related #8849 Closes #8918	2021-07-27 12:51:29 +03:00
Takuya ASADA	42fd73d033	scylla_setup: add RAID5 support This supports optional RAID5 support on scylla_setup. Fixes #9076 Closes #9093	2021-07-27 12:49:29 +03:00
Avi Kivity	2cca461652	Merge 'sstables: merge row consumer interfaces with implementations' from Wojciech Mitros This patch follows #9002, further reducing the complexity of the sstable readers. The split between row consumer interfaces and implementations has been first added in 2015, and there is no reason to create new implementations anymore. By merging those classes, we achieve a sizeable reduction in sstable reader length and complexity. Refs #7952 Tests: unit(dev) Closes #9073 * github.com:scylladb/scylla: sstables: merge row_consumer into mp_row_consumer_k_l sstables: move kl row_consumer sstables: merge consumer_m into mp_row_consumer_m sstables: move mp_row_consumer_m	2021-07-27 12:23:29 +03:00
Benny Halevy	424c53d5b1	mutation_fragment_stream_validator: disambiguate schema member definition gcc 10.3.1 complains that: ``` ./mutation_fragment_stream_validator.hh:39:21: error: declaration of ‘const schema& mutation_fragment_stream_validator::schema() const’ changes meaning of ‘schema’ [-fpermissive] 39 \| const ::schema& schema() const { return _schema; } \| ^~~~~~ ``` Defining the _schama member as `::schema` rather than just `schema` calms the compiler down. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210727073941.1999909-1-bhalevy@scylladb.com>	2021-07-27 11:55:42 +03:00
Nadav Har'El	8030461a2c	cql-pytest: translate Cassandra's misc. type tests This is a translation of Cassandra's CQL unit test source file validation/entities/TypeTest.java into our our cql-pytest framework. This is a tiny test file, with only four test which apparently didn't find their place in other source files. All four tests pass on Cassandra, and all but one pass on Scylla - the test marked xfail discovered one previously-unknown incompatibility with Cassandra: Refs #9082: DROP TYPE IF EXISTS shouldn't fail on non-existent keyspace Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210726140934.1479443-1-nyh@scylladb.com>	2021-07-27 08:28:16 +03:00
Tomasz Grabiec	7578cef0a4	test: perf_row_cache_update: Uncomment test case for lots of range tombstones	2021-07-26 21:38:00 +02:00
Gleb Natapov	56d0f711e8	serialize: allow use non copyable types with std::variant Message-Id: <20210720120935.710549-3-gleb@scylladb.com>	2021-07-26 19:09:19 +03:00
Gleb Natapov	63025a75b2	serialize: allow use non copyable types with std::optional Message-Id: <20210720120935.710549-2-gleb@scylladb.com>	2021-07-26 19:09:19 +03:00
Avi Kivity	8a80e455fb	sstables: keys: convert trichotomic comparisons to std::strong_ordering Prevent accidental conversions to bool from yielding the wrong results. Unprepared users (that converted to bool, or assigned to int) are adjusted. Ref #1449 Test: unit (dev) Closes #9088	2021-07-26 19:09:19 +03:00
Nadav Har'El	d3a715e0ff	Update seastar submodule * seastar 93d053cd...ce3cc268 (4): > doc: update coroutine exception paragraph with make_exception > coroutine: add make_exception helper > coroutine: use std::move for forwarding exception_ptr > doc: tutorial: document direct exception propagation With the new throw-less coroutine exception support, we can modify some of Scylla's new coroutine code to generate exceptions a bit more efficiently, without actually thowing an exception.	2021-07-26 19:09:19 +03:00
Tomasz Grabiec	2d18360157	row_cache: Consume range tombstones incrementally Before the patch, all range tombstones up to the next row were copied into a vector, and then put into the buffer until it's full. This would get quadratic if there is much more range tombstones than fit in a buffer. The fix is to avoid the accumulation of all tombstones in the vector and invoke the callback instead, which stops the iteartion as soon as the buffer is full. Fixes #2581.	2021-07-26 17:48:05 +02:00
Tomasz Grabiec	e74c3c885e	partition_snapshot_reader: Avoid quadratic behavior with lots of range tombstones next_range_tombstone() was populating _rt_stream on each invocation from the current iterator ranges in _range_tombstones. If there is a lot of range tombstones, all would be put into _rt_stream. One problem is that this can cause a reactor stall. Fix by more incremental approach where we populate _rt_stream with minimal amount on each invocation of next_range_tombstone(). Another problem is that this can get quadratic. The iterators in _range_tombstones are advanced, but if lsa invalidates them across calls they can revert back to the front since they go back to _last_rt, which is the last consumed range tombstone, and if the buffer fills up, not all tombstones from _rt_stream could be consumed. The new code doesn't have this problem because everything which is produced out of the iterators in _range_tombstones is produced only once. What we put into _rt_stream is consumed first before we try to feed the _rt_stream with more data.	2021-07-26 17:48:05 +02:00
Tomasz Grabiec	0d7b3f9463	tests: mvcc: Relax monotonicity check Consecutive range tombstones can have the same position. They will, in one of the test cases, after the range tombstone merger in partition_snapshot_flat_reader no longer uses range_tombstone_list to merge data form multiple versions, which deoverlaps, but rather merges the streams corresponding to each version, which interleaves range tombstones from different versions.	2021-07-26 17:27:03 +02:00
Tomasz Grabiec	91868cf0cd	range_tombstone_stream: Introduce peek_next()	2021-07-26 13:33:34 +02:00
Pavel Emelyanov	11a2709f10	storage_service: Replace globals with locals The node-ops verb handler is the lambda of storage-service and it can stop using global storage service instance for no extra charge. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-26 14:21:30 +03:00
Pavel Emelyanov	6e56671d9e	storage_service: Remove one extra hop of node-ops handler It's now clear that the verb handler goes to some "random" shard, then immediatelly switches to shard-0 and then does the handling. Avoid the extra hop and go to shard-0 right at once. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-26 14:21:30 +03:00

1 2 3 4 5 ...

27614 Commits