scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-29 04:37:00 +00:00

Author	SHA1	Message	Date
Botond Dénes	41facb3270	treewide: move reversing to the mutation sources Push down reversing to the mutation-sources proper, instead of doing it on the querier level. This will allow us to test reverse reads on the mutation source level. The `max_size` parameter of `consume_page()` is now unused but is not removed in this patch, it will be removed in a follow-up to reduce churn.	2021-09-29 12:15:45 +03:00
Botond Dénes	c7619de929	mutation_query: reconcilable_result_builder: document reverse query preconditions	2021-09-28 17:03:57 +03:00
Kamil Braun	7dc4ee35c9	sstable_set: time_series_sstable_set: reverse mode `time_series_sstable_set` uses `clustering_combined_reader` to implement efficient single-partition reads. It provides a `position_reader_queue` to the reader. This queue returns readers to the sstables from the set in order of the sstables' lower bounds, and with each reader it provides an upper bound for the positions-in-partition returned by the reader. Until now we would assume non-reversed queries only. Reversed queries were implemented by performing forward query in the lower layers and reversing the results at the upper-most layer of the reader stack. Before pushing the reversing down to the sources (in particular, to sstable readers), we need to support the reverse mode in `time_series_sstable_set` and the queue it provides to `clustering_combined_reader`. This requires using different lower and upper bounds in the queue. For non-reversed reads we used `sstable::min_position()` as the lower bound and `sstable::max_position()` as the upper bound. For reversed reads all comparisons performed by `clustering_combined_reader` will be reversed, as it will use a reversed schema. We can then use `sstable::max_position().reversed()` for the lower bound and `sstable::min_position().reversed()` for the upper bound.	2021-09-28 17:03:57 +03:00
Botond Dénes	22e216563a	mutlishard_mutation_query: set max result size on used permits `08042c1688` added the query max result size to the permit but only set it for single partition queries. This patch does the same for range-scans in preparation of `query::consume_page()` not propagating max size soon.	2021-09-28 17:03:57 +03:00
Botond Dénes	dec282e050	db/virtual_table: streaming_virtual_table::as_mutation_source(): use query schema instead of table schema The two might not be the same in case the schema was upgraded (unlikely for virtual tables) or if we are reading in reverse. It is important to use the passed-in query schema consistently during a read.	2021-09-28 17:03:57 +03:00
Botond Dénes	f5ef88c0c5	flat_mutation_reader: make_reversing_reader(): add convenience stored slice This serves as a convenience slice storage for reads that have to store an edited slice somewhere. This is common for reads that work with a native-reversed slice and so have to convert the one used in the query -- which is in half-reversed format.	2021-09-28 17:03:57 +03:00
Botond Dénes	2bd295ee80	mutation_reader: evictable_reader: add reverse read support Evictable reader has to be made aware of reverse reads as it checks/edits the slice. This shouldn't require reverse awareness normally, it is only required because we still use the half-reversed (legacy) slice format for reversed reads. Once we switch to the native format this commit can be reverted.	2021-09-28 17:03:57 +03:00
Botond Dénes	eeebe4ab63	flat_mutation_reader: make_flat_mutation_reader_from_fragments(): add reverse read support Implemented with the `make_reversing_reader()` adaptor.	2021-09-28 17:03:57 +03:00
Botond Dénes	cc222e5332	flat_mutation_reader: flat_mutation_reader_from_mutations(): add reverse read support Implemented with the `make_reversing_reader()` adaptor.	2021-09-28 17:03:57 +03:00
Botond Dénes	1a2bdba25f	flat_mutation_reader: flat_mutation_reader_from_mutations(): document preconditions	2021-09-28 17:03:57 +03:00
Kamil Braun	4bd601c6fd	query-request: introduce `half_reverse_slice` A utility function for converting between forward and half-reversed (or 'legacy'-reversed) slices to be used in the next commit.	2021-09-28 17:03:57 +03:00
Kamil Braun	270093b251	flat_mutation_reader_assertions: log what's expected	2021-09-28 17:03:57 +03:00
Tomasz Grabiec	2b3ae6aca4	position_in_partition: Introduce reversed() transformation It transforms the position from a forward-clustering-order schema domain to a reversed-clustering-order schema domain. The object still refers to the same element of the space of keys under this transformation. However, the identification of the position, the position_in_partition object, is schema-dependent, it is always interpreted relative to some schema. Hence the need to transform it when switching schema domains. Message-Id: <20210917102612.308149-1-tgrabiec@scylladb.com>	2021-09-27 14:23:09 +03:00
Gleb Natapov	78774a485a	raft: drop local snapshot if it cannot be installed If a locally taken snapshot cannot be installed because newer one was received meanwhile it should be dropped, otherwise it will take space needlessly. Message-Id: <YUrWXxVfBjEio1Ol@scylladb.com>	2021-09-27 13:03:23 +02:00
Asias He	1657e7be14	gossiper: Send generation number with shutdown message Consider: - n1, n2 in the cluster - n2 shutdown - n2 sends gossip shutdown message to n1 - n1 delays processing of the handler of shutdown message - n2 restarts - n1 learns new gossip state of n2 - n1 resumes to handle the shutdown message - n1 will mark n2 as shutdown status incorrectly until n2 restarts again To prevent this, we can send the gossip generation number along with the shutdown message. If the generation number does not match the local generation number for the remote node, the shutdown message will be ignored. Since we use the rpc::optional to send the generation number, it works with mixed cluster. Fixes #8597 Closes #9381	2021-09-27 11:08:43 +03:00
Avi Kivity	d7ac699a55	Revert "Merge "compaction: Update backlog tracker correctly when schema is updated" from Raphael" This reverts commit `b5cf0b4489`, reversing changes made to `e8493e20cb`. It causes segmentation faults when sstable readers are closed. Fixes #9388.	2021-09-26 18:31:49 +03:00
Avi Kivity	bf94c06fc7	Revert "Merge "simplifications and layer violation fix for compaction manager" from Raphael" This reverts commit `7127c92acc`, reversing changes made to `88480ac504`. We need to revert `b5cf0b4489` to fix #9388, and this stands in the way. Ref #9388.	2021-09-26 18:30:36 +03:00
Piotr Sarna	06f724857f	transport: remove unused map of stream_id->query states The map is never touched, so it only occupies precious space for each connection. Closes #9383	2021-09-26 13:41:58 +03:00
Avi Kivity	936de92876	Merge 'cql3: Add evaluate(expression) and use instead of term::bind()' from Jan Ciołek This PR adds the function: ```c++ constant evaluate(const expression&, const query_options&); ``` which evaluates the given expression to a constant value. It binds all the bound values, calls functions, and reduces the whole expression to just raw bytes and `data_type`, just like `bind()` and `get()` did for `term`. The code is often similar to the original `bind()` implementation in `lists.cc`, `sets.cc`, etc. * For some reason in the original code, when a collection contains `unset_value`, then the whole collection is evaluated to `unset_value`. I'm not sure why this is the case, considering it's impossible to have `unset_value` inside a collection, because we forbid bind markers inside collections. For example here: `cc8fc73761/cql3/lists.cc (L134)` This seems to have been introduced by Pekka Enberg in `50ec81ee67`, but he has left the company. I didn't change the behaviour, maybe there is a reason behind it, although maybe it would be better to just throw `invalid_request_exception`. * There was a strange limitation on map key size, it seems incorrect: `cc8fc73761/cql3/maps.cc (L150)`, but I left it in. * When evaluating a `user_type` value, the old code tolerated `unset_value` in a field, but it was later converted to NULL. This means that `unset_value` doesn't work inside a `user_type`, I didn't change it, will do in another PR. * We can't fully get rid of `bind()` yet, because it's used in `prepare_term` to return a `terminal`. It will be removed in the next PR, where we finally get rid of `term`. Closes #9353 * github.com:scylladb/scylla: cql3: types: Optimize abstract_type::contains_collection cql3: expr: Convert evaluate_IN_list to use evaluate(expression) cql3: expr: Use only evaluate(expression) to evaluate term cql3: expr: Implement evaluate(expr::function_call) cql3: expr: Implement evaluate(expr::usertype_constructor) cql3: expr: Implement evaluate(expr::collection_constructor) cql3: expr: Implement evaluate(expr::tuple_constructor) cql3: expr: Implement evaluate(expr::bind_variable) cql3: Add contains_collection/set_or_map to abstract_type cql3: expr: Add evaluate(expression, query_options) cql3: Implement term::to_expression for function_call cql3: Implement term::to_expression for user_type cql3: Implement term::to_expression for collections cql3: Implement term::to_expression for tuples cql3: Implement term::to_expression for marker classes cql3: expr: Add data_type to *_constructor structs cql3: Add term::to_expression method cql3: Reorganize term and expression includes	2021-09-26 12:58:11 +03:00
Eliran Sinvani	0b2861d014	Prepare for inheriting from reader_concurrency_semaphore Some future and enterprise features requires us to inherit from reader_concurrency_semaphore, this might require additional "wrap up" operations to be done on stop which serves as a barrier for the semaphore. Here we simply make stop virtual so it is inherited and can be augmented. This change have no significant impact on performance since stop can get called once in a lifetime of a semaphore. The approach is to add two extenction points to the reader_concurrency_semaphore class, one just before the stop code is executed and one just after. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Closes #9373	2021-09-26 12:57:48 +03:00
Avi Kivity	2d352820f4	Update tools/java and tools/jmx submodules * tools/java 9c5c0ad1fd...05ec511bbb (2): > reloc/build_reloc.sh: Add missing space > reloc: stop removing entire BUILDDIR * tools/jmx 658818b...5c383b6 (1): > reloc: stop removing entire $BUILDDIR	2021-09-26 12:33:55 +03:00
Pavel Emelyanov	88e5b7c547	database: Shutdown in tests There's a circular dependency: query processor needs database database owns large_data_handler and compaction_manager those two need qctx qctx owns a query_processor Respectively, the latter hidden dependency is not "tracked" by constructor arguments -- the query processor is started after the database and is deferred to be stopped before it. This works in scylla, because query processor doesn't really stop there, but in cql_test_env it's problematic as it stops everything, including the qctx. Recent database start-stop sanitation revealed this problem -- on database stop either l.d.h. or compaction manager try to start (or continue) messing with the query processor. One problem was faced immediatelly and pluged with the `75e1d7ea` safety check inside l.d.h., but still cql_test_env tests continue suffering from use after free on stopped query processor. The fix is to partially revert the `4b7846da` by making the tests stop some pieces of the database (inclusing l.d.h. and compaction manager) as it used to before. In scylla this is, probably, not needed, at least now -- the database shutdown code was and still is run right before the stopping one. tests: unit(debug) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210924080248.11764-1-xemul@scylladb.com>	2021-09-26 11:09:01 +03:00
Benny Halevy	7498ac4869	dht: boot_strapper: bootstrap: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210923144206.1690576-2-bhalevy@scylladb.com>	2021-09-26 11:09:01 +03:00
Benny Halevy	798aee6747	dht: boot_strapper: coroutinize bootstrap Prepare for futurizing get_pending_address_ranges. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210923144206.1690576-1-bhalevy@scylladb.com>	2021-09-26 11:09:01 +03:00
Kamil Braun	bf823e34a4	raft: disable sticky leadership rule The Raft PhD presents the following scenario. When we remove a server from the cluster configuration, it does not receive the configuration entry which removes it (because the leader appending this entry uses that entry's configuration to decide to which servers to send the entry to, and the entry does not contain the removed server). Therefore the server keeps believing it is a member but does not receive heartbeats from leaders in the new configuration. Therefore it will keep becoming a candidate, causing existing leaders to step down, harming availability. With many such candidates the cluster may even stop being able to proceed at all. We call such servers "disruptive". More concretely, consider the following example, adapted from the PhD for joint configuration changes (the original PhD considered a different algorithm which can only add/remove one server at once): Let C_old = {A, B, C, D}, C_new = {B, C, D}, and C_joint be the joint configuration (C_old, C_new). D is the leader. D managed to append C_joint to every server and commit it. D appends C_new. At this point, D stops sending heartbeats to A because C_new does not contain A, but A's last entry is still C_joint, so it still has the ability to become a candidate. A can now become a candidate and cause D, or any other leader in C_new, to step down. Even if D manages to commit C_new, A can keep disrupting the cluster until it is shut down. Prevoting changes the situation, which the authors admit. The "even if" above no longer applies: if D manages to commit C_new, or just append it to a majority of C_new, then A won't be able to succeed in the prevote phase because a majority of servers in C_new has a longer log than A (and A must obtain a prevote from a majority of servers in C_new because A is in C_joint which contains C_new). But the authors continue to argue that disruptions can still occur during the small period where C_new is only appended on D but not yet on a majority of C_new. As they say: "we also did not want to assume that a leader will reliably replicate entries fast enough to move past the scenario (...) quickly; that might have worked in practice, but it depends on stronger assumptions that we prefer to avoid about the performance (...) of replicating log entries". One could probably try debunking this by saying that if entries take longer to replicate than the election timeout we're in much bigger trouble, but nevermind. In any case, the authors propose a solution which we call "sticky leadership". A server will not grant a vote to a candidate if it has recently received a heartbeat from the currently known leader, even if the candidate's term is higher. In the above example, servers in C_new would not grant votes to A as long as D keeps sending them heartbeats, thus A is no longer disruptive. In our case the situation is a bit different: in original Raft, "heartbeats" have a very specific meaning - they are append_entries requests (possibly empty) sent by leaders. Thus if a node stops being a leader it stops sending heartbeats; similarly, if a node leaves the configuration, it stops receiving heartbeats from others still in the configuration. We instead use a "shared failure detector" interface, where nodes may still consider other nodes alive regardless of their configuration/leadership situation, as part of the general "MultiRaft" framework. This pretty much invalidates the original argument, as seen on the above example: A will still consider D alive, thus it won't become a candidate. Shared failure detector combined with sticky leadership actually makes the situation worse - it may cause cluster unavailability in certain scenarios (fortunately not a permanent one, it can be solved with server restarts, for example). Randomized nemesis testing with reconfigurations found the following scenario: Let C1 = {A, B, C}, C2 = {A}, C3 = {B, C}. We start from configuration C1, B is the leader. B commits joint (C1, C2), then new C2 configuration. Note that C does not learn about the last entry (since it's not part of C2) but it keeps believing that B is alive, so it keeps believing that B is the leader. We then partition {A} from {B, C}. A appends (C2, C3) joint configuration to its log. It's not able to append it to B or C due to the partition. The partition holds long enough for A to revert to candidate state (or we may restart A at this point). Eventually the partition resolves. The only node which can become a candidate now is A: C does not become a candidate because it keeps believeing that B is the leader, and B does not become a candidate because it saw the C2 non-joint entry being committed. However, A won't become a leader because C won't grant it a vote due to the sticky leadership rule. The cluster will remain unavailable until e.g. C is restarted. Note that this scenario requires allowing configuration changes which remove and then readd the same servers to the configuration. One may wonder if such reconfigurations should be allowed, but there doesn't seem to be any example of them breaking safety of Raft (and the PhD doesn't seem to mention them at all; perhaps it implicitly accepts them). It is unknown whether a similar scenario may be produced without such reconfigurations. In any case, disabling sticky leadership resolves the problem, and it is the last currently known availability problem found in randomized nemesis testing. There is no reason to keep this extension, both because the original Raft authors' argument does not apply for shared failure detector, and because one may even argue with the authors in vanilla Raft given that prevoting is enabled (see end of third paragraph of this commit message). Message-Id: <20210921153741.65084-1-kbraun@scylladb.com>	2021-09-26 11:09:01 +03:00
Jan Ciolek	e9f24edc9b	cql3: types: Optimize abstract_type::contains_collection contains_collection() and contains_set_or_map() used to be calculated on each call(). Now the result is calculated only once during type creation. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 13:45:38 +02:00
Jan Ciolek	c672c0b42d	cql3: expr: Convert evaluate_IN_list to use evaluate(expression) evaluate_IN_list used term::bind(), but now it's possible to make it use term::to_expression() and then evaluate(expression) Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	7ab14ca9c1	cql3: expr: Use only evaluate(expression) to evaluate term Finally we don't need term::bind() to evaluate a term. We can just convert the term to expression and call evaluate(expression). Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	ea02fd82bc	cql3: expr: Implement evaluate(expr::function_call) function_call can be evaluated now. The code matches the one from functions::function_call::bind. I needed to add cache id to function_call in order for it ot work properly. See the blurb in struct function_call for more information. New code corresponds to bind() in cql3/functions/functions.cc. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	4a035b07d3	cql3: expr: Implement evaluate(expr::usertype_constructor) usertype_constructor can now be evaluated. To evaluate an usertype_constructor we need to know the type, because the fields have to be in the correct order. Type has been added to usertype_constructor. New code corresponds to old bind() of user_types::delayed_value in cql3/user_types.cc. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	f7ee40aa01	cql3: expr: Implement evaluate(expr::collection_constructor) collection_constructor can now be evaluated. There is a bit of a problem, because we don't know the type of an empty collection_constructor, but luckily empty collection constructors get converted to constants during preparation. For some reason in the original code when a collection contains unset_value, the whole collection is automatically evaluated to unset_value. I didn't change this behaviour. New code corresponds to old bind() of lists::delayed_value in cql3/lists.cc, sets::delayed_value etc. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	0f20d301d8	cql3: expr: Implement evaluate(expr::tuple_constructor) Tuple constructors can now be evaluated. New code corresponds to old bind() of tuples::delayed_value::marker in cql3/tuples.cc Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	5589f348e7	cql3: expr: Implement evaluate(expr::bind_variable) Implement evaluating a bind_variable. To be able to evaluate a bind_variable we need to know the type of the bound value. This is why a data_type has been added to the bind_variable struct. There are some quirks when evaluating a bind_variable. The first problem occurs when the variable has been sent with an older cql serialization format and contains collections. In that case the value has to be reserialized to use the newest cql serialization format. The second problem occurs when there is a set or a map in the value. The set value sent by the driver might not have the elements in the correct order, contain duplicates etc. When a set or map is detected in the value it is reserialized as well. collection_type_impl::reserialize doesn't work for this purpose, because it uses data_value which does not perform sorting or removal. New code corresponds to old bind() of lists::marker in cql3/lists.cc, sets::marker etc. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	e621cbaa32	cql3: Add contains_collection/set_or_map to abstract_type Sometimes we need to know whether some type contains some collection, set, or map inside. Introduce two functions that provide this information. Information about collection is useful for reserializing values with old serialization format. Information about set/map is useful for reserializing sets and maps to remove duplicates. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	f0e238f0a6	cql3: expr: Add evaluate(expression, query_options) Add a function that takes an expression and evaluates it to a constant. Evaluating specific expression variants will be implemented in the following commits. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	4ee4dc10ed	cql3: Implement term::to_expression for function_call Each functions::function_call can now be converted to expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	abd11b6fb4	cql3: Implement term::to_expression for user_type Each user_type::delayed_value can now be converted to expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	d61b2dbf8a	cql3: Implement term::to_expression for collections Each collection delayed_value can now be converted to expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	f17d003808	cql3: Implement term::to_expression for tuples Each tuples::delayed_value can now be converted to expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	c40f227c14	cql3: Implement term::to_expression for marker classes Implement to_expression for non terminals that represent a bind marker. For now each bind marker has a shape describing where it is used, but hopefully this can be removed in the future. In order to evaluate a bind_variable we need to know its type. The type is needed to pass to constant and to validate the value. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	499c9235fc	cql3: expr: Add data_type to _constructor structs It is useful to have a data_type in _constructor structs when evaluating. The resulting constant has a data_type, so we have to find it somehow. For tuple_constructor we don't have to create a separate tuple_type_impl instance. For collection_constructor we know what the type is even in case of an empty collection. For usertype_constructor we know the name, type and order of fields in the user type. Additionally without a data_type we wouldn't know whether the type is reversed or not. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	f86a1270b0	cql3: Add term::to_expression method Add a method that converts given term to the matching expression. It will be used as an intermediate step when implementing evaluate(expression). evaluate(term) will convert the term to the expression and then call evaluate(expression). For terminals this is simply calling get() to serialize the value. For non-terminals the implementation is more complicated and will be implemeted in the following commits. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Jan Ciolek	746e9c620f	cql3: Reorganize term and expression includes Make term.hh include expression.hh instead of the other way around. expression can't be forward declared. expression is needed in term.hh to declare term::to_expression(). Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 11:05:53 +02:00
Tomasz Grabiec	f582bfd453	Merge "test: raft: randomized_nemesis_test: generator test with linearizability checking" from Kamil The AppendReg state machine stores a sequence of integers. It supports `append` inputs which append a single integer to the sequence and return the previous state (before appending). The implementation uses the `append_seq` data structure representing an immutable sequence that uses a vector underneath which may be shared by multiple instances of `append_seq`. Appending to the sequence appends to the underlying vector, but there is no observable effect on the other instances since they use only the prefix of the sequence that wasn't changed. If two instances sharing the same vector try to append, the later one must perform a copy. This allows efficient appends if only one instance is appending, which is useful in the following context: - a Raft server stores a copy in the underlying state machine replica and appends to it, - clients send append operations to the server; the server returns the state of the sequence before it was appended to, - thanks to the sharing, we don't need to copy all elements when returning the sequence to the client, and only one instance (the server) is appending to the shared vector, - summarizing, all operations have amortized O(1) complexity. We use AppendReg instead of ExReg in `basic_generator_test` with a generator which generates a sequence of append operations with unique integers. This implies that the result of every operation uniquely identifies the operation (since it contains the appended integer, and different operations use different integers) and all operations that must have happened before it (since it contains the previous state of the append register), which allows us to reconstruct the "current state" of the register according to the results of operations coming from Raft calls, giving us an on-line serializability checker with O(1) amortized complexity on each operation completion. We also enforce linearizability by checking that every completed operation was previously invoked. We also perform a simple liveness check at the end of the test by ensuring that a leader becomes eventually elected and that we can successfully execute a call. * kbr/linearizability-v2: test: raft: randomized_nemesis_test: check consistency and liveness in basic_generator_test test: raft: randomized_nemesis_test: introduce append register	2021-09-23 23:55:13 +02:00
Benny Halevy	7e9ca101ae	storage_service: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210923093200.1559734-31-bhalevy@scylladb.com>	2021-09-23 17:36:43 +03:00
Benny Halevy	ecbe9f1ef6	storage_service: coroutinize rebuild Prepare for futurizing get_ranges_for_endpoint. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210923093200.1559734-30-bhalevy@scylladb.com>	2021-09-23 17:36:42 +03:00
Benny Halevy	c8b12afe1b	storage_service: effective_ownership: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210923093200.1559734-29-bhalevy@scylladb.com>	2021-09-23 17:35:32 +03:00
Benny Halevy	add78a8cc0	storage_service: coroutinize effective_ownership Prepare for futurizing get_ranges_for_endpoint. Dtest: nodetool_additional_test:TestNodetool.status_test Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210923093200.1559734-28-bhalevy@scylladb.com>	2021-09-23 17:34:56 +03:00
Avi Kivity	7127c92acc	Merge "simplifications and layer violation fix for compaction manager" from Raphael "This series removes layer violation in compaction, and also simplifies compaction manager and how it interacts with compaction procedure." * 'compaction_manager_layer_violation_fix/v3' of github.com:raphaelsc/scylla: compaction: split compaction info and data for control compaction_manager: use task when stopping a given compaction type compaction: remove start_size and end_size from compaction_info compaction_manager: introduce helpers for task compaction_manager: introduce explicit ctor for task compaction: kill sstables field in compaction_info compaction: kill table pointer in compaction_info compaction: simplify procedure to stop ongoing compactions compaction: move management of compaction_info to compaction_manager compaction: move output run id from compaction_info into task	2021-09-23 17:29:19 +03:00
Raphael S. Carvalho	5bf51ced14	compaction: split compaction info and data for control compaction_info must only contain info data to be exported to the outside world, whereas compaction_data will contain data for controlling compaction behavior and stats which change as compaction progresses. This separation makes the interface clearer, also allowing for future improvements like removing direct references to table in compaction. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-09-23 10:56:18 -03:00

1 2 3 4 5 ...

28425 Commits