scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-09 16:33:35 +00:00

Author	SHA1	Message	Date
Kefu Chai	d1e8d89ae2	doc: topology-over-raft: add transition_state to node state diagram in order to help the developers to understand the transitions of `node_state` and the `transition_state` on each of the `node_state`, in this change, the nested state machine diagram is added to the node state diagram. please note, instead of trying to merge similar states like bootstrapping and replacing into a single state, we keep them as separate ones, and replicate the nested state machine diagram in them as well, to be more clear. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18025	2024-03-27 12:16:35 +01:00
Kefu Chai	8af9c735f2	docs/operating-scylla: document nodetool sstableinfo Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-27 07:29:24 +08:00
Kefu Chai	da90e368dc	docs/operating-scylla: document nodetool getsstables Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-27 07:29:24 +08:00
Tzach Livyatan	6702ba3664	Docs: Add link from migration tools page to nodetool refresh load and stream Closes scylladb/scylladb#18006	2024-03-25 17:47:05 +02:00
Kamil Braun	69bf962522	Merge 'allow changing snitch with topology over raft' from Gleb Fixes scylladb/scylladb#17513 * 'gleb/raft-snitch-change-v3' of github.com:scylladb/scylla-dev: doc: amend snitch changing procedure to work with raft test: add test to check that snitch change takes effect. raft topology: update rack/dc info in topology state on reboot if changed	2024-03-25 10:41:39 +01:00
Gleb Natapov	3b272c5650	doc: amend snitch changing procedure to work with raft To change snitch with raft all nodes need to be started simultaneously since each node will try to update its state in the raft and for that quorum is required.	2024-03-25 11:31:30 +02:00
David Garcia	0375faa6aa	docs: add experimental tag Closes scylladb/scylladb#17633	2024-03-22 09:53:30 +02:00
David Garcia	559dc9bb27	docs: Implement relative link support for configuration properties Introduces relative link support for individual properties listed on the configuration properties page. For instance, to link to a property from a different document, use the syntax :ref:`memtable_flush_static_shares <confprop_memtable_flush_static_shares>`. Additionally, it also adds support for linking groups. For example, :ref:`Ungrouped properties <confgroup_ungrouped_properties>`. Closes scylladb/scylladb#17753	2024-03-20 11:39:30 +02:00
Piotr Dulikowski	70cb1dc8fe	doc: describe upgrade and recovery for raft topology Document the manual upgrade procedure that is required to enable consistent cluster management in clusters that were upgraded from an older version to ScyllaDB Open Source 6.0. This instruction is placed in previously placeholder "Enable Raft-based Topology" page which is a part of the upgrade instructions to ScyllaDB Open Source 6.0. Add references to the new description in the "Raft Consensus Algorithm in ScyllaDB" document in relevant places. Extend the "Handling Node Failures" document so that it mentions steps required during recovery of a ScyllaDB cluster running version 6.0. Fixes: scylladb/scylladb#17341 Closes scylladb/scylladb#17624	2024-03-19 14:59:14 +01:00
Anna Stuchlik	a13694daea	doc: fix the image upgrade page This commit updates the Upgrade ScyllaDB Image page. - It removes the incorrect information that updating underlying OS packages is mandatory. - It adds information about the extended procedure for non-official images. Closes scylladb/scylladb#17867	2024-03-18 18:27:46 +02:00
Avi Kivity	72bbe75d5b	Merge 'Fix node replace with tablets for RF=N' from Tomasz Grabiec This PR fixes a problem with replacing a node with tablets when RF=N. Currently, this will fail because tablet replica allocation for rebuild will not be able to find a viable destination, as the replacing node is not considered to be a candidate. It cannot be a candidate because replace rolls back on failure and we cannot roll back after tablets were migrated. The solution taken here is to not drain tablet replicas from replaced node during topology request but leave it to happen later after the replaced node is in left state and replacing node is in normal state. The replacing node waits for this draining to be complete on boot before the node is considered booted. Fixes https://github.com/scylladb/scylladb/issues/17025 Nodes in the left state will be kept in tablet replica sets for a while after node replace is done, until the new replica is rebuilt. So we need to know about those node's location (dc, rack) for two reasons: 1) algorithms which work with replica sets filter nodes based on their location. For example materialized views code which pairs base replicas with view replicas filters by datacenter first. 2) tablet scheduler needs to identify each node's location in order to make decisions about new replica placement. It's ok to not know the IP, and we don't keep it. Those nodes will not be present in the IP-based replica sets, e.g. those returned by get_natural_endpoints(), only in host_id-based replica sets. storage_proxy request coordination is not affected. Nodes in the left state are still not present in token ring, and not considered to be members of the ring (datacanter endpoints excludes them). In the future we could make the change even more transparent by only loading locator::node* for those nodes and keeping node* in tablet replica sets. Currently left nodes are never removed from topology, so will accumulate in memory. We could garbage-collect them from topology coordinator if a left node is absent in any replica set. That means we need a new state - left_for_real. Closes scylladb/scylladb#17388 * github.com:scylladb/scylladb: test: py: Add test for view replica pairing after replace raft, api: Add RESTful API to query current leader of a raft group test: test_tablets_removenode: Verify replacing when there is no spare node doc: topology-on-raft: Document replace behavior with tablets tablets, raft topology: Rebuild tablets after replacing node is normal tablets: load_balancer: Access node attributes via node struct tablets: load_balancer: Extract ensure_node() mv: Switch to using host_id-based replica set effective_replication_map: Introduce host_id-based get_replicas() raft topology: Keep nodes in the left state to topology tablets: Introduce read_required_hosts()	2024-03-18 16:16:08 +02:00
Tomasz Grabiec	1d01b4ca20	doc: topology-on-raft: Document replace behavior with tablets	2024-03-15 13:20:08 +01:00
Pavel Emelyanov	6a77f36519	doc: Add tablets migration state diagram Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#17790	2024-03-14 20:29:21 +01:00
Yaniv Kaul	a2ac80340f	Typo: pint -> print Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> Closes scylladb/scylladb#17804	2024-03-14 15:50:35 +02:00
Kefu Chai	15bea069a9	docs: use less slangy language this is a follow-up change of `1519904fb9`, to incorporate the comment from Anna Stuchlik. Signed-off-by: Anna Stuchlik <anna.stuchlik@scylladb.com> Closes scylladb/scylladb#17671	2024-03-13 13:33:37 +02:00
Botond Dénes	8e90b856b5	Merge 'Extend test.py's ability to select test cases' from Pavel Emelyanov This PR fixes comments left from #17481 , namely - adds case selection to boost suite - describes the case selection in documentation Closes scylladb/scylladb#17721 * github.com:scylladb/scylladb: docs: Add info about the ability to run specific test case test.py: Support case selection for boost tests	2024-03-12 13:21:50 +02:00
Mikołaj Grzebieluch	cb17b4ac59	docs: maintenance socket: add section about accessing maintenance socket Closes scylladb/scylladb#17701	2024-03-11 20:25:00 +02:00
Pavel Emelyanov	3453a934ba	docs: Add info about the ability to run specific test case The test.py usage is documented, the ability to run a specific test by its name is described in doc. Extend it with the new ability to run specific test case as well. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-03-11 09:10:20 +03:00
Tzach Livyatan	a245c0bb98	Docs: Remove 3rd party Rust Driver from the driver list The 3rd party Rust https://github.com/AlexPikalov/cdrs is not maintained, and we have a better internal alternative. Closes scylladb/scylladb#15815	2024-03-06 10:34:43 +02:00
Botond Dénes	f164ed8bae	Merge 'docs: fix the formattings in operating-scylla/nodetool-commands/info.rst' from Kefu Chai couple minor formatting fixes. Closes scylladb/scylladb#17518 * github.com:scylladb/scylladb: docs: remove leading space in table element docs: remove space in words	2024-03-06 10:33:21 +02:00
Tzach Livyatan	dafc83205b	Docs: rename the select-from-mutation-fragments page name Closes scylladb/scylladb#17456	2024-03-06 10:32:56 +02:00
David Garcia	d27d89fd34	docs: add collapsible for images Introduces collapsible dropdowns for images reference docs. With this update, only the latest version's details will be displayed open by default. Information about previous versions will be hidden under dropdowns, which users can expand as needed. This enhancement aims to make pages shorter and easier to navigate. Closes scylladb/scylladb#17492	2024-03-06 10:32:35 +02:00
David Garcia	847882b981	docs: add dynamic substitutions This pull request adds dynamic substitutions for the following variables: * `.. \|CURRENT_VERSION\| replace:: {current_version}` * `.. \|UBUNTU_SCYLLADB_LIST\| replace:: scylla-{current_version}.list` * `.. \|CENTOS_SCYLLADB_REPO\| replace:: scylla-{current_version}.repo` As a result, it is no longer needed to update the "Installation on Linux" page manually after every new release. Closes scylladb/scylladb#17544	2024-03-06 10:25:57 +02:00
comsky	48ad1b3d20	Update stats-output.rst I read this doc to learn how to use nodetool commands, and I eventually found some typos in the docs. 😄 Closes scylladb/scylladb#15771	2024-03-06 10:25:06 +02:00
Avi Kivity	6383aa1e3c	docs: maintainer.md: add exceptions to the don't-commit-your-own-code rules Submodule and toolchain updates aren't original code and so are exempt from the don't-commit-own-code rule. Closes scylladb/scylladb#17534	2024-03-06 10:19:46 +02:00
Tzach Livyatan	04b483e286	Docs: fix RF type in the consistency-calculator Closes scylladb/scylladb#17557	2024-03-06 10:18:29 +02:00
Tzach Livyatan	1edce9f4b6	Improve the frozen vs. non-frozen doc section, removing falses claimes Closes scylladb/scylladb#17556	2024-03-06 10:16:33 +02:00
Anna Stuchlik	a024c2d692	doc: remove Membership changes vs LWT page This commit removes the redundant "Cluster membership changes and LWT consistency" page. The page is no longer useful because the Raft algorithm serializes topology operations, which results in consistent topology updates. Closes scylladb/scylladb#17523	2024-03-06 10:10:01 +02:00
Botond Dénes	6f374aa7d6	Merge 'doc: update procedures following the introduction of Raft-based topology' from Anna Stuchlik This PR updates the procedures that changed as a result of introducing Raft-based topology. Refs https://github.com/scylladb/scylladb/issues/15934 Applied the updates from https://docs.google.com/document/d/1BgZaYtKHs2GZKAxudBZv4G7uwaXcRt2jM6TK9dctRQg/edit In addition, it adds a placeholder for the 5.4-to-6.0 upgrade guide, as a file included in that guide, Enable Raft topology, is referenced from other places in the docs. Closes scylladb/scylladb#17500 * github.com:scylladb/scylladb: doc: replace "Raft Topology" with "Consistent Topology" doc: (Raft topology) update Removenode doc: (Raft topology) update Upscale a Cluster doc:(Raft topology)update Membership Change Failures doc: doc: (Raft topology) update Replace Dead Node doc: (Raft topology) update Remove a Node doc: (Raft topology) update Add a New DC doc: (Raft topology) update Add a New Node doc: (Raft topology) update Create Cluster (EC2) doc: (Raft topology) update Create Cluster (n-DC) doc: (Raft topology) update Create Cluster (1DC) doc: include the quorum requirement file doc: add the quorum requirement file doc: add placeholder for Enable Raft topology page	2024-03-06 10:05:47 +02:00
Kefu Chai	1519904fb9	docs: quote CQL keywords this "misspelling" was identified by codespell. actually, it's not quite a misspelling, as "UPDATE" and "INSERT" are keywords in CQL. so we intended to emaphasis them, so to make codespell more useful, and to preserve the intention, let's quote the keywords with backticks. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17391	2024-03-06 09:57:07 +02:00
Anna Stuchlik	85cfc6059b	doc: replace "Raft Topology" with "Consistent Topology" This commit replaces "Raft-based Topology" with "Consistent Topology Updates" in the 5.4-to-6.0 upgrade guide and all the links to it.	2024-02-29 14:42:30 +01:00
Anna Stuchlik	9250e0d8e0	doc: (Raft topology) update Removenode This commit updates the Nodetool Removenode page with reference to the Raft-related topology. Specifically, it removes outdated warnings, and adds the information about banning removed and ignored nodes from the cluster.	2024-02-29 14:40:19 +01:00
Anna Stuchlik	d59f38a6ad	doc: (Raft topology) update Upscale a Cluster This commit updates the Upscale a Cluster page with reference to the Raft-related topology. Specifically, it adds a note with the quorum requirement.	2024-02-29 14:40:11 +01:00
Anna Stuchlik	5bece99d4d	doc:(Raft topology)update Membership Change Failures This commit updates the Handling Cluster Membership Change Failures page with reference to the Raft-related topology. Specifically, it adds a note that the page only applies when Raft-based topology is not enabled. In addition, it removes the Raft-enabled option.	2024-02-29 14:38:45 +01:00
Anna Stuchlik	48dd7021a7	doc: doc: (Raft topology) update Replace Dead Node This commit updates the Replace a Dead Node page with reference to the Raft-related topology. Specifically, it removes the previous pre-Raft limitation to replace the nodes one by one and the requirement to ensure that the the replaced node will never come back to the cluster In addition, a warning is added to indicate the limitations when Raft-base topology is not enabled upon upgrade from 5.4.	2024-02-29 14:38:45 +01:00
Anna Stuchlik	a390ce9e6b	doc: (Raft topology) update Remove a Node This commit updates the Remove a Node page with reference to the Raft-related topology. Specifically, it removes the previous pre-Raft limitation to remove the nodes one by one and the requirement to ensure that the the removed node will never come back to the cluster In addition, a warning is added to indicate the limitations when Raft-base topology is not enabled upon upgrade from 5.4.	2024-02-29 14:38:45 +01:00
Anna Stuchlik	59f890c0ef	doc: (Raft topology) update Add a New DC This commit updates the Add a New DC) page with reference to the Raft-related topology. Specifically, it removes the previous pre-Raft limitation to bootstrap the nodes one by one. In addition, a warning is added to indicate the limitations when Raft-base topology is not enabled upon upgrade from 5.4.	2024-02-29 14:38:36 +01:00
Anna Stuchlik	5a3a720b82	doc: (Raft topology) update Add a New Node This commit updates the Add a New Node (Out Scale) page with reference to the Raft-related topology. Specifically, it removes the previous pre-Raft limitation to bootstrap the nodes one by one. In addition, a warning is added to indicate the limitations when Raft-base topology is not enabled upon upgrade from 5.4.	2024-02-29 14:35:03 +01:00
Anna Stuchlik	631fcebe12	doc: (Raft topology) update Create Cluster (EC2) This commit updates the Create Cluster (EC2) page with reference to the Raft-related topology. Specifically, it removes the previous pre-Raft limitation to bootstrap the nodes one by one. In addition, it updates the concept of the seed node.	2024-02-29 14:30:00 +01:00
Anna Stuchlik	b6b610c16e	doc: (Raft topology) update Create Cluster (n-DC) This commit updates the Create Cluster (Multi DC) page with reference to the Raft-related topology. Specifically, it removes the previous pre-Raft limitation to bootstrap the nodes one by one. In addition, it updates the concept of the seed node.	2024-02-29 14:30:00 +01:00
Anna Stuchlik	cbf054f2b9	doc: (Raft topology) update Create Cluster (1DC) This commit updates the Create Cluster (Single DC) page with reference to the Raft-related topology. Specifically, it removes the previous pre-Raft limitation to bootstrap the nodes one by one. In addition, it updates the concept of the seed node.	2024-02-29 14:30:00 +01:00
Anna Stuchlik	57e0f15c7c	doc: include the quorum requirement file Include the file to avoid repetition.	2024-02-29 14:29:39 +01:00
Anna Stuchlik	b02f8a0759	doc: add the quorum requirement file	2024-02-28 13:21:11 +01:00
Kefu Chai	cd228f4d6c	docs: remove leading space in table element otherwise sphinx would consider "Within which Data Center the" as the "term" part of an entry in a definition list, and "node is located" as the definition part of this entry. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-02-26 13:03:26 +08:00
Kefu Chai	d12655ff46	docs: remove space in words * remove space in "Exceptions", otherwise it renders like "Except" "tions", which does not look right. * remove space in "applicable". * remove space in "Transport". Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-02-26 13:03:26 +08:00
Anna Stuchlik	14a4fa16a8	doc: add placeholder for Enable Raft topology page This commit adds a placeholder for the Enable Raft-based Topology page in the 5.4-to-6.0 upgrade guide. This page needs to be referenced from other pages in the docs.	2024-02-22 16:02:06 +01:00
Kamil Braun	3ee56e1936	Merge 'raft topology: enable writes to previous CDC generations' from Patryk Jędrzejczak When we create a CDC generation and ring-delay is non-zero, the timestamp of the new generation is in the future. Hence, we can have multiple generations that can be written to. However, if we add a new node to the cluster with the Raft-based topology, it receives only the last committed generation. So, this node will be rejecting writes considered correct by the other nodes until the last committed generation starts operating. In scylladb/scylladb#17134, we have allowed sending writes to the previous CDC generations. So, the situation became even more complicated. This PR adjusts the Raft-based topology to ensure all required generations are loaded into memory and their data isn't cleared too early. To load all required generations into memory, we replace `current_cdc_generation_{uuid, timestamp}` with the set containing IDs of all committed generations - `committed_cdc_generations`. To ensure this set doesn't grow endlessly, we remove an entry from this set together with the data in CDC_GENERATIONS_V3. Currently, we may clear a CDC generation's data from CDC_GENERATIONS_V3 if it is not the last committed generation and it is at least 24 hours old (according to the topology coordinator's clock). However, after allowing writes to the previous CDC generations, this condition became incorrect. We might clear data of a generation that could still be written to. The new solution introduced in this PR is to clear data of the generations that finished operating more than 24 hours ago. Apart from the changes mentioned above, this PR hardens `test_cdc_generation_clearing.py`. Fixes scylladb/scylladb#16916 Fixes scylladb/scylladb#17184 Fixes scylladb/scylladb#17288 Closes scylladb/scylladb#17374 * github.com:scylladb/scylladb: test: harden test_cdc_generation_clearing test: test clean-up of committed_cdc_generations raft topology: clean committed_cdc_generations raft topology: clean only obsolete CDC generations' data storage_service: topology_state_load: load all committed CDC generations system_keyspace: load_topology_state: fix indentation raft topology: store committed CDC generations' IDs in the topology	2024-02-22 11:41:25 +01:00
Anna Stuchlik	37237407f6	doc: remove info about outdated versions This PR removes information about outdated versions, including disclaimers and information when a given feature was added. Now that the documentation is versioned, information about outdated versions is unnecessary (and makes the docs harder to read). Fixes https://github.com/scylladb/scylladb/issues/12110 Closes scylladb/scylladb#17430	2024-02-20 19:32:13 +02:00
Avi Kivity	93af3dd69b	Merge 'Maintenance socket: set filesystem permissions to 660' from Mikołaj Grzebieluch Set filesystem permissions for the maintenance socket to 660 (previously it was 755) to allow a scyllaadm's group to connect. Split the logic of creating sockets into two separate functions, one for each case: when it is a regular cql controller or used by maintenance_socket. Fixes https://github.com/scylladb/scylladb/issues/16487. Closes scylladb/scylladb#17113 * github.com:scylladb/scylladb: maintenance_socket: add option to set owning group transport/controller: get rid of magic number for socket path's maximal length transport/controller: set unix_domain_socket_permissions for maintenance_socket transport/controller: pass unix_domain_socket_permissions to generic_server::listen transport/controller: split configuring sockets into separate functions	2024-02-20 15:09:54 +02:00
Patryk Jędrzejczak	e145e758eb	raft topology: store committed CDC generations' IDs in the topology When we create a CDC generation and ring-delay is non-zero, the timestamp of the new generation is in the future. Hence, we can have multiple generations that can be written to. However, if we add a new node to the cluster with the Raft-based topology, it receives only the last committed generation. So, this node will be rejecting writes considered correct by the other nodes until the last committed generation starts operating. In scylladb/scylladb#17134, we have allowed sending writes to the previous CDC generations. So, the situation became even more complicated. We need to adjust the Raft-based topology to ensure all required generations are loaded into memory and their data isn't cleared too early. This patch is the first step of the adjustment. We replace `current_cdc_generation_{uuid, timestamp}` with the set containing IDs of all committed generations - `committed_cdc_generations`. This set is sorted by timestamps, just like `unpublished_cdc_generations`. This patch is mostly refactoring. The last generation in `committed_cdc_generations` is the equivalent of the previous `current_cdc_generation_{uuid, timestamp}`. The other generations are irrelevant for now. They will be used in the following patches. After introducing `committed_cdc_generations`, a newly committed generation is also unpublished (it was current and unpublished before the patch). We introduce `add_new_committed_cdc_generation`, which updates both sets of generations so that we don't have to call `add_committed_cdc_generation` and `add_unpublished_cdc_generation` together. It's easy to forget that both of them are necessary. Before this patch, there was no call to `add_unpublished_cdc_generation` in `topology_coordinator::build_coordinator_state`. It was a bug reported in scylladb/scylladb#17288. This patch fixes it. This patch also removes "the current generation" notion from the Raft-based topology. For the Raft-based topology, the current generation was the last committed generation. However, for the `cdc::metadata`, it was the generation operating now. These two generations could be different, which was confusing. For the `cdc::metadata`, the current generation is relevant as it is handled differently, but for the Raft-based topology, it isn't. Therefore, we change only the Raft-based topology. The generation called "current" is called "the last committed" from now.	2024-02-20 12:35:16 +01:00

1 2 3 4 5 ...

1219 Commits