scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 03:30:49 +00:00

Author	SHA1	Message	Date
Avi Kivity	5e4941a74b	Merge '[Backport 2025.2] sstables/mx/writer: handle non-full prefix row keys' from Scylladb[bot] Although valid for compact tables, non-full (or empty) clustering key prefixes are not handled for row keys when writing sstables. Only the present components are written, consequently if the key is empty, it is omitted entirely. When parsing sstables, the parsing code unconditionally parses a full prefix. This mis-match results in parsing failures, as the parser parses part of the row content as a key resulting in a garbage key and subsequent mis-parsing of the row content and maybe even subsequent partitions. Introduce a new system table: `system.corrupt_data` and infrastructure similar to `large_data_handler`: `corrupt_data_handler` which abstracts how corrupt data is handled. The sstable writer now passes rows such corrupt keys to the corrupt data handler. This way, we avoid corrupting the sstables beyond parsing and the rows are also kept around in system.corrupt_data for later inspection and possible recovery. Add a full-stack test which checks that rows with bad keys are correctly handled. Fixes: https://github.com/scylladb/scylladb/issues/24489 The bug is present in all versions, has to be backported to all supported versions. - (cherry picked from commit `92b5fe8983`) - (cherry picked from commit `0753643606`) - (cherry picked from commit `b0d5462440`) - (cherry picked from commit `093d4f8d69`) - (cherry picked from commit `678deece88`) - (cherry picked from commit `64f8500367`) - (cherry picked from commit `b931145a26`) - (cherry picked from commit `3e1c50e9a7`) - (cherry picked from commit `46ff7f9c12`) - (cherry picked from commit `ebd9420687`) - (cherry picked from commit `aae212a87c`) - (cherry picked from commit `592ca789e2`) - (cherry picked from commit `edc2906892`) Parent PR: #24492 Closes scylladb/scylladb#24744 * github.com:scylladb/scylladb: test/boost/sstable_datafile_test: add test for corrupt data sstables/mx/writer: handler rows with empty keys test/lib/cql_assertions: introduce columns_assertions sstables: add corrupt_data_handler to sstables::sstables tools/scylla-sstable: make large_data_handler a local db: introduce corrupt_data_handler mutation: introduce frozen_mutation_fragment_v2 mutation/mutation_partition_view: read_{clustering,static}_row(): return row type mutation/mutation_partition_view: extract de-ser of {clustering,static} row idl-compiler.py: generate skip() definition for enums serializers idl: extract full_position.idl from position_in_partition.idl db/system_keyspace: add apply_mutation() db/system_keyspace: introduce the corrupt_data table	2025-07-01 12:27:01 +03:00
Anna Stuchlik	3d8368cacb	doc: remove OSS mention from the SI notes This commit removes a confusing reference to an Open Source version form the Local Secondary Indexes page. Fixes https://github.com/scylladb/scylladb/issues/24668 Closes scylladb/scylladb#24673 (cherry picked from commit `2367330513`) Closes scylladb/scylladb#24723	2025-06-30 18:53:48 +03:00
Botond Dénes	43eb3bcf91	db/system_keyspace: introduce the corrupt_data table To serve as a place to store corrupt mutation fragments. These fragments cannot be written to sstables, as they would be spread around by compaction and/or repair. They even might make parsing the sstable impossible. So they are stored in this special table instead, kept around to be inspected later and possibly restored if possible. (cherry picked from commit `92b5fe8983`)	2025-06-30 12:44:28 +00:00
Patryk Jędrzejczak	b1bfa4b115	docs: rely on the Raft-based topology being enabled In 2025.2, we don't force enabling the Raft-based topology in the code, but we stated in the upgrade guides that it's a mandatory step of the upgrade to 2025.1. We also remind users to enable the Raft-based topology in the upgrade guides to 2025.2. Hence, we can rely in the the documentation on the Raft-based topology being enabled. If it is still disabled, we can just send the user to the upgrade guides. Hence: - we remove all documentation related to enabling the Raft-based topology, enabling the Raft-based schema (enabled Raft-based topology implies enabled Raft-based schema), and the gossip-based topology, - we can replace the documentation of the old manual recovery procedure with the documentation of the new manual recovery procedure (done in the previous commit). (cherry picked from commit `203ea5d8f9`)	2025-06-26 22:18:56 +00:00
Patryk Jędrzejczak	f052af6c45	docs: handling-node-failures: document the new recovery procedure We replace the documentation of the old recovery procedure with the documentation of the new recovery procedure. We can get rid of the old procedure from the documentation because we requested users to enable the Raft-based topology during upgrades to 2025.1 and 2025.2. We leave the note that enabling the Raft-based topology is required to use the new recovery procedure just in case, since we didn't force enabling the Raft-based topology in the code. (cherry picked from commit `4e256182a0`)	2025-06-26 22:18:56 +00:00
Anna Stuchlik	b469158418	doc: improve the tablets limitations section This PR improves the Limitations and Unsupported Features section for tablets, as it has been confusing to the customers. Refs https://github.com/scylladb/scylla-enterprise/issues/5465 Fixes https://github.com/scylladb/scylladb/issues/24562 Closes scylladb/scylladb#24563 (cherry picked from commit `17eabbe712`) Closes scylladb/scylladb#24588	2025-06-24 10:06:21 +03:00
Karol Nowacki	76bd23cddd	cql, schema: Extend name length limit from 48 to 192 bytes This commit increases the maximum length of names for keyspaces, tables, materialized views, and indexes from 48 to 192 bytes. The previous 48-bytes limit was inherited from Cassandra 3 for compatibility. However, this validation was removed in Cassandra 4 and 5 (see CASSANDRA-20389) and some usage scenarios (such as some feature store workflows generating long table names) now depend on this relaxed constraint. This change brings ScyllaDB's behavior in line with modern Cassandra versions and better supports these use cases. The new limit of 192 bytes is derived from underlying filesystem limitations to prevent runtime errors when creating directories for table data. When a new table is created, ScyllaDB generates a directory for its SSTables. The directory name is constructed from the table name, a dash, and a 32-character UUID. For a CDC-enabled table, an associated log table is also created, which has the suffix `_scylla_cdc_log` appended to its name. The directory name for this log table becomes the longest possible representation. Additionally we reserve 15 bytes for future use, allowing for potential future extensions without breaking existing schemas. To guarantee that directory creation never fails due to exceeding filesystem name limits, the maximum name length is calculated as follows: 255 bytes (common filesystem limit for a path component) - 32 bytes (for the 32-character UUID string) - 1 byte (for the '-' separator) - 15 bytes (for the '_scylla_cdc_log' suffix) - 15 bytes (reserved for future use) ---------- = 192 bytes (Maximum allowed name length) This calculation is similar in principle to the one proposed for Cassandra to fix related directory creation failures (see apache/cassandra/pull/4038). This patch also updates/adds all associated tests to validate the new 192-byte limit. The documentation has been updated accordingly. (cherry picked from commit `4577c66a04`)	2025-06-22 17:38:30 +00:00
Anna Stuchlik	01d3b504d1	doc: add support for z3 GCP This commit adds support for z3-highmem-highlssd instance types to Cloud Instance Recommendations for GCP. Fixes https://github.com/scylladb/scylladb/issues/24511 Closes scylladb/scylladb#24533 (cherry picked from commit `648d8caf27`) Closes scylladb/scylladb#24545	2025-06-17 23:40:47 +03:00
Anna Stuchlik	baa2592299	doc: remove the limitation for disabling CDC This commit removes the instruction to stop all writes before disabling CDC with ALTER. Fixes https://github.com/scylladb/scylla-docs/issues/4020 Closes scylladb/scylladb#24406 (cherry picked from commit `b0ced64c88`) Closes scylladb/scylladb#24476	2025-06-13 14:07:38 +03:00
Robert Bindar	a926cba476	Add support for nodetool refresh --skip-reshape This patch adds the new option in nodetool, patches the load_new_ss_tables REST request with a new parameter and skips the reshape step in refresh if this flag is passed. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com> Closes scylladb/scylladb#24409 Fixes: #24365 (cherry picked from commit `ca1a9c8d01`) Closes scylladb/scylladb#24472	2025-06-13 14:06:19 +03:00
Anna Stuchlik	4ebae7ae62	doc: add the upgrade guide from 2025.1 to 2025.2 This commit adds the upgrade guide from version 2025.1 to 2025.2. Also, it removes the upgrade guides existing for the previous version that are irrelevant in 2025.2 (upgrade from OSS 6.2 and Enterprise 2024.x). Note that the new guide does not include the "Enable Consistent Topology Updates" page, as users upgrading to 2025.2 have consistent topology updates already enabled. Fixes https://github.com/scylladb/scylladb/issues/24133 Fixes https://github.com/scylladb/scylladb/issues/24265 Closes scylladb/scylladb#24266 (cherry picked from commit `8b989d7fb1`) Closes scylladb/scylladb#24391	2025-06-06 08:48:31 +03:00
Pavel Emelyanov	024af57bd5	nodetool: Add refresh --skip-cleanup option The option "conflicts" with load-and-stream. Tests and doc included. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> (cherry picked from commit `c0796244bb`)	2025-06-05 17:52:13 +03:00
Robert Bindar	b62264e1d9	Add nodetool refresh --scope option This change adds the --scope option to nodetool refresh. Like in the case of nodetool restore, you can pass either of: * node - On the local node. * rack - On the local rack. * dc - In the datacenter (DC) where the local node lives. * all (default) - Everywhere across the cluster. as scope. The feature is based on the existing load_and_stream paths, so it requires passing --load-and-stream to the refresh command. Also, it is not compatible with the --primary-replica-only option. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com> Closes scylladb/scylladb#23861 (cherry picked from commit `c570941692`)	2025-06-04 11:59:17 +03:00
Anna Stuchlik	12596a8eca	doc: add OS support for ScyllaDB 2025.2 This commit adds the information about support for platforms in ScyllaDB version 20252. Fixes https://github.com/scylladb/scylladb/issues/24180 Closes scylladb/scylladb#24263 (cherry picked from commit `28cb5a1e02`) Closes scylladb/scylladb#24335	2025-06-03 10:07:28 +03:00
Anna Stuchlik	be3f50b658	doc: update migration tools overview This commit updates the migration overview page: - It removes the info about migration from SSTable to CQL. - It updates the link to the migrator docs. Fixes https://github.com/scylladb/scylladb/issues/24247 Refs https://github.com/scylladb/scylladb/pull/21775 Closes scylladb/scylladb#24258 (cherry picked from commit `b197d1a617`) Closes scylladb/scylladb#24282	2025-06-03 10:06:42 +03:00
Anna Stuchlik	cc299e335d	doc: remove copyright from Cassandra Stress This commit removes the Apache copyright note from the Cassandra Stress page. It's a follow up to https://github.com/scylladb/scylladb/pull/21723, which missed that update (see https://github.com/scylladb/scylladb/pull/21723#discussion_r1944357143). Cassandra Stress is a separate tool with separate repo with the docs, so the copyright information on the page is incorrect. Fixes https://github.com/scylladb/scylladb/issues/23240 Closes scylladb/scylladb#24219 (cherry picked from commit `d303edbc39`) Closes scylladb/scylladb#24256	2025-06-02 14:41:34 +03:00
David Garcia	a7b34a54bc	docs: fix \t (tab) is not rendered correctly Closes scylladb/scylladb#24096 (cherry picked from commit `bf9534e2b5`) Closes scylladb/scylladb#24257	2025-06-02 14:40:54 +03:00
Anna Stuchlik	20602b6a8b	doc: clarify RF increase issues for tablets vs. vnodes This commit updates the guidelines for increasing the Replication Factor depending on whether tablets are enabled or disabled. To present it in a clear way, I've reorganized the page. Fixes https://github.com/scylladb/scylladb/issues/23667 Closes scylladb/scylladb#24221 (cherry picked from commit `efce03ef43`) Closes scylladb/scylladb#24284	2025-05-30 15:16:17 +03:00
Anna Stuchlik	70d9352cec	doc: remove the redundant pages This commit removes two redundant pages and adds the related redirections. - The Tutorials page is a duplicate and is not maintained anymore. Having it in the docs hurts the SEO of the up-to-date Tutorias page. - The Contributing page is not helpful. Contributions-related information should be maintained in the project README file. Fixes https://github.com/scylladb/scylladb/issues/17279 Fixes https://github.com/scylladb/scylladb/issues/24060 Closes scylladb/scylladb#24090 (cherry picked from commit `eed8373b77`) Closes scylladb/scylladb#24220	2025-05-26 10:30:03 +03:00
Anna Stuchlik	ab8d50b5e7	doc: fix the product name for version 2025.1 Starting with 2025.1, ScyllaDB versions are no longer called "Enterprise", but the OS support page still uses that label. This commit fixes that by replacing "Enterprise" with "ScyllaDB". This update is required since we've removed "Enterprise" from everywhere else, including the commands, so having it here is confusing. Fixes https://github.com/scylladb/scylladb/issues/24179 Closes scylladb/scylladb#24181 (cherry picked from commit `2d7db0867c`) Closes scylladb/scylladb#24204	2025-05-19 12:03:35 +03:00
David Garcia	b1ee0e2a6a	docs: fix AttributeError with 'myst_enable_extensions' in publication workflow Rolled back some dependencies in `poetry.lock` to previous versions while we investigate how to make the extension `sphinx_scylladb_markdown` compatible with the latest versions. This should fix the error in https://github.com/scylladb/scylladb/actions/runs/14708656912/job/41275115239, which currently prevents publishing new versions of https://opensource.docs.scylladb.com/ Closes scylladb/scylladb#23969	2025-05-06 16:33:00 +03:00
Anna Stuchlik	851a433663	doc: add a link to the previous Enterprise documentation This commit adds a link to the docs for previous Enterprise versions at https://enterprise.docs.scylladb.com/ to the left menu. As we still support versions 2024.1 and 2024.2, we need to ensure easier access to those docs sets. Fixes https://github.com/scylladb/scylladb/issues/23870 Closes scylladb/scylladb#23945	2025-05-05 12:16:47 +03:00
David Garcia	4ba7182515	docs: fix md redirections for multiversion support This change resolves an issue where selecting a version from the multiversion dropdown on Markdown pages (e.g. https://docs.scylladb.com/manual/stable/alternator/getting-started.html) incorrectly redirected users to the main page instead of the corresponding versioned page. The underlying cause was that the `multiversion` extension relies on `source_suffix` to identify available pages for URL mapping. Without this configuration, proper redirection fails for `.md` files. This fix should be backported to `2025.1` to ensure correct behavior. Otherwise, the fix will only take effect in future releases. Testing locally is non-trivial: clone the repository, apply the changes to each relevant branch, set `smv_remote_whitelist` to "", then run `make multiversionpreview`. Afterward, switch between versions in the dropdown to verify behavior. I've tested it locally, so the best next step is to merge and confirm that it works as expected in the live environment. Closes scylladb/scylladb#23957	2025-05-05 10:39:39 +03:00
David Garcia	cf7d846b9e	docs: update dependencies This is a mandatory dependency update to resolve a critical Dependabot alert. For more details, see the [Dependabot alerts](https://docs.github.com/en/code-security/dependabot/dependabot-alerts/viewing-and-updating-dependabot-alerts). Closes scylladb/scylladb#23918 Fixes #23935	2025-04-27 18:45:11 +03:00
Pavel Emelyanov	eb5b52f598	Merge 'main: make DC and rack immutable after bootstrap' from Piotr Dulikowski Changing DC or rack on a node which was already bootstrapped is, in case of vnodes, very unsafe (almost guaranteed to cause data loss or unavailability), and is outright not supported if the cluster has a tablet-backed keyspaces. Moreover, the possibility of doing that makes it impossible to uphold some of the invariants promised by the RF-rack-valid flag, which is eventually going to become unconditionally enabled. Get rid of the above problems by removing the possibility of changing the DC / rack of a node. A node will now fail to start if its snitch reports a different DC or rack than the one that was reported during the first boot. Fixes: scylladb/scylladb#23278 Fixes: scylladb/scylladb#22869 Marking for backport to 2025.1, as this is a necessary part of the RF-rack-valid saga Closes scylladb/scylladb#23800 * github.com:scylladb/scylladb: doc: changing topology when changing snitches is no longer supported test: cluster: introduce test_no_dc_rack_change storage_service: don't update DC/rack in update_topology_with_local_metadata main: make dc and rack immutable after bootstrap test: cluster: remove test_snitch_change	2025-04-21 15:52:55 +03:00
Piotr Dulikowski	325a89638c	doc: changing topology when changing snitches is no longer supported Update the "How to Switch Snitches" document to indicate that changing topology (i.e. changing node's DC or rack) while changing the snitch is no longer supported. Remove a note which said that switching snitches is not supported with tablets. It was introduced because of the concern that switching a snitch might change DC or rack of the node, for which our current tablet load balancer is completely unprepated. Now that changing DC/rack is forbidden, there doesn't seem to be anything related to snitches which could cause trouble for tablets.	2025-04-17 16:22:58 +02:00
Anna Stuchlik	0b4740f3d7	doc: add info about Scylla Doctor Automation to the docs Fixes https://github.com/scylladb/scylladb/issues/23642 Closes scylladb/scylladb#23745	2025-04-16 11:44:35 +03:00
Kefu Chai	3e3f583b84	docs/dev/tombstone.md: fix a typo s/alwas/always/ Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#23734	2025-04-15 10:54:42 +03:00
Avi Kivity	5e1cf90a51	build: replace tools/java submodule with packaged cassandra-stress We no longer use tools/java (scylladb/scylla-tools-java.git) for nodetool or cqlsh; only cassandra-stress. Since that is available in package form install that and excise the tools/java submodule from the source tree. pgo/ is adjusted to use the packaged cassandra-stress (and the cqlsh submodule). A few jmx references are dropped as well. Frozen toolchain regenerated. Optimized clang from https://devpkg.scylladb.com/clang/clang-19.1.7-Fedora-41-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-19.1.7-Fedora-41-x86_64.tar.gz Closes scylladb/scylladb#23698	2025-04-15 10:11:28 +03:00
Karol Baryła	df64985a4e	Docs: Describe driver issue with tablet RF increase Current protocol extension that sends tablet info to drivers only does that if the driver selects a non-replica coordinator for a routable request. It works well if some node on the replica list is replaced by other node, or if some replicas are removed from the list. Driver will at some point send a request to stale replica, and receive new list in response. The issue is with extending the list with new replicas. In that case old replicas are all still correct, so driver will not select any wrong replica, and will not receive the new list. As far as I know that only scenario where this could happen is RF increase. It could be to some degree worked around in the drivers, but it would add significant complexity (definitely more than any other invalidations we introduced) while still not being ideal solution. This scenario should be rare enough, and the consequences of not handling it minor enough (new replicas not being used as coordinators) that it does not warrant driver-side solution. Instead this commit adds info about this to documentation, advising users to restart applications after replica lists are extended. It is worth noting that if new tablet feedback protocol extension is implemented then this problem goes away. See issue #21664. Closes scylladb/scylladb#23447	2025-04-11 13:48:40 +02:00
David Garcia	cf11d5eb69	fix: openapi not rendering in docs.scylladb.com/manual Closes scylladb/scylladb#23686	2025-04-10 17:47:58 +03:00
Avi Kivity	9559e53f55	Merge 'Adjust tablet-mon.py for capacity-aware load balancing' from Tomasz Grabiec After load-balancer was made capacity-aware it no longer equalizes tablet count per shard, but rather utilization of shard's storage. This makes the old presentation mode not useful in assessing whether balance was reached, since nodes with less capacity will get fewer tablets when in balanced state. This PR adds a new default presentation mode which scales tablet size by its storage utilization so that tablets which have equal shard utilization take equal space on the graph. To facilitate that, a new virtual table was added: system.load_per_node, which allows the tool to learn about load balancer's view on per-node capacity. It can also serve as a debugging interface to get a view of current balance according to the load-balancer. Closes scylladb/scylladb#23584 * github.com:scylladb/scylladb: tablet-mon.py: Add presentation mode which scales tablet size by its storage utilization tablet-mon.py: Center tablet id text properly in the vertical axis tablet-mon.py: Show migration stage tag in table mode only when migrating virtual-tables: Introduce system.load_per_node virtual_tables: memtable_filling_virtual_table: Propagate permit to execute() docs: virtual-tables: Fix instructions service: tablets: Keep load_stats inside tablet_allocator	2025-04-10 14:59:08 +03:00
Tomasz Grabiec	b5211cca85	Merge 'tablets: rebuild: use repair for tablet rebuild' from Aleksandra Martyniuk Currently, when we rebuild a tablet, we stream data from all replicas. This creates a lot of redundancy, wastes bandwidth and CPU resources. In this series, we split the streaming stage of tablet rebuild into two phases: first we stream tablet's data from only one replica and then repair the tablet. Fixes: https://github.com/scylladb/scylladb/issues/17174. Needs backport to 2025.1 to prevent out of space during streaming Closes scylladb/scylladb#23187 * github.com:scylladb/scylladb: test: add test for rebuild with repair locator: service: move to rebuild_v2 transition if cluster is upgraded locator: service: add transition to rebuild_repair stage for rebuild_v2 locator: service: add rebuild_repair tablet transition stage locator: add maybe_get_primary_replica locator: service: add rebuild_v2 tablet transition kind gms: add REPAIR_BASED_TABLET_REBUILD cluster feature	2025-04-09 21:35:37 +02:00
Tomasz Grabiec	0b9a75d7b6	virtual-tables: Introduce system.load_per_node Can be used to query per-node stats about load as seen by the load balancer. In particular, node's capacity will be used by tablet-mon.py to scale tablet columns so that equal height is equal node utilization.	2025-04-09 20:21:51 +02:00
Tomasz Grabiec	34beaa30b5	docs: virtual-tables: Fix instructions	2025-04-09 20:21:51 +02:00
Botond Dénes	b65a76ab6f	Merge 'nodetool: cluster repair: add a command to repair tablet keyspaces' from Aleksandra Martyniuk Add a new nodetool cluster super-command. Add nodetool cluster repair command to repair tablet keyspaces. It uses the new /storage_service/tablets/repair API. The nodetool cluster repair command allows you to specify the keyspace and tables to be repaired. A cluster repair of many tables will request /storage_service/tablets/repair and wait for the result synchronously for each table. The nodetool repair command, which was previously used to repair keyspaces of any type, now repairs only vnode keyspaces. Fixes: https://github.com/scylladb/scylladb/issues/22409. Needs backport to 2025.1 that introduces the new tablet repair API Closes scylladb/scylladb#22905 * github.com:scylladb/scylladb: docs: nodetool: update repair and add tablet-repair docs test: nodetool: add tests for cluster repair command nodetool: add cluster repair command nodetool: repair: extract getting hosts and dcs to functions nodetool: repair: warn about repairing tablet keyspaces nodetool: repair: move keyspace_uses_tablets function	2025-04-09 08:20:34 +03:00
Botond Dénes	583a813d17	docs/dev/tombstone.md: fix link to ddl.html Closes scylladb/scylladb#23622	2025-04-08 16:18:50 +03:00
Anna Stuchlik	93a7b3ac1d	doc: add enabling consistent topology updates to the 2025.1 upgrade guide-from-2024 This commit adds the procedure to enable consistent topology updates for upgrades from 2024.1 to 2025.1 (or from 2024.2 to 2025.1 if the feature wasn't enabled after upgrading from 2024.1 to 2024.2). Fixes https://github.com/scylladb/scylladb/issues/23650 Closes scylladb/scylladb#23651	2025-04-08 15:38:00 +03:00
Aleksandra Martyniuk	eb17af6143	locator: service: add transition to rebuild_repair stage for rebuild_v2 Modify write_both_read_old and streaming stages in rebuild_v2 transition kind: write_both_read_old moves to rebuild_repair stage and streaming stage streams data only from one replica.	2025-04-08 10:42:02 +02:00
Aleksandra Martyniuk	ed7b8bb787	locator: service: add rebuild_v2 tablet transition kind Currently, in the streaming stage of rebuild tablet transition, we stream tablet data from all replicas. This patch series splits the streaming stage into two phases: - repair phase, where we repair the tablet; - streaming phase, where we stream tablet data from one replica. To differentiate the two streaming methods, a new tablet transition kind - rebuild_v2 - is added. The transtions and stages for rebuild_v2 transition kind will be added in the following patches.	2025-04-08 10:42:01 +02:00
Aleksandra Martyniuk	9769d7a564	docs: nodetool: update repair and add tablet-repair docs	2025-04-08 09:13:14 +02:00
dependabot[bot]	a899cae158	build(deps): bump sphinx-scylladb-theme from 1.8.5 to 1.8.6 in /docs Bumps [sphinx-scylladb-theme](https://github.com/scylladb/sphinx-scylladb-theme) from 1.8.5 to 1.8.6. - [Release notes](https://github.com/scylladb/sphinx-scylladb-theme/releases) - [Commits](https://github.com/scylladb/sphinx-scylladb-theme/compare/1.8.5...1.8.6) --- updated-dependencies: - dependency-name: sphinx-scylladb-theme dependency-version: 1.8.6 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Closes scylladb/scylladb#23537	2025-04-07 13:42:19 +03:00
Botond Dénes	fcdae20fd1	Merge 'Add tablet enforcing option' from Benny Halevy This series add a new config option: `tablets_mode_for_new_keyspaces` that replaces the existing `enable_tablets` option. It can be set to the following values: disabled: New keyspaces use vnodes by default, unless enabled by the tablets={'enabled':true} option enabled: New keyspaces use tablets by default, unless disabled by the tablets={'disabled':true} option enforced: New keyspaces must use tablets. Tablets cannot be disabled using the CREATE KEYSPACE option `tablets_mode_for_new_keyspaces=disabled` or `tablets_mode_for_new_keyspaces=enabled` control whether tablets are disabled or enabled by default for new keyspaces, respectively. In either cases, tablets can be opted-in or out using the `tablets={'enabled':...}` keyspace option, when the keyspace is created. `tablets_mode_for_new_keyspaces=enforced` enables tablets by default for new keyspaces, like `tablets_mode_for_new_keyspaces=enabled`. However, it does not allow to opt-out when creating new keyspaces by setting `tablets = {'enabled': false}` Refs scylladb/scylla-enterprise#4355 * Requires backport to 2025.1 Closes scylladb/scylladb#22273 * github.com:scylladb/scylladb: boost/tablets_test: verify failure to create keyspace with tablets and non network replication strategy tablets: enforce tablets using tablets_mode_for_new_keyspaces=enforced config option db/config: add tablets_mode_for_new_keyspaces option	2025-04-03 16:32:19 +03:00
Radosław Cybulski	c36614e16d	alternator: add size check to BatchItemWrite Add a size check for BatchItemWrite command - if the item count is bigger than configuration value `alternator_maximum_batch_write_size`, an error will be raised and no modification will happen. This is done to synchronize with DynamoDB, where maximum size of BatchItemWrite is 25. To avoid complaints from clients, who use our feature of BatchWriteItem being limitless we set default value to 100. Fixes #5057 Closes scylladb/scylladb#23232	2025-04-02 14:48:00 +03:00
Botond Dénes	3bad46a6e2	docs/dev: add tombstone.md An exhaustive document on the tombstone related internal logic as well as the user-facing aspects. Closes scylladb/scylladb#23454	2025-04-01 20:17:57 +03:00
Pavel Emelyanov	2ee9cec1d3	Merge 'Remove object_storage.yaml and move the endpoints to scylla.yaml' from Robert Bindar Move `object_storage.yaml` endpoints to `scylla.yaml` This change also removes the `object_storage.yaml` file altogether and adds tests for fetching the endpoints via the `v2/config/object_storage_endpoints` REST api. Also, `object_storage_config_file` options is moved to a deprecated state as it's no longer needed. This PR depends on #22951, the reviewers should review patch 393e1ac0ec066475ca94094265a5f88dbbdb1a1f Refs https://github.com/scylladb/scylladb/issues/22428 Closes scylladb/scylladb#22952 * github.com:scylladb/scylladb: Remove db::config::object_storage_config Move `object_storage.yaml` endpoints to `scylla.yaml`	2025-04-01 16:01:44 +03:00
Avi Kivity	69684e16d8	Merge 'sstables: add SSTable compression with shared dictionaries ' from Michał Chojnowski This PR extends Scylla's SSTable compression with the ability to use compression dictionaries shared across compression chunks. This involves several changes: - We refactor `compression_parameters` and friends (`compressor`, `sstables::local_compression`, `sstables::compression`) to prepare for making the construction of `compressor`s asynchronous, to enable sharing pieces of compressors (the dictionaries) across shards. - We introduce the notion of "hidden compression options" which are written to `CompressionInfo.db` and used to construct decompressors, like regular options, but don't appear in the schema. (We later stuff the SSTable's dictionary into `CompressionInfo.db` using a sequence of such options). - We add a cluster feature which guards the creation of dictionary-compressed SSTables. - We introduce a central "compressor factory" (one instance shared by all shards), which from this point onward is used to construct all `compressor` objects (one per SSTable) used to process the SSTables. When constructing a compressor for writing, it uses the "current"/"recommended" dictionary (which is passed to the factory from the actively-observed contents of the group0-managed `system.dicts`). When constructing a compressor for reading, it uses the dictionary written in the hidden compression options in CompressionInfo.db. And it keeps dictionaries deduplicated, so that each unique live dictionary blob has only one instance in memory, shared across shards. - We teach the relevant `lz4` and `zstd` compressor wrappers about the dictionaries. - We add a HTTP API call which samples pieces of the given table (i.e. the Data.db files) from across the cluster, trains a dictionary on it, and publishes it via `system.dicts` as the new current dictionary for that table. (And we add some RPC verbs to support that). - We add a HTTP API call which estimates the impact of various available compression configurations on the compression ratio. - We add an autotrainer fiber which periodically retrains dicts for dict-aware tables and publishes them if they seem to be a significant improvement. Known imperfections: - The factory currently keeps one dictionary instance on the entire node, but we probably want one copy per NUMA node. I didn't do that because exposing NUMA knowledge to Scylla seems to require some changes in Seastar first. New feature, no backporting involved. Closes scylladb/scylladb#23025 * github.com:scylladb/scylladb: docs: add user-facing documentation for SSTable compression with shared dicts docs/dev: add sstable-compression-dicts.md test: add test_sstable_compression_dictionaries_autotrain.py test: add test_sstable_compression_dictionaries_basic.py test/pylib/rest_client: add `keyspace_upgrade_sstables` helper main: run a sstable_dict_autotrainer api: add the estimate_compression_ratios API call dict_autotrainer: introduce sstable_dict_autotrainer db/system_keyspace: add query_dict_timestamp compress: add ZstdWithDictsCompressor and LZ4WithDictsCompressor main: clean up sstable compression dicts after table drops sstables/compress: discard hidden compression options after the decompressor is created compress: change compressor_ptr from shared_ptr to unique_ptr api: add the retrain_dict API call storage_service: add some dict-related routines main: in compression_dict_updated_callback, recognize and use SSTable compression dicts storage_service: add do_sample_sstables() messaging_service: add SAMPLE_SSTABLES and ESTIMATE_SSTABLE_VOLUME verbs db/system_keyspace: let `system.dicts` helpers be used for dicts other than the RPC compression dict raft/group0_state_machine: on `system.dicts` mutations, pass the affected partitition keys to the callback database: add sample_data_files() database: add take_sstable_set_snapshot() compress: teach `lz4_processor` about dictionaries compress: teach `zstd_processor` about dictionaries sstables: delegate compressor creation to the compressor factory sstables: plug an `sstable_compressor_factory` into `sstables_manager` sstables: introduce sstable_compressor_factory utils/hashers: add get_sha256() gms/feature_service: add the SSTABLE_COMPRESSION_DICTS cluster feature compress: add hidden dictionary options compress: remove `compression_parameters::get_compressor()` sstables/compress: remove get_sstable_compressor() sstables/compress: move ownership of `compressor` to `sstable::compression` compress: remove compressor::option_names() compress: clean up the constructor of zstd_processor compress: squash zstd.cc into compress.cc sstables/compress: break the dependency of `compression_parameters` on `compressor` compress.hh: switch compressor::name() from an instance member to a virtual call bytes: adapt fmt_hex to std::span<const std::byte>	2025-04-01 12:47:34 +03:00
David Garcia	6e61fc323b	docs: redirect to docs.scylladb.com/manual/ Define a custom alert to redirect users to the latest version of the docs in https://docs.scylladb.com/manual/ Closes scylladb/scylladb#22636	2025-04-01 09:22:56 +03:00
Michał Chojnowski	36be9d1c9b	docs: add user-facing documentation for SSTable compression with shared dicts	2025-04-01 00:07:31 +02:00
Michał Chojnowski	d33ffb221b	docs/dev: add sstable-compression-dicts.md	2025-04-01 00:07:31 +02:00

1 2 3 4 5 ...

1649 Commits