scylladb

Author	SHA1	Message	Date
Avi Kivity	bd08b6e5b2	Merge 'Unify configuration of object storage endpoints (take 2)' from Pavel Emelyanov To configure S3 storage, one needs to do ``` object_storage_endpoints: - name: s3.us-east-1.amazonaws.com port: 443 https: true aws_region: us-east-1 ``` and for GCS it's ``` object_storage_endpoints: - name: https://storage.googleapis.com:433 type: gs credentials_file: <gcp account credentials json file> ``` This PR updates the S3 part to look like ``` object_storage_endpoints: - name: https://s3.us-east-1.amazonaws.com:443 aws_region: us-east-1 ``` fixes: #26570 This is 2nd attempt, previous one (#27360) was reverted because it reported endpoint configs in new format via API and CQL always, even if the endpoint was configured in the old way. This "broke" scylla manager and some dtests. This version has this bug fixed, and endpoints are reported in the same format as they were configured with. About correctness of the changes. No modifications to existing tests are made here, so old format is respected correctly (as far as it's covered by tests). To prove the new format works the the test_get_object_store_endpoints is extended to validate both options. Some preparations to this test to make this happen come on their own with the PR #28111 to show that they are valid and pass before changing the core code. Enhancing the way configuration is made, likely no need to backport. Closes scylladb/scylladb#28112 * github.com:scylladb/scylladb: test: Validate S3 endpoints new format works docs: Update docs according to new endpoints config option format object_storage: Create s3 client with "extended" endpoint name s3/storage: Tune config updating sstable: Shuffle args for s3_client_wrapper test: Rename badconf variable into objconf test: Split the object_store/test_get_object_store_endpoints test	2026-01-14 18:29:03 +02:00
Botond Dénes	551eecab63	Merge 'EAR: deprecate the replicated key provider' from Calle Wilund Refs #22733. Adds runtime warning and docs info that replicated provider is deprecated and will be removed. Fixes #27292 Closes scylladb/scylladb#27270 * github.com:scylladb/scylladb: docs::encryption: Add warning that replicated provider is deprecated ent::encryption: Switch default key provider from replicated to local replicated_key_provider: Add deprecation warning on usage	2026-01-14 13:47:23 +02:00
Botond Dénes	122b7847e5	Merge 'index: Accept view properties in CREATE INDEX' from Dawid Mędrek Problem ------- Secondary indexes are implemented via materialized views under the hood. The way an index behaves is determined by the configuration of the view. Currently, it can be modified by performing the CQL statement `ALTER MATERIALIZED VIEW` on it. However, that raises some concerns. Consider, for instance, the following scenario: 1. The user creates a secondary index on a table. 2. In parallel, the user performs writes to the base table. 3. The user modifies the underlying materialized view, e.g. by setting the `synchronous_updates` to `true` [1]. Some of the writes that happened before step 3 used the default value of the property (which is `false`). That had an actual consequence on what happened later on: the view updates were performed asynchronously. Only after step 3 had finished did it change. Unfortunately, as of now, there is no way to avoid a situation like that. Whenever the user wants to configure a secondary index they're creating, they need to do it in another schema change. Since it's not always possible to control how the database is manipulated in the meantime, it leads to problems like the one described. That's not all, though. The fact that it's not possible to configure secondary indexes is inconsistent with other schema entities. When it comes to tables or materialized views, the user always have a means to set some or even all of the properties during their creation. Solution -------- The solution to this problem is extending the `CREATE INDEX` CQL statement by view properties. The syntax is of form: ``` > CREATE INDEX <index name> > .. ON <keyspace>.<table> (<columns>) > .. WITH <properties> ``` where `<properties>` corresponds to both index-specific and view properties [2, 3]. View properties can only be used with indexes implemented with materialized views; for example, it will be impossible to create a vector index when specifying any view property (see examples below). When a view property is provided, it will be applied when creating the underlying materialized view. The behavior should be similar to how other CQL statements responsible for creating schema entities work. High-level implementation strategy ---------------------------------- 1. Make auxiliary changes. 2. Introduce data structures representing the new set of index properties: both index-specific and those corresponding to the underlying view. 3. Extend `CREATE INDEX` to accept view properties. 4. Extend `DESCRIBE INDEX` and other `DESCRIBE` statements to include view properties in their output. User documentation is also updated at the steps to reflect the corresponding changes. Implementation considerations ----------------------------- There are a number of schema properties that are now obsolete. They're accepted by other CQL statements, but they have no effect. They include: * `index_interval` * `replicate_on_write` * `populate_io_cache_on_flush` * `read_repair_chance` * `dclocal_read_repair_chance` If the user tries to create a secondary index specifying any of those keywords, the statement will fail with an appropriate error (see examples below). Unlike materialized views, we forbid specifying the clustering order when creating a secondary index [4]. This limitation may be lifted later on, but it's a detail that may or may not prove troublesome. It's better to postpone covering it to when we have a better perspective on the consequences it would bring. Examples -------- Good examples ``` > CREATE INDEX idx ON ks.t (v); > CREATE INDEX idx ON ks.t (v) WITH comment = 'ok view property'; > CREATE INDEX idx ON ks.t (v) .. WITH comment = 'multiple view properties are ok' .. AND synchronous_updates = true; > CREATE INDEX idx ON ks.t (v) .. WITH comment = 'default value ok' .. AND synchronous_updates = false; ``` Bad examples ``` > CREATE INDEX idx ON ks.t (v) WITH replicate_on_write = true; SyntaxException: Unknown property 'replicate_on_write' > CREATE INDEX idx ON ks.t (v) .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="Cannot specify options for a non-CUSTOM index" > CREATE CUSTOM INDEX idx ON ks.t (v) .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="CUSTOM index requires specifying the index class" > CREATE CUSTOM INDEX idx ON ks.t (v) .. USING 'vector_index' .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="You cannot use view properties with a vector index" > CREATE INDEX idx ON ks.t (v) WITH CLUSTERING ORDER BY (v ASC); InvalidRequest: Error from server: code=2200 [Invalid query] message="Indexes do not allow for specifying the clustering order" ``` and so on. For more examples, see the relevant tests. References: [1] https://docs.scylladb.com/manual/branch-2025.4/cql/cql-extensions.html#synchronous-materialized-views [2] https://docs.scylladb.com/manual/branch-2025.4/cql/secondary-indexes.html#create-index [3] https://docs.scylladb.com/manual/branch-2025.4/cql/mv.html#mv-options [4] https://docs.scylladb.com/manual/branch-2025.4/cql/dml/select.html#ordering-clause Fixes scylladb/scylladb#16454 Backport: not needed. This is an enhancement. Closes scylladb/scylladb#24977 * github.com:scylladb/scylladb: cql3: Extend DESC INDEX by view properties cql3: Forbid using CLUSTERING ORDER BY when creating index cql3: Extend CREATE INDEX by MV properties cql3/statements/create_index_statement: Allow for view options cql3/statements/create_index_statement: Rename member cql3/statements/index_prop_defs: Re-introduce index_prop_defs cql3/statements/property_definitions: Add extract_property() cql3/statements/index_prop_defs.cc: Add namespace cql3/statements/index_prop_defs.hh: Rename type cql3/statements/view_prop_defs.cc: Move validation logic into file cql3/statements: Introduce view_prop_defs.{hh,cc} cql3/statements/create_view_statement.cc: Move validation of ID schema/schema.hh: Do not include index_prop_defs.hh	2026-01-14 09:54:27 +02:00
Nadav Har'El	fc6fff61d1	docs/alternator: add document on reducing Alternator network costs This patch adds a new document, docs/alternator/network.md, explaining the various mechanisms that can be used to reduce network usage in Alternator. It explains compression of requests and responses, header reduction, rack-aware routing, and RPC compression. Many of these topics - especially support in the client libraries - are work in progress, so some details are still missing in the new document. Still, I think it is a good start that can be improved later. Fixes #27915. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27927	2026-01-13 14:29:01 +02:00
Pavel Emelyanov	bd225784bd	docs: Update docs according to new endpoints config option format Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:24:06 +03:00
Anna Stuchlik	8141283262	doc: add the version name to the Install ScyllaDB page Fixes https://github.com/scylladb/scylladb/issues/28021 Closes scylladb/scylladb#28022	2026-01-13 10:01:48 +02:00
Avi Kivity	66aee0fb5e	alternator: add optional listeners for proxy protocol v2 Following `954f2cbd2f`, which added proxy protocol v2 listeners for CQL, we do the same for alternator. We add two optional ports for plain and TLS-wrapped HTTP. We test each new port, that the old ports still work, and that mixing up a port with no proxy protocol and a connection with proxy protocol (or the opposite) fails. The latter serves to show that the testing strategy is valid and doesn't just pass whatever happens. We also verify that the correct addresses (and TLS mode) show up in system.clients. Closes scylladb/scylladb#27889	2026-01-13 09:59:24 +02:00
tomek7667	19313d67e3	docs/cql/ddl.rst: fix formatting of deprecated initial sub-option Closes scylladb/scylladb#26852	2026-01-13 08:55:24 +02:00
Anna Stuchlik	14cadcbc18	doc: remove references to Open Source Fixes https://github.com/scylladb/scylladb/issues/28118 Closes scylladb/scylladb#28119	2026-01-13 08:43:26 +02:00
Michał Jadwiszczak	649efd198f	docs/dev/service_levels: update docs to service levels on raft Since Scylla 6.0, service levels are manged by Raft group0. This patch updates table name used by service levels and adds a paragraph describing service levels on raft. Fixes scylladb/scylladb#18177 Closes scylladb/scylladb#26556	2026-01-13 06:49:18 +02:00
Anna Stuchlik	791ab4ed02	doc: clarify the information about SSTable version support Fixes https://github.com/scylladb/scylladb/issues/27765 Closes scylladb/scylladb#27835	2026-01-13 06:17:37 +02:00
Botond Dénes	6bcc18e5c6	erge 'test.py: integrate python tests to be executed with pytest runner' from Andrei Chekun This will move responsibility for running tests with pytest in the same manner as it was done with boost tests. From this commit, test.py is not responsible anymore for running python tests and relies completely on pytest. This is another step for unification of test execution. Convert skip_mode function to `pytest.mark` to be able to use to annotate the whole module instead of each test explicitly. NOTE: this is a breaking change. From this commit, several directories with tests will require a path to the file to launch the test. Affected directories test/alternator test/broadcast_tables test/cql test/cqlpy test/rest_api Changes only in framework, so no backport. This PR will increase the amount of the tests by 30 test, due to the fact that how test.py and pytest discover tests. test.py count a file as a test, and when skip used in suite.yaml it will exclude the tests from discovery completely. While the pytest count test funstion as a test and uses skip_mode mark and will discover the tests, but it will skip them during execution, hence the difference test.py output before PR: ```bash > ./test.py --mode=release rest_api/test_compaction_task rest_api/test_task_manager --list --no-gather-metrics ``` test.py output in this PR: ```bash > ./test.py --mode=release test/rest_api/test_compaction_task.py test/rest_api/test_task_manager.py --list rest_api/test_compaction_task.py::test_global_major_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_major_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_cleanup_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_offstrategy_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_rewrite_sstables_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_reshaping_compaction_task.release.1 rest_api/test_compaction_task.py::test_resharding_compaction_task.release.1 rest_api/test_compaction_task.py::test_regular_compaction_task.release.1 rest_api/test_compaction_task.py::test_compaction_task_abort.release.1 rest_api/test_compaction_task.py::test_major_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_cleanup_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_offstrategy_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_rewrite_sstables_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_compaction_progress[major_keyspace_compaction_task_impl_run_fail].release.1 rest_api/test_compaction_task.py::test_compaction_progress[shard_major_keyspace_compaction_task_impl_run_fail].release.1 rest_api/test_compaction_task.py::test_compaction_progress[table_major_keyspace_compaction_task_impl_run_fail].release.1 rest_api/test_task_manager.py::test_task_manager_modules.release.1 rest_api/test_task_manager.py::test_task_manager_tasks.release.1 rest_api/test_task_manager.py::test_task_manager_status_running.release.1 rest_api/test_task_manager.py::test_task_manager_status_done.release.1 rest_api/test_task_manager.py::test_task_manager_status_failed.release.1 rest_api/test_task_manager.py::test_task_manager_not_abortable.release.1 rest_api/test_task_manager.py::test_task_manager_wait.release.1 rest_api/test_task_manager.py::test_task_manager_ttl.release.1 rest_api/test_task_manager.py::test_task_manager_user_ttl.release.1 rest_api/test_task_manager.py::test_task_manager_sequence_number.release.1 rest_api/test_task_manager.py::test_task_manager_recursive_status.release.1 rest_api/test_task_manager.py::test_module_not_exists.release.1 rest_api/test_task_manager.py::test_task_folding.release.1 rest_api/test_task_manager.py::test_abort_on_unregistered_task.release.1 ``` Fixes: https://github.com/scylladb/scylladb/issues/27716 Closes scylladb/scylladb#26395 * github.com:scylladb/scylladb: test.py: fix test_vector_similarity.py docs: add directories excluded from test.py test.py: prevent file descriptors leaking test.py: capture print inside the test test.py: do not print header for collection with test.py test.py: remove not supported functionality test.py: switch of execution of several test directories by test.py runner test.py: integrate python tests to be executed with pytest runner test.py: fix test/vector_search_validator to be able to run with pytest test.py: prepare base class for migration test.py: move environment preparation to one method test.py: introduce new environment variable TESTPY_PREPARED_ENVIRONMENT	2026-01-12 14:17:19 +02:00
Marcin Maliszkiewicz	3c9f52e709	Merge 'doc: update the Web Installer instructions' from Anna Stuchlik This PR: - Replaces a fixed version name with the variable for the current version in the instructions for installing a non-default version with Web Installer. This will make using the installer more user-friendly. - Removes the instruction for Open Source from the Web Installer docs. Fixes https://github.com/scylladb/scylladb/issues/28005 Fixes https://github.com/scylladb/scylladb/issues/28079 Closes scylladb/scylladb#28046 * github.com:scylladb/scylladb: doc: remove the instruction for Open Source from the Web Installer docs doc: add the version variable to the Web Installer instructions	2026-01-12 11:10:04 +01:00
Botond Dénes	7e1c8776b7	docs: remove sstabledump and sstablemetadata These tools are deprecated and no longer shipped by ScyllaDB packages. They no longer support the latest SSTable versions and ScyllaDB-only features, like encryption and dictionary based compression. Remove them from the documentation. Closes scylladb/scylladb#27608	2026-01-09 17:31:54 +01:00
Ferenc Szili	0ede8d154b	docs: add docs for size based load balancing This patch updates the documentation for size based load balancing. Closes scylladb/scylladb#27616	2026-01-09 16:25:25 +02:00
Anna Stuchlik	396093ff60	doc: remove the instruction for Open Source from the Web Installer docs Fixes https://github.com/scylladb/scylladb/issues/28079	2026-01-09 14:07:32 +01:00
Andrei Chekun	82e81a8664	docs: add directories excluded from test.py Add new directories that are excluded from the test.py executor and will be fully managed by pytest	2026-01-09 11:59:25 +01:00
Botond Dénes	60570d7114	Merge 'topology coordinator: restrict node join/remove to preserve RF-rack validity' from Michael Litvak Allow creating materialized views and secondary indexes in a tablets keyspace only if it's RF-rack-valid, and enforce RF-rack-validity while the keyspace has views by restricting some operations: * Altering a keyspace's RF if it would make the keyspace RF-rack-invalid * Adding a node in a new rack * Removing / Decommissioning the last node in a rack Previously the config option `rf_rack_valid_keyspaces` was required for creating views. We now remove this restriction - it's not needed because we always maintain RF-rack-validity for keyspaces with views. The restrictions are relevant only for keyspaces with numerical RF. Keyspace with rack-list-based RF are always RF-rack-valid. Fixes scylladb/scylladb#23345 Fixes https://github.com/scylladb/scylladb/issues/26820 backport to relevant versions for materialized views with tablets since it depends on rf-rack validity Closes scylladb/scylladb#26354 * github.com:scylladb/scylladb: docs: update RF-rack restrictions cql3: don't apply RF-rack restrictions on vector indexes cql3: add warning when creating mv/index with tablets about rf-rack service/tablet_allocator: always allow tablet merge of tables with views locator: extend rf-rack validation for rack lists test: test rf-rack validity when creating keyspace during node ops locator: fix rf-rack validation during node join/remove test: test topology restrictions for views with tablets test: add test_topology_ops_with_rf_rack_valid topology coordinator: restrict node join/remove to preserve RF-rack validity topology coordinator: add validation to node remove locator: extend rf-rack validation functions view: change validate_view_keyspace to allow MVs if RF=Racks db: enforce rf-rack-validity for keyspaces with views replica/db: add enforce_rf_rack_validity_for_keyspace helper db: remove enforce parameter from check_rf_rack_validity test: adjust test to not break rf-rack validity	2026-01-09 10:01:23 +02:00
Anna Stuchlik	f614482e66	doc: add the patch release upgrade procedure for version 2025.4 Adds the patch upgrade guide based on previous upgrade guides. Fixes https://github.com/scylladb/scylladb/issues/27982 Closes scylladb/scylladb#27985	2026-01-08 21:55:18 +02:00
Anna Stuchlik	3f1c7c70f5	doc: remove the link to the Download Center ... from the OS support page. Fixes https://github.com/scylladb/scylladb/issues/28047 Closes scylladb/scylladb#28048	2026-01-08 21:55:18 +02:00
Anna Stuchlik	1b653166f1	doc: add the version variable to the Web Installer instructions This commit replaces a fixed version name with the variable for the current version in the instructions for installing a non-default version with Web Installer. This will make using the installer more user-friendly. Fixes https://github.com/scylladb/scylladb/issues/28005	2026-01-08 10:12:21 +01:00
Nadav Har'El	384e394ff0	Merge 'Add similarity functions to calculate similarity of given vectors' from Dawid Pawlik It should be possible to return the similarity of vectors in CQL statements following the [Cassandra compatible syntax](https://cassandra.apache.org/doc/latest/cassandra/getting-started/vector-search-quickstart.html#query-vector-data-with-cql): ``` SELECT comment, similarity_cosine(comment_vector, [0.1, 0.15, 0.3, 0.12, 0.05]) FROM cycling.comments_vs; ``` Although the calculations are slow, and we already have calculated results returned via Vector Store API, we need the functionality as it allows us to calculate similarity of vectors not stored in vector indexes. It will be needed for [quantization and rescoring](https://scylladb.atlassian.net/wiki/spaces/RND/pages/195985800/Quantization+and+Rescoring). The feature is also a nice-to-have in testing as requested many times by testing and CX teams. The optimized version utilizing already calculated distances from Vector Store without a need of rescoring will be coming soon after via https://github.com/scylladb/scylladb/pull/27991. --- The patch adds functions: - `similarity_cosine(<vector>, <vector>)`, - `similarity_euclidean(<vector>, <vector>)`, - `similarity_dot_product(<vector>, <vector>)` Where `<vector>` is either a column of type `VECTOR<FLOAT, N>` or a vector of floats literal. These functions can be called with every `SELECT` query, not only ANN vector queries as opposed to https://github.com/scylladb/scylladb/pull/25993. The similarity calculations are implemented inspired by [USearch's implementation]( `a2f1759910/include/usearch/index_plugins.hpp (L1304-L1385)`) and made compatible with [Cassandra's documentation](https://cassandra.apache.org/doc/5.0/cassandra/developing/cql/functions.html#vector-similarity-functions). That would guarantee the results in ScyllaDB are calculated using the exact same algorithms as used in Vector Store indexes. --- Fixes: SCYLLADB-88 Fixes: SCYLLADB-89 New feature, should land into 2026.1 Closes scylladb/scylladb#27524 * github.com:scylladb/scylladb: docs: add vector similarity functions documentation test/cqlpy: add similarity functions correctness tests test/cqlpy: add similarity functions invalid call tests cql3: introduce similarity functions syntax vector_similarity_fcts: introduce similarity functions vector_similarity_fcts: retrieve similarity function argument types vector_similarity_fcts: add calculating similarity between vectors	2026-01-05 18:28:10 +02:00
Anna Stuchlik	375479d96c	doc: fix the syntax of internal links Some internal links had the wrong syntax: they were formatted as external links. As a result, they redirected the user to the outdated Open Source documentation. This commit fixes that bug. Fixes https://github.com/scylladb/scylladb/issues/25899 Closes scylladb/scylladb#27905	2026-01-05 10:44:58 +01:00
Avi Kivity	0df85c8ae8	Revert "Merge 'Unify configuration of object storage endpoints' from Pavel Emelyanov" This reverts commit `1bb897c7ca`, reversing changes made to `954f2cbd2f`. It makes incompatible changes to the object storage configuration format, breaking tests [1]. It's likely that it doesn't break any production configuration, but we can't be sure. Fixes #27966 Closes scylladb/scylladb#27969	2026-01-05 08:53:41 +02:00
Dawid Pawlik	c0b06a7fc6	docs: add vector similarity functions documentation Add documentation in `functions.rst` as the CQL reference for a vector similarity functions. This includes the syntax, example usage, and prerequisites for the parameters.	2026-01-02 13:02:59 +01:00
Anna Stuchlik	624869de86	doc: remove cassandra-stress from installation instructions The cassandra-stress tool is no longer part of the default package and cannot be run in the way described. This commit removes the instruction to run cassandra-stress. Fixes https://github.com/scylladb/scylladb/issues/24994 Closes scylladb/scylladb#27726	2026-01-01 14:20:58 +02:00
Nadav Har'El	80e5860a8c	docs/alternator: document that Streams needs vnodes The current state (after PR #26836) is that Alternator tables are created by default using tablets. But due to issue #23838, Alternator Streams cannot be enabled on a table that uses tablets... An attempt to enable Streams on such a table results in a clear error: "Streams not yet supported on a table using tablets (issue #23838). If you want to use streams, create a table with vnodes by setting the tag 'system:initial_tablets' set to 'none'." But users should be able to learn this fact from the documentation - not just retroactively from an error message. This is especially important because a user might create and fill a table using tablets, and only get this error when attempting to enable Streams on the existing table - when it is too late to change anything. So this patch adds a paragraph on this to compatibility.md, where several other requirements of Alternator Streams are already mentioned. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27000	2025-12-30 10:45:34 +03:00
Avi Kivity	853f3dadda	Merge 'treewide: fix some spelling errors' from Piotr Smaron Irritated by prevailing spellchecker comments attached to every PR, I aim to fix them all. No need to backport, just cosmetic changes. Closes scylladb/scylladb#27897 * github.com:scylladb/scylladb: treewide: fix some spelling errors codespell: ignore `iif` and `tread`	2025-12-29 20:45:31 +02:00
Avi Kivity	9927c6a3d4	Merge 'Reapply "audit: enable some subset of auditing by default"' from Piotr Smaron This reverts commit a5edbc7d612df237a1dd9d46fd5cecf251ccfd13. <h3>Why re-enabling table audit</h3> Audit has been disabled (scylladb/scylla-enterprise/pull/3094) over many concerns raised against the table implementation, e.g. scylladb/scylla-enterprise/issues/2939 / scylladb/scylla-enterprise/issues/2759 + there's whole outstanding backlog of issues . One of the concerns was also a possible loss of availability, and since then we migrated audit keyspace from SimpleStrategy RF=1 to NetworkTopologyStrategy RF=3 (scylladb/scylla-enterprise/pull/3399) and stopped failing queries when auditing fails (scylladb/scylla-enterprise/pull/3118 & scylladb/scylla-enterprise/pull/3117), which improves the situation but doesn't address all the concerns. Eventually we want to use syslog as audit's sink, but it's not fully ready just yet, and so we'll restore table audit for now to increase the security, but later switch to syslog. BTW. cloud will enable table audit for AUTH category scylladb/sre-ops-automation/issues/2970 separately from this effort. <h3>Performance considerations</h3> We are assuming that the events for the enabled categories, i.e. DCL, DDL, AUTH & ADMIN, should appear at about the same, low cadence, with AUTH perhaps having the biggest impact of them all under some workloads. The performance penalty of enabling just the AUTH category [has been measured](https://scylladb.atlassian.net/wiki/spaces/RND/pages/148308005/Audit+performance+impact+test) and while authentication throughput and read/write throughput remain stable, the queries' P99 latency may decrease by a couple of % in the most hardcore scenarios. Fixes: https://github.com/scylladb/scylladb/issues/26020 Gradually re-enabling audit feature, no need to backport. Closes scylladb/scylladb#27262 * github.com:scylladb/scylladb: doc: audit: set audit as enabled by default Reapply "audit: enable some subset of auditing by default"	2025-12-29 16:41:04 +02:00
Tomasz Grabiec	bbf9ce18ef	Merge 'load_balancer: compute node load based on tablet sizes' from Ferenc Szili Currently, the tablet load balancer performs capacity based balancing by collecting the gross disk capacity of the nodes, and computes balance assuming that all tablet sizes are the same. This change introduces size-based load balancing. The load balancer does not assume identical tablet sizes any more, and computes load based on actual tablet sizes. The size-based load balancer computes the difference between the most and least loaded nodes in the balancing set (nodes in DC, or nodes in a rack in case of `rf-rack-valid-keyspaces`) and stops further balancing if this difference is bellow the config option `size_based_balance_threshold_percentage`. This config option does not apply to the absolute load, but instead to the percentage of how much the most loaded node is more loaded than the least loaded node: `delta = (most_loaded - least_loaded) / most_loaded` If this delta is smaller then the config threshold, the balancer will consider the nodes balanced. This PR is a part of a series of PRs which are based on top of each other. - First part for tablet size collection via load_stats: #26035 - Second part reconcile load_stats: #26152 - The third part for load_sketch changes: #26153 - The fourth part which performs tablet load balancing based on tablet size: #26254 - The fifth part changes the load balancing simulator: #26438 This is a new feature, backport is not needed. Fixes #26254 Closes scylladb/scylladb#26254 * github.com:scylladb/scylladb: test, load balancing: add test for table balance load_balancer: add cluster feature for size based balancing load_balancer: implement size-based load balancing config: add size based load balancing config params load_stats: use trinfo to decide how to reconcile tablet size load_sketch: use tablet sizes in load computation load_stats: add get_tablet_size_in_transition()	2025-12-29 15:01:38 +01:00
Piotr Smaron	fb4d89f789	treewide: fix some spelling errors	2025-12-29 13:53:56 +01:00
Radosław Cybulski	5e1254eef0	Update documentation	2025-12-29 08:33:08 +01:00
Ferenc Szili	10eb364821	load_balancer: implement size-based load balancing This changes introduces tablet size based load balancing. It is an extension of capacity based balancing with the addition of actual tablet sizes. It computes the difference between the most and least loaded nodes in the DC and stops further balancing if this difference is bellow the config option size_based_balance_threshold_percentage. This config option does not apply to the absolute load, but instead to the percentage of how much the most loaded node is more loaded than the least loaded node: delta = (most_loaded - least_loaded) / most_loaded If this delta is smaller then the config threshold, the balancer will consider the nodes balanced.	2025-12-27 11:20:20 +01:00
Ferenc Szili	621cb19045	load_sketch: use tablet sizes in load computation This commit changes load_sketch so that it computes node and shard load based on tablet sizes instead of tablet count.	2025-12-27 10:37:23 +01:00
Piotr Szymaniak	9c5b4e74c3	doc: Correct reference in dev/audit.md Closes scylladb/scylladb#27832	2025-12-24 15:25:15 +02:00
Nadav Har'El	8df5189f9c	Merge 'docs: scylla-sstable.rst: extract script API to separate document' from Botond Dénes The script API is 500+ lines long in an already too long and hard to navigate document. Extract it to a separate document, making both documents shorter and easier to navigate. Documentation refactoring, no backport needed. Closes scylladb/scylladb#27609 * github.com:scylladb/scylladb: docs: scylla-sstable-script-api.rst: add introduction and title docs: scylla-sstable.rst: extract script API to separate document docs: scylla-sstable: prepare for script API extract	2025-12-24 15:02:57 +02:00
Andrzej Jackowski	632ff66897	doc: audit: mention double audit sink in Enabling Audit section Configuration of both table and syslog audit is possible since scylladb/scylladb#26613 was implemented. However, the "Enabling Audit" section of the documentation wasn't updated, which can be misleading. Ref: scylladb/scylladb#26613 Closes scylladb/scylladb#27790	2025-12-24 13:20:03 +02:00
Botond Dénes	1bb897c7ca	Merge 'Unify configuration of object storage endpoints' from Pavel Emelyanov To configure S3 storage, one needs to do ``` object_storage_endpoints: - name: s3.us-east-1.amazonaws.com port: 443 https: true aws_region: us-east-1 ``` and for GCS it's ``` object_storage_endpoints: - name: https://storage.googleapis.com:433 type: gs credentials_file: <gcp account credentials json file> ``` This PR updates the S3 part to look like ``` object_storage_endpoints: - name: https://s3.us-east-1.amazonaws.com:443 aws_region: us-east-1 ``` fixes: #26570 Not-yet released feature, no need to backport. Old configs are not accepted any longer. If it's needed, then this decision needs to be revised. Closes scylladb/scylladb#27360 * github.com:scylladb/scylladb: object_storage: Temporarily handle pure endpoint addresses as endpoints code: Remove dangling mentions of s3::endpoint_config docs: Update docs according to new endpoints config option format object_storage: Create s3 client with "extended" endpoint name test: Add named constants for test_get_object_store_endpoints endpoint names s3/storage: Tune config updating sstable: Shuffle args for s3_client_wrapper	2025-12-24 06:59:02 +02:00
Anna Stuchlik	7198191aa9	doc: fix the license information on DockerHub This commit removes the OSS-related information from DockerHub. It adds the link to the Source Available license. Fixes https://github.com/scylladb/scylladb/issues/22440 Closes scylladb/scylladb#27706	2025-12-23 15:53:06 +02:00
Avi Kivity	7586c5ccbd	Merge 'system.clients: add `client_options` map column' from Vladislav Zolotarov This pull request introduces a new caching mechanism for client options in the Alternator and transport layers, refactors how client metadata is stored and accessed, and extends the `system.clients` virtual table to surface richer client information. The changes improve efficiency by deduplicating commonly used strings (like driver names/versions and client options), and ensure that client data is handled in a way that's safe for cross-shard access. Additionally, the test suite and virtual table schema are updated to reflect the new client options data. Caching and client metadata refactoring: * The largest and most repeatable items in the connection state before this PR were a `driver_name` and a `driver_version` which were stored as an `sstring` object which means that the corresponding memory consumption was 16 bytes per each such value at least (the smallest size of the `seastar`'s `sstring` object) per-connection. In reality the driver name is usually longer than 15 characters, e.g. "ScyllaDB Python Driver" is 23 characters and this is not the longest driver name there is. In such cases the actual memory usage of a corresponding `sstring` object jumps to 8 + 4 + 1 + (string length, 23 in our example) + 1. So, for "ScyllaDB Python Driver" it would be 37 bytes (in reality it would be a bit more due to natural alignment of other allocations since the `contents` size is not well aligned (13 bytes), but let's ignore this for now). * These bytes add up quickly as there are more connections and, sometimes we are talking about millions of connections per-shard. * Using a smart pointer (`lw_shared_ptr`) referencing a corresponding cached value will effectively reduce the per-connection memory usage to be 8 bytes (a size of a pointer on 64-bit CPU platform) for each such value. While storing a corresponding `sstring` value only once. * This will would reduce the "variable" (per-connection) memory usage by at least 50%. And in case of "ScyllaDB Python Driver" driver version - by 78%! * And all this for a price of a single `loading_shared_values` object per-shard (implements a hash table) and a minor overhead for each value stored in it. * Introduced a new cache type (`client_options_cache_type`) for deduplicating and sharing client option strings, and refactored `client_data`, `client_state`, and related classes to use `foreign_ptr<std::unique_ptr<client_data>>` and cached entry types for fields like driver name, driver version, and client options. (`client_data.hh`, `service/client_state.hh`, `alternator/server.hh`, `alternator/controller.hh`, `transport/controller.hh`, `transport/protocol_server.hh`) [[1]](diffhunk://#diff-664a3b19e905481bdf8eb3843fc4d34691067bb97ab11cfd6e652e74aac51d9fR33-R36) [[2]](diffhunk://#diff-664a3b19e905481bdf8eb3843fc4d34691067bb97ab11cfd6e652e74aac51d9fL40-R56) [[3]](diffhunk://#diff-daadce1a2de3667511e59558f3a8f077b5ee30a14bcc6a99d588db90d0fcd2bdL105-R107) [[4]](diffhunk://#diff-daadce1a2de3667511e59558f3a8f077b5ee30a14bcc6a99d588db90d0fcd2bdL154-R182) [[5]](diffhunk://#diff-5fce246edf5abffb2351bd02e2eb1e9850880f7a00607ccaa90c3eee7ef57c6bL91-R92) [[6]](diffhunk://#diff-5fce246edf5abffb2351bd02e2eb1e9850880f7a00607ccaa90c3eee7ef57c6bL110-R111) [[7]](diffhunk://#diff-31730ba8e7374f784a88dc27c1512291cf73b7f24e08768f7466a3c8cfcc7a1aL96-R96) [[8]](diffhunk://#diff-19a97c0247cc08155ee49b277e43859ca32d6ef8cbff0ed7368ec5fa19e0a11eL172-R172) [[9]](diffhunk://#diff-eea7e2db5d799a25e717a72ac8ce5842bd4adb72b694d38d8f47166d9cd926faL356-R356) [[10]](diffhunk://#diff-d0b4ec3a144bbc5dc993866cf0b940850a457ff6156064f7e2b4b10ad0a95fefL80-R80) [[11]](diffhunk://#diff-4293b94c444d9bd5ecd17ce7eda8c00685d35ecf6e07f844efc91a91bbe85be1L46-R48) * Updated the methods for setting and getting driver name, driver version, and client options in `client_state` to be asynchronous and use the new cache. (`service/client_state.hh`, `service/client_state.cc`) [[1]](diffhunk://#diff-daadce1a2de3667511e59558f3a8f077b5ee30a14bcc6a99d588db90d0fcd2bdL154-R182) [[2]](diffhunk://#diff-99634aae22e2573f38b4e2f050ed2ac4f8173ff27f0ae8b3609d1f0cc1aeb775R347-R362) Virtual table and API enhancements: * Extended the `system.clients` virtual table schema and implementation to include a new `client_options` column (a map of option key/value pairs), and updated the table population logic to use the new cached types and foreign pointers. (`db/virtual_tables.cc`) [[1]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1R752) [[2]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1L769-R770) [[3]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1L809-R816) [[4]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1L828-R879) API and interface changes: * Changed the signatures of `get_client_data` methods throughout the codebase to return vectors of `foreign_ptr<std::unique_ptr<client_data>>` instead of plain `client_data` objects, to ensure safe cross-shard access. (`alternator/controller.hh`, `alternator/controller.cc`, `alternator/server.hh`, `alternator/server.cc`, `transport/controller.hh`, `transport/protocol_server.hh`) [[1]](diffhunk://#diff-31730ba8e7374f784a88dc27c1512291cf73b7f24e08768f7466a3c8cfcc7a1aL96-R96) [[2]](diffhunk://#diff-19a97c0247cc08155ee49b277e43859ca32d6ef8cbff0ed7368ec5fa19e0a11eL172-R172) [[3]](diffhunk://#diff-5fce246edf5abffb2351bd02e2eb1e9850880f7a00607ccaa90c3eee7ef57c6bL110-R111) [[4]](diffhunk://#diff-a7e2cda866c03a75afcf3b087de1c1dcd2e7aa996214db67f9a11ed6451e596dL988-R995) [[5]](diffhunk://#diff-eea7e2db5d799a25e717a72ac8ce5842bd4adb72b694d38d8f47166d9cd926faL356-R356) [[6]](diffhunk://#diff-d0b4ec3a144bbc5dc993866cf0b940850a457ff6156064f7e2b4b10ad0a95fefL80-R80) [[7]](diffhunk://#diff-4293b94c444d9bd5ecd17ce7eda8c00685d35ecf6e07f844efc91a91bbe85be1L46-R48) Testing and validation: * Updated the Python test for the `system.clients` table to verify the new `client_options` column and its contents, ensuring that driver name and version are present in the options map. (`test/cqlpy/test_virtual_tables.py`) [[1]](diffhunk://#diff-6dd8bd4a6a82cd642252a29dc70726f89a46ceefb991c3e63fc67e283f323f03R79) [[2]](diffhunk://#diff-6dd8bd4a6a82cd642252a29dc70726f89a46ceefb991c3e63fc67e283f323f03R88-R90) Closes scylladb/scylladb#25746 * github.com:scylladb/scylladb: transport/server: declare a new "CLIENT_OPTIONS" option as supported service/client_state and alternator/server: use cached values for driver_name and driver_version fields system.clients: add a client_options column controller: update get_client_data to use foreign_ptr for client_data	2025-12-22 20:02:40 +02:00
Nikos Dragazis	20ff2fcc18	docs: Amend limitations for keyspace RF changes The doc about DDL statements claims that an `ALTER KEYSPACE` will fail in the presence of an ongoing global topology operation. This limitation was specifically referring to RF changes, which Scylla implements as global topology requests (`keyspace_rf_change`), and it was true when it was first introduced (`1b913dd880`) because there was no global topology request queue at that time, so only one ongoing global request was allowed in the cluster. This limitation was lifted with the introduction of the global topology request queue (`6489308ebc`), and it was re-introduced again very recently (`2e7ba1f8ce`) in a slightly different form; it now applies only to RF changes (not to any request type) and only those that affect the same keyspace. None of these two changes were ever reflected in the doc. Synchronize the doc with the current state. Fixes #27776. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#27786	2025-12-22 20:02:40 +02:00
Anna Stuchlik	4c247a5d08	doc: document support for i8g and i8ge instances Fixes https://github.com/scylladb/scylladb/issues/27703 Closes scylladb/scylladb#27754	2025-12-22 20:02:40 +02:00
Pavel Emelyanov	e304d912b4	Merge 'db/view/view_building_worker: follow-ups' from Michał Jadwiszczak This patch consists of a few smaller follow-ups to the view building worker: - catch general execption in staging task registrator - remove unnecessary CV broadcast - don't pollute function context with conditionally compiled variable - avoid creating a copy of tasks map - fix some typos Refs https://github.com/scylladb/scylladb/issues/25929 Refs https://github.com/scylladb/scylladb/pull/26897 This PR doesn't fix any bugs but recently we're backporting some PRs to 2025.4, so let's also backport this one to avoid painful conflicts. Closes scylladb/scylladb#26558 * github.com:scylladb/scylladb: docs/dev/view-building-coordinator: fix typos db/view/view_building_worker: remove unnnecessary empty lines db/view/view_building_worker: fix typo db/view/view_building_worker: avoid creating a copy of tasks map db/view/view_building_worker: wrap conditionally compiled code in a scope db/view/view_building_worker: remove unnecessary CV broadcast db/view/view_building_worker: catch general execption in staging task registrator	2025-12-22 20:02:40 +02:00
Anna Stuchlik	9793a45288	doc: add a Vector Search page under Features This commit adds a page with an overview of Vector Search under the Features section. It includes a link to the VS documentation in ScyllaDB Cloud, as the feature is only available in ScyllaDB Cloud. The purpose of the page is to raise awareness of the feature. Fixes https://scylladb.atlassian.net/browse/VECTOR-215 Closes scylladb/scylladb#27787	2025-12-22 15:29:45 +02:00
Michael Litvak	9f8aea21e3	docs: update RF-rack restrictions Update the documentation about restrictions to tablets keyspaces related to RF-rack. * MV/SI require the keyspace to be RF-rack-valid * topology operations are restricted if a keyspace has views to preserve RF-rack-validity	2025-12-22 09:21:07 +01:00
Vlad Zolotarov	ea95cdaaec	transport/server: declare a new "CLIENT_OPTIONS" option as supported Declare support for a 'CLIENT_OPTIONS' startup key. This key is meant to be used by drivers for sending client-specific configurations like request timeouts values, retry policy configuration, etc. The value of this key can be any string in general (according to the CQL binary protocol), however, it's expected to be some structured format, e.g. JSON. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2025-12-20 12:26:22 -05:00
Anna Stuchlik	f65db4e8eb	doc: remove the links to the Download Center This commit removes the remaining links to the Download Center on the website. We no longer use it for installation, and we don't want users to infer that something like that still exists. Fixes https://github.com/scylladb/scylladb/issues/27753 Closes scylladb/scylladb#27756	2025-12-19 12:53:40 +01:00
Michael Litvak	33f7bc28da	docs: document restrictions of colocated tables Currently some things are not supported for colocated tables: it's not possible to repair a colocated table, and due to this it's also not possible to use the tombstone_gc=repair mode on a colocated table. Extend the documentation to explain what colocated tables are and document these restrictions. Fixes scylladb/scylladb#27261 Closes scylladb/scylladb#27516	2025-12-18 15:38:29 +01:00
Patryk Jędrzejczak	d5c205194b	Merge 'topology: Make removenode use left_token_ring state for global barrier' from Emil Maskovsky Make the removenode operation go through the `left_token_ring` state, similar to decommission. This ensures that when removenode completes, all nodes in the cluster are aware of the topology change through a global token metadata barrier. Previously, removenode would skip the `left_token_ring` state and go directly from `write_both_read_new` to `left` state. This meant that when the operation completed, some nodes might not yet know about the topology change, potentially causing issues with subsequent data plane requests. Key changes: - Both decommission and removenode now transition to `left_token_ring` state in the `write_both_read_new` handler - In `left_token_ring` state, only decommissioning nodes receive the shutdown RPC (removed nodes are already dead) - Updated documentation to reflect that both operations use this state This change improves consistency guarantees for removenode operations by ensuring cluster-wide awareness before completion. The change is protected by "REMOVENODE_WITH_LEFT_TOKEN_RING" feature flag to also support mixed clusters during e.g. upgrade. Fixes: scylladb/scylladb#25530 No backport: This fixes and issue found in tests. It can theoretically happen in production too, but wasn't reported in any customer issue, so a backport is not needed. Closes scylladb/scylladb#26931 * https://github.com/scylladb/scylladb: topology: make removenode use left_token_ring state for global barrier topology: allow removing nodes not having tokens features: add feature flag for removenode via left token ring	2025-12-18 09:34:38 +01:00
Emil Maskovsky	1642c686c2	topology: make removenode use left_token_ring state for global barrier Make the removenode operation go through the `left_token_ring` state, similar to decommission. This ensures that when removenode completes, all nodes in the cluster are aware of the topology change through a global token metadata barrier. Previously, removenode would skip the `left_token_ring` state and go directly from `write_both_read_new` to `left` state. This meant that when the operation completed, some nodes might not yet know about the topology change, potentially causing issues with subsequent data plane requests. Key changes: - Both decommission and removenode now transition to `left_token_ring` state in the `write_both_read_new` handler - In `left_token_ring` state, only decommissioning nodes receive the shutdown RPC (removed nodes may already be dead) - Updated documentation to reflect that both operations use this state This change improves consistency guarantees for removenode operations by ensuring cluster-wide awareness before completion. Fixes: scylladb/scylladb#25530	2025-12-17 13:31:11 +01:00

1 2 3 4 5 ...

1970 Commits