scylladb

Author	SHA1	Message	Date
Nadav Har'El	15c252fd8f	Merge 'docs: Update documentation on CREATE ROLE WITH HASHED PASSWORD' from Dawid Mędrek As part of #18750, we added a CQL statement CREATE ROLE WITH SALTED HASH that prevented hashing a password when creating a role, effectively leading to inserting a hash given by the user directly into the database. In #21350, we noticed that Cassandra had implemented a CQL statement of similar semantics but different syntax. We decided to rename Scylla's statement to be compatible with Cassandra. Unfortunately, we didn't notice one more difference between what we had in Scylla and what was part of Cassandra. Scylla's statement was originally supposed to only be used when restoring the schema and the user needn't have to be aware of its existence at all: the database produced a sequence of CQL statements that the user saved to a file and when a need to restore the schema arose, they would execute the contents of the file. That's why that although we documented the feature, it was only done in the necessary places. Those that weren't related to the backup & restore procedure were deliberately skipped. Cassandra, on the other hand, added the statement for a different purpose (for details, see the relevant issue) and it was supposed to be used by the user by design. The statement is also documented as such. Since we want to preserve compatibility with Cassandra, we document the statement and its semantics in the user documentation, explicitly implying that it can be used by the user. We also add a test verifying that logging in works correctly. Fixes scylladb/scylladb#21691 Backport: not needed. The relevant code didn't make it to 6.2 or any previous version of OSS. Closes scylladb/scylladb#21752 * github.com:scylladb/scylladb: docs: Update documentation on CREATE ROLE WITH HASHED PASSWORD test/boost: Add test for creating roles with hashed passwords	2025-01-14 15:33:30 +02:00
Kefu Chai	f8885a4afd	dist/docker,docs: replace "--experimental" with "--experimental-features" The "--experimental" option was removed in commit `f6cca741ea`. Using this deprecated option now causes Scylla to fail with the error: ``` error: the argument ('on') for option '--experimental-features' is invalid ``` So, in this change, let's update the docker entry point script to use `--experimental-features` command line option instead. The related document is updated accordingly. Fixes scylladb/scylladb#22207 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22283	2025-01-14 07:56:38 -05:00
Geoff Montee	25e8478051	docs: rest.rst: use latest docker tag to view Swagger UI for REST API Closes scylladb/scylladb#21681	2025-01-14 07:56:38 -05:00
Botond Dénes	686a997c04	Merge 'Complete implementation of configuring IO bandwidth limits' from Pavel Emelyanov In Scylla there are two options that control IO bandwidth limit -- the /storage_service/(compaction\|stream)_throughput REST API endpoints. The endpoints are partially implemented and have no counterparts in the nodetool. This set implements the missing bits and adds tests for new functionality. Closes scylladb/scylladb#21877 * github.com:scylladb/scylladb: nodetool: Implement [gs]etstreamthroughput commands nodetool: Implement [gs]etcompationthroughput commands test: Add validation of how IO-updating endpoints work api: Implement /storage_service/(stream\|compaction)_throughput endpoints api: Disqualify const config reference api: Implement /storage_service/stream_throughput endpoint api: Move stream throughput set/get endpoints from storage service block api: Move set_compaction_throughput_mb_per_sec to config block util: Include fmt/ranges.h in config_file.hh	2025-01-14 07:56:38 -05:00
Geoff Montee	c8ca2bd212	docs: operating-scylla/admin-tools/virtual-tables.rst: fix link to virtual tables Closes scylladb/scylladb#22198	2025-01-14 08:45:49 +02:00
Botond Dénes	f899f0e411	tools/scylla-sstable: dump-statistics: fix handling of {min,max}_column_names Said fields in statistics are of type `disk_array<uint32_t, disk_string<uint16_t>>` and currently are handled as array of regular strings. However these fields store exploded clustering keys, so the elements store binary data and converting to string can yield invalid UTF-8 characters that certain JSON parsers (jq, or python's json) can choke on. Fix this by treating them as binary and using `to_hex()` to convert them to string. This requires some massaging of the json_dumper: passing field offset to all visit() methods and using a caller-provided disk-string to sstring converter to convert disk strings to sstring, so in the case of statistics, these fields can be intercepted and properly handled. While at it, the type of these fields is also fixed in the documentation. Before: "min_column_names": [ "��Z��\u0011�\u0012ŷ4^��<", "�2y\u0000�}\u007f" ], "max_column_names": [ "��Z��\u0011�\u0012ŷ4^��<", "}��B\u0019l%^" ], After: "min_column_names": [ "9dd55a92bc8811ef12c5b7345eadf73c", "80327900e2827d7f" ], "max_column_names": [ "9dd55a92bc8811ef12c5b7345eadf73c", "7df79242196c255e" ], Fixes: #22078 Closes scylladb/scylladb#22225	2025-01-13 09:19:04 +03:00
Avi Kivity	814942505f	Merge 'Introduce Encryption-at-Rest (EAR) for sstables and commitlog' from Calle Wilund Fixes https://github.com/scylladb/scylla-enterprise/issues/5016#issuecomment-2558464631 EAR - encryption at rest. Allows on-disk file encryption of sstables and commitlog data. Introduces OpenSSL based file level encrypted storage, managed via a set of providers ranging from local files to cloud KMS providers. For a more comprehensive explanation, see the included docs (or if possible, original source tree). Manual bulk merge of EAR feature from enterprise repo to main scylla repo. Breaks some features apart, but main EAR is still a humongous commit, because to separate this I would have to mess with code incrementally, adding time and risk. This PR includes the local file gen tool, tests and also p11 validation. Note: CI will not execute the full tests unless master CI is set to provide the same environment as the enterprise one. Not sure about the status of this ATM. Note: Includes code to compile against cryptsoft kmipc SDK, but not the SDK. If you happen to check out this tree in the scylla folder and configure, it will be linked against and KMIP functionality will be enabled, otherwise not. Closes scylladb/scylladb#22233 * github.com:scylladb/scylladb: docs: Add EAR docs main/build: Add p11-kit and initialize tools: Add local-file-key-generator tool tests: Add EAR tests tmpdir: shorten test tempdir path EAR: port the ear feature from enterprise cql_test_env: Add optional query timeout schema/migration_manager: Add schema validate sstables: add get_shared_components accessor config/config_file: Add exports and definitions of config_type_for<>	2025-01-12 16:10:46 +02:00
Piotr Smaron	288f9b2b15	Introduce LDAP role manager & saslauthd authenticator This PR extends authentication with 2 mechanisms: - a new role_manager subclass, which allows managing users via LDAP server, - a new authenticator, which delegates plaintext authentication to a running saslauthd daemon. The features have been ported from the enterprise repository with their test.py tests and the documentation as part of changing license to source available. Fixes: scylladb/scylla-enterprise#5000 Fixes: scylladb/scylla-enterprise#5001 Closes scylladb/scylladb#22030	2025-01-12 14:50:29 +02:00
Calle Wilund	8e828f608d	docs: Add EAR docs Merge docs relating to EAR.	2025-01-09 10:40:47 +00:00
Botond Dénes	a2436f139f	docs/dev: review-checklist.md: expand the guide for good commit log Closes scylladb/scylladb#22214	2025-01-08 13:01:35 +02:00
Kefu Chai	23729beeb5	docs: remove "ScyllaDB Enterprise" labels remove the "ScyllaDB Enterprise" labels in document. because there is no need to differentiate ScyllaDB Enterprise from its OSS variant, let's stop adding the "ScyllaDB Enterprise" labels to enterprise-only features. this helps to reduce the confusion. as we are still in the process of porting the enterprise features to this repo, this change does not fix scylladb/scylladb#22175. we will review the document again when completing the migration. we also take this opportunity to stop referencing "Enterprise" in the changed paragraph. Refs scylladb/scylladb#22175 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22177	2025-01-08 09:02:52 +02:00
Kefu Chai	e51b2075da	docs/kb: correct referenced git sha1 and version number in `047ce136`, we cherry-picked the change adding garbage-collection-ics.rst to the document. but it was still referencing the git sha1 and version number in enterprise. this change updates kb/garbage-collection-ics.rst, so that it * references the git commit sha1 in this repo * do not reference the version introducing this feature, as per Anna Stuchlik > As a rule, we should avoid documenting when something was > introduced or set as a default because our documentation > was versioned. Per-version information should be listed in > the release notes. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22195	2025-01-08 07:08:15 +02:00
Anna Stuchlik	8d824a564f	doc: add troubleshooting removal with --autoremove-ubuntu This commit adds a troubleshooting article on removing ScyllaDB with the --autoremove option. Fixes https://github.com/scylladb/scylladb/issues/21408 Closes scylladb/scylladb#21697	2025-01-07 13:35:13 +01:00
David Garcia	66a5e7f672	docs: update Sphinx configuration for unified repository publishing This change is related to the unification of enterprise and open-source repositories. The Sphinx configuration is updated to build documentation either for `docs.scylladb.com/manual` or `opensource.docs.scylladb.com`, depending on the flag passed to Sphinx. By default, it will build docs for `docs.scylladb.com/manual`. If the `opensource` flag is passed, it will build docs for `opensource.docs.scylladb.com`, with a different set of versions. This change will prepare the configuration to publish to `docs.scylladb.com/manual` while allowing the option to keep publishing and editing docs with a different multiversion configuration. Note that this change will continue publishing docs to `opensource.docs.scylladb.com` for now since the `opensource` flag is being passed in the `gh-pages.yml` branch. chore: remove comment chore: update project name Closes scylladb/scylladb#22089	2025-01-07 12:54:51 +02:00
Anna Stuchlik	047ce13641	doc: add a new KB article about tombstone garbage collection in ICS Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22174	2025-01-06 16:48:50 +02:00
Raphael S. Carvalho	c973254362	Introduce incremental compaction strategy (ICS) ICS is a compaction strategy that inherits size tiered properties -- therefore it's write optimized too -- but fixes its space overhead of 100% due to input files being only released on completion. That's achieved with the concept of sstable run (similar in concept to LCS levels) which breaks a large sstable into fixed-size chunks (1G by default), known as run fragments. ICS picks similar-sized runs for compaction, and fragments of those runs can be released incrementally as they're compacted, reducing the space overhead to about (number_of_input_runs * 1G). This allows user to increase storage density of nodes (from 50% to ~80%), reducing the cost of ownership. NOTE: test_system_schema_version_is_stable adjusted to account for batchlog using IncrementalCompactionStrategy contains: compaction/: added incremental_compaction_strategy.cc (.hh), incremental_backlog_tracker.cc (.hh) compaction/CMakeLists.txt: include ICS cc files configure.py: changes for ICS files, includes test db/legacy_schema_migrator.cc / db/schema_tables.cc: fallback to ICS when strategy is not supported db/system_keyspace: pick ICS for some system tables schema/schema.hh: ICS becomes default test/boost: Add incremental_compaction_test.cc test/boost/sstable_compaction_test.cc: ICS related changes test/cqlpy/test_compaction_strategy_validation.py: ICS related changes docs/architecture/compaction/compaction-strategies.rst: changes to ICS section docs/cql/compaction.rst: changes to ICS section docs/cql/ddl.rst: adds reference to ICS options docs/getting-started/system-requirements.rst: updates sentence mentioning ICS docs/kb/compaction.rst: changes to ICS section docs/kb/garbage-collection-ics.rst: add file docs/kb/index.rst: add reference to <garbage-collection-ics> docs/operating-scylla/procedures/tips/production-readiness.rst: add ICS section some relevant commits throughout the ICS history: commit 434b97699b39c570d0d849d372bf64f418e5c692 Merge: 105586f747 30250749b8 Author: Paweł Dziepak <pdziepak@scylladb.com> Date: Tue Mar 12 12:14:23 2019 +0000 Merge "Introduce Incremental Compaction Strategy (ICS)" from Raphael " Introduce new compaction strategy which is essentially like size tiered but will work with the existing incremental compaction. Thus incremental compaction strategy. It works like size tiered, but each element composing a tier is a sstable run, meaning that the compaction strategy will look for N similar-sized sstable runs to compact, not just individual sstables. Parameters: * "sstable_size_in_mb": defines the maximum sstable (fragment) size composing a sstable run, which impacts directly the disk space requirement which is improved with incremental compaction. The lower the value the lower the space requirement for compaction because fragments involved will be released more frequently. * all others available in size tiered compaction strategy HOWTO ===== To change an existing table to use it, do: ALTER TABLE mykeyspace.mytable WITH compaction = {'class' : 'IncrementalCompactionStrategy'}; Set fragment size: ALTER TABLE mykeyspace.mytable WITH compaction = {'class' : 'IncrementalCompactionStrategy', 'sstable_size_in_mb' : 1000 } " commit 94ef3cd29a196bedbbeb8707e20fe78a197f30a1 Merge: dca89ce7a5 e08ef3e1a3 Author: Avi Kivity <avi@scylladb.com> Date: Tue Sep 8 11:31:52 2020 +0300 Merge "Add feature to limit space amplification in Incremental Compaction" from Raphael " A new option, space_amplification_goal (SAG), is being added to ICS. This option will allow ICS user to set a goal on the space amplification (SA). It's not supposed to be an upper bound on the space amplification, but rather, a goal. This new option will be disabled by default as it doesn't benefit write-only (no overwrites) workloads and could hurt severely the write performance. The strategy is free to delay triggering this new behavior, in order to increase overall compaction efficiency. The graph below shows how this feature works in practice for different values of space_amplification_goal: https://user-images.githubusercontent.com/1409139/89347544-60b7b980-d681-11ea-87ab-e2fdc3ecb9f0.png When strategy finds space amplification crossed space_amplification_goal, it will work on reducing the SA by doing a cross-tier compaction on the two largest tiers. This feature works only on the two largest tiers, because taking into account others, could hurt the compaction efficiency which is based on the fact that the more similar-sized sstables are compacted together the higher the compaction efficiency will be. With SAG enabled, min_threshold only plays an important role on the smallest tiers, given that the second-largest tier could be compacted into the largest tier for a space_amplification_goal value < 2. By making the options space_amplification_goal and min_threshold independent, user will be able to tune write amplification and space amplification, based on the needs. The lower the space_amplification_goal the higher the write amplification, but by increasing the min threshold, the write amplification can be decreased to a desired amount. " commit 7d90911c5fb3fa891ad64a62147c3a6ca26d61b1 Author: Raphael S. Carvalho <raphaelsc@scylladb.com> Date: Sat Oct 16 13:41:46 2021 -0300 compaction: ICS: Add garbage collection Today, ICS lacks an approach to persist expired tombstones in a timely manner, which is a problem because accumulation of tombstones are known to affecting latency considerably. For an expired tombstone to be purged, it has to reach the top of the LSM tree and hope that older overlapping data wasn't introduced at the bottom. The condition are there and must be satisfied to avoid data resurrection. STCS, today, has an inefficient garbage collection approach because it only picks a single sstable, which satisfies the tombstone density threshold and file staleness. That's a problem because overlapping data either on same tier or smaller tiers will prevent tombstones from being purged. Also, nothing is done to push the tombstones to the top of the tree, for the conditions to be eventually satisfied. Due to incremental compaction, ICS can more easily have an effecient GC by doing cross-tier compaction of relevant tiers. The trigger will be file staleness and tombstone density, which threshold values can be configured by tombstone_compaction_interval and tombstone_threshold, respectively. If ICS finds a tier which meets both conditions, then that tier and the larger[1] and closest-in-size[2] tier will be compacted together. [1]: A larger tier is picked because we want tombstones to eventually reach the top of the tree. [2]: It also has to be the closest-in-size tier as the smaller the size difference the higher the efficiency of the compaction. We want to minimize write amplification as much as possible. The staleness condition is there to prevent the same file from being picked over and over again in a short interval. With this approach, ICS will be continuously working to purge garbage while not hurting overall efficiency on a steady state, as same-tier compactions are prioritized. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211016164146.38010-1-raphaelsc@scylladb.com> Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#22063	2025-01-04 15:43:52 +02:00
Avi Kivity	202f16e799	Merge 'Introduce workload prioritization for service levels' from Piotr Dulikowski This series introduces workload prioritization: an extension of the service levels feature which allows specifying "shares" per service level. The number of shares determines the priority of the user which has this service level attached (if multiple are attached then the one with the lowest shares wins). Different service levels will be isolated in the following way: - Each service level gets its own scheduling group with the number of shares (corresponding to the service level's number of shares), which controls the priority of the CPU and I/O used for user operations running on that service level. - Each service level gets two reader concurrency semaphores, one for user reads and the other for read-before-write done for view updates. - Each service level gets its own TCP connections for RPC to prevent priority inversion issues. Because of the mandatory use of scheduling groups, which are a globally limited resource, the number of service levels is now limited to 7 user created service levels + 1 created by default that cannot be removed. This feature has been previously only available in ScyllaDB Enterprise but has been made available for the source available ScyllaDB. The series was created by comparing the master branch with source-available-workbranch / enterprise branch and taking the workload prioritization related parts from the diff, then molding the resulting diff into a proper series. Some very minor changes were made such as fixing whitespace, removing unused or unnecessary code, adding some boilerplate (in api/) which was missing, but otherwise no major changes have been made. No backport is required. Closes scylladb/scylladb#22031 * github.com:scylladb/scylladb: tracing: record scheduling group in trace event record qos: un-shared-from-this standard_service_level_distributed_data_accessor alternator: execute under scheduling group for service level test.py: support multiple commands in prepare_cql in suite.yml docs: add documentation for workload prioritization docs/dev: describe workload prioritization features in service_levels test/auth_cluster: test workload prioritization in service level tests cqlpy/test_service_levels: add workload prioritization tests api: introduce service levels specific API api/cql_server_test: add information about scheduling group db/virtual_tables: add scheduling group column to system.clients test/boost: update service_level_controller_test for workload prio qos: include number of shares in DESCRIBE cql3/statements: update SL statements for workload prioritization transport/server: use scheduling group assigned to current user messaging_service: use separate set of connections per service levels replica/database: add reader concurrency semaphore groups qos: manage and assign scheduling groups to service levels qos: use the shares field in service level reads/writes qos: add shares to service_level_options qos: explicitly specify columns when querying service level tables db/system_distributed_keyspace: add shares column and upgrade code db/system_keyspace: adjust SL schema for workload prioritization gms: introduce WORKLOAD_PRIORITIZATION cluster feature build: increase the max number of scheduling groups qos: return correct error code when SL does not exist	2025-01-02 20:05:36 +02:00
Kefu Chai	233e3969c4	utils: correct misspellings these misspellings were identified by codespell. let's fix them. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22143	2025-01-02 16:47:57 +02:00
Piotr Dulikowski	b23bc3a5d5	alternator: execute under scheduling group for service level Now, the Alternator API requests are executed under the correct scheduling group of the service level assigned to the currently logged in user.	2025-01-02 07:13:34 +01:00
Piotr Dulikowski	07b162fb5b	docs: add documentation for workload prioritization The doc pages were slightly adjusted during migration not to mention Scylla Enterprise and to fix some whitespace issues.	2025-01-02 07:13:34 +01:00
Piotr Dulikowski	241e710c19	docs/dev: describe workload prioritization features in service_levels The concept of shares, and some helper HTTP APIs, are now described in the developer documentation for service levels.	2025-01-02 07:13:34 +01:00
Avi Kivity	727f68e0f5	Merge 'cql3: allow SELECT of specific collection element' from Michael Litvak This adds to the grammar the option to SELECT a specific element in a collection (map/set/list). For example: `SELECT map['key'] FROM table` `SELECT map['key1']['key2'] FROM table` This feature was implemented in Cassandra 4.0 and was requested by scylla users. The behavior is mostly compatible with Cassandra, except: 1. in SELECT, we allow list subscript in a selector, while cassandra allows only map and set. 2. in UPDATE, we allow set subscript in a column condition, while cassandra allows only map and list. 3. the slice syntax `SELECT m[a..b]` is not implemented yet 4. null subscript - `SELECT m[null]` returns null in scylla, while cassandra returns error Fixes #7751 backport was requested for a user to be able to use it Closes scylladb/scylladb#22051 * github.com:scylladb/scylladb: cql3: allow SELECT of specific collection key cql3: allow set subscript	2025-01-01 14:48:40 +02:00
Avi Kivity	76cf5148e1	Merge 'message: introduce advanced rpc compression' from Michał Chojnowski This is a forward port (from scylla-enterprise) of additional compression options (zstd, dictionaries shared across messages) for inter-node network traffic. It works as follows: After the patch, messaging_service (Scylla's interface for all inter-node communication) compresses its network traffic with compressors managed by the new advanced_rpc_compression::tracker. Those compressors compress with lz4, but can also be configured to use zstd as long as a CPU usage limit isn't crossed. A precomputed compression dictionary can be fed to the tracker. Each connection handled by the tracker will then start a negotiation with the other end to switch to this dictionary, and when it succeeds, the connection will start being compressed using that dictionary. All traffic going through the tracker is passed as a single merged "stream" through dict_sampler. dictionary_service has access to the dict_sampler. On chosen nodes (in the "usual" configuration: the Raft leader), it uses the sampler to maintain a random multi-megabyte sample of the sampler's stream. Every several minutes, it copies the sample, trains a compression dictionary on it (by calling zstd's training library via the alien_worker thread) and publishes the new dictionary to system.dicts via Raft's write_mutation command. This update triggers (eventually) a callback on all nodes, which feeds the new dictionary to advanced_rpc_compression::tracker, and this switches (eventually) all inter-node connections to this dictionary. Closes scylladb/scylladb#22032 * github.com:scylladb/scylladb: messaging_service: use advanced_rpc_compression::tracker for compression message/dictionary_service: introduce dictionary_service service: make Raft group 0 aware of system.dicts db/system_keyspace: add system.dicts utils: add advanced_rpc_compressor utils: add dict_trainer utils: introduce reservoir_sampling utils: introduce alien_worker utils: add stream_compressor	2024-12-31 15:02:57 +02:00
Michael Litvak	5ef7afb968	cql3: allow SELECT of specific collection key This adds to the grammar the option to SELECT a specific key in a collection column using subscript syntax. For example: SELECT map['key'] FROM table SELECT map['key1']['key2'] FROM table The key can also be parameterized in a prepared query. For this we need to pass the query options to result_set_builder where we process the selectors. Fixes scylladb/scylladb#7751	2024-12-30 17:05:20 +02:00
Michał Chojnowski	fdb2d2209c	messaging_service: use advanced_rpc_compression::tracker for compression This patch sets up an `alien_worker`, `advanced_rpc_compression::tracker`, `dict_sampler` and `dictionary_service` in `main()`, and wires them to each other and to `messaging_service`. `messaging_service` compresses its network traffic with compressors managed by the `advanced_rpc_compression::tracker`. All this traffic is passed as a single merged "stream" through `dict_sampler`. `dictionary_service` has access to `dict_sampler`. On chosen nodes (by default: the Raft leader), it uses the sampler to maintain a random multi-megabyte sample of the sampler's stream. Every several minutes, it copies the sample, trains a compression dictionary on it (by calling zstd's training library via the `alien_worker` thread) and publishes the new dictionary to `system.dicts` via Raft. This update triggers a callback into `advanced_rpc_compression::tracker` on all nodes, which updates the dictionary used by the compressors it manages.	2024-12-27 10:17:58 +01:00
Michał Chojnowski	0fd1050784	utils: add advanced_rpc_compressor Adds glue needed to pass lz4 and zstd with streaming and/or dictionaries as the network traffic compressors for Seastar's RPC servers. The main jobs of this glue are: 1. Implementing the API expected by Seastar from RPC compressors. 2. Expose metrics about the effectiveness of the compression. 3. Allow dynamically switching algorithms and dictionaries on a running connection, without any extra waits. The biggest design decision here is that the choice of algorithm and dictionary is negotiated by both sides of the connection, not dictated unilaterally by the sender. The negotiation algorithm is fairly complicated (a TLA+ model validating it is included in the commit). Unilateral compression choice would be much simpler. However, negotiation avoids re-sending the same dictionary over every connection in the cluster after dictionary updates (with one-way communication, it's the only reliable way to ensure that our receiver possesses the dictionary we are about to start using), lets receivers ask for a cheaper compression mode if they want, and lets them refuse to update a dictionary if they don't think they have enough free memory for that. In hindsight, those properties probably weren't worth the extra complexity and extra development effort. Zstd can be quite expensive, so this patch also includes a mechanism which temporarily downgrades the compressor from zstd to lz4 if zstd has been using too much CPU in a given slice of time. But it should be noted that this can't be treated as a reliable "protection" from negative performance effects of zstd, since a downgrade can happen on the sender side, and receivers are at the mercy of senders.	2024-12-23 23:37:02 +01:00
Pavel Emelyanov	a24dc02255	api: New "scope" API param to load-and-stream calls There are two of those -- the POST /storage_service/keyspace that loads and streams new sstables from /upload and POST /storage_service/restore that does the same, but gets sstables from object store. The new optional parameter allow users to tun the streaming phase behavior. The test/pylib client part is also updated here. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-12-23 19:28:05 +03:00
Avi Kivity	f8ce49ebe9	cql3: implement NOT IN Where the grammar supports IN, we add NOT IN. This includes the WHERE clause and LWT IF clause. Evaluation of NOT IN follows from IN. In statement_restrictions analysis, they are different, as NOT IN doesn't enable any clever query plan and must filter. Some tests are added. An error message was changed ('in' changed to 'IN'), so some tests are adjusted. Closes scylladb/scylladb#21992	2024-12-22 15:15:23 +02:00
Dawid Mędrek	461a6b129c	docs: Update documentation on CREATE ROLE WITH HASHED PASSWORD As part of #18750, we added a CQL statement CREATE ROLE WITH SALTED HASH that prevented hashing a password when creating a role, effectively leading to inserting a hash given by the user directly into the database. In #21350, we noticed that Cassandra had implemented a CQL statement of similar semantics but different syntax. We decided to rename Scylla's statement to be compatible with Cassandra. Unfortunately, we didn't notice one more difference between what we had in Scylla and what was part of Cassandra. Scylla's statement was originally supposed to only be used when restoring the schema and the user needn't have to be aware of its existence at all: the database produced a sequence of CQL statements that the user saved to a file and when a need to restore the schema arose, they would execute the contents of the file. That's why that although we documented the feature, it was only done in the necessary places. Those that weren't related to the backup & restore procedure were deliberately skipped. Cassandra, on the other hand, added the statement for a different purpose (for details, see the relevant issue) and it was supposed to be used by the user by design. The statement is also documented as such. Since we want to preserve compatibility with Cassandra, we document the statement and its semantics in the user documentation, explicitly implying that it can be used by the user. Fixes scylladb/scylladb#21691	2024-12-17 13:43:36 +01:00
Pavel Emelyanov	3081ce24cd	nodetool: Implement [gs]etstreamthroughput commands They exist in the original documentation, but are not yet implemented. Now it's possible to do it. It slightly more complex that its compaction counterpart in a sense than get method reports megabits/s by default and has an option to convert to MiBs. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-12-13 14:39:47 +03:00
Pavel Emelyanov	67089fd5a1	nodetool: Implement [gs]etcompationthroughput commands They exist in the original documentation, but are not yet implemented. Now it's possible to do it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-12-13 14:39:47 +03:00
Anna Stuchlik	98860905d8	doc: remove wrong image upgrade info (5.2-to-2023.1) This commit removes the information about the recommended way of upgrading ScyllaDB images - by updating ScyllaDB and OS packages in one step. This upgrade procedure is not supported (it was implemented, but then reverted). Refs https://github.com/scylladb/scylladb/issues/15733 Closes scylladb/scylladb#21876	2024-12-11 14:00:30 +02:00
Tomasz Grabiec	8e60a0b831	Merge 'truncate: make TRUNCATE TABLE safe with tablets' from Ferenc Szili Currently truncating a table works by issuing an RPC to all the nodes which call `database::truncate_table_on_all_shards()`, which makes sure that older writes are dropped. It works with tablets, but is not safe. A concurrent replication process may bring back old data. This change makes makes TRUNCATE TABLE a topology operation, so that it excludes with other processes in the system which could interfere with it. More specifically, it makes TRUNCATE a global topology request. Backporting is not needed. Fixes #16411 Closes scylladb/scylladb#19789 * github.com:scylladb/scylladb: docs: docs: topology-over-raft: Document truncate_table request storage_proxy: fix indentation and remove empty catch/rethrow test: add tests for truncate with tablets storage_proxy: use new TRUNCATE for tablets truncate: make TRUNCATE a global topology operation storage_service: move logic of wait_for_topology_request_completion() RPC: add truncate_with_tablets RPC with frozen_topology_guard feature_service: added cluster feature for system.topology schema change system.topology_requests: change schema storage_proxy: propagate group0 client and TSM dependency	2024-12-10 17:50:50 +01:00
Ferenc Szili	49cc771bda	docs: docs: topology-over-raft: Document truncate_table request	2024-12-09 16:38:50 +01:00
Botond Dénes	2491a31f4c	docs: cql/ddl.rst: document {min,max}_index_interval Closes scylladb/scylladb#21795	2024-12-09 13:45:20 +03:00
Tomasz Grabiec	7e2875d648	Merge 'Add tablet merge support' from Raphael Raph Carvalho The goal of merge is to reduce the tablet count for a shrinking table. Similar to how split increases the count while the table is growing. The load balancer decision to merge is implemented today (came with infrastructure introduced for split), but it wasn't handled until now. Initial tablet count is respected while the table is in "growing mode". For example, the table leaves it if there was a need to split above the initial tablet count. After the table leaves the mode, the average size can be trusted to determine that the table is shrinking. Merge decision is emitted if the average tablet size is 50% of the target. Hysteresis is applied to avoid oscillations between split and merges. Similar to split, the decision to merge is recorded in tablet map's resize_type field with the string "merge". This is important in case of coordinator failover, so new coordinator continues from where the old left off. Unlike split, the preparation phase during merge is not done by the replica (with split compactions), but rather by the coordinator by co-locating sibling tablets in the same node's shard. We can define sibling tablets as tablets that have contiguous range and will become one after merge. The concept is based on the power-of-two constraint and token contiguity. For example, in a table with 4 tablets, tablets of ids 0 and 1 are siblings, 2 and 3 are also siblings. The algorithm for co-locating sibling tablets is very simple. The balancer is responsible for it, and it will emit migrations so that "odd" tablet will follow the "even" one. For example, tablet 1 will be migrated to where tablet 0 lives. Co-location is low in priority, it's not the end of the world to delay merge, but it's not ideal to delay e.g. decommission or even regular load balancing as that can translate into temporary unbalancing, impacting the user activities. So co-location migrations will happen when there is no more important work to do. While regular balancing is higher in priority, it will not undo the co-location work done so far. It does that by treating co-located tablets as if they were already merged. The load inversion convergence check was adjusted so balancer understand when two tablets are being migrated instead of one, to avoid oscillations. When balancer completes co-location work for a table undergoing merge, it will put the id of the table into the resize_plan, which is about communicating with the topology coordinator that a table is ready for it. With all sibling tablets co-located, the coordinator can resize the tablet map (reduce it by a factor of 2) and record the new map into group0. All the replicas will react to it (on token metadata update) by merging the storage (memtable(s) + sstables) of sibling tablets into one. Fixes #18181. system test details: test: https://github.com/pehala/scylla-cluster-tests/blob/tablets_split_merge/tablets_split_merge_test.py yaml file: https://github.com/pehala/scylla-cluster-tests/blob/tablets_split_merge/test-cases/features/tablets/tablets-split-merge-test.yaml instance type: i3.8xlarge nodes: 3 target tablet size: 0.5G (scaled down by 10, to make it easier to trigger splits and merges) description: multiple cycles of growing and shrinking the data set in order to trigger splits and merges. data_set_size: ~100G initial_tablets: 64, so it grew to 128 tablets on split, and back to 64 on merge. latency of reads and writes that happened in parallel to split and merge: ``` $ for i in scylla-bench; do cat $i \| grep "Mode\\|99th:\\|99\.9th:"; done Mode: write 99.9th: 3.145727ms 99th: 1.998847ms 99.9th: 3.145727ms 99th: 2.031615ms Mode: read 99.9th: 3.145727ms 99th: 2.031615ms 99.9th: 3.145727ms 99th: 2.031615ms Mode: write 99.9th: 3.047423ms 99th: 1.933311ms 99.9th: 3.047423ms 99th: 1.933311ms Mode: read 99.9th: 3.145727ms 99th: 1.900543ms 99.9th: 3.145727ms 99th: 1.900543ms Mode: write 99.9th: 5.079039ms 99th: 3.604479ms 99.9th: 35.389439ms 99th: 25.624575ms Mode: write 99.9th: 3.047423ms 99th: 1.998847ms 99.9th: 3.047423ms 99th: 1.998847ms Mode: read 99.9th: 3.080191ms 99th: 2.031615ms 99.9th: 3.112959ms 99th: 2.031615ms ``` Closes scylladb/scylladb#20572 github.com:scylladb/scylladb: docs: Document tablet merging tests/boost: Add test to verify correctness of balancer decisions during merge tests/topology_experimental_raft: Add tablet merge test service: Handle exception when retrying split service: Co-locate sibling tablets for a table undergoing merge gms: Add cluster feature for tablet merge service: Make merge of resize plan commutative replica: Implement merging of compaction groups on merge completion replica: Handle tablet merge completion service: Implement tablet map resize for merge locator: Introduce merge_tablet_info() service: Rename topology::transition_state::tablet_split_finalization service: Respect initial_tablet_count if table is in growing mode service: Wire migration_tablet_set into the load balancer locator: Add tablet_map::sibling_tablets() service: Introduce sorted_replicas_for_tablet_load() locator/tablets: Extend tablet_replica equality comparator to three-way service: Introduce alias to per-table candidate map type service: Add replication constraint check variant for migration_tablet_set service: Add convergence check variant for migration_tablet_set service: Add migration helpers for migration_tablet_set service/tablet_allocator: Introduce migration_tablet_set service: Introduce migration_plan::add(migrations_vector) locator/tablets: Introduce tablet_map::for_each_sibling_tablets() locator/tablets: Introduce tablet_map::needs_merge() locator/tablets: Introduce resize_decision::initial_decision() locator/tablets: Fix return type of three-way comparison operators service: Extract update of node load on migrations service: Extract converge check for intra-node migration service: Extract erase of tablet replicas from candidate list scripts/tablet-mon: Allow visualization of tablet id	2024-12-06 18:06:20 +01:00
Kefu Chai	37c49acbac	docs/cql/ddl: Clarify crc_check_chance option behavior Although `crc_check_chance` is accepted as a configuration option in ScyllaDB, the value is currently ignored during runtime. This change makes this behavior explicit in the documentation to prevent potential user misunderstandings. Changes: - Explicitly document that the option is currently a no-op - Provide clear guidance on the current implementation - Prevent confusion about the option's actual functionality Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21794	2024-12-06 13:48:03 +02:00
Emil Maskovsky	2b07d93bea	raft: clean up the documentation Small adjustments and improvements to the documentation in the raft section. Fixing Markdown lint warnings: - MD004/ul-style: Unordered list style [Expected: dash; Actual: asterisk] - MD007/ul-indent: Unordered list indentation [Expected: 0; Actual: 2] - MD032/blanks-around-lists: Lists should be surrounded by blank lines - MD036/no-emphasis-as-heading: Emphasis used instead of a heading - MD046/code-block-style: Code block style [Expected: fenced; Actual: indented] Closes scylladb/scylladb#21780	2024-12-05 13:44:11 +01:00
Raphael S. Carvalho	d93a0040e5	docs: Document tablet merging Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2024-12-04 13:11:11 -03:00
Botond Dénes	f55dc71c3f	Merge 'Use checksummed input streams in `validate_checksums()`' from Nikos Dragazis With commits `ed7d352e7d` and `bb1867c7c7`, we now have input streams for both compressed and uncompressed SSTables that provide seamless checksum and digest checking. The code for these was based on `validate_checksums()`, which implements its own validation logic over raw streams. This has led to some duplicate code. This PR deduplicates the uncompressed case by modifying `validate_checksums()` to use a checksummed input stream instead of a raw stream. The same cannot be done for compressed SSTables though. The reason is that `validate_checksums()` needs to examine the whole data file, even if an invalid chunk is encountered. In the checksummed case we support that by offloading the error handling logic from the data source via a function parameter. In the compressed data source we cannot do that because it needs to return decompressed data and decompression may fail if the data are invalid. This PR also enables `validate_checksums()` to partially verify SSTables with just the per-chunk checksums if the digest is missing. In more detail, this PR consists of: * Port of some integrity checks from `do_validate_uncompressed()` to the checksummed data source. It should now be able to detect corruption due to truncated or appended chunks (expected number of chunks is retrieved from the CRC component). * Introduction of `error_handler` parameter in checksummed data source and `data_stream()`. * Refactoring of `validate_checksums()`. The JSON response of `sstable validate-checksums` was also modified to report a missing digest. * Tests for `validate_checksums()` against SSTables with truncated data, appended data, invalid digests, or no digest. Refs #19058. This PR is a hybrid of cleanup and feature. No backport is needed. Closes scylladb/scylladb#20933 * github.com:scylladb/scylladb: tools/scylla-sstable: Rename valid_checksums -> valid test: Check validate_checksums() with missing digest sstables: Allow validate_checksums() to report missing digests sstables: Refactor validate_checksums() to use checksummed data stream sstables: Add error_handler parameter to data_stream() sstables: Add error handler in checksummed data source sstables: Check for excessive chunks in checksummed data source sstables: Check for premature EOF in checksummed data source test: test_validate_checksums: Check SSTable with invalid digest test: test_validate_checksums: Check SSTable with appended data test: test_validate_checksums: Complement test for truncated SSTable	2024-12-04 10:46:18 +02:00
Raphael S. Carvalho	e00798f1b1	service: Rename topology::transition_state::tablet_split_finalization This transition state will be reused by merge completion, so let's rename it to tablet_resize_finalization. The completion handling path will also be reused, so let's rename functions involved similarly. The old name "tablet split finalization" is deprecated but still recognized and points to the correct transition. Otherwise, the reverse lookup would fail when populating topology system table which last state was split finalization. NOTE: I thought of adding a new tablet_merge_finalization, but it would complicate things since more than one table could be ready for either split or merge, so you need a generic transition state for handling resize completion. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2024-12-03 20:45:20 -03:00
Kefu Chai	afeff0a792	docs: explain task status retention and one-time query behavior Task status information from nodetool commands is not retained permanently: - Status of completed tasks is only kept for `task_ttl_in_seconds` - Status is removed after being queried, making it a one-time operation This behavior is important for users to understand since subsequent queries for the same completed task will not return any information. Add documentation to make this clear to users. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21386	2024-11-29 16:36:27 +01:00
Botond Dénes	055a36ae55	main: dump diagnostics on SIGQUIT Dump a diagnostics report on each shard when receiving a SIGQUIT. The report is logged with a dedicated logger, called diagnostics. The report has multiple parts: * seastar memory diagnostics, similar to that printed by the scylla memory command (from scylla-gdb.py). * reader concurrency semaphore diagnostics for each semaphore. Example report: INFO 2024-11-27 01:31:55,882 [shard 0:main] diagnostics - Diagnostics dump requested via SIGQUIT: Dumping seastar memory diagnostics Used memory: 3988M Free memory: 58M Total memory: 4G Hard failures: 0 LSA allocated: 4M used: 16 free: 4G Cache: total: 1M used: 642K free: 398K Memtables: total: 3M Regular: real dirty: 0B virt dirty: 0B System: real dirty: 3M virt dirty: 3M Replica: Read Concurrency Semaphores: user: 0/100, 0B/81M, queued: 0 streaming: 0/10, 0B/81M, queued: 0 system: 0/10, 0B/81M, queued: 0 compaction: 0/unlimited, 0B/unlimited view update: 0/50, 0B/40M, queued: 0 Execution Stages: apply stage: Total: 0 Tables - Ongoing Operations: Pending writes (top 10): 0 Total (all) Pending reads (top 10): 0 Total (all) Pending streams (top 10): 0 Total (all) Small pools: objsz spansz usedobj memory unused wst% 8 4K 858 16K 9K 58 10 4K 5 8K 8K 99 12 4K 5 8K 8K 99 14 4K 0 0B 0B 0 16 4K 2k 44K 15K 35 32 4K 4k 136K 16K 11 32 4K 8k 280K 24K 8 32 4K 3k 92K 6K 6 32 4K 4k 140K 21K 14 48 4K 3k 180K 25K 14 48 4K 2k 120K 27K 22 64 4K 2k 156K 18K 11 64 4K 19k 1M 11K 0 80 4K 3k 236K 16K 6 96 4K 6k 572K 49K 8 112 4K 2k 276K 72K 25 128 4K 477 80K 20K 25 160 4K 194 60K 30K 49 192 4K 1k 232K 39K 16 224 4K 2k 468K 15K 3 256 4K 182 100K 55K 54 320 8K 349 152K 43K 28 384 8K 332 288K 164K 56 448 4K 243 180K 74K 40 512 4K 256 244K 116K 47 640 16K 185 192K 76K 39 768 16K 394 432K 137K 31 896 8K 54 192K 144K 75 1024 4K 288 432K 144K 33 1280 32K 92 256K 140K 54 1536 32K 11 128K 111K 86 1792 16K 10 144K 126K 87 2048 8K 487 1M 90K 8 2560 64K 113 384K 100K 26 3072 64K 9 256K 228K 89 3584 32K 3 288K 277K 96 4096 16K 129 912K 396K 43 5120 128K 21 384K 275K 71 6144 128K 4 512K 486K 94 7168 64K 3 576K 553K 96 8192 32K 373 3M 56K 1 10240 64K 6 832K 770K 92 12288 64K 17 960K 756K 78 14336 128K 2 1M 1M 97 16384 64K 14 1M 992K 81 Page spans: index size free used spans 0 4K 4K 5M 1k 1 8K 8K 2M 213 2 16K 16K 2M 106 3 32K 64K 6M 200 4 64K 64K 4M 71 5 128K 384K 3934M 31k 6 256K 1M 256K 5 7 512K 512K 512K 2 8 1M 2M 0B 2 9 2M 2M 2M 2 10 4M 4M 0B 1 11 8M 16M 0B 2 12 16M 32M 0B 2 13 32M 0B 32M 1 14 64M 0B 0B 0 15 128M 0B 0B 0 16 256M 0B 0B 0 17 512M 0B 0B 0 18 1G 0B 0B 0 19 2G 0B 0B 0 20 4G 0B 0B 0 21 8G 0B 0B 0 22 16G 0B 0B 0 23 32G 0B 0B 0 24 64G 0B 0B 0 25 128G 0B 0B 0 26 256G 0B 0B 0 27 512G 0B 0B 0 28 1T 0B 0B 0 29 2T 0B 0B 0 30 4T 0B 0B 0 31 8T 0B 0B 0 INFO 2024-11-27 01:31:55,882 [shard 0:main] diagnostics - Diagnostics dump requested via SIGQUIT: Semaphore user with 0/100 count and 0/84850769 memory resources: user request, dumping permit diagnostics: permits count memory table/operation/state 0 0 0B total Stats: permit_based_evictions: 0 time_based_evictions: 0 inactive_reads: 0 total_successful_reads: 0 total_failed_reads: 0 total_reads_shed_due_to_overload: 0 total_reads_killed_due_to_kill_limit: 0 reads_admitted: 0 reads_enqueued_for_admission: 0 reads_enqueued_for_memory: 0 reads_admitted_immediately: 0 reads_queued_because_ready_list: 0 reads_queued_because_need_cpu_permits: 0 reads_queued_because_memory_resources: 0 reads_queued_because_count_resources: 0 reads_queued_with_eviction: 0 total_permits: 0 current_permits: 0 need_cpu_permits: 0 awaits_permits: 0 disk_reads: 0 sstables_read: 0 INFO 2024-11-27 01:31:55,882 [shard 0:main] diagnostics - Diagnostics dump requested via SIGQUIT: Semaphore streaming with 0/10 count and 0/84850769 memory resources: user request, dumping permit diagnostics: permits count memory table/operation/state 0 0 0B total Stats: permit_based_evictions: 0 time_based_evictions: 0 inactive_reads: 0 total_successful_reads: 6 total_failed_reads: 0 total_reads_shed_due_to_overload: 0 total_reads_killed_due_to_kill_limit: 0 reads_admitted: 6 reads_enqueued_for_admission: 0 reads_enqueued_for_memory: 0 reads_admitted_immediately: 6 reads_queued_because_ready_list: 0 reads_queued_because_need_cpu_permits: 0 reads_queued_because_memory_resources: 0 reads_queued_because_count_resources: 0 reads_queued_with_eviction: 0 total_permits: 6 current_permits: 0 need_cpu_permits: 0 awaits_permits: 0 disk_reads: 0 sstables_read: 0 INFO 2024-11-27 01:31:55,882 [shard 0:main] diagnostics - Diagnostics dump requested via SIGQUIT: Semaphore compaction with 0/2147483647 count and 0/9223372036854775807 memory resources: user request, dumping permit diagnostics: permits count memory table/operation/state 0 0 0B total Stats: permit_based_evictions: 0 time_based_evictions: 0 inactive_reads: 0 total_successful_reads: 0 total_failed_reads: 0 total_reads_shed_due_to_overload: 0 total_reads_killed_due_to_kill_limit: 0 reads_admitted: 0 reads_enqueued_for_admission: 0 reads_enqueued_for_memory: 0 reads_admitted_immediately: 0 reads_queued_because_ready_list: 0 reads_queued_because_need_cpu_permits: 0 reads_queued_because_memory_resources: 0 reads_queued_because_count_resources: 0 reads_queued_with_eviction: 0 total_permits: 27 current_permits: 0 need_cpu_permits: 0 awaits_permits: 0 disk_reads: 0 sstables_read: 0 INFO 2024-11-27 01:31:55,882 [shard 0:main] diagnostics - Diagnostics dump requested via SIGQUIT: Semaphore system with 0/10 count and 0/84850769 memory resources: user request, dumping permit diagnostics: permits count memory table/operation/state 1 0 0B ./view_builder/active 1 0 0B total Stats: permit_based_evictions: 0 time_based_evictions: 0 inactive_reads: 0 total_successful_reads: 234 total_failed_reads: 0 total_reads_shed_due_to_overload: 0 total_reads_killed_due_to_kill_limit: 0 reads_admitted: 234 reads_enqueued_for_admission: 154 reads_enqueued_for_memory: 0 reads_admitted_immediately: 80 reads_queued_because_ready_list: 154 reads_queued_because_need_cpu_permits: 0 reads_queued_because_memory_resources: 0 reads_queued_because_count_resources: 0 reads_queued_with_eviction: 0 total_permits: 235 current_permits: 1 need_cpu_permits: 0 awaits_permits: 0 disk_reads: 0 sstables_read: 0 INFO 2024-11-27 01:31:55,882 [shard 0:main] diagnostics - Diagnostics dump requested via SIGQUIT: Semaphore view_update with 0/50 count and 0/42425384 memory resources: user request, dumping permit diagnostics: permits count memory table/operation/state 0 0 0B total Stats: permit_based_evictions: 0 time_based_evictions: 0 inactive_reads: 0 total_successful_reads: 0 total_failed_reads: 0 total_reads_shed_due_to_overload: 0 total_reads_killed_due_to_kill_limit: 0 reads_admitted: 0 reads_enqueued_for_admission: 0 reads_enqueued_for_memory: 0 reads_admitted_immediately: 0 reads_queued_because_ready_list: 0 reads_queued_because_need_cpu_permits: 0 reads_queued_because_memory_resources: 0 reads_queued_because_count_resources: 0 reads_queued_with_eviction: 0 total_permits: 0 current_permits: 0 need_cpu_permits: 0 awaits_permits: 0 disk_reads: 0 sstables_read: 0 Fixes: scylladb/scylladb#7400 Closes scylladb/scylladb#21692	2024-11-28 18:52:29 +02:00
Botond Dénes	ff90a77f5b	scylla-sstable: revamp schema sources Demote --scylla-data-dir and --scylla-yaml-file to schema source helpers, rather than schema source in themselves. This practically means that when these options are used, they won't define where the tool will attempt to load the schema from, they will just be helpers to help locate the schema, for whichever schema source the tool was instructed to use (or left to choose). --scylla-data-dir and --scylla-yaml-file being schema sources were problematic with encryption at rest and for S3 support (not yet implemented). With encryption, the tool needs access to the configuration, so --scylla-yaml-file is often used to provide the path to the configuration file, which contains encryption configuration, needed for the tool to decrypt the sstable. Currently, using this option implies forcing the tool to read the schema from the schema tables, which is a problematic option for tests -- Scylla might be compacting a schema sstable and this will make the tool fail to load the schema. Demoting these options the schema helpers, allows providing them, while at the same time having the option to use a different schema-source. To allow the user to force the tool to load the schema from the schema tables, a new --schema-tables option is added. Similarly, a --sstable-schema option is introduced to force the tool to load the schema from the sstable itself. With this, each 4 schema source now has an option to force the use of said schema source. There are various helper options to be used along with these. The documentation as well as the tests are updated with the changes. The schema related documentation gets an rather extensive facelift because it was a bit out-of-date and incomplete. Fixes: scylladb/scylladb#20534 Closes scylladb/scylladb#21678	2024-11-28 18:36:09 +02:00
Kefu Chai	23a7e9a6d0	docs: align tablestats documentation with actual output Update the tablestats documentation to correctly describe the "Number of partitions" metric. The previous documentation incorrectly referred to "estimated row count" when the command actually shows estimated partition count. Before: ``` Number of keys (estimate) \| The estimated row count ``` After: ``` Number of partitions (estimate) \| The estimated partition count ``` This distinction is important since a partition (identified by its partition key) can contain multiple rows in ScyllaDB. The updated format also matches Cassandra's nodetool output for better compatibility. Fixes scylladb/scylladb#21586 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21598	2024-11-28 09:36:21 +02:00
Botond Dénes	87bdfb80aa	docs/dev/reader-concurrency-semaphore.md: fix formatting of diagnostics dump Indent the whole thing so it is formatted as code, not as text. Closes scylladb/scylladb#21693	2024-11-27 12:13:16 +03:00
Botond Dénes	ccb433d767	Merge 'tasks: add api_task_ttl for tasks started with API' from Aleksandra Martyniuk When users start an operation asynchronously with API, they are expected to check the operation's status. Hence, the status should be kept in task manager for reasonable time after the operation is done. The operations that are started internally usually don't need to stay in task manager for that long. Add api_task_ttl that will be used for tasks started with API. By default it's 1 hour. The time for which non-API tasks stay in task manager isn't changed. Fixes: #21499. Refs: #21425. No backport needed - previous versions may use task_ttl Closes scylladb/scylladb#21505 * github.com:scylladb/scylladb: test: add test to check user_task_ttl tasks: api: move make_task method docs: nodetool: update backup and restore commands docs docs: update task manager docs nodetool: add nodetool tasks user-ttl command node_ops: use user task ttl for node ops virtual task tasks: use user_task_ttl for tasks started by user api: task_manager: add /task_manager/user_ttl to get and set user task ttl tasks: add task_manager::task::is_user_task method tasks: keep updateable_value of task_ttl in task manager db: config: add user_task_ttl_seconds named value	2024-11-27 09:57:57 +02:00
Kamil Braun	1f5b83dc56	Merge 'docs: update admin-tools docs with deprecation and removal notice for java tools' from Botond Dénes Java tools are deprecated and slated for removal in the next ScyllaDB release. Update the admin-tools docs and make sure all java tool documentation pages have a notice reflecting this fact. Fixes: https://github.com/scylladb/scylladb/issues/21149 Should be backported to 6.2, so users of the latest stable version can see the notice. Closes scylladb/scylladb#21522 * github.com:scylladb/scylladb: docs: sstableloader.rst: add deprecation notice docs: admin-tools: update deprecation notice for sstable{dump,metadata} docs: tools_index.rst: remove deprecated sstablereset and sstablerepairedset tools	2024-11-26 17:03:56 +01:00
Ernest Zaslavsky	793f2c95d1	snapshots: Stop taking snapshots of MVs Stop taking snapshots of MVs and allow taking snapshot of individual tables, now one can take a snapshot of any base table, any view or index. Also add tests to cover new cases both boost test (using cc code) and pytest (using the API) Also, update documentation to reflect the change fixes: #21339 fixes: #20760 Closes scylladb/scylladb#21433	2024-11-26 15:27:30 +02:00
Aleksandra Martyniuk	1244982071	docs: nodetool: update backup and restore commands docs	2024-11-26 09:57:41 +01:00

1 2 3 4 5 ...

1525 Commits