scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 17:40:34 +00:00

Author	SHA1	Message	Date
Łukasz Paszkowski	5089ffe06f	tools: toolchain: add e2fsprogs, fuse3 to the dependencies The packages contain filesystem utilities to create volumes such that sudo/unshare are not required. Closes #26135 [avi: regenerate frozen toolchain with optimized clang from https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-x86_64.tar.gz ] Closes scylladb/scylladb#26165	2025-09-23 18:49:37 +03:00
Pavel Emelyanov	ce8dd798a2	Merge 'tools/scylla-sstable-scripts: introduce purgeable.lua and writetime-histogram.lua' from Botond Dénes `purgeable.lua` was written for a specific investigation a few years ago. `writetime-histogram.lua` is an sstable script transcription of the former scylla-sstable writetime-histogram command. This was also written for an investigation (before script command existed) and is too specific to be a native command, so was removed by `edaf67edcb`. Add both scripts to the sample script library, they can be useful, either for a future investigation, or as samples to copy+edit to write new scripts (and train AI). New sstable scripts, no backport Closes scylladb/scylladb#26137 * github.com:scylladb/scylladb: tools/scylla-sstable-scripts: introduce writetime-histogram.lua tools/scylla-sstable-scripts: introduce purgable.lua	2025-09-22 15:27:49 +03:00
Botond Dénes	92f614dc5a	tools/scylla-sstable-scripts: introduce writetime-histogram.lua Produces a histogram with the writetime (timestamp) of the data in the sstable(s). The histogram is printed to the output, along with general stats about the processed data.	2025-09-19 11:54:01 +03:00
Botond Dénes	d298da5410	tools/scylla-sstable-scripts: introduce purgable.lua Collects and prints statistics about how much data is purgeable in an sstable. Works only with tombstone_gc = {'mode': 'timeout'}; Can help diagnosing the efficiency (or lack of) tombstone-gc.	2025-09-19 11:53:30 +03:00
Botond Dénes	37e46f674d	Merge 'nodetool: ignore repair request error of colocated tables' from Michael Litvak when cluster repair is run for an entire keyspace, nodetool makes a repair api request for each table. if the keyspace contains colocated tables, then the api request for the colocated tables will fail, because currently scylla doesn't allow making repair requests for specific colocated tables, but only for base tables. if the request is to repair an entire keyspace then we can ignore this, because we will make a repair request for all base tables, and this in turn will repair also all the colocated tables in the keyspace. however if specific tables are requested and some of them are colocated then we should propagate the error to let the user know the request is invalid. Refs https://github.com/scylladb/scylladb/issues/24816 no backport - no colocated tablets in previous releases Closes scylladb/scylladb#26051 * github.com:scylladb/scylladb: nodetool: ignore repair request error of colocated tables storage_service: improve error message on repair of colocated tables	2025-09-19 06:44:23 +03:00
Karol Nowacki	b5f3f2f4c5	tools: Fix missing source file in CMake target The `json_mutation_stream_parser.cc` file was not included in the `scylla-tools` CMake target. This could lead to "undefined reference" linker errors when building with CMake. This commit adds the missing source file to the target's source list. Closes scylladb/scylladb#26108	2025-09-18 19:44:53 +03:00
Botond Dénes	edaf67edcb	tools/scylla-sstable: remove writetime-histogram command This command was written for an investigation and was used exactly once. This would have been a perfect candidate for the (also rarely used) scylla-sstable script command, but it didn't exist yet. Drop this command from the tool, such super-specific commands should be written as sstable-scripts nowadays, which is what we will do if we ever need this again. Closes scylladb/scylladb#26062	2025-09-18 12:05:54 +03:00
Michael Litvak	aae91330b0	nodetool: ignore repair request error of colocated tables when cluster repair is run for an entire keyspace, nodetool makes a repair api request for each table. if the keyspace contains colocated tables, then the api request for the colocated tables will fail, because currently scylla doesn't allow making repair requests for specific colocated tables, but only for base tables. if the request is to repair an entire keyspace then we can ignore this, because we will make a repair request for all base tables, and this in turn will repair also all the colocated tables in the keyspace. however if specific tables are requested and some of them are colocated then we should propagate the error to let the user know the request is invalid. Refs scylladb/scylladb#24816	2025-09-18 09:35:53 +02:00
Avi Kivity	3acfc577d8	Merge 'tools/scylla-sstable: extract json mutation stream parser into own hh,cc' from Botond Dénes tools/scylla-sstable.cc has 3.5k SLOC, out of which this class alone is 1K. Extract into own hh and cc. Since this class was already using pimpl, the header remains nice and small. Code cleanup, no backport needed. Closes scylladb/scylladb#26064 * github.com:scylladb/scylladb: tools: extract json_mtuation_stream_parser to its own hh,cc files tools/scylla-sstable: fix indentation tools/scylla-sstable: prepare for extracting json_mutation_stream_parser	2025-09-17 18:30:30 +03:00
Botond Dénes	2fa0f82910	tools: extract json_mtuation_stream_parser to its own hh,cc files tools/scylla-sstable.cc has 3.5k SLOC, out of which this class alone is 1K. Extract into own hh and cc, just a copy-paste after the preparation commit.	2025-09-17 12:18:07 +03:00
Botond Dénes	ffe8918522	tools/scylla-sstable: fix indentation Left broken by previous patch.	2025-09-17 12:16:22 +03:00
Botond Dénes	8c36a983cc	tools/scylla-sstable: prepare for extracting json_mutation_stream_parser Make methods out-of-line, so class declaration stands on its own, without definition of impl. Move auxiliary structures, used only by impl, out of the class scope. Move parser to tools namespace, and auxiliaries to anonymous namespace within the tools one. Pass down logger ref to parser impl and below, to prepare for sst_log not being available in scope. Add comment to parser class explaining what it does.	2025-09-17 12:16:21 +03:00
Asias He	54162a026f	scylla-nodetool: Add --incremental-mode option to cluster repair The `--incremental-mode` option specifies the incremental repair mode. Can be 'disabled', 'regular', or 'full'. 'regular': The incremental repair logic is enabled. Unrepaired sstables will be included for repair. Repaired sstables will be skipped. The incremental repair states will be updated after repair. 'full': The incremental repair logic is enabled. Both repaired and unrepaired sstables will be included for repair. The incremental repair states will be updated after repair. 'disabled': The incremental repair logic is disabled completely. The incremental repair states, e.g., repaired_at in sstables and sstables_repaired_at in the system.tablets table, will not be updated after repair. When the option is not provided, it defaults to regular. Fixes #25931 Closes scylladb/scylladb#25969	2025-09-16 10:23:22 +03:00
Yaron Kaikov	902d139c80	tools: toolchain: dbuild: add setuptools_scm as dependency this package was added as a dependnancy to `cqlsh` in `216d8b0658` Fixes: https://github.com/scylladb/scylladb/issues/25613 [Yaron: regenerate frozen toolchain with optimized clang from https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-x86_64.tar.gz ] Closes scylladb/scylladb#25932	2025-09-11 08:51:28 +03:00
Botond Dénes	514f59d157	tools/scylla-sstable: write: move to UUID generation We are moving away from integer generations, so stop using them. Also drop the --generation command-line parameter, UUID generations don't have be provided by the caller, because random UUIDs will not collide with each other. To help the caller still know what generation the output sstable has (previously they provided it via --generation), print the generation to stdout. Closes scylladb/scylladb#25166	2025-09-10 13:47:26 +03:00
Andrei Chekun	ea4cd431c9	test.py: add pytest-sugar plugin to the dependencies This plugin allows having better terminal output with progress bar for the tests. Closes scylladb/scylladb#25845 [avi: regenerate frozen toolchain] Closes scylladb/scylladb#25860	2025-09-08 20:50:02 +03:00
Dawid Mędrek	bb0255b2fb	tools/scylla-sstable: Enable rf_rack_valid_keyspaces Enabling the configuration option should have no negative impact on how the tool behaves. There is no topology and we do not create any keyspaces (except for trivial ones using `SimpleStrategy` and RF=1), only their metadata. Thanks to that, we don't go through validation logic that could fail in presence of an RF-rack-invalid keyspace. On the other hand, enabling `rf_rack_valid_keyspaces` lets the tool access code hidden behind that option. While that might not be of any consequence right now, in the future it might be crucial (for instance, see: scylladb/scylladb#23030). Note that other tools don't need an adjustment: * scylla-types: it uses schema_builder, but it doesn't reuse any other relevant part of Scylla. * nodetool: it manages Scylla instances but is not an instance itself, and it does not reuse any codepaths. * local-file-key-generator: it has nothing to do with Scylla's logic. Other files in the `tools` directory are auxiliary and are instructed with an already created instance of `db::config`. Hence, no need to modify them either. Fixes scylladb/scylladb#25792 Closes scylladb/scylladb#25794	2025-09-08 11:52:43 +03:00
Pavel Emelyanov	b26816f80d	s3: Export memory usage gauge (metrics) The memory usage is tracked with the help of a semaphore, so just export its "consumed" units. One tricky place here is the need to skip metrics registration for scylla-sstable tool. The thing is that the tools starts the storage manager and sstables manager on start and then some of tool's operations may want to start both managers again (via cql environment) causing double metrics registration exception. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#25769	2025-09-05 18:25:34 +03:00
Radosław Cybulski	c242234552	Revert "build: add precompiled headers to CMakeLists.txt" This reverts commit `01bb7b629a`. Closes scylladb/scylladb#25735	2025-09-03 09:46:00 +03:00
Radosław Cybulski	01bb7b629a	build: add precompiled headers to CMakeLists.txt Add precompiled header support to CMakeLists.txt and configure.py - it improves compilation time by approximately 10%. New header `stdafx.hh` is added, don't include it manually - the compiler will include it for you. The header contains includes from external libraries used by Scylla - seastar, standard library, linux headers and zlib. The feature is enabled by default, use CMake option `Scylla_USE_PRECOMPILED_HEADER` or configure.py --disable-precompiled-header to disable. The feature should be disabled, when trying to check headers - otherwise you might get false negatives on missing includes from seastar / abseil and so on. Note: following configuration needs to be added to ccache.conf: sloppiness = pch_defines,time_macros Closes #25182	2025-08-27 21:37:54 +03:00
Avi Kivity	352cda4467	treewide: avoid including gms/feature_service.hh from headers To avoid dependency proliferation, switch to forward declarations. In one case, we introduce indirection via std::unique_ptr and deinline the constructor and destructor. Ref #1 Closes scylladb/scylladb#25584	2025-08-20 10:30:27 +03:00
Avi Kivity	611918056a	Merge 'repair: Add tablet incremental repair support' from Asias He The central idea of incremental repair is to allow repair participants to select and repair only a portion of the dataset to speed up the repair process. All repair participants must utilize an identical selection method to repair and synchronize the same selected dataset. There are two primary selection methods: time-based and file-based. The time-based method selects data within a specified time frame. It is versatile but it is less efficient because it requires reading all of the dataset and omitting data beyond the time frame. The file-based method selects data from unrepaired SSTables and is more efficient because it allows the entire SSTable to be omitted. This document patch implements the file-based selection method. Incremental repair will only be supported for tablet tables; it will not be supported for vnode tables. On one hand, the legacy vnode is less important to support. On the other hand, the incremental repair for vnode is much harder to implement. With vnodes, a SSTalbe could contain data for multiple vnode ranges. When a given vnode range is repaired, only a portion of the SSTable is repaired. This complicates the manipulation of SSTables significantly during both repair and compaction. With tablets, an entire tablet is repaired so that a sstable is either fully repaired or not repaired which is a huge simplification. This patch uses the repaired_at from sstables::statistics component to mark a sstable as repaired. It uses a virtual clock as the repair timestamp, i.e., using a monotonically increasing number for the repaired_at field of a SSTable and sstables_repaired_at column in system.tablets table. Notice that when a sstable is not repaired, the repaired_at field will be set to the default value 0 by default. The being_repaired in memory field of a SSTable is used to explicitly mark that a SSTable is being selected. The following variables are used for incremental repair: The repaired_at on disk field of a SSTable is used. - A 64-bit number increases sequentially The sstables_repaired_at is added to the system.tablets table. - repaired_at <= sstables_repaired_at means the sstable is repaired The being_repaired in memory field of a SSTable is added. - A repair UUID tells which sstable has participated in the repair Initial test results: 1) Medium dataset results Node amount: 3 Instance type: i4i.2xlarge Disk usage per node: ~500GB Cluster pre-populated with ~500GB of data before starting repairs job. Results for Repair Timings: The regular repair run took 210 mins. Incremental repair 1st run took 183 mins, 2nd and 3rd runs took around 48s The speedup is: 183 mins / 48s = 228X 2) Small dataset results Node amount: 3 Instance type: i4i.2xlarge Disk usage per node: ~167GB Cluster pre-populated with ~167GB of data before starting the repairs job. Regular repair 1st run took 110s, 2nd and 3rd runs took 110s. Incremental repair 1st run took 110 seconds, 2nd and 3rd run took 1.5 seconds. The speedup is: 110s / 1.5s = 73X 3) Large dataset results Node amount: 6 Instance type: i4i.2xlarge, 3 racks 50% of base load, 50% read/write Dataset == Sum of data on each node Dataset Non-incremental repair (minutes) 1.3 TiB 31:07 3.5 TiB 25:10 5.0 TiB 19:03 6.3 TiB 31:42 Dataset Incremental repair (minutes) 1.3 TiB 24:32 3.0 TiB 13:06 4.0 TiB 5:23 4.8 TiB 7:14 5.6 TiB 3:58 6.3 TiB 7:33 7.0 TiB 6:55 Fixes #22472 Closes scylladb/scylladb#24291 * github.com:scylladb/scylladb: replica: Introduce get_compaction_reenablers_and_lock_holders_for_repair compaction: Move compaction_reenabler to compaction_reenabler.hh topology_coordinator: Make rpc::remote_verb_error to warning level repair: Add metrics for sstable bytes read and skipped from sstables test.py: Disable incremental for test_tombstone_gc_for_streaming_and_repair test.py: Add tests for tablet incremental repair repair: Add tablet incremental repair support compaction: Add tablet incremental repair support feature_service: Add TABLET_INCREMENTAL_REPAIR feature tablet_allocator: Add tablet_force_tablet_count_increase and decrease repair: Add incremental helpers sstable: Add being_repaired to sstable sstables: Add set_repaired_at to metadata_collector mutation_compactor: Introduce add operator to compaction_stats tablet: Add sstables_repaired_at to system.tablets table test: Fix drain api in task_manager_client.py	2025-08-19 13:13:22 +03:00
Asias He	f9021777d8	compaction: Add tablet incremental repair support This patch addes incremental_repair support in compaction. - The sstables are split into repaired and unrepaired set. - Repaired and unrepaired set compact sperately. - The repaired_at from sstable and sstables_repaired_at from system.tablets table are used to decide if a sstable is repaired or not. - Different compactions tasks, e.g., minor, major, scrub, split, are serialized with tablet repair.	2025-08-18 11:01:21 +08:00
Avi Kivity	66173c06a3	Merge 'Eradicate the ability to create new sstables with numerical sstable generation' from Benny Halevy Remove support for generating numerical sstable generation for new sstables. Loading such sstables is still supported but new sstables are always created with a uuid generation. This is possible since: * All live versions (since 5.4 / `f014ccf369`) now support uuid sstable generations. * The `uuid_sstable_identifiers_enabled` config option (that is unused from version 2025.2 / `6da758d74c`) controls only the use of uuid generations when creating new sstables. SSTables with uuid generations should still be properly loaded by older versions, even if `uuid_sstable_identifiers_enabled` is set to `false`. Fixes #24248 * Enhancement, no backport needed Closes scylladb/scylladb#24512 * github.com:scylladb/scylladb: streaming: stream_blob: use the table sstable_generation_generator replica: distributed_loader: process_upload_dir: use the table sstable_generation_generator sstables: sstable_generation_generator: stop tracking highest generation replica: table: get rid of update_sstables_known_generation sstables: sstable_directory: stop tracking highest_generation replica: distributed_loader: stop tracking highest_generation sstables: sstable_generation: get rid of uuid_identifiers bool class sstables_manager: drop uuid_sstable_identifiers feature_service: move UUID_SSTABLE_IDENTIFIERS to supported_feature_set test: cql_query_test: add test_sstable_load_mixed_generation_type test: sstable_datafile_test: move copy_directory helper to test/lib/test_utils test: database_test: move table_dir helper to test/lib/test_utils	2025-08-14 11:54:33 +03:00
Israel Fruchter	2da26d1fc1	Update tools/cqlsh submodule (v6.0.26) * tools/cqlsh 02ec7c57...aa1a52c1 (6): > build-push.yaml: upgrade cibuildwheel to latest > build-push.yml: skip python 3.8 and PyPy builds > cqlshlib: make NetworkTopologyStrategy default for autocomplete > default to setuptools_scm based version when not packaged > chore(deps): update pypa/cibuildwheel action to v2.23.0 Closes scylladb/scylladb#25420	2025-08-11 13:07:47 +03:00
Avi Kivity	f49b63f696	tools: toolchain: dbuild: forward container registry credentials Docker hub rate-limits unauthenticated image pulls, so forward the host's credentials to the container. This prevents rate limit errors when running nested containers. Try the locations for the credentials in order and bind-mount the first that exists to a location that gets picked up. Verified with `podman login --get-login docker.io` in the container. Closes scylladb/scylladb#25354	2025-08-11 09:05:57 +03:00
Benny Halevy	6cc964ef16	sstables: sstable_generation: get rid of uuid_identifiers bool class Now that all call sites enable uuid_identifiers. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-08-08 11:46:21 +03:00
Raphael S. Carvalho	9d3755f276	replica: Futurize retrieval of sstable sets in compaction_group_view This will allow upcoming work to gently produce a sstable set for each compaction group view. Example: repaired and unrepaired. Locking strategy for compaction's sstable selection: Since sstable retrieval path became futurized, tasks in compaction manager will now hold the write lock (compaction_state::lock) when retrieving the sstable list, feeding them into compaction strategy, and finally registering selected sstables as compacting. The last step prevents another concurrent task from picking the same sstable. Previously, all those steps were atomic, but we have seen stall in that area in large installations, so futurization of that area would come sooner or later. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2025-08-08 06:58:00 +03:00
Raphael S. Carvalho	2c4a9ba70c	treewide: Rename table_state to compaction_group_view Since table_state is a view to a compaction group, it makes sense to rename it as so. With upcoming incremental repair, each replica::compaction_group will be actually two compaction groups, so there will be two views for each replica::compaction_group. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2025-08-08 06:51:28 +03:00
Benny Halevy	5e5e63af10	scylla-sstable: print_query_results_json: continue loop if row is disengaged Otherwise it is accessed right when exiting the if block. Add a unit test reproducing the issue and validating the fix. Fixes #25325 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#25326	2025-08-06 16:44:51 +03:00
Pavel Emelyanov	0616407be5	Merge 'rest_api: add endpoint which drops all quarantined sstables' from Taras Veretilnyk Added a new POST endpoint `/storage_service/drop_quarantined_sstables` to the REST API. This endpoint allows dropping all quarantined SSTables either globally or for a specific keyspace and tables. Optional query parameters `keyspace` and `tables` (comma-separated table names) can be provided to limit the scope of the operation. Fixes scylladb/scylladb#19061 Backport is not required, it is new functionality Closes scylladb/scylladb#25063 * github.com:scylladb/scylladb: docs: Add documentation for the nodetool dropquarantinedsstables command nodetool: add command for dropping quarantine sstables rest_api: add endpoint which drops all quarantined sstables	2025-08-06 11:55:15 +03:00
Andrei Chekun	4c33ff791b	build: add pytest-timeout to the toolchain Adding this plugin allows using timeout for a test or timeout for the whole session. This can be useful for Unit Test Custom task in the pipeline to avoid running tests is batches, that will mess with the test names later in Jenkins. Closes #25210 [avi: regenerate frozen toolchain with optimized clang from https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-x86_64.tar.gz ] Closes scylladb/scylladb#25243	2025-07-30 12:53:10 +03:00
Avi Kivity	d3cdb88fe7	tools: toolchain: dbuild: increase depth of nested podman configuration coverage The initial support for nested containers (`2d2a2ef277`) worked on my machine (tm) and even laptop, but does not work on fresh installs. This is likely due to changes in where persistent configuration is stored on the host between various podman versions; even though my podman is fully updated, it uses configuration created long ago. Make nested containers work on fresh installs by also configuring /etc/containers/storage.conf. The important piece is to set graphroot to the same location as the host. Verified both on my machine and on a fresh install. Closes scylladb/scylladb#25156	2025-07-29 08:23:41 +03:00
Botond Dénes	f3ed27bd9e	Merge 'Move feature-service config creation code out of feature-service itself' from Pavel Emelyanov Nowadays the way to configure an internal service is 1. service declares its config struct 2. caller (main/test/tool) fills the respective config with values it wants 3. the service is started with the config passed by value The feature service code behaves likewise, but provides a helper method to create its config out of db::config. This PR moves this helper out of gms code, so that it doesn't mess with system-wide db::config and only needs its own small struct feature_config. For the reference: similar changes with other services: #23705 , #20174 , #19166 Closes scylladb/scylladb#25118 * github.com:scylladb/scylladb: gms,init: Move get_disabled_features_from_db_config() from gms code: Update callers generating feature service config gms: Make feature_config a simple struct gms: Split feature_config_from_db_config() into two	2025-07-29 08:17:49 +03:00
Taras Veretilnyk	3bc9ee10d1	nodetool: add command for dropping quarantine sstables - Add dropquarantinedsstables command to remove quarantined SSTables - Support both flag-based (--keyspace, --table) and positional arguments - Allow targeting all keyspaces, specific keyspace, or keyspace with specified tables Fixes scylladb/scylladb#19061	2025-07-28 16:55:17 +02:00
Robert Bindar	d921a565de	Add open-coredump script depndencies to install-dependencies.sh Whilst the coredump script checks for prerequisites, the user experience is not ideal because you either have to go in the script and get the list of deps and install them or wait for the script to complain about lacking dependencies one by one. This commit completes the list of dependencies in the install script (some of them were already there for Fedora), so you already have them installed by the time you get to run the coredump script. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com> [avi: - remove trailing whitespace - regenerate frozen toolchain Optimized clang binaries generated and stored in https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-x86_64.tar.gz ] Closes #22369 Closes scylladb/scylladb#25203	2025-07-28 06:45:01 +03:00
Avi Kivity	1930f3e67f	Merge 'sstables/mx/reader: accommodate inexact partition indexes' from Michał Chojnowski Unlike the currently-used sstable index files, BTI indexes don't store the entire partition keys. They only store prefixes of decorated keys, up to the minimum length needed to differentiate a key from its neighbours in the sstable. This saves space. However, it means that a BTI index query might be off by one partition (on each end of the queried partition range) with respect to the optimal Data position. For example, if the index stores prefixes `a`, `b`, `c`, the index has no way to know if the first index entry after key `bb` is `b` (which might correspond to `ba` as well as `bc`), or `c`. So the index reader conservatively has to pick the wider Data range, and the Data reader must ignore the superfluous partitions. (And there's no way around that.) Before this patch, the sstable reader expects the index query to return an exact (optimal) Data range. This patch adjusts the logic of the sstable reader to allow for inexact ranges. Note: the patch is more complicated that it looks. The logic of the sstable reader was already fairly hard to follow and this adds even more flags, more weird special states and more edge cases. I think I managed to write a decent test and it did find three or four edge cases I wouldn't have noticed otherwise. I think it should cover all the added logic, but I didn't verify code coverage. (Do our scripts for that even work nowadays)? Simplification ideas are welcome. Preparation for new functionality, no backporting needed. Closes scylladb/scylladb#25093 * github.com:scylladb/scylladb: sstables/index_reader: weaken some exactness guarantees in abstract_index_reader test/boost: add a test for inexact index lookups sstables/mx/reader: allow passing a custom index reader to the constructor sstables/index_reader: remove advance_to sstables/mx/reader: handle inexact lookups in `advance_context()` sstables/mx/reader: handle inexact lookups in `advance_to_next_partition()` sstables/index_reader: make the return value of `get_partition_key` optional sstables/mx/reader: handle "backward jumps" in forward_to sstables/mx/reader: filter out partitions outside the queried range sstables/mx/reader: update _pr after `fast_forward_to`	2025-07-27 19:39:36 +03:00
Michał Chojnowski	141895f9eb	sstables/index_reader: make the return value of `get_partition_key` optional BTI indexes only store encoded prefixes of partition keys, not the whole keys. They can't reliably implement `get_partition_key`. The index reader interface must be weakened and callers must be adapted.	2025-07-25 11:00:18 +02:00
Ernest Zaslavsky	d2c5765a6b	treewide: Move keys related files to a new keys directory As requested in #22102, #22103 and #22105 moved the files and fixed other includes and build system. Moved files: - clustering_bounds_comparator.hh - keys.cc - keys.hh - clustering_interval_set.hh - clustering_key_filter.hh - clustering_ranges_walker.hh - compound_compat.hh - compound.hh - full_position.hh Fixes: #22102 Fixes: #22103 Fixes: #22105 Closes scylladb/scylladb#25082	2025-07-25 10:45:32 +03:00
Ran Regev	dd67d22825	nodetool restore: sstable list from a file Fixes: #25045 added the ability to supply the list of files to restore from the a given file. mainly required for local testing. Signed-off-by: Ran Regev <ran.regev@scylladb.com> Closes scylladb/scylladb#25077	2025-07-22 09:11:02 +03:00
Pavel Emelyanov	8220974e76	code: Update callers generating feature service config Instead of requesting it from gms code, create it "by hand" with the help of get_disabled_features_from_db_config() method. This is how other services are configured by main/tools/testing code. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-07-21 19:19:09 +03:00
Botond Dénes	20693edb27	Merge 'sstables: put index_reader behind a virtual interface' from Michał Chojnowski This is a refactoring patch in preparation for BTI indexes. It contains no functional changes (or at least it's not intended to). In this patch, we modify the sstable readers to use index readers through a new virtual `abstract_index_readers` interface. Later, we will add BTI indexes which will also implement this interface. This interface contains the methods of `index_reader` which are needed by sstable readers, and leaves out all other methods, such as `current_clustered_cursor`. Not all methods of this interface will be implementable by a trie-based index later. For example, a trie-based index can't provide a reliable `get_partition_key()`, because — unlike the current index — it only stores partition keys for partitions which have a row index. So the interface will have to be further restricted later. We don't do that in this patch because that will require changes to sstable reader logic, and this patch is supposed to only include cosmetic changes. No backports needed, this is a preparation for new functionality. Closes scylladb/scylladb#25000 * github.com:scylladb/scylladb: sstables: add sstable::make_index_reader() and use where appropriate sstables/mx: in readers, use abstract_index_reader instead of index_reader sstables: in validate(), use abstract_index_reader instead of index_reader where possible test/lib/index_reader_assertions: accept abstract_index_reader instead of index_reader sstables/index_reader: introduce abstract_index_reader sstables/index_reader: extract a prefetch_lower_bound() method	2025-07-17 14:32:08 +03:00
Michał Chojnowski	4e4a4b6622	sstables: add sstable::make_index_reader() and use where appropriate If we add multiple index implementations, users of index readers won't easily know which concrete index reader type is the right one to construct. We also don't want pieces of code to depend on functionality specific to certain concrete types, if that's not necessary. So instead of constructing the readers by themselves, they can use a helper function, which will return an abstract (virtual) index reader. This patch adds such a function, as a method of `sstable`.	2025-07-17 10:32:57 +02:00
Botond Dénes	a26b6a3865	Merge 'storage: add `make_data_or_index_source` to the storages' from Ernest Zaslavsky Add `make_data_or_index_source` to the storages to utilize new S3 based data source which should improve restore performance * Introduce the `encrypted_data_source` class that wraps an existing data source to read and decrypt data on the fly using block encryption. Also add unit tests to verify correct decryption behavior. * Add `make_data_or_index_source` to the `storage` interface, implement it for `filesystem_storage` storage which just creates `data_source` from a file and for the `s3_storage` create a (maybe) decrypting source from s3 make_download_source. This change should solve performance improvement for reading large objects from S3 and should not affect anything for the `filesystem_storage` No backport needed since it enhances functionality which has not been released yet fixes: https://github.com/scylladb/scylladb/issues/22458 Closes scylladb/scylladb#23695 * github.com:scylladb/scylladb: sstables: Start using `make_data_or_index_source` in `sstable` sstables: refactor readers and sources to use coroutines sstables: coroutinize futurized readers sstables: add `make_data_or_index_source` to the `storage` encryption: refactor key retrieval encryption: add `encrypted_data_source` class	2025-07-15 13:32:13 +03:00
Ernest Zaslavsky	dff9a229a7	sstables: refactor readers and sources to use coroutines Refactor readers and sources to support coroutine usage in preparation for integration with `make_data_or_index_source`. Move coroutine-based member initialization out of constructors where applicable, and defer initialization until first use.	2025-07-15 10:10:23 +03:00
Marcin Maliszkiewicz	19bc6ffcb0	replica: make truncate_table_on_all_shards get whole schema from table_shards Before for views and indexes it was fetching base schema from db (and couple other properties). This is a problem once we introduce atomic tables and views deletion (in the following commit). Because once we delete table it can no longer be fetched from db object, and truncation is performed after atomically deleting all relevant tables/views/indexes. Now the whole relevant schema will be fetched via global_table_ptr (table_shards) object.	2025-07-10 10:40:43 +02:00
Marcin Maliszkiewicz	2e69016c4f	db: access types during schema merge via special storage Once we create types atomically the code which is before commit may depend on newly added types, so it has to access both old and new types. New storage called in_progress_types_storage was added.	2025-07-10 10:40:42 +02:00
Avi Kivity	1e0b015c8b	Merge 'cql3: Represent create_statement using managed_bytes' from Dawid Mędrek When describing a table, we need to do it carefully: if some columns were dropped, we must specify that explicitly by ``` ALTER TABLE {table} DROP {column} USING TIMESTAMP ... ``` in the result of the DESCRIBE statement. Failing to do so could lead to data resurrection. However, if a table has been altered many, many times, we might end up with a huge create statement. Constructing it could, in turn, trigger an oversized allocation. Some tests ran into that very problem in fact. In this commit, we want to mitigate the problem: instead of allocating a contiguous chunk of memory for the create statement, we use `bytes_ostream` and `managed_bytes` to possibly keep data scattered in memory. It makes handling `cql3::description` less convenient in the code, but since the struct is pretty much immediately serialized after creating it, it's a very good trade-off. A reproducer is intentionally not provided by this commit: it's easy to test the change, but adding and dropping a huge number of columns would take a really long amount of time, so we need to omit it. Fixes scylladb/scylladb#24018 Backport: all of the supported versions are affected, so we want to backport the changes there. Closes scylladb/scylladb#24151 * github.com:scylladb/scylladb: cql3/description: Serialize only rvalues of description cql3: Represent create_statement using managed_string cql3/statements/describe_statement.cc: Don't copy descriptions cql3: Use managed_bytes instead of bytes in DESCRIBE utils/managed_string.hh: Introduce managed_string and fragmented_ostringstream	2025-07-01 21:59:38 +03:00
Avi Kivity	609cc20d22	build: update toolchain to Fedora 42 with clang 20.1 and libstdc++ 15 Rebase to Fedora 42 with clang 20.1 and libstdc++ 15. JAVA8_HOME environment variable dropped since we no longer use it. cassandra-stress package updated with version that doesn't depend on no-longer-available Java 11. Optimized clang binaries generates and stored in https://devpkg.scylladb.com/clang/clang-20.1.7-Fedora-42-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-20.1.7-Fedora-42-x86_64.tar.gz Closes scylladb/scylladb#23978	2025-07-01 14:39:47 +03:00
Dawid Mędrek	ac9062644f	cql3: Represent create_statement using managed_string When describing a table, we need to do it carefully: if some columns were dropped, we must specify that explicitly by ``` ALTER TABLE {table} DROP {column} USING TIMESTAMP ... ``` in the result of the DESCRIBE statement. Failing to do so could lead to data resurrection. However, if a table has been altered many, many times, we might end up with a huge create statement. Constructing it could, in turn, trigger an oversized allocation. Some tests ran into that very problem in fact. In this commit, we want to mitigate the problem: instead of allocating a contiguous chunk of memory for the create statement, we use `fragmented_ostringstream` and `managed_string` to possibly keep data scattered in memory. It makes handling `cql3::description` less convenient in the code, but since the struct is pretty much immediately serialized after creating it, it's a very good trade-off. We provide a reproducer. It consistently passes with this commit, while having about 50% chance of failure before it (based on my own experiments). Playing with the parameters of the test doesn't seem to improve that chance, so let's keep it as-is. Fixes scylladb/scylladb#24018	2025-07-01 12:58:02 +02:00

1 2 3 4 5 ...

1139 Commits