scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 17:40:34 +00:00

Author	SHA1	Message	Date
Nadav Har'El	32afcdbaf0	test/alternator: enable, and add, tests for gzip'ed requests After in the previous patch we implemented support in Alternator for gzip-compressed requests ("Content-Encoding: gzip"), here we enable an existing xfail-ing test for this feature, and also add more tests for more cases: * A test for longer compressed requests, or a short compressed request which expands to a longer request. Since the decompression uses small buffers, this test reaches additional code paths. * Check for various cases of a malformed gzip'ed request, and also an attempt to use an unsupported Content-Encoding. DynamoDB returns error 500 for both cases, so we want to test that we do to - and not silently ignore such errors. * Check that two concatenated gzip'ed streams is a valid request, and check that garbage at the end of the gzip - or a missing character at the end of the gzip - is recognized as an error. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-27 09:42:47 +02:00
Nadav Har'El	4c7c5f4af7	alternator: implement gzip-compressed requests In this patch we implement Alternator's support for gzip-compressed requests, i.e., requests with the "Content-Encoding: gzip" header, other uncompressed headers, and a gzip-compressed body. The server needs to verify the signature of the compressed content, and then uncompress the body before running the request. We only support gzip compression because this is what DynamoDB supports. But in the future we can easily add support for other compression algorithms like lz4 or zstd. This patch Refs #5041 but doesn't "Fixes" it because it only implements compressed requests (Content-Encoding), not compressed responses (Accept-Encoding). The next patch will enable several tests for this feature and make sure it behaves like DynamoDB. Note that while we will have now support in our server for compressed requests, just like DynamoDB does, the clients (AWS SDKs) will probably NOT make use of it because they do not enable request compression by default. For example, see the tests for some hoops one needs to jump through in boto3 (the Python SDK) to send compressed requests. However, we are hoping that in the future Alternator's modified clients will use compressed requests and enjoy this feature. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-25 17:46:44 +02:00
dependabot[bot]	b911a643fd	build(deps): bump sphinx-scylladb-theme from 1.8.8 to 1.8.9 in /docs Bumps [sphinx-scylladb-theme](https://github.com/scylladb/sphinx-scylladb-theme) from 1.8.8 to 1.8.9. - [Release notes](https://github.com/scylladb/sphinx-scylladb-theme/releases) - [Commits](https://github.com/scylladb/sphinx-scylladb-theme/commits) --- updated-dependencies: - dependency-name: sphinx-scylladb-theme dependency-version: 1.8.9 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Closes scylladb/scylladb#27169	2025-11-25 11:01:37 +02:00
Botond Dénes	1263e1de54	Merge 'docs: modify debian/ubutnu installation instructions' from Yaron Kaikov To support debian13, we need to modify the installation instructions since `apt-key` command is no longer available Also updated installation instruction to match the latest release Fixes: https://github.com/scylladb/scylladb/issues/26673 No need for backport since we added debian13 only in master for now Closes scylladb/scylladb#27205 * github.com:scylladb/scylladb: install-on-linux.rst: update installation example to supported release docs: modify debian/ubutnu installation instructions	2025-11-25 10:53:11 +02:00
Nadav Har'El	bcd1758911	Merge 'vector_search: add validator tests' from Pawel Pery The vector-search-validator is a binary tool which do functional and integration tests between scylla and vector-store. It is build in Rust mainly in vector-store repository. This patch adds possibility to write tests on scylladb repository side, compile them together with vector-store tests and run them in `test.py` environment. There are three parts of the change: - add sources of validator to the `test/vector_search_validator` directory - add support for building validator and vector-store in `build/vector-search-validator/bin` directory with or without cmake - add support for `pytest` and `test.py` to run validator test locally and in the CI environment; this part adds also README to the `test/vector_search_validator` directory Design for validator integration tests: https://scylladb.atlassian.net/wiki/spaces/RND/pages/39518215/Vector+Search+Core+Test+Plan+Document References: VECTOR-50 No backport needed as this is a new functionality. Closes scylladb/scylladb#26653 * github.com:scylladb/scylladb: vector_search: add vector-search-validator tests vector_search: implement building vector-search-validator vector_search: add vector-search-validator sources	2025-11-25 10:34:33 +02:00
Amnon Heiman	b2c2a99741	index/vector_index.cc: Don't allow zero as an index option This patch forces vector_index option value to be real-positive numbers as zero would make no senese. Fixes https://scylladb.atlassian.net/browse/VECTOR-249 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Closes scylladb/scylladb#27191	2025-11-25 10:05:44 +02:00
Karol Nowacki	ca62effdd2	vector_search: Restrict vector index tests to tablets only Vector indexes are going to be supported only for tablets (see VECTOR-322). As a result, tests using vector indexes will be failing when run with vnodes. This change ensures tests using vector indexes run exclusively with tablets. Fixes: VECTOR-49 Closes scylladb/scylladb#26843	2025-11-25 09:26:16 +02:00
Pawel Pery	9f10aebc66	vector_search: add vector-search-validator tests The commit adds a functionality for `pytest` and `test.py` to run `vector-search-validator` in `sudo unshare` environment. There are already two tests - first parametrized `test_validator.py::test_validator[test-case-name]` (run validator) and second `test_cargo_toml.py::test_cargo_toml` (check if the current `Cargo.toml` for validator is correct). Documentation for these tests are provided in `README.md`.	2025-11-24 17:26:04 +01:00
Pawel Pery	3702e982b9	vector_search: implement building vector-search-validator The commit adds targets building `build/vector-search-validator/bin/{vector-store,vector-search-validator}. The targets must be build for tests. They don't depend on build mode. The commit adds target in `configure.py` and also in `cmake`.	2025-11-24 17:26:04 +01:00
Pawel Pery	e569a04785	vector_search: add vector-search-validator sources The commit adds validator sources uses combination of local files and vector-store's files. In `build-env` there are definition of vector-store git repository and revision on which validator will be built. `cargo-toml-template` is script for printing current `Cargo.toml` to the stdout. After updating `build-env` developer needs to update new configuration with `./cargo-toml-template > Cargo.toml`. Git revision is used in several places in `Cargo.toml` and will be used for building `vector-store`, so for better handling git revision it should be setup only in one place. The validator is divided into several crates to be able to built it within scylladb and vector-store repositories. Here we need to create a new validator crate with simple `main` function and call `validator_engine::main` there. We provide tests written in scylladb repo in `validator-scylla` crate. The commit provides empty `cql` test case, which should be filled in the future.	2025-11-24 17:26:04 +01:00
Gleb Natapov	39cec4ae45	topology: let banned node know that it is banned Currently if a banned node tries to connect to a cluster it fails to create connections, but has no idea why, so from inside the node it looks like it has communication problems. This patch adds new rpc NOTIFY_BANNED which is sent back to the node when its connection is dropped. On receiving the rpc the node isolates itself and print an informative message about why it did so. Closes scylladb/scylladb#26943	2025-11-24 17:12:13 +01:00
Tomasz Grabiec	d4b77c422f	Merge 'load_stats: leaving replica could be std::nullopt' from Ferenc Szili When migrating tablet size during the end_migration tablet transition stage, we need the pending and leaving replica hosts. The leaving and pending replicas are gathered in objects of type std::optional<tablet_replica> and are not checked if they contain a value before dereferencing which could cause an exception in the topology coordinator. This patch adds a check for leaving and pending replicas, and only performs the tablet size migration if neither are empty. This bug was introduced in `10f07fb95a` This change also adds the ability to create a tablet size in load_stats during end_migration stage of a tablet rebuild. We compute the new tablet size from by averaging the tablet sizes of the existing replicas. This change also adds the virtual table tablet_sizes which contains tablet sizes of all the replicas of all the tablets in the cluster. A version containing this bug has not yet been released, so a backport is not needed. Closes scylladb/scylladb#27118 * github.com:scylladb/scylladb: test: add tests for tablet size migration during end_migration virtual_table: add tablet_sizes virtual table load_stats: update tablet sizes after migration or rebuild	2025-11-24 15:31:30 +01:00
Yaron Kaikov	13eca61d41	install-on-linux.rst: update installation example to supported release Example of installation is out of date, since scylla-5.2 is EOL for long time upding the example for more recent release (together with packages update)	2025-11-24 16:22:17 +02:00
Anna Stuchlik	724dc1e582	doc: fix the info about object storage This commit fixes the information about object storage: - Object storage configuration is no longer marked as experimental. - Redundant information has been removed from the description. - Information related to object storage for SStabels has been removed as the feature is not working. Fixes https://github.com/scylladb/scylladb/issues/26985 Closes scylladb/scylladb#26987	2025-11-24 17:16:33 +03:00
Yaron Kaikov	5541f75405	docs: modify debian/ubutnu installation instructions To support debian13, we need to modify the installation instructions since `apt-key` command is no longer available Fixes: https://github.com/scylladb/scylladb/issues/26673	2025-11-24 13:33:17 +02:00
Avi Kivity	eb5e9f728c	build: lock cxxbridge-cmd version to the rest of the cxx packages rust/Cargo.toml locks the cxx packages to version 1.0.83, but install-dependencies.sh does not lock cxxbridge-cmd, part of that ecosystem. Since cxx 1.0.189 broke compatibility with 1.0.83 (understandable, as these are all sub-packages of a single repository), builds with newer cxxbridge-cmd are broken. Fix by locking cxxbridge-cmd to the same version as the other cxx subpackages. Regenerated frozen toolchain with optimized clang from https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-20.1.8-Fedora-42-x86_64.tar.gz Probably better done by building cxxbridge-cmd during the build itself, but that is a deeper change. Fixes #27176 Closes scylladb/scylladb#27177	2025-11-24 07:04:53 +02:00
Avi Kivity	d6ef5967ef	tools: toolchain: prepare: replace 'reg' with 'skopeo' The prepare scripts uses 'reg' to verify we're not going to overwrite an existing image. The 'reg' command is not available in Fedora 43. Use 'skopeo' instead. Skopeo is part of the podman ecosystem so hopefully will live longer. Fixes #27178. Closes scylladb/scylladb#27179	2025-11-24 06:59:34 +02:00
Aleksandra Martyniuk	19a7d8e248	replica: database: change type of tables_metadata::_ks_cf_to_uuid If there is a lot of tables, a node reports oversized allocation in _ks_cf_to_uuid of type flat_hash_map. Change the type to std::unordered_map to prevent oversized allocations. Fixes: https://github.com/scylladb/scylladb/issues/26787. Closes scylladb/scylladb#27165	2025-11-24 06:42:40 +02:00
Botond Dénes	296d7b8595	Merge 'Enable digest+checksum verification for file based streaming' from Taras Veretilnyk This patch enables integrity check in 'create_stream_sources()' by introducing a new 'sstable_data_stream_source_impl' class for handling the Data component of SSTables. The new implementation uses 'sstable::data_stream()' with 'integrity_check::yes' instead of the raw input_stream. These additional checks require reading the digest and CRC components from disk, which may introduce some I/O overhead. For uncompressed SSTables, this involves loading and computing checksums and digest from the data. For compressed SSTables - where checksums are already embedded - the cost comes from reading, calculating and verifying the diges. New test cases were added to verify that the integrity checks work correctly, detecting both data and digest mismatches. Backport is not required, since it is a new feature Fixes #21776 Closes scylladb/scylladb#26702 * github.com:scylladb/scylladb: file_stream_test: add sstable file streaming integrity verification test cases streaming: prioritize sender-side errors in tablet_stream_files sstables: enable integrity check for data file streaming sstables: Add compressed raw streaming support sstables: Allow to read digest and checksum from user provided file instance sstables: add overload of data_stream() to accept custom file_input_stream_options	2025-11-24 06:37:27 +02:00
Aleksandra Martyniuk	76174d1f7a	cql3: reject ALTER KEYSPACE if rf of datacenter with tablets is omitted In ALTER KEYSPACE, when a datacenter name is omitted, its replication factor is implicitly set to zero with vnodes, while with tablets, it remains unchanged. ALTER KEYSPACE should behave the same way for tablets as it does for vnodes. However, this can be dangerous as we may mistakenly drop the whole datacenter. Reject ALTER KEYSPACE if it changes replication factor, but omits a datacenter that currently contains tablet replicas. Fixes: https://github.com/scylladb/scylladb/issues/25549. Closes scylladb/scylladb#25731	2025-11-24 06:36:51 +02:00
Avi Kivity	85db7b1caf	Merge 'address_map: Use more efficient and reliable replication method' from Tomasz Grabiec Primary issue with the old method is that each update is a separate cross-shard call, and all later updates queue behind it. If one of the shards has high latency for such calls, the queue may accumulate and system will appear unresponsive for mapping changes on non-zero shards. This happened in the field when one of the shards was overloaded with sstables and compaction work, which caused frequent stalls which delayed polling for ~100ms. A queue of 3k address updates accumulated, because we update mapping on each change of gossip states. This made bootstrap impossible because nodes couldn't learn about the IP mapping for the bootstrapping node and streaming failed. To protect against that, use a more efficient method of replication which requires a single cross-shard call to replicate all prior updates. It is also more reliable, if replication fails transiently for some reason, we don't give up and fail all later updates. Fixes #26865 Closes scylladb/scylladb#26941 * github.com:scylladb/scylladb: address_map: Use barrier() to wait for replication address_map: Use more efficient and reliable replication method utils: Introduce helper for replicated data structures	2025-11-23 19:15:12 +02:00
Avi Kivity	b0643f8959	Merge 'db/config: enable `ms` sstable format by default' from Michał Chojnowski Trie-based sstable indexes are supposed to be (hopefully) a better default than the old BIG indexes. Make them the new default. If we change our mind, this change can be reverted later. New functionality, and this is a drastic change. No backport needed. Closes scylladb/scylladb#26377 * github.com:scylladb/scylladb: db/config: enable `ms` sstable format by default cluster/dtest/bypass_cache_test: switch from highest_supported_sstable_format to chosen_sstable_format api/system: add /system/chosen_sstable_version test/cluster/dtest: reduce num_tokens to 16	2025-11-23 13:52:57 +02:00
Piotr Dulikowski	e8b0f8faa9	Merge 'vector search: Add HTTPS requests support' from Karol Nowacki vector_search: Add HTTPS support for vector store connections This commit introduces TLS encryption support for vector store connections. A new configuration option is added: - vector_store_encryption_options.truststore: path to the trust store file To enable secure connections, use the https:// scheme in the vector_store_primary_uri/vector_store_secondary_uri configuration options. Fixes: VECTOR-327 Backport to 2025.4 as this feature is expected to be available in 2025.4. Closes scylladb/scylladb#26935 * github.com:scylladb/scylladb: test: vector_search: Ensure all clients are stopped on shutdown vector_search: Add HTTPS support for vector store connections	2025-11-22 14:58:06 +01:00
Karol Nowacki	58456455e3	test: vector_search: Ensure all clients are stopped on shutdown A flaky test revealed that after `clients::stop()` was called, the `old_clients` collection was sometimes not empty, indicating that some clients were not being stopped correctly. This resulted in sanitizer errors when objects went out of scope at the end of the test. This patch modifies `stop()` to ensure all clients, including those in `old_clients`, are stopped, guaranteeing a clean shutdown.	2025-11-22 08:18:45 +01:00
Karol Nowacki	c40b3ba4b3	vector_search: Add HTTPS support for vector store connections This commit introduces TLS encryption support for vector store connections. A new configuration option is added: - vector_store_encryption_options.truststore: path to the trust store file To enable secure connections, use the https:// scheme in the vector_store_primary_uri/vector_store_secondary_uri configuration options. Fixes: VECTOR-327	2025-11-22 08:18:45 +01:00
Ferenc Szili	39711920eb	test: add tests for tablet size migration during end_migration This change adds tests for the correctness of tablet size migration during the end_migrations stage. This size migration can happend for tablet migrations and for tablet rebuild.	2025-11-21 16:58:11 +01:00
Ferenc Szili	e96863be0c	virtual_table: add tablet_sizes virtual table This change adds the tablet_sizes virtual table. The contents of this table are gathered from the current load_stats data structure.	2025-11-21 16:53:28 +01:00
Ferenc Szili	cede4f66af	load_stats: update tablet sizes after migration or rebuild When migrating tablet size during the end_migration tablet transition stage, we need the pending and leaving replica hosts. The leaving and pending replicas are gathered in objects of type std::optional<tablet_replica> and are not checked if they contain a value before dereferencing which could cause an exception in the topology coordinator. This patch adds a check for leaving and pending replicas, and only perfoms the tablet size migration if neither are empty. This bug was introduced in `10f07fb95a` This change also adds the functionality to add the tablet size to load_stats after a tablet rebuild. We compute the average tablet size from the existing replicas, and add the new size to the pending replica.	2025-11-21 16:22:20 +01:00
Botond Dénes	38a1b1032a	Merge 'doc: update Cloud Instance Recommendations for GCP' from Anna Stuchlik This PR: - Removes n1-highmem instances from Recommended Instances. - Adds missing support for n2-highmem-96. - Updates the reference to n2 instances in the Google Cloud docs (fixes a broken link to GCP). - Adds the missing information about processors for n2-highmem-instance - Ice Lake and Cascade Lake (requested by CX). Fixes https://github.com/scylladb/scylladb/issues/25946 Fixes https://github.com/scylladb/scylladb/issues/24223 Fixes https://github.com/scylladb/scylladb/issues/23976 No backport needed if this PR is merged before 2025.4 branching. Closes scylladb/scylladb#26182 * github.com:scylladb/scylladb: doc: update information for n2-highmem instances doc: remove n1-highmem instances from Recommended Instances	2025-11-21 16:28:54 +02:00
Anna Stuchlik	dab74471cc	doc: update information for n2-highmem instances This commit updates the section for n2-highmem instances on the Cloud Instance Recommendations page - Added missing support for n2-highmem-96 - Update the reference to n2 instances in the Google Cloud docs. - Added the missing information about processors for this instance type (Ice Lake and Cascade Lake).	2025-11-21 15:13:36 +01:00
Taras Veretilnyk	3003669c96	file_stream_test: add sstable file streaming integrity verification test cases Add 'test_sstable_stream' to verify SSTable file streaming integrity check. The new tests cover both compressed and uncompressed SSTables and includes: - Checksum mismatch detection verification - Digest mismatch detection verifivation	2025-11-21 12:52:35 +01:00
Taras Veretilnyk	77dcad9484	streaming: prioritize sender-side errors in tablet_stream_files When 'send_data_to_peer' throws and closes the sink, the peer later reports its own error, masking the original sender failure. This commit preserves the original sender exception. If the status-retrieval task throws its own error before sender task rethrows its exception, we can still propagate the original exception later.	2025-11-21 12:52:31 +01:00
Taras Veretilnyk	c8d2f89de7	sstables: enable integrity check for data file streaming This patch enables integrity check in 'create_stream_sources()' by introducing a new 'sstable_data_stream_source_impl' class for handling the Data component of SSTables. The new implementation uses 'sstable::data_stream()' with 'integrity_check::yes' instead of the raw input_stream. These additional checks require reading the digest and CRC components from disk, which may introduce some I/O overhead. For uncompressed SSTables, this involves loading and computing checksums and digest from the data. For compressed SSTables - where checksums are already embedded - the cost comes from reading, calculation and verifying the digest.	2025-11-21 12:52:26 +01:00
Taras Veretilnyk	18e1dbd42e	sstables: Add compressed raw streaming support Implement compressed_raw_file_data_source that streams compressed chunks without decompression while verifying checksums and calculating digests. Extends raw_stream enum to support compressed_chunks mode. This data_source implementation will be used in the next commits for file based streaming.	2025-11-21 12:52:04 +01:00
Taras Veretilnyk	c32e9e1b54	sstables: Allow to read digest and checksum from user provided file instance Add overloaded methods to read digest and checksum from user-provided file handles: - 'read_digest(file f)' - 'read_checksum(file f) This will be useful for tablet file-based streaming to enable integrity verification, as the streaming code uses SSTable snapshots with open files to prevent missing components when SSTables are unlinked.	2025-11-21 12:51:40 +01:00
Michał Chojnowski	da51a30780	db/config: enable `ms` sstable format by default Trie-based sstable indexes are supposed to be (hopefully) a better default than the old BIG indexes. Make them the new default. If we change our mind, this change can be reverted later.	2025-11-21 12:39:46 +01:00
Michał Chojnowski	73090c0d27	cluster/dtest/bypass_cache_test: switch from highest_supported_sstable_format to chosen_sstable_format Trie-based indexes and older indexes have a difference in metrics, and the test uses the metrics to check for bypass cache. To choose the right metrics, it uses highest_supported_sstable_format, which is inappropriate, because the sstable format chosen for writes by Scylla might be different than highest_supported_sstable_format. Use chosen_sstable_format instead.	2025-11-21 12:39:46 +01:00
Michał Chojnowski	38e14d9cd5	api/system: add /system/chosen_sstable_version Returns the sstable version currently chosen for use in for new sstables. We are adding it because some tests want to know what format they are writing (tests using upgradesstable, tests which check stats that only apply to one of the index types, etc). (Currently they are using `highest_supported_sstable_format` for this purpose, which is inappropriate, and will become invalid if a non-latest format is the default).	2025-11-21 12:39:46 +01:00
Botond Dénes	5c6813ccd0	test/cluster/test_repair.py: add test_repair_timestamp_difference Add a test which verifies that if two nodes have the same data, with different timestamps, repair will detect and fix the diverging timestamps. All our repair tests focus on difference in data and I remember writing this test multiple times in the past to quickly verify whether this works. Time to upstream this test. Closes scylladb/scylladb#26900	2025-11-21 14:19:51 +03:00
Botond Dénes	6f79fcf4d5	tools/scylla-nodetool: dump request history on json assert A JSON assert happens when a JSON member is either missing or has unexpected type. rapidjson has a very unhelpful "json assert failed" message for this, with a backtrace (somewhat helpful), with no other context. To help debug such errors, collect all request sent to the API and dump them when such errors happen. The backtrace with the full request history should be enough to debug any such issues. Refs CUSTOMER-17 Closes scylladb/scylladb#26899	2025-11-21 14:17:53 +03:00
Gautam Menghani	939fcc0603	db/system_keyspace: Remove the FIXME related to caching of large tables Remove the FIXME comment for re-enabling caching of the large tables since the tables are used infrequently [1]. [1] : github.com/scylladb/scylladb/pull/26789#issuecomment-3477540364 Fixes #26032 Signed-off-by: Gautam Menghani <gautam.opensource@gmail.com> Closes scylladb/scylladb#26789	2025-11-21 12:34:34 +02:00
Radosław Cybulski	d589e68642	Add precompiled headers to CMakeLists.txt Add precompiled header support to CMakeLists.txt and configure.py - it improves compilation time by approximately 10%. New header `stdafx.hh` is added, don't include it manually - the compiler will include it for you. The header contains includes from external libraries used by Scylla - seastar, standard library, linux headers and zlib. The feature is enabled by default, use CMake option `Scylla_USE_PRECOMPILED_HEADER` or configure.py --disable-precompiled-header to disable. The feature should be disabled, when trying to check headers - otherwise you might get false negatives on missing includes from seastar / abseil and so on. Note: following configuration needs to be added to ccache.conf: sloppiness = pch_defines,time_macros,include_file_mtime,include_file_ctime Closes scylladb/scylladb#26617	2025-11-21 12:27:41 +02:00
Nadav Har'El	64a075533b	alternator: fix update of stats from wrong shard In commit `51186b2` (PR #25457) we introduced new statistics for authentication errors, and among other places we modified executor::create_table() to update them when necessary. This function runs its real work (create_table_on_shard0()) on shard 0, but incorrectly updates "_stats" from the original shard. It doesn't really matter which shard's stats we update - but it does matter that code running on shard 0 shouldn't touch some other shard's objects. Since all we do on these stats is to increment an integer, the risk of updating it on the wrong shard is minimal to non-existant, but it's still wrong and can cause bigger trouble in the future as the code continues to evolve. The fix is simple - we should pass to create_table_on_shard0() the _stats object from the acutal shard running it (shard 0). Fixes #26942 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#26944	2025-11-21 11:53:06 +02:00
Calle Wilund	3c4546d839	messaging_service: Add internode_compression=rack as option Fixes #27085 Adds a "rack" option to enum/config and handles in connection setup in messaging_service. Closes scylladb/scylladb#27099	2025-11-21 11:50:55 +02:00
Nadav Har'El	66bd3dc22c	test/alternator: tests for request compression DynamoDB's documentation https://docs.aws.amazon.com/sdkref/latest/guide/feature-compression.html suggests that DynamoDB allows request bodies to be compressed (currently only by gzip). The purpose of patch is to have a test reproducing this feature. The test shows us that indeed DynamoDB understands compressed requests using the "gzip" encoding, but Alternator does not, so the new test is xfail. As you can see in the test code, although the low-level SDK (botocore) can send compress requests, this is not actually enabled for DynamoDB and we need to resort to some trickery to send compressed requests. But the point is that once we do manage to send compressed requests, the test shows us that they work properly on AWS, but fail on Alternator. The failure of the compressed requests on Alternator is reported like: An error occurred (ValidationException) when calling the PutItem operation: Parsing JSON failed: Invalid value. at 70459088 This error message should probably be improved (what is that high number?!) but of course even better would be to make it really work. By enabling tracing on alternator-server (e.g., edit test/cqlpy/run.py and add `'--logger-log-level', 'alternator-server=trace',`) we can see exactly what request the SDK sends Alternator. What we can see in the request is: 1. The request headers are uncompressed (this is expected in HTTP) 2. There is a header "Content-Encoding: gzip" 3. The request's body is binary, a full-fleged gzip output complete with a gzip magic in the beginning. Refs #5041 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27049	2025-11-21 10:48:33 +02:00
Shreyas Ganesh	4488a4fb06	docs: document sstables quarantine subdirectory Add documentation for the quarantine/ subdirectory that holds SSTables isolated due to validation failures or corruption. Document the scrub operation's quarantine_mode parameter options and the drop_quarantined_sstables API operation. Also update the directory hierarchy example to include the quarantine directory. Fixes #10742 Signed-off-by: Shreyas Ganesh <vansi.ganeshs@gmail.com> Closes scylladb/scylladb#27023	2025-11-21 10:45:33 +02:00
Ernest Zaslavsky	825d81dde2	cmake: dont complain about deprecated builtins On clang 21.1.4 (Fedora 43) the abseil compilation started to fail with `builtin XXX is deprecated use YYY instead`. Suppress this for abseil compilation only Closes scylladb/scylladb#27098	2025-11-21 10:31:54 +02:00
Botond Dénes	0cc5208f8e	Merge 'Add sstables_manager::config' from Pavel Emelyanov Currently sstables_manager keeps a reference on global db::config to configure itself. Most of other services use their own specific configs with much less data on-board for the same purposes (e.g. #24841, #19051 and #23705 did same for other services) This PR applies this approach to sstables_manager as well. Mostly it moves various values from db::config onto newly introduced struct sstables_manager::config, but it also adds specific tracking of sstable_file_io_extensions and patches tools/scylla-sstable not to use sstables_manager as "proxy" object to get db::config from along its calls. Shuffling components dependencies, no need to backport Closes scylladb/scylladb#27021 * github.com:scylladb/scylladb: sstables_manager: Drop db::config from sstables_manager tools/sstable: Make shard_of_with_tablets use db::config argument tools/sstable: Add db::config& to all operations tools/sstable: Get endpoints from storage manager sstables_manager: Hold sstable IO extensions on it sstables: Manager helper to grab file io extensions sstables_manager: Move default format on config sstables_manager: Move enable_sstable_data_integrity_check on config sstables_manager: Move data_file_directories on config sstables_manager: Move components_memory_reclaim_threshold on config sstables_manager: Move column_index_auto_scale_threshold on config sstables_manager: Move column_index_size on config sstables_manager: Move sstable_summary_ratio on config sstables_manager: Move enable_sstable_key_validation on config sstables_manager: Move available_memory on config code: Introduce sstables_manager::config sstables: Patch get_local_directories() to work on vector of paths code: Rename sstables_manager::config() into db_config()	2025-11-21 10:21:41 +02:00
Botond Dénes	f89bb68fe2	Merge 'cdc: Preserve properties when reattaching log table' from Dawid Mędrek When we enable CDC on a table, Scylla creates a log table for it. It has default properties, but the user may change them later on. Furthermore, it's possible to detach that log table by simply disabling CDC on the base table: ```cql /* Create a table with CDC enabled. The log table is created. / CREATE TABLE ks.t (pk int PRIMARY KEY) WITH cdc = {'enabled': true}; / Detach the log table. / ALTER TABLE ks.t WITH cdc = {'enabled': false}; / Modify a property of the log table. / ALTER TABLE ks.t_scylla_cdc_log WITH bloom_filter_fp_chance = 0.13; ``` The log table can also be reattached by enabling CDC on the base table again: ```cql / Reattach the log table / ALTER TABLE ks.t WITH cdc = {'enabled': true}; ``` However, because the process of reattachment goes through the same code that created it in the first place, the properties of the log table are rolled back to their default values. This may be confusing to the user and, if unnoticed, also have other consequences, e.g. affecting performance. To prevent that, we ensure that the properties are preserved. A reproducer test, `test_log_table_preserves_properties_after_reattachment`, has been provided to verify that the changes are correct. Another test, `test_log_table_preserves_id_after_reattachment`, has also been added because the current implementation sets properties and the ID separately. Fixes scylladb/scylladb#25523 Backport: not necessary. Although the behavior may be unexpected, it's not a bug per se. Closes scylladb/scylladb#26443 github.com:scylladb/scylladb: cdc: Preserve properties when reattaching log table cdc: Extract creating columns in CDC log table to dedicated function cdc: Extract default properties of CDC log tables to dedicated function schema/schema_builder.hh: Add set_properties schema: Add getter for schema::user_properties schema: Remove underscores in fields of schema::user_properties schema: Extract user properties out of raw_schema	2025-11-21 10:06:05 +02:00
Calle Wilund	03408b185e	utils::gcp::object_storage: Fix buffer alignment reordering trailing data Fixes #26874 Due to certain people (me) not being able to tell forward from backward, the data alignment to ensure partial uploads adhere to the 256k-align rule would potentially _reorder_ trailing buffers generated iff the source buffers input into the sink are small enough. Which, as a fun fact, they are in backup upload. Change the unit test to use raw sink IO and add two unit tests (of which the smaller size provokes the bug) that checks the same 64k buf segmented upload backup uses. Closes scylladb/scylladb#26938	2025-11-21 09:36:13 +02:00

1 2 3 4 5 ...

50671 Commits