scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 20:05:10 +00:00

Author	SHA1	Message	Date
Benny Halevy	4a3d14a031	test: cluster: test_tablets_merge: add test_tablet_split_merge_with_many_tables Reproduces #23284 Currently skipped in release mode since it requires the `short_tablet_stats_refresh_interval` interval. Ref #24641 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-07-06 15:07:31 +03:00
Benny Halevy	2c0bafb934	token_metadata: clear_and_destroy_impl when destroyed We have a lot of places in the code where a token_metadata_ptr is kept in an automatic variable and destroyed when it leaves the scope. since it's a referenced counted lw_shared_ptr, the token_metadata object is rarely destroyed in those cases, but when it is, it doesn't go through clear_gently, and in particular its tablet_metadata is not cleared gently, leading to inefficient destruction of potentially many foreign_ptr:s. This patch calls clear_and_destroy_impl that gently clears and destroys the impl object in the background using the shared_token_metadata. Fixes #13381 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-07-06 15:07:31 +03:00
Benny Halevy	2b2cfaba6e	token_metadata: keep a reference to shared_token_metadata To be used by a following patch to gently clean and destroy the token_data_impl in the background. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-07-06 15:07:31 +03:00
Benny Halevy	e0a19b981a	token_metadata: move make_token_metadata_ptr into shared_token_metadata class So we can use the local shared_token_metadata instance for safe background destroy of token_metadata_impl:s. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-07-06 14:22:20 +03:00
Benny Halevy	493a2303da	replica: database: get and expose a mutable locator::shared_token_metadata Prepare for next patch, the will use this shared_token_metadata to make mutable_token_metadata_ptr:s Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-07-06 14:22:20 +03:00
Benny Halevy	3acca0aa63	locator: tablets: tablet_metadata: clear_gently: optimize foreign ptr destruction Sort all tablet_map_ptr:s by shard_id and then destroy them on each shard to prevent long cross-shard task queues for foreign_ptr destructions. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-07-06 14:20:46 +03:00
Avi Kivity	33225b730d	Merge 'Do not reference db::config by transport::server' from Pavel Emelyanov The db::config is top-level configuration class that includes options for pretty much everything in Scylla. Instead of messing with this large thing, individual services have their own smaller configs, that are initialized with values from db::config. This PR makes it for transport::server (transport::controller will be next) and its cql_server_config. One bad thing not to step on is that updateable_value is not shard-safe (#7316), but the code in controller that creates cql_server_config is already taking care. Closes scylladb/scylladb#24841 * github.com:scylladb/scylladb: transport: Stop using db::config by transport::server transport: Keep uninitialized_connections_semaphore_cpu_concurrency on cql_server_config transport: Move cql_duplicate_bind_variable_names_refer_to_same_variable to cql_server_config transport: Move max_concurrent_requests to struct config transport: Use cql_server_config::max_request_size	2025-07-05 18:39:01 +03:00
Pavel Emelyanov	9b178df7dd	transport: Stop using db::config by transport::server Now the server is self-contained in the way it is being configured by the controller. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-07-04 15:40:20 +03:00
Pavel Emelyanov	e2c1484d8d	transport: Keep uninitialized_connections_semaphore_cpu_concurrency on cql_server_config This also repeats previous patch for another updateable_value. The thing here is that this config option is passed further to generic_server, but not used by transport::server itslef. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-07-04 15:40:20 +03:00
Pavel Emelyanov	64ffe67cbd	transport: Move cql_duplicate_bind_variable_names_refer_to_same_variable to cql_server_config Similarly to previous patch -- move yet another updateable_value to let transport::server eventually stop messing with db::config. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-07-04 15:40:14 +03:00
Pavel Emelyanov	b6546ed5ff	transport: Move max_concurrent_requests to struct config This is updateable_value that's initialized from db::config named_value to tackle its shard-unsafety. However, the cql_server_config is created by controller using sharded_parameter() helper, so that is can be safely passed to server. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-07-04 15:35:55 +03:00
Pavel Emelyanov	6075eca168	transport: Use cql_server_config::max_request_size It's duplicated on config and the transport::server that aggregates the config itself. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-07-04 15:34:53 +03:00
Pavel Emelyanov	4d4406c5bc	Merge 'test.py: dtest: port next_gating tests from auth_test.py' from Evgeniy Naydanov Copy `auth_test.py` from scylla-dtest test suite, remove all not next_gating tests from it, and make it works with `test.py` As a part of the porting process, remove unused imports and markers, remove non-next_gating tests and tests marked with `required_features("!consistent-topology-changes")` marker. Remove `test_permissions_caching` test because it's too flaky when running using test.py Also, make few time execution optimizations: - remove redundant `time.sleep(10)` - use smaller timeouts for CQL sessions Enable the test in `suite.yaml` (run in dev mode only.) Additional modifications to test.py/dtest shim code: - Modify ManagerClient.server_update_config() method to change multiple config options in one call in addition to one `key: value` pair. - Implement the method using slightly modified `set_configuration_options()` method of `ScyllaCluster`. - Copy generate_cluster_topology() function from tools/cluster_topology.py module. - Add support for `bootstrap` parameter for `new_node()` function. - Rework `wait_for_any_log()` function. Closes scylladb/scylladb#24648 * github.com:scylladb/scylladb: test.py: dtest: make auth_test.py run using test.py test.py: dtest: rework wait_for_any_log() test.py: dtest: add support for bootstrap parameter for new_node test.py: dtest: add generate_cluster_topology() function test.py: dtest: add ScyllaNode.set_configuration_options() method test.py: pylib/manager_client: support batch config changes test.py: dtest: copy unmodified auth_test.py test.py: dtest: add missed markers to pytest.ini	2025-07-04 10:51:52 +03:00
Botond Dénes	258bf664ee	scylla-gdb.py: sstable-summary: adjust for raw-tokens `01466be7b9` changed the summary entries, storing raw tokens in them, instead of dht::token. Adjust the command so that it works with both pre- and post- versions. Also make it accept pointers to sstables as arguments, this is what scylla sstables listing provides. Closes scylladb/scylladb#24759	2025-07-04 10:44:25 +03:00
Patryk Jędrzejczak	8d925b5ab4	test: increase the default timeout of graceful shutdown Multiple tests are currently flaky due to graceful shutdown timing out when flushing tables takes more than a minute. We still don't understand why flushing is sometimes so slow, but we suspect it is an issue with new machines spider9 and spider11 that CI runs on. All observed failures happened on these machines, and most of them on spider9. In this commit, we increase the timeout of graceful shutdown as a temporary workaround to improve CI stability. When we get to the bottom of the issue and fix it, we will revert this change. Ref #12028 It's a temporary workaround to improve CI stability, we don't have to backport it. Closes scylladb/scylladb#24802	2025-07-04 10:43:38 +03:00
Avi Kivity	60f407bff4	storage_proxy: avoid large allocation when storing batch in system.batchlog Currently, when computing the mutation to be stored in system.batchlog, we go through data_value. In turn this goes through `bytes` type (#24810), so it causes a large contiguous allocation if the batch is large. Fix by going through the more primitive, but less contiguous, atomic_cell API. Fixes #24809. Closes scylladb/scylladb#24811	2025-07-04 10:43:05 +03:00
Avi Kivity	5cbeae7178	sstables: drop minimum_key(), maximum_key() Not used. Closes scylladb/scylladb#24825	2025-07-04 10:42:44 +03:00
Dawid Mędrek	a151944fa6	treewide: Replace __builtin_expect with (un)likely C++20 introduced two new attributes--likely and unlikely--that function as a built-in replacement for __builtin_expect implemented in various compilers. Since it makes code easier to read and it's an integral part of the language, there's no reason to not use it instead. Closes scylladb/scylladb#24786	2025-07-03 13:34:04 +03:00
dependabot[bot]	59cc496757	build(deps): bump sphinx-scylladb-theme from 1.8.6 to 1.8.7 in /docs Bumps [sphinx-scylladb-theme](https://github.com/scylladb/sphinx-scylladb-theme) from 1.8.6 to 1.8.7. - [Release notes](https://github.com/scylladb/sphinx-scylladb-theme/releases) - [Commits](https://github.com/scylladb/sphinx-scylladb-theme/compare/1.8.6...1.8.7) --- updated-dependencies: - dependency-name: sphinx-scylladb-theme dependency-version: 1.8.7 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Closes scylladb/scylladb#24805	2025-07-03 12:04:24 +03:00
Gleb Natapov	ca7837550d	topology coordinator: do not set request_type field for truncation command if topology_global_request_queue feature is not enabled yet Old nodes do not expect global topology request names to be in request_type field, so set it only if a cluster is fully upgraded already. Closes scylladb/scylladb#24731	2025-07-02 17:09:29 +02:00
Pavel Emelyanov	fa0077fb77	Merge 'S3 chunked download source bug fixes' from Ernest Zaslavsky - Fix missing negation in the `if` in the background downloading fiber - Add test to catch this case - Improve the s3 proxy to inject errors if the same resource requested more than once - Suppress client retry since retrying the same request when each produces multiple buffers may lead to the same data appear more than once in the buffer deque - Inject exception from the test to simulate response callback failure in the middle No need to backport anything since this class in not used yet Closes scylladb/scylladb#24657 * github.com:scylladb/scylladb: s3_test: Add s3_client test for non-retryable error handling s3_test: Add trace logging for default_retry_strategy s3_client: Fix edge case when the range is exhausted s3_client: Fix indentation in try..catch block s3_client: Stop retries in chunked download source s3_client: Enhance test coverage for retry logic s3_client: Add test for Content-Range fix s3_client: Fix missing negation s3_client: Refine logging s3_client: Improve logging placement for current_range output	2025-07-02 14:45:10 +03:00
Patryk Jędrzejczak	fa982f5579	docs: handling-node-failures: fix typo Replacing "from" is incorrect. The typo comes from recently merged #24583. Fixes #24732 Requires backport to 2025.2 since #24583 has been backported to 2025.2. Closes scylladb/scylladb#24733	2025-07-02 12:22:01 +03:00
Konstantin Osipov	37fc4edeb5	test.py: add a way to provide pytest arguments via test.py Now that we use a single pytest.ini for all tests, different developer preferences collide. There should be an easy way to override pytest.ini defaults from the command line. Fixes https://github.com/scylladb/scylladb/issues/21800 Closes scylladb/scylladb#24573	2025-07-02 12:20:43 +03:00
Avi Kivity	dfaed80f55	Merge 'types: add byte-comparable format support for native cql3 types' from Lakshmi Narayanan Sreethar This PR introduces a new `comparable_bytes` class to add byte-comparable format support for all the [native cql3 data types](https://opensource.docs.scylladb.com/stable/cql/types.html#native-types) except `counter` type as that is not comparable. The byte-comparable format is a pre-requisite for implementing the trie based index format for our sstables(https://github.com/scylladb/scylladb/issues/19191). This implementation adheres to the byte-comparable format specification in https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/utils/bytecomparable/ByteComparable.md Note that support for composite data types like lists, maps, and sets has not been implemented yet and will be made available in a separate PR. Refs https://github.com/scylladb/scylladb/issues/19407 New feature - backport not required. Closes scylladb/scylladb#23541 * github.com:scylladb/scylladb: types/comparable_bytes: add testcase to verify compatibility with cassandra types/comparable_bytes: support variable-length natively byte-ordered data types types/comparable_bytes: support decimal cql3 types types/comparable_bytes: introduce count_digits() method types/comparable_bytes: support uuid and timeuuid cql3 types types/comparable_bytes: support varint cql3 type types/comparable_bytes: support skipping sign byte write in decode_signed_long_type types/comparable_bytes: introduce encode/decode_varint_length types/comparable_bytes: support float and double cql3 types types/comparable_bytes: support date, time and timestamp cql3 types types/comparable_bytes: support bigint cql3 type types/comparable_bytes: support fixed length signed integers types/comparable_bytes: support boolean cql3 type types: introduce comparable_bytes class bytes_ostream: overload write() to support writing from FragmentedView docs: fix minor typo in docs/dev/cql3-type-mapping.md	2025-07-02 11:58:32 +03:00
Avi Kivity	1e0b015c8b	Merge 'cql3: Represent create_statement using managed_bytes' from Dawid Mędrek When describing a table, we need to do it carefully: if some columns were dropped, we must specify that explicitly by ``` ALTER TABLE {table} DROP {column} USING TIMESTAMP ... ``` in the result of the DESCRIBE statement. Failing to do so could lead to data resurrection. However, if a table has been altered many, many times, we might end up with a huge create statement. Constructing it could, in turn, trigger an oversized allocation. Some tests ran into that very problem in fact. In this commit, we want to mitigate the problem: instead of allocating a contiguous chunk of memory for the create statement, we use `bytes_ostream` and `managed_bytes` to possibly keep data scattered in memory. It makes handling `cql3::description` less convenient in the code, but since the struct is pretty much immediately serialized after creating it, it's a very good trade-off. A reproducer is intentionally not provided by this commit: it's easy to test the change, but adding and dropping a huge number of columns would take a really long amount of time, so we need to omit it. Fixes scylladb/scylladb#24018 Backport: all of the supported versions are affected, so we want to backport the changes there. Closes scylladb/scylladb#24151 * github.com:scylladb/scylladb: cql3/description: Serialize only rvalues of description cql3: Represent create_statement using managed_string cql3/statements/describe_statement.cc: Don't copy descriptions cql3: Use managed_bytes instead of bytes in DESCRIBE utils/managed_string.hh: Introduce managed_string and fragmented_ostringstream	2025-07-01 21:59:38 +03:00
Lakshmi Narayanan Sreethar	5f5a8cf54c	types/comparable_bytes: add testcase to verify compatibility with cassandra	2025-07-01 22:19:08 +05:30
Lakshmi Narayanan Sreethar	6c1853a830	types/comparable_bytes: support variable-length natively byte-ordered data types The following cql3 data types - ascii, blob, duration, inet, and text - are natively byte-ordered in their serialized forms. To encode them into a byte-comparable format, zeros are escaped, and since these types have variable lengths, the encoded form is terminated in an escaped state to mark its end. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:08 +05:30
Lakshmi Narayanan Sreethar	5c77d17834	types/comparable_bytes: support decimal cql3 types The decimal cql3 type is internally stored as a scale and an unscaled integer. To convert them into a byte comparable format, they are first normalized into a base-100 exponent and a mantissa that lies in [0.01, 1) and then encoded into a byte sequence that preserves the numerical order. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:08 +05:30
Lakshmi Narayanan Sreethar	832236d044	types/comparable_bytes: introduce count_digits() method Implemented a method `count_digits()` to return the number of significant digits in a given boost::multiprecision:cpp_int. This is required to convert big_decimal to a byte comparable format. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:08 +05:30
Lakshmi Narayanan Sreethar	a00c5d3899	types/comparable_bytes: support uuid and timeuuid cql3 types The uuid type values are composed of two fixed-length unsigned integers: an msb and an lsb. The msb contains a version digit, which must be pulled first in a byte-comparable representation. For version 1 uuids, in addition to extracting the version digit first, the msb must be rearranged to make it byte comparable. The lsb is written as is. For the timeuuid type, the msb is handled simliar to the version 1 uuid values. The lsb however is treated differently - the sign bits of all bytes are inverted to preserve the legacy comparison order, which compared individual bytes as signed values. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:08 +05:30
Lakshmi Narayanan Sreethar	4592b9764c	types/comparable_bytes: support varint cql3 type Any varint value less than 7 bytes is encoded using the signed long encoding format and remaining values are all encoded using the full form encoding : <signbyte><length as unsigned integer - 7><7 or more bytes>, where <signbyte> is 00 for negative numbers and FF for positive ones, and the length's bytes are inverted if the number is negative (so that longer length sorts smaller). Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	1b6b0a665d	types/comparable_bytes: support skipping sign byte write in decode_signed_long_type The decode_signed_long_type() method writes leading sign bytes when decoding a byte-comparable encoded signed long value. The varint decoder depends on this method to decode values up to a certain length and expects the decoded form to include sign-only bytes only when necessary. Update the decode_signed_long_type() code to allow skipping the write of sign-only bytes based on the caller's request. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	ad45a19373	types/comparable_bytes: introduce encode/decode_varint_length The length of a varint value is encoded separately as an unsigned variable-length integer. For negative varint values, the encoded bytes are flipped to ensure that longer lengths sort smaller. This patch implements both encoding and decoding logic for varint lengths and will be used by the subsequent patch. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	7af153c237	types/comparable_bytes: support float and double cql3 types The sign bit is flipped for positive values to ensure that they are ordered after negative values. For negative values, all the bytes are inverted, allowing larger negative values to be ordered before smaller ones. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	0145c1d705	types/comparable_bytes: support date, time and timestamp cql3 types Both the date and time cql3 types are internally unsigned fixed length integers. Their serialized form is already byte comparable, so the encoder and decoder return the serialized bytes as it is. The timestamp type is encoded using the fixed length signed integer encoding. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	b6ff3f5304	types/comparable_bytes: support bigint cql3 type The bigint type, internally implemented as a long data type, is encoded using a variable-length encoding similar to UTF-8. This enables a significant amount of space to be saved when smaller numbers are frequently used, while still permitting large values to be efficiently encoded. The first bit of the encoding represents the inverted sign (i.e., 1 for positive, 0 for negative), followed by length encoded as a sequence of bits matching the inverted sign. This is then followed by a differing bit (except for 9-byte encodings) and the bits of the number's two's complement. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	c0d25060bd	types/comparable_bytes: support fixed length signed integers To encode fixed-length signed integers in a byte-comparable format, the first bit of each value is inverted. This ensures that negative numbers are ordered before positive ones during comparison. This patch adds support for the data types : byte_type (tinyint), short_type (smallint), and int32_type (int). Although long_type (bigint) is a fixed length integer type, it has different byte comparable encoding and will be handled separately in another patch. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	8572afca2b	types/comparable_bytes: support boolean cql3 type Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	74c556a33d	types: introduce comparable_bytes class This patch implements a new class, `comparable_bytes`, designed to implement methods for converting data values to and from byte-comparable formats. The class stores the comparable bytes as `managed_bytes` and currently provides the structure for all required methods. The actual logic for converting various data types will be implemented in subsequent patches. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	e4c7cb7834	bytes_ostream: overload write() to support writing from FragmentedView Overloaded write() method to support writing a FragmentedView into bytes_ostream. Also added a testcase to verify the implementation. The new helper will be used by the byte_comparable implementation during the encode/decode process. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Lakshmi Narayanan Sreethar	068e74b457	docs: fix minor typo in docs/dev/cql3-type-mapping.md Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Ernest Zaslavsky	acf15eba8e	s3_test: Add s3_client test for non-retryable error handling Introduce a test that injects a non-retryable error and verifies that the chunked download source throws an exception as expected.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	a5246bbe53	s3_test: Add trace logging for default_retry_strategy Introduce trace-level logging for `default_retry_strategy` in `s3_test` to improve visibility into retry logic during test execution.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	49e8c14a86	s3_client: Fix edge case when the range is exhausted Handle case where the download loop exits after consuming all data, but before receiving an empty buffer signaling EOF. Without this, the next request is sent with a non-zero offset and zero length, resulting in "Range request cannot be satisfied" errors. Now, an empty buffer is pushed to indicate completion and exit the fiber properly.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	e50f247bf1	s3_client: Fix indentation in try..catch block Correct indentation in the `try..catch` block to improve code readability and maintain consistent formatting.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	d2d69cbc8c	s3_client: Stop retries in chunked download source Disable retries for S3 requests in the chunked download source to prevent duplicate chunks from corrupting the buffer queue. The response handler now throws an exception to bypass the retry strategy, allowing the next range to be attempted cleanly. This exception is only triggered for retryable errors; unretryable ones immediately halt further requests.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	c75acd274c	s3_client: Enhance test coverage for retry logic Extend the S3 proxy to support error injection when the client makes multiple requests to the same resource—useful for testing retry behavior and failure handling.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	ec59fcd5e4	s3_client: Add test for Content-Range fix Introduce a test that accurately verifies the Content-Range behavior, ensuring the previous fix is properly validated.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	6d9cec558a	s3_client: Fix missing negation Restore a missing `not` in a conditional check that caused incorrect behavior during S3 client execution.	2025-07-01 18:45:17 +03:00
Ernest Zaslavsky	e73b83e039	s3_client: Refine logging Fix typo in log message to improve clarity and accuracy during S3 operations.	2025-07-01 18:45:17 +03:00

1 2 3 4 5 ...

48380 Commits