scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-21 23:32:15 +00:00

Author	SHA1	Message	Date
Andrzej Jackowski	f39c48648a	test: perf: add --audit-rules option to perf-simple-query Allow perf-simple-query to compare audit-rule matching with the category/keyspace/table audit filters under the same workload. Register a hardcoded "tester" role with the audit cache so rules targeting that role exercise the preprocessed fast path. The new option was used to measure audit-rules performance against the category/keyspace/table audit config. The results are as follows: =============================================================================================================================================================================== Configuration \| Binary \| throughput (tps) \| insns/op \| cpu_cycles/op \| alloc/op \| logal/op \| task/op =============================================================================================================================================================================== audit=none [1] \| baseline \| 206922.4 \| 36591.6 \| 15348.3 \| 58.1 \| 0.0 \| 14.1 audit=none [1] \| this PR \| 207856.4 (+0.5%) \| 36544.9 (-0.1%) \| 15274.0 (-0.5%) \| 58.1 \| 0.0 \| 14.1 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- audit=syslog keyspaces=ks [2] \| baseline \| 94871.8 \| 54163.0 \| 27172.4 \| 72.0 \| 0.0 \| 24.0 audit=syslog keyspaces=ks [2] \| this PR \| 96138.4 (+1.3%) \| 54072.3 (-0.2%) \| 26699.3 (-1.7%) \| 72.0 \| 0.0 \| 24.0 audit=syslog audit-rules=ks [3] \| this PR \| 95142.1 (+0.3%) \| 54457.8 (+0.5%) \| 26953.8 (-0.8%) \| 72.0 \| 0.0 \| 24.0 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- audit=syslog keyspaces=ks-non-existent [4] \| baseline \| 213997.8 \| 36735.6 \| 14848.1 \| 58.1 \| 0.0 \| 14.1 audit=syslog keyspaces=ks-non-existent [4] \| this PR \| 219297.2 (+2.5%) \| 36667.3 (-0.2%) \| 14500.1 (-2.3%) \| 58.1 \| 0.0 \| 14.1 audit=syslog audit-rules=ks-non-existent [5] \| this PR \| 211038.7 (-1.4%) \| 36999.7 (+0.7%) \| 15048.6 (+1.4%) \| 58.1 \| 0.0 \| 14.1 =============================================================================================================================================================================== [1] ./scylla perf-simple-query --smp 1 --duration 100 --audit "none" [2] ./scylla perf-simple-query --smp 1 --duration 100 --audit "syslog" --audit-keyspaces "ks" --audit-categories "DCL... [3] ./scylla perf-simple-query --smp 1 --duration 100 --audit "syslog" --audit-rules '[{"sinks":["syslog"],"categorie... [4] ./scylla perf-simple-query --smp 1 --duration 100 --audit "syslog" --audit-keyspaces "ks-non-existent" --audit-ca... [5] ./scylla perf-simple-query --smp 1 --duration 100 --audit "syslog" --audit-rules '[{"sinks":["syslog"],"categorie... audit-null.sock was created with `socat -u UNIX-RECV:/tmp/audit-null.sock,type=2 OPEN:/dev/null` Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	f4acf91949	docs: add audit rules section to the auditing guide Operators need a reference for the new rule schema, its relationship to audit_categories/audit_tables/ audit_keyspaces, and the live-update path so they can adopt the feature without reading the source. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	f03398fdba	test: audit: cover role and schema cache notifications Verify on a multi-node cluster that role creation/alter/ drop and table/materialized-view create/drop trigger updates to the preprocessed audit-rules cache on every node, and that a matching DML on the newly created table is audited via the cache. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	7f61d7662d	test: audit: cover audit rules cluster behavior Cluster-level tests should validate rule matching, live updates, sink routing, role filtering, and error handling without rerunning the broader audit suite. Add audit_rules to LIVE_AUDIT_KEYS so the test framework tracks it as a live-updatable config key. Test that rules with empty categories or roles match nothing, that DML rules coexist with legacy audit config, AUTH rules fire on login events, CQL and REST API update paths reject invalid JSON, per-rule sink routing works for table and syslog, role-based filtering works across sessions, and sink mismatch produces a warning in server logs. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	c810bb48f4	audit: rebuild rule caches on group0 snapshot and role changes Nodes can join or reload snapshots after roles and tables already exist, so the cache cannot rely only on incremental notifications. Bulk-load all known roles and tables into the rule cache on Raft state reload and snapshot transfer. Detect incremental role creates and drops in reload_modules() by comparing the loaded roles against the auth cache, and forward the changes to every shard. Each shard rebuilds the fnmatch cache locally from its own rules to avoid cross-shard races when rules are updated concurrently with entity sync. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	78bd361919	audit: refresh rule caches on schema, role, and config changes Schema, role, and config changes must refresh the preprocessed rule cache, otherwise the fast path serves stale matches after reconfiguration or metadata changes. Register a migration listener for table/view create/drop. Observe audit_rules config changes through a serialized action so concurrent rebuilds collapse. Add hooks for role create/drop and a set_known_entities() bulk-load method. Implement real cleanup in shutdown() (previously a no-op) and roll back cleanly on start failure. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	465f8f4d8d	audit: route matching rules to configured sinks Rule-based routing must coexist with legacy category/keyspace/table filtering so operators who have not opted into rules keep their existing behavior. Merge rule-matched sinks into the event's sink set alongside legacy matches. Add a username parameter to should_log_login/sinks_for_login so rules can match the authenticated role. Use a conservative over-approximation for the fast will-log check since the role is not yet known at that call site. Log an error at startup when rules reference sinks not enabled globally. Log a warning when rules are configured but audit is disabled. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	7afb90aa6f	test: cover preprocessed audit rule cache The rule cache is the fast path for matching, so its hit, fallback, refresh, and category-bypass behavior needs focused unit coverage. Test transparent hash consistency, cached and uncached lookup paths, incremental entity add/remove, rule refresh, and empty-rules short circuit. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	97fb2f01ff	audit: add preprocessed rule matching cache Running fnmatch on every audit event would hurt hot-path latency. Precompute per-role and per-table bitsets and intersect them at query time. Rebuild from snapshots with a generation counter to avoid partial state after yielding. Unknown roles/tables fall back to linear fnmatch until metadata notifications populate the cache. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	6354daa8d7	audit: pass sink targets to storage helpers Per-rule routing needs each audit event to carry its target sinks so storage helpers can self-filter without duplicating writes. Replace should_log() with sinks_for() returning an audit_sink_set and add sinks_for_login() for the login path. Move the early-return filtering check from the static inspect() caller into audit::log() so it uses the new sinks_for() directly. Pass the sink set to storage_helper::write() so each helper only fires when its sink is included. Rename parse_audit_modes to parse_audit_sinks. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	67ecdba456	test: audit: cover rule matching semantics Rule matching is reused by both the preprocessed cache and the fallback path -- unit-test it separately so coupling failures do not mask matching bugs. Cover category bitmask, glob patterns for tables and roles, AUTH/ADMIN/DCL table bypass, empty-keyspace batch bypass, and sink bitmask conversion. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	65dd103f74	audit: add rule matching and sink helpers Rule matching must be shared between the preprocessed cache and the fallback path to avoid divergent semantics. Introduce audit_sink enum and audit_sink_set bitmask for routing. Match categories via bitmask, tables and roles via fnmatch with extended globs. AUTH/ADMIN/DCL bypass table matching. Empty category or role lists match nothing. Empty keyspace (e.g. cross-table batches) bypasses table matching for table-scoped categories. Convert validated sink names to an audit_sink_set bitmask for routing. Refs SCYLLADB-1430	2026-05-20 06:55:15 +02:00
Andrzej Jackowski	762fd5d455	test: audit: cover audit_rules configuration Audit rules enter through three paths (YAML, CQL, CLI), each with its own parsing and tracking -- cover all entry points before routing can depend on them. Test loading from YAML, live update via CQL and server API, CLI parsing, invalid value rejection at each path, and observer notification on live update. Refs SCYLLADB-1430	2026-05-20 06:55:14 +02:00
Andrzej Jackowski	f3a7e2e3dc	config: add live audit_rules option Operators need to configure audit rules through YAML, CQL, and CLI with live-update support so routing can be reconfigured without restart. Add audit_rules as a LiveUpdate config option with YAML decoding, JSON parsing for CQL updates, CLI --audit-rules flag, and a custom serializer that avoids double-quoting the JSON array. Refs SCYLLADB-1430	2026-05-20 06:55:14 +02:00
Andrzej Jackowski	3cc55dd6eb	test: cover audit rule parsing and validation Parsing and validation are the first consumer-visible surface of audit rules -- cover them before building higher layers. Test JSON parsing (valid, malformed, missing fields), rule validation (unknown sinks, invalid categories), and JSON round-trip serialization. Refs SCYLLADB-1430	2026-05-20 06:55:14 +02:00
Andrzej Jackowski	32cfa778f7	audit: define audit_rule type with parsing and validation Audit rules provide more granular control over which statements are audited, filtering by tables, roles, and categories. Typos in sink or category names should be caught at parse time rather than silently disabling rules at runtime. Define the audit_rule struct with JSON parsing, validation of sink and category names, serialization, and fmt support. Move statement_category, category_set, and category_to_string out of audit.hh/audit.cc so the rule type is self-contained. Refs SCYLLADB-1430	2026-05-20 06:55:14 +02:00
Nadav Har'El	a91d8aeb63	Merge '`CREATE INDEX` validation for Fulltext Indexes' from Dawid Pawlik This PR adds the schema-level validation required for `CREATE INDEX` and `DROP INDEX` on fulltext indexes, mirroring what vector indexes already enforce. Fulltext indexes are viewless custom indexes (no backing materialized view) that rely on CDC for change tracking. The validation ensures these prerequisites are met at index creation time and cannot be violated afterwards via `ALTER TABLE`. Tablet storage: Fulltext indexes require the keyspace to use tablet storage. Creation is rejected otherwise. CDC requirements: Fulltext indexes need a CDC log with a minimum TTL of 24 hours and either `delta = 'full'` or `postimage = true`. The PR enforces this in three places: - `CREATE INDEX` rejects creation when existing CDC options don't meet the requirements. - auto-enables CDC for tables with a fulltext index (same as vector indexes) and validates CDC options on schema updates. - `ALTER TABLE` blocks disabling CDC while a fulltext index exists. Viewless index generalization: The `vector_index`-specific checks in `create_index_statement` (rejecting `WITH` view properties, name-based duplicate detection for issue #26672) are replaced with a generic `is_viewless_custom_class()` helper that queries the index factory. This automatically covers both vector and fulltext indexes without duplicating logic. DROP INDEX reuses the existing path with no changes needed - the standard drop logic works for viewless indexes as-is. Added tests covering all validation paths above. All existing tests are updated to require the `skip_without_tablets` fixture. Fixes: SCYLLADB-1516 Closes scylladb/scylladb#29739 * github.com:scylladb/scylladb: external_index: fix require CDC options for disabled CDC test/cqlpy: add duplicate and view tests for fulltext index cql3: generalize viewless index handling in CREATE INDEX statement test/cqlpy: add CDC validation tests for fulltext index fulltext_index: enforce CDC requirements for fulltext indexes test/cqlpy: add tablet requirement test for fulltext index fulltext_index: require tablet storage for fulltext indexes index: introduce `external_index` base class for VS/FTS indexes	2026-05-20 01:10:56 +03:00
Piotr Szymaniak	f6d4d8abc0	test/cluster: don't advance to SERVING before CQL/Alternator connections are up Don't honor sd_notify SERVING until CQL/Alternator ports are verified reachable. Fixes a race introduced in `af03f0e8c4` (PR #29758). Refs: #29929 Fixes: SCYLLADB-2065 Closes scylladb/scylladb#29964	2026-05-19 21:06:04 +03:00
Michael Litvak	eecbead541	test: wait for others_not_see_server before exclude Between stopping a server and excluding it, wait for other nodes to see the server as down, otherwise exclude may see the server as alive and fail. Fixes SCYLLADB-2110 Closes scylladb/scylladb#29966	2026-05-19 19:36:54 +02:00
Patryk Jędrzejczak	b7fc661fa9	Merge 'raft: fix send_snapshot abort_source lifetime' from Emil Maskovsky Fix a lifetime bug where `send_snapshot()` captured `abort_source` by reference and the referenced object could be destroyed before the continuation ran. Use a gate-tracked background coroutine for each snapshot transfer: - keep abort_source on the coroutine frame (stable lifetime) - store a raw abort_source* in _snapshot_transfers for synchronous abort - erase transfer slots immediately on abort to allow same-batch reuse - close _snapshot_gate during abort() to wait for all in-flight transfers This removes the need for extra aborted-transfer bookkeeping and makes snapshot transfer shutdown and ownership semantics explicit. Fixes: SCYLLADB-1234 Refs: https://github.com/scylladb/scylladb/pull/29092 No backport: Currently the abort source parameter is not being actually used, so this doesn't cause any problems in the current and older branches. So no backport is needed (the using of abort source parameter will be eventually implemented on master afterwards). Closes scylladb/scylladb#29913 * https://github.com/scylladb/scylladb: raft: fix send_snapshot abort_source lifetime raft: fix parameter name mismatch in `send_snapshot()`	2026-05-19 10:15:13 +02:00
Szymon Malewski	6b2fce03f9	alternator: optional stripping of http response headers In Alternator's HTTP API, response headers can dominate bandwidth for small payloads. The Server, Date, and Content-Type headers were sent on every response but many clients never use them. This patch introduces three Alternator config options: - alternator_http_response_server_header, - alternator_http_response_disable_date_header, - alternator_http_response_disable_content_type_header, which allow customizing or suppressing the respective HTTP response headers. All three options support live update (no restart needed). The Server header is no longer sent by default; the Date and Content-Type defaults preserve the existing behavior. The Server and Date header suppression uses Seastar's set_server_header() and set_generate_date_header() APIs added in https://github.com/scylladb/seastar/pull/3217. This patch also fixes deprecation warnings from older Seastar HTTP APIs. Tests are in test/alternator/test_http_headers.py. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-70 Closes scylladb/scylladb#28288	2026-05-19 10:47:13 +03:00
Benny Halevy	97e03762c5	test/cluster/test_keyspace_rf: extend test_create_keyspace_with_default_replication_factor for tablets rack lists Add more racks to dc2 to verify that the default replication factor covers all available racks (rather than e.g. limited to 3). With tablets and rf_rack_valid_keyspaces, verify also the automatically selected rack list. Restrict the extension to non-debug build modes to prevent running out of memory with --repeat=100. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#29931	2026-05-19 10:44:24 +03:00
Nadav Har'El	cd61a44ab8	test/alternator: test response compression of tiny responses This patch adds to the existing collection of tests for Alternator response compression another test with a tiny response being compressed. This test serves two purposes: 1. It verifies setting alternator_response_compression_threshold_in_bytes to a tiny number like 1 really means that tiny responses would be compressed. 2. It verifies that our compression code, which has a special code path for the small chunk at the end of the compression, works correctly. The original motivation for writing this test was a false alarm by Claude Code which claimed that Alternator's response compression code has a serious, exploitable, memory overrun bug, because it set the wrong size limit on that last chunk. Claude was wrong, there is no such bug. We did set an oversized limit on the last chunk (so this patch fixes this typo), but it didn't matter - because the code used deflateBound - the guaranteed maximum size of the uncompressed data - for the buffer's size, so the buffer was unconditionally big enough, no matter which avail_out limit we passed to delate() it could never overflow. The included test passes even before this patch, even with ASAN enabled to detect memory overflows - no overflow was happening. It also passes after the typo correction in this patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#29718	2026-05-19 10:02:26 +03:00
Dawid Pawlik	a631123c06	external_index: fix require CDC options for disabled CDC Since we want to remove the requirement of disallowing "explicitly disabled" CDC table when creating external index (#29894), we still need to check other CDC required parameters to be set properly. Before this commit, once we auto-enable CDC which was "explicitly disabled", we would never run the `check_cdc_options()`. This patch adjusts the check to happen not only when the CDC enabled is true.	2026-05-19 08:53:15 +02:00
Dawid Pawlik	6387c61506	test/cqlpy: add duplicate and view tests for fulltext index Verify that fulltext indexes, which have no backing materialized view, correctly reject duplicate index creation and respect IF NOT EXISTS semantics. Named indexes must not be created twice under the same name; unnamed indexes on the same column must be detected as duplicates. IF NOT EXISTS must silently succeed rather than create a second index, including the known edge cases where the same name is reused across different tables or columns in the same keyspace (VECTOR-641).	2026-05-19 08:52:47 +02:00
Dawid Pawlik	232b1a3725	cql3: generalize viewless index handling in CREATE INDEX statement Replace the `vector_index`-specific checks in `create_index_statement` with a generic `is_viewless_custom_class()` helper that queries the index factory to determine whether an index type creates a backing materialized view. This covers both existing (`vector_index`) and new (`fulltext_index`) viewless index types: - Reject view properties (WITH clause) for any viewless index - Use name-based duplicate detection for named viewless indexes, since they have no backing view table for `has_schema()` to find (issue #26672)	2026-05-19 08:52:47 +02:00
Dawid Pawlik	215a1e3f00	test/cqlpy: add CDC validation tests for fulltext index Verify that fulltext index creation and ALTER TABLE enforce the CDC requirements: creation is rejected when TTL is below the 24-hour minimum, or when the delta mode is neither 'full' nor compensated by postimage. Also verify that enabling postimage or full delta mode allows index creation to succeed, that DROP INDEX works, and that ALTER TABLE cannot disable CDC while a fulltext index is present.	2026-05-19 08:52:47 +02:00
Dawid Pawlik	9e02e11ea8	fulltext_index: enforce CDC requirements for fulltext indexes Fulltext indexes rely on CDC to track changes for asynchronous index building. Enforce the following CDC constraints during CREATE INDEX: - CDC TTL must be at least 86400 seconds (24 hours) - CDC delta mode must be 'full' or postimage must be enabled Add `has_fulltext_index()` and `check_cdc_options()` so that other modules can detect fulltext indexes and validate CDC settings: - include fulltext indexes in `cdc_enabled()` so the CDC log is auto-created, and validate CDC options in `on_before_update_column_family()` - block `ALTER TABLE ... WITH cdc = {'enabled': false}` when a fulltext index exists on the table	2026-05-19 08:52:47 +02:00
Dawid Pawlik	558de64773	test/cqlpy: add tablet requirement test for fulltext index Add `test_create_fulltext_index_requires_tablets` to verify that creating a fulltext index on a keyspace with tablets disabled is rejected.	2026-05-19 08:52:47 +02:00
Dawid Pawlik	69dc62c373	fulltext_index: require tablet storage for fulltext indexes Fulltext indexes, like vector indexes, require the base table's keyspace to use tablets. Add `check_uses_tablets()` validation to `fulltext_index::validate()` that rejects index creation when the keyspace does not use tablet storage. Also add `skip_without_tablets` fixture to all existing fulltext index tests so they are skipped in environments where tablets are not available.	2026-05-19 08:52:47 +02:00
Dawid Pawlik	61d658106a	index: introduce `external_index` base class for VS/FTS indexes Add `external_index` as a common base for `vector_index` and `fulltext_index`, both of which are backed by an external Vector Store engine and share CDC requirements.	2026-05-19 08:52:47 +02:00
Emil Maskovsky	e0f58d1e81	raft: fix send_snapshot abort_source lifetime Fix a lifetime bug where `send_snapshot()` captured `abort_source` by reference and the referenced object could be destroyed before the continuation ran. Use a gate-tracked background coroutine for each snapshot transfer: - keep abort_source on the coroutine frame (stable lifetime) - store a raw abort_source* in _snapshot_transfers for synchronous abort - erase transfer slots immediately on abort to allow same-batch reuse - close _snapshot_gate during abort() to wait for all in-flight transfers This removes the need for extra aborted-transfer bookkeeping and makes snapshot transfer shutdown and ownership semantics explicit. Fixes: SCYLLADB-1234	2026-05-18 21:49:37 +00:00
Avi Kivity	85374207ca	Merge 'test.py: rewrite gather metrics' from Andrei Chekun Rewrite gather metrics to be able to gather metrics for python tests correctly. Python tests require different handling of metrics gathering from cgroup than C++ tests. pytest do not execute each python tests in a separate process, so we can't put it there and get the metrics. The idea is to put the whole pytest process to the cgroup and get the metrics. This will work because pytest runs the threads as a completely separate processes and inside the thread it will run tests consequently. Additionally, to simplify system resource monitor moved to pytest main thread. Change the behavior of the gathering metrics. From this PR some data will be collected even with `--no-gather-metrics`. This data do not need any configuration and just metadata of the tests: test name, time of execution, status of the test. When `--gather-metrics` provided additionally will be written the data gathered from the cgroups about the memory for each specific test and system CPU/RAM utilization. Backport is not needed, because it's a framework change only. Fixes: https://scylladb.atlassian.net/browse/SCYLLADB-575 ~Blocked by: https://github.com/scylladb/scylladb/pull/27618~ Now python tests have metrics gathered from the cgroups as well with their own Scylla instances. ```bash $ sqlite3 --header testlog/sqlite_af8cb.db 'select tst.path, tst.file, tst.test_name, user_sec,system_sec,usage_sec,memory_peak /1024/1024 as memory_peak_mb from test_metrics join tests as tst where tst.id = test_metrics.test_id order by memory_peak_mb desc limit 10;' path\|file\|test_name\|user_sec\|system_sec\|usage_sec\|memory_peak_mb test/cluster/dtest\|limits_test.py\|test_max_cells\|489.468174\|27.6638949999999\|517.132069\|4241 test/cluster/dtest\|rebuild_test.py\|test_rebuild_stream_abort_repro\|93.6400869999998\|28.9843249999999\|122.624412\|4241 test/cluster/dtest\|schema_management_test.py\|test_prepared_statements_work_after_node_restart_after_altering_schema_without_changing_columns\|6.8933219999999\|3.63569899999993\|10.5290209999994\|4241 test/cluster/dtest\|schema_management_test.py\|test_dropping_keyspace_with_many_columns\|1.31770999999981\|0.754742999999962\|2.07245299999977\|4241 test/cluster/dtest\|schema_management_test.py\|test_multiple_create_table_in_parallel\|5.48435300000028\|2.72915200000011\|8.21350499999971\|4241 test/cluster/dtest\|schema_management_test.py\|test_alter_table_in_parallel_to_read_and_write[write]\|80.687293\|18.5562\|99.2434920000005\|4241 test/cluster/dtest\|schema_management_test.py\|test_alter_table_in_parallel_to_read_and_write[read]\|79.1984790000001\|18.0969829999999\|97.2954609999997\|4241 test/cluster/dtest\|schema_management_test.py\|test_alter_table_in_parallel_to_read_and_write[mixed]\|85.332915\|18.9321070000001\|104.265022\|4241 test/cluster/dtest\|schema_management_test.py\|test_update_schema_while_node_is_killed[create_table]\|10.5875369999999\|5.67954400000008\|16.267081\|4241 test/cluster/dtest\|schema_management_test.py\|test_update_schema_while_node_is_killed[alter_table]\|11.3801709999998\|6.54689099999996\|17.9270630000001\|4241 ``` Closes scylladb/scylladb#28206 * github.com:scylladb/scylladb: test.py: Add host hardware info test.py: rewrite resource gather	2026-05-18 20:35:14 +03:00
Dawid Pawlik	c2d27d1a50	index: remove Chinese, Japanese, and Korean language analyzers Remove "chinese", "japanese", and "korean" from the list of accepted full-text search analyzer options. Exposing these options commits ScyllaDB to supporting them long-term — if we ever switch from one backend search engine to another, CJK analyzers are the most likely to lose out-of-the-box support, unlike the popular European languages that are broadly available across text analysis libraries. Restrict the accepted set now, while FTS is still new, to avoid a future compatibility burden. Add a test to check if the CJK language analyzer options are rejected. Fixes: VECTOR-672 Closes scylladb/scylladb#29877	2026-05-18 18:20:47 +03:00
Szymon Malewski	15493872b2	vector_search: fix decimal/varint precision loss in filter value_to_json() value_to_json() converts CQL values to JSON for vector search filters. For decimal and varint types, it used rjson::parse() on the JSON string, which parses through a double and silently loses precision for values exceeding ~15 significant digits — producing wrong filter results. Additionally, for decimal type we need an exact string representation that preserves the original (unscaled, scale) pair, because partition keys use byte-level identity: different serialized representations of the same numeric value are distinct rows, so the filter must reproduce the exact representation stored in the key. Add big_decimal::to_string_canonical() which follows the Java BigDecimal toString() spec (JDK 8+), producing a bijective string representation that uses exponential notation for extreme scales instead of expanding trailing zeros (which could cause OOM). This could replace to_string(), but doing so has wider consequences (e.g. hash/equality contract for decimal_type) described in SCYLLADB-1574. Use it in value_to_json() for decimal_type, and use rjson::from_string() for varint_type, both bypassing the lossy double parse path. Tests cover the new to_string_canonical() and the filter fix, as well as existing decimal type behavior (key representation, clustering order, toJson) that we rely on and must not break. The CQL decimal type tests (test_type_decimal.py) also pass against Cassandra. Fixes: https://scylladb.atlassian.net/browse/SCYLLADB-1583 Refs: https://scylladb.atlassian.net/browse/SCYLLADB-1574 Closes scylladb/scylladb#29505	2026-05-18 17:07:26 +03:00
Piotr Dulikowski	26671d4d5f	Merge 'Refactor view_update_builder' from Wojciech Mitros This series improves the readability and structure of view_update_builder, the component that generates materialized view updates from base-table mutations. The first four patches are pure renames and refactoring with no semantic changes: 1. Document that the builder operates on a single base partition. 2. Rename member fields to clearly distinguish readers (the mutation_reader streams) from the cached fragments (the last mutation_fragment_v2 read from each stream). 3. Rename advance/on_results methods to names that describe what they actually do: read the next fragment, or generate view updates. 4. Extract partition-start handling into its own method. The next two patches are minor optimizations: 5. Simplify clustering-row handling by moving the row out of the fragment before applying the tombstone, avoiding an unnecessary memory-usage recalculation in the reader permit. 6. Replace deep copies with moves in the existing-only tail path, matching the pattern used everywhere else. Finally, patch 7 deduplicates the fragment-consuming logic by extracting the three repeated blocks into consume_both_fragments(), consume_update_fragment(), and consume_existing_fragment(). Code reorganization - no backport needed Closes scylladb/scylladb#29497 * github.com:scylladb/scylladb: mv: deduplicate code for consuming fragments in view_update_builder mv: avoid unnecessary copies of existing rows in generate_updates() mv: simplify clustering row handling in generate_updates() mv: rename methods in view_update_builder for clarity mv: rename view_update_builder readers and cached fragments mv: drop redundant std::move from partition key extraction mv: document single-partition builder scope	2026-05-18 15:52:26 +02:00
Piotr Dulikowski	5efb43195e	Merge 'db/schema_tables: don't emit empty view_building_tasks mutation on ALTER TABLE' from Michał Jadwiszczak After recent change (`1a32ccd`) `make_update_indices_mutations()` is unconditionally adding a mutation for `system.view_building_tasks`, even when no indices were being dropped. In a mixed-version cluster, the older node may not have this table, causing the Raft schema applier to fail with 'Can't find a column family with UUID ...'. This patch fixes the bug by emitting the mutation when indices are actually dropped (i.e., when the view building cleanup code path was entered). Fixes: SCYLLADB-2026 Refs: scylladb#26557 scylladb#26557 wasn't backported, so this patch also doesn't need to be. Closes scylladb/scylladb#29908 * github.com:scylladb/scylladb: db/schema_tables: don't emit empty view_building_tasks mutation on ALTER TABLE db/view_building_task_mutation_builder: add `empty()` method	2026-05-18 15:37:02 +02:00
Nadav Har'El	5dbd0d71d5	Merge 'test/pylib: test/pylib: Cached Scylla package resolver' from Alex Dathskovsky This series adds a shared helper for resolving, downloading, unpacking, and installing Scylla relocatable packages for test.py. The first patch introduces `version_fetch_utils`, which can resolve public Scylla artifacts from the downloads bucket by version, architecture, package variant, or direct URL. It also centralizes the local cache/install flow using retry handling, marker files, and file locking so repeated or concurrent test runs can safely reuse an existing installation. The second patch wires this helper into the existing Scylla executable setup paths. This removes the hard-coded 2025.1 package URL and replaces the local download/unpack/install logic in `scylla_cluster.py` with the shared resolver. It also makes `--exe-url` use the same cached installer path. Together, these changes make upgrade-test executable selection less brittle, avoid duplicated install logic, and provide a reusable foundation for fetching other Scylla versions in test.py. Closes scylladb/scylladb#29855 * github.com:scylladb/scylladb: test/pylib: use version fetcher for Scylla executable setup test/pylib: add cached Scylla package installer	2026-05-18 16:32:47 +03:00
Yaniv Michael Kaul	5d8d158bdd	call_backport_with_jira.yaml: add missing workflow permissions Add explicit permissions block matching the requirements of the called reusable workflow (contents: read, pull-requests: write, issues: write). Fixes code scanning alerts #181, #182, #183. Closes scylladb/scylladb#29182	2026-05-18 15:50:00 +03:00
Yaniv Michael Kaul	fbf5be5587	docs: update Python deps Ran 'make update' to get the latest version of all dependencies needed to build docs. Tested with 'make test' only. Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> AI-Assisted: no, to my surprise. Backport: not sure. Closes scylladb/scylladb#29909	2026-05-18 15:45:59 +03:00
Andrei Chekun	a03c4fd754	test.py: Add host hardware info Gather additional information about the running host for better metrics analysis	2026-05-18 12:23:40 +02:00
Andrei Chekun	6414c48fc2	test.py: rewrite resource gather Python tests requires different handling of metrics gathering from cgroup than C++ tests. pytest do not execute each python tests in a separate process, so we can't put it there and get the metrics. The idea is to put the whole pytest process to the cgroup and get the metrics. This will work because pytest runs the threads as as completely separate processes and inside the thread it will run tests consequently. Additionally, to simplify system resource monitor moved to pytest main thread.	2026-05-18 12:23:40 +02:00
Marcin Maliszkiewicz	628e1ef2de	Merge 'Introduce auth::config to decouple auth modules from db::config' from Pavel Emelyanov Auth modules (authenticators, role managers, and auth::service) access their configuration options by reaching into db::config through the query processor. This abuses database as proxy object to get configuration. This series introduces a dedicated auth::config struct that carries the configuration options used by auth modules.The config is populated in main.cc and delivered to each shard via sharded_parameter. This makes auth service conform to the overall design, where db::config is split into smaller per-service configs on start, thus decoupling individual components/services from global configuration. Cleaning components dependencies, not backporting. Closes scylladb/scylladb#29870 * github.com:scylladb/scylladb: auth: Remove unused default_superuser() function auth: Switch role managers to use auth::config auth: Switch authenticators to use auth::config auth: Introduce auth::config and wire it through service	2026-05-18 11:32:11 +02:00
Patryk Jędrzejczak	c9592a495e	Merge 'cql: fix missing TABLETS_ROUTING_V1 payload after CAS shard bounce' from Petr Gusev After an internal CAS shard bounce, check_locality() was evaluating against this_shard_id() of the post-bounce shard — which is the correct tablet shard — so it returned nullopt, and LWT/SERIAL responses omitted the tablets-routing-v1 custom payload. The client never learned the correct tablet map. Fix by recording the original entry shard in client_state (initialized to this_shard_id() at construction, preserved across shard bounces via client_state_for_another_shard) and passing it to check_locality() so it compares against the client's actual routing decision. No host_id tracking or forwarded_client_state IDL changes are needed because CAS shard bounces are always intra-node. Fixes SCYLLADB-2041 backport: need to backport to all versions with LWT over tablets Closes scylladb/scylladb#29910 * https://github.com/scylladb/scylladb: cql: refactor add_tablet_info to take tablet_routing_info directly cql: fix UB dereference of nullopt tablet_info in execute_with_condition test/boost: add regression test for missing tablet routing after CAS bounce cql: fix missing TABLETS_ROUTING_V1 payload after CAS shard bounce	2026-05-18 11:19:04 +02:00
Yehuda Lebi	6307e17795	fix: raise scylla-helper.slice CPUWeight from 10 to 100 to prevent node_exporter CPU starvation Closes scylladb/scylladb#29839	2026-05-18 11:55:14 +03:00
Yaniv Michael Kaul	f047e6fd5c	trigger_jenkins.yaml: add missing permissions and fix script injection Add explicit empty permissions block (permissions: {}) since this workflow only triggers Jenkins and sends Slack notifications using its own secrets. Also move expression interpolations into env vars to prevent potential script injection. Fixes code scanning alert #147. Also remove the pre-existing 'permissions: contents: read' block, which would result in duplicate YAML keys (invalid per the YAML spec). Closes scylladb/scylladb#29186	2026-05-18 11:39:39 +03:00
Botond Dénes	cc210813c8	Merge 'cmake: add IDL comparison to build system tool and fix PCH propagation' from Ernest Zaslavsky This series adds IDL file comparison to the build system comparison tool and fixes CMake PCH propagation. 1. `scripts/compare_build_systems.py` only compared compilation flags, link targets, and linker settings — it did not compare IDL-generated file sets. This allowed PR #28843 to pass CI despite adding `strong_consistency/groups_manager.idl.hh` to `configure.py` but not to `idl/CMakeLists.txt`. 2. CMake's `scylla-main` target was not using the precompiled header (`stdafx.hh`), even though configure.py applies it to every source file via `-include-pch`. This caused compilation failures for files relying on transitive includes from the PCH — e.g., `sstables_loader.cc` failed with `no member named 'read_entire_stream' in namespace 'seastar::util'`. Add a 4th comparison check to the build system comparison script: extract IDL-generated file sets from both build systems' ninja files and compare them. The extractors parse ninja build statements — configure.py side filters by build mode, CMake side handles the `\|` separator for implicit outputs — and normalize to a canonical relative path for comparison. Add the missing `strong_consistency/groups_manager.idl.hh` to `idl/CMakeLists.txt`. Add `target_precompile_headers(scylla-main REUSE_FROM scylla-precompiled-header)` so that all sources compiled under `scylla-main` benefit from the PCH, matching configure.py's behavior. Update documentation to reflect the new IDL comparison check. Refs: https://github.com/scylladb/scylladb/pull/29901 Refs: https://github.com/scylladb/scylladb/pull/28843 No backport needed — these are build system improvements only. Closes scylladb/scylladb#29912 * github.com:scylladb/scylladb: cmake: reuse precompiled header in scylla-main target idl: add missing groups_manager.idl.hh to CMakeLists.txt scripts: add IDL-generated file comparison to compare_build_systems	2026-05-18 11:38:14 +03:00
Yaniv Michael Kaul	34aac2030c	paxos: enable paging for internal paxos state queries The paxos state queries (load_paxos_state, save_paxos_promise, etc.) were using page_size=-1 (no paging). While each query returns at most one row and paging never actually kicks in, the lack of paging causes these internal queries to be counted as non-paged reads in the metrics, which can be confusing to users monitoring their cluster. Add LIMIT 1 to the SELECT query so that may_need_paging() short-circuits to false (row_limit <= 1), avoiding pager allocation overhead entirely. Set page_size=1000 so these queries are no longer reported as non-paged reads. Refs: https://scylladb.atlassian.net/browse/CUSTOMER-372 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> Backport: no, improvement Closes scylladb/scylladb#29852	2026-05-18 11:35:55 +03:00
Michał Jadwiszczak	a9b2baf36b	db/schema_tables: don't emit empty view_building_tasks mutation on ALTER TABLE After recent change (`1a32ccd`) `make_update_indices_mutations()` is unconditionally adding a mutation for `system.view_building_tasks`, even when no indices were being dropped. In a mixed-version cluster, the older node may not have this table, causing the Raft schema applier to fail with 'Can't find a column family with UUID ...'. This patch fixes the bug by emitting the mutation when indices are actually dropped (i.e., when the view building cleanup code path was entered). Fixes: SCYLLADB-2026 Refs: scylladb#26557	2026-05-18 10:01:21 +02:00
Michał Jadwiszczak	82eb5611ab	db/view_building_task_mutation_builder: add `empty()` method The method allows to check if the builder contains any changes, so it will allow to skip emitting empty mutation.	2026-05-18 09:54:26 +02:00

1 2 3 4 5 ...

53975 Commits