scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 20:46:56 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	3775593e53	test: perf_simple_query: Add 'sstable-format' command-line option	2026-03-18 16:25:20 +01:00
Tomasz Grabiec	6ee9bc63eb	test: perf_simple_query: Add 'sstable-summary-ratio' command-line option	2026-03-18 16:25:20 +01:00
Tomasz Grabiec	38d130d9d0	test: perf-simple-query: Add option to disable index cache	2026-03-18 16:25:20 +01:00
Marcin Maliszkiewicz	9318c80203	perf: add abort_source support to wait-for-port loops Check abort_source on each retry iteration in wait_for_alternator and wait_for_cql so the wait can be interrupted on shutdown. Didn't use sleep_abortable as the sleep is very short anyway.	2026-03-16 16:14:10 +01:00
Marcin Maliszkiewicz	edf0148bee	perf-alternator: wait for alternator port before running workload This patch is mostly for the purpose of running pgo CI job. We may receive connection error if asyncio.sleep(5) in pgo.py is not sufficient waiting time. In pgo.py we do wait for port but only for cql, anyway it's better to have high level check than trying to wait for alternator port there.	2026-03-16 16:07:52 +01:00
Botond Dénes	6004e84f18	test: move away from tombstone_gc_state(nullptr) ctor Use for_tests() instead (or no_gc() where approriate).	2026-03-03 14:09:28 +02:00
Avi Kivity	27a5502f14	Merge 'Reapply "main: test: add future and abort_source to after_init_func"' from Marcin Maliszkiewicz The patchset fixes abort_source implementation for perf-alternator and perf-cql-raw. It moves run_standalone function to common code in perf.hh with necessary templating. We also add extensive testing so that it's more difficult to break the tooling in the future. Fixes SCYLLADB-560 Backport: no, internal tooling improvement Closes scylladb/scylladb#28541 * github.com:scylladb/scylladb: test: cluster: add tests for perf tools test: perf: fix port race condition on startup in connect workload test: perf: prepare benchmarks to bind to custom host test: perf: make perf-alterantor remote port configurable test: perf: fix ASAN leak warnings in perf-alternator Reapply "main: test: add future and abort_source to after_init_func"	2026-02-19 19:12:46 +02:00
Ernest Zaslavsky	45d824e0fe	s3_perf: fix upload operation flow Correct the upload operation logic. The previous flow incorrectly checked for the test file on S3 even when performing operations that do not download the file, such as uploads.	2026-02-19 11:14:59 +02:00
Marcin Maliszkiewicz	c69534504c	test: perf: fix port race condition on startup in connect workload Other workloads at startup call prepopulate() which connects with retry loop therefore it waits until cql port is open. This commit adds a single place where we will wait for port for all workloads. Timeout is set to 5 minutes so that even slowest machines are able to start.	2026-02-19 09:33:10 +01:00
Marcin Maliszkiewicz	828f2fbdb1	test: perf: prepare benchmarks to bind to custom host This is usefull for tests where we use local networks like 127.5.5.5 to avoid port and host collisions.	2026-02-19 09:33:10 +01:00
Marcin Maliszkiewicz	9f2b97bef4	test: perf: make perf-alterantor remote port configurable It could be a usefull option to have.	2026-02-19 09:33:10 +01:00
Marcin Maliszkiewicz	f5a212e91e	test: perf: fix ASAN leak warnings in perf-alternator Those were intentional as test process is short lived but when we add automated tests in the following commits we expect clean exit, with 0 exit code.	2026-02-19 09:33:10 +01:00
Marcin Maliszkiewicz	0c76c73e34	Reapply "main: test: add future and abort_source to after_init_func" This reverts commit `ceec703bb7`. The commit was fixed with abort source handling for alternator standalone path so it's safe to reapply.	2026-02-19 09:33:10 +01:00
Ernest Zaslavsky	4026b54a5e	s3_perf: fix the CMake build Fix the CMake build of the perf_s3_client by adding the necessary linkage with the jsoncpp library.	2026-02-18 17:12:08 +02:00
Ferenc Szili	d7cfaf3f84	test, simulator: compute load based on tablet size instead of count This patch changes the load balancing simulator so that it computes table load based on tablet sizes instead of tablet count. best_shard_overcommit measured minimal allowed overcommit in cases where the number of tablets can not be evenly distributed across all the available shards. This is still the case, but instead of computing it as an integer div_ceil() of the average shard load, it is now computed by allocating the tablet sizes using the largest-tablet-first method. From these, we can get the lowest overcommit for the given set of nodes, shards and tablet sizes.	2026-02-12 12:54:55 +01:00
Ferenc Szili	216443c050	test, simulator: generate tablet sizes and update load_stats This change adds a random tablet size generator. The tablet sizes are created in load_stats. Further changes to the load balance simulator: - apply_plan() updates the load_stats after a migration plan is issued by the load balancer, - adds the option to set a command line option which controls the tablet size deviation factor.	2026-02-12 12:54:55 +01:00
Ferenc Szili	e31870a02d	test, simulator: postpone creation of load_stats_ptr With size based load balancing, we will have to move the tablet size in load_stats after each internode migration issued by balance_tablets(). This will be done in a subsequent commit in apply_plan() which is called from rebalance_tablets(). Currently, rebalance_tablets() is passed a load_stats_ptr which is defined as: using load_stats_ptr = lw_shared_ptr<const load_stats>; Because this is a pointer to const, apply_plan() can't modify it. So, we pass a reference to load_stats to rebalance_tablets() and create a load_stats_ptr from it for each call to balance_tablets().	2026-02-12 12:54:55 +01:00
Avi Kivity	ceec703bb7	Revert "main: test: add future and abort_source to after_init_func" This reverts commit `7bf7ff785a`. The commit tried to add clean shutdown to `scylla perf` paths, but forgot at least `scylla perf-alternator --workload wr` which now crashes on uninitialized `c.as`. Fixes #28473 Closes scylladb/scylladb#28478	2026-02-02 09:22:24 +01:00
Avi Kivity	6676953555	Merge 'test: perf: add option to write results to json in perf-cql-raw and perf-alternator' from Marcin Maliszkiewicz Adds --json-result option to perf-cql-raw and perf-alternator, the same as perf-simple-query has. It is useful for automating test runs. Related: https://scylladb.atlassian.net/browse/SCYLLADB-434 Bacport: no, original benchmark is not backported Closes scylladb/scylladb#28451 * github.com:scylladb/scylladb: test: perf: add example commands to perf-alternator and perf-cql-raw test: perf: add option to write results to json in perf-cql-raw test: perf: add option to write results to json in perf-alternator test: perf: move write_json_result to a common file	2026-02-01 13:57:10 +02:00
Marcin Maliszkiewicz	80e627c64b	test: perf: add example commands to perf-alternator and perf-cql-raw	2026-01-30 08:48:19 +01:00
Marcin Maliszkiewicz	ea29e4963e	test: perf: add option to write results to json in perf-cql-raw	2026-01-29 10:56:03 +01:00
Marcin Maliszkiewicz	d974ee1e21	test: perf: add option to write results to json in perf-alternator	2026-01-29 10:55:52 +01:00
Marcin Maliszkiewicz	a74b442c65	test: perf: move write_json_result to a common file The implementation is going to be shared with perf-alternator and perf-cql-raw.	2026-01-29 10:54:11 +01:00
Tomasz Grabiec	0d090aa47b	tablets: Cache pointer to stats during plan-making Saves on lookup cost, esp. for candidate evaluation. This showed up in perf profile in the past. Also, lays the ground for splitting stats per rack.	2026-01-28 01:32:00 +01:00
Marcin Maliszkiewicz	32543625fc	test: perf: reuse stream id When one request is super slow and req/s high in theory we have a collision on id, this patch avoids that by reusing id and aborting when there is no free one (unlikely).	2026-01-22 12:26:50 +01:00
Marcin Maliszkiewicz	7bf7ff785a	main: test: add future and abort_source to after_init_func This commit avoids leaking seastar::async future from two benchmark tools: perf-alternator and perf-cql-raw. Additionally it adds abort_source for fast and clean shutdown.	2026-01-22 12:26:50 +01:00
Marcin Maliszkiewicz	0d20300313	test: perf: add option to stress multiple tables in perf-cql-raw	2026-01-22 12:26:50 +01:00
Marcin Maliszkiewicz	a033b70704	test: perf: add perf-cql-raw benchmarking tool The tool supports: - auth or no auth modes - simple read and write workloads - connection pool or connection per request modes - in-process or remote modes, remote may be usefull to assess tool's overhead or use it as bigger scale benchmark - uses prepared statements by default - connection only mode, for testing storms It could support in the future: - TLS mode - different workloads - shard awareness Example usage: > build/release/scylla perf-cql-raw --workdir /tmp/scylla-data --smp 2 --cpus 0,1 \ --developer-mode 1 --workload read --duration 5 2> /dev/null Running test with config: {workload=read, partitions=10000, concurrency=100, duration=5, ops_per_shard=0} Pre-populated 10000 partitions 97438.42 tps (269.2 allocs/op, 1.1 logallocs/op, 35.2 tasks/op, 113325 insns/op, 80572 cycles/op, 0 errors) 102460.77 tps (261.1 allocs/op, 0.0 logallocs/op, 31.7 tasks/op, 108222 insns/op, 75447 cycles/op, 0 errors) 95707.93 tps (261.0 allocs/op, 0.0 logallocs/op, 31.7 tasks/op, 108443 insns/op, 75320 cycles/op, 0 errors) 102487.87 tps (261.0 allocs/op, 0.0 logallocs/op, 31.7 tasks/op, 107956 insns/op, 74320 cycles/op, 0 errors) 100409.60 tps (261.0 allocs/op, 0.0 logallocs/op, 31.7 tasks/op, 108337 insns/op, 75262 cycles/op, 0 errors) throughput: mean= 99700.92 standard-deviation=3039.28 median= 100409.60 median-absolute-deviation=2759.85 maximum=102487.87 minimum=95707.93 instructions_per_op: mean= 109256.53 standard-deviation=2281.39 median= 108337.37 median-absolute-deviation=1034.83 maximum=113324.69 minimum=107955.97 cpu_cycles_per_op: mean= 76184.36 standard-deviation=2493.46 median= 75320.20 median-absolute-deviation=922.09 maximum=80572.19 minimum=74320.00	2026-01-22 12:26:50 +01:00
Marcin Maliszkiewicz	1318ff5a0d	test: perf: move cut_arg helper func to common code It will be reused later.	2026-01-19 14:33:10 +01:00
Ferenc Szili	621cb19045	load_sketch: use tablet sizes in load computation This commit changes load_sketch so that it computes node and shard load based on tablet sizes instead of tablet count.	2025-12-27 10:37:23 +01:00
Aleksandra Martyniuk	d66a36058b	service: pass topology and system_keyspace to load_balancer ctor Pass a pointer to service::topology and db::system_keyspace to load balancer. It will be used in the following patches to create rack_list_colocation plan.	2025-12-16 13:25:38 +01:00
Tomasz Grabiec	721434054b	perf-row-cache-update: Add scenario with large tombstone covering many rows Fills memtable with rows and a tombstone which deletes all rows which are already in cache. Similar to raft log workload, but more extreme. With -c1 -m4G, observed really bad performance: update: 1711.976196 [ms], preemption: {count: 22603, 99%: 0.943127 [ms], max: 1494.571776 [ms]}, cache: 2148/2906 [MB], alloc/comp: 1334/869 [MB] (amp: 0.651), pr/me/dr 1062186/0/1062187 cache: 2148/2906 [MB], memtable: 738/1024 [MB], alloc/comp: 993/0 [MB] (amp: 0.000) Which means that max reactor stall during cache update was 1.5 [s] 0.7 GB memtables. 2.1 GB in cache.	2025-12-06 01:03:09 +01:00
Botond Dénes	6ee0f1f3a7	Merge 'replica/table: add a metric for hypothetical total file size without compression' from Michał Chojnowski This patch adds a metric for pre-compression size of sstable files. This patch adds a per-table metric `scylla_column_family_total_disk_space_before_compression`, which measures the hypothetical total size of sstables on disk, if Data.db was replaced with an uncompressed equivalent. As for the implementation: Before the patch, tables and sstable sets are already tracking their total physical file size. Whenever sstables are added or removed, the size delta is propagated from the sstable up through sstable sets into table_stats. To implement the new metric, we turn the size delta that is getting passed around from a one-dimensional to a two-dimensional value, which includes both the physical and the pre-compression size. New functionality, no backport needed. Closes scylladb/scylladb#26996 * github.com:scylladb/scylladb: replica/table: add a metric for hypothetical total file size without compression replica/table: keep track of total pre-compression file size	2025-11-20 09:10:38 +02:00
Michał Chojnowski	1cfce430f1	replica/table: keep track of total pre-compression file size Every table and sstable set keeps track of the total file size of contained sstables. Due to a feature request, we also want to keep track of the hypothetical file size if Data files were uncompressed, to add a metric that shows the compression ratio of sstables. We achieve this by replacing the relevant `uint_64 bytes_on_disk` counters everywhere with a struct that contains both the actual (post-compression) size and the hypothetical pre-compression size. This patch isn't supposed to change any observable behavior. In the next patch, we will use these changes to add a new metric.	2025-11-13 00:49:57 +01:00
Dario Mirovic	549e6307ec	audit: unify `create_audit` and `start_audit` There is no need to have `create_audit` separate from `start_audit`. `create_audit` just stores the passed parameters, while `start_audit` does the actual initialization and startup work. Refs #26022	2025-11-06 03:05:06 +01:00
Botond Dénes	24c6476f73	mutation/mutation_compactor: add tombstone_gc_state to query ctor So tombstones can be purged correctly based on the tombstone gc mode. Currently if repair-mode is used, tombstones are not purged at all, which can lead to purged tombstone being re-replicated to replicas which already purged them via read-repair. This is not a correctness problem, tombstones are not included in data query resutl or digest, these purgable tombstone are only a nuissance for read repair, where they can create extra differences between replicas. Note that for the read repair to trigger, some difference other than in purgable tombstones has to exist, because as mentioned above, these are not included in digets. Fixes: scylladb/scylladb#24332 Closes scylladb/scylladb#26351	2025-10-12 17:48:15 +03:00
Andrzej Jackowski	14081d0727	generic_server: transport: start using `sl:driver` for new connections Before this change, new connections were handled in a default scheduling group (`main`), because before the user is authenticated we do not know which service level should be used. With the new `sl:driver` service level, creation of new connections can be moved to `sl:driver`. We switch the service level as early as possible, in `do_accepts`. There is a possibility, that `sl:driver` will not exist yet, for instance, in specific upgrade cases, or if it was removed. Therefore, we also switch to `sl:driver` after a connection is accepted. Refs: scylladb/scylladb#24411	2025-10-08 08:25:12 +02:00
Pavel Emelyanov	6ad8dc4a44	Merge 'root,replica: mv querier to replica/' from Botond Dénes The querier object is a confusing one. Based on its name it should be in the query/ module and it is already in the query namespace. The query namespace is used for symbols which span the coordinator and replica, or that are mostly coordinator side. The querier is mainly in this namespace due to its similar name and because at the time it was introduced, namespace replica didn't exist yet. But this is a mistake which confuses people. The querier is actually a completely replica-side logic, implementing the caching of the readers on the replica. Move it to the replica module and namespace to make this more clear. Code cleanup, no backport. Closes scylladb/scylladb#26280 * github.com:scylladb/scylladb: replica: move querier code to replica namespace root,replica: mv querier to replica/	2025-10-06 08:26:05 +03:00
Avi Kivity	15fa1c1c7e	Merge 'sstables/trie: translate all key cells in one go, not lazily' from Michał Chojnowski Applying lazy evaluation to the BTI encoding of clustering keys was probably a bad default. The possible benefits are dubious (because it's quite likely that the laziness won't allow us to avoid that much work), but the overhead needed to implement the laziness is large and immediate. In this patch we get rid of the laziness. We rewrite lazy_comparable_bytes_from_clustering_position and lazy_comparable_bytes_from_ring_position so that they performs the key translation eagerly, all components to a single bytes_ostream in one synchronous call. perf_bti_key_translation (microbenchmark added in this series, 1 iteration is 100 translations of a clustering key with 8 cells of int32_type): ``` Before: test iterations median mad min max allocs tasks inst cycles lcb_mismatch_test.lcb_mismatch 9233 109.930us 0.000ns 109.930us 109.930us 4356.000 0.000 2615394.3 614709.6 After: test iterations median mad min max allocs tasks inst cycles lcb_mismatch_test.lcb_mismatch 50952 19.487us 0.000ns 19.487us 19.487us 198.000 0.000 603120.1 109042.9 ``` Enhancement, backport not required. Closes scylladb/scylladb#26302 * github.com:scylladb/scylladb: sstables/trie: BTI-translate the entire partition key at once sstables/trie: avoid an unnecessary allocation of std::generator in last_block_offset() sstables/trie: perform the BTI-encoding of position_in_partition eagerly types/comparable_bytes: add comparable_bytes_from_compound test/perf: add perf_bti_key_translation	2025-10-01 14:59:06 +03:00
Benny Halevy	b17a36c071	tablets: read_tablet_mutations: use unfreeze_and_split_gently Split the tablets mutations by number of rows, based on `min_tablets_in_mutation` (currently calibrated to 1024), similar to the splitting done in `storage_service::merge_topology_snapshot`. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-09-30 17:15:41 +03:00
Benny Halevy	3c07e0e877	perf-tablets: change default tables and tablets-per-table tablets-per-table must be a power of 2, so round up 10000 to 16K. also, reduce number of tables to have a total of about 100K tablets, otherwise we hit the maximum commitlog mutation size limit in save_tablet_metadata. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-09-30 17:07:06 +03:00
Benny Halevy	2c3fb341e9	perf-tablets: abort on unhandled exception Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-09-30 17:07:06 +03:00
Ernest Zaslavsky	debc756794	treewide: Move transport related files to a `transport` directory As requested in #22112 , moved the files and fixed other includes and build system. Moved files: - generic_server.hh - generic_server.cc - protocol_server.hh Fixes: #22112 This is a cleanup, no need to backport Closes scylladb/scylladb#25090	2025-09-29 11:46:06 +03:00
Botond Dénes	2b4a140610	replica: move querier code to replica namespace The query namespace is used for symbols which span the coordinator and replica, or that are mostly coordinator side. The querier is mainly in this namespace due to its similar name, but this is a mistake which confuses people. Now that the code was moved to replica/, also fix the namespace to be namespace replica.	2025-09-29 06:44:52 +03:00
Botond Dénes	ee3d2f5b43	root,replica: mv querier to replica/ The querier object is a confusing one. Based on its name it should be in the query/ module and it is already in the query namespace. But this is actually a completely replica-side logic, implementing the caching of the readers on the replica. Move it to the replica module to make this more clear.	2025-09-29 06:33:53 +03:00
Michał Chojnowski	3703197c4c	test/perf: add perf_bti_key_translation Add a microbenchmark for translating keys to BTI encoding.	2025-09-29 04:08:00 +02:00
Botond Dénes	1999d8e3d3	compaction: remove using namespace {compaction,sstables} Some files in compaction/ have using namespace {compaction,sstables} clauses, some even in headers. This is considered bad practice and muddies the namespace use. Remove them.	2025-09-25 15:03:57 +03:00
Botond Dénes	86ed627fc4	compaction: move code to namespace compaction The namespace usage in this directory is very inconsistent, with files and classes scattered in: * global namespace * namespace compaction * namespace sstables With cases, where all three used in the same file. This code used to live in sstables/ and some of it still retains namespace sstables as a heritage of that time. The mismatch between the dir (future module) and the namespace used is confusing, so finish the migration and move all code in compaction/ to namespace compaction too. This patch, although large, is mechanic and only the following kind of changes are made: * replace namespace sstable {} with namespace compaction {} * add namespace compaction {} * drop/add sstables:: * drop/add compaction:: * move around forward-declarations so they are in the correct namespace context This refactoring revealed some awkward leftover coupling between sstables and compaction, in sstables/sstable_set.cc, where the make_sstable_set() methods of compaction strategies are implemented.	2025-09-25 15:03:56 +03:00
Tomasz Grabiec	2b03a69065	test: perf: perf-load-balancing: Add parallel-scaleout scenario Simulates reblancing on a single scale-out involving simultaneous addition of multiple nodes per rack. Default parameters create a cluster with 2 racks, 70 tables, 256 tablets/table, 10 nodes, 88 shards/node. Adds 6 nodes in parallel (3 per rack). Current result on my laptop: testlog - Rebalance took 21.874 [s] after 82 iteration(s)	2025-09-23 00:31:31 +02:00
Tomasz Grabiec	0dcaaa061e	test: perf: perf-load-balancing: Convert to tool_app_template To support sub-commands for testing different scenarios. The current scenario is given the name "rolling-add-dec".	2025-09-23 00:30:38 +02:00

1 2 3 4 5 ...

509 Commits