mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Go to file

Avi Kivity 1f7dca0225 Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

This applies to small partition workload where index pages have high partition count, and the index doesn't fit in cache. It was observed that the count can be in the order of hundreds. In such a workload pages undergo constant population, LSA compaction, and LSA eviction, which has severe impact on CPU utilization.

Refs https://scylladb.atlassian.net/browse/SCYLLADB-620

This PR reduces the impact by several changes:

  - reducing memory footprint in the partition index. Assuming partition key size is 16 bytes, the cost dropped from 96 bytes to 36 bytes per partition.

  - flattening the object graph and amortizing storage. Storing entries directly in the vector. Storing all key values in a single managed_bytes. Making index_entry a trivial struct.

  - index entries and key storage are now trivially moveable, and batched inside vector storage
    so LSA migration can use memcpy(), which amortizes the cost per key. This reduces the cost of LSA segment compaction.

 - LSA eviction is now pretty much constant time for the whole page
   regardless of the number of entries, because elements are trivial and batched inside vectors.
   Page eviction cost dropped from 50 us to 1 us.

Performance evaluated with:

   scylla perf-simple-query -c1 -m200M --partitions=1000000

Before:

```
7774.96 tps (166.0 allocs/op, 521.7 logallocs/op,  54.0 tasks/op,  802428 insns/op,  430457 cycles/op,        0 errors)
7511.08 tps (166.1 allocs/op, 527.2 logallocs/op,  54.0 tasks/op,  804185 insns/op,  430752 cycles/op,        0 errors)
7740.44 tps (166.3 allocs/op, 526.2 logallocs/op,  54.2 tasks/op,  805347 insns/op,  432117 cycles/op,        0 errors)
7818.72 tps (165.2 allocs/op, 517.6 logallocs/op,  53.7 tasks/op,  794965 insns/op,  427751 cycles/op,        0 errors)
7865.49 tps (165.1 allocs/op, 513.3 logallocs/op,  53.6 tasks/op,  788898 insns/op,  425171 cycles/op,        0 errors)
```

After (+318%):

```
32492.40 tps (130.7 allocs/op,  12.8 logallocs/op,  36.1 tasks/op,  109236 insns/op,  103203 cycles/op,        0 errors)
32591.99 tps (130.4 allocs/op,  12.8 logallocs/op,  36.0 tasks/op,  108947 insns/op,  102889 cycles/op,        0 errors)
32514.52 tps (130.6 allocs/op,  12.8 logallocs/op,  36.0 tasks/op,  109118 insns/op,  103219 cycles/op,        0 errors)
32491.14 tps (130.6 allocs/op,  12.8 logallocs/op,  36.0 tasks/op,  109349 insns/op,  103272 cycles/op,        0 errors)
32582.90 tps (130.5 allocs/op,  12.8 logallocs/op,  36.0 tasks/op,  109269 insns/op,  102872 cycles/op,        0 errors)
32479.43 tps (130.6 allocs/op,  12.8 logallocs/op,  36.0 tasks/op,  109313 insns/op,  103242 cycles/op,        0 errors)
32418.48 tps (130.7 allocs/op,  12.8 logallocs/op,  36.1 tasks/op,  109201 insns/op,  103301 cycles/op,        0 errors)
31394.14 tps (130.7 allocs/op,  12.8 logallocs/op,  36.1 tasks/op,  109267 insns/op,  103301 cycles/op,        0 errors)
32298.55 tps (130.7 allocs/op,  12.8 logallocs/op,  36.1 tasks/op,  109323 insns/op,  103551 cycles/op,        0 errors)
```

When the workload is miss-only, with both row cache and index cache disabled (no cache maintenance cost):

  perf-simple-query -c1 -m200M --duration 6000 --partitions=100000 --enable-index-cache=0 --enable-cache=0

Before:

```
9124.57 tps (146.2 allocs/op, 789.0 logallocs/op,  45.3 tasks/op,  889320 insns/op,  357937 cycles/op,        0 errors)
9437.23 tps (146.1 allocs/op, 789.3 logallocs/op,  45.3 tasks/op,  889613 insns/op,  357782 cycles/op,        0 errors)
9455.65 tps (146.0 allocs/op, 787.4 logallocs/op,  45.2 tasks/op,  887606 insns/op,  357167 cycles/op,        0 errors)
9451.22 tps (146.0 allocs/op, 787.4 logallocs/op,  45.3 tasks/op,  887627 insns/op,  357357 cycles/op,        0 errors)
9429.50 tps (146.0 allocs/op, 787.4 logallocs/op,  45.3 tasks/op,  887761 insns/op,  358148 cycles/op,        0 errors)
9430.29 tps (146.1 allocs/op, 788.2 logallocs/op,  45.3 tasks/op,  888501 insns/op,  357679 cycles/op,        0 errors)
9454.08 tps (146.0 allocs/op, 787.3 logallocs/op,  45.3 tasks/op,  887545 insns/op,  357132 cycles/op,        0 errors)
```

After (+55%):

```
14484.84 tps (150.7 allocs/op,   6.5 logallocs/op,  44.7 tasks/op,  396164 insns/op,  229490 cycles/op,        0 errors)
14526.21 tps (150.8 allocs/op,   6.5 logallocs/op,  44.8 tasks/op,  396401 insns/op,  228824 cycles/op,        0 errors)
14567.53 tps (150.7 allocs/op,   6.5 logallocs/op,  44.7 tasks/op,  396319 insns/op,  228701 cycles/op,        0 errors)
14545.63 tps (150.6 allocs/op,   6.5 logallocs/op,  44.7 tasks/op,  395889 insns/op,  228493 cycles/op,        0 errors)
14626.06 tps (150.5 allocs/op,   6.5 logallocs/op,  44.7 tasks/op,  395254 insns/op,  227891 cycles/op,        0 errors)
14593.74 tps (150.5 allocs/op,   6.5 logallocs/op,  44.7 tasks/op,  395480 insns/op,  227993 cycles/op,        0 errors)
14538.10 tps (150.8 allocs/op,   6.5 logallocs/op,  44.8 tasks/op,  397035 insns/op,  228831 cycles/op,        0 errors)
14527.18 tps (150.8 allocs/op,   6.5 logallocs/op,  44.8 tasks/op,  396992 insns/op,  228839 cycles/op,        0 errors)
```

Same as above, but with summary ratio increased from 0.0005 to 0.005 (smaller pages):

Before:

```
33906.70 tps (146.1 allocs/op,  83.6 logallocs/op,  45.1 tasks/op,  170553 insns/op,   98104 cycles/op,        0 errors)
32696.16 tps (146.0 allocs/op,  83.5 logallocs/op,  45.1 tasks/op,  170369 insns/op,   98405 cycles/op,        0 errors)
33889.05 tps (146.1 allocs/op,  83.6 logallocs/op,  45.1 tasks/op,  170551 insns/op,   98135 cycles/op,        0 errors)
33893.24 tps (146.1 allocs/op,  83.5 logallocs/op,  45.1 tasks/op,  170488 insns/op,   98168 cycles/op,        0 errors)
33836.73 tps (146.1 allocs/op,  83.6 logallocs/op,  45.1 tasks/op,  170528 insns/op,   98226 cycles/op,        0 errors)
33897.61 tps (146.0 allocs/op,  83.5 logallocs/op,  45.1 tasks/op,  170428 insns/op,   98081 cycles/op,        0 errors)
33834.73 tps (146.1 allocs/op,  83.5 logallocs/op,  45.1 tasks/op,  170438 insns/op,   98178 cycles/op,        0 errors)
33776.31 tps (146.3 allocs/op,  83.9 logallocs/op,  45.2 tasks/op,  170958 insns/op,   98418 cycles/op,        0 errors)
33808.08 tps (146.3 allocs/op,  83.9 logallocs/op,  45.2 tasks/op,  170940 insns/op,   98388 cycles/op,        0 errors)
```

After (+18%):

```
40081.51 tps (148.2 allocs/op,   4.4 logallocs/op,  45.0 tasks/op,  121047 insns/op,   82231 cycles/op,        0 errors)
40005.85 tps (148.6 allocs/op,   4.4 logallocs/op,  45.2 tasks/op,  121327 insns/op,   82545 cycles/op,        0 errors)
39816.75 tps (148.3 allocs/op,   4.4 logallocs/op,  45.1 tasks/op,  121067 insns/op,   82419 cycles/op,        0 errors)
39953.11 tps (148.1 allocs/op,   4.4 logallocs/op,  45.0 tasks/op,  121027 insns/op,   82258 cycles/op,        0 errors)
40073.96 tps (148.2 allocs/op,   4.4 logallocs/op,  45.0 tasks/op,  121006 insns/op,   82313 cycles/op,        0 errors)
39882.25 tps (148.2 allocs/op,   4.4 logallocs/op,  45.0 tasks/op,  120925 insns/op,   82320 cycles/op,        0 errors)
39916.08 tps (148.3 allocs/op,   4.4 logallocs/op,  45.1 tasks/op,  121054 insns/op,   82393 cycles/op,        0 errors)
39786.30 tps (148.2 allocs/op,   4.4 logallocs/op,  45.0 tasks/op,  121027 insns/op,   82465 cycles/op,        0 errors)
38662.45 tps (148.3 allocs/op,   4.4 logallocs/op,  45.0 tasks/op,  121108 insns/op,   82312 cycles/op,        0 errors)
39849.42 tps (148.3 allocs/op,   4.4 logallocs/op,  45.1 tasks/op,  121098 insns/op,   82447 cycles/op,        0 errors)
```

Closes scylladb/scylladb#28603

* github.com:scylladb/scylladb:
  sstables: mx: index_reader: Optimize parsing for no promoted index case
  vint: Use std::countl_zero()
  test: sstable_partition_index_cache_test: Validate scenario of pages with sparse promoted index placement
  sstables: mx: index_reader: Amoritze partition key storage
  managed_bytes: Hoist write_fragmented() to common header
  utils: managed_vector: Use std::uninitialized_move() to move objects
  sstables: mx: index_reader: Keep promoted_index info next to index_entry
  sstables: mx: index_reader: Extract partition_index_page::clear_gently()
  sstables: mx: index_reader: Shave-off 16 bytes from index_entry by using raw_token
  sstables: mx: index_reader: Reduce allocation_section overhead during index page parsing by batching allocation
  sstables: mx: index_reader: Keep index_entry directly in the vector
  dht: Introduce raw_token
  test: perf_simple_query: Add 'sstable-format' command-line option
  test: perf_simple_query: Add 'sstable-summary-ratio' command-line option
  test: perf-simple-query: Add option to disable index cache
  test: cql_test_env: Respect enable-index-cache config

(cherry picked from commit 5e7fb08bf3)

Closes scylladb/scylladb#29136

Closes scylladb/scylladb#29140

2026-03-20 10:58:26 +02:00

.github

ci: harden trigger-scylla-ci workflow against credential leaks and untrusted PRs

2026-02-27 11:28:11 +02:00

abseil @ d7aaad83b4

…

alternator

Merge '[Backport 2025.4] alternator: fix batch writes during intranode tablet migrations' from Scylladb[bot]

2026-01-08 16:40:20 +02:00

api

api: storage_service/tablets/repair: disable incremental repair by default

2025-12-11 23:49:07 +00:00

audit

Revert "build: add precompiled headers to CMakeLists.txt"

2025-09-03 09:46:00 +03:00

auth

auth: add possibilty to check for any permission in set

2025-10-30 10:13:16 +00:00

bin

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

cdc

tombstone_gc: don't use 'repair' mode for colocated tables

2025-11-26 08:36:52 +01:00

cmake

cmake: introduce Scylla_WITH_DEBUG_INFO option

2025-09-25 12:49:36 +03:00

compaction

compaction_manager: fix maybe_wait_for_sstable_count_reduction() hanging forever

2026-02-27 01:39:05 +02:00

conf

Fix comment for tablets_mode_for_new_keyspaces

2025-11-26 20:42:06 +02:00

cql3

cql3/statements/describe_statement: hide paxos state tables

2026-02-02 23:31:17 +00:00

data_dictionary

Revert "build: add precompiled headers to CMakeLists.txt"

2025-09-03 09:46:00 +03:00

Merge '[Backport 2026.1] mv: allow skipping view updates when a collection is unmodified' from Scylladb[bot]

2026-03-18 10:41:50 +01:00

debug

…

dht

Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

2026-03-20 10:58:26 +02:00

dist

packaging: add systemctl command to dependencies

2026-03-05 14:52:36 +02:00

docs

doc: fix the unified installer instructions

2026-03-09 21:40:55 +02:00

ent

replicated_key_provider: make KSNAME public

2026-01-19 14:59:26 +00:00

exceptions

exceptions.hh: fix message argument passing

2025-08-13 13:39:52 +02:00

gms

tablets: Make balancing disabling call preempt tablet transitions

2026-02-02 21:25:11 +01:00

idl

direct_failure_detector: pass timeout to direct_fd_ping verb

2025-12-07 14:57:50 +00:00

index

vector_index: require tablets for vector indexes

2025-12-19 09:12:18 +02:00

keys

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

lang

treewide: Move type related files to a type directory As requested in #22110 , moved the files and fixed other includes and build system.

2025-09-17 17:32:19 +03:00

licenses

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

locator

load_sketch: Allow populating load_sketch with normalized current load

2026-01-09 18:42:03 +01:00

message

direct_failure_detector: run direct failure detector in the gossiper scheduling group

2025-12-09 17:19:31 +02:00

mutation

compaction: move code to namespace compaction

2025-09-25 15:03:56 +03:00

mutation_writer

replica: Fix split compaction when tablet boundaries change

2025-09-07 05:20:23 -03:00

node_ops

storage_service: change node_ops_info::ignore_nodes to host id

2025-09-15 10:18:24 +02:00

pgo

Update pgo profiles - aarch64

2026-03-15 04:44:34 +02:00

query

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

raft

raft: refactor can_vote logic and type

2025-09-24 13:55:05 +02:00

readers

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

reloc

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

repair

service: pass topology guard to RBNO

2026-02-02 17:37:37 +01:00

replica

distributed_loader: system_replicated_keys as system keyspace

2026-01-19 14:59:26 +00:00

rust

Revert "build: add precompiled headers to CMakeLists.txt"

2025-09-03 09:46:00 +03:00

schema

Merge 'service: assert that tables updated via group0 use schema commitlog' from Aleksandra Martyniuk

2026-03-17 17:29:36 +01:00

scripts

docs: expose alternator metrics

2025-08-22 09:49:52 +03:00

seastar @ 72df0a926d

Update seastar submodule (assorted fixes for S3 client update)

2026-02-03 11:41:30 +03:00

service

Merge 'service: assert that tables updated via group0 use schema commitlog' from Aleksandra Martyniuk

2026-03-17 17:29:36 +01:00

sstables

Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

2026-03-20 10:58:26 +02:00

streaming

repair: Fix sstable_list_to_mark_as_repaired with multishard writer

2026-01-19 09:39:49 +02:00

swagger-ui @ 12f1da1082

…

tasks

Merge 'compaction: handle exception in expected_total_workload' from Aleksandra Martyniuk

2025-09-17 15:10:19 +03:00

test

Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

2026-03-20 10:58:26 +02:00

tools

schema: Add initializer for compression defaults

2026-01-28 12:42:10 +02:00

tracing

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

transport

transport: fix connection code to consume only initially taken semaphore units

2026-02-19 16:23:47 +01:00

types

Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

2026-03-20 10:58:26 +02:00

unified

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

utils

Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

2026-03-20 10:58:26 +02:00

vector_search

vector_search: Fix missing timeout on TLS handshake

2026-02-13 21:24:06 +00:00

.clang-format

…

.dockerignore

…

.gitattributes

configure.py: prepare the build for a default PGO profile in version control

2024-12-27 16:16:04 +08:00

.gitignore

.gitignore: add rust target

2025-08-19 13:09:18 +03:00

.gitmodules

Update seastar submodule (iotune fixes for i7i/i4i)

2025-10-16 20:30:37 +03:00

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

absl-flat_hash_map.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

amplify.yml

…

backlog_controller.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

build_mode.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_fwd.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_ostream.hh

treewide: Replace __builtin_expect with (un)likely

2025-07-03 13:34:04 +03:00

bytes.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes.hh

bytes: adapt fmt_hex to std::span<const std::byte>

2025-04-01 00:07:27 +02:00

cartesian_product.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.hh

transport/server: use scheduling group assigned to current user

2025-01-02 07:13:34 +01:00

clocks-impl.cc

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

clocks-impl.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

CMakeLists.txt

configure: increase SCHEDULING_GROUPS_COUNT to 20

2025-09-30 12:41:28 +03:00

configure.py

utils: Introduce helper for replicated data structures

2025-11-23 21:05:33 +00:00

CONTRIBUTING.md

Fix typos

2025-02-11 00:17:43 +02:00

coverage_excludes.txt

…

coverage_sources.list

…

db_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

debug.cc

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

debug.hh

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

default.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

Doxyfile

…

encoding_stats.hh

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

enum_set.hh

auth: add possibilty to check for any permission in set

2025-10-30 10:13:16 +00:00

fix_system_distributed_tables.py

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

flake.lock

…

flake.nix

…

gc_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gdbinit

…

gen_segmented_compress_params.py

compress: move compress.cc/hh to sstables/compressor

2025-07-31 13:10:41 +03:00

HACKING.md

build: replace tools/java submodule with packaged cassandra-stress

2025-04-15 10:11:28 +03:00

hashing_partition_visitor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

idl-compiler.py

idl-compiler.py: generate skip() definition for enums serializers

2025-06-24 11:05:31 +03:00

inet_address_vectors.hh

storage_proxy: handle node_local_only in mutate

2025-07-24 19:48:08 +02:00

init.cc

view: Stop requiring experimental feature

2025-10-06 13:19:54 +00:00

init.hh

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

install-dependencies.sh

install-dependencies.sh: update node_exporter to 1.10.2

2025-11-12 13:04:39 +02:00

install.sh

install.sh: simplify check_usermode_support()

2025-02-24 11:29:30 +03:00

LICENSE-ScyllaDB-Source-Available.md

Fix typos

2025-02-13 01:54:08 +02:00

main.cc

schema: Add initializer for compression defaults

2026-01-28 12:42:10 +02:00

marshal_exception.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_query.cc

schema: deinline some speculative_retry methods

2025-01-02 12:28:33 +01:00

mutation_query.hh

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

NOTICE.txt

PowerPC: remove ppc stuff

2025-07-08 10:38:23 +03:00

ORIGIN

…

partition_builder.hh

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

partition_range_compat.hh

treewide: Move misc files to utils directory

2025-07-21 11:56:40 +03:00

partition_slice_builder.cc

tree: Remove unused boost headers

2025-02-25 10:32:32 +03:00

partition_slice_builder.hh

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

partition_snapshot_reader.hh

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

querier.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

querier.hh

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

query_ranges_to_vnodes.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

query_ranges_to_vnodes.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reader_concurrency_semaphore_group.cc

treewide: fix misspellings

2025-01-05 16:13:09 +02:00

reader_concurrency_semaphore_group.hh

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: improve handling of base resources

2026-01-21 06:37:38 +02:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: use named gate

2025-04-12 11:28:48 +03:00

reader_permit.hh

reader_permit: mark check_abort() as const

2025-02-07 01:32:35 -05:00

README.md

README: adjust to reflect license change

2025-01-30 10:28:32 +03:00

real_dirty_memory_accounter.hh

moved cache files to db

2025-02-04 12:21:31 +03:00

release.cc

release: adjust doc_link() for the post source-available world

2025-10-03 14:29:19 +00:00

release.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reversibly_mergeable.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_upgrader.hh

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

scylla_post_install.sh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla-gdb.py

sstables: introduce ms sstable format version

2025-09-29 22:15:24 +02:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 2025.4.6

2026-03-04 11:45:48 +02:00

seastarx.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serialization_visitors.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer_impl.hh

serializer_impl.hh: add as_input_stream(managed_bytes_view) overload

2025-05-13 10:32:32 +02:00

serializer.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer.hh

treewide: include boost headers as "system" headers

2025-08-22 17:21:24 +03:00

service_permit.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

shell.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_dict_autotrainer.cc

storage_service: hold group0 gate in publish_new_sstable_dict

2025-07-28 12:42:37 +02:00

sstable_dict_autotrainer.hh

dict_autotrainer: introduce sstable_dict_autotrainer

2025-04-01 00:07:30 +02:00

sstables_loader.cc

streaming:: add more logging

2025-12-21 17:49:10 +02:00

sstables_loader.hh

streaming:: add more logging

2025-12-21 17:49:10 +02:00

supervisor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

table_helper.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

table_helper.hh

audit: Add the audit subsystem

2025-01-15 11:10:35 +01:00

test.py

vector_store_client_test: Relocate to a dedicated directory

2025-09-25 14:04:28 +02:00

timeout_config.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

timeout_config.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_extension.hh

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

tombstone_gc_options.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_options.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc-internals.hh

treewide: Add missing #pragma once

2025-09-01 14:58:21 +03:00

tombstone_gc.cc

tombstone_gc: don't use 'repair' mode for colocated tables

2025-11-26 08:36:52 +01:00

tombstone_gc.hh

tombstone_gc: don't use 'repair' mode for colocated tables

2025-11-26 08:36:52 +01:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

unimplemented.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

validation.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

validation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

version.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

view_info.hh

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

vint-serialization.cc

Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

2026-03-20 10:58:26 +02:00

vint-serialization.hh

Merge 'Fix bad performance for densely populated partition index pages' from Tomasz Grabiec

2026-03-20 10:58:26 +02:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++23 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of ScyllaDB.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.5%

Python 26.2%

CMake 0.4%

GAP 0.3%

Shell 0.3%