mirror of https://github.com/scylladb/scylladb.git synced 2026-06-03 21:47:10 +00:00

Go to file

Avi Kivity fd1dd0eac7 Merge "Track the memory consumption of reader buffers" from Botond

"
The last major untracked area of the reader pipeline is the reader
buffers. These scale with the number of readers as well as with the size
and shape of data, so their memory consumption is unpredictable varies
wildly. For example many small rows will trigger larger buffers
allocated within the `circular_buffer<mutation_fragment>`, while few
larger rows will consume a lot of external memory.

This series covers this area by tracking the memory consumption of both
the buffer and its content. This is achieved by passing a tracking
allocator to `circular_buffer<mutation_fragment>` so that each
allocation it makes is tracked. Additionally, we now track the memory
consumption of each and every mutation fragment through its whole
lifetime. Initially I contemplated just tracking the `_buffer_size` of
`flat_mutation_reader::impl`, but concluded that as our reader trees are
typically quite deep, this would result in a lot of unnecessary
`signal()`/`consume()` calls, that scales with the number of mutation
fragments and hence adds to the already considerable per mutation
fragment overhead. The solution chosen in this series is to instead
track the memory consumption of the individual mutation fragments, with
the observation that these are typically always moved and very rarely
copied, so the number of `signal()`/`consume()` calls will be minimal.

This additional tracking introduces an interesting dilemma however:
readers will now have significant memory on their account even before
being admitted. So it may happen that they can prevent their own
admission via this memory consumption. To prevent this, memory
consumption is only forwarded to the semaphore upon admission. This
might be solved when the semaphore is moved to the front -- before the
cache.
Another consequence of this additional, more complete tracking is that
evictable readers now consume memory even when the underlying reader is
evicted. So it may happen that even though no reader is currently
admitted, all memory is consumed from the semaphore. To prevent any such
deadlocks, the semaphore now admits a reader unconditionally if no
reader is admitted -- that is if all count resources all available.

Refs: #4176

Tests: unit(dev, debug, release)
"

* 'track-reader-buffers/v2' of https://github.com/denesb/scylla: (37 commits)
  test/manual/sstable_scan_footprint_test: run test body in statement sched group
  test/manual/sstable_scan_footprint_test: move test main code into separate function
  test/manual/sstable_scan_footprint_test: sprinkle some thread::maybe_yield():s
  test/manual/sstable_scan_footprint_test: make clustering row size configurable
  test/manual/sstable_scan_footprint_test: document sstable related command line arguments
  mutation_fragment_test: add exception safety test for mutation_fragment::mutate_as_*()
  test: simple_schema: add make_static_row()
  reader_permit: reader_resources: add operator==
  mutation_fragment: memory_usage(): remove unused schema parameter
  mutation_fragment: track memory usage through the reader_permit
  reader_permit: resource_units: add permit() and resources() accessors
  mutation_fragment: add schema and permit
  partition_snapshot_row_cursor: row(): return clustering_row instead of mutation_fragment
  mutation_fragment: remove as_mutable_end_of_partition()
  mutation_fragment: s/as_mutable_partition_start/mutate_as_partition_start/
  mutation_fragment: s/as_mutable_range_tombstone/mutate_as_range_tombstone/
  mutation_fragment: s/as_mutable_clustering_row/mutate_as_clustering_row/
  mutation_fragment: s/as_mutable_static_row/mutation_as_static_row/
  flat_mutation_reader: make _buffer a tracked buffer
  mutation_reader: extract the two fill_buffer_result into a single one
  ...

2020-09-29 16:08:16 +03:00

.github

Additional entries in CODEOWNERS

2020-08-04 21:03:23 +03:00

abseil @ 2069dc796a

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

alternator

alternator streams: fix NextShardIterator for closed shard

2020-09-23 09:25:10 +02:00

api

Merge "Free compaction from storage service" from Pavel E

2020-08-23 17:58:32 +03:00

auth

auth: Inline standard_role_manager_name into only use

2020-08-26 11:33:23 +03:00

cdc

cdc: sprinkle parentheses in EntryContainer concept

2020-09-21 16:32:53 +03:00

conf

transport: Allow user to disable unencrypted native transport

2020-08-11 13:15:17 +03:00

cql3

cql3: select_statement: fix incorrect implicit conversion of bool_class to bool

2020-09-21 16:32:53 +03:00

data

data/cell: don't overshoot target allocation sizes

2020-09-14 14:21:46 +03:00

Merge "Track the memory consumption of reader buffers" from Botond

2020-09-29 16:08:16 +03:00

debug

…

dht

range_streamer: keep a const token_metadata&

2020-08-20 16:20:34 +03:00

dist

scylla_setup: skip offline warnings on nonroot mode

2020-09-29 13:30:13 +03:00

docs

docs/docker-hub.md: add quickstart section with --smp 1

2020-09-22 10:18:01 +02:00

exceptions

exceptions: make a single-param constructor explicit

2020-09-28 09:16:31 +02:00

gms

Merge "Gossip echo message improvement" from Asias

2020-09-24 15:13:55 +02:00

idl

Merge "Get rid of seed concept in gossip" from Asias

2020-08-17 09:50:51 +03:00

imr

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

index

flat_mutation_reader: impl: add reader_permit parameter

2020-09-28 10:53:48 +03:00

interface

thrift: switch csharp backend to netstd

2020-06-23 19:40:18 +03:00

libdeflate @ e7e54eab42

…

licenses

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

locator

token_metadata: set_pending_ranges: prep new interval_map out of line

2020-09-16 15:28:42 +03:00

message

Merge "Gossip echo message improvement" from Asias

2020-09-24 15:13:55 +02:00

mutation_writer

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

redis

redis: add hexists command

2020-09-21 12:32:33 +03:00

reloc

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

repair

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

scripts

scripts/refresh-submodules.sh: Add python3 submodule

2020-09-29 16:06:32 +03:00

seastar @ 292ba734bc

Update seastar submodule

2020-09-25 21:54:44 +03:00

service

gossip: Reduce unncessary VIEW_BACKLOG updates

2020-09-29 13:37:37 +03:00

sstables

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

streaming

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

swagger-ui @ 12f1da1082

…

test

Merge "Track the memory consumption of reader buffers" from Botond

2020-09-29 16:08:16 +03:00

thrift

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

tools

Update tools/jmx submodule

2020-09-29 13:32:45 +03:00

tracing

tracing: Fix error on slow batches

2020-09-29 13:24:39 +02:00

transport

transport: make max_concurrent_requests_per_shard reloadable

2020-09-29 10:11:36 +02:00

types

cql3: pass column_specification via lw_shared_ptr

2020-04-27 12:47:42 +03:00

unified

install.sh: stop using symlinks for systemd units on nonroot mode

2020-09-29 12:20:41 +03:00

utils

utils: utf8: avoid harmless integer overflow

2020-09-22 17:24:33 +03:00

.dockerignore

.dockerignore: add testlog

2020-02-07 08:59:39 +01:00

.gitattributes

…

.gitignore

.gitignore: add .vscode to the list

2020-07-30 16:35:06 +03:00

.gitmodules

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

.gitorderfile

…

absl-flat_hash_map.cc

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

absl-flat_hash_map.hh

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

atomic_cell_hash.hh

…

atomic_cell_or_collection.hh

…

atomic_cell.cc

data/cell: don't overshoot target allocation sizes

2020-09-14 14:21:46 +03:00

atomic_cell.hh

atomic_cell.hh: forward-declare atomic_cell_or_collection

2020-09-21 16:32:53 +03:00

backlog_controller.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

bytes_ostream.hh

…

bytes.cc

mp_row_consumer: Provide hex-formatting wrapper for bytes_view

2020-08-26 20:44:11 +03:00

bytes.hh

bytes: define contructor for fmt_hex

2020-09-21 16:32:53 +03:00

cache_flat_mutation_reader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

cache_temperature.hh

…

caching_options.hh

treewide: replace libjsoncpp usage with rjson

2020-07-03 10:27:23 +02:00

canonical_mutation.cc

everywhere: Use uninitialized_string instead of sstring::initialized_later

2020-03-10 13:17:49 -07:00

canonical_mutation.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

cartesian_product.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

cell_locking.hh

…

checked-file-impl.hh

treewide: replace calls to engine().some_api() with some_api()

2020-04-05 12:46:04 +03:00

clocks-impl.cc

clocks-impl: switch to thread-safe time conversion

2020-05-04 14:11:38 +03:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

clustering_interval_set.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

clustering_key_filter.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

clustering_ranges_walker.hh

…

CMakeLists.txt

CMakeLists.txt: Add abseil to include directories

2020-07-31 12:15:23 +02:00

collection_mutation.cc

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

collection_mutation.hh

collection_mutation_view: add type-aware pretty printer

2020-01-07 12:06:29 +02:00

column_computation.hh

treewide: replace libjsoncpp usage with rjson

2020-07-03 10:27:23 +02:00

combine.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

compaction_garbage_collector.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

compaction_strategy_type.hh

compaction_strategy: add method to reshape SSTables

2020-06-18 09:37:18 -04:00

compaction_strategy.hh

distributed_loader: reshard before the node is made online

2020-06-18 09:37:18 -04:00

compatible_ring_position.hh

…

compound_compat.hh

bytes: compare_unsigned: do not pass nullptr to memcmp

2020-07-09 17:54:46 +03:00

compound.hh

compound_type: implement validate()

2020-05-07 16:19:56 +03:00

compress.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

compress.hh

…

concrete_types.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

configure.py

configure.py: build python3, jmx, tools and unified-tar only in relevant dist-{mode}

2020-09-29 15:41:52 +03:00

connection_notifier.cc

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

connection_notifier.hh

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

CONTRIBUTING.md

Fix a link to contributor-agreement in the CONTRIBUTING page

2020-05-17 14:15:49 +03:00

converting_mutation_partition_applier.cc

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

converting_mutation_partition_applier.hh

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

counters.cc

counters: remove unused 1.7.4 counter order code

2020-09-29 12:16:58 +03:00

counters.hh

counters: remove unused 1.7.4 counter order code

2020-09-29 12:16:58 +03:00

cql_serialization_format.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

database_fwd.hh

…

database.cc

Merge 'sstables: make sstable_manager control the lifetime of the sstables it manages' from Avi Kivity

2020-09-24 13:54:38 +03:00

database.hh

flat_mutation_reader: impl: add reader_permit parameter

2020-09-28 10:53:48 +03:00

db_clock.hh

clocks: add printing functions

2020-01-30 11:10:08 +01:00

debug.hh

…

digest_algorithm.hh

digest: add null values to row digest

2020-09-10 13:16:44 +02:00

digester.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

dirty_memory_manager.hh

…

distributed_loader.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

distributed_loader.hh

distributed_loader: remove declaration of inexistent do_populate_column_family()

2020-06-29 14:23:42 -03:00

Doxyfile

…

duration.cc

duration: adjust for C++20 char8_t type

2020-05-12 20:40:30 +02:00

duration.hh

…

encoding_stats.hh

…

enum_set.hh

…

fix_system_distributed_tables.py

…

flat_mutation_reader.cc

mutation_fragment: memory_usage(): remove unused schema parameter

2020-09-28 11:27:47 +03:00

flat_mutation_reader.hh

mutation_fragment: memory_usage(): remove unused schema parameter

2020-09-28 11:27:47 +03:00

frozen_mutation.cc

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

frozen_mutation.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

frozen_schema.cc

…

frozen_schema.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

gc_clock.hh

…

gen_segmented_compress_params.py

…

HACKING.md

README: better explanation of dependencies and build

2020-06-16 13:26:04 +02:00

hashers.cc

hashers: convert illegal contraint to static_assert

2020-09-21 16:32:10 +03:00

hashers.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

hashing_partition_visitor.hh

…

hashing.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

idl-compiler.py

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

init.cc

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

init.hh

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

install-dependencies.sh

Add support passing python3 dependencies from main repo to scylla-python3 script

2020-09-08 23:39:34 +03:00

install.sh

install.sh: stop using symlinks for systemd units on nonroot mode

2020-09-29 12:20:41 +03:00

interval.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

intrusive_set_external_comparator.hh

…

keys.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

keys.hh

partition_key_view: add validate method

2020-05-12 12:07:00 +03:00

LICENSE.AGPL

…

lister.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

lister.hh

Update seastar submodule

2020-08-19 17:18:57 +03:00

log.hh

…

lua.cc

utf8: Print invalid UTF-8 character position

2020-09-07 18:11:21 +03:00

lua.hh

lua: Handle nil returns correctly

2020-01-29 14:05:01 -08:00

main.cc

sstables: remove background_jobs(), await_background_jobs()

2020-09-23 20:55:17 +03:00

map_difference.hh

…

marshal_exception.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable-sstable.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable.cc

flat_mutation_reader: impl: add reader_permit parameter

2020-09-28 10:53:48 +03:00

memtable.hh

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

multishard_mutation_query.cc

mutation_fragment: memory_usage(): remove unused schema parameter

2020-09-28 11:27:47 +03:00

multishard_mutation_query.hh

storage_proxy: use read_command::max_result_size to pass max result size around

2020-07-28 18:00:29 +03:00

mutation_cleaner.hh

…

mutation_compactor.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_fragment.cc

mutation_fragment: track memory usage through the reader_permit

2020-09-28 11:27:29 +03:00

mutation_fragment.hh

mutation_fragment: memory_usage(): remove unused schema parameter

2020-09-28 11:27:47 +03:00

mutation_partition_serializer.cc

sstables: drop checks for correct counter order support

2020-09-14 12:05:11 +02:00

mutation_partition_serializer.hh

…

mutation_partition_view.cc

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

mutation_partition_view.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

mutation_partition_visitor.hh

…

mutation_partition.cc

Merge "Unfriend rows_entry, cache_tracker and mutation_partition" from Pavel Emelyanov

2020-09-22 21:18:14 +02:00

mutation_partition.hh

Merge "Unfriend rows_entry, cache_tracker and mutation_partition" from Pavel Emelyanov

2020-09-22 21:18:14 +02:00

mutation_query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_query.hh

reconcilable_result_builder: don't aggrevate out-of-memory condition during recovery

2020-09-15 19:53:05 +02:00

mutation_reader.cc

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

mutation_reader.hh

flat_mutation_reader: make _buffer a tracked buffer

2020-09-28 10:53:56 +03:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

…

mutation.cc

mutation: Improve log print of mutations

2020-09-04 16:33:25 +02:00

mutation.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

noexcept_traits.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

NOTICE.txt

tests: port Cassandra CQL tests to cql repl

2020-03-26 15:19:38 +02:00

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

partition_slice_builder: add with_option()

2020-07-28 18:00:29 +03:00

partition_snapshot_reader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

partition_snapshot_row_cursor.hh

partition_snapshot_row_cursor: row(): return clustering_row instead of mutation_fragment

2020-09-28 10:53:56 +03:00

partition_version_list.hh

…

partition_version.cc

partition_version: Remove dead code

2020-09-01 10:19:47 +03:00

partition_version.hh

partition_version: Remove dead code

2020-09-01 10:19:47 +03:00

position_in_partition.hh

position_in_partition_view: add position_in_partition_view before_key() overload

2020-09-25 12:09:00 +03:00

querier.cc

querier: move common stuff into querier_base

2020-06-03 18:45:33 +03:00

querier.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query_class_config.hh

query: query_class_config: use max_result_size for the max_memory_for_unlimited_query field

2020-07-28 18:00:29 +03:00

query_result_merger.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-request.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-reader.hh

results-view: Abort early if messing with empty vector

2020-09-22 10:18:01 +02:00

query-result-set.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-set.hh

mutation_partition: Debloat header form others

2020-03-18 11:53:36 +02:00

query-result-writer.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

range_tombstone_list.cc

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

range_tombstone_list.hh

range_tombstone_list: Do not expose internal collection

2020-09-07 23:17:41 +03:00

range_tombstone.cc

…

range_tombstone.hh

…

range.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

read_context.hh

row_cache: pass a valid permit to underlying read

2020-05-28 11:34:35 +03:00

reader_concurrency_semaphore.cc

reader_permit: only forward resource consumption to semaphore after admission

2020-09-28 08:46:22 +03:00

reader_concurrency_semaphore.hh

reader_permit::resource_units: store permit instead of semaphore

2020-09-28 08:46:22 +03:00

reader_permit.hh

reader_permit: reader_resources: add operator==

2020-09-28 11:27:49 +03:00

README.md

Improve build documentation

2020-09-07 10:51:31 +03:00

real_dirty_memory_accounter.hh

…

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

flat_mutation_reader: impl: add reader_permit parameter

2020-09-28 10:53:48 +03:00

row_cache.hh

Merge "Unfriend rows_entry, cache_tracker and mutation_partition" from Pavel Emelyanov

2020-09-22 21:18:14 +02:00

schema_builder.hh

schema: Pass an rvalue to set_compaction_strategy_options

2020-08-19 14:02:35 -07:00

schema_fwd.hh

…

schema_mutations.cc

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_mutations.hh

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_registry.cc

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_registry.hh

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_upgrader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

schema.cc

Merge "materialized views: Fix undefined behavior on base table schema changes" from Tomasz

2020-08-26 17:37:52 +03:00

schema.hh

lwt: introduce paxos_grace_seconds per-table option to set paxos ttl

2020-08-17 16:44:14 +02:00

scylla_post_install.sh

scylla_post_install.sh: generate memory.conf for CentOS7

2020-07-29 14:10:16 +03:00

scylla-gdb.py

scylla-gdb.py: histogram: don't use shared default argument

2020-09-15 10:09:15 +02:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN: skip updating version files when git hash unchanged

2020-02-06 18:36:46 +02:00

seastarx.hh

Everywhere: Explicitly instantiate make_shared

2020-07-21 10:33:49 -07:00

serialization_visitors.hh

…

serializer_impl.hh

repair: Switch to btree_set for repair_hash.

2020-07-09 11:35:18 +03:00

serializer.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

service_permit.hh

Everywhere: Explicitly instantiate make_lw_shared

2020-07-21 10:33:49 -07:00

setup.py

…

supervisor.hh

supervisor: drop unused Upstart code, always use libsystemd

2020-06-10 08:17:35 +03:00

table_helper.cc

everywhere: Replace engine().cpu_id() with this_shard_id()

2020-03-27 11:40:03 +03:00

table_helper.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

table.cc

flat_mutation_reader: impl: add reader_permit parameter

2020-09-28 10:53:48 +03:00

test.py

test.py: Add "--list" option to show a list of tests

2020-09-16 16:02:48 +02:00

timeout_config.cc

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timeout_config.hh

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timestamp.hh

add missing include to timestamp.hh

2020-02-05 19:42:18 +02:00

to_string.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

tombstone.hh

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

tox.ini

…

types.cc

types: time_point_to_string: prevent overflow of nanoseconds

2020-09-08 10:02:02 +03:00

types.hh

types, compound: pass std::current_exception() to on_internal_error()

2020-05-07 11:25:25 +02:00

unimplemented.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

unimplemented.hh

…

user_types_metadata.hh

…

validation.cc

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

validation.hh

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

version.hh

…

view_info.hh

db: view: Refactor view_info::initialize_base_dependent_fields()

2020-08-20 14:53:07 +02:00

vint-serialization.cc

…

vint-serialization.hh

…

xx_hasher.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

zstd.cc

build: remove zstd submodule

2020-06-11 17:12:49 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found in ./docs and on the wiki. There is currently no clear definition of what goes where, so when looking for something be sure to check both. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%