mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Go to file

Avi Kivity 86bbf1763d Merge "reader concurrency semaphore: dump permit diagnostics on timeout or queue overflow" from Botond

"
The reader concurrency semaphore timing out or its queue being overflown
are fairly common events both in production and in testing. At the same
time it is a hard to diagnose problem that often has a benign cause
(especially during testing), but it is equally possible that it points
to something serious. So when this error starts to appear in logs,
usually we want to investigate and the investigation is lengthy...
either involves looking at metrics or coredumps or both.

This patch intends to jumpstart this process by dumping a diagnostics on
semaphore timeout or queue overflow. The diagnostics is printed to the
log with debug level to avoid excessive spamming. It contains a
histogram of all the permits associated with the problematic semaphore
organized by table, operation and state.

Example:

DEBUG 2020-10-08 17:05:26,115 [shard 0] reader_concurrency_semaphore -
Semaphore _read_concurrency_sem: timed out, dumping permit diagnostics:
Permits with state admitted, sorted by memory
memory  count   name
3499M   27      ks.test:data-query

3499M   27      total

Permits with state waiting, sorted by count
count   memory  name
1       0B      ks.test:drain
7650    0B      ks.test:data-query

7651    0B      total

Permits with state registered, sorted by count
count   memory  name

0       0B      total

Total: permits: 7678, memory: 3499M

This allows determining several things at glance:
* What are the tables involved
* What are the operations involved
* Where is the memory

This can speed up a follow-up investigation greatly, or it can even be
enough on its own to determine that the issue is benign.

Tests: unit(dev, debug)
"

* 'dump-diagnostics-on-semaphore-timeout/v2' of https://github.com/denesb/scylla:
  reader_concurrency_semaphore: dump permit diagnostics on timeout or queue overflow
  utils: add to_hr_size()
  reader_concurrency_semaphore: link permits into an intrusive list
  reader_concurrency_semaphore: move expiry_handler::operator()() out-of-line
  reader_concurrency_semaphore: move constructors out-of-line
  reader_concurrency_semaphore: add state to permits
  reader_concurrency_semaphore: name permits
  querier_cache_test: test_immediate_evict_on_insert: use two permits
  multishard_combining_reader: reader_lifecycle_policy: add permit param to create_reader()
  multishard_combining_reader: add permit parameter
  multishard_combining_reader: shard_reader: use multishard reader's permit

2020-10-13 12:44:23 +03:00

.github

Additional entries in CODEOWNERS

2020-08-04 21:03:23 +03:00

abseil @ 2069dc796a

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

alternator

alternator::streams: Set dynamodb data TTL explicitly in cdc options

2020-10-07 08:43:39 +00:00

api

api/storage_service.cc: Add the get_range_to_endpoint_map

2020-10-08 12:09:09 +03:00

auth

auth: Inline standard_role_manager_name into only use

2020-08-26 11:33:23 +03:00

cdc

alternator::streams: Improve paging and fix parent-child calculation

2020-10-07 08:43:39 +00:00

conf

transport: Allow user to disable unencrypted native transport

2020-08-11 13:15:17 +03:00

cql3

cql3/statements/batch_statement.cc: improve batch size warning message

2020-10-13 09:02:51 +03:00

data

data/cell: fix value_writer use before definition

2020-10-12 13:41:09 +03:00

Merge "reader concurrency semaphore: dump permit diagnostics on timeout or queue overflow" from Botond

2020-10-13 12:44:23 +03:00

debug

…

dht

range_streamer: keep a const token_metadata&

2020-08-20 16:20:34 +03:00

dist

scylla_setup: skip iotune when developer_mode is enabled

2020-10-12 11:08:10 +03:00

docs

docs: fix typos in docs/alternator/alternator.md

2020-10-01 04:46:40 +02:00

exceptions

exceptions: make a single-param constructor explicit

2020-09-28 09:16:31 +02:00

gms

Merge "Gossip echo message improvement" from Asias

2020-09-24 15:13:55 +02:00

idl

Merge "Get rid of seed concept in gossip" from Asias

2020-08-17 09:50:51 +03:00

imr

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

index

flat_mutation_reader: impl: add reader_permit parameter

2020-09-28 10:53:48 +03:00

interface

thrift: switch csharp backend to netstd

2020-06-23 19:40:18 +03:00

libdeflate @ e7e54eab42

…

licenses

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

locator

token_metadata: get rid of unused calculate_pending_ranges_for_* methods

2020-09-30 23:16:23 +03:00

message

Merge "Gossip echo message improvement" from Asias

2020-09-24 15:13:55 +02:00

mutation_writer

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

raft

raft: declarative tests

2020-10-09 15:50:31 +02:00

redis

redis: pass request as a reference

2020-10-04 14:58:00 +03:00

reloc

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

repair

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

scripts

scripts/refresh-submodules.sh: Add python3 submodule

2020-09-29 16:06:32 +03:00

seastar @ 35c255dcd3

Update seastar submodule

2020-10-11 16:49:03 +03:00

service

main: Start tracing in main

2020-10-06 15:44:59 +03:00

sstables

Merge "reader concurrency semaphore: dump permit diagnostics on timeout or queue overflow" from Botond

2020-10-13 12:44:23 +03:00

streaming

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

swagger-ui @ 12f1da1082

…

test

Merge "reader concurrency semaphore: dump permit diagnostics on timeout or queue overflow" from Botond

2020-10-13 12:44:23 +03:00

thrift

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

tools

Update tools/jmx submodule

2020-10-08 12:09:24 +03:00

tracing

tracing: Keep qp anchor on backend

2020-10-06 15:45:19 +03:00

transport

transport: Delay NEW_NODE until CQL listen started

2020-10-07 09:57:27 +03:00

types

types: Work around a clang thread-local code generation bug (user_type)

2020-10-11 12:36:38 +03:00

unified

install.sh: stop using symlinks for systemd units on nonroot mode

2020-09-29 12:20:41 +03:00

utils

utils: add to_hr_size()

2020-10-13 12:32:14 +03:00

.dockerignore

.dockerignore: add testlog

2020-02-07 08:59:39 +01:00

.gitattributes

…

.gitignore

.gitignore: add .vscode to the list

2020-07-30 16:35:06 +03:00

.gitmodules

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

.gitorderfile

…

absl-flat_hash_map.cc

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

absl-flat_hash_map.hh

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

atomic_cell_hash.hh

collection_mutation: easier (de)serialization of collection_mutation(s).

2019-10-25 10:42:58 +02:00

atomic_cell_or_collection.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

atomic_cell.cc

data/cell: don't overshoot target allocation sizes

2020-09-14 14:21:46 +03:00

atomic_cell.hh

atomic_cell.hh: forward-declare atomic_cell_or_collection

2020-09-21 16:32:53 +03:00

backlog_controller.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

bytes_ostream.hh

bytes_ostream: make it a FragmentRange

2019-12-02 10:10:31 +02:00

bytes.cc

mp_row_consumer: Provide hex-formatting wrapper for bytes_view

2020-08-26 20:44:11 +03:00

bytes.hh

bytes: define contructor for fmt_hex

2020-09-21 16:32:53 +03:00

cache_flat_mutation_reader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

cache_temperature.hh

…

caching_options.hh

treewide: replace libjsoncpp usage with rjson

2020-07-03 10:27:23 +02:00

canonical_mutation.cc

everywhere: Use uninitialized_string instead of sstring::initialized_later

2020-03-10 13:17:49 -07:00

canonical_mutation.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

cartesian_product.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

cell_locking.hh

mutation_partition: make static_row optional to reduce memory footprint

2019-10-15 15:42:05 +03:00

checked-file-impl.hh

treewide: replace calls to engine().some_api() with some_api()

2020-04-05 12:46:04 +03:00

clocks-impl.cc

clocks-impl: switch to thread-safe time conversion

2020-05-04 14:11:38 +03:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

clustering_interval_set.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

clustering_key_filter.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

clustering_ranges_walker.hh

…

CMakeLists.txt

CMakeLists.txt: Add raft directory to source code directories

2020-10-01 19:38:39 +03:00

collection_mutation.cc

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

collection_mutation.hh

collection_mutation_view: add type-aware pretty printer

2020-01-07 12:06:29 +02:00

column_computation.hh

treewide: replace libjsoncpp usage with rjson

2020-07-03 10:27:23 +02:00

combine.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

compaction_garbage_collector.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

compaction_strategy_type.hh

compaction_strategy: add method to reshape SSTables

2020-06-18 09:37:18 -04:00

compaction_strategy.hh

distributed_loader: reshard before the node is made online

2020-06-18 09:37:18 -04:00

compatible_ring_position.hh

Introduce compatible_ring_position and compatible_ring_position_or_view

2019-06-23 16:29:12 +03:00

compound_compat.hh

bytes: compare_unsigned: do not pass nullptr to memcmp

2020-07-09 17:54:46 +03:00

compound.hh

compound_type: implement validate()

2020-05-07 16:19:56 +03:00

compress.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

compress.hh

…

concrete_types.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

configure.py

Merge "reader concurrency semaphore: dump permit diagnostics on timeout or queue overflow" from Botond

2020-10-13 12:44:23 +03:00

connection_notifier.cc

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

connection_notifier.hh

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

CONTRIBUTING.md

Fix a link to contributor-agreement in the CONTRIBUTING page

2020-05-17 14:15:49 +03:00

converting_mutation_partition_applier.cc

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

converting_mutation_partition_applier.hh

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

counters.cc

counters: remove unused 1.7.4 counter order code

2020-09-29 12:16:58 +03:00

counters.hh

counters: Avoid signed integer overflow

2020-10-05 20:04:09 +02:00

cql_serialization_format.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

database_fwd.hh

…

database.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

database.hh

sstable_directory: use a external load_semaphore

2020-10-08 11:57:06 +03:00

db_clock.hh

clocks: add printing functions

2020-01-30 11:10:08 +01:00

debug.hh

…

digest_algorithm.hh

digest: add null values to row digest

2020-09-10 13:16:44 +02:00

digester.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

dirty_memory_manager.hh

commitlog+region_group: timeout exceptions with names

2019-12-03 19:07:19 +01:00

distributed_loader.cc

sstable_directory: use a external load_semaphore

2020-10-08 11:57:06 +03:00

distributed_loader.hh

distributed_loader: remove declaration of inexistent do_populate_column_family()

2020-06-29 14:23:42 -03:00

Doxyfile

…

duration.cc

duration: adjust for C++20 char8_t type

2020-05-12 20:40:30 +02:00

duration.hh

…

encoding_stats.hh

…

enum_set.hh

lwt: ensure enum_set::of is constexpr.

2019-10-01 19:45:56 +02:00

fix_system_distributed_tables.py

tracing: add username to the session table

2020-10-01 04:46:40 +02:00

flat_mutation_reader.cc

flat_mutation_reader: de-virtualize buffer_size()

2020-10-06 08:22:56 +03:00

flat_mutation_reader.hh

flat_mutation_reader: de-virtualize buffer_size()

2020-10-06 08:22:56 +03:00

frozen_mutation.cc

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

frozen_mutation.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

frozen_schema.cc

frozen_schema: order idl implementations correctly

2020-10-03 19:56:28 +03:00

frozen_schema.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

gc_clock.hh

gc_clock, serialization: define new serialization for gc_clock::duration (aka TTLs)

2019-10-23 18:36:33 +03:00

gen_segmented_compress_params.py

…

HACKING.md

README: better explanation of dependencies and build

2020-06-16 13:26:04 +02:00

hashers.cc

hashers: convert illegal contraint to static_assert

2020-09-21 16:32:10 +03:00

hashers.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

hashing_partition_visitor.hh

…

hashing.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

idl-compiler.py

idl-compiler: generate views after serializers

2020-10-03 19:56:25 +03:00

init.cc

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

init.hh

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

install-dependencies.sh

Add support passing python3 dependencies from main repo to scylla-python3 script

2020-09-08 23:39:34 +03:00

install.sh

install.sh: set LC_ALL=en_US.UTF-8 on python3 thunk

2020-10-13 09:38:25 +03:00

interval.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

intrusive_set_external_comparator.hh

…

keys.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

keys.hh

partition_key_view: add validate method

2020-05-12 12:07:00 +03:00

LICENSE.AGPL

…

lister.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

lister.hh

Update seastar submodule

2020-08-19 17:18:57 +03:00

log.hh

…

lua.cc

lua: expect overflow when selecting lua types

2020-10-11 15:38:07 +03:00

lua.hh

lua: Handle nil returns correctly

2020-01-29 14:05:01 -08:00

main.cc

sstable_directory: use a external load_semaphore

2020-10-08 11:57:06 +03:00

map_difference.hh

…

marshal_exception.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable-sstable.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

memtable.hh

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

multishard_mutation_query.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

multishard_mutation_query.hh

storage_proxy: use read_command::max_result_size to pass max result size around

2020-07-28 18:00:29 +03:00

mutation_cleaner.hh

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

mutation_compactor.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_fragment.cc

mutation_fragment: track memory usage through the reader_permit

2020-09-28 11:27:29 +03:00

mutation_fragment.hh

mutation_fragment: memory_usage(): remove unused schema parameter

2020-09-28 11:27:47 +03:00

mutation_partition_serializer.cc

sstables: drop checks for correct counter order support

2020-09-14 12:05:11 +02:00

mutation_partition_serializer.hh

…

mutation_partition_view.cc

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

mutation_partition_view.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

mutation_partition_visitor.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

mutation_partition.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

mutation_partition.hh

Merge "Unfriend rows_entry, cache_tracker and mutation_partition" from Pavel Emelyanov

2020-09-22 21:18:14 +02:00

mutation_query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_query.hh

reconcilable_result_builder: don't aggrevate out-of-memory condition during recovery

2020-09-15 19:53:05 +02:00

mutation_reader.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

mutation_reader.hh

multishard_combining_reader: reader_lifecycle_policy: add permit param to create_reader()

2020-10-12 15:56:56 +03:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

Add mutation_source_metadata

2019-06-26 15:45:59 +03:00

mutation.cc

mutation: Improve log print of mutations

2020-09-04 16:33:25 +02:00

mutation.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

noexcept_traits.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

NOTICE.txt

tests: port Cassandra CQL tests to cql repl

2020-03-26 15:19:38 +02:00

ORIGIN

…

partition_builder.hh

collection_mutation: generalize constructor of collection_mutation to abstract_type.

2019-10-25 10:42:58 +02:00

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

partition_slice_builder: add with_option()

2020-07-28 18:00:29 +03:00

partition_snapshot_reader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

partition_snapshot_row_cursor.hh

partition_snapshot_row_cursor: row(): return clustering_row instead of mutation_fragment

2020-09-28 10:53:56 +03:00

partition_version_list.hh

…

partition_version.cc

partition_version: Remove dead code

2020-09-01 10:19:47 +03:00

partition_version.hh

partition_version: Remove dead code

2020-09-01 10:19:47 +03:00

position_in_partition.hh

position_in_partition_view: add position_in_partition_view before_key() overload

2020-09-25 12:09:00 +03:00

querier.cc

querier: move common stuff into querier_base

2020-06-03 18:45:33 +03:00

querier.hh

querier_cache: use the reader permit for memory accounting

2020-10-06 08:22:56 +03:00

query_class_config.hh

query: query_class_config: use max_result_size for the max_memory_for_unlimited_query field

2020-07-28 18:00:29 +03:00

query_result_merger.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-request.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-reader.hh

query-result-reader: order idl implementations correctly

2020-10-03 19:56:29 +03:00

query-result-set.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-set.hh

mutation_partition: Debloat header form others

2020-03-18 11:53:36 +02:00

query-result-writer.hh

query-result-writer: fix idl definition order related failures with clang

2020-10-11 17:57:12 +03:00

query-result.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

range_tombstone_list.cc

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

range_tombstone_list.hh

range_tombstone_list: Do not expose internal collection

2020-09-07 23:17:41 +03:00

range_tombstone.cc

…

range_tombstone.hh

…

range.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

read_context.hh

row_cache: pass a valid permit to underlying read

2020-05-28 11:34:35 +03:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: dump permit diagnostics on timeout or queue overflow

2020-10-13 12:32:14 +03:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: dump permit diagnostics on timeout or queue overflow

2020-10-13 12:32:14 +03:00

reader_permit.hh

reader_concurrency_semaphore: dump permit diagnostics on timeout or queue overflow

2020-10-13 12:32:14 +03:00

README.md

Improve build documentation

2020-09-07 10:51:31 +03:00

real_dirty_memory_accounter.hh

…

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

flat_mutation_reader: de-virtualize buffer_size()

2020-10-06 08:22:56 +03:00

row_cache.hh

Merge "Unfriend rows_entry, cache_tracker and mutation_partition" from Pavel Emelyanov

2020-09-22 21:18:14 +02:00

schema_builder.hh

schema: Pass an rvalue to set_compaction_strategy_options

2020-08-19 14:02:35 -07:00

schema_fwd.hh

collection_type_impl::mutation: compact_and_expire() add collector parameter

2019-07-15 17:37:55 +03:00

schema_mutations.cc

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_mutations.hh

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_registry.cc

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_registry.hh

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_upgrader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

schema.cc

Merge "materialized views: Fix undefined behavior on base table schema changes" from Tomasz

2020-08-26 17:37:52 +03:00

schema.hh

lwt: introduce paxos_grace_seconds per-table option to set paxos ttl

2020-08-17 16:44:14 +02:00

scylla_post_install.sh

scylla_post_install.sh: generate memory.conf for CentOS7

2020-07-29 14:10:16 +03:00

scylla-gdb.py

scylla-gdb.py: histogram: don't use shared default argument

2020-09-15 10:09:15 +02:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN: skip updating version files when git hash unchanged

2020-02-06 18:36:46 +02:00

seastarx.hh

Everywhere: Explicitly instantiate make_shared

2020-07-21 10:33:49 -07:00

serialization_visitors.hh

…

serializer_impl.hh

repair: Switch to btree_set for repair_hash.

2020-07-09 11:35:18 +03:00

serializer.hh

serializer.hh: remove unneeded semicolon after function definition

2020-10-11 22:12:04 +03:00

service_permit.hh

Everywhere: Explicitly instantiate make_lw_shared

2020-07-21 10:33:49 -07:00

setup.py

…

supervisor.hh

supervisor: drop unused Upstart code, always use libsystemd

2020-06-10 08:17:35 +03:00

table_helper.cc

table_helper: Require local query processor in calls

2020-10-06 15:44:20 +03:00

table_helper.hh

table_helper: Require local query processor in calls

2020-10-06 15:44:20 +03:00

table.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

test.py

test: suppress ubsan true-positive on rapidjson

2020-10-07 19:27:49 +03:00

timeout_config.cc

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timeout_config.hh

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timestamp.hh

add missing include to timestamp.hh

2020-02-05 19:42:18 +02:00

to_string.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

tombstone.hh

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

tox.ini

…

types.cc

types: don't linearize ascii during validation

2020-10-12 13:15:24 +03:00

types.hh

types: validate(): linearize values lazily

2020-10-07 11:00:18 +03:00

ubsan-suppressions.supp

test: suppress ubsan true-positive on rapidjson

2020-10-07 19:27:49 +03:00

unimplemented.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

unimplemented.hh

…

user_types_metadata.hh

user_types_metadata: don't implement enable_lw_shared_from_this

2019-12-11 10:44:40 -08:00

validation.cc

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

validation.hh

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

version.hh

…

view_info.hh

db: view: Refactor view_info::initialize_base_dependent_fields()

2020-08-20 14:53:07 +02:00

vint-serialization.cc

…

vint-serialization.hh

…

xx_hasher.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

zstd.cc

build: remove zstd submodule

2020-06-11 17:12:49 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found in ./docs and on the wiki. There is currently no clear definition of what goes where, so when looking for something be sure to check both. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%