mirror of https://github.com/scylladb/scylladb.git synced 2026-06-02 13:06:57 +00:00

Go to file

Nadav Har'El 5e8bdf6877 alternator: fix corruption of PutItem operation in case of contention

This patch fixes a bug noted in issue #7218 - where PutItem operations
sometimes lose part of the item's data - some attributes were lost,
and the name of other attributes replaced by empty strings. The problem
happened when the write-isolation policy was LWT and there was contention
of writes to the same partition (not necessarily the same item).

To use CAS (a.k.a. LWT), Alternator builds an alternator::rmw_operation
object with an apply() function which takes the old contents of the item
(if needed) and a timestamp, and builds a mutation that the CAS should
apply. In the case of the PutItem operation, we wrongly assumed that apply()
will be called only once - so as an optimization the strings saved in the
put_item_operation were moved into the returned mutation. But this
optimization is wrong - when there is contention, apply() may be called
again when the changed proposed by the previous one was not accepted by
the Paxos protocol.

The fix is to change the one place where put_item_operation *moved* strings
out of the saved operations into the mutations, to be a copy. But to prevent
this sort of bug from reoccuring in future code, this patch enlists the
compiler to help us verify that it can't happen: The apply() function is
marked "const" - it can use the information in the operation to build the
mutation, but it can never modify this information or move things out of it,
so it will be fine to call this function twice.

The single output field that apply() does write (_return_attributes) is
marked "mutable" to allow the const apply() to write to it anyway. Because
apply() might be called twice, it is important that if some apply()
implementation sometimes sets _return_attributes, then it must always
set it (even if to the default, empty, value) on every call to apply().

The const apply() means that the compiler verfies for us that I didn't
forget to fix additional wrong std::move()s. Additionally, a test I wrote
to easily reproduce issue #7218 (which I will submit as a dtest later)
passes after this fix.

Fixes #7218.

Signed-off-by: Nadav Har'El <nyh@scylladb.com>
Message-Id: <20200916064906.333420-1-nyh@scylladb.com>

2020-09-16 10:30:19 +02:00

.github

Additional entries in CODEOWNERS

2020-08-04 21:03:23 +03:00

abseil @ 2069dc796a

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

alternator

alternator: fix corruption of PutItem operation in case of contention

2020-09-16 10:30:19 +02:00

api

Merge "Free compaction from storage service" from Pavel E

2020-08-23 17:58:32 +03:00

auth

auth: Inline standard_role_manager_name into only use

2020-08-26 11:33:23 +03:00

cdc

cdc: Add setter for delta mode

2020-09-07 14:14:04 +00:00

conf

transport: Allow user to disable unencrypted native transport

2020-08-11 13:15:17 +03:00

cql3

roles: drop checks for roles schema support

2020-09-14 12:17:26 +02:00

data

data/cell: don't overshoot target allocation sizes

2020-09-14 14:21:46 +03:00

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

debug

…

dht

range_streamer: keep a const token_metadata&

2020-08-20 16:20:34 +03:00

dist

scylla_setup: drop hugepages package installation

2020-09-14 17:05:09 +03:00

docs

redis: add hgetall and hdel commands

2020-09-08 11:59:52 +03:00

exceptions

cql3: avoid using shared_ptr's in unrecognized_entity_exception

2020-05-06 19:02:36 +03:00

gms

gms: add comments for deprecated features

2020-09-14 12:59:19 +02:00

idl

Merge "Get rid of seed concept in gossip" from Asias

2020-08-17 09:50:51 +03:00

imr

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

index

Merge 'Replace operator_type with an enum' from Dejan

2020-08-18 13:45:20 +03:00

interface

thrift: switch csharp backend to netstd

2020-06-23 19:40:18 +03:00

libdeflate @ e7e54eab42

…

licenses

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

locator

Merge 'Remove _pending_ranges and _pending_ranges_map in token_metadata' from Asias

2020-09-15 17:16:35 +03:00

message

messaging_service: Unglobal messaging service instance

2020-08-19 20:50:53 +03:00

mutation_writer

codebase wide: use try_emplace when appropriate

2020-08-16 14:41:09 +03:00

redis

redis: remove lambda in command_factory

2020-09-14 11:30:20 +03:00

reloc

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

repair

Merge 'Clean up old cluster features' from Piotr Sarna

2020-09-16 10:53:25 +03:00

scripts

scripts: pull_pr.sh: auto-close pull request after merge

2020-09-16 10:23:34 +02:00

seastar @ dc06cd1f0f

Update seastar submodule

2020-09-15 17:33:24 +03:00

service

Merge 'Clean up old cluster features' from Piotr Sarna

2020-09-16 10:53:25 +03:00

sstables

sstables: drop checks for non-compound range tombstones support

2020-09-14 12:09:51 +02:00

streaming

streaming: drop checks for RPC stream support

2020-09-14 12:18:13 +02:00

swagger-ui @ 12f1da1082

…

test

Merge "Some optimizations on cache entry lookup" from Pavel Emelyanov

2020-09-15 17:49:47 +02:00

thrift

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

tools

Update tools/jmx submodule

2020-09-16 10:30:33 +03:00

tracing

migration_manager: Remove db/schema_tables.hh inclustion into header

2020-07-17 17:54:43 +03:00

transport

utf8: Print invalid UTF-8 character position

2020-09-07 18:11:21 +03:00

types

cql3: pass column_specification via lw_shared_ptr

2020-04-27 12:47:42 +03:00

unified

Merge "Add unified tarball to build "dist" target" from Pekka

2020-09-14 11:29:28 +03:00

utils

Merge "Fix race in schema version recalculation leading to stale schema version in gossip" from Tomasz

2020-09-14 12:37:46 +03:00

.dockerignore

.dockerignore: add testlog

2020-02-07 08:59:39 +01:00

.gitattributes

…

.gitignore

.gitignore: add .vscode to the list

2020-07-30 16:35:06 +03:00

.gitmodules

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

.gitorderfile

…

absl-flat_hash_map.cc

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

absl-flat_hash_map.hh

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

atomic_cell_hash.hh

collection_mutation: easier (de)serialization of collection_mutation(s).

2019-10-25 10:42:58 +02:00

atomic_cell_or_collection.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

atomic_cell.cc

data/cell: don't overshoot target allocation sizes

2020-09-14 14:21:46 +03:00

atomic_cell.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

backlog_controller.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

bytes_ostream.hh

bytes_ostream: make it a FragmentRange

2019-12-02 10:10:31 +02:00

bytes.cc

mp_row_consumer: Provide hex-formatting wrapper for bytes_view

2020-08-26 20:44:11 +03:00

bytes.hh

utf8: Print invalid UTF-8 character position

2020-09-07 18:11:21 +03:00

cache_flat_mutation_reader.hh

code: Force formatting of pointer in .debug and .trace

2020-08-26 20:44:11 +03:00

cache_temperature.hh

…

caching_options.hh

treewide: replace libjsoncpp usage with rjson

2020-07-03 10:27:23 +02:00

canonical_mutation.cc

everywhere: Use uninitialized_string instead of sstring::initialized_later

2020-03-10 13:17:49 -07:00

canonical_mutation.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

cartesian_product.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

cell_locking.hh

mutation_partition: make static_row optional to reduce memory footprint

2019-10-15 15:42:05 +03:00

checked-file-impl.hh

treewide: replace calls to engine().some_api() with some_api()

2020-04-05 12:46:04 +03:00

clocks-impl.cc

clocks-impl: switch to thread-safe time conversion

2020-05-04 14:11:38 +03:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

clustering_interval_set.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

clustering_key_filter.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

clustering_ranges_walker.hh

…

CMakeLists.txt

CMakeLists.txt: Add abseil to include directories

2020-07-31 12:15:23 +02:00

collection_mutation.cc

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

collection_mutation.hh

collection_mutation_view: add type-aware pretty printer

2020-01-07 12:06:29 +02:00

column_computation.hh

treewide: replace libjsoncpp usage with rjson

2020-07-03 10:27:23 +02:00

combine.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

compaction_garbage_collector.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

compaction_strategy_type.hh

compaction_strategy: add method to reshape SSTables

2020-06-18 09:37:18 -04:00

compaction_strategy.hh

distributed_loader: reshard before the node is made online

2020-06-18 09:37:18 -04:00

compatible_ring_position.hh

Introduce compatible_ring_position and compatible_ring_position_or_view

2019-06-23 16:29:12 +03:00

compound_compat.hh

bytes: compare_unsigned: do not pass nullptr to memcmp

2020-07-09 17:54:46 +03:00

compound.hh

compound_type: implement validate()

2020-05-07 16:19:56 +03:00

compress.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

compress.hh

…

concrete_types.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

configure.py

configure.py: Build unified tarball as part of "dist" target

2020-09-11 12:38:47 +03:00

connection_notifier.cc

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

connection_notifier.hh

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

CONTRIBUTING.md

Fix a link to contributor-agreement in the CONTRIBUTING page

2020-05-17 14:15:49 +03:00

converting_mutation_partition_applier.cc

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

converting_mutation_partition_applier.hh

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

counters.cc

…

counters.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

cql_serialization_format.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

database_fwd.hh

…

database.cc

db, schema: Hide update_schema_version_and_announce()

2020-09-11 14:42:48 +02:00

database.hh

db, schema: Hide update_schema_version_and_announce()

2020-09-11 14:42:48 +02:00

db_clock.hh

clocks: add printing functions

2020-01-30 11:10:08 +01:00

debug.hh

…

digest_algorithm.hh

digest: add null values to row digest

2020-09-10 13:16:44 +02:00

digester.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

dirty_memory_manager.hh

commitlog+region_group: timeout exceptions with names

2019-12-03 19:07:19 +01:00

distributed_loader.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

distributed_loader.hh

distributed_loader: remove declaration of inexistent do_populate_column_family()

2020-06-29 14:23:42 -03:00

Doxyfile

…

duration.cc

duration: adjust for C++20 char8_t type

2020-05-12 20:40:30 +02:00

duration.hh

…

encoding_stats.hh

encoding_stats.hh: add missing include

2019-05-14 13:27:30 +03:00

enum_set.hh

lwt: ensure enum_set::of is constexpr.

2019-10-01 19:45:56 +02:00

fix_system_distributed_tables.py

fix_system_distributed_tables.py: declare the 'port' argument as 'int'

2019-06-06 20:19:57 +03:00

flat_mutation_reader.cc

range_tombstone_list: Introduce and use pop-and-lock helper

2020-09-07 23:17:41 +03:00

flat_mutation_reader.hh

query: query_class_config: use max_result_size for the max_memory_for_unlimited_query field

2020-07-28 18:00:29 +03:00

frozen_mutation.cc

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

frozen_mutation.hh

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

frozen_schema.cc

…

frozen_schema.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

gc_clock.hh

gc_clock, serialization: define new serialization for gc_clock::duration (aka TTLs)

2019-10-23 18:36:33 +03:00

gen_segmented_compress_params.py

…

HACKING.md

README: better explanation of dependencies and build

2020-06-16 13:26:04 +02:00

hashers.cc

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

hashers.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

hashing_partition_visitor.hh

…

hashing.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

idl-compiler.py

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

init.cc

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

init.hh

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

install-dependencies.sh

Add support passing python3 dependencies from main repo to scylla-python3 script

2020-09-08 23:39:34 +03:00

install.sh

unified/install.sh: set default python3/sysconfdir smartly

2020-08-31 15:54:51 +03:00

interval.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

intrusive_set_external_comparator.hh

…

keys.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

keys.hh

partition_key_view: add validate method

2020-05-12 12:07:00 +03:00

LICENSE.AGPL

…

lister.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

lister.hh

Update seastar submodule

2020-08-19 17:18:57 +03:00

log.hh

…

lua.cc

utf8: Print invalid UTF-8 character position

2020-09-07 18:11:21 +03:00

lua.hh

lua: Handle nil returns correctly

2020-01-29 14:05:01 -08:00

main.cc

Storage proxy: add a dedicated smp group for hints

2020-09-07 15:46:12 +03:00

map_difference.hh

…

marshal_exception.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable-sstable.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable.cc

memtable: Switch onto B+ rails

2020-07-14 16:30:02 +03:00

memtable.hh

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

multishard_mutation_query.cc

multishard_mutation_query: fix a typo in variable name

2020-08-09 12:52:40 +03:00

multishard_mutation_query.hh

storage_proxy: use read_command::max_result_size to pass max result size around

2020-07-28 18:00:29 +03:00

mutation_cleaner.hh

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

mutation_compactor.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_fragment.cc

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

mutation_fragment.hh

clustering_row: Do not re-implement deletable_row

2020-09-08 22:21:15 +03:00

mutation_partition_serializer.cc

sstables: drop checks for correct counter order support

2020-09-14 12:05:11 +02:00

mutation_partition_serializer.hh

…

mutation_partition_view.cc

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

mutation_partition_view.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

mutation_partition_visitor.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

mutation_partition.cc

mutation_partition: use proper hasher in row hashing

2020-09-14 14:17:36 +03:00

mutation_partition.hh

mutation_partition: Fix typo

2020-09-15 10:09:15 +02:00

mutation_query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_query.hh

reconcilable_result_builder: don't aggrevate out-of-memory condition during recovery

2020-09-15 19:53:05 +02:00

mutation_reader.cc

mutation_reader: make_combined_reader(): return empty reader when combining 0 readers

2020-08-22 20:47:49 +03:00

mutation_reader.hh

Merge "Don't depend on seastar::make_(lw_)?shared idiosyncrasies" from Rafael

2020-08-02 19:51:24 +03:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

Add mutation_source_metadata

2019-06-26 15:45:59 +03:00

mutation.cc

mutation: Improve log print of mutations

2020-09-04 16:33:25 +02:00

mutation.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

noexcept_traits.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

NOTICE.txt

tests: port Cassandra CQL tests to cql repl

2020-03-26 15:19:38 +02:00

ORIGIN

…

partition_builder.hh

collection_mutation: generalize constructor of collection_mutation to abstract_type.

2019-10-25 10:42:58 +02:00

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

partition_slice_builder: add with_option()

2020-07-28 18:00:29 +03:00

partition_snapshot_reader.hh

partition_snapshot_reader: Do not fill buffer in constructor

2020-09-14 12:18:03 +02:00

partition_snapshot_row_cursor.hh

deletable_row: Do not mess with clustering_row

2020-09-08 22:18:15 +03:00

partition_version_list.hh

…

partition_version.cc

partition_version: Remove dead code

2020-09-01 10:19:47 +03:00

partition_version.hh

partition_version: Remove dead code

2020-09-01 10:19:47 +03:00

position_in_partition.hh

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

querier.cc

querier: move common stuff into querier_base

2020-06-03 18:45:33 +03:00

querier.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query_class_config.hh

query: query_class_config: use max_result_size for the max_memory_for_unlimited_query field

2020-07-28 18:00:29 +03:00

query_result_merger.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-request.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-reader.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-set.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-set.hh

mutation_partition: Debloat header form others

2020-03-18 11:53:36 +02:00

query-result-writer.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

range_tombstone_list.cc

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

range_tombstone_list.hh

range_tombstone_list: Do not expose internal collection

2020-09-07 23:17:41 +03:00

range_tombstone.cc

…

range_tombstone.hh

…

range.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

read_context.hh

row_cache: pass a valid permit to underlying read

2020-05-28 11:34:35 +03:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: make inactive read handles unique across semaphores

2020-07-23 16:43:33 +03:00

reader_concurrency_semaphore.hh

Merge "messaging: make verb handler registering independent of current scheduling group" from Botond

2020-07-27 13:56:52 +03:00

reader_permit.hh

reader_permit: reader_resources: add operator- and operator+

2020-07-20 11:23:39 +03:00

README.md

Improve build documentation

2020-09-07 10:51:31 +03:00

real_dirty_memory_accounter.hh

…

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

row_cache: Save one key compare on direct hit

2020-09-03 21:13:21 +03:00

row_cache.hh

row_cache: Kill incomplete_tag

2020-09-03 21:13:21 +03:00

schema_builder.hh

schema: Pass an rvalue to set_compaction_strategy_options

2020-08-19 14:02:35 -07:00

schema_fwd.hh

collection_type_impl::mutation: compact_and_expire() add collector parameter

2019-07-15 17:37:55 +03:00

schema_mutations.cc

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_mutations.hh

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_registry.cc

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_registry.hh

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_upgrader.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

schema.cc

Merge "materialized views: Fix undefined behavior on base table schema changes" from Tomasz

2020-08-26 17:37:52 +03:00

schema.hh

lwt: introduce paxos_grace_seconds per-table option to set paxos ttl

2020-08-17 16:44:14 +02:00

scylla_post_install.sh

scylla_post_install.sh: generate memory.conf for CentOS7

2020-07-29 14:10:16 +03:00

scylla-gdb.py

scylla-gdb.py: histogram: don't use shared default argument

2020-09-15 10:09:15 +02:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN: skip updating version files when git hash unchanged

2020-02-06 18:36:46 +02:00

seastarx.hh

Everywhere: Explicitly instantiate make_shared

2020-07-21 10:33:49 -07:00

serialization_visitors.hh

…

serializer_impl.hh

repair: Switch to btree_set for repair_hash.

2020-07-09 11:35:18 +03:00

serializer.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

service_permit.hh

Everywhere: Explicitly instantiate make_lw_shared

2020-07-21 10:33:49 -07:00

setup.py

…

supervisor.hh

supervisor: drop unused Upstart code, always use libsystemd

2020-06-10 08:17:35 +03:00

table_helper.cc

everywhere: Replace engine().cpu_id() with this_shard_id()

2020-03-27 11:40:03 +03:00

table_helper.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

table.cc

Merge "Update log messages to {fmt} rules" from Pavel E

2020-09-03 15:10:09 +03:00

test.py

Use detect_stack_use_after_return=1

2020-08-04 11:00:09 +03:00

timeout_config.cc

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timeout_config.hh

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timestamp.hh

add missing include to timestamp.hh

2020-02-05 19:42:18 +02:00

to_string.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

tombstone.hh

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

tox.ini

…

types.cc

types: time_point_to_string: prevent overflow of nanoseconds

2020-09-08 10:02:02 +03:00

types.hh

types, compound: pass std::current_exception() to on_internal_error()

2020-05-07 11:25:25 +02:00

unimplemented.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

unimplemented.hh

…

user_types_metadata.hh

user_types_metadata: don't implement enable_lw_shared_from_this

2019-12-11 10:44:40 -08:00

validation.cc

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

validation.hh

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

version.hh

…

view_info.hh

db: view: Refactor view_info::initialize_base_dependent_fields()

2020-08-20 14:53:07 +02:00

vint-serialization.cc

…

vint-serialization.hh

…

xx_hasher.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

zstd.cc

build: remove zstd submodule

2020-06-11 17:12:49 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found in ./docs and on the wiki. There is currently no clear definition of what goes where, so when looking for something be sure to check both. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.2%

Python 26.6%

CMake 0.3%

GAP 0.3%

Shell 0.3%