mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 18:50:53 +00:00

Go to file

Tomasz Grabiec 6863a5e43b row_cache: Avoid generating overlapping range tombstones

Row cache reader can produce overlapping range tombstones in the
mutation fragment stream even if there is only a single range
tombstone in sstables, due to #2581. For every range between two rows,
the row cache reader queries for tombstones relevant for that
range. The result of the query is trimmed to the current position of
the reader (=position of the previous row) to satisfy key
monotonicity. The end position of range tombstones is left
unchanged. So cache reader will split a single range tombstone around
rows. Those range tombstones are transient, they will be only
materialized in the reader's stream, they are not persisted anywhere.

That is not a problem in itself, but it interacts badly with mutation
compactor due to #8625. The range_tombstone_accumulator which is used
to compact the mutation fragment stream needs to accumulate all
tombstones which are relevant for the current clustering position in
the stream. Adding a new range tombstone is O(N) in the number of
currently active tombstones. This means that producing N rows will be
O(N^2).

In a unit test, I saw reading 137'248 rows which overlap with a range
tombstone take 245 seconds. Almost all of CPU time is in
drop_unneeded_tombstones().

The solution is to make the cache reader trim range tombstone end to
the currently emited sub-range, so that it emits non-overlapping range
tombstones.

Fixes #8626.

2021-05-12 00:10:24 +02:00

.github

docs: added multiversion_regex_builder

2021-01-13 11:07:29 +02:00

abseil @ 9c6a50fdd8

Update abseil submodule

2021-02-08 15:41:46 +02:00

alternator

migration_manager: allow table updates with timestamp

2021-05-10 10:10:38 +02:00

api

storage_proxy, treewide: introduce names for vectors of inet_address

2021-05-05 18:36:48 +03:00

auth

auth: Add service_level resource for supporting in authorization of cql service_level

2021-04-12 16:01:04 +02:00

cdc

Merge "Untie cdc, storage service and migration notifier knot" from Pavel E

2021-05-11 18:39:10 +03:00

conf

config: relax batch size warning and failure thresholds

2021-04-06 20:56:06 +03:00

cql3

Merge 'Fix index name conflicts with regular tables' from Piotr Sarna

2021-05-11 18:40:15 +03:00

Merge 'Add per-service-level timeouts' from Piotr Sarna

2021-05-11 18:39:10 +03:00

debug

…

dht

Merge 'token_metadata: Fix get_all_endpoints to return nodes in the ring' from Asias He

2021-05-11 18:39:10 +03:00

dist

scylla_io_setup: configure "aio-max-nr" before iotune

2021-05-11 18:39:10 +03:00

docs

docs: add a paragraph describing service level timeouts

2021-05-10 12:39:41 +02:00

exceptions

cql: fix error return from execution of fromJson() and other functions

2021-01-21 15:21:13 +01:00

gms

gossip: Relax failure detector update

2021-04-14 13:16:00 +02:00

idl

Merge 'Switch to use NODE_OPS_CMD for decommission and bootstrap operation' from Asias He

2021-05-06 17:28:19 +03:00

index

secondary index: fix index name in IndexInfo system table

2021-05-11 18:39:10 +03:00

interface

thrift: switch csharp backend to netstd

2020-06-23 19:40:18 +03:00

libdeflate @ e7e54eab42

…

licenses

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

locator

Merge 'token_metadata: Fix get_all_endpoints to return nodes in the ring' from Asias He

2021-05-11 18:39:10 +03:00

message

storage_proxy, treewide: introduce names for vectors of inet_address

2021-05-05 18:36:48 +03:00

mutation_writer

mutation_writer: multishard_writer: close readers when done

2021-04-25 11:35:07 +03:00

raft

raft: document that add entry my throw commit_status_unknown

2021-05-06 11:59:36 +03:00

redis

treewide: propagate service level to client state

2021-05-10 11:48:14 +02:00

reloc

reloc: Remove "build_reloc.sh" script as obsolete

2020-11-20 22:41:26 +02:00

repair

repair: Wire off-strategy compaction for decommission

2021-05-11 18:39:10 +03:00

scripts

scripts: introduce coverage.py

2021-05-07 15:54:49 +03:00

seastar @ 847fccaf5e

Update seastar submodule

2021-05-04 09:12:49 +03:00

service

Merge 'Add per-service-level timeouts' from Piotr Sarna

2021-05-11 18:39:10 +03:00

sstables

sstables: disambiguate boost::find

2021-05-10 11:48:14 +02:00

streaming

Merge "Close flat mutation readers" from Benny

2021-04-25 13:53:11 +03:00

swagger-ui @ 12f1da1082

…

test

Merge 'Fix index name conflicts with regular tables' from Piotr Sarna

2021-05-11 18:40:15 +03:00

thrift

treewide: propagate service level to client state

2021-05-10 11:48:14 +02:00

tools

Update tools/jmx submodule (toppartitions multi-sampler query)

2021-05-11 18:39:10 +03:00

tracing

treewide: remove inclusions of storage_proxy.hh from headers

2021-04-20 21:23:00 +03:00

transport

transport: add updating per-service-level params

2021-05-10 12:39:41 +02:00

types

treewide: make headers self-sufficient

2021-04-20 21:23:00 +03:00

unified

unified: abort install when non-bash shell detected

2021-04-15 11:59:41 +02:00

utils

utils/enum_option.hh: make it easier to compare the value

2021-05-11 18:39:10 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

docs: added theme

2020-12-03 17:37:18 +01:00

.gitmodules

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

.gitorderfile

…

absl-flat_hash_map.cc

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

absl-flat_hash_map.hh

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

atomic_cell_hash.hh

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

atomic_cell_or_collection.hh

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

atomic_cell.cc

atomic_cell: fix operator<< for atomic_cell_or_collection

2021-02-22 14:45:34 +02:00

atomic_cell.hh

atomic_cell: get rid of is_value_fragments

2021-05-09 11:08:53 +03:00

backlog_controller.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

bytes_ostream.hh

bytes_ostream: convert write_placeholder from enable_if to concepts

2021-03-22 12:00:07 +01:00

bytes.cc

mp_row_consumer: Provide hex-formatting wrapper for bytes_view

2020-08-26 20:44:11 +03:00

bytes.hh

bytes: implement std::hash using appending_hash

2021-01-08 13:17:46 +01:00

cache_flat_mutation_reader.hh

row_cache: Avoid generating overlapping range tombstones

2021-05-12 00:10:24 +02:00

cache_temperature.hh

…

caching_options.cc

caching_options.hh: move code to .cc

2021-04-05 13:05:43 +03:00

caching_options.hh

caching_options.hh: move code to .cc

2021-04-05 13:05:43 +03:00

canonical_mutation.cc

canonical_mutation: make the data type non-contiguous

2021-02-15 10:24:47 +01:00

canonical_mutation.hh

canonical_mutation: make the data type non-contiguous

2021-02-15 10:24:47 +01:00

cartesian_product.hh

cartesian_product: Remove std::iterator from iterator

2020-11-17 16:53:20 +01:00

cell_locking.hh

…

checked-file-impl.hh

files: Construct file_impls properly

2021-03-26 00:22:11 +01:00

clocks-impl.cc

clocks-impl: switch to thread-safe time conversion

2020-05-04 14:11:38 +03:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

clustering_bounds_comparator: do not depend on implicit conversion of keys to bytes_view

2020-12-20 15:14:44 +01:00

clustering_interval_set.hh

clustering_interval_set: Remove std::iterator from position_range_iterator

2020-11-17 16:53:20 +01:00

clustering_key_filter.hh

…

clustering_ranges_walker.hh

clustering_range_walker: fix false discontiguity detected after a static row

2021-02-01 19:32:07 +02:00

CMakeLists.txt

caching_options.hh: move code to .cc

2021-04-05 13:05:43 +03:00

collection_mutation.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

collection_mutation.hh

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

column_computation.hh

Reduce dependency on header utils/rjson.hh

2021-04-25 13:20:51 +03:00

combine.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

compaction_garbage_collector.hh

…

compaction_strategy_type.hh

compaction_strategy: add method to reshape SSTables

2020-06-18 09:37:18 -04:00

compaction_strategy.hh

distributed_loader: reshard before the node is made online

2020-06-18 09:37:18 -04:00

compatible_ring_position.hh

dht: ring_position, decorated_key: convert tri_comparators to std::strong_ordering

2021-03-18 12:40:05 +02:00

compound_compat.hh

composite: replace enable_if with constraints

2021-04-04 13:56:51 +03:00

compound.hh

Merge 'sstables: remove large allocations when parsing cells' from Wojciech Mitros

2021-04-22 15:38:10 +02:00

compress.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

compress.hh

…

concrete_types.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

configure.py

configure.py: replace --coverage with a coverage build mode

2021-05-07 15:23:31 +03:00

connection_notifier.cc

treewide: remove inclusions of storage_proxy.hh from headers

2021-04-20 21:23:00 +03:00

connection_notifier.hh

code: Use qctx::evecute_cql methods, not global ones

2020-11-19 18:39:05 +03:00

CONTRIBUTING.md

CONTRIBUTING.md: add the requirement for self-contained headers

2021-05-05 15:10:46 +03:00

converting_mutation_partition_applier.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

converting_mutation_partition_applier.hh

…

counters.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

counters.hh

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

cql_serialization_format.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

database_fwd.hh

…

database.cc

database: check for conflicting table names for indexes

2021-05-11 15:20:59 +02:00

database.hh

database: add get_unlimited_query_max_result_size()

2021-05-05 13:30:42 +03:00

db_clock.hh

…

debug.hh

…

default.nix

build: add nix-shell support

2021-04-14 13:15:59 +02:00

digest_algorithm.hh

digest: add null values to row digest

2020-09-10 13:16:44 +02:00

digester.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

dirty_memory_manager.hh

…

distributed_loader.cc

sys_dist_ks: new keyspace for system tables with Everywhere strategy

2021-04-19 11:22:57 +03:00

distributed_loader.hh

distributed_loader: Add get_sstables_from_upload_dir

2021-01-16 20:03:17 +08:00

Doxyfile

…

duration.cc

duration: adjust for C++20 char8_t type

2020-05-12 20:40:30 +02:00

duration.hh

…

encoding_stats.hh

…

enum_set.hh

…

fix_system_distributed_tables.py

tracing: add username to the session table

2020-10-01 04:46:40 +02:00

flat_mutation_reader.cc

flat_mutation_reader: abort if not closed before destroyed

2021-04-25 11:35:07 +03:00

flat_mutation_reader.hh

flat_mutation_reader: consume_mutation_fragments_until: maybe yield after each popped mutation_fragment

2021-05-03 14:06:26 +03:00

frozen_mutation.cc

flat_mutation_reader: make sure to close flat_mutation_reader_from_mutations

2021-04-25 11:25:47 +03:00

frozen_mutation.hh

Merge "lwt: store column_mapping's for each table schema version upon a DDL change" from Pavel Solodovnikov

2020-10-15 20:48:29 +02:00

frozen_schema.cc

frozen_schema: order idl implementations correctly

2020-10-03 19:56:28 +03:00

frozen_schema.hh

…

gc_clock.hh

…

gen_segmented_compress_params.py

…

generic_server.cc

generic_server: Rename "maybe_idle" to "maybe_stop"

2021-04-13 14:13:24 +03:00

generic_server.hh

generic_server: Rename "maybe_idle" to "maybe_stop"

2021-04-13 14:13:24 +03:00

HACKING.md

Merge "Improve coverage support" from Botond

2021-05-11 18:39:10 +03:00

hashers.cc

hashers: convert illegal contraint to static_assert

2020-09-21 16:32:10 +03:00

hashers.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

hashing_partition_visitor.hh

…

hashing.hh

hashing: appending_hash: convert from enable_if to concepts

2021-03-17 09:59:22 +02:00

idl-compiler.py

idl-compiler: allow fields of type utils::chunked_vector

2021-01-13 04:09:18 +01:00

inet_address_vectors.hh

storage_proxy, treewide: use utils::small_vector inet_address_vector:s

2021-05-05 18:36:54 +03:00

init.cc

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

init.hh

cross-tree: reduce dependency on db/config.hh and database.hh

2021-05-05 13:23:00 +03:00

install-dependencies.sh

build: drop lld from install-dependencies.sh on s390x

2021-04-12 09:46:33 +03:00

install.sh

unified: abort install when non-bash shell detected

2021-04-15 11:59:41 +02:00

interval.hh

interval: support C++20 three-way comparisons

2021-02-28 21:03:25 +02:00

intrusive_set_external_comparator.hh

…

keys.cc

keys: convert trichotomic comparators to return std::strong_ordering

2021-03-21 09:30:43 +02:00

keys.hh

compound: add explode_fragmented

2021-04-08 10:02:54 +02:00

LICENSE.AGPL

…

lister.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

lister.hh

Update seastar submodule

2020-08-19 17:18:57 +03:00

log.hh

…

lua.cc

cross-tree: reduce dependency on db/config.hh and database.hh

2021-05-05 13:23:00 +03:00

lua.hh

cross-tree: reduce dependency on db/config.hh and database.hh

2021-05-05 13:23:00 +03:00

main.cc

storage_service: Remove migration notifier dependency

2021-04-29 22:47:13 +03:00

map_difference.hh

…

marshal_exception.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable-sstable.hh

table: Add write_memtable_to_sstable variant which accepts flat_mutation_reader

2021-01-04 16:23:00 -03:00

memtable.cc

memtable: flush_reader: make sure to close partition reader

2021-04-25 11:35:07 +03:00

memtable.hh

memtable: Track min timestamp

2021-01-04 13:24:43 -03:00

multishard_mutation_query.cc

multishard_mutation_query: read_context::stop: properly close unregistered inactive_reads

2021-04-25 11:35:07 +03:00

multishard_mutation_query.hh

multishard_mutation_query: add query_data_on_all_shards()

2021-03-02 07:53:53 +02:00

mutation_cleaner.hh

…

mutation_compactor.hh

mutation compactor: query compaction: ignore purgeable tombstones

2021-01-22 15:27:48 +02:00

mutation_consumer_concepts.hh

flat_mutation_reader: move mutation consumer concepts to separate header

2021-01-22 15:27:48 +02:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: add token validation level

2021-03-01 07:49:23 +02:00

mutation_fragment.cc

range_tombstone_stream: Remove unused methods

2021-03-16 12:08:18 +03:00

mutation_fragment.hh

clustering_row: Add new .apply() overload

2021-04-09 10:05:47 +03:00

mutation_partition_serializer.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

mutation_partition_serializer.hh

…

mutation_partition_view.cc

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

mutation_partition_view.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

mutation_partition_visitor.hh

…

mutation_partition.cc

mutation_partition: counter_write_query: close reader when done

2021-04-25 11:35:07 +03:00

mutation_partition.hh

code: Relax position_in_partition::tri_compare users

2021-04-09 18:20:39 +03:00

mutation_query.cc

mutation_query: move to_data_query_result() to mutation_partition.cc

2021-01-22 15:27:48 +02:00

mutation_query.hh

mutation_query: remove now unused mutation_query()

2021-04-09 13:40:27 +03:00

mutation_reader.cc

mutation_reader: shard_reader: get rid of stop

2021-04-25 11:35:07 +03:00

mutation_reader.hh

mutation_reader: reader_lifecycle_policy: return future from destroy_reader

2021-04-25 11:35:07 +03:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

…

mutation.cc

mutation: remove now unused query() and query_compacted()

2021-01-22 15:36:37 +02:00

mutation.hh

mutation: consume(): add reverse mode

2021-02-03 11:00:47 +02:00

noexcept_traits.hh

noexcept_traits: convert enable_if to concepts

2021-03-30 09:30:23 +02:00

NOTICE.txt

raft: etcd unit tests: initial boost tests

2021-01-18 12:33:12 -04:00

ORIGIN

…

partition_builder.hh

partition_builder: accept_row(): use append_clustering_row()

2020-12-02 15:08:49 +02:00

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

partition_slice_builder: add with_option()

2020-07-28 18:00:29 +03:00

partition_snapshot_reader.hh

flat_mutation_reader: require close

2021-04-25 11:35:07 +03:00

partition_snapshot_row_cursor.hh

partition_snapshot_row_cursor: Rewrite row() with consume_row()

2021-04-09 12:18:29 +03:00

partition_version_list.hh

…

partition_version.cc

misc: fix indentation

2021-01-08 14:16:08 +01:00

partition_version.hh

row_cache: Zap dummy entries when populating or reading a range

2021-03-01 20:34:35 +02:00

position_in_partition.hh

position_in_partition: Convert tri_compare to strong_ordering

2021-04-09 18:20:39 +03:00

querier.cc

querier_cache: implement stop

2021-04-25 11:35:07 +03:00

querier.hh

querier_cache: implement stop

2021-04-25 11:35:07 +03:00

query_class_config.hh

query_class_config: add operator== for max_result_size

2021-05-05 11:20:22 +03:00

query_result_merger.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-request.hh

query: partition_slice: add range_scan_data_variant option

2021-03-02 07:53:53 +02:00

query-result-reader.hh

query-result-reader: order idl implementations correctly

2020-10-03 19:56:29 +03:00

query-result-set.cc

treewide: use query_mutations() instead of mutation::query()

2021-01-22 15:36:37 +02:00

query-result-set.hh

mutation_partition: Debloat header form others

2020-03-18 11:53:36 +02:00

query-result-writer.hh

mutation_partition: mark query_result_builder constructor noexcept

2021-04-25 11:35:07 +03:00

query-result.hh

result_memory_accounter: abort unpaged queries hitting the global limit

2021-02-26 23:43:16 +02:00

query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

range_tombstone_list.cc

range_tombstone_list: Add new slice() helper

2021-03-16 11:55:28 +03:00

range_tombstone_list.hh

range_tombstone_list: Add new slice() helper

2021-03-16 11:55:28 +03:00

range_tombstone.cc

range_tombstone_accumulator: Avoid update_current_tombstone() when nothing changed

2021-05-12 00:10:24 +02:00

range_tombstone.hh

row_cache: Avoid generating overlapping range tombstones

2021-05-12 00:10:24 +02:00

range.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

read_context.hh

row_cache: read_context: add close method

2021-04-25 11:35:07 +03:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: dump_reader_diagnostics(): print more information in the header

2021-05-10 10:15:47 +03:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: dump_reader_diagnostics(): cap number of printed lines

2021-05-10 10:15:47 +03:00

reader_permit.hh

reader_permit: always forward resources

2021-04-26 15:56:56 +03:00

README.md

docs: fix invalid path in README.mds

2021-02-21 13:49:12 +02:00

real_dirty_memory_accounter.hh

…

release.cc

scylla: Add "--build-mode" command line option

2021-01-20 16:07:29 +02:00

release.hh

scylla: Add "--build-mode" command line option

2021-01-20 16:07:29 +02:00

reversibly_mergeable.hh

…

row_cache.cc

row_cache: hold read_context as unique_ptr

2021-04-25 11:35:07 +03:00

row_cache.hh

row_cache: hold read_context as unique_ptr

2021-04-25 11:35:07 +03:00

schema_builder.hh

schema_tables: put schema tables on shard 0

2021-01-28 13:28:22 +02:00

schema_fwd.hh

…

schema_mutations.cc

uuid: reduce code dependency on UUID_gen.hh

2021-01-27 20:08:29 +02:00

schema_mutations.hh

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_registry.cc

global_schema_ptr: add support for view's base table

2021-03-07 12:50:42 +02:00

schema_registry.hh

global_schema_ptr: add support for view's base table

2021-03-07 12:50:42 +02:00

schema_upgrader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

schema.cc

Reduce dependency on header utils/rjson.hh

2021-04-25 13:20:51 +03:00

schema.hh

column_mapping_entry: extract == and != operators

2020-10-16 14:59:50 +02:00

scylla_post_install.sh

scylla_post_install.sh: generate memory.conf for CentOS7

2020-07-29 14:10:16 +03:00

scylla-gdb.py

scylla-gdb.py: "this" -> "self"

2021-05-11 18:39:10 +03:00

SCYLLA-VERSION-GEN

version: prepare for the 4.6 cycle

2021-04-01 20:40:52 +03:00

seastarx.hh

Everywhere: Explicitly instantiate make_shared

2020-07-21 10:33:49 -07:00

serialization_visitors.hh

…

serializer_impl.hh

serializer: replace enable_if in deserialized_bytes_proxy with constraint

2021-03-30 09:30:06 +02:00

serializer.cc

serializer: add serializer<lw_shared_ptr<T>> specialization

2021-01-29 01:58:46 +03:00

serializer.hh

serializer: implement FragmentedView for buffer_view

2020-11-27 15:26:13 +01:00

service_permit.hh

service_permit: add a getter for the number of units held

2021-03-29 11:34:18 +02:00

setup.py

…

shell.nix

build: add nix-shell support

2021-04-14 13:15:59 +02:00

supervisor.hh

supervisor: drop unused Upstart code, always use libsystemd

2020-06-10 08:17:35 +03:00

table_helper.cc

table_helper: Use query_processor::get_migration_manager()

2021-03-15 19:35:53 +03:00

table_helper.hh

table_helper: Require local query processor in calls

2020-10-06 15:44:20 +03:00

table.cc

cross-tree: reduce dependency on db/config.hh and database.hh

2021-05-05 13:23:00 +03:00

test.py

test.py: refine test mode control

2021-05-11 18:39:10 +03:00

timeout_config.cc

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timeout_config.hh

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timestamp.hh

…

to_string.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

tombstone.hh

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

tox.ini

…

types.cc

cdc: log: rewrite collection merge to use managed_bytes instead of bytes

2021-04-08 10:16:21 +02:00

types.hh

types: add a missing translation for cql_duration

2021-05-10 11:04:39 +02:00

ubsan-suppressions.supp

suppress ubsan error in boost::deque::clear()

2020-11-09 11:25:19 +02:00

unimplemented.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

unimplemented.hh

…

user_types_metadata.hh

…

validation.cc

validation: Remove get_local_storage_proxy call

2020-12-11 18:52:42 +03:00

validation.hh

validation: Remove get_local_storage_proxy call

2020-12-11 18:52:42 +03:00

version.hh

…

view_info.hh

db: view: Refactor view_info::initialize_base_dependent_fields()

2020-08-20 14:53:07 +02:00

vint-serialization.cc

…

vint-serialization.hh

vint-serialization: Reference the correct spec

2021-01-05 18:54:09 +02:00

xx_hasher.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

zstd.cc

build: remove zstd submodule

2020-06-11 17:12:49 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%