mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 06:53:12 +00:00

Go to file

Avi Kivity 5285ccbb12 Merge 'Add prune ghost rows statement' from Piotr Sarna

This series is split from another, bigger RFC series which provides
manual remedies to deal with inconsistencies between the base table
and its views. This part deals with ghost rows by providing a statement
which fetches view rows from a given range, then reads its corresponding
rows from the base table (cl=ALL), and finally removes rows which were
not present in the base table at all, qualifying them as ghost rows.
Motivations for introducing such a statement:
 * in case of detected inconsistencies, it can be used to fix
   materialized views without recreating them from scratch, which can
   take days and generates lots of throughput
 * a tool which periodically scrubs a materialized view can be easily
   created on top of this statement, especially that it's possible
   to remove ghost rows from a user-defined view token range;

This series comes with a unit test.

The reason for digging up this series is because it's still possible to end up with ghost rows in certain rather improbable scenarios, and we lack a way of fixing them without rebuilding the whole view. For instance, in case of a failed synchronous update to a local view, the user will be notified that the query failed, but a ghost row can be created nonetheless. The pruning statement introduced in this series would allow healing the failure locally, without rebuilding the whole view.

Tests: unit(dev)

Closes #10426

* github.com:scylladb/scylla:
  docs: add a paragraph on PRUNE MATERIALIZED VIEW statement
  service,test: add a test case for error during pruning
  tests: add ghost row deletion test case
  cql3: enable ghost row deletion via CQL
  cql3: add a statement for deleting ghost rows
  cql3: convert is_json statement parameter to enum
  pager: add ghost row deleting pager
  db,view: add delete ghost rows visitor

2022-05-19 17:21:35 +03:00

.github

docs: disable link checker

2022-05-09 12:45:28 +02:00

abseil @ f70eadadd7

…

alternator

gossiper, code: Relax get_up/down/all_counters() helpers

2022-05-06 10:34:48 +03:00

api

Update seastar submodule. Unfortunately, also requires two changes

2022-05-11 14:46:30 +02:00

auth

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

cdc

cql3: Remove relation class

2022-05-16 18:17:58 +02:00

compaction

Merge 'Reapply: "disable_auto_compaction: stop ongoing compactions"' from Eliran Sinvani

2022-05-18 18:33:12 +03:00

conf

db: config: add a flag to disable new parallelized aggregation algorithm

2022-02-01 21:26:25 +01:00

cql3

cql3: enable ghost row deletion via CQL

2022-05-19 10:11:50 +02:00

data_dictionary

data_dictionary: Introduce user types storage

2022-05-05 09:44:26 +03:00

Merge 'Add prune ghost rows statement' from Piotr Sarna

2022-05-19 17:21:35 +03:00

debug

…

dht

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

direct_failure_detector

direct_failure_detector: introduce new failure detector service

2022-05-09 13:14:40 +02:00

dist

dist/common/scripts: generate debug log when exception occurred

2022-05-17 13:18:27 +03:00

docs

docs: add a paragraph on PRUNE MATERIALIZED VIEW statement

2022-05-19 10:16:04 +02:00

exceptions

cql3: Remove relation class

2022-05-16 18:17:58 +02:00

gms

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

idl

tracing: Trace slow queries on replicas wrt. parent's clock

2022-02-10 12:03:53 +01:00

index

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

interface

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

lang

wasm: add wasm ABI version 2

2022-03-30 20:49:35 +02:00

libdeflate @ e7e54eab42

…

licenses

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

locator

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

message

messaging_service: abortable version of send_gossip_echo

2022-05-09 13:14:41 +02:00

mutation_writer

treewide: use system-#include (angle brackets) for seastar

2022-04-26 14:46:42 +03:00

raft

raft: actively search for a leader if it is not known for a tick duration

2022-04-25 14:51:22 +02:00

readers

Merge 'compaction: get rid of reader v1' from Benny Halevy

2022-05-01 19:29:10 +03:00

redis

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

reloc

…

repair

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

replica

Reapply: "disable_auto_compaction: stop ongoing compactions"

2022-05-18 14:57:10 +03:00

rust

tests: add rust example

2022-05-11 16:49:31 +02:00

scripts

Adjust scripts/pull_github_pr.sh to check tests status

2022-05-11 14:46:30 +02:00

seastar @ 96bb3a1b80

Update seastar submodule. Unfortunately, also requires two changes

2022-05-11 14:46:30 +02:00

service

Merge 'Add prune ghost rows statement' from Piotr Sarna

2022-05-19 17:21:35 +03:00

sstables

sstables/processing_result_generator.hh: refine check for coroutine standard

2022-05-19 11:31:40 +03:00

streaming

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

swagger-ui @ 12f1da1082

…

test

Merge 'Add prune ghost rows statement' from Piotr Sarna

2022-05-19 17:21:35 +03:00

thrift

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

tools

dist/common/scripts: generate debug log when exception occurred

2022-05-17 13:18:27 +03:00

tracing

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

transport

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

types

cql3: Add null and unset checks in collection validation

2022-05-18 11:05:14 +02:00

unified

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

utils

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

.dockerignore

…

.gitattributes

…

.gitignore

.gitignore: ignore mypy_cache, the python lint cache

2022-04-19 16:48:47 +03:00

.gitmodules

…

.gitorderfile

…

absl-flat_hash_map.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

absl-flat_hash_map.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell_hash.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell_or_collection.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell.cc

atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal

2022-03-07 11:05:30 +02:00

atomic_cell.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

backlog_controller.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes_ostream.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cache_flat_mutation_reader.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

cache_temperature.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

caching_options.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

caching_options.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

canonical_mutation.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

canonical_mutation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cartesian_product.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cell_locking.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

checked-file-impl.hh

treewide: use system-#include (angle brackets) for seastar

2022-04-26 14:46:42 +03:00

client_data.cc

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

client_data.hh

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

clocks-impl.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clocks-impl.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_bounds_comparator.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_interval_set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_key_filter.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_ranges_walker.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

CMakeLists.txt

cql3: Remove relation class

2022-05-16 18:17:58 +02:00

collection_mutation.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

collection_mutation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

column_computation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

combine.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compatible_ring_position.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compound_compat.hh

compound_compat.hh: add missing methods of iterator

2022-03-08 15:37:03 +02:00

compound.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compress.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compress.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

concrete_types.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

configure.py

Merge 'Add prune ghost rows statement' from Piotr Sarna

2022-05-19 17:21:35 +03:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

converting_mutation_partition_applier.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

counters.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

counters.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cql_serialization_format.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

db_clock.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

debug.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

default.nix

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

digest_algorithm.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

digester.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

dirty_memory_manager.hh

table: clear: serialize with ongoing flush

2022-04-25 18:57:07 +03:00

Doxyfile

…

duration.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

duration.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

encoding_stats.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

enum_set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

fix_system_distributed_tables.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

frozen_mutation.cc

frozen_mutation: add unfreeze_gently

2022-05-05 13:32:25 +03:00

frozen_mutation.hh

frozen_mutation: add consume_gently

2022-05-05 13:32:25 +03:00

frozen_schema.cc

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

frozen_schema.hh

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

gc_clock.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

gdbinit

docs: debugging.md: add a sample gdbinit file

2022-05-11 10:23:08 +03:00

gen_segmented_compress_params.py

treewide: clean up stray license blurbs

2022-02-13 14:16:16 +02:00

generic_server.cc

generic_server: Gentle iterator

2022-02-18 14:25:08 +03:00

generic_server.hh

generic_server.hh: add missing include

2022-04-04 17:31:55 +03:00

HACKING.md

docs: update theme 1.2.1

2022-04-03 13:45:07 +03:00

hashers.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashers.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashing_partition_visitor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashing.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

idl-compiler.py

treewide: clean up stray license blurbs

2022-02-13 14:16:16 +02:00

inet_address_vectors.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

init.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

init.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

install-dependencies.sh

dist/common/scripts: generate debug log when exception occurred

2022-05-17 13:18:27 +03:00

install.sh

docker: revert scylla-server.conf service name change

2022-04-03 19:18:18 +03:00

interval.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

intrusive_set_external_comparator.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

keys.cc

replica, partition_snapshot_reader, keys: replace boost::any with std::any

2022-04-28 07:18:53 +03:00

keys.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

LICENSE.AGPL

…

log.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

main.cc

service: raft: move group0 write path into a separate file

2022-05-19 17:21:35 +03:00

map_difference.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

marshal_exception.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

multishard_mutation_query.cc

multishard_mutation_query: do_query: stop ctx if lookup_readers fails

2022-04-26 11:11:52 +03:00

multishard_mutation_query.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_cleaner.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_compactor.hh

mutation_compactor: drop v1 related code-paths

2022-03-11 09:24:05 +02:00

mutation_consumer_concepts.hh

introduce the MutationConsumer concept

2022-02-28 17:11:54 +02:00

mutation_fragment_fwd.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: validate range tombstone changes

2022-03-29 13:19:05 +03:00

mutation_fragment_v2.hh

mutation_fragment_v2: range_tombstone_change: add minimal_memory_usage()

2022-04-28 14:11:51 +03:00

mutation_fragment.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_fragment.hh

treewide: use system-#include (angle brackets) for seastar

2022-04-26 14:46:42 +03:00

mutation_partition_serializer.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition_serializer.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition_view.cc

mutation_partition_view: do_accept_gently: keep clustering_row key on stack

2022-05-12 16:26:07 +03:00

mutation_partition_view.hh

mutation_partition_view: add accept_gently methods

2022-05-05 13:32:25 +03:00

mutation_partition_visitor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition.cc

mutation: add consume_gently

2022-05-05 13:32:25 +03:00

mutation_partition.hh

code: Convert is_same+result_of assertions into invocable concepts

2022-02-24 19:46:10 +03:00

mutation_query.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_query.hh

query: coroutinize to_data_query_result

2022-05-05 13:32:25 +03:00

mutation_rebuilder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_source_metadata.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation.cc

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation.hh

mutation: add consume_gently

2022-05-05 13:32:25 +03:00

noexcept_traits.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_range_compat.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_slice_builder.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_slice_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_snapshot_reader.hh

partition_snapshot_reader: convert implementation to native v2

2022-04-28 14:12:12 +03:00

partition_snapshot_row_cursor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_version_list.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_version.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_version.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

position_in_partition.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

protocol_server.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

querier.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

querier.hh

mutation_reader: move mutation source into readers/

2022-03-30 15:42:51 +03:00

query_class_config.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query_ranges_to_vnodes.cc

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_ranges_to_vnodes.hh

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_result_merger.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-request.hh

messaging_service: add verb for count(*) request forwarding

2022-02-01 21:14:41 +01:00

query-result-reader.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-result-set.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-result-set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-result-writer.hh

query_result_builder: remove v1 support

2022-03-11 09:24:17 +02:00

query-result.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query.cc

query: do not assert in operator<<(ostream&, const forward_result::printer&)

2022-03-09 14:58:11 +01:00

range_tombstone_assembler.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone_change_generator.hh

range_tombstone_change_generator: flush(): add end_of_range

2022-04-21 14:37:10 +03:00

range_tombstone_list.cc

range_tombstone_list: insert_from: correct rev.update range_tombstone in not overlapping case

2022-04-04 22:26:29 +02:00

range_tombstone_list.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone_splitter.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

read_context.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

reader_concurrency_semaphore.cc

treewide: fix compilation issues with fmtlib 8.1.0+

2022-03-16 12:31:50 +03:00

reader_concurrency_semaphore.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

reader_permit.hh

evicatble_reader: avoid preemption pitfall around waiting for readmission

2022-03-15 14:37:22 +02:00

README.md

…

real_dirty_memory_accounter.hh

memtable: move to replica module and namespace

2022-02-23 09:05:16 +02:00

release.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

release.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

reversibly_mergeable.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

row_cache.cc

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

row_cache.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

schema_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_fwd.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_mutations.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_mutations.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_registry.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_registry.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_upgrader.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

schema.cc

secondary index: avoid special characters in default index names

2022-03-20 18:33:48 +02:00

schema.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

scylla_post_install.sh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

scylla-gdb.py

Merge 'scylla-gdb.py: add commands to dump sstables summary and index-cache' from Botond Dénes

2022-05-19 17:21:35 +03:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN:set release-version value length

2022-02-21 13:28:04 +02:00

seastarx.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serialization_visitors.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serializer_impl.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serializer.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serializer.hh

code: Convert is_integral assertions to concepts

2022-02-24 19:44:29 +03:00

service_permit.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

setup.py

…

shell.nix

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

sstables_loader.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

sstables_loader.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

supervisor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

table_helper.cc

treewide: require group0_guard when performing schema changes

2022-01-24 15:20:35 +01:00

table_helper.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test.py

test.py: highlight the failure cause

2022-02-04 17:15:52 +03:00

timeout_config.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

timeout_config.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

timestamp.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

to_string.hh

to_string.hh: include <map>

2022-02-17 08:53:48 +02:00

tombstone_gc_extension.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc_options.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc_options.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc.cc

gms: feature_service: remove variable/helper function duplication

2022-05-04 18:59:56 +03:00

tombstone_gc.hh

Merge "tools: cut schema loader free of replica::database" from Botond

2022-03-27 17:01:05 +03:00

tombstone.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tox.ini

…

types.cc

Merge 'cql: Add proper validation for null and unset inside collections send as bound values' from Jan Ciołek

2022-05-19 11:25:24 +03:00

types.hh

Merge 'cql: Add proper validation for null and unset inside collections send as bound values' from Jan Ciołek

2022-05-19 11:25:24 +03:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

unimplemented.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

validation.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

validation.hh

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

version.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

view_info.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

vint-serialization.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

vint-serialization.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

xx_hasher.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

zstd.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%