mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 06:53:12 +00:00

Go to file

Nadav Har'El 8df2ea3f95 cql: don't crash when creating a view during a truncate

The test dtest materialized_views_test.py::TestMaterializedViews::
test_mv_populating_from_existing_data_during_truncate reproduces an
assertion failure, and crash, while doing a CREATE MATERIALIZED VIEW
during a TRUNCATE operation.

This patch fixes the crash by removing the assert() call for a view
(replacing it by a warning message) - we'll explain below why this is fine.
Also for base tables change we change the assertion to an on_internal_error
(Refs #7871).
This makes the test stop crashing Scylla, but it still fails due to
issue #17635.

Let's explain the crash, and the fix:

The test starts TRUNCATE on table that doesn't yet have a view.
truncate_table_on_all_shards() begins by disabling compaction on
the table and all its views (of which there are none, at this
point). At this point, the test creates a new view is on this table.
The new view has, by default, compaction enabled. Later, TRUNCATE
calls discard_sstables() on this new view, asserts that it has
compaction disabled - and this assertion fails.

The fix in this patch is to not do the assert() for views. In other words,
we acknowledge that in this use case, the view *will* have compactions
enabled while being truncated. I claim that this is "good enough", if we
remember *why* we disable compaction in the first place: It's important
to disable compaction while truncating because truncating during compaction
can lead us to data resurection when the old sstable is deleted during
truncation but the result of the compaction is written back. True,
this can now happen in a new view (a view created *DURING* the
truncation). But I claim that worse things can happen for this
new view: Notably, we may truncate a view and then the ongoing
view building (which happens in a new view) might copy data from
the base to the view and only then truncate the base - ending up
with an empty base and non-empty view. This problem - issue #17635 -
is more likely, and more serious, than the compaction problem, so
will need to be solved in a separate patch.

Fixes #17543.

Signed-off-by: Nadav Har'El <nyh@scylladb.com>

Closes scylladb/scylladb#17634

2024-03-20 08:54:39 +02:00

.github

[action] Sync labels from an Issue to linked PR

2024-03-19 09:17:07 +02:00

alternator

alternator: Use summary for shard-level latencies.

2024-03-11 11:12:08 +02:00

api

Merge 'Fix node replace with tablets for RF=N' from Tomasz Grabiec

2024-03-18 16:16:08 +02:00

auth

test: auth: add tests for lost quorum and command splitting

2024-03-01 16:25:14 +01:00

bin

tools: add cqlsh shortcut

2023-07-12 09:36:59 +03:00

cdc

cdc: s/string_view/std::string_view/

2024-02-22 13:49:19 +02:00

cmake

Merge 'build: cmake: put server deb packages under build/dist/$<CONFIG>/debian' from Kefu Chai

2024-03-18 16:18:35 +02:00

compaction

compaction: Check for key presence in memtable when calculating max purgeable timestamp

2024-03-18 13:37:44 +02:00

conf

Merge 'Add maintenance socket' from Mikołaj Grzebieluch

2023-12-20 19:04:40 +02:00

cql3

cql3: Remove unused cf_name::operator<<

2024-03-08 15:14:52 +02:00

data_dictionary

data_dictionary: do not include unused headers

2024-03-15 21:17:11 +03:00

sstables_manager: decouple from system_keyspace

2024-03-18 20:38:07 +03:00

debug

…

dht

interval: rename nonwrapping_interval to interval

2024-02-21 19:43:17 +02:00

direct_failure_detector

…

dist

build: cmake: add dist-* targets to the default build target

2024-03-18 20:02:43 +08:00

docs

doc: describe upgrade and recovery for raft topology

2024-03-19 14:59:14 +01:00

exceptions

exceptions: do not include unused headers

2024-02-06 13:16:03 +02:00

gms

gossiper: failure detector: don't handle directly removed live endpoints

2024-03-14 13:29:17 +01:00

idl

storage_service: add support for auth-v2 raft snapshots

2024-03-01 16:25:14 +01:00

index

Merge 'scylla-sstable: add support for loading schema of views and indexes' from Botond Dénes

2024-01-24 23:36:54 +02:00

interface

Typos: fix typos in comments

2023-12-02 22:37:22 +02:00

lang

lang: do not include unused headers

2024-02-07 09:27:39 +02:00

licenses

…

locator

tablets, raft topology: Rebuild tablets after replacing node is normal

2024-03-15 13:20:08 +01:00

message

storage_service: add support for auth-v2 raft snapshots

2024-03-01 16:25:14 +01:00

mutation

frozen_mutation: add unfreeze_gently(span<frozen_mutation>)

2024-03-17 17:45:30 +02:00

mutation_writer

mutation_writer: do not include unused headers

2024-01-24 15:20:02 +02:00

node_ops

node_ops: add fmt::formatter for node_ops_cmd and node_ops_cmd_request

2024-03-06 10:24:31 +02:00

raft

raft: add fmt::formatter for raft tracker types

2024-03-08 15:19:37 +02:00

readers

treewide: Use partition_slice::is_reversed()

2024-03-13 08:52:46 +02:00

redis

transport/controller: pass unix_domain_socket_permissions to generic_server::listen

2024-02-05 14:22:03 +01:00

reloc

…

repair

repair: add fmt::formatter for row_level_diff_detect_algorithm

2024-03-16 19:12:49 +02:00

replica

cql: don't crash when creating a view during a truncate

2024-03-20 08:54:39 +02:00

rust

build: cmake: use scylla build mode for rust profile name

2024-03-06 15:53:11 +08:00

schema

schema: add fmt::formatter for schema

2024-03-13 09:29:00 +02:00

scripts

open-coredump.sh: respect http redirects

2024-03-13 08:57:04 +02:00

seastar @ a71bd96d5a

Update seastar submodule

2024-03-12 09:19:28 +02:00

service

raft_group0_client: assert that hold_read_apply_mutex is called on shard 0

2024-03-18 16:20:41 +01:00

sstables

sstables: Fix clone semantics for runs in partitioned_sstable_set

2024-03-20 08:41:32 +02:00

streaming

error_injection: Overload inject() instead of inject_with_handler()

2024-03-11 19:30:19 +03:00

swagger-ui @ 12f1da1082

…

tasks

tasks: do not include unused headers

2024-02-02 15:20:40 +01:00

test

Fix leaking file descriptors in test.py

2024-03-19 14:59:14 +01:00

thrift

interval: rename nonwrapping_interval to interval

2024-02-21 19:43:17 +02:00

tools

compaction: Check for key presence in memtable when calculating max purgeable timestamp

2024-03-18 13:37:44 +02:00

tracing

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

transport

maintenance_socket: change log message to differentiate from regular CQL ports

2024-03-08 10:08:09 +01:00

types

data_value: delete data_value(T*) constructor

2024-02-11 15:42:55 +02:00

unified

Update unified/build_unified.sh

2023-12-05 15:23:38 +02:00

utils

Merge 'Simplify error_injection::inject_with_handler()' from Pavel Emelyanov

2024-03-14 13:37:54 +02:00

.dockerignore

…

.gitattributes

…

.gitignore

docs: download iam csv files

2023-10-02 12:28:56 +03:00

.gitmodules

…

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

amplify.yml

…

backlog_controller.hh

treewide: apply codespell to the comments in source code

2023-12-20 10:25:03 +02:00

build_mode.hh

…

bytes_ostream.hh

utils: managed_bytes: optimize memory usage for small buffers

2024-02-09 20:56:20 +01:00

bytes.cc

…

bytes.hh

bytes.hh: correct spelling of delimiter and delimited

2023-12-18 20:46:21 +02:00

cache_flat_mutation_reader.hh

cache_flat_mutation_reader: fix a broken iterator validity guarantee in ensure_population_lower_bound()

2023-11-16 19:01:18 +01:00

cache_temperature.hh

…

cartesian_product.hh

…

cell_locking.hh

…

checked-file-impl.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

client_data.cc

…

client_data.hh

…

clocks-impl.cc

clocks-impl: format time_point using fmt

2023-11-22 17:44:07 +02:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

clustering_bounds_comparator: add fmt::formtter for bound_{kind,view}

2024-03-11 11:37:48 +02:00

clustering_interval_set.hh

clustering_interval_set: add fmt::formatter for clustering_interval_set

2024-03-08 15:13:14 +02:00

clustering_key_filter.hh

…

clustering_ranges_walker.hh

…

CMakeLists.txt

build: cmake: reword the comment for dev-headers

2024-03-15 09:51:47 +02:00

collection_mutation.cc

collection_mutation: add formatter for collection_mutation_view::printer

2024-02-13 17:42:25 +02:00

collection_mutation.hh

collection_mutation: add formatter for collection_mutation_view::printer

2024-02-13 17:42:25 +02:00

column_computation.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

combine.hh

…

compound_compat.hh

compound_compat: do not format an sstring with {:d}

2023-07-08 15:13:11 +03:00

compound.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

compress.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

compress.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

concrete_types.hh

use fmt::to_string() for seastar::net::inet_address

2024-02-05 16:56:40 +01:00

configure.py

db: add system_auth_v2 keyspace

2024-03-01 10:40:29 +01:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

…

counters.hh

…

coverage_excludes.txt

test.py: support code coverage

2024-01-18 11:11:34 +02:00

coverage_sources.list

configure.py support coverage profiles on standrad build modes

2024-01-18 11:11:34 +02:00

cql_serialization_format.hh

…

db_clock.hh

…

debug.cc

…

debug.hh

…

default.nix

…

Doxyfile

…

duration.cc

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

duration.hh

…

encoding_stats.hh

encoding_state: mark helper methods protected

2023-08-29 15:41:13 +03:00

enum_set.hh

…

fix_system_distributed_tables.py

…

flake.lock

…

flake.nix

…

frozen_schema.cc

…

frozen_schema.hh

…

full_position.hh

…

gc_clock.hh

db: add formatter for gc_clock::time_point

2024-02-11 16:39:25 +02:00

gdbinit

…

gen_segmented_compress_params.py

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

generic_server.cc

transport/controller: pass unix_domain_socket_permissions to generic_server::listen

2024-02-05 14:22:03 +01:00

generic_server.hh

transport/controller: pass unix_domain_socket_permissions to generic_server::listen

2024-02-05 14:22:03 +01:00

HACKING.md

…

hashing_partition_visitor.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

idl-compiler.py

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

inet_address_vectors.hh

abstract_replication_strategy: calculate_natural_endpoints: make it work with both versions of token_metadata

2023-12-12 23:19:53 +04:00

init.cc

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

init.hh

Merge 'Typos: fix typos in code' from Yaniv Kaul

2023-12-06 07:36:41 +02:00

install-dependencies.sh

install-dependencies.sh: remove duplicate python3-pyudev package

2024-02-02 15:20:40 +01:00

install.sh

install.sh: use a temporary file when packaging scylla.yaml

2024-01-01 21:50:29 +02:00

interval.hh

interval: add fmt::formatters for managed_bytes and friends

2024-02-23 10:26:30 +02:00

keys.cc

clustering_bounds_comparator: add fmt::formtter for bound_{kind,view}

2024-03-11 11:37:48 +02:00

keys.hh

keys: do not use zip_iterator for printing key components

2023-07-01 23:49:02 +03:00

LICENSE.AGPL

…

log.hh

…

main.cc

Merge 'Migrate system_auth to raft group0' from Marcin Maliszkiewicz

2024-03-06 10:11:33 +01:00

map_difference.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

marshal_exception.hh

…

multishard_mutation_query.cc

replica/database: use include page-size in max-result-size

2024-02-27 02:27:55 -05:00

multishard_mutation_query.hh

treewide: apply codespell to the comments in source code

2023-12-20 10:25:03 +02:00

mutation_query.cc

mutation_query: reconcilable_result: add merge_disjoint()

2024-02-21 02:08:48 -05:00

mutation_query.hh

treewide: Use partition_slice::is_reversed()

2024-03-13 08:52:46 +02:00

noexcept_traits.hh

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

interval: rename nonwrapping_interval to interval

2024-02-21 19:43:17 +02:00

partition_slice_builder.cc

…

partition_slice_builder.hh

…

partition_snapshot_reader.hh

…

partition_snapshot_row_cursor.hh

partition_snapshot_row_cursor: add fmt::format to this class

2024-03-08 15:15:43 +02:00

protocol_server.hh

…

querier.cc

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

querier.hh

treewide: Use partition_slice::is_reversed()

2024-03-13 08:52:46 +02:00

query_id.hh

…

query_ranges_to_vnodes.cc

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

query_ranges_to_vnodes.hh

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

query_result_merger.hh

…

query-request.hh

query-request: use default-generated operator==

2024-03-07 09:02:42 +03:00

query-result-reader.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

query-result-set.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

query-result-set.hh

query-result-set: add formatter for query-result-set.hh types

2024-02-21 17:54:48 +08:00

query-result-writer.hh

query: do not kill unpaged queries when they reach the tombstone-limit

2024-02-12 12:34:04 +02:00

query-result.hh

query-result.hh: add formatter for query::result::printer

2024-02-21 17:57:18 +08:00

query.cc

treewide: use #include <seastar/...> for seastar headers

2023-06-06 08:36:09 +03:00

read_context.hh

treewide: Use partition_slice::is_reversed()

2024-03-13 08:52:46 +02:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: use variable reference for metrics

2024-03-11 20:47:04 +02:00

reader_concurrency_semaphore.hh

reader_permit: store schema_ptr instead of raw schema pointer

2024-01-11 08:37:56 +02:00

reader_permit.hh

add fmt::formatter for reader_permit::state and reader_resources

2024-03-11 09:55:51 +02:00

README.md

…

real_dirty_memory_accounter.hh

…

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

treewide: Use partition_slice::is_reversed()

2024-03-13 08:52:46 +02:00

row_cache.hh

row_cache: add fmt::formatter for cache_entry

2024-03-06 10:08:11 +02:00

schema_mutations.cc

schema_mutations: add fmt::formatter for schema_mutations

2024-03-15 09:49:56 +02:00

schema_mutations.hh

schema_mutations: add fmt::formatter for schema_mutations

2024-03-15 09:49:56 +02:00

schema_upgrader.hh

…

scylla_post_install.sh

dist: drop legacy control group parameters

2023-12-11 19:38:28 +09:00

scylla-gdb.py

scylla-gdb: use current_scheduling_group_ptr instead of task_queue._current

2024-03-11 13:13:59 +02:00

SCYLLA-VERSION-GEN

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

serializer.cc

…

serializer.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

service_permit.hh

…

setup.py

…

shell.nix

…

sstables_loader.cc

sstables_loader: Stream to pending tablet replica if needed

2024-02-27 15:17:05 -03:00

sstables_loader.hh

…

supervisor.hh

…

table_helper.cc

keyspace_metadata: Add default value for new_keyspace's durable_writes

2023-12-26 11:47:37 +03:00

table_helper.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

test.py

Fix leaking file descriptors in test.py

2024-03-19 14:59:14 +01:00

timeout_config.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

timeout_config.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

timestamp.hh

…

tombstone_gc_extension.hh

…

tombstone_gc_options.cc

tombstone_gc_options: add fmt::formatter for tombstone_gc_mode

2024-03-08 15:12:00 +02:00

tombstone_gc_options.hh

tombstone_gc_options: add fmt::formatter for tombstone_gc_mode

2024-03-08 15:12:00 +02:00

tombstone_gc.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

tombstone_gc.hh

interval: rename nonwrapping_interval to interval

2024-02-21 19:43:17 +02:00

tox.ini

…

ubsan-suppressions.supp

…

unimplemented.cc

unimplemented: add format_as() for unimplemented::cause

2024-01-19 08:38:30 +02:00

unimplemented.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

validation.cc

…

validation.hh

…

version.hh

…

view_info.hh

view_info: add fmt::formatter for view_info

2024-03-12 13:28:27 +02:00

vint-serialization.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

vint-serialization.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

zstd.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%