mirror of https://github.com/scylladb/scylladb.git synced 2026-05-23 16:22:15 +00:00

Go to file

Avi Kivity 8624718983 Merge "row_cache: update reader implementations to v2" from Botond

"
cache_flat_mutation_reader gets a native v2 implementation. The
underlying mutation representation is not changed: range deletions are
still stored as v1 range_tombstones in mutation_partition. These are
converted to range tombstone changes during reading.
This allows for separating the change of a native v2 reader
implementation and a native v2 in-memory storage format, enabling the
two to be done at separate times and incrementally.
This means there is still conversion ingoing when reading from cache and
when populating, but when reading from underlying, the stream can now be
passed through as-is without conversions.
Also, any future v2 related changes to the in-memory storage will now be
limited to the cache reader implementation itself.

In the process, the non-forwarding reader, whose only user is the cache,
is also converted to v2.
"

Performance results reported by Botond:

"
build/release/test/perf/perf_simple_query -c1 -m2G --flush --
duration=20

BEFORE
median 130421.76 tps ( 71.1 allocs/op,  12.1 tasks/op,   47462
insns/op)
median absolute deviation: 319.64
maximum: 131028.33
minimum: 127502.55

AFTER
median 133297.41 tps ( 64.1 allocs/op,  12.2 tasks/op,   45406
insns/op)
median absolute deviation: 2964.24
maximum: 137581.56
minimum: 123739.4

Getting rid of those upgrade/downgrade was good for allocs and ops.
Curiously there is a 0.1 rise in number of tasks though.
"

* 'row-cache-readers-v2/v1' of https://github.com/denesb/scylla:
  row_cache: update reader implementations to v2
  range_tombstone_change_generator: flush(): add end_of_range
  readers/nonforwardable: convert to v2
  read_context: fix indentation
  read_context: coroutinize move_to_next_partition()
  row_cache: cache_entry::read(): return v2 reader
  row_cache: return v2 readers from make_reader*()
  readers/delegating_v2: s/make_delegating_reader_v2/make_delegating_reader/

2022-04-23 19:10:43 +03:00

.github

docs: update theme 1.2.1

2022-04-03 13:45:07 +03:00

abseil @ f70eadadd7

…

alternator

alternator: ttl: avoid specializing class templates in non-namespace scope

2022-04-18 12:27:18 +03:00

api

api: avoid function specialization in req_param

2022-04-18 12:27:18 +03:00

auth

treewide: require group0_guard when performing schema changes

2022-01-24 15:20:35 +01:00

cdc

system_keyspace,cdc,storage_service: Make bootstrap manipulations non-static

2022-03-25 15:08:13 +03:00

compaction

compaction: leveled_compaction_strategy: avoid compares between signed and unsigned

2022-04-18 12:27:18 +03:00

conf

db: config: add a flag to disable new parallelized aggregation algorithm

2022-02-01 21:26:25 +01:00

cql3

cql3: expr: possible_lhs_values: Handle subscript

2022-04-11 19:05:09 +03:00

data_dictionary

database,cql3: add STORAGE option to keyspaces

2022-04-08 09:17:01 +02:00

Merge 'Fix some errors and issues found by gcc 12' from Avi Kivity

2022-04-19 10:25:38 +03:00

debug

…

dht

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

dist

scripts: print perftune.py error message when capture_output=True

2022-04-18 14:06:51 +03:00

docs

alternator: implement Select option of Query and Scan

2022-04-11 10:04:32 +02:00

exceptions

cross-tree: split coordinator_result from exceptions.hh

2022-03-02 10:12:57 +02:00

gms

gms: gossiper: coroutinize apply_state_locally

2022-04-17 11:51:18 +03:00

idl

tracing: Trace slow queries on replicas wrt. parent's clock

2022-02-10 12:03:53 +01:00

index

tree: remove mutation_reader.hh include

2022-03-30 15:42:51 +03:00

interface

…

lang

wasm: add wasm ABI version 2

2022-03-30 20:49:35 +02:00

libdeflate @ e7e54eab42

…

licenses

…

locator

treewide: abort() after switch in formatters

2022-04-18 12:27:18 +03:00

message

treewide: fix compilation issues with fmtlib 8.1.0+

2022-03-16 12:31:50 +03:00

mutation_writer

tree: remove mutation_reader.hh include

2022-03-30 15:42:51 +03:00

raft

raft: server: translate abort_requested_exception to raft::request_aborted

2022-04-05 19:18:53 +02:00

readers

Merge "row_cache: update reader implementations to v2" from Botond

2022-04-23 19:10:43 +03:00

redis

treewide: require group0_guard when performing schema changes

2022-01-24 15:20:35 +01:00

reloc

…

repair

Merge 'Remove queue reader v1' from Mikołaj Sielużycki

2022-04-21 12:34:48 +03:00

replica

row_cache: return v2 readers from make_reader*()

2022-04-20 10:59:09 +03:00

scripts

scripts: Allow specifying submodule branch to refresh from

2022-03-22 15:18:25 +02:00

seastar @ 5e86362704

Update seastar submodule

2022-04-17 17:11:31 +03:00

service

storage_proxy: coroutinize mutate_locally (vector overload)

2022-04-19 10:59:16 +03:00

sstables

sstables: : remove unnecessary throws

2022-04-12 13:09:54 +02:00

streaming

tree: remove mutation_reader.hh include

2022-03-30 15:42:51 +03:00

swagger-ui @ 12f1da1082

…

test

Merge "row_cache: update reader implementations to v2" from Botond

2022-04-23 19:10:43 +03:00

thrift

result_message: add result_message::exception

2022-02-08 11:08:42 +01:00

tools

test.py: add a dependency on python3-aiohttp and tabulate

2022-04-19 18:22:50 +03:00

tracing

tracing: Trace slow queries on replicas wrt. parent's clock

2022-02-10 12:03:53 +01:00

transport

transport: return correct error codes when downgrading v4 {WRITE,READ}_FAILURE to {WRITE,READ}_TIMEOUT

2022-04-12 19:19:52 +03:00

types

types/map.hh: add missing const qualifiers

2022-03-03 14:24:05 +02:00

unified

…

utils

utils: result_loop: remove invalid and incorrect constraint

2022-04-18 12:27:18 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

.gitignore: ignore mypy_cache, the python lint cache

2022-04-19 16:48:47 +03:00

.gitmodules

…

.gitorderfile

…

.lycheeignore

docs: update theme 1.2.1

2022-04-03 13:45:07 +03:00

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

atomic_cell_hash.hh

…

atomic_cell_or_collection.hh

…

atomic_cell.cc

atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal

2022-03-07 11:05:30 +02:00

atomic_cell.hh

…

backlog_controller.hh

…

bytes_ostream.hh

…

bytes.cc

…

bytes.hh

…

cache_flat_mutation_reader.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

cache_temperature.hh

…

caching_options.cc

…

caching_options.hh

…

canonical_mutation.cc

…

canonical_mutation.hh

…

cartesian_product.hh

…

cell_locking.hh

…

checked-file-impl.hh

…

client_data.cc

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

client_data.hh

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

clocks-impl.cc

…

clocks-impl.hh

…

clustering_bounds_comparator.hh

…

clustering_interval_set.hh

…

clustering_key_filter.hh

…

clustering_ranges_walker.hh

…

CMakeLists.txt

cmake: CMakeLists.txt: rename flat_mutation_reader.cc to readers/mutation_readers.cc

2022-04-06 14:10:34 +03:00

collection_mutation.cc

…

collection_mutation.hh

…

column_computation.hh

…

combine.hh

…

compatible_ring_position.hh

…

compound_compat.hh

compound_compat.hh: add missing methods of iterator

2022-03-08 15:37:03 +02:00

compound.hh

…

compress.cc

…

compress.hh

…

concrete_types.hh

…

configure.py

build: disable warnings that cause false-positive errors with gcc 12

2022-04-18 12:27:18 +03:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

…

counters.hh

…

cql_serialization_format.hh

…

db_clock.hh

…

debug.hh

…

default.nix

…

digest_algorithm.hh

…

digester.hh

…

dirty_memory_manager.hh

memtable: move to replica module and namespace

2022-02-23 09:05:16 +02:00

Doxyfile

…

duration.cc

…

duration.hh

…

encoding_stats.hh

…

enum_set.hh

…

fix_system_distributed_tables.py

…

frozen_mutation.cc

frozen_mutation: fragment_and_freeze(): convert to v2

2022-03-31 09:57:48 +03:00

frozen_mutation.hh

frozen_mutation: introduce consume method

2022-04-05 10:51:21 +03:00

frozen_schema.cc

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

frozen_schema.hh

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

gc_clock.hh

…

gen_segmented_compress_params.py

treewide: clean up stray license blurbs

2022-02-13 14:16:16 +02:00

generic_server.cc

generic_server: Gentle iterator

2022-02-18 14:25:08 +03:00

generic_server.hh

generic_server.hh: add missing include

2022-04-04 17:31:55 +03:00

HACKING.md

docs: update theme 1.2.1

2022-04-03 13:45:07 +03:00

hashers.cc

…

hashers.hh

…

hashing_partition_visitor.hh

…

hashing.hh

…

idl-compiler.py

treewide: clean up stray license blurbs

2022-02-13 14:16:16 +02:00

inet_address_vectors.hh

…

init.cc

…

init.hh

…

install-dependencies.sh

test.py: add a dependency on python3-aiohttp and tabulate

2022-04-19 18:22:50 +03:00

install.sh

docker: revert scylla-server.conf service name change

2022-04-03 19:18:18 +03:00

interval.hh

…

intrusive_set_external_comparator.hh

…

keys.cc

…

keys.hh

…

LICENSE.AGPL

…

log.hh

…

main.cc

main: fix discarded future during prometheus start sequence

2022-04-15 16:40:31 +03:00

map_difference.hh

…

marshal_exception.hh

…

multishard_mutation_query.cc

readers: move multishard reader & friends to reader/multishard.cc

2022-03-30 15:42:51 +03:00

multishard_mutation_query.hh

…

mutation_cleaner.hh

…

mutation_compactor.hh

mutation_compactor: drop v1 related code-paths

2022-03-11 09:24:05 +02:00

mutation_consumer_concepts.hh

introduce the MutationConsumer concept

2022-02-28 17:11:54 +02:00

mutation_fragment_fwd.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: validate range tombstone changes

2022-03-29 13:19:05 +03:00

mutation_fragment_v2.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation_fragment.cc

…

mutation_fragment.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation_partition_serializer.cc

…

mutation_partition_serializer.hh

…

mutation_partition_view.cc

…

mutation_partition_view.hh

…

mutation_partition_visitor.hh

…

mutation_partition.cc

frozen_mutation: introduce consume method

2022-04-05 10:51:21 +03:00

mutation_partition.hh

code: Convert is_same+result_of assertions into invocable concepts

2022-02-24 19:46:10 +03:00

mutation_query.cc

…

mutation_query.hh

reconcilable_result_builder: remove v1 support

2022-03-11 09:24:46 +02:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

…

mutation.cc

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation.hh

mutation: migrate consume() to v2

2022-02-21 12:27:55 +02:00

noexcept_traits.hh

…

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

…

partition_snapshot_reader.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

partition_snapshot_row_cursor.hh

…

partition_version_list.hh

…

partition_version.cc

…

partition_version.hh

…

position_in_partition.hh

…

protocol_server.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

querier.cc

…

querier.hh

mutation_reader: move mutation source into readers/

2022-03-30 15:42:51 +03:00

query_class_config.hh

…

query_ranges_to_vnodes.cc

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_ranges_to_vnodes.hh

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_result_merger.hh

…

query-request.hh

messaging_service: add verb for count(*) request forwarding

2022-02-01 21:14:41 +01:00

query-result-reader.hh

…

query-result-set.cc

…

query-result-set.hh

…

query-result-writer.hh

query_result_builder: remove v1 support

2022-03-11 09:24:17 +02:00

query-result.hh

…

query.cc

query: do not assert in operator<<(ostream&, const forward_result::printer&)

2022-03-09 14:58:11 +01:00

range_tombstone_assembler.hh

…

range_tombstone_change_generator.hh

range_tombstone_change_generator: flush(): add end_of_range

2022-04-21 14:37:10 +03:00

range_tombstone_list.cc

range_tombstone_list: insert_from: correct rev.update range_tombstone in not overlapping case

2022-04-04 22:26:29 +02:00

range_tombstone_list.hh

…

range_tombstone_splitter.hh

…

range_tombstone.cc

…

range_tombstone.hh

…

range.hh

…

read_context.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

reader_concurrency_semaphore.cc

treewide: fix compilation issues with fmtlib 8.1.0+

2022-03-16 12:31:50 +03:00

reader_concurrency_semaphore.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

reader_permit.hh

evicatble_reader: avoid preemption pitfall around waiting for readmission

2022-03-15 14:37:22 +02:00

README.md

…

real_dirty_memory_accounter.hh

memtable: move to replica module and namespace

2022-02-23 09:05:16 +02:00

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

row_cache.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

schema_builder.hh

…

schema_fwd.hh

…

schema_mutations.cc

…

schema_mutations.hh

…

schema_registry.cc

…

schema_registry.hh

…

schema_upgrader.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

schema.cc

secondary index: avoid special characters in default index names

2022-03-20 18:33:48 +02:00

schema.hh

…

scylla_post_install.sh

…

scylla-gdb.py

scylla-gdb: Support lw_shared_ptr_no_esft

2022-03-22 17:19:39 +02:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN:set release-version value length

2022-02-21 13:28:04 +02:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

…

serializer.cc

…

serializer.hh

code: Convert is_integral assertions to concepts

2022-02-24 19:44:29 +03:00

service_permit.hh

…

setup.py

…

shell.nix

…

sstables_loader.cc

…

sstables_loader.hh

…

supervisor.hh

…

table_helper.cc

treewide: require group0_guard when performing schema changes

2022-01-24 15:20:35 +01:00

table_helper.hh

…

test.py

test.py: highlight the failure cause

2022-02-04 17:15:52 +03:00

timeout_config.cc

…

timeout_config.hh

…

timestamp.hh

…

to_string.hh

to_string.hh: include <map>

2022-02-17 08:53:48 +02:00

tombstone_gc_extension.hh

…

tombstone_gc_options.cc

…

tombstone_gc_options.hh

…

tombstone_gc.cc

treewide: abort() after switch in formatters

2022-04-18 12:27:18 +03:00

tombstone_gc.hh

Merge "tools: cut schema loader free of replica::database" from Botond

2022-03-27 17:01:05 +03:00

tombstone.hh

…

tox.ini

…

types.cc

types: fix is_string for reversed types

2022-03-09 08:18:33 +01:00

types.hh

Merge "Conceptualize some static assertions" From Pavel Emelyanov

2022-02-28 13:58:01 +02:00

ubsan-suppressions.supp

…

unimplemented.cc

…

unimplemented.hh

…

validation.cc

validation: complete transition to data_dictionary module

2022-01-25 09:52:30 +02:00

validation.hh

validation: complete transition to data_dictionary module

2022-01-25 09:52:30 +02:00

version.hh

…

view_info.hh

…

vint-serialization.cc

…

vint-serialization.hh

…

xx_hasher.hh

…

zstd.cc

…

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%