mirror of https://github.com/scylladb/scylladb.git synced 2026-06-07 15:33:15 +00:00

Go to file

Piotr Sarna 2015988373 Merge 'types: get rid of linearization in deserialize()' from Michał Chojnowski

Citing #6138: > In the past few years we have converted most of our codebase to
work in terms of fragmented buffers, instead of linearised ones, to help avoid
large allocations that put large pressure on the memory allocator.  > One
prominent component that still works exclusively in terms of linearised buffers
is the types hierarchy, more specifically the de/serialization code to/from CQL
format. Note that for most types, this is the same as our internal format,
notable exceptions are non-frozen collections and user types.  > > Most types
are expected to contain reasonably small values, but texts, blobs and especially
collections can get very large. Since the entire hierarchy shares a common
interface we can either transition all or none to work with fragmented buffers.

This series gets rid of intermediate linearizations in deserialization. The next
steps are removing linearizations from serialization, validation and comparison
code.

Series summary:
- Fix a bug in `fragmented_temporary_buffer::view::remove_prefix`. (Discovered
  while testing. Since it wasn't discovered earlier, I guess it doesn't occur in
  any code path in master.)
- Add a `FragmentedView` concept to allow uniform handling of various types of
  fragmented buffers (`bytes_view`, `temporary_fragmented_buffer::view`,
  `ser::buffer_view` and likely `managed_bytes_view` in the future).
- Implement `FragmentedView` for relevant fragmented buffer types.
- Add helper functions for reading from `FragmentedView`.
- Switch `deserialize()` and all its helpers from `bytes_view` to
  `FragmentedView`.
- Remove `with_linearized()` calls which just became unnecessary.
- Add an optimization for single-fragment cases.

The addition of `FragmentedView` might be controversial, because another concept
meant for the same purpose - `FragmentRange` - is already used. Unfortunately,
it lacks the functionality we need. The main (only?) thing we want to do with a
fragmented buffer is to extract a prefix from it and `FragmentRange` gives us no
way to do that, because it's immutable by design. We can work around that by
wrapping it into a mutable view which will track the offset into the immutable
`FragmentRange`, and that's exactly what `linearizing_input_stream` is. But it's
wasteful. `linearizing_input_stream` is a heavy type, unsuitable for passing
around as a view - it stores a pair of fragment iterators, a fragment view and a
size (11 words) to conform to the iterator-based design of `FragmentRange`, when
one fragment iterator (4 words) already contains all needed state, just hidden.
I suggest we replace `FragmentRange` with `FragmentedView` (or something
similar) altogether.

Refs: #6138

Closes #7692

* github.com:scylladb/scylla:
  types: collection: add an optimization for single-fragment buffers in deserialize
  types: add an optimization for single-fragment buffers in deserialize
  cql3: tuples: don't linearize in in_value::from_serialized
  cql3: expr: expression: replace with_linearize with linearized
  cql3: constants: remove unneeded uses of with_linearized
  cql3: update_parameters: don't linearize in prefetch_data_builder::add_cell
  cql3: lists: remove unneeded use of with_linearized
  query-result-set: don't linearize in result_set_builder::deserialize
  types: remove unneeded collection deserialization overloads
  types: switch collection_type_impl::deserialize from bytes_view to FragmentedView
  cql3: sets: don't linearize in value::from_serialized
  cql3: lists: don't linearize in value::from_serialized
  cql3: maps: don't linearize in value::from_serialized
  types: remove unused deserialize_aux
  types: deserialize: don't linearize tuple elements
  types: deserialize: don't linearize collection elements
  types: switch deserialize from bytes_view to FragmentedView
  types: deserialize tuple types from FragmentedView
  types: deserialize set type from FragmentedView
  types: deserialize map type from FragmentedView
  types: deserialize list type from FragmentedView
  types: add FragmentedView versions of read_collection_size and read_collection_value
  types: deserialize varint type from FragmentedView
  types: deserialize floating point types from FragmentedView
  types: deserialize decimal type from FragmentedView
  types: deserialize duration type from FragmentedView
  types: deserialize IP address types from FragmentedView
  types: deserialize uuid types from FragmentedView
  types: deserialize timestamp type from FragmentedView
  types: deserialize simple date type from FragmentedView
  types: deserialize time type from FragmentedView
  types: deserialize boolean type from FragmentedView
  types: deserialize integer types from FragmentedView
  types: deserialize string types from FragmentedView
  types: remove unused read_simple_opt
  types: implement read_simple* versions for FragmentedView
  utils: fragmented_temporary_buffer: implement FragmentedView for view
  utils: fragment_range: add single_fragmented_view
  serializer: implement FragmentedView for buffer_view
  utils: fragment_range: add linearized and with_linearized for FragmentedView
  utils: fragment_range: add FragmentedView
  utils: fragmented_temporary_buffer: fix view::remove_prefix

2020-12-04 09:46:20 +01:00

.github

codeowners: add a couple of Botonds

2020-11-10 18:22:52 +02:00

abseil @ 1e3d25b265

Update abseil submodule from upstream

2020-10-25 12:51:40 +02:00

alternator

alternator: guard streams with an experimental flag

2020-11-12 12:36:16 +01:00

api

api: Add force_remove_endpoint for gossip

2020-11-29 13:58:46 +02:00

auth

auth: Fix class name vs field name compilation by gcc

2020-11-18 18:40:55 +02:00

cdc

cdc: produce postimage when inserting with no regular columns

2020-12-01 18:01:23 +02:00

conf

db: add TransitionalAuthorizer and TransitionalAuthenticator...

2020-11-09 10:51:54 +01:00

cql3

Merge 'types: get rid of linearization in deserialize()' from Michał Chojnowski

2020-12-04 09:46:20 +01:00

data

data/cell: fix value_writer use before definition

2020-10-12 13:41:09 +03:00

large_data_handler: maybe_delete_large_data_entries: use sstable large data stats

2020-12-01 15:19:42 +02:00

debug

…

dht

dht/i_partitioner: to_partition_ranges: support yielding

2020-11-24 12:23:56 +02:00

dist

Merge 'dist/common/scripts/scylla_setup: Optionally config rsyslog destination' from Amnon Heiman

2020-12-01 13:12:32 +02:00

docs

docs: sstable-scylla-format: document large_data_type in more details

2020-12-02 13:25:49 +02:00

exceptions

cql_metrics: Add metrics for CQL errors

2020-11-30 12:18:37 +02:00

gms

api: Add force_remove_endpoint for gossip

2020-11-29 13:58:46 +02:00

idl

Merge "Get rid of seed concept in gossip" from Asias

2020-08-17 09:50:51 +03:00

imr

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

index

secondary_index: use new token_column_computation

2020-11-04 12:02:42 +01:00

interface

thrift: switch csharp backend to netstd

2020-06-23 19:40:18 +03:00

libdeflate @ e7e54eab42

…

licenses

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

locator

locator: extract can_yield to utils/maybe_yield.hh

2020-11-24 12:23:56 +02:00

message

messaging: msg_addr: mark methods noexcept

2020-11-01 16:46:18 +02:00

mutation_writer

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

raft

raft: ignore append_reply from a peer in SNAPSHOT state

2020-11-25 12:36:41 +02:00

redis

redis::service: Shut down sharded<> subobject on startup exception

2020-11-25 15:52:47 +00:00

reloc

reloc: Remove "build_reloc.sh" script as obsolete

2020-11-20 22:41:26 +02:00

repair

Merge "Unstall cleanup_compaction::get_ranges_for_invalidation" from Benny

2020-11-29 14:10:39 +02:00

scripts

reloc: add ubsan-suppressions.supp to relocatable package

2020-11-10 19:14:27 +02:00

seastar @ 8b400c7b45

Update seastar submodule

2020-12-01 15:12:25 +02:00

service

locator: extract can_yield to utils/maybe_yield.hh

2020-11-24 12:23:56 +02:00

sstables

large_data_handler: maybe_delete_large_data_entries: use sstable large data stats

2020-12-01 15:19:42 +02:00

streaming

database, streaming: remove remnants of memtable-base streaming

2020-11-16 14:32:19 +01:00

swagger-ui @ 12f1da1082

…

test

Merge 'mutation_reader: introduce clustering_order_reader_merger' from Kamil Braun

2020-12-02 12:15:35 +02:00

thrift

thrift: Validate cell names when constructing clustering keys

2020-12-01 15:12:08 +02:00

tools

Update tools/java submodule

2020-11-27 15:19:48 +02:00

tracing

tracing: Keep qp anchor on backend

2020-10-06 15:45:19 +03:00

transport

cql_metrics: Add metrics for CQL errors

2020-11-30 12:18:37 +02:00

types

types: collection: add an optimization for single-fragment buffers in deserialize

2020-12-04 09:21:05 +01:00

unified

build: compress unified package faster

2020-11-23 00:31:04 +02:00

utils

Merge 'types: get rid of linearization in deserialize()' from Michał Chojnowski

2020-12-04 09:46:20 +01:00

.dockerignore

.dockerignore: add testlog

2020-02-07 08:59:39 +01:00

.gitattributes

…

.gitignore

.gitignore: add .vscode to the list

2020-07-30 16:35:06 +03:00

.gitmodules

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

.gitorderfile

…

absl-flat_hash_map.cc

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

absl-flat_hash_map.hh

Add absl wrapper headers

2020-06-14 08:18:39 -07:00

atomic_cell_hash.hh

collection_mutation: easier (de)serialization of collection_mutation(s).

2019-10-25 10:42:58 +02:00

atomic_cell_or_collection.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

atomic_cell.cc

data/cell: don't overshoot target allocation sizes

2020-09-14 14:21:46 +03:00

atomic_cell.hh

atomic_cell.hh: forward-declare atomic_cell_or_collection

2020-09-21 16:32:53 +03:00

backlog_controller.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

bytes_ostream.hh

bytes_ostream: Remove std::iterator from fragment_iterator

2020-11-17 16:53:20 +01:00

bytes.cc

mp_row_consumer: Provide hex-formatting wrapper for bytes_view

2020-08-26 20:44:11 +03:00

bytes.hh

bytes: define contructor for fmt_hex

2020-09-21 16:32:53 +03:00

cache_flat_mutation_reader.hh

range_tombstone: Remove unused schema arg from .set_start

2020-11-06 15:13:05 +03:00

cache_temperature.hh

…

caching_options.hh

treewide: replace libjsoncpp usage with rjson

2020-07-03 10:27:23 +02:00

canonical_mutation.cc

everywhere: Use uninitialized_string instead of sstring::initialized_later

2020-03-10 13:17:49 -07:00

canonical_mutation.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

cartesian_product.hh

cartesian_product: Remove std::iterator from iterator

2020-11-17 16:53:20 +01:00

cell_locking.hh

mutation_partition: make static_row optional to reduce memory footprint

2019-10-15 15:42:05 +03:00

checked-file-impl.hh

treewide: replace calls to engine().some_api() with some_api()

2020-04-05 12:46:04 +03:00

clocks-impl.cc

clocks-impl: switch to thread-safe time conversion

2020-05-04 14:11:38 +03:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

clustering_interval_set.hh

clustering_interval_set: Remove std::iterator from position_range_iterator

2020-11-17 16:53:20 +01:00

clustering_key_filter.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

clustering_ranges_walker.hh

…

CMakeLists.txt

cmake: redesign scylla's CMakeLists.txt to finally allow full-fledged building

2020-11-10 10:34:27 +02:00

collection_mutation.cc

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

collection_mutation.hh

collection_mutation_view: add type-aware pretty printer

2020-01-07 12:06:29 +02:00

column_computation.hh

column_computation: add token_column_computation

2020-11-04 12:02:42 +01:00

combine.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

compaction_garbage_collector.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

compaction_strategy_type.hh

compaction_strategy: add method to reshape SSTables

2020-06-18 09:37:18 -04:00

compaction_strategy.hh

distributed_loader: reshard before the node is made online

2020-06-18 09:37:18 -04:00

compatible_ring_position.hh

…

compound_compat.hh

compound_compat: Remove std::iterator from iterators

2020-11-17 16:53:20 +01:00

compound.hh

compound: Remove std::iterator from iterator

2020-11-17 16:53:20 +01:00

compress.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

compress.hh

…

concrete_types.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

configure.py

Merge 'sstables: a bunch of refactors' from Kamil Braun

2020-11-24 09:23:57 +02:00

connection_notifier.cc

code: Use qctx::evecute_cql methods, not global ones

2020-11-19 18:39:05 +03:00

connection_notifier.hh

code: Use qctx::evecute_cql methods, not global ones

2020-11-19 18:39:05 +03:00

CONTRIBUTING.md

Fix a link to contributor-agreement in the CONTRIBUTING page

2020-05-17 14:15:49 +03:00

converting_mutation_partition_applier.cc

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

converting_mutation_partition_applier.hh

converting_mutation_partition_applier: move to .cc file

2020-03-04 12:42:57 +02:00

counters.cc

counters: remove unused 1.7.4 counter order code

2020-09-29 12:16:58 +03:00

counters.hh

counters: Remove std::iterator from iterators

2020-11-17 16:53:20 +01:00

cql_serialization_format.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

database_fwd.hh

…

database.cc

system-keyspace: Rewrite force_blocking_flush

2020-11-19 18:39:05 +03:00

database.hh

Merge 'sstables: a bunch of refactors' from Kamil Braun

2020-11-24 09:23:57 +02:00

db_clock.hh

clocks: add printing functions

2020-01-30 11:10:08 +01:00

debug.hh

…

digest_algorithm.hh

digest: add null values to row digest

2020-09-10 13:16:44 +02:00

digester.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

dirty_memory_manager.hh

commitlog+region_group: timeout exceptions with names

2019-12-03 19:07:19 +01:00

distributed_loader.cc

sstable_directory: use a external load_semaphore

2020-10-08 11:57:06 +03:00

distributed_loader.hh

distributed_loader: remove declaration of inexistent do_populate_column_family()

2020-06-29 14:23:42 -03:00

Doxyfile

…

duration.cc

duration: adjust for C++20 char8_t type

2020-05-12 20:40:30 +02:00

duration.hh

…

encoding_stats.hh

…

enum_set.hh

lwt: ensure enum_set::of is constexpr.

2019-10-01 19:45:56 +02:00

fix_system_distributed_tables.py

tracing: add username to the session table

2020-10-01 04:46:40 +02:00

flat_mutation_reader.cc

flat_mutation_reader: de-virtualize buffer_size()

2020-10-06 08:22:56 +03:00

flat_mutation_reader.hh

Merge 'sstables: a bunch of refactors' from Kamil Braun

2020-11-24 09:23:57 +02:00

frozen_mutation.cc

frozen_mutation: introduce unfreeze_upgrading method

2020-09-15 05:26:44 +03:00

frozen_mutation.hh

Merge "lwt: store column_mapping's for each table schema version upon a DDL change" from Pavel Solodovnikov

2020-10-15 20:48:29 +02:00

frozen_schema.cc

frozen_schema: order idl implementations correctly

2020-10-03 19:56:28 +03:00

frozen_schema.hh

header: De-bloat schema.hh

2020-03-03 11:34:00 +01:00

gc_clock.hh

gc_clock, serialization: define new serialization for gc_clock::duration (aka TTLs)

2019-10-23 18:36:33 +03:00

gen_segmented_compress_params.py

…

HACKING.md

README: better explanation of dependencies and build

2020-06-16 13:26:04 +02:00

hashers.cc

hashers: convert illegal contraint to static_assert

2020-09-21 16:32:10 +03:00

hashers.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

hashing_partition_visitor.hh

…

hashing.hh

hashers: Mark hash updates noexcept

2020-09-07 23:17:41 +03:00

idl-compiler.py

idl-compiler: generate views after serializers

2020-10-03 19:56:25 +03:00

init.cc

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

init.hh

messaging_service: Move initialization to messaging/

2020-08-19 13:08:12 +03:00

install-dependencies.sh

tools: toolchain: update to Fedora 33 with clang 11

2020-10-28 20:21:44 +02:00

install.sh

install.sh: apply sysctl.d files on non-packaging installation

2020-11-26 09:52:14 +02:00

interval.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

intrusive_set_external_comparator.hh

…

keys.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

keys.hh

partition_key_view: add validate method

2020-05-12 12:07:00 +03:00

LICENSE.AGPL

…

lister.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

lister.hh

Update seastar submodule

2020-08-19 17:18:57 +03:00

log.hh

…

lua.cc

lua: expect overflow when selecting lua types

2020-10-11 15:38:07 +03:00

lua.hh

lua: Handle nil returns correctly

2020-01-29 14:05:01 -08:00

main.cc

Merge "Remove reference on database from global qctx" from Pavel E

2020-11-19 18:31:51 +02:00

map_difference.hh

…

marshal_exception.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable-sstable.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

memtable.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

memtable.hh

headers:: Remove flat_mutation_reader.hh from several other headers

2020-07-17 17:54:47 +03:00

multishard_mutation_query.cc

multishard_mutation_query: Propagate mutation_reader::forwarding flag

2020-11-02 15:24:36 +02:00

multishard_mutation_query.hh

storage_proxy: use read_command::max_result_size to pass max result size around

2020-07-28 18:00:29 +03:00

mutation_cleaner.hh

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

mutation_compactor.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_fragment.cc

range_tombstone: Remove unused trim-front arg from .apply()

2020-11-06 15:13:05 +03:00

mutation_fragment.hh

range_tombstone: Remove unused trim-front arg from .apply()

2020-11-06 15:13:05 +03:00

mutation_partition_serializer.cc

sstables: drop checks for correct counter order support

2020-09-14 12:05:11 +02:00

mutation_partition_serializer.hh

…

mutation_partition_view.cc

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

mutation_partition_view.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

mutation_partition_visitor.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

mutation_partition.cc

reader_concurrency_semaphore: name permits

2020-10-13 12:32:13 +03:00

mutation_partition.hh

Merge "Unfriend rows_entry, cache_tracker and mutation_partition" from Pavel Emelyanov

2020-09-22 21:18:14 +02:00

mutation_query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

mutation_query.hh

mutation_query: mutation_query_stage: add get_stats()

2020-11-17 15:13:21 +02:00

mutation_reader.cc

mutation_reader: introduce clustering_order_reader_merger

2020-11-30 11:55:44 +01:00

mutation_reader.hh

mutation_reader: introduce clustering_order_reader_merger

2020-11-30 11:55:44 +01:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

…

mutation.cc

mutation: Improve log print of mutations

2020-09-04 16:33:25 +02:00

mutation.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

noexcept_traits.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

NOTICE.txt

tests: port Cassandra CQL tests to cql repl

2020-03-26 15:19:38 +02:00

ORIGIN

…

partition_builder.hh

collection_mutation: generalize constructor of collection_mutation to abstract_type.

2019-10-25 10:42:58 +02:00

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

partition_slice_builder: add with_option()

2020-07-28 18:00:29 +03:00

partition_snapshot_reader.hh

range_tombstone: Remove unused trim-front arg from .apply()

2020-11-06 15:13:05 +03:00

partition_snapshot_row_cursor.hh

partition_snapshot_row_cursor: row(): return clustering_row instead of mutation_fragment

2020-09-28 10:53:56 +03:00

partition_version_list.hh

…

partition_version.cc

partition_version: Change range_tombstones() to return chunked_vector

2020-10-26 11:54:42 +02:00

partition_version.hh

partition_version: Change range_tombstones() to return chunked_vector

2020-10-26 11:54:42 +02:00

position_in_partition.hh

sstables: add may_have_partition_tombstones method

2020-11-23 23:30:19 +02:00

querier.cc

querier: move common stuff into querier_base

2020-06-03 18:45:33 +03:00

querier.hh

querier_cache: use the reader permit for memory accounting

2020-10-06 08:22:56 +03:00

query_class_config.hh

query: query_class_config: use max_result_size for the max_memory_for_unlimited_query field

2020-07-28 18:00:29 +03:00

query_result_merger.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-request.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query-result-reader.hh

query-result-reader: order idl implementations correctly

2020-10-03 19:56:29 +03:00

query-result-set.cc

query-result-set: don't linearize in result_set_builder::deserialize

2020-12-04 09:19:39 +01:00

query-result-set.hh

mutation_partition: Debloat header form others

2020-03-18 11:53:36 +02:00

query-result-writer.hh

query-result-writer: fix idl definition order related failures with clang

2020-10-11 17:57:12 +03:00

query-result.hh

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

query.cc

increase the maximum size of query results to 2^64

2020-08-03 17:32:49 +02:00

range_tombstone_list.cc

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

range_tombstone_list.hh

range_tombstone_list: Do not expose internal collection

2020-09-07 23:17:41 +03:00

range_tombstone.cc

…

range_tombstone.hh

range_tombstone: Remove unused schema arg from .set_start

2020-11-06 15:13:05 +03:00

range.hh

range: rename range template family to interval

2020-06-16 13:36:20 +03:00

read_context.hh

row_cache: pass a valid permit to underlying read

2020-05-28 11:34:35 +03:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: rate-limit diagnostics messages

2020-11-17 11:57:51 +02:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: add is_unlimited()

2020-11-17 15:13:21 +02:00

reader_permit.hh

reader_concurrency_semaphore: dump permit diagnostics on timeout or queue overflow

2020-10-13 12:32:14 +03:00

README.md

Improve build documentation

2020-09-07 10:51:31 +03:00

real_dirty_memory_accounter.hh

…

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

flat_mutation_reader: de-virtualize buffer_size()

2020-10-06 08:22:56 +03:00

row_cache.hh

Merge "Unfriend rows_entry, cache_tracker and mutation_partition" from Pavel Emelyanov

2020-09-22 21:18:14 +02:00

schema_builder.hh

schema: Pass an rvalue to set_compaction_strategy_options

2020-08-19 14:02:35 -07:00

schema_fwd.hh

…

schema_mutations.cc

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_mutations.hh

schema: include partitioner name in scylla tables mutation

2020-03-15 10:25:20 +01:00

schema_registry.cc

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_registry.hh

schema_registry: make grace period configurable

2020-09-15 17:53:27 +02:00

schema_upgrader.hh

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

schema.cc

column_computation: add token_column_computation

2020-11-04 12:02:42 +01:00

schema.hh

column_mapping_entry: extract == and != operators

2020-10-16 14:59:50 +02:00

scylla_post_install.sh

scylla_post_install.sh: generate memory.conf for CentOS7

2020-07-29 14:10:16 +03:00

scylla-gdb.py

database, streaming: remove remnants of memtable-base streaming

2020-11-16 14:32:19 +01:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN: change master version to 4.4.dev

2020-11-03 13:42:54 +02:00

seastarx.hh

Everywhere: Explicitly instantiate make_shared

2020-07-21 10:33:49 -07:00

serialization_visitors.hh

…

serializer_impl.hh

repair: Switch to btree_set for repair_hash.

2020-07-09 11:35:18 +03:00

serializer.hh

serializer: implement FragmentedView for buffer_view

2020-11-27 15:26:13 +01:00

service_permit.hh

Everywhere: Explicitly instantiate make_lw_shared

2020-07-21 10:33:49 -07:00

setup.py

…

supervisor.hh

supervisor: drop unused Upstart code, always use libsystemd

2020-06-10 08:17:35 +03:00

table_helper.cc

table_helper: Require local query processor in calls

2020-10-06 15:44:20 +03:00

table_helper.hh

table_helper: Require local query processor in calls

2020-10-06 15:44:20 +03:00

table.cc

sstables: pass ring_position to create_single_key_sstable_reader

2020-11-23 12:33:24 +01:00

test.py

test.py: enable back CQL based tests

2020-11-20 11:45:15 +02:00

timeout_config.cc

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timeout_config.hh

config: Place timeout_config() into own .cc file

2020-03-08 17:57:58 +02:00

timestamp.hh

add missing include to timestamp.hh

2020-02-05 19:42:18 +02:00

to_string.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

tombstone.hh

tombstone: use comparison operator instead of ad-hoc compare() function and with_relational_operators

2020-06-02 09:28:52 +03:00

tox.ini

…

types.cc

types: collection: add an optimization for single-fragment buffers in deserialize

2020-12-04 09:21:05 +01:00

types.hh

types: add an optimization for single-fragment buffers in deserialize

2020-12-04 09:19:39 +01:00

ubsan-suppressions.supp

suppress ubsan error in boost::deque::clear()

2020-11-09 11:25:19 +02:00

unimplemented.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

unimplemented.hh

…

user_types_metadata.hh

user_types_metadata: don't implement enable_lw_shared_from_this

2019-12-11 10:44:40 -08:00

validation.cc

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

validation.hh

validation: add is_cql_key_invalid()

2020-05-12 12:07:00 +03:00

version.hh

…

view_info.hh

db: view: Refactor view_info::initialize_base_dependent_fields()

2020-08-20 14:53:07 +02:00

vint-serialization.cc

…

vint-serialization.hh

…

xx_hasher.hh

Merge "Don't expose exact collection from range_tombstone_list" from Pavel E

2020-09-15 10:09:15 +02:00

zstd.cc

build: remove zstd submodule

2020-06-11 17:12:49 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found in ./docs and on the wiki. There is currently no clear definition of what goes where, so when looking for something be sure to check both. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%