mirror of https://github.com/scylladb/scylladb.git synced 2026-06-08 16:03:20 +00:00

Go to file

Avi Kivity c290976185 Merge 'cql3: Remove some restrictions classes' from Jan Ciołek

This PR removes some restrictions classes and replaces them with expression.

* `single_column_restriction` has been removed altogether.
* `partition_key_restrictions` field inside `statement_restrictions` has been replaced with `expression`

`clustering_key_restrictions` are not replaced yet, but this PR already has 30 commits so it's probably better to merge this before adding any more changes.
Luckily most of these commits are implementations of small helper functions.

`single_column_restriction` was pretty easy to remove. This class holds the `expression` that describes the restriction and `column_definition` of the restricted column.
It inherits from `restriction` - the base class of all restrictions.

I wasn't able to replace it with plain `expression` just yet, because a lot of times a `shared_ptr<single_column_restriction>` is being cast to `shared_ptr<restriction>`.
Instead I replaced all instances of `single_column_restriction` with `restriction`.
To decide if a `restriction` is a `single_column_restriction` we can use a helper method that works on expressions.
Same with acquiring the restricted `column_definition`.

This change has two advantages:
* One less restriction class -> moving towards 0
* Preparing towards one generic `restriction/expression` type and using functions to distinguish the type of expression that we're dealing with.

`partition_key_restrictions` is a class used to keep restrictions on the partition key inside `statement_restrictions`.
Removing it required two major steps.

First I had to implement taking all the binary operators and making sure that they are valid together.
Before the change this was the `merge_to` method. It ensures that for example there are no token and regular restrictions occurring at the same time.
This has been implemented as `statement_restrictions::add_restriction`.
It detects which case it's dealing with and mimics `merge_to` from the right restrictions class.

Then I implemented all methods of `partition_key_restrictions` but operating on plain `expressions`.
While doing that I was able to gradually shift the responsibility to the brand new functions.

Finally `partition_key_restrictions` wasn't used anywhere at all and I was able to remove it.

Here's the inheritance tree of all restriction classes for context:
![image](https://user-images.githubusercontent.com/36861778/176141470-f96f6189-e650-44c2-9648-2a840b4c89c0.png)

For now this is marked as a draft.
I just put all this together in a readable way and wanted to put it out for you to see.
I will have another look at the code and maybe do some improvements.

Closes #10910

* github.com:scylladb/scylla:
  cql3: Remove _new from  _new_partition_key_restrictions
  cql3: Remove _partition_key_restrictions from statement_restrictions
  cql3: Use expression for index restrictions
  cql3: expr: Add contains_multi_column_restriction
  cql3: Add expr::value_for
  cql3: Use the new restrictions map in another place
  cql3: use the new map in get_single_column_partition_key_restrictions
  cql3: Keep single column restrictions map inside statement restrictions
  cql3: Use expression instead of _partition_key_restrictions in the remaining code
  cql3: Replace partition_key_restrictions->has_supporting_index()
  cql3: Replace statement_restrictions->get_column_defs()
  cql3: Replace partition_key_restrictions->needs_filtering()
  cql3: Replace partition_key_restrictions->size()
  cql3: Replace partition_key_restrictions->is_all_eq()
  cql3: Replace parition_key_restriction->has_unrestricted_components()
  cql3: Replace parition_key_restrictions->empty()
  cql3: Keep restrictions as expressions inside statement_restrictions
  cql3: Handle single value INs inside prepare_binary_operator
  cql3: Add get_columns_in_commons
  cql3: expr: Add is_empty_restriction
  cql3: Replicate column sorting functionality using expressions
  cql3: Remove single_column_restriction class
  cql3: Replace uses of single_column_restriction with restriction
  cql3: expr: Add get_the_only_column
  cql3: expr: Add is_single_column_restriction
  cql3: expr: Add for_each_expression
  cql3: Remove some unsued methods

2022-07-03 16:11:25 +03:00

.github

docs: disable link checker

2022-05-09 12:45:28 +02:00

abseil @ 9e408e050f

Update abseil submodule

2022-05-22 23:46:33 +03:00

alternator

alternator: use position-in-partition in paging cookie only when reading CQL tables

2022-06-30 15:10:30 +03:00

api

Merge "Make permissions cache live updateable and add an API for resetting authorization cache" from Igor Ribeiro Barbosa Duarte

2022-06-29 11:14:13 +03:00

auth

api: Add API for resetting authorization cache

2022-06-28 19:58:06 -03:00

cdc

cdc/log.hh: expose is_log_name()

2022-06-10 10:57:12 +03:00

compaction

compaction_manager: task: acquire_semaphore: handle abort_requested_exception

2022-06-27 09:47:48 +03:00

conf

conf: update the description of the seeds parameter in scylla.yaml

2022-06-02 18:45:11 +03:00

cql3

Merge 'cql3: Remove some restrictions classes' from Jan Ciołek

2022-07-03 16:11:25 +03:00

data_dictionary

data_dictionary: Introduce user types storage

2022-05-05 09:44:26 +03:00

Merge 'cql3: Remove some restrictions classes' from Jan Ciołek

2022-07-03 16:11:25 +03:00

debug

…

dht

dht: boot_strapper: check if keyspace still exists in bootstrap

2022-06-27 19:13:46 +02:00

direct_failure_detector

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

dist

scylla_cpuset_setup: stop deleting perftune.yaml and skip update cpuset.conf when same parameter specified

2022-06-23 10:28:36 +03:00

docs

doc, alternator: split "experimental" features from "unimplemented" ones

2022-06-28 08:08:50 +03:00

exceptions

exceptions: Define operator<< for exception_code

2022-06-27 14:49:58 +03:00

gms

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

idl

Merge 'A bunch of refactors related to Raft group 0' from Kamil Braun

2022-06-29 16:51:54 +03:00

index

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

interface

…

lang

wasm: fix freeing in wasm UDFs using WASI

2022-07-01 07:57:45 +02:00

libdeflate @ e7e54eab42

…

licenses

…

locator

topology: Add get_rack/_datacenter methods

2022-06-22 11:47:26 +03:00

message

Merge 'A bunch of refactors related to Raft group 0' from Kamil Braun

2022-06-29 16:51:54 +03:00

mutation_writer

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

raft

raft: server: if add_entry with wait_type::applied successfully returns, ensure state_machine::apply is called for this entry

2022-05-27 12:06:18 +02:00

readers

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

redis

query-request: add allow_limit flag

2022-06-22 20:16:49 +02:00

reloc

…

repair

repair: Allow abort repair jobs in early stage

2022-06-27 16:39:36 +03:00

replica

dirty_memory_manager: move db ctor out-of-line

2022-06-30 17:26:18 +03:00

rust

tests: add rust example

2022-05-11 16:49:31 +02:00

scripts

configure.py: speed up and simplify compdb generation

2022-06-15 16:40:52 +03:00

seastar @ 9c016aeebf

Update seastar submodule

2022-06-27 23:11:56 +03:00

service

Merge 'A bunch of refactors related to Raft group 0' from Kamil Braun

2022-06-29 16:51:54 +03:00

sstables

Merge 'sstables: generation_type tidy-up' from Michael Livshin

2022-06-28 08:50:12 +03:00

streaming

streaming: Enable auto off strategy compaction trigger for all rbno ops

2022-06-09 17:10:14 +03:00

swagger-ui @ 12f1da1082

…

test

wasm: fix freeing in wasm UDFs using WASI

2022-07-01 07:57:45 +02:00

thrift

query-request: add allow_limit flag

2022-06-22 20:16:49 +02:00

tools

install-dependencies.sh: uprgade node_exporter to 1.3.1

2022-06-23 11:47:13 +03:00

tracing

trace-state: Remove unused fields

2022-06-17 15:02:51 +03:00

transport

exceptions: Define operator<< for exception_code

2022-06-27 14:49:58 +03:00

types

fix "ninja dev-headers"

2022-05-31 23:42:34 +03:00

unified

…

utils

Merge "Make permissions cache live updateable and add an API for resetting authorization cache" from Igor Ribeiro Barbosa Duarte

2022-06-29 11:14:13 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

.gitignore: ignore mypy_cache, the python lint cache

2022-04-19 16:48:47 +03:00

.gitmodules

…

.gitorderfile

…

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

atomic_cell_hash.hh

…

atomic_cell_or_collection.hh

…

atomic_cell.cc

atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal

2022-03-07 11:05:30 +02:00

atomic_cell.hh

…

backlog_controller.hh

backlog_controller: Generalize scheduling groups

2022-06-16 17:40:19 +03:00

bytes_ostream.hh

…

bytes.cc

…

bytes.hh

…

cache_flat_mutation_reader.hh

Reads from cache lack preemption check when scanning over range tombstones

2022-06-28 06:58:48 +03:00

cache_temperature.hh

…

caching_options.cc

…

caching_options.hh

…

canonical_mutation.cc

…

canonical_mutation.hh

…

cartesian_product.hh

…

cell_locking.hh

…

checked-file-impl.hh

treewide: use system-#include (angle brackets) for seastar

2022-04-26 14:46:42 +03:00

client_data.cc

…

client_data.hh

…

clocks-impl.cc

…

clocks-impl.hh

…

clustering_bounds_comparator.hh

…

clustering_interval_set.hh

…

clustering_key_filter.hh

…

clustering_ranges_walker.hh

…

CMakeLists.txt

db: add rate_limiter

2022-06-22 20:16:48 +02:00

collection_mutation.cc

…

collection_mutation.hh

…

column_computation.hh

…

combine.hh

…

compatible_ring_position.hh

…

compound_compat.hh

compound_compat.hh: add missing methods of iterator

2022-03-08 15:37:03 +02:00

compound.hh

…

compress.cc

…

compress.hh

…

concrete_types.hh

…

configure.py

Merge "Make permissions cache live updateable and add an API for resetting authorization cache" from Igor Ribeiro Barbosa Duarte

2022-06-29 11:14:13 +03:00

CONTRIBUTING.md

docs/contribute/CONTRIBUTING.md: add reference to review checklist:

2022-06-16 10:29:26 +03:00

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

…

counters.hh

…

cql_serialization_format.hh

…

db_clock.hh

…

debug.hh

…

default.nix

…

digest_algorithm.hh

…

digester.hh

…

dirty_memory_manager.hh

dirty_memory_manager: move db ctor out-of-line

2022-06-30 17:26:18 +03:00

Doxyfile

…

duration.cc

…

duration.hh

…

encoding_stats.hh

…

enum_set.hh

…

fix_system_distributed_tables.py

…

frozen_mutation.cc

frozen_mutation: add unfreeze_gently

2022-05-05 13:32:25 +03:00

frozen_mutation.hh

messaging: forward-declare types in messaging_service.hh

2022-06-09 15:52:12 +03:00

frozen_schema.cc

…

frozen_schema.hh

…

full_position.hh

introduce full_position

2022-06-23 13:36:24 +03:00

gc_clock.hh

…

gdbinit

docs: debugging.md: add a sample gdbinit file

2022-05-11 10:23:08 +03:00

gen_segmented_compress_params.py

…

generic_server.cc

…

generic_server.hh

generic_server.hh: add missing include

2022-04-04 17:31:55 +03:00

HACKING.md

docs: update theme 1.2.1

2022-04-03 13:45:07 +03:00

hashers.cc

…

hashers.hh

…

hashing_partition_visitor.hh

…

hashing.hh

…

idl-compiler.py

message: change parameter order in send_message_oneway_timeout

2022-06-23 16:14:41 +02:00

inet_address_vectors.hh

…

init.cc

…

init.hh

…

install-dependencies.sh

install-dependencies.sh: uprgade node_exporter to 1.3.1

2022-06-23 11:47:13 +03:00

install.sh

install.sh: install files with correct permission in strict umask setting

2022-06-20 17:52:03 +03:00

interval.hh

…

intrusive_set_external_comparator.hh

…

keys.cc

replica, partition_snapshot_reader, keys: replace boost::any with std::any

2022-04-28 07:18:53 +03:00

keys.hh

…

LICENSE.AGPL

…

log.hh

…

main.cc

Merge 'A bunch of refactors related to Raft group 0' from Kamil Braun

2022-06-29 16:51:54 +03:00

map_difference.hh

…

marshal_exception.hh

…

multishard_mutation_query.cc

query: have replica provide the last position

2022-06-23 13:36:24 +03:00

multishard_mutation_query.hh

…

mutation_cleaner.hh

db: mutation_cleaner: Enqueue new snapshots at the back

2022-06-28 18:29:29 +03:00

mutation_compactor.hh

mutation_compactor: add current_full_position() convenience accessor

2022-06-23 13:36:24 +03:00

mutation_consumer_concepts.hh

introduce the MutationConsumer concept

2022-02-28 17:11:54 +02:00

mutation_fragment_fwd.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: validate range tombstone changes

2022-03-29 13:19:05 +03:00

mutation_fragment_v2.hh

mutation_fragment_v2: range_tombstone_change: add minimal_memory_usage()

2022-04-28 14:11:51 +03:00

mutation_fragment.cc

position_in_partition: add to_string(partition_region) and parse_partition_region()

2022-06-23 11:19:55 +03:00

mutation_fragment.hh

mutation_fragment.hh: move operator<<(partition_region) to position_in_partition.hh

2022-06-23 11:19:55 +03:00

mutation_partition_serializer.cc

…

mutation_partition_serializer.hh

…

mutation_partition_view.cc

ser: use vector_deserializer by default for all idl vectors

2022-05-18 19:24:18 +03:00

mutation_partition_view.hh

mutation_partition_view: add accept_gently methods

2022-05-05 13:32:25 +03:00

mutation_partition_visitor.hh

…

mutation_partition.cc

query: have replica provide the last position

2022-06-23 13:36:24 +03:00

mutation_partition.hh

mutation_fragment: pass the applied row by reference in clustering_row::apply()

2022-06-20 15:22:17 +02:00

mutation_query.cc

…

mutation_query.hh

query: coroutinize to_data_query_result

2022-05-05 13:32:25 +03:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

…

mutation.cc

test: mutation: Compare against compacted mutations

2022-06-15 11:30:01 +02:00

mutation.hh

test: mutation: Compare against compacted mutations

2022-06-15 11:30:01 +02:00

noexcept_traits.hh

…

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

…

partition_snapshot_reader.hh

memtable: Fix missing range tombstones during reads under ceratin rare conditions

2022-06-29 19:02:23 +03:00

partition_snapshot_row_cursor.hh

partition_snapshot_row_cursor: construct the clustering_row directly in row()

2022-06-20 15:45:19 +02:00

partition_version_list.hh

…

partition_version.cc

mvcc: Introduce apply_resume to hold state for partition version merging

2022-06-15 11:30:01 +02:00

partition_version.hh

mvcc: Introduce apply_resume to hold state for partition version merging

2022-06-15 11:30:01 +02:00

position_in_partition.hh

position_in_partition: add to_string(partition_region) and parse_partition_region()

2022-06-23 11:19:55 +03:00

protocol_server.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

querier.cc

querier: use full_position instead of ad-hoc struct

2022-06-23 13:36:24 +03:00

querier.hh

querier: use full_position instead of ad-hoc struct

2022-06-23 13:36:24 +03:00

query_class_config.hh

…

query_ranges_to_vnodes.cc

…

query_ranges_to_vnodes.hh

…

query_result_merger.hh

…

query-request.hh

query-request: add allow_limit flag

2022-06-22 20:16:49 +02:00

query-result-reader.hh

query: have replica provide the last position

2022-06-23 13:36:24 +03:00

query-result-set.cc

…

query-result-set.hh

…

query-result-writer.hh

query: have replica provide the last position

2022-06-23 13:36:24 +03:00

query-result.hh

…

query.cc

query: have replica provide the last position

2022-06-23 13:36:24 +03:00

range_tombstone_assembler.hh

…

range_tombstone_change_generator.hh

range_tombstone_change_generator: flush(): add end_of_range

2022-04-21 14:37:10 +03:00

range_tombstone_list.cc

range_tombstone_list: insert_from: correct rev.update range_tombstone in not overlapping case

2022-04-04 22:26:29 +02:00

range_tombstone_list.hh

…

range_tombstone_splitter.hh

…

range_tombstone.cc

…

range_tombstone.hh

…

range.hh

…

read_context.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

reader_concurrency_semaphore.cc

treewide: fix compilation issues with fmtlib 8.1.0+

2022-03-16 12:31:50 +03:00

reader_concurrency_semaphore.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

reader_permit.hh

evicatble_reader: avoid preemption pitfall around waiting for readmission

2022-03-15 14:37:22 +02:00

README.md

…

real_dirty_memory_accounter.hh

memtable: move to replica module and namespace

2022-02-23 09:05:16 +02:00

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

memtable: Add counters for tombstone compaction

2022-06-15 11:30:25 +02:00

row_cache.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

schema_builder.hh

schema: add per_partition_rate_limit schema extension

2022-06-22 20:16:48 +02:00

schema_fwd.hh

…

schema_mutations.cc

…

schema_mutations.hh

…

schema_registry.cc

…

schema_registry.hh

…

schema_upgrader.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

schema.cc

schema: add per_partition_rate_limit schema extension

2022-06-22 20:16:48 +02:00

schema.hh

schema: add per_partition_rate_limit schema extension

2022-06-22 20:16:48 +02:00

scylla_post_install.sh

…

scylla-gdb.py

gdb: Make robust in case there is no global storage_proxy or database instance

2022-06-30 08:41:57 +03:00

SCYLLA-VERSION-GEN

…

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

serializer_impl: add vector_deserializer

2022-05-18 19:10:13 +03:00

serializer.cc

…

serializer.hh

code: Convert is_integral assertions to concepts

2022-02-24 19:44:29 +03:00

service_permit.hh

…

setup.py

…

shell.nix

…

sstables_loader.cc

sstable_set: Fix partitioned_sstable_set constructor

2022-06-21 11:58:13 +03:00

sstables_loader.hh

…

supervisor.hh

…

table_helper.cc

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

table_helper.hh

…

test.py

test.py: extend xml output with logs

2022-06-28 18:22:01 +03:00

timeout_config.cc

…

timeout_config.hh

…

timestamp.hh

…

to_string.hh

…

tombstone_gc_extension.hh

…

tombstone_gc_options.cc

…

tombstone_gc_options.hh

…

tombstone_gc.cc

gms: feature_service: remove variable/helper function duplication

2022-05-04 18:59:56 +03:00

tombstone_gc.hh

Merge "tools: cut schema loader free of replica::database" from Botond

2022-03-27 17:01:05 +03:00

tombstone.hh

…

tox.ini

…

types.cc

types: time_point_to_string: use numeric formatting rather than chrono-format specifiers

2022-06-27 08:28:56 +03:00

types.hh

Merge 'cql: Add proper validation for null and unset inside collections send as bound values' from Jan Ciołek

2022-05-19 11:25:24 +03:00

ubsan-suppressions.supp

…

unimplemented.cc

…

unimplemented.hh

…

validation.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

validation.hh

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

version.hh

…

view_info.hh

…

vint-serialization.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

vint-serialization.hh

…

xx_hasher.hh

…

zstd.cc

…

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%