scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 12:36:56 +00:00

Go to file

Avi Kivity f917f73616 Merge "Handling of schema changes" from Tomasz

"Our domain objects have schema version dependent format, for efficiency
reasons. The data structures which map between columns and values rely on
column ids, which are consecutive integers. For example, we store cells in a
vector where index into the vector is an implicit column id identifying table
column of the cell. When columns are added or removed the column ids may
shift. So, to access mutations or query results one needs to know the version
of the schema corresponding to it.

In case of query results, the schema version to which it conforms will always
be the version which was used to construct the query request. So there's no
change in the way query result consumers operate to handle schema changes. The
interfaces for querying needed to be extended to accept schema version and do
the conversions if necessary.

Shard-local interfaces work with a full definition of schema version,
represented by the schema type (usually passed as schema_ptr). Schema versions
are identified across shards and nodes with a UUID (table_schema_version
type). We maintain schema version registry (schema_registry) to avoid fetching
definitions we already know about. When we get a request using unknown schema,
we need to fetch the definition from the source, which must know it, to obtain
a shard-local schema_ptr for it.

Because mutation representation is schema version dependent, mutations of
different versions don't necessarily commute. When a column is dropped from
schema, the dropped column is no longer representable in the new schema. It is
generally fine to not hold data for dropped columns, the intent behind
dropping a column is to lose the data in that column. However, when merging an
incoming mutation with an existing mutation both of which have different
schema versions, we'd have to choose which schema should be considered
"latest" in order not to loose data. Schema changes can be made concurrently
in the cluster and initiated on different nodes so there is not always a
single notion of latest schema. However, schema changes are commutative and by
merging changes nodes eventually agree on the version.  For example adding
column A (version X) on one node and adding column B (version Y) on another
eventually results in a schema version with both A and B (version Z). We
cannot tell which version among X and Y is newer, but we can tell that version
Z is newer than both X and Y. So the solution to the problem of merging
conflicting mutations could be to ensure that such merge is performed using
the schema which is superior to schemas of both mutations.

The approach taken in the series for ensuring this is as follows. When a node
receives a mutation of an unknown schema version it first performs a schema
merge with the source of that mutation. Schema merge makes sure that current
node's version is superior to the schema of incoming mutation. Once the
version is synced with, it is remembered as such and won't be synced with on
later mutations. Because of this bookkeeping, schema versions must be
monotonic; we don't want table altering to result in any earlier version
because that would cause nodes to avoid syncing with them. The version is a
cryptographically-secure hash of schema mutations, which should fulfill this
purpose in practice.

TODO: It's possible that the node is already performing a sync triggered by
broadcasted schema mutations. To avoid triggering a second sync needlessly, the
schema merging should mark incoming versions as being synced with.

Each table shard keeps track of its current schema version, which is
considered to be superior to all versions which are going to be applied to it.
All data sources for given column family within a shard have the same notion
of current schema version. Individual entries in cache and memtables may be at
earlier versions but this is hidden behind the interface. The entries are
upgraded to current version lazily on access. Sstables are immutable, so they
don't need to track current version. Like any other data source, they can be
queried with any schema version.

Note, the series triggered a bug in demangler:
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68700"

2016-01-11 17:59:14 +02:00

api

messaging_service: Rename shard_id to msg_addr

2016-01-07 10:36:35 +02:00

conf

init: bail out if running not on an XFS filesystem

2015-12-30 10:56:21 +02:00

cql3

query_processor: Invalidate statements synchronously

2016-01-11 10:34:55 +01:00

schema_tables: Wait for make_directory_for_column_family() to finish in merge_tables()

2016-01-11 10:34:55 +01:00

dht

streaming: Get rid of the _connecting_ parameter

2015-12-31 11:25:08 +01:00

dist

Update scylla-ami submodule

2016-01-11 17:58:47 +02:00

exceptions

cql3: Implement truncate_statement::execute()

2015-09-30 09:09:43 +02:00

gms

gms/gossiper: Fix compilation error

2016-01-07 16:42:55 +02:00

interface

…

Add license notices

2015-09-20 10:43:39 +03:00

licenses

scripts: Add git-archive-all script

2015-09-01 14:38:24 +03:00

locator

snitch: intentionally leak snitch singleton

2016-01-07 16:43:37 +02:00

message

service: Fetch and sync schema

2016-01-11 10:34:53 +01:00

repair

db: Make read interface schema version aware

2016-01-11 10:34:52 +01:00

scripts

Fixing missing items in move from scylla-ami.sh to scylla_install

2016-01-04 15:23:14 +02:00

seastar @ ad3577b190

Merge seastar upstream

2016-01-11 17:41:39 +02:00

service

Merge "Handling of schema changes" from Tomasz

2016-01-11 17:59:14 +02:00

sstables

db: Make read interface schema version aware

2016-01-11 10:34:52 +01:00

streaming

service: Fetch and sync schema

2016-01-11 10:34:53 +01:00

swagger-ui @ 1b212bbe71

Update swagger-ui for local fix (change URL to not to point to pet store)

2015-06-25 14:04:07 +03:00

tests

Merge "Handling of schema changes" from Tomasz

2016-01-11 17:59:14 +02:00

thrift

db: Make read interface schema version aware

2016-01-11 10:34:52 +01:00

transport

query_processor: Invalidate prepared statements when columns change

2016-01-11 10:34:55 +01:00

utils

Introduce hashing helpers

2016-01-08 21:10:25 +01:00

.gitattributes

Add .gitattributes file to classify C++ source

2015-10-05 08:51:51 +02:00

.gitignore

dist: make ubuntu package as 'debian non-native package'

2015-11-15 15:03:02 +09:00

.gitmodules

dist: move ComboAMI related code to scylla-ami

2015-09-22 00:17:42 +03:00

.gitorderfile

…

atomic_cell_hash.hh

mutation: Make hashable

2016-01-08 21:10:26 +01:00

atomic_cell_or_collection.hh

mutation: Make hashable

2016-01-08 21:10:26 +01:00

atomic_cell.hh

db: Move atomic_cell_or_collection to separate header

2016-01-08 21:10:25 +01:00

bytes_ostream.hh

Introduce hashing helpers

2016-01-08 21:10:25 +01:00

bytes.cc

Add license notices

2015-09-20 10:43:39 +03:00

bytes.hh

Introduce hashing helpers

2016-01-08 21:10:25 +01:00

caching_options.hh

schema: Add equality operators

2015-12-16 18:06:55 +01:00

canonical_mutation.cc

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

canonical_mutation.hh

Introduce canonical_mutation

2016-01-11 10:34:50 +01:00

cartesian_product.hh

Add license notices

2015-09-20 10:43:39 +03:00

combine.hh

Add license notices

2015-09-20 10:43:39 +03:00

compaction_strategy.hh

compaction_strategy should accept both class name and full class name

2015-11-11 15:31:39 +02:00

compound_compat.hh

Add license notices

2015-09-20 10:43:39 +03:00

compound.hh

Improve not implemented errors

2015-12-18 10:51:37 +01:00

compress.hh

compress: Add equality operators

2015-12-16 18:06:55 +01:00

configure.py

tests: Add schema_change_test

2016-01-11 10:34:53 +01:00

converting_mutation_partition_applier.hh

mutation_partition: drop cells from dropped_columns at upgrade

2016-01-11 10:34:53 +01:00

database_fwd.hh

schema: Introduce column_mapping

2016-01-08 21:10:26 +01:00

database.cc

query_processor: Invalidate prepared statements when columns change

2016-01-11 10:34:55 +01:00

database.hh

query_processor: Invalidate prepared statements when columns change

2016-01-11 10:34:55 +01:00

db_clock.hh

Add license notices

2015-09-20 10:43:39 +03:00

debug.hh

Add license notices

2015-09-20 10:43:39 +03:00

dns.cc

dns: Move gethostbyname to source file

2015-10-26 15:59:58 +02:00

dns.hh

dns: Move gethostbyname to source file

2015-10-26 15:59:58 +02:00

Doxyfile

docs: exclude dpdk

2015-06-24 13:09:51 +03:00

enum_set.hh

Add license notices

2015-09-20 10:43:39 +03:00

frozen_mutation.cc

frozen_mutation: Add schema_version field

2016-01-11 10:34:51 +01:00

frozen_mutation.hh

frozen_mutation: Add schema_version field

2016-01-11 10:34:51 +01:00

frozen_schema.cc

schema: Introduce frozen_schema

2016-01-11 10:34:51 +01:00

frozen_schema.hh

schema: Introduce frozen_schema

2016-01-11 10:34:51 +01:00

gc_clock.cc

Add license notices

2015-09-20 10:43:39 +03:00

gc_clock.hh

Add license notices

2015-09-20 10:43:39 +03:00

hashing_partition_visitor.hh

mutation: Make hashable

2016-01-08 21:10:26 +01:00

hashing.hh

Introduce hashing helpers

2016-01-08 21:10:25 +01:00

init.cc

main/init: Use server_encryption_options

2015-12-28 10:10:35 +00:00

init.hh

main/init: Use server_encryption_options

2015-12-28 10:10:35 +00:00

json.hh

Add license notices

2015-09-20 10:43:39 +03:00

key_reader.cc

key_reader: add key_from_mutation_reader

2015-10-20 20:27:47 +02:00

key_reader.hh

key_reader: add filtering key reader

2015-10-20 20:27:47 +02:00

keys.cc

db/serializer: Spread serializers to relax header dependencies

2016-01-08 21:10:26 +01:00

keys.hh

db/serializer: Spread serializers to relax header dependencies

2016-01-08 21:10:26 +01:00

LICENSE.AGPL

Add the AGPL license

2015-09-20 10:45:35 +03:00

log.cc

logger: be robust when exceptions are thrown while stringifying args

2015-12-21 19:58:08 +01:00

log.hh

log: Change default level from warn to info

2016-01-09 09:24:22 +02:00

main.cc

main: wait for API http server to start

2016-01-07 16:44:07 +02:00

map_difference.hh

map_difference: accept std::unordered_map

2016-01-05 09:49:04 +01:00

md5_hasher.hh

Introduce md5_hasher

2016-01-08 21:10:25 +01:00

memtable.cc

column_family: Add schema setters

2016-01-11 10:34:52 +01:00

memtable.hh

column_family: Add schema setters

2016-01-11 10:34:52 +01:00

mutation_partition_applier.hh

db: change collection_mutation::{one,view} not to use nested classes

2015-11-13 17:13:07 +02:00

mutation_partition_serializer.cc

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

mutation_partition_serializer.hh

Add license notices

2015-09-20 10:43:39 +03:00

mutation_partition_view.cc

mutation_partition_view: Make visitable also with column_mapping

2016-01-08 21:10:26 +01:00

mutation_partition_view.hh

mutation_partition_view: Make visitable also with column_mapping

2016-01-08 21:10:26 +01:00

mutation_partition_visitor.hh

db: change collection_mutation::{one,view} not to use nested classes

2015-11-13 17:13:07 +02:00

mutation_partition.cc

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

mutation_partition.hh

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

mutation_query.cc

db: Make read interface schema version aware

2016-01-11 10:34:52 +01:00

mutation_query.hh

db: Make read interface schema version aware

2016-01-11 10:34:52 +01:00

mutation_reader.cc

mutation_reader: move move_and_disengage to a separate header

2015-10-20 20:24:11 +02:00

mutation_reader.hh

db: Make read interface schema version aware

2016-01-11 10:34:52 +01:00

mutation.cc

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

mutation.hh

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

noexcept_traits.hh

Introduce noexcept_traits

2015-12-07 09:50:27 +01:00

NOTICE.txt

…

nway_merger.hh

Add license notices

2015-09-20 10:43:39 +03:00

ORIGIN

Update ORIGIN for gossip and storage_service

2015-12-01 19:45:04 +08:00

partition_builder.hh

db: change collection_mutation::{one,view} not to use nested classes

2015-11-13 17:13:07 +02:00

partition_slice_builder.cc

partition_slice_builder: Introduce reversed()

2015-10-22 10:32:08 +02:00

partition_slice_builder.hh

partition_slice_builder: Introduce reversed()

2015-10-22 10:32:08 +02:00

query_result_merger.hh

Add license notices

2015-09-20 10:43:39 +03:00

query-request.hh

db: Make read interface schema version aware

2016-01-11 10:34:52 +01:00

query-result-reader.hh

db: change collection_mutation::{one,view} not to use nested classes

2015-11-13 17:13:07 +02:00

query-result-set.cc

query::result_set: Add constructor from mutation

2016-01-08 21:10:26 +01:00

query-result-set.hh

query::result_set: Add constructor from mutation

2016-01-08 21:10:26 +01:00

query-result-writer.hh

db: change collection_mutation::{one,view} not to use nested classes

2015-11-13 17:13:07 +02:00

query-result.hh

query: Make query::result movable

2015-12-16 18:06:54 +01:00

query.cc

query: Add schema_version field to read_command

2016-01-11 10:34:51 +01:00

range.hh

range: Introduce equal()

2015-12-16 13:09:01 +01:00

README-DPDK.md

README: fix typos and paramter syntax

2015-06-28 10:24:48 +03:00

README.md

README: Add missing build dependencies

2015-12-31 13:34:48 +02:00

release.cc

release: copy version string into heap

2015-12-10 13:12:40 +02:00

release.hh

Add license notices

2015-09-20 10:43:39 +03:00

row_cache.cc

column_family: Add schema setters

2016-01-11 10:34:52 +01:00

row_cache.hh

column_family: Add schema setters

2016-01-11 10:34:52 +01:00

schema_builder.hh

schema_builder: add with_altered_column_type()

2016-01-11 10:34:54 +01:00

schema_mutations.cc

schema_tables: Calculate digest from mutations

2016-01-11 10:34:53 +01:00

schema_mutations.hh

Make schema_mutations serializable

2016-01-11 10:34:50 +01:00

schema_registry.cc

schema_registry: Track synced state of schema

2016-01-11 10:34:52 +01:00

schema_registry.hh

schema_registry: Track synced state of schema

2016-01-11 10:34:52 +01:00

schema.cc

schema: Introduce equal_columns()

2016-01-11 10:34:55 +01:00

schema.hh

schema: Introduce equal_columns()

2016-01-11 10:34:55 +01:00

scylla-gdb.py

schema: Enable shared_from_this()

2016-01-11 10:34:51 +01:00

SCYLLA-VERSION-GEN

version: mark master branch as development version

2015-10-08 15:31:50 +03:00

serialization_format.hh

Add license notices

2015-09-20 10:43:39 +03:00

sstable_mutation_readers.hh

Add license notices

2015-09-20 10:43:39 +03:00

test.py

tests: Add schema_change_test

2016-01-11 10:34:53 +01:00

timestamp.hh

Abstract timestamp creation behind new_timestamp()

2015-12-15 15:16:04 +02:00

to_string.hh

to_string: Support std::set and std::unordered_set for to_string

2015-11-16 13:11:43 +02:00

tombstone.hh

mutation: Make hashable

2016-01-08 21:10:26 +01:00

types.cc

types: implement collection compatibility checks

2016-01-04 11:02:21 +01:00

types.hh

types: Introduce is_atomic()

2016-01-08 21:10:26 +01:00

unimplemented.cc

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

unimplemented.hh

Make mutation interfaces support multiple versions

2016-01-11 10:34:51 +01:00

validation.cc

Add license notices

2015-09-20 10:43:39 +03:00

validation.hh

Add license notices

2015-09-20 10:43:39 +03:00

version.hh

Add license notices

2015-09-20 10:43:39 +03:00

README.md

#Scylla

##Building Scylla

In addition to required packages by Seastar, the following packages are required by Scylla.

Submodules

Scylla uses submodules, so make sure you pull the submodules first by doing:

git submodule init
git submodule update --recursive

Building and Running Scylla on Fedora

Installing required packages:

sudo yum install yaml-cpp-devel lz4-devel zlib-devel snappy-devel jsoncpp-devel thrift-devel antlr3-tool antlr3-C++-devel libasan libubsan gcc-c++ gnutls-devel ninja-build ragel libaio-devel cryptopp-devel xfsprogs-devel

Build Scylla

./configure.py --mode=release --with=scylla --disable-xen
ninja-build build/release/scylla -j2 # you can use more cpus if you have tons of RAM

Run Scylla

./build/release/scylla

run Scylla with one CPU and ./tmp as data directory

./build/release/scylla --datadir tmp --commitlog-directory tmp --smp 1

For more run options:

./build/release/scylla --help

Building Fedora RPM

As a pre-requisite, you need to install Mock on your machine:

# Install mock:
sudo yum install mock

# Add user to the "mock" group:
usermod -a -G mock $USER && newgrp mock

Then, to build an RPM, run:

./dist/redhat/build_rpm.sh

The built RPM is stored in /var/lib/mock/<configuration>/result directory. For example, on Fedora 21 mock reports the following:

INFO: Done(scylla-server-0.00-1.fc21.src.rpm) Config(default) 20 minutes 7 seconds
INFO: Results and/or logs in: /var/lib/mock/fedora-21-x86_64/result

Building Fedora-based Docker image

Build a Docker image with:

cd dist/docker
docker build -t <image-name> .

Run the image with:

docker run -p $(hostname -i):9042:9042 -i -t <image name>

Contributing to Scylla

Do not send pull requests.

Send patches to the mailing list address scylladb-dev@googlegroups.com. Be sure to subscribe.

In order for your patches to be merged, you must sign the Contributor's License Agreement, protecting your rights and ours. See http://www.scylladb.com/opensource/cla/.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%