mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Go to file

Asias He 5fe2ce3bbe gossiper: Always use the new generation number

User reported an issue that after a node restart, the restarted node
is marked as DOWN by other nodes in the cluster while the node is up
and running normally.

Consier the following:

- n1, n2, n3 in the cluster
- n3 shutdown itself
- n3 send shutdown verb to n1 and n2
- n1 and n2 set n3 in SHUTDOWN status and force the heartbeat version to
  INT_MAX
- n3 restarts
- n3 sends gossip shadow rounds to n1 and n2, in
  storage_service::prepare_to_join,
- n3 receives response from n1, in gossiper::handle_ack_msg, since
  _enabled = false and _in_shadow_round == false, n3 will apply the
  application state in fiber1, filber 1 finishes faster filber 2, it
  sets _in_shadow_round = false
- n3 receives response from n2, in gossiper::handle_ack_msg, since
  _enabled = false and _in_shadow_round == false, n3 will apply the
  application state in fiber2, filber 2 yields
- n3 finishes the shadow round and continues
- n3 resets gossip endpoint_state_map with
  gossiper.reset_endpoint_state_map()
- n3 resumes fiber 2, apply application state about n3 into
  endpoint_state_map, at this point endpoint_state_map contains
  information including n3 itself from n2.
- n3 calls gossiper.start_gossiping(generation_number, app_states, ...)
  with new generation number generated correctly in
  storage_service::prepare_to_join, but in
  maybe_initialize_local_state(generation_nbr), it will not set new
  generation and heartbeat if the endpoint_state_map contains itself
- n3 continues with the old generation and heartbeat learned in fiber 2
- n3 continues the gossip loop, in gossiper::run,
  hbs.update_heart_beat() the heartbeat is set to the number starting
  from 0.
- n1 and n2 will not get update from n3 because they use the same
  generation number but n1 and n2 has larger heartbeat version
- n1 and n2 will mark n3 as down even if n3 is alive.

To fix, always use the the new generation number.

Fixes: #5800
Backports: 3.0 3.1 3.2
(cherry picked from commit 62774ff882)

2020-03-27 12:49:20 +01:00

.github

github: remove github pull request template (#4833 )

2019-08-14 09:28:39 +03:00

alternator

alternator: pass tracing state explicitly instead of relying on it been in the client_state

2020-02-13 13:45:56 +02:00

alternator-test

merge: Handle multiple regular base columns in view pk

2020-01-14 10:01:00 +02:00

api

misc_services: Introduce load_meter

2020-01-13 13:53:08 +03:00

auth

service: Add a lock around migration_notifier::_listeners

2020-02-16 20:13:42 +02:00

cdc

cdc: set TTLs on CDC log cells

2020-02-26 18:12:55 +02:00

conf

Merge "Add experimental_features option" from Dejan

2019-12-11 14:23:08 +02:00

cql3

cql: fix qualifying indexed columns for filtering

2020-03-22 09:00:51 +01:00

data

data: make cell::make_collection(): more consistent and safer

2020-01-16 12:05:50 +02:00

commitlog: use commitlog IO scheduling class for segment zeroing

2020-02-26 12:51:10 +02:00

debug

…

dht

Revert "storage_service: remove storage_service::_is_bootstrap_mode."

2019-10-23 19:20:36 +08:00

dist

dist/redhat: scylla.spec.mustache: set _no_recompute_build_ids

2020-03-09 15:21:50 +02:00

docs

building-packages doc: Update no specific el7 on path

2020-01-16 12:49:08 +02:00

exceptions

cql3: functions: detect and handle int overflow in sum

2020-01-08 09:48:33 +02:00

gms

gossiper: Always use the new generation number

2020-03-27 12:49:20 +01:00

idl

repair: Do not return working_row_buf_nr in get combined row hash verb

2019-12-21 20:13:15 +02:00

imr

imr: move documentation to docs/

2019-11-28 16:47:52 +02:00

index

secondary_index_manager: add the index_name_from_table_name function

2020-01-15 15:06:00 +02:00

interface

…

libdeflate @ e7e54eab42

…

licenses

…

locator

locator: correctly select endpoints if RF=0

2020-03-12 12:09:46 +02:00

message

lwt: drop invoke_on in paxos_state prepare and accept

2020-01-13 10:26:02 +02:00

mutation_writer

tests: generalize timestamp_based_spliiting_writer and bucket_writer to UDTs.

2019-10-25 12:04:44 +02:00

python3

reloc/python3: add install.sh on python relocatable package

2019-09-03 20:06:30 +09:00

redis

Redis: Combine all the source files redis/commands/* into redis/commands.{hh,cc}

2019-12-08 13:54:33 +02:00

redis-test

Merge "Redis: fix the options related to Redis API, fix the DEL and GET command"

2019-12-05 11:58:34 +02:00

reloc

reloc: Turn the default flags into common flags

2020-01-03 15:48:20 +02:00

repair

repair: Avoid duplicated partition_end write

2020-01-06 14:06:02 +02:00

scripts

scripts: Add cpp-name-format: pretty printer

2020-01-01 12:08:12 +02:00

seastar @ a0bdc6cd85

Update seastar submodule

2020-03-12 19:41:50 +02:00

service

storage_service: drain_on_shutdown: unregister storage_proxy subscribers from local_storage_service

2020-02-25 16:39:49 +02:00

sstables

sstables: Move STCS implementation to source file

2020-01-08 09:55:35 +02:00

streaming

streaming: Fix map access in stream_manager::get_progress

2020-01-06 10:31:15 +02:00

swagger-ui @ 12f1da1082

…

test

cql: fix qualifying indexed columns for filtering

2020-03-22 09:00:51 +01:00

thrift

lwt: Process lwt request on a owning shard

2020-01-13 10:26:02 +02:00

tools

tools: toolchain: dbuild: relax process limit in container

2020-01-28 18:14:01 +02:00

tracing

tracing: split adding prepared query parameters from stopping of a trace

2019-12-05 17:00:47 +02:00

transport

service: Add a lock around migration_notifier::_listeners

2020-02-16 20:13:42 +02:00

types

user defined types: fix support for case-sensitive type names

2020-01-03 15:48:20 +02:00

utils

logalloc: increase capacity of _regions vector outside reclaim lock

2020-03-12 11:25:20 +02:00

xxHash @ 744892b802

…

zstd @ ff304e9e65

…

.dockerignore

…

.gitattributes

…

.gitignore

test.py: add CQL .reject files to gitignore

2020-01-15 11:41:19 +03:00

.gitmodules

Point seastar submodule at scylla-seastar.git branch-3.3

2020-02-16 15:51:46 +02:00

.gitorderfile

…

atomic_cell_hash.hh

collection_mutation: easier (de)serialization of collection_mutation(s).

2019-10-25 10:42:58 +02:00

atomic_cell_or_collection.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

atomic_cell.cc

atomic_cell: consistently use comma as separator in pretty-printers

2020-01-16 17:26:33 +01:00

atomic_cell.hh

atomic_cell: add type-aware pretty printing

2019-12-30 18:27:04 +02:00

backlog_controller.hh

…

build_id.cc

build-id: Handle the binary having multiple PT_NOTE headers

2020-01-03 15:48:20 +02:00

build_id.hh

Print build-id on startup

2019-12-19 15:43:04 +02:00

bytes_ostream.hh

bytes_ostream: make it a FragmentRange

2019-12-02 10:10:31 +02:00

bytes.cc

…

bytes.hh

…

cache_flat_mutation_reader.hh

mvcc: Introduce partition_snapshot::touch()

2019-10-03 22:03:28 +02:00

cache_temperature.hh

…

caching_options.hh

…

canonical_mutation.cc

canonical_mutation: add pretty printing

2020-01-07 12:06:31 +02:00

canonical_mutation.hh

canonical_mutation: add pretty printing

2020-01-07 12:06:31 +02:00

cartesian_product.hh

…

cell_locking.hh

mutation_partition: make static_row optional to reduce memory footprint

2019-10-15 15:42:05 +03:00

checked-file-impl.hh

…

clocks-impl.cc

…

clocks-impl.hh

…

clustering_bounds_comparator.hh

…

clustering_key_filter.hh

…

clustering_ranges_walker.hh

…

CMakeLists.txt

cmake: update CMakeLists.txt to scan test/ rather than tests/

2019-12-16 17:47:42 +03:00

collection_mutation.cc

data: make cell::make_collection(): more consistent and safer

2020-01-16 12:05:50 +02:00

collection_mutation.hh

collection_mutation_view: add type-aware pretty printer

2020-01-07 12:06:29 +02:00

column_computation.hh

…

combine.hh

…

compaction_garbage_collector.hh

types: move collection_type_impl::mutation(_view) out of collection_type_impl.

2019-10-25 10:19:45 +02:00

compaction_strategy.hh

…

compatible_ring_position.hh

…

compound_compat.hh

serialization: accept any CharOutputIterator

2019-12-02 10:10:31 +02:00

compound.hh

serialization: accept any CharOutputIterator

2019-12-02 10:10:31 +02:00

compress.cc

…

compress.hh

…

concrete_types.hh

Lua: Implement support for returning inet

2019-11-07 08:41:08 -08:00

configure.py

configure: Add -O1 when compiling generated parsers

2020-01-16 12:05:50 +02:00

connection_notifier.cc

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

connection_notifier.hh

system_keyspace: Added infrastructure for table `system.clients'

2019-12-17 11:31:28 +01:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.hh

converting_mutation_partition_applier: generalize accept_cell to UDTs.

2019-10-25 12:04:42 +02:00

counters.cc

…

counters.hh

…

cql_serialization_format.hh

…

database_fwd.hh

…

database.cc

Merge "Relax migration manager dependencies" from Pavel Emalyanov

2020-01-16 12:12:25 +01:00

database.hh

Merge "Relax migration manager dependencies" from Pavel Emalyanov

2020-01-16 12:12:25 +01:00

db_clock.hh

…

debug.hh

…

digest_algorithm.hh

…

digester.hh

…

dirty_memory_manager.hh

commitlog+region_group: timeout exceptions with names

2019-12-03 19:07:19 +01:00

disk-error-handler.cc

…

disk-error-handler.hh

…

distributed_loader.cc

database: Explicitly pass migration_manager through init_non_system_keyspace

2020-01-15 14:29:21 +03:00

distributed_loader.hh

database: Explicitly pass migration_manager through init_non_system_keyspace

2020-01-15 14:29:21 +03:00

Doxyfile

…

duration.cc

…

duration.hh

…

encoding_stats.hh

…

enum_set.hh

lwt: ensure enum_set::of is constexpr.

2019-10-01 19:45:56 +02:00

fix_system_distributed_tables.py

…

flat_mutation_reader.cc

mutation_fragment_stream_validator: wrap exceptions into own exception type

2019-12-20 12:05:00 +01:00

flat_mutation_reader.hh

mutation_fragment_stream_validator: wrap exceptions into own exception type

2019-12-20 12:05:00 +01:00

frozen_mutation.cc

…

frozen_mutation.hh

…

frozen_schema.cc

…

frozen_schema.hh

…

gc_clock.hh

gc_clock, serialization: define new serialization for gc_clock::duration (aka TTLs)

2019-10-23 18:36:33 +03:00

gen_segmented_compress_params.py

…

HACKING.md

docs: The scylla's dpdk config is boolean

2019-10-31 10:12:17 +02:00

hashers.cc

…

hashers.hh

…

hashing_partition_visitor.hh

…

hashing.hh

…

idl-compiler.py

…

init.cc

storage_service: Kill initialization helper from init.cc

2020-01-15 14:27:27 +03:00

init.hh

storage_service: Kill initialization helper from init.cc

2020-01-15 14:27:27 +03:00

install-dependencies.sh

test.py: prepare to remove custom colors

2019-12-23 15:13:22 +02:00

install.sh

dist: stop replacing /usr/lib/scylla with symlink (#5530 )

2019-12-30 13:52:24 +02:00

intrusive_set_external_comparator.hh

…

json.cc

…

json.hh

…

keys.cc

…

keys.hh

…

LICENSE.AGPL

…

lister.cc

…

lister.hh

…

log.hh

…

lua.cc

lua: Handle nil returns correctly

2020-02-09 18:55:42 +02:00

lua.hh

lua: Handle nil returns correctly

2020-02-09 18:55:42 +02:00

main.cc

storage_service: Unregister from gossiper notifications ... at all

2020-02-24 14:18:15 +03:00

MAINTAINERS

scripts/find-maintainer: refresh maintainers list

2019-11-20 16:56:31 +02:00

map_difference.hh

…

marshal_exception.hh

…

memtable-sstable.hh

…

memtable.cc

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

memtable.hh

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

multishard_mutation_query.cc

messaging_service: use rpc::tuple instead of variadic futures for rpc

2019-09-26 12:09:31 +02:00

multishard_mutation_query.hh

messaging_service: use rpc::tuple instead of variadic futures for rpc

2019-09-26 12:09:31 +02:00

mutation_cleaner.hh

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

mutation_compactor.hh

query: add flag to return static row on partition with no rows

2019-10-28 21:50:44 +03:00

mutation_fragment.cc

position_in_partition_view: add type-aware printer

2020-01-07 12:15:09 +01:00

mutation_fragment.hh

mutation_fragment: declare partition_region operator<< in header file

2019-09-09 15:30:59 +03:00

mutation_partition_serializer.cc

collection_mutation: easier (de)serialization of collection_mutation(s).

2019-10-25 10:42:58 +02:00

mutation_partition_serializer.hh

…

mutation_partition_view.cc

mutation_partition_view: add virtual visitor

2020-01-07 12:06:31 +02:00

mutation_partition_view.hh

mutation_partition_view: add virtual visitor

2020-01-07 12:06:31 +02:00

mutation_partition_visitor.hh

atomic_cell: move collection_mutation(_view) to a new file.

2019-10-25 10:19:45 +02:00

mutation_partition.cc

row: append(): downgrade assert to on_internal_error()

2020-02-16 15:12:46 +02:00

mutation_partition.hh

row_marker: correct expiration condition

2019-11-19 11:46:59 +01:00

mutation_query.cc

…

mutation_query.hh

query: add flag to return static row on partition with no rows

2019-10-28 21:50:44 +03:00

mutation_reader.cc

mutation_reader: gallop mode for combined reader

2019-10-30 09:51:18 +01:00

mutation_reader.hh

…

mutation_rebuilder.hh

…

mutation_source_metadata.hh

…

mutation.cc

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

mutation.hh

lwt: move mutation hashers to mutation.hh

2019-10-01 19:49:31 +02:00

noexcept_traits.hh

…

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

collection_mutation: generalize constructor of collection_mutation to abstract_type.

2019-10-25 10:42:58 +02:00

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

…

partition_snapshot_reader.hh

…

partition_snapshot_row_cursor.hh

…

partition_version_list.hh

…

partition_version.cc

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

partition_version.hh

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

position_in_partition.hh

position_in_partition_view: add type-aware printer

2020-01-07 12:15:09 +01:00

querier.cc

querier_cache: correctly account entries evicted on insertion in the population

2019-10-03 11:49:44 +03:00

querier.hh

querier_cache: add inserted stat

2019-09-24 10:52:49 +02:00

query_result_merger.hh

…

query-request.hh

query: initialize read_command timestamp to now

2020-01-08 10:19:07 +02:00

query-result-reader.hh

…

query-result-set.cc

query-result-set: generalize result_set_builder to UDTs.

2019-10-25 12:04:44 +02:00

query-result-set.hh

…

query-result-writer.hh

…

query-result.hh

…

query.cc

schema: rename column_mask to column_set

2019-11-13 11:41:30 +03:00

range_tombstone_list.cc

…

range_tombstone_list.hh

…

range_tombstone.cc

…

range_tombstone.hh

…

range.hh

…

read_context.hh

row_cache: Use the correct schema version to populate the partition entry

2019-10-03 22:03:28 +02:00

reader_concurrency_semaphore.cc

db+semaphores+tests: mandatory `name' param in reader_concurrency_semaphore

2019-12-03 15:41:34 +01:00

reader_concurrency_semaphore.hh

db+semaphores+tests: mandatory `name' param in reader_concurrency_semaphore

2019-12-03 15:41:34 +01:00

README.md

cli: Add the --workdir|-W option

2019-11-21 15:07:39 +02:00

real_dirty_memory_accounter.hh

…

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

row_cache: Fix abort on bad_alloc during cache update

2019-11-24 12:06:51 +02:00

row_cache.hh

memtables: add partition/row hit/miss counters

2019-11-12 13:35:41 +01:00

schema_builder.hh

schema: allow schema to be marked as 'always sync to commitlog'

2020-01-15 12:15:42 +02:00

schema_fwd.hh

…

schema_mutations.cc

scylla_tables: treat empty cdc props as disabled

2020-01-05 14:39:23 +02:00

schema_mutations.hh

schema_tables: handle 'cdc' options

2019-10-17 10:55:31 +02:00

schema_registry.cc

schema_registry: mark global_schema_ptr move constructor noexcept

2019-09-26 16:56:59 +03:00

schema_registry.hh

schema_registry: mark global_schema_ptr move constructor noexcept

2019-09-26 16:56:59 +03:00

schema_upgrader.hh

…

schema.cc

schema: Add a describe method

2020-01-15 15:06:00 +02:00

schema.hh

merge "Adding a schema file when creating a snapshot"

2020-01-16 12:05:50 +02:00

scylla_post_install.sh

scylla_post_install.sh: fix 'integer expression expected' error

2020-02-04 14:30:04 +02:00

scylla-gdb.py

scylla-gdb.py: static_vector: update for changed storage

2019-12-18 17:39:56 +02:00

SCYLLA-VERSION-GEN

release: prepare for 3.3.0

2020-03-19 21:46:44 +02:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

serializer: Add std::variant support

2019-09-26 11:44:00 +03:00

serializer.hh

serializer: add reference_wrapper handling

2019-11-24 11:35:29 +02:00

service_permit.hh

…

setup.py

…

supervisor.cc

…

supervisor.hh

…

table_helper.cc

treewide: silence discarded future warnings for questionable discards

2019-08-26 19:28:43 +03:00

table_helper.hh

…

table.cc

Revert "streaming: Do not invalidate cache if no sstable is added in flush_streaming_mutations"

2020-02-24 10:02:58 +02:00

test.py

test.py: introduce BoostTest and virtualize custom boost arguments

2020-01-15 13:37:25 +03:00

timeout_config.hh

…

timestamp.hh

…

to_string.hh

to_string: Add operator<< overload for std::tuple.

2019-08-29 13:35:02 +03:00

tombstone.hh

…

tox.ini

…

types.cc

types: Fix encoding of negative varint

2020-02-02 16:00:58 +02:00

types.hh

types: Refactor duplicated value_cast implementation

2019-11-24 11:35:29 +02:00

unimplemented.cc

…

unimplemented.hh

…

user_types_metadata.hh

user_types_metadata: don't implement enable_lw_shared_from_this

2019-12-11 10:44:40 -08:00

validation.cc

…

validation.hh

…

version.hh

…

view_info.hh

view: handle multiple regular base columns in view pk

2020-01-07 12:18:39 +01:00

vint-serialization.cc

…

vint-serialization.hh

…

xx_hasher.hh

…

zstd.cc

…

README.md

Scylla

Quick-start

To get the build going quickly, Scylla offers a frozen toolchain which would build and run Scylla using a pre-configured Docker image. Using the frozen toolchain will also isolate all of the installed dependencies in a Docker container. Assuming you have met the toolchain prerequisites, which is running Docker in user mode, building and running is as easy as:

$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla
$ ./tools/toolchain/dbuild ./build/release/scylla --developer-mode 1

Please see HACKING.md for detailed information on building and developing Scylla.

Note: GCC >= 8.1.1 is required to compile Scylla.

Running Scylla

Run Scylla

./build/release/scylla

run Scylla with one CPU and ./tmp as work directory

./build/release/scylla --workdir tmp --smp 1

For more run options:

./build/release/scylla --help

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also experimental support for the API of Amazon DynamoDB, but being experimental it needs to be explicitly enabled to be used. For more information on how to enable the experimental DynamoDB compatibility in Scylla, and the current limitations of this feature, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found in ./docs and on the wiki. There is currently no clear definition of what goes where, so when looking for something be sure to check both. Seastar documentation can be found here. User documentation can be found here.

Building Fedora RPM

As a pre-requisite, you need to install Mock on your machine:

# Install mock:
sudo yum install mock

# Add user to the "mock" group:
usermod -a -G mock $USER && newgrp mock

Then, to build an RPM, run:

./dist/redhat/build_rpm.sh

The built RPM is stored in /var/lib/mock/<configuration>/result directory. For example, on Fedora 21 mock reports the following:

INFO: Done(scylla-server-0.00-1.fc21.src.rpm) Config(default) 20 minutes 7 seconds
INFO: Results and/or logs in: /var/lib/mock/fedora-21-x86_64/result

Building Fedora-based Docker image

Build a Docker image with:

cd dist/docker
docker build -t <image-name> .

Run the image with:

docker run -p $(hostname -i):9042:9042 -i -t <image name>

Contributing to Scylla

Hacking howto Guidelines for contributing

Languages

C++ 72.5%

Python 26.2%

CMake 0.4%

GAP 0.3%

Shell 0.3%