scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-09 16:33:35 +00:00

Author	SHA1	Message	Date
Asias He	1b9e350614	gossip: Do not print node is now part of the cluster during gossip shadow round With Node 1 (Seed node, Port 7000 is opened, 10.184.9.144) Node 2 (Port 7000 is opened, 10.184.9.145) Node 3 (Port 7000 is blocked by firewall) On Node 3, we saw the following error which was very confusing: Node 3 saw Node 1 and Node 3 but it complained it can not contact any seeds. The message "Node 10.184.9.144 is now part of the cluster" and friends are actually messages printed during the gossip shadow round where Node 3 connects to Node 1's port 7000 and Node 1 returns all info it knows to Node 3, so that Node 3 knows Node 1 and Node 2 and we see the "Node 10.184.9.144/145 is now part of the cluster" message. However, during the normal gossip round, Node 3 will not mark Node 1 and Node 2 UP until the Seed node initiates a gossip round to Node 3, (note port 7000 on node 3 is blocked in this case). So Node 3 will not mark Node 1 and Node 2 UP and we see the "Unable to contact any seeds" error. [shard 0] storage_service - Loading persisted ring state [shard 0] gossip - Node 10.184.9.144 is now part of the cluster [shard 0] gossip - inet_address 10.184.9.144 is now UP [shard 0] gossip - Node 10.184.9.145 is now part of the cluster [shard 0] gossip - inet_address 10.184.9.145 is now UP [shard 0] storage_service - Starting up server gossip scylla_run[12479]: Start gossiper service ... [shard 0] storage_service - JOINING: waiting for ring information [shard 0] storage_service - JOINING: schema complete, ready to bootstrap [shard 0] storage_service - JOINING: waiting for pending range calculation [shard 0] storage_service - JOINING: calculation complete, ready to bootstrap [shard 0] storage_service - JOINING: getting bootstrap token [shard 0] storage_service - JOINING: sleeping 5000 ms for pending range setup scylla_run[12479]: Exiting on unhandled exception of type 'std::runtime_error': Unable to contact any seeds!	2015-12-01 17:29:25 +08:00
Asias He	f62a6f234b	gossip: Add shutdown gossip state Backported: CASSANDRA-8336 and CASSANDRA-9871 84b2846 remove redundant state b2c62bb Add shutdown gossip state to prevent timeouts during rolling restarts 8f9ca07 Cannot replace token does not exist - DN node removed as Fat Client Fixes: When X is shutdown, X sends SHUTDOWN message to both Y and Z, but for some reason, only Y receives the message and Z does not receive the message. If Z has a higher gossip version for X than Y has for X, Z will initiate a gossip with Y and Y will mark X alive again. X ------> Y \ / \ / Z	2015-12-01 17:29:25 +08:00
Gleb Natapov	8c02ad0e9e	messaging: log connection dropping event	2015-11-30 19:42:04 +02:00
Avi Kivity	b85f3ad130	Merge "Commit log replay - handle corrupted data silently, as non-fatal" Fixes: #593 "Changes the parser/replayer to treat data corruption as non-fatal, skipping as little as possible to get the most data out of a segment, but keeping track of, and reporting, the amount corrupted. Replayer handles this and reports any non-fatal errors on replay finish. Also added tests for corruption cases. This patch series contains a cleanup-patch for commitlog_tests that was previously submitted, but got lost."	2015-11-30 19:13:31 +02:00
Gleb Natapov	5b9f3bff7d	storage_proxy: simplify error handling by using this/handle_exception It is cleaner to use this/handle_exception instead of then_wrapped if normal and error flow do not share any state.	2015-11-30 17:41:32 +02:00
Gleb Natapov	5484f25091	storage_proxy: remove unneeded continuation make_ready_future() around when_all() is not longer needed. It was added to catch mutate_locally() exceptions, but now it is handled in lower level.	2015-11-30 17:41:28 +02:00
Gleb Natapov	cf95c3f681	storage_proxy: introduce unique_response_handler object to prevent write request leaks If something bad happens between write request handler creation and request execution the request handler have to be destroyed. Currently code tries to do that explicitly in all places where request may be abandoned, but it misses some (at least one). This patch replaces this by introducing unique_response_handler object that will remove the handler automatically if request is not executed for some reason.	2015-11-30 17:41:27 +02:00
Gleb Natapov	d8afc6014e	storage_proxy: catch exception thrown by mutate_locally in mutate verb handler Also simplify error logging.	2015-11-30 17:41:25 +02:00
Avi Kivity	3c9ded27cc	Update scylla-ami submodule * ami/files/scylla-ami 3f37184...07b7118 (1): > Use /etc/scylla as SCYLLA_CONF directory	2015-11-30 16:39:49 +02:00
Takuya ASADA	616903de12	dist: use distribution version of antlr3, on Ubuntu 15.10 Rename antlr3-tool to antlr3 (same as distribution package), and use distribution version if it's available Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-11-30 16:37:36 +02:00
Pekka Enberg	0e8f80b5ee	Merge "Relax bootstrapping/leaving/moving nodes check" from Asias	2015-11-30 11:53:07 +02:00
Asias He	2022117234	failure_detector: Enable phi_convict_threshold option Adjusts the sensitivity of the failure detector on an exponential scale. Use as: $ scylla --phi-convict-threshold 9 Default to 8.	2015-11-30 11:09:36 +02:00
Asias He	db70643fe3	failure_detector: Print application_state properly	2015-11-30 11:08:40 +02:00
Asias He	aaca88a1e7	token_metadata: Add print_pending_ranges for debug print Signed-off-by: Pekka Enberg <penberg@scylladb.com>	2015-11-30 11:07:42 +02:00
Avi Kivity	2c59e2f81f	Merge "Fix race between population and update in row cache" from Tomasz "Before this change, populations could race with update from flushed memtable, which might result in cache being populated with older data. Populations started before the flush are not considering the memtable nor its sstable. The fix employed here is to make update wait for populations which were started before the flushed memtable's sstable was added to the undrelying data source. All populatinos started after that are guaranteed to see the new data. The update() call will wait only for current populating reads to complete, it will not wait for readers to get advanced by the consumer for instance."	2015-11-30 11:06:23 +02:00
Tomasz Grabiec	8d88ece896	schema_tables: Fix "comment" property not being loaded from storage	2015-11-30 10:57:36 +02:00
Pekka Enberg	a64fa3db03	Merge "range_streamer fix and cleanup" from Asias Do not use hard-coded value for is_replacing and rangemovement.	2015-11-30 10:47:06 +02:00
Asias He	879a4ad4d3	storage_service: Update pending ranges immediately after update of normal tokens To avoid a race where natural endpoint was updated to contain node A, but A was not yet removed from pending endpoints. This fixes the root cause of commit `d9d8f87c1` (storage_proxy: filter out natural endpoints from pending endpoint). This patch alone fixes #539, but we still want commit `d9d8f87c1` to be safe.	2015-11-30 10:20:59 +02:00
Asias He	0af7fb5509	range_streamer: Kill FIXME in use_strict_consistency for consistent_rangemovement	2015-11-30 09:15:42 +08:00
Asias He	f80e3d7859	range_streamer: Simplify multiple_map to map conversion in add_ranges	2015-11-30 09:15:42 +08:00
Asias He	21882f5122	range_streamer: Kill one leftover comment	2015-11-30 09:15:42 +08:00
Asias He	6b258f1247	range_streamer: Kill FIXME for is_replacing	2015-11-30 09:15:42 +08:00
Asias He	aa2b11f21b	database: Move is_replacing and get_replace_address to database class So they can be used outside storage_service.	2015-11-30 09:15:42 +08:00
Asias He	80d1d4d161	storage_service: Relax bootstrapping/leaving/moving nodes check in check_for_endpoint_collision When other bootstrapping/leaving/moving nodes are found during bootstrap, instead of throwing immediately, sleep and try again for one minute, hoping other nodes will finish the operation soon. Since we are retrying using shadow gossip round more than once, we need to put the gossip state back to shadow round after each shadow round, to make shadow round works correctly. This is useful when starting an empty cluster for testing. E.g, $ scylla --listen-address 127.0.0.1 $ sleep 3 $ scylla --listen-address 127.0.0.2 $ sleep 3 $ scylla --listen-address 127.0.0.3 Without this patch, node 3 will hit the check. TIME STATUS ----------------------- Node 1: 32:00 Starts 32:00 In NORMAL status Node 2: 32:03 Starts 32:04 In BOOT status 32:10 In NORMAL status Node 3: 32:06 Starts 32:06 Found node 2 in BOOT status, hit the check, sleep and try again 32:11 Found node 2 in NORMAL status, can keep going now 32:12 In BOOT status 32:18 In NORMAL status	2015-11-30 09:07:57 +08:00
Asias He	8b19373536	storage_service: Relax bootstrapping/leaving/moving nodes check in join_token_ring When other bootstrapping/leaving/moving nodes are found during bootstrap, instead of throwing immediately, sleep and try again for one minute, hoping other nodes will finish the operation soon. This is useful when starting an empty cluster for testing. E.g, $ scylla --listen-address 127.0.0.1 $ scylla --listen-address 127.0.0.2 $ scylla --listen-address 127.0.0.3 Without this patch, node 3 will hit the check. TIME STATUS ----------------------- Node 1: 25:19 Starts 25:20 In NORMAL status Node 2: 25:19 Starts 25:23 In BOOT status 25:28 In NORMAL status Node 3: 25:19 Starts 25:24 Found node 2 in BOOT status, hit the check, sleep and try again 25:29 Found node 2 in NORMAL status, can keep going now 25:29 In BOOT status 25:34 In NORMAL status	2015-11-30 09:07:57 +08:00
Tomasz Grabiec	df46542832	tests: Add test for populate and update race	2015-11-29 16:25:22 +01:00
Tomasz Grabiec	6f69d4b700	tests: Avoid potential use after free on partition range	2015-11-29 16:25:21 +01:00
Tomasz Grabiec	de75f3fa69	row_cache: Add default value for partition range in make_reader()	2015-11-29 16:25:21 +01:00
Tomasz Grabiec	ab328ead3d	mutation: Introduce ring_position()	2015-11-29 16:25:21 +01:00
Tomasz Grabiec	32ac2ccc4a	memtable: Introduce apply(memtable&)	2015-11-29 16:25:21 +01:00
Tomasz Grabiec	7c3e6c306b	row_cache: Wait for in-flight populations on update Before this change, populations could race with update from flushed memtable, which might result in cache being populated with older data. Populations started before the flush are not considering the memtable nor its sstable. The fix employed here is to make update wait for populations which were started before the flushed memtable's sstable was added to the undrelying data source. All populatinos started after that are guaranteed to see the new data.	2015-11-29 16:25:21 +01:00
Tomasz Grabiec	a3e3add28a	utils: Introduce phased_barrier Utility for waiting on a group of async actions started before certain point in time.	2015-11-29 16:25:21 +01:00
Pekka Enberg	a26ffefd53	transport/server: Remove CQL text type from encoding The text data type is no longer present in CQL binary protocol v3 and later. We don't need it for encoding earlier versions either because it's an alias for varchar which is present in all CQL binary protocol versions. Fixes #526. Signed-off-by: Pekka Enberg <penberg@scylladb.com>	2015-11-27 09:13:56 +01:00
Pekka Enberg	2599b78583	Merge "CQL notification + storage_service fix" from Asias "pushed_notifications_test.py dtest is now passing"	2015-11-27 10:09:56 +02:00
Asias He	da0e80a286	storage_service: Fix failed bootstrap/replace attempts being persisted in system.peers Backported from: ac46747 Fix failed bootstrap/replace attempts being persisted in system.peers (CASSANDRA-9180)	2015-11-27 15:31:56 +08:00
Asias He	36b2de10ed	failure_detector: Improve FD logging when the arrival time is ignored Backport from: eb9c5bb Improve FD logging when the arrival time is ignored.	2015-11-27 15:31:56 +08:00
Asias He	ed9cd23a2d	transport: Fix duplicate up/down messages sent to native clients This patch plus pekka's previous commit `3c72ea9f96` "gms: Fix gossiper::handle_major_state_change() restart logic" fix CASSANDRA-7816. Backported from: def4835 Add missing follow on fix for 7816 only applied to cassandra-2.1 branch in 763130bdbde2f4cec2e8973bcd5203caf51cc89f 763130b Followup commit for 7816 2199a87 Fix duplicate up/down messages sent to native clients Tested by: pushed_notifications_test.py:TestPushedNotifications.restart_node_test	2015-11-27 15:31:56 +08:00
Asias He	25bb889c2a	transport: Fix wrong message for UP and DOWN event	2015-11-27 15:31:56 +08:00
Asias He	ca8c4f3e77	storage_service: Fix MOVED_NODE client event (CASSANDRA-8516) Backport from: b296c55f956c6ef07c8330dc28ef8c351e5bcfe2 (Fix MOVED_NODE client event) Fixes: DISABLE_VNODES=true nosetests pushed_notifications_test.py:TestPushedNotifications.move_single_node_test	2015-11-27 15:31:56 +08:00
Gleb Natapov	ad358300a9	cql server: remove connection from notifiers earlier Remove connection from notifiers lists just before closing it to prevent attempts to send notification on already closed connection.	2015-11-26 18:50:08 +02:00
Pekka Enberg	569d288891	cql3: Add TRUNCATE TABLE alias for TRUNCATE CQL 3.2.1 introduces a "TRUNCATE TABLE X" alias for "TRUNCATE X": `4e3555c1d9` Fix our CQL grammar to also support that. Please note that we don't bump up advertised CQL version yet because our cqlsh clients won't be able to connect by default until we upgrade them to C* 2.1.10 or later. Fixes #576 Signed-off-by: Pekka Enberg <penberg@scylladb.com>	2015-11-26 18:45:50 +02:00
Gleb Natapov	96f40d535e	cql server: add missing gate during connection access cql connection access is protected by a gate, but event notifiers have omitted taking it. Fix it.	2015-11-26 13:05:59 +02:00
Tomasz Grabiec	a7c11d1e30	db: Fix handling of missing column family The FIXMEs are no longer valid, we load schema on bootstrap and don't support hot-plugging of column families via file system (nor does Cassandra). Handling of missing tables matches Cassandra 2.1, applies log it and continue, queries propagate the error.	2015-11-25 16:59:15 +02:00
Tomasz Grabiec	3a402db1be	storage_proxy: Remove dead signature	2015-11-25 16:57:03 +02:00
Asias He	d03b452322	storage_service: Remove RPC client in on_dead When gossip mark a node down, we should close all the RPC connections to that node.	2015-11-25 16:30:14 +02:00
Gleb Natapov	d9d8f87c1b	storage_proxy: filter out natural endpoints from pending endpoint If request comes after natural endpoint was updated to contain node A, but A was not yet removed from pending endpoints it will be in both and write request logic cannot handle this properly. Filter nodes which are already in natural endpoint from pending endpoint to fix this. Fixes #539.	2015-11-25 16:28:55 +02:00
Pekka Enberg	cf7541020f	Merge "Enable more config options" from Asias	2015-11-25 16:09:22 +02:00
Tomasz Grabiec	c3f03d5c96	Merge branch 'pdziepak/random-lsa-patches/v3' from seastar-dev.git LSA fixes from Paweł.	2015-11-25 10:26:23 +01:00
Paweł Dziepak	89f7f746cb	lsa: fix printing object_descriptor::_alignment object_descriptor::_alignment is of type uint8_t which is actually an unsigned char. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-11-24 20:13:29 +01:00
Paweł Dziepak	65875124b7	lsa: guarantee that segment_heap doesn't throw boost::heap::binomial_heap allocates helper object in push() and, therefore, may throw an exception. This shouldn't happen during compaction. The solution is to reserve space for this helper object in segment_descriptor and use a custom allocator with boost::heap::binomial_heap. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-11-24 19:51:22 +01:00

... 928 929 930 931 932 ...

53948 Commits