scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 03:45:11 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	85c092c56c	storage_service: add LARGE_PARTITIONS_FEATURE Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-13 09:51:23 +01:00
Asias He	e0949a8f4f	storage_service: Exit shadow round state if it fails If a node fails to talk to any seed node, shadow round will fail. We should exit shadow round state before we continue. This issue is spotted by consistency_test.TestConsistency.data_query_digest_test dtest. Message-Id: <ba0613532a69bac369ca316ab61d907b320c8e68.1467963674.git.asias@scylladb.com>	2016-07-08 10:05:07 +01:00
Asias He	5236e7a379	storage_service: Implement feature check for seed node Checking features for seed node is a bit more complicated than non-seed node, because non-seed node can always talk to at least one seed node, seed node may not. In this patch, we distingush new cluster and existing cluster by checking if the system table is empty. We relax the feature check for new cluster because the feature check is mostly useful when upgrading an existing cluster to prevent old node to join new cluster. When talking to a seed node failed during the check, we fallback to the check using features stored in the system table. This makes restarting a seed node when no other seed node is up possible (no other seed node at all, or other seed node is not up yet). I tested the following scenarios. 1) start a completely new seed node in a new cluster * system table is empty, skip the check. 2) start a cluster, restart one seed node, at least one other seed node is up * system table is not empty, check with shadow round, shadow round will * succeed 3) start a cluster, restart one seed node, no other seed node is up * system table is not empty, check with shadow round, shadow round will * fail, fallback to system table check. 4) start a cluster, shutdown all the nodes, start one seed node with new ip address, seed list in yaml is updated with new ip address * system table is not empty, check with shadow round, shadow round will * fail, fallback to system table check	2016-07-05 10:09:54 +08:00
Avi Kivity	e22517bafc	Merge "Optimize reads from leveled sstables" In a leveled column family, there can be many thousands of sstables, since each sstable is limited to a relatively small size (160M by default). With the current approach of reading from all sstables in parallel, cpu quickly becomes a bottleneck as we need to check the bloom filter for each of these sstables. This patch addresses the problem by introducing a compaction-strategy-specific data structure for holding sstables. This data structure has a method to obtain the sstables used for a read. For leveled compaction strategy, this data structure is an interval map, which can be efficiently used to select the right sstables.	2016-07-04 16:00:35 +03:00
Asias He	610a0f7ef0	storage_service: Skip feature check for seed node for now When a seed node boots up with more than one node in the seed list, it will fail to talk to the other seed node which is not up yet. This fails the feature check, so the seed node will not boot. Skip the feature check for seed node for now, util we have a proper solution. Fixes recent dtest failure due to fail to boot the seed node. Message-Id: <e1d4110f96817e45f81dc0bc948dd14600fc5333.1467251799.git.asias@scylladb.com>	2016-07-04 15:09:57 +03:00
Asias He	f6a2672be0	storage_service: Modify log to match config option of scylla We currently log as follow: May 9 00:09:13 node3.nl scylla[2546]: [shard 0] storage_service - This node was decommissioned and will not rejoin the ring unless cassandra.override_decommission=true has been set,or all existing data is removed and the node is bootstrapped again Howerver, user should use override_decommission:true instead of cassandra.override_decommission:true in scylla.yaml where the cassandra prefix is stripped. Fixes #1240 Message-Id: <b0c9424c6922431ad049ab49391771e07ca6fbde.1467079190.git.asias@scylladb.com>	2016-07-04 10:47:49 +02:00
Avi Kivity	2a46410f4a	Change sstable_list from a map to a set sstable_list is now a map<generation, sstable>; change it to a set in preparation for replacing it with sstable_set. The change simplifies a lot of code; the only casualty is the code that computes the highest generation number.	2016-07-03 10:26:57 +03:00
Paweł Dziepak	002d2bc353	thrift: pass query_processor to the thrift handler Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com> Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-27 15:24:27 +02:00
Pekka Enberg	bcba45f546	Merge "Prevent old node to join new cluster" from Asias Fixes #1253	2016-06-23 10:25:38 +03:00
Asias He	4f3ce42163	storage_service: Prevent old version node to join a new version cluster We want to prevent older version of scylla which has fewer features to join a cluster with newer version of scylla which has more features, because when scylla sees a feature is enabled on all other nodes, it will start to use the feature and assume existing nodes and future nodes will always have this feature. In order to support downgrade during rolling upgrade, we need to support mixed old and new nodes case. 1) All old nodes O O O O O <- N OK O O O O O <- O OK 2) All new nodes N N N N N <- N OK N N N N N <- O FAIL 3) Mixed old and new nodes O N O N O <- N OK O N O N O <- O OK (O == old node, N == new node, <- == joining the cluster) With this patch, I tested: 1.1) Add new node to new node cluster gossip - Feature check passed. Local node 127.0.0.4 features = {RANGE_TOMBSTONES}, Remote common_features = {RANGE_TOMBSTONES} 1.2) Add old node to old node cluster gossip - Feature check passed. Local node 127.0.0.4 features = {}, Remote common_features = {} 2.1) Add new node to new node cluster gossip - Feature check passed. Local node 127.0.0.4 features = {RANGE_TOMBSTONES}, Remote common_features = {RANGE_TOMBSTONES} 2.2) Add old node to new node cluster seastar - Exiting on unhandled exception: std::runtime_error (Feature check failed. This node can not join the cluster because it does not understand the feature. Local node 127.0.0.4 features = {}, Remote common_features = {RANGE_TOMBSTONES}) 3.1) Add new node to mixed cluster gossip - Feature check passed. Local node 127.0.0.4 features = {RANGE_TOMBSTONES}, Remote common_features = {} 3.2) Add old node to mixed cluster gossip - Feature check passed. Local node 127.0.0.4 features = {}, Remote common_features = {} Fixes #1253	2016-06-17 10:49:45 +08:00
Pekka Enberg	d72c608868	service/storage_service: Make do_isolate_on_error() more robust Currently, we only stop the CQL transport server. Extract a stop_transport() function from drain_on_shutdown() and call it from do_isolate_on_error() to also shut down the inter-node RPC transport, Thrift, and other communications services. Fixes #1353	2016-06-16 13:34:09 +03:00
Pekka Enberg	155ad2eeb5	storage_service: Fix start_rpc_server() to use logger Message-Id: <1465882880-7392-1-git-send-email-penberg@scylladb.com>	2016-06-14 09:52:04 +02:00
Vlad Zolotarov	d3960f0bbb	tracing: rearrange shut down tracing::tracing local instance is dereferenced from a cql_server::connection::process_request(), therefore tracing::tracing service may be stop()ed only after a CQL server service is down. On the other hand it may not be stopped before RPC service is down because a remote side may request a tracing for a specific command too. This patch splits the tracing::tracing stop() into two phases: 1) Flush all pending tracing records and stop the backend. 2) Stop the service. The first phase is called after CQL server is down and before RPC is down. The second phase is called after RPC is down. Fixes #1339 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1465840496-19990-1-git-send-email-vladz@cloudius-systems.com>	2016-06-14 07:58:04 +03:00
Duarte Nunes	c970d682d1	storage_service: Announce range tombstones feature This patch enables the RANGE_TOMBSTONES supported feature, meaning that the node is capable of accepting row entry tombstones as range tombstones. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:59 +02:00
Duarte Nunes	e46537b7d3	storage_service: Include range tombstones feature This patch adds the range tombstones feature, which is not enabled yet, to the storage_service, so that consumers can query for it. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:58 +02:00
Asias He	f1b3cb4a08	storage_service: Catch and fail an invalid configuration with --replace-address Vlad reported a strange user configuration: SCYLLA_ARGS="--log-to-syslog 1 --log-to-stdout 0 --default-log-level info --collectd-address=127.0.0.1:25826 --collectd=1 --collectd-poll-period 60000 --network-stack posix --num-io-queues 32 --max-io-requests 128 --replace-address 10.0.4.131" seed_provider: - class_name: org.apache.cassandra.locator.SimpleSeedProvider parameters: - seeds: "10.0.4.131" In the mean while, 10.0.4.131 is the IP address of the node itself. When the node was started, the following message were reported. Apr 13 06:31:12 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (20 seconds passed) Apr 13 06:31:13 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (21 seconds passed) Apr 13 06:31:14 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (22 seconds passed) Apr 13 06:31:15 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (23 seconds passed) The configruation is invalid, becasue for --replace-address to work, at least one working seed node should be alive. Catch the configuration error and fail it with an appropriate error message. Fixes #1183 Message-Id: <a94a082d896313e7a668915ae21fe2c03719da3a.1464164058.git.asias@scylladb.com>	2016-05-25 14:42:19 +03:00
Raphael S. Carvalho	e5f0314afd	db: introduce upload directory for sstable migration This change is intended to make migration process safer and easier. All column families will now have a directory called upload. With this feature, users may choose to copy migrated sstables to upload directory of respective column families, and call 'nodetool refresh'. That's supposed to be the preferred option from now on. For each sstable in upload directory, refresh will do the following: 1) Mutate sstable level to 0. 2) Create hard links to its components in column family dir, using a new generation. We make it safe by creating a hard link to temporary TOC first. 3) Remove all of its components in upload directory. This new code runs after refresh checked for new sstables in the column family directory. Otherwise, we could have a generation conflict. Unlike the first step, this new step runs with sstable write enabled. It's easier here because we know exactly which sstables are new. After that, refresh will load new sstables found in column family and upload directories. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-20 17:26:21 -03:00
Duarte Nunes	7dbeef3c39	storage_service: Fix ignored future in on_alive This patch ensures the future created by invoke_on_all is not ignored by waiting on it, which is safe to do since we are within a seastar::async context. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1462989837-7326-1-git-send-email-duarte@scylladb.com>	2016-05-12 09:03:46 +03:00
Calle Wilund	437ebe7128	cql_server: Use credentials_builder to init tls Slightly cleaner, and shard-safe tls init. Message-Id: <1462283265-27051-3-git-send-email-calle@scylladb.com>	2016-05-09 14:12:59 +03:00
Calle Wilund	709dd82d59	storage_service: Add logging to match origin Pointing out if CQL server is listing in SSL mode. Message-Id: <1462368016-32394-2-git-send-email-calle@scylladb.com>	2016-05-06 13:27:55 +03:00
Asias He	891e947314	storage_service: Rename remove_node to removenode nodetool uses removenode command to remove a node. Rename the implementation in storage_service to match the command.	2016-04-13 14:53:28 +08:00
Asias He	9ffb95216d	storage_service: Add force_remove_completion It is needed by the $ nodetool removenode force command.	2016-04-13 14:53:28 +08:00
Asias He	7c7e5967f6	storage_service: Add get_removal_status It is needed by the $ nodetool removenode status command.	2016-04-13 14:53:28 +08:00
Asias He	8d7cd07d6c	storage_service: Add print info in confirm_replication The message is rare but it is very useful to debug removenode operation.	2016-04-13 14:53:28 +08:00
Pekka Enberg	47a904c0f6	Merge "gossip: Introduce SUPPORTED_FEATURES" from Asias "There is a need to have an ability to detect whether a feature is supported by entire cluster. The way to do it is to advertise feature availability over gossip and then each node will be able to check if all other nodes have a feature in question. The idea is to have new application state SUPPORTED_FEATURES that will contain set of strings, each string holding feature name. This series adds API to do so. The following patch on top of this series demostreates how to wait for features during boot up. FEATURE1 and FEATURE2 are introduced. We use wait_for_feature_on_all_node to wait for FEATURE1 and FEATURE2 successfully. Since FEATURE3 is not supported, the wait will not succeed, the wait will timeout. --- a/service/storage_service.cc +++ b/service/storage_service.cc @@ -95,7 +95,7 @@ sstring storage_service::get_config_supported_features() { // Add features supported by this local node. When a new feature is // introduced in scylla, update it here, e.g., // return sstring("FEATURE1,FEATURE2") - return sstring(""); + return sstring("FEATURE1,FEATURE2"); } std::set<inet_address> get_seeds() { @@ -212,6 +212,11 @@ void storage_service::prepare_to_join() { // gossip snitch infos (local DC and rack) gossip_snitch_info().get(); + gossiper.wait_for_feature_on_all_node(std::set<sstring>{sstring("FEATURE1"), sstring("FEATURE2")}, std::chrono::seconds(30)).get(); + logger.info("Wait for FEATURE1 and FEATURE2 done"); + gossiper.wait_for_feature_on_all_node(std::set<sstring>{sstring("FEATURE3")}).get(); + logger.info("Wait for FEATURE3 done"); + We can query the supported_features: cqlsh> SELECT supported_features from system.peers; supported_features -------------------- FEATURE1,FEATURE2 FEATURE1,FEATURE2 (2 rows) cqlsh> SELECT supported_features from system.local; supported_features -------------------- FEATURE1,FEATURE2 (1 rows)"	2016-04-08 09:22:50 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Asias He	d5dce8016b	storage_service: Advertise supported_features into cluster Advertise features supported by this node, so that other nodes can know this info. For example, on a 3 node cluster with supported_features == FEATURE1 and FEATURE2, it looks like: cqlsh> SELECT supported_features from system.peers; supported_features -------------------- FEATURE1,FEATURE2 FEATURE1,FEATURE2 (2 rows) cqlsh> SELECT supported_features from system.local; supported_features -------------------- FEATURE1,FEATURE2 (1 rows)	2016-04-06 07:12:34 +08:00
Asias He	0e1738943d	storage_service: Add supported_features into system.peers table	2016-04-06 07:12:34 +08:00
Asias He	b710a5f9ee	storage_service: Introduce get_config_supported_features It tells features supported by this local node. When new feature is introduced in scylla, update features returned by get_config_supported_features, e.g., return sstring("FEATURE1,FEATURE2")	2016-04-06 07:12:34 +08:00
Asias He	bc1889b7ab	storage_service: Shutdown batchlog_manager after decommission On the node which was decommissioned, I saw 2016-03-29 09:35:52,097 [shard 0] storage_service - DECOMMISSIONED: 2016-03-29 09:35:52,097 [shard 0] storage_service - DECOMMISSIONING: done 2016-03-29 09:36:28,814 [shard 0] batchlog_manager - Batchlog replay on shard 0: starts 2016-03-29 09:36:28,814 [shard 0] batchlog_manager - Batchlog replay on shard 0: done 2016-03-29 09:37:28,819 [shard 0] batchlog_manager - Batchlog replay on shard 1: starts 2016-03-29 09:37:28,820 [shard 0] batchlog_manager - Batchlog replay on shard 1: done 2016-03-29 09:38:28,830 [shard 0] batchlog_manager - Batchlog replay on shard 0: starts 2016-03-29 09:38:28,830 [shard 0] batchlog_manager - Batchlog replay on shard 0: done 2016-03-29 09:39:28,844 [shard 0] batchlog_manager - Batchlog replay on shard 1: starts 2016-03-29 09:39:28,844 [shard 0] batchlog_manager - Batchlog replay on shard 1: done We should stop the batchlog_manager to avoid initiating only future batchlog replay operation.	2016-03-30 20:54:30 +08:00
Asias He	5d1140b1eb	storage_service: Do batch log replay in decommission Replay the batch log during decommission. Kill one FIXME. Refs #1085	2016-03-30 20:54:30 +08:00
Tomasz Grabiec	d1db23e353	storage_service: Fix typos Message-Id: <1458837390-26634-1-git-send-email-tgrabiec@scylladb.com>	2016-03-29 10:29:04 +03:00
Raphael Carvalho	e6e5999282	Fix corner-case in refresh Problem found by dtest which loads sstables with generation 1 and 2 into an empty column family. The root of the problem is that reshuffle procedure changes new sstables to start from generation 2 at least. So reshuffle could try to set generation 1 to 2 when generation 2 exists. This problem can be fixed by starting from generation 1 instead, so reshuffle would handle this case properly. Fixes #1099. Signed-off-by: Raphael Carvalho <raphaelsc@scylladb.com> Message-Id: <88c51fbda9557a506ad99395aeb0a91cd550ede4.1458917237.git.raphaelsc@scylladb.com>	2016-03-27 10:03:32 +03:00
Raphael Carvalho	370b1336fe	service: fix refresh Vlad and I were working on finding the root of the problems with refresh. We found that refresh was deleting existing sstable files because of a bug in a function that was supposed to return the maximum generation of a column family. The intention of this function is to get generation from last element of column_family::_sstables, which is of type std::map. However, we were incorrectly using std::map::end() to get last element, so garbage was being read instead of maximum generation. If the garbage value is lower than the minimum generation of a column family, then reshuffle_sstables() would set generation of all existing sstables to a lower value. That would confuse our mechanism used to delete sstables because sstables loaded at boot stage were touched. Solution to this problem is about using rbegin() instead of end() to get last element from column_family::_sstables. The other problem is that refresh will only load generations that are larger than or equal to X, so new sstables with lower generation will not be loaded. Solution is about creating a set with generation of live SSTables from all shards, and using this set to determine whether a generation is new or not. The last change was about providing an unused generation to reshuffle procedure by adding one to the maximum generation. That's important to prevent reshuffle from touching an existing SSTable. Tested 'refresh' under the following scenarios: 1) Existing generations: 1, 2, 3, 4. New ones: 5, 6. 2) Existing generations: 3, 4, 5, 6. New ones: 1, 2. 3) Existing generations: 1, 2, 3, 4. New ones: 7, 8. 4) No existing generation. No new generation. 5) No existing generation. New ones: 1, 2. I also had to adapt existing testcase for reshuffle procedure. Fixes #1073. Signed-off-by: Raphael Carvalho <raphaelsc@scylladb.com> Message-Id: <1c7b8b7f94163d5cd00d90247598dd7d26442e70.1458694985.git.raphaelsc@scylladb.com>	2016-03-23 10:21:58 +02:00
Benoît Canet	1fb9a48ac5	exception: Optionally shutdown communication on I/O errors. I/O errors cannot be fixed by Scylla the only solution is to shutdown the database communications. Signed-off-by: Benoît Canet <benoit@scylladb.com> Message-Id: <1458154098-9977-1-git-send-email-benoit@scylladb.com>	2016-03-17 15:02:52 +02:00
Asias He	83ffae1568	storage_service: Drop block_until_update_pending_ranges_finished It is a legacy API from c*. Since we can wait for the update_pending_ranges to complete, we can wait for it directly instead of calling block_until_update_pending_ranges_finished to do so. Also, change do_update_pending_ranges to be private. Message-Id: <ac79b2879ec08fdcd3b2278ff68962cc71492f12.1458040608.git.asias@scylladb.com>	2016-03-15 15:18:45 +02:00
Gleb Natapov	c6157dd99e	enable rpc_keepalive parameter Fixes #1044 Message-Id: <20160315104609.GV6117@scylladb.com>	2016-03-15 12:51:12 +02:00
Pekka Enberg	917ed4adbe	Merge "verb init/handler for gosisp and storage_service" from Asias "- ignore ack2 msg if gossip is not enabled - move REPLICATION_FINISHED to where it belongs to - add comments for gossip runtime dependency"	2016-03-15 11:12:10 +02:00
Asias He	883d8cb8fd	storage_service: Move REPLICATION_FINISHED verb to storage_service It belongs to storage_service not storage_proxy.	2016-03-15 16:13:22 +08:00
Asias He	fb4d292d5c	storage_service: Drop unused debug code	2016-03-15 16:13:21 +08:00
Asias He	d63281b256	storage_service: Update pending ranges when keyspace is changed If a keyspace is created after we calcuate the pending ranges during bootstrap. We will ignore the keyspace in pending ranges when handling write request for that keyspace which will casue data lose if rf = 1. Fixes #1000	2016-03-15 15:41:23 +08:00
Asias He	9f64c36a08	storage_service: Fix pending_range_calculator_service Since calculate_pending_ranges will modify token_metadata, we need to replicate to other shards. With this patch, when we call calculate_pending_ranges, token_metadata will be replciated to other non-zero shards. In addition, it is not useful as a standalone class. We can merge it into the storage_service. Kill one singleton class. Fixes #1033 Refs #962 Message-Id: <fb5b26311cafa4d315eb9e72d823c5ade2ab4bda.1457943074.git.asias@scylladb.com>	2016-03-14 10:14:22 +02:00
Asias He	cb90ff2709	storage_service: Make decommission log info instead of debug level The log is just a few lines. It is very useful to tell which step fails in case of error when we do decommission.	2016-03-10 10:56:48 +08:00
Asias He	138c5f5834	storage_service: Do not stop messaging_service more than once If we do - Decommission a node - Stop a node we will shutdown messaging_service more than once in: - storage_service::decommission - storage_service::drain_on_shutdown Fixes #1005 Refs #1013 This fix a dtest failure in debug build. update_cluster_layout_tests.TestUpdateClusterLayout.simple_decommission_node_1_test/ /data/jenkins/workspace/urchin-dtest/label/monster/mode/debug/scylla/seastar/core/future.hh:802:35: runtime error: member call on null pointer of type 'struct future_state' core/future.hh:334:49: runtime error: member access within null pointer of type 'const struct future_state' ASAN:SIGSEGV ================================================================= ==4557==ERROR: AddressSanitizer: SEGV on unknown address 0x000000000000 (pc 0x00000065923e bp 0x7fbf6ffac430 sp 0x7fbf6ffac420 T0) #0 0x65923d in future_state<>::available() const /data/jenkins/workspace/urchin-dtest/label/monster/mode/debug/scylla/seastar/core/future.hh:334 #1 0x41458f1 in future<>::available() /data/jenkins/workspace/urchin-dtest/label/monster/mode/debug/scylla/seastar/core/future.hh:802 #2 0x41458f1 in then_wrapped<parallel_for_each(Iterator, Iterator, Func&&)::<lambda(parallel_for_each_state&)> [with Iterator = std::__detail::_Node_iterator<std::pair<const net::msg_addr, net::messaging_service::shard_info>, false, true>; Func = net::messaging_service::stop()::<lambda(auto:39&)> [with auto:39 = std::unordered_map<net::msg_addr, net::messaging_service::shard_info, net::msg_addr::hash>]::<lambda(std::pair<const net::msg_addr, net::messaging_service::shard_info>&)>]::<lambda(future<>)>, future<> > /data/jenkins/workspace/urchin-dtest/label/monster/mode/debug/scylla/seastar/core/future.hh:878	2016-03-10 10:56:48 +08:00
Asias He	3a4ea227d8	storage_service: Fix effective_ownership Now, get_ranges_for_endpoint will unwrap the first range. With t0 t1 t2 t3, the first range (t3,t0] will be splitted as (min,t0] and (t3,max]. Skippping the range (t3,max] we will get the correct ownership number as if the first range were not splitted. Fixes #928 Message-Id: <2e30ebd53f3dba3cc5e0cf36d5541c354b0e30ca.1457506704.git.asias@scylladb.com>	2016-03-09 13:26:01 +01:00
Raphael S. Carvalho	0f4239d63a	service: improve logging of storage_service::load_new_sstables Closes #952. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <2402f387c32d2d1221e740edb67e56c1593c1936.1457366098.git.raphaelsc@scylladb.com>	2016-03-07 18:01:52 +01:00
Raphael S. Carvalho	d65642cee8	fix storage_service::load_new_sstables() to not disable write permanently Avi says: "If an exception happens, then enable_sstable_writes won't be called." The problem is fixed by catching a possible exception and enabling sstable write for the relevant column family if it wasn't enabled already. Closes #953. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <32c1bcb2c60c7b9e5514eb0a95062f40ca92093a.1457119308.git.raphaelsc@scylladb.com>	2016-03-07 13:56:02 +01:00
Vlad Zolotarov	87e6efcdab	storage_service: distribute gossiper::endpoint_state_map together with token_metadata If storage_service::token_metadata is not distributed together with gossiper::endpoint_state_map there may be a situation when a non-zero shard sees a new value in token_metadata (e.g. newly added node's token ranges) while still seeing an old gossiper::endpoint_state_map contents (e.g. a mentioned above newly added node may not be present, thus causing gossiper::is_alive() to return FALSE for that node, while the node is actually alive and kicking). To avoid this discrepancy we will always update a token_metadata together with an endpoint_state_map when we distribute new token_metadata data among shards. Fixes #909 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-03-06 13:15:19 +02:00
Pekka Enberg	6d7e14a53a	Merge "Implement describe_schema_versions" from Paweł "This series implements describe_schema_versions so that we nodetool describecluster can return proper schema information for the whole cluster. It involves adding new verb SCHEMA_CHECK which is used to get schema version for a given node and a simple map-reduce that using that verb gets info from the whole cluster. This fixes #677, fixes #684, and fixes #472."	2016-03-02 16:02:53 +02:00
Paweł Dziepak	723b3ae7ed	storage_service: implement describe_schema_versions Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-02 12:49:55 +00:00

1 2 3 4 5 ...

336 Commits