scylladb

Author	SHA1	Message	Date
Pavel Emelyanov	6a154305d7	gossiper: Remove db::config reference from gossiper Also const-ify the db::config reference argument and std::move the gossip_config argument while at it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-03-02 18:34:55 +03:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	ae3a360725	database: Move database, keyspace, table classes to replica/ directory The database, keyspace, and table classes represent the replica-only part of the objects after which they are named. Reading from a table doesn't give you the full data, just the replica's view, and it is not consistent since reconciliation is applied on the coordinator. As a first step in acknowledging this, move the related files to a replica/ subdirectory.	2022-01-06 17:07:30 +02:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Piotr Sarna	8f98c0585f	failure_detector: add a missing const qualifier The mean() method is effectively const, so it should be marked as such. Message-Id: <14dd39e8419136909fcf10508c34de3752faa7fe.1612953601.git.sarna@scylladb.com>	2021-02-10 13:04:37 +02:00
Pavel Emelyanov	eb827c9f5d	gossiper: Keep needed for failure_detection values on board And drop the gossiper -> storage_service link Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Asias He	b2c110699e	gms: Remove i_failure_detector.hh It is not used any more.	2019-03-22 09:08:51 +08:00
Asias He	af579a055b	gossip: Get rid of the gms::get_local_failure_detector static object Store the failure_detector object inside gossiper object. - No more the global object sharded<failure_detector> - No need to initialize sharded<failure_detector> manually which simplifies the code in tests/cql_test_env.cc and init.cc.	2019-03-22 09:08:51 +08:00
Avi Kivity	f02c64cadf	streaming: stream_session: remove include of db/view/view_update_from_staging_generator.hh This header, which is easily replaced with a forward declaration, introduces a dependency on database.hh everywhere. Remove it and scatter includes of database.hh in source files that really need it.	2019-01-05 17:33:25 +02:00
Avi Kivity	864f55e745	config: remove inclusions of db/config.hh from header files Instead, distribute those inclusions to .cc files that require them. This reduces rebuilds when config.hh changes, and makes it easier to locate files that need config disaggregation.	2018-12-09 20:11:38 +02:00
Tomasz Grabiec	a71624d58d	gms/failure_detector: Ignore short update intervals Failure detector decides that a node is down if it hasn't received a change of its heartbeat for longer than ~11 times the average of past intervals between updates. If there are multiple incoming ACKs containing information about the same node, we may detect and report a change for each of them. This will cause failure_detector to establish that the average report period is in milliseconds. After the update storm is over, it will claim the node failure very soon, because report period will now be a large multiple of the average. Fix by not counting short updates into the calculation of average arrival time. Fixes #2861.	2017-10-18 08:49:52 +02:00
Duarte Nunes	d0fba1a113	gms/failure_detector: Simplify alive/dead endpoint count Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-10-11 10:02:31 +01:00
Duarte Nunes	dc65cda1a3	gms/failure_detector: Fix if/else style to include braces Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-10-11 10:02:31 +01:00
Duarte Nunes	ceebbe14cc	gossiper: Avoid endpoint_state copies gossiper::get_endpoint_state_for_endpoint() returns a copy of endpoint_state, which we've seen can be very expensive. This patch adds a similar function which returns a pointer instead, and changes the call sites where using the pointer-returning variant is deemed safe (the pointer neither escapes the function, nor crosses any defer point). Fixes #764 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-10-10 13:48:02 +01:00
Asias He	adc5f0bd21	gossip: Implement the missing fd_max_interval_ms and fd_initial_value_ms option It is useful for larger cluster with larger gossip message latency. By default the fd_max_interval_ms is 2 seconds which means the failure_detector will ignore any gossip message update interval larger than 2 seconds. However, in larger cluster, the gossip message udpate interval can be larger than 2 seconds. Fixes #2603. Message-Id: <49b387955fbf439e49f22e109723d3a19d11a1b9.1500278434.git.asias@scylladb.com>	2017-07-17 13:29:16 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Asias He	022c7e50a1	failure_detector: Fix false alarm of "Not marking nodes down due to local pause of" The problem is we initialize _last_interpret when failure_detector object is constructed. When interpret() runs for the first time, the _last_interpret value is not the last time we run interpret() but the time we initialize failure_detector object. Fix by initializing _last_interpret inside interpret(). [Thu Feb 18 02:40:04 2016] INFO [shard 0] storage_service - Node 127.0.0.1 state jump to normal [Thu Feb 18 02:40:04 2016] INFO [shard 0] storage_service - NORMAL: node is now in normal status [Thu Feb 18 02:40:04 2016] INFO [shard 0] gossip - Waiting for gossip to settle before accepting client requests... [Thu Feb 18 02:40:12 2016] INFO [shard 0] gossip - No gossip backlog; proceeding Starting listening for CQL clients on 127.0.0.1:9042... [Thu Feb 18 02:40:12 2016] INFO [shard 0] gossip - Node 127.0.0.2 is now part of the cluster [Thu Feb 18 02:40:12 2016] INFO [shard 0] gossip - InetAddress 127.0.0.2 is now UP [Thu Feb 18 02:40:13 2016] INFO [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 [Thu Feb 18 02:40:13 2016] WARN [shard 0] failure_detector - Not marking nodes down due to local pause of 9091 > 5000 (milliseconds)	2016-02-24 19:31:14 +08:00
Asias He	ad30cf0faf	failure_detector: Use a standalone logger name Do not share logger with gossip. Sometimes, it is useful to only see one of them.	2015-12-02 14:21:26 +08:00
Asias He	59694a8e43	failure_detector: Print versions for gossip states in gossipinfo Backport: CASSANDRA-10330 ae4cd69 Print versions for gossip states in gossipinfo For instance, the version for each state, which can be useful for diagnosing the reason for any missing states. Also instead of just omitting the TOKENS state, let's indicate whether the state was actually present or not.	2015-12-01 17:29:25 +08:00
Asias He	224db2ba37	failure_detector: Don't mark nodes down before the max local pause interval once paused Backport: CASSANDRA-9446 7fba3d2 Don't mark nodes down before the max local pause interval once paused	2015-12-01 17:29:25 +08:00
Asias He	51fcc48700	failure_detector: Failure detector detects and ignores local pauses Backport: CASSANDRA-9183 4012134 Failure detector detects and ignores local pauses	2015-12-01 17:29:25 +08:00
Asias He	2022117234	failure_detector: Enable phi_convict_threshold option Adjusts the sensitivity of the failure detector on an exponential scale. Use as: $ scylla --phi-convict-threshold 9 Default to 8.	2015-11-30 11:09:36 +02:00
Asias He	db70643fe3	failure_detector: Print application_state properly	2015-11-30 11:08:40 +02:00
Asias He	36b2de10ed	failure_detector: Improve FD logging when the arrival time is ignored Backport from: eb9c5bb Improve FD logging when the arrival time is ignored.	2015-11-27 15:31:56 +08:00
Asias He	01ee5d002a	failure_detector: Remove debug print in operator<<	2015-10-28 16:13:57 +08:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Asias He	5bec8cba82	gossip: Kill one async::thread for mark_dead We have this call chain, gossiper::run -> do_status_check -> interpret -> convict -> mark_dead since gossip::run is executed inside a seastar thread, we can assure all functions above run inside a seastar thread.	2015-09-06 11:04:41 +08:00
Asias He	4390b448a2	gossip: Move _the_failure_detector to failure_detector.cc We will kill gms/gms.cc soon.	2015-07-31 10:43:39 +08:00
Asias He	1547fa05a5	failure_detector: Simplify get_initial_value and get_max_interval	2015-07-24 19:01:49 +08:00
Asias He	64f8c6e498	failure_detector: Switch to use std::chrono::steady_clock Instead of naked integer based time point value.	2015-07-24 18:55:21 +08:00
Asias He	73bb690b40	failure_detector: Fix now unit in report	2015-07-24 15:56:05 +08:00
Asias He	9f1dc2877e	failure_detector: Fix INITIAL_VALUE_NANOS	2015-07-24 15:56:05 +08:00
Asias He	1c2f5d5997	failure_detector: Add more log printout	2015-07-24 15:56:05 +08:00
Asias He	c3b77f499b	failure_detector: Enable logger	2015-07-24 15:56:04 +08:00
Asias He	26cd039005	gossip: Add is_alive helper failure_detector::is_alive asks gossiper if a node is up or down.	2015-06-04 17:16:58 +08:00
Shlomi Livne	0ad0a02d93	Change failure_detector registration of listeners to accept a ptr Signed-off-by: Shlomi Livne <shlomi@cloudius-systems.com>	2015-05-14 17:01:18 +08:00
Asias He	a800fbfe64	gossip: Set get_phi_convict_threshold to 8 It is the default value.	2015-04-23 14:55:26 +08:00
Asias He	b38dae4a2b	gossip: Dump failure detector info	2015-04-20 15:49:27 +08:00
Asias He	650e69da9e	gossip: Reduce header inclusion for gms/failure_detector.hh	2015-04-15 15:03:29 +08:00
Asias He	fc72506f68	gossip: Add gms/failure_detector.cc Move code from failure_detector.hh to failure_detector.cc	2015-04-15 15:03:29 +08:00

40 Commits