scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 06:53:12 +00:00

Author	SHA1	Message	Date
Duarte Nunes	4684b8ecbb	gossip: Refactor waiting for features This patch changes the sleep-based mechanism of detecting new features by instead registering waiters with a condition variable that is signaled whenever a new endpoint information is received. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-05-27 17:20:51 +00:00
Duarte Nunes	422f244172	gossip: Don't timeout when waiting for features This patch removes the timeout when waiting for features, since future patches will make this argument unnecessary. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-05-27 17:20:51 +00:00
Avi Kivity	fab4cc8d6d	Merge seastar upstream * seastar 8bfbb1a...0bcdd28 (1): > Merge "introduce sleep_abortable() that throws exception on application exit" from Gleb	2016-05-27 20:14:49 +03:00
Duarte Nunes	b3011c9039	gossip: Rename set_heart_beat_state ...to set_heart_beat_state_and_update_timestamp in order to make it explicit for callers the update_timestamp is also changed. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1464309023-3254-3-git-send-email-duarte@scylladb.com>	2016-05-27 09:11:39 +03:00
Duarte Nunes	8c0e2e05b7	gossip: Fix modification to shadow endpoint state This patch fixes an inadvertent change to the shadow endpoint state map in gossiper::run, done by calling get_heart_beat_state() which also updates the endpoint state's timestamp. This did not happen for the normal map, but did happen for the shadow map. As a result, every time gossiper::run() was scheduled, endpoint_map_changed would always be true and all the shards would make superfluous copies of the endpoint state maps. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1464309023-3254-2-git-send-email-duarte@scylladb.com>	2016-05-27 09:10:38 +03:00
Pekka Enberg	b7e79b72d5	Merge "Introduce SET_NIC for non-AMI environment" from Takuya "This patchset provides a way to enable SET_NIC(posix_net_conf.sh) on non-AMI environment. Also support -mq option of the script. This also contains number of bug fixes of scripts. Fixes #1192"	2016-05-26 13:37:06 +03:00
Yoav Kleinberger	26c0d86401	tools/scyllatop: improved user interface: scrollable views NOTE: scyllatop now requires the urwid library previously, if there were more metrics that lines in the terminal window, the user could not see some of the metrics. Now the user can scroll. As an added bonus, the program will not crash when the window size changes. Signed-off-by: Yoav Kleinberger <yoav@scylladb.com> Message-Id: <1464098832-5755-1-git-send-email-yoav@scylladb.com>	2016-05-26 13:36:28 +03:00
Piotr Jastrzebski	136b8148d2	Use idle CPU to compact LSA memory Register an idle CPU handler that compacts a single segment every time there's nothing better to execute on CPU. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <c26aa608a1e0752fb9e6db1833ef3ba1de95f161.1464169748.git.piotr@scylladb.com>	2016-05-26 12:43:53 +03:00
Avi Kivity	d7f36a093f	Merge seastar upstream * seastar e5faea8...8bfbb1a (1): > reactor: advertise the logging_failures metric as a DERIVE counter Fixes #1292.	2016-05-26 11:46:08 +03:00
Tomasz Grabiec	f0c2b1d161	config: Fix typos Message-Id: <1464201938-4778-1-git-send-email-tgrabiec@scylladb.com>	2016-05-26 08:19:57 +03:00
Asias He	f1b3cb4a08	storage_service: Catch and fail an invalid configuration with --replace-address Vlad reported a strange user configuration: SCYLLA_ARGS="--log-to-syslog 1 --log-to-stdout 0 --default-log-level info --collectd-address=127.0.0.1:25826 --collectd=1 --collectd-poll-period 60000 --network-stack posix --num-io-queues 32 --max-io-requests 128 --replace-address 10.0.4.131" seed_provider: - class_name: org.apache.cassandra.locator.SimpleSeedProvider parameters: - seeds: "10.0.4.131" In the mean while, 10.0.4.131 is the IP address of the node itself. When the node was started, the following message were reported. Apr 13 06:31:12 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (20 seconds passed) Apr 13 06:31:13 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (21 seconds passed) Apr 13 06:31:14 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (22 seconds passed) Apr 13 06:31:15 n0 scylla[19681]: [shard 0] gossip - Connect seeds again ... (23 seconds passed) The configruation is invalid, becasue for --replace-address to work, at least one working seed node should be alive. Catch the configuration error and fail it with an appropriate error message. Fixes #1183 Message-Id: <a94a082d896313e7a668915ae21fe2c03719da3a.1464164058.git.asias@scylladb.com>	2016-05-25 14:42:19 +03:00
Asias He	fed1e65e1e	gossip: Do not insert the same node into _live_endpoints_just_added _live_endpoints_just_added tracks the peer node which just becomes live. When a down node gets back, the peer nodes can receive multiple messages which would mark the node up, e.g., the message piled up in the sender's tcp stack, after a node was blocked with gdb and released. Each such message will trigger a echo message and when the reply of the echo message is received (real_mark_alive), the same node will be added to _live_endpoints_just_added.push_back more than once. Thus, we see the same node be favored more than once: INFO 2016-04-12 12:09:57,399 [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 INFO 2016-04-12 12:09:58,412 [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 INFO 2016-04-12 12:09:59,429 [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 INFO 2016-04-12 12:10:00,429 [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 INFO 2016-04-12 12:10:01,430 [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 INFO 2016-04-12 12:10:02,442 [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 INFO 2016-04-12 12:10:03,454 [shard 0] gossip - do_gossip_to_live_member: Favor newly added node 127.0.0.2 To fix, do not insert the node if it is already in _live_endpoints_just_added. Fixes #1178 Message-Id: <6bcfad4430fbc63b4a8c40ec86a2744bdfafb40f.1464161975.git.asias@scylladb.com>	2016-05-25 14:19:40 +03:00
Glauber Costa	46f60f52d9	database: do not use implicitly stated seal function when closing the CF In commit `4981362f57`, I have introduced a regression that was thankfully caught by our dtest infrastructure. That patch is a preparation patch for the active reclaim patchset that is to come, and it consolidated all the flushes using the memtable_list's seal_fn function instead of calling the seal function explicitly. The problem here is that the streaming memtables have the delayed mechanism, about which the memtable_list is unaware. Calling memtable_list's seal_active_memtable() for the streaming memtables calls the delayed version, that does not guarantee flush. If we're lucky, we will indeed flush after the timer expires, but if we're not we'll just stop the CF with data not flushed. There are two options to fix this: the first is to teach the memtable_list about the delayed/forced mechanism, and the second is to just call the correct function explicitly during shutdown, and then when the time comes to add continuations to the result of the seal, add them here as well. Although the second option involves a bit more work and duplication, I think it is better in the sense that the delayed / forced mechanism really is something that belong to the streaming only. Being this the only user, I don't think it justifies complicating the memtable_list with this concept. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <b26017c825ccf585f39f58c4ab3787d78e551f5f.1464126884.git.glauber@scylladb.com>	2016-05-25 08:21:24 +03:00
Avi Kivity	2d4d6c9c92	Merge seastar upstream * seastar aed893e...e5faea8 (5): > Catch exceptions thrown by idle cpu handler > core::gate: add a get_count() method > reactor: Introduce idle CPU handler > core: add missing header for g++-4.9 > Add lksctp-tools-devel do required packages	2016-05-24 20:42:41 +03:00
Pekka Enberg	ceb29f9d32	Merge "Introduce upload dir for sstable migration" from Raphael "This change is intended to make migration process safer and easier. All column families will now have a directory called upload. With this feature, users may choose to copy migrated sstables to upload directory of respective column families, and run 'nodetool refresh'. That's supposed to be the preferred option from now on."	2016-05-24 16:36:47 +03:00
Pekka Enberg	d7d8c76fe5	transport/server: Add CQL frame LZ4 compression support The default CQL frame compression algorithm in Cassandra is LZ4. Add support for decompressing incoming frames and compressing outgoing frames with LZ4 if the CQL driver asks for that. Fixes #416 Message-Id: <1464086807-11325-1-git-send-email-penberg@scylladb.com>	2016-05-24 15:03:33 +03:00
Takuya ASADA	53cebb4a5e	dist/ubuntu: don't rebuild dependency packages by default Same as CentOS, do not build dependencies by default, install binary packages from our repository. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1464023451-21436-1-git-send-email-syuu@scylladb.com>	2016-05-24 14:10:59 +03:00
Avi Kivity	9637c2232c	Merge "Move the JMX timer polling logic to Scylla" from Amnon	2016-05-24 13:07:52 +03:00
Raphael S. Carvalho	c2fa3b796d	db: fix read consistency after refresh If sstable loaded by refresh covers a row that is cached by the column family, read query may fail to return consistent data. What we should do is to clear cache for the column family being loaded with new sstables. Fixes #1212. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <a08c9885a5ceb0b2991e40337acf5b7679580a66.1464072720.git.raphaelsc@scylladb.com>	2016-05-24 12:11:41 +03:00
Takuya ASADA	5d5d525a14	dist/ubuntu: fix incorrect dependency package name PyYAML is CentOS/RHEL/Fedora package name, python-yaml is correct one. Fixes #1279 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1463987823-22837-1-git-send-email-syuu@scylladb.com>	2016-05-23 10:21:29 +03:00
Pekka Enberg	8a7197e390	dist/docker: Fetch RPM repository from Scylla web site Fix the hard-coded Scylla RPM repository by downloading it from Scylla web site. This makes it easier to switch between different versions. Message-Id: <1463981271-25231-1-git-send-email-penberg@scylladb.com>	2016-05-23 09:45:41 +03:00
Piotr Jastrzebski	2be4ec4e06	Add lksctp-tools-devel to required packages in fedora build instructions. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <15f3db34f12f01cb9da32fd14c16ba87e64ad5f4.1463947999.git.piotr@scylladb.com>	2016-05-23 08:26:02 +03:00
Avi Kivity	5e5317b228	dist: add build dependencies for sctp Required by new seastar	2016-05-22 19:10:25 +03:00
Avi Kivity	5bb1255da1	Merge seastar upstream * seastar 6a849ac...aed893e (3): > net: move 'transport' enum to seastar namespace > net: sctp protocol support for posix stack > future: Support get() when state is at a promise	2016-05-22 16:32:33 +03:00
Amnon Heiman	e26002d581	idl-compiler: default constructor of complex types This patch solve a problem where a complex type is define as version depended (with the version attribute) but doesn't have a default value. In those cases the default constructor is used, but in the case of complex types (template) param_type should be use to get the C++ type. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1463916723-15322-1-git-send-email-amnon@scylladb.com>	2016-05-22 15:32:29 +03:00
Raphael S. Carvalho	e5f0314afd	db: introduce upload directory for sstable migration This change is intended to make migration process safer and easier. All column families will now have a directory called upload. With this feature, users may choose to copy migrated sstables to upload directory of respective column families, and call 'nodetool refresh'. That's supposed to be the preferred option from now on. For each sstable in upload directory, refresh will do the following: 1) Mutate sstable level to 0. 2) Create hard links to its components in column family dir, using a new generation. We make it safe by creating a hard link to temporary TOC first. 3) Remove all of its components in upload directory. This new code runs after refresh checked for new sstables in the column family directory. Otherwise, we could have a generation conflict. Unlike the first step, this new step runs with sstable write enabled. It's easier here because we know exactly which sstables are new. After that, refresh will load new sstables found in column family and upload directories. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-20 17:26:21 -03:00
Raphael S. Carvalho	70b793e4d3	tests: add test for statistics rewrite Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-20 17:26:12 -03:00
Raphael S. Carvalho	74c8a87777	sstables: fix statistics rewrite It's not working because it tries to overwrite existing statistics file with exclusive flag. It's fixed by writing new statistics into temporary file and renaming it into place. If Scylla failed in middle of rewrite, a temporary file is left over. So boot code was adjusted to delete a temporary file created by this rewrite procedure. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-20 17:24:15 -03:00
Pekka Enberg	94e7e61cd0	api: Register snitch API earlier Currently, we register snitch API in set_server_gossip_settle() which waits until a node has joined the cluster. This makes 'nodetool status' not properly show the status of a joining node. Fix the issue by registering snitch API earlier. Fixes #1269. Message-Id: <1463576381-15484-1-git-send-email-penberg@scylladb.com>	2016-05-20 14:24:14 +03:00
Gleb Natapov	7a54b5ebbb	gossiper: cleanup mark_alive() even more Message-Id: <20160519100513.GE984@scylladb.com>	2016-05-19 12:47:19 +02:00
Takuya ASADA	03a762bb0b	dist/common/scripts: Ask to set SET_NIC=yes on scylla_setup interactive prompt We supported SET_NIC on non-AMI environment, so ask user to use it on scylla_setup interactive prompt.	2016-05-19 06:26:23 +09:00
Takuya ASADA	88fde0a91e	dist/ami: fix dependency unresolved error on AMI build script with local package, by adding scylla-conf package Since we added scylla-conf package, we cannot install scylla-server/-tools without the package, because of this --localrpm is failing. So copy scylla-conf package to AMI, and install it to fix the problem.	2016-05-19 06:26:23 +09:00
Takuya ASADA	898243929f	dist/common/scripts: specify queue settings for posix_net_conf.sh on scylla_prepare posix_net_conf.sh wants -sq/-mq options, so detect number of queues and specify the option in scylla_prepare.	2016-05-19 06:26:23 +09:00
Takuya ASADA	f84b7b094f	dist/common/scripts: drop special condition to enable SET_NIC on AMI, do this on AMI installation script Remove special case of SET_NIC in AMI, do this in scylla-ami-setup.service.	2016-05-19 06:25:41 +09:00
Takuya ASADA	49cdd0b786	dist: move '--cpuset' and '--smp' configuration to scylla_cpuset_setup / cpuset.conf These parameters are only required for AMI, not for non-AMI environment which want to enable SET_NIC, so split them to indivisual script / conf file, call it from AMI install script.	2016-05-19 06:25:28 +09:00
Takuya ASADA	46fa80a5a6	dist/common/scripts: replace IFNAME variable when --nic specified to scylla_sysconfig_setup scylla_sysconfig_setup has bug that it not replaces IFNAME variable, so fixed.	2016-05-19 06:25:15 +09:00
Glauber Costa	4eff07d773	database: reorder initialization In a preparation move for the LSA throttler, we have reordered the initialization fields in database.hh so that the sizes of the regions are computed before the initialization of the region. However, that seemingly innocent move broke one of our tests. The reason behind that, is that if we don't destroy the column families before destroying the region, we may end up with a use after free in the memtable destructor - that itself expects to call into the region. This patch reorders the initialization so that the CF list still comes after the dirty regions (therefore being destroyed first), while maintaining the relative ordering between size / region that we needed in the first place. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <0669984b5bccdb2c950f2444bdee4427abad56ba.1463508884.git.glauber@scylladb.com>	2016-05-18 11:02:40 +03:00
Asias He	eb9ac9ab91	gms: Optimize gossiper::is_alive In perf-flame, I saw in service::storage_proxy::create_write_response_handler (2.66% cpu) gossiper::is_alive takes 0.72% cpu locator::token_metadata::pending_endpoints_for takes 1.2% cpu After this patch: service::storage_proxy::create_write_response_handler (2.17% cpu) gossiper::is_alive does not show up at all locator::token_metadata::pending_endpoints_for takes 1.3% cpu There is no need to copy the endpoint_state from the endpoint_state_map to check if a node is alive. Optimize it since gossiper::is_alive is called in the fast path. Message-Id: <2144310aef8d170cab34a2c96cb67cabca761ca8.1463540290.git.asias@scylladb.com>	2016-05-18 10:12:38 +03:00
Avi Kivity	6ec0000df8	Merge "fix migration of tables with level > 0" from Rapahel	2016-05-17 19:14:01 +03:00
Raphael S. Carvalho	cbc2e96a58	tests: check that overlapping sstable has its level changed to 0 Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-17 11:11:05 -03:00
Raphael S. Carvalho	ee0f66eef6	db: fix migration of sstables with level greater than 0 Refresh will rewrite statistics of any migrated sstable with level > 0. However, this operation is currently not working because O_EXCL flag is used, meaning that create will fail. It turns out that we don't actually need to change on-disk level of a sstable by overwriting statistics file. We can only set in-memory level of a sstable to 0. If Scylla reboots before all migrated sstables are compacted, leveled strategy is smart enough to detect sstables that overlap, and set their in-memory level to 0. Fixes #1124. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-17 11:08:08 -03:00
Gleb Natapov	76e0eb426e	gossiper: simplify mark_alive() The code runs in a thread so there is no need to use heap to communicate between statements. Message-Id: <20160517120245.GK984@scylladb.com>	2016-05-17 15:37:21 +03:00
Avi Kivity	4413176051	Merge "reduce performance degradation when adding node" from Asias "With this series, the operations per second drop during adding node period gets much better. Before: 45K to 10K After: 45k to 38K Refs: #1223 "	2016-05-17 14:31:31 +03:00
Asias He	089734474b	token_metadata: Speed up pending_endpoints_for pending_endpoints_for is called frequently by storage_proxy::create_write_response_handler when doing cql query. Before this patch, each call to pending_endpoints_for involves converting a multimap (std::unordered_multimap<range<token>, inet_address>>) to map (std::unordered_map<range<token>, std::unordered_set<inet_address>>). To speed up the token to pending endpoint mapping search, a interval map is introduced. It is faster than searching the map linearly and can avoid caching the token/pending endpoint mapping. With this patch, the operations per second drop during adding node period gets much better. Before: 45K to 10K After: 45k to 38K (The number is measured with the streaming code skipping to send data to rule out the streaming factor.) Refs: #1223	2016-05-17 17:32:15 +08:00
Asias He	ee0585cee9	dht: Add default constructor for token It is needed to put token in to a boost interval_map in the following patch.	2016-05-17 17:32:15 +08:00
Amnon Heiman	ad34f80e6f	API: change cache_service, column_family and storage_proxy to rate object The API would expose now the rate_moving_average and rate_moving_average_and_histogram. The old end points remains for the transition period, but marked as depricated. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:56:52 +03:00
Amnon Heiman	b33ed48527	API Definition: change cache_service, column_family and storage_proxy to use rate objects This patch replaces the latency histogram to rate_moving_avrage_and_histogram and the counters to rate_moving_average. The old endpoints where left unchagned but marked as depricated when needed.	2016-05-17 11:55:06 +03:00
Amnon Heiman	20a48b0f20	API: column family stats break the map_reduce functionality This patch replaces the helper function for column family with two function, one that collect the relevant column family from all shareds and another one that do the translation to json object. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:53:15 +03:00
Amnon Heiman	750f30cf07	column_family: Change histogram to timed_rate_moving_average_and_histogram As part of moving the derived statistic in to scylla, this replaces the histogram object in the column_family to timed_rate_moving_average_and_histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:53:15 +03:00
Amnon Heiman	468bcfbf1f	row_cache: Change counter to timed_rate_moving_average_and_histogram As part of moving the derived statistic in to scylla, this replaces the counter in the row_cache stats to timed_rate_moving_average_and_histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:53:15 +03:00

1 2 3 4 5 ...

9398 Commits