scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 06:05:53 +00:00

Author	SHA1	Message	Date
Amnon Heiman	2086c651ba	column_family: get_snapshot_details should return empty map for no snapshots If there is no snapshot directory for the specific column family, get_snapshot_details should return an empty map. This patch check that a directory exists before trying to iterate over it. Fixes #619 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-07 12:51:04 +01:00
Tomasz Grabiec	b43b5af894	Merge tag 'tgrabiec/make-future-values-nothrow-move-constructible-v3' from seastar-dev.git Seastar's future<> now requires types to be nothrow move constructible. This series makes Scylla code comply.	2015-12-07 10:43:18 +01:00
Tomasz Grabiec	95f515a6bd	Move seastar submodule head Scylla changes: sstable.cc: Remove file_exists() function which conflicts with seastar's Amnon Heiman (2): reactor: Add file_exists method Add a wrapper for file_exists Avi Kivity (2): Merge "Introduce shared_future" from Tomasz Merge ""scripts: a few fixes in posix_net_conf.sh" from Vlad Gleb Natapov (3): rpc: not stop client in error state avoid allocation in parallel_for_each is there is nothing to do memory: fix size_to_idx calculation Nadav Har'El (1): test: fix use-after-free in timertest Pawe�� Dziepak (1): memory: use size instead of old_size to shrink memory block Tomasz Grabiec (7): file: Mark move constructor as noexcept core: future: Add static asserts about type's noexcept guarantees core: future: Drop now redundant move_noexcept flag core: future_state: Make state getters non-destructive for non-rvalue-refs core: future: Make get_available_state() noexcept core: Introduce shared_future Make json_return_type movable Vlad Zolotarov (8): scripts: posix_net_conf.sh: ban NIC IRQs from being moved by irqbalance scripts: posix_net_conf.sh: exclude CPU0 siblings from RPS scripts: posix_net_conf.sh: Configure XPS scripts: posix_net_conf.sh: Add a new mode for MQ NICs scripts: posix_net_conf.sh: increase some backlog sizes core: to_sstring(): cleanup core: to_sstring_strintf(): always use %g(or %lg) format for floating point values core: prevent explicit calls for to_sstring_sprintf()	2015-12-07 10:41:39 +01:00
Glauber Costa	79e70568d7	scylla-setup: do not add discard to the command line In a recent discussion with the XFS developers, Dave Chinner recommended us not to use discard, but rather issue fstrims explicitly. In machines like Amazon's c3-class, the situation is made worse by the fact that discard is not supported by the disk. Contrary to my intuition, adding the discard mount option in such situation is not a nop and will just create load for no reason. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-12-07 11:22:27 +02:00
Tomasz Grabiec	934d3f06d1	api: Make histogram reduction work on domain value instead of json objects Objects extending json_base are not movable, so we won't be able to pass them via future<>, which will assert that types are nothrow move constructible. This problem only affects httpd::utils_json::histogram, which is used in map-reduce. This patch changes the aggregation to work on domain value (utils::ihistrogram) instead of json objects.	2015-12-07 09:50:28 +01:00
Tomasz Grabiec	c0ac7b3a73	commitlog: Wrap subscription in a unique_ptr<> to make it nothrow movable future<> will require nothrow move constructible types.	2015-12-07 09:50:28 +01:00
Tomasz Grabiec	657841922a	Mark move constructors noexcept when possible	2015-12-07 09:50:27 +01:00
Tomasz Grabiec	fdc28a73f8	thrift: Make with_cob() handle not nothrow move constructible types	2015-12-07 09:50:27 +01:00
Tomasz Grabiec	538de7222a	Introduce noexcept_traits	2015-12-07 09:50:27 +01:00
Tomasz Grabiec	bc23ebcbc3	schema_tables: Replace schema_result::value_type with equivalent movable type future<> requires and will assert nothrow move constructible types.	2015-12-07 09:50:27 +01:00
Avi Kivity	91c2af2803	Merge "nodetool removenode fix + cleanup" from Asias	2015-12-07 10:41:51 +02:00
Takuya ASADA	2891291ad1	dist: add swagger-ui and api-doc on ubuntu package Fixes .deb part of #520 Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-07 10:39:59 +02:00
Takuya ASADA	3f0ca277e5	dist: add swagger-ui and api-doc on rpm package Fixes .rpm part of #520 Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-07 10:39:59 +02:00
Vlad Zolotarov	564cb2bcd1	gms::versioned_value: don't use to_sstring_sprintf() directly Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2015-12-06 12:24:54 +02:00
Raphael S. Carvalho	d435ca7da6	enable more logging for leveled compaction strategy Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2015-12-06 11:36:50 +02:00
Pekka Enberg	a95a7294ef	types: Fix 'varint' type value compatibility check Fixes #575. Signed-off-by: Pekka Enberg <penberg@scylladb.com>	2015-12-04 13:25:34 +01:00
Glauber Costa	5e8249f062	commitlog: fix but preventing flushing with default max_size value The config file expresses this number in MB, while total_memory() gives us a quantity in bytes. This causes the commitlog not to flush until we reach really skyhigh numbers. While we need this fix for the short term before we cook another release, I will note that for the mid/long term, it would be really helpful to stop representing memory amounts as integers, and use an explicit C++ type for those. That would have prevented this bug. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-12-04 09:29:19 +02:00
Vlad Zolotarov	cd215fc552	types: map::to_string() - non-empty implementation Print a map in the form of [(]{ key0 : value0 }[, { keyN : valueN }]*[)] The map is printed inside () brackets if it's frozen. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2015-12-03 18:46:12 +01:00
Amnon Heiman	54b4f26cb0	API: Change the compaction summary to use an object In origin, there are two APIs to get the information about the current running compactions. Both APIs do the string formatting. This patch changes the API to have a single API get_compaction that would return a list of summary object. The jmx would do the string formatting for the two APIs. This change gives a better API experience is it's better documented and would make it easier to support future format changes in origin. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-03 11:57:37 +02:00
Asias He	dcb9b441ab	storage_service: Fix debug build Start non-seed with debug build I saw: ==9844==WARNING: ASan is ignoring requested __asan_handle_no_return: stack top: 0x7ffdabd73000; bottom 0x7fe309218000; size: 0x001aa2b5b000 (114398965760) False positive error reports may follow For details see http://code.google.com/p/address-sanitizer/issues/detail?id=189 DEBUG [shard 0] storage_service - Starting shadow gossip round to check for endpoint collision ================================================================= ==9844==ERROR: AddressSanitizer: stack-buffer-overflow on address 0x7fe309219ad0 at pc 0x00000495a88e bp 0x7fe309219960 sp 0x7fe309219950 WRITE of size 8 at 0x7fe309219ad0 thread T0 #0 0x495a88d in _Head_base<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work> /usr/include/c++/5.1.1/tuple:115 #1 0x495a993 in _Tuple_impl<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work, std::default_delete<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work>, void> /usr/include/c++/5.1.1/tuple:213 #2 0x495aa73 in tuple<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work*, std::default_delete<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work>, void> /usr/include/c++/5.1.1/tuple:613 #3 0x495ab82 in unique_ptr /usr/include/c++/5.1.1/bits/unique_ptr.h:206 ... #16 0x4d44c8e in _M_invoke /usr/include/c++/5.1.1/functional:1871 #17 0x5d2fb7 in std::function<void ()>::operator()() const /usr/include/c++/5.1.1/functional:2271 #18 0x8a1e70 in seastar::thread_context::main() core/thread.cc:139 #19 0x8a1d89 in seastar::thread_context::s_main(unsigned int, unsigned int) core/thread.cc:130 #20 0x7fe311b6cf0f (/lib64/libc.so.6+0x48f0f) I'm not sure why this patch helps. Perhaps the exception makes ASAN unhappy. Anyway, this patch makes the debug build work again. Fixes #613.	2015-12-03 10:42:11 +02:00
Tomasz Grabiec	96d215168e	Merge tag 'asias/gossip_start_stop/fix/v1' from seastar-dev.git Fixes for issues in tests from Asias.	2015-12-03 09:10:55 +01:00
Tomasz Grabiec	9e0c498425	Merge branch 'dev/amnon/latency_clock_v2' From Amnon: After this series an example run of cfhistograms report a maximal 0.5s latency as it should	2015-12-02 19:58:43 +01:00
Amnon Heiman	1812fe9e70	API: Add the get_version to messaging_service swagger definition file Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-02 14:45:44 +02:00
Amnon Heiman	ae53604ed7	API: Add the get_version implementation to messaging service This patch adds the implementation to the get_version. After this patch the following url will be available: messaging_service/version?addr=127.0.0.1 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-02 13:29:40 +02:00
Avi Kivity	53e3e79349	Merge "API: Stubing the compaction manager" from Amnon "This series allows the compaction manager to be used by the nodetool as a stub implementation. It has two changes: * Add to the compaction manager API a method that returns a compaction info object * Stub all the compaction method so that it will create an unimplemented warning but will not fail, the API implementation will be reverted when the work on compaction will be completed."	2015-12-02 13:28:34 +02:00
Takuya ASADA	871bfb1c94	dist: generate correct distribution codename on debian/changelog Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-02 12:38:52 +02:00
Takuya ASADA	b61ea247d2	dist: check supported Ubuntu release Warn if unsupported release. Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-02 12:38:52 +02:00
Takuya ASADA	0c66c25250	dist: fix typo on scylla_prepare Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-02 11:30:15 +02:00
Asias He	3004866f59	gossip: Rename start to start_gossiping So that we have a more consistent name start_gossiping() and stop_gossiping() and it will not confuse with get_gossiper.start().	2015-12-02 16:50:34 +08:00
Asias He	5c3951b28a	gossip: Get rid of the handler helper	2015-12-02 16:50:34 +08:00
Asias He	7a6ad7aec2	gossip: Fix Assertion `local_is_initialized()' failed This patch fixes the following cql_query_test failure. cql_query_test: scylla/seastar/core/sharded.hh:439: Service& seastar::sharded<Service>::local() [with Service = gms::gossiper]: Assertion `local_is_initialized()' failed. The problem is in gossiper::stop() we call gossip::add_local_application_state() which will in turn call gms::get_local_gossiper(). In seastar::sharded::stop _instances[engine().cpu_id()].service = nullptr; return inst->stop().then([this, inst] { return _instances[engine().cpu_id()].freed.get_future(); }); We set the _instances to nullptr before we call the stop method, so local_is_initialized asserts when we try to access get_local_gossiper again. To fix, we make the stopping of gossiper explicit. In the shutdown procedure, we call stop_gossiping() explicitly. This has two more advantages: 1) The api to stop gossip is now calling the stop_gossiping() instead of sharing the seastar::sharded's stop method. 2) We can now get rid of the _handler seastar::sharded helper.	2015-12-02 16:50:34 +08:00
Asias He	e22972009b	gossip: Make log message in mark_dead stick to cassandra	2015-12-02 14:21:26 +08:00
Asias He	ad30cf0faf	failure_detector: Use a standalone logger name Do not share logger with gossip. Sometimes, it is useful to only see one of them.	2015-12-02 14:21:26 +08:00
Asias He	eb05dc680d	storage_service: Warn lost of data when remove a node If RF = 1 and one node is down, it is possible that data is lost. Warn this in the logger.	2015-12-02 14:21:26 +08:00
Asias He	0fe14e2b4b	storage_service: Do not ignore future in remove_node	2015-12-02 14:21:26 +08:00
Asias He	5a7f15ba49	storage_service: Run confirm_replication on cpu zero storage_service::_replicating_nodes is valid on cpu zero only.	2015-12-02 14:21:26 +08:00
Amnon Heiman	7e79d35f85	Estimated histogram: Clean the add interface The add interface of the estimated histogram is confusing as it is not clear what units are used. This patch removes the general add method and replace it with a add_nano that adds nanoseconds or add that gets duration. To be compatible with origin, nanoseconds vales are translated to microseconds.	2015-12-01 15:28:06 +02:00
Amnon Heiman	61abc85eb3	histogram: Add started counter This patch adds a started counter, that is used to mark the number of operation that were started. This counter serves two purposes, it is a better indication for when to sample the data and it is used to indicate how many pending operations are. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-01 15:28:06 +02:00
Amnon Heiman	88dcf2e935	latency: Switch to steady_clock The system clock is less suitable for for time difference than steady_clock. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-01 15:28:06 +02:00
Avi Kivity	f667e05e08	Merge "backport gosisp and storage_service fix" from Asias "This contains most bug fixes from imported version commit 38847a6bd967e4f41bc7b1fc83629161a2c214dc to c* 2.1.11 for gossip and storage_service."	2015-12-01 14:42:41 +02:00
Asias He	dc6f2157e7	Update ORIGIN for gossip and storage_service	2015-12-01 19:45:04 +08:00
Amnon Heiman	3674ee2fc1	API: get snapshot size This patch adds the column family API that return the snapshot size. The changes in the swagger definition file follo origin so the same API will be used for the metric and the column_family. The implementation is based on the get_snapshot_details in the column_family. This fix: 425 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-01 11:41:52 +02:00
Asias He	56df32ba56	gossip: Mark node as dead even if already left Backport: CASSANDRA-10205 484e645 Mark node as dead even if already left	2015-12-01 17:29:25 +08:00
Asias He	59694a8e43	failure_detector: Print versions for gossip states in gossipinfo Backport: CASSANDRA-10330 ae4cd69 Print versions for gossip states in gossipinfo For instance, the version for each state, which can be useful for diagnosing the reason for any missing states. Also instead of just omitting the TOKENS state, let's indicate whether the state was actually present or not.	2015-12-01 17:29:25 +08:00
Asias He	af91a8f31b	storage_service: Fix transition from write survey to normal mode Backport: CASSANDRA-9740 52dbc3f Can't transition from write survey to normal mode	2015-12-01 17:29:25 +08:00
Asias He	2f071d9648	storage_service: Refuse to decommission if not in state NORMAL Backport: CASSANDRA-8741 5bc56c3 refuse to decomission if not in state NORMAL	2015-12-01 17:29:25 +08:00
Asias He	224db2ba37	failure_detector: Don't mark nodes down before the max local pause interval once paused Backport: CASSANDRA-9446 7fba3d2 Don't mark nodes down before the max local pause interval once paused	2015-12-01 17:29:25 +08:00
Asias He	51fcc48700	failure_detector: Failure detector detects and ignores local pauses Backport: CASSANDRA-9183 4012134 Failure detector detects and ignores local pauses	2015-12-01 17:29:25 +08:00
Asias He	1b9e350614	gossip: Do not print node is now part of the cluster during gossip shadow round With Node 1 (Seed node, Port 7000 is opened, 10.184.9.144) Node 2 (Port 7000 is opened, 10.184.9.145) Node 3 (Port 7000 is blocked by firewall) On Node 3, we saw the following error which was very confusing: Node 3 saw Node 1 and Node 3 but it complained it can not contact any seeds. The message "Node 10.184.9.144 is now part of the cluster" and friends are actually messages printed during the gossip shadow round where Node 3 connects to Node 1's port 7000 and Node 1 returns all info it knows to Node 3, so that Node 3 knows Node 1 and Node 2 and we see the "Node 10.184.9.144/145 is now part of the cluster" message. However, during the normal gossip round, Node 3 will not mark Node 1 and Node 2 UP until the Seed node initiates a gossip round to Node 3, (note port 7000 on node 3 is blocked in this case). So Node 3 will not mark Node 1 and Node 2 UP and we see the "Unable to contact any seeds" error. [shard 0] storage_service - Loading persisted ring state [shard 0] gossip - Node 10.184.9.144 is now part of the cluster [shard 0] gossip - inet_address 10.184.9.144 is now UP [shard 0] gossip - Node 10.184.9.145 is now part of the cluster [shard 0] gossip - inet_address 10.184.9.145 is now UP [shard 0] storage_service - Starting up server gossip scylla_run[12479]: Start gossiper service ... [shard 0] storage_service - JOINING: waiting for ring information [shard 0] storage_service - JOINING: schema complete, ready to bootstrap [shard 0] storage_service - JOINING: waiting for pending range calculation [shard 0] storage_service - JOINING: calculation complete, ready to bootstrap [shard 0] storage_service - JOINING: getting bootstrap token [shard 0] storage_service - JOINING: sleeping 5000 ms for pending range setup scylla_run[12479]: Exiting on unhandled exception of type 'std::runtime_error': Unable to contact any seeds!	2015-12-01 17:29:25 +08:00
Asias He	f62a6f234b	gossip: Add shutdown gossip state Backported: CASSANDRA-8336 and CASSANDRA-9871 84b2846 remove redundant state b2c62bb Add shutdown gossip state to prevent timeouts during rolling restarts 8f9ca07 Cannot replace token does not exist - DN node removed as Fat Client Fixes: When X is shutdown, X sends SHUTDOWN message to both Y and Z, but for some reason, only Y receives the message and Z does not receive the message. If Z has a higher gossip version for X than Y has for X, Z will initiate a gossip with Y and Y will mark X alive again. X ------> Y \ / \ / Z	2015-12-01 17:29:25 +08:00

1 2 3 4 5 ...

7546 Commits