scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Asias He	ccbd801f40	storage_service: Fix decommissioned nodes are willing to rejoin the cluster if restarted Backport: CASSANDRA-8801 a53a6ce Decommissioned nodes will not rejoin the cluster. Tested with: topology_test.py:TestTopology.decommissioned_node_cant_rejoin_test	2015-12-09 10:43:51 +08:00
Asias He	b3dd2d976a	storage_service: Simplify prepare_to_join with seastar thread	2015-12-09 10:43:51 +08:00
Asias He	e9a4d93d1b	storage_service: Fix added node not showing up in nodetool in status joining The get_token_endpoint API should return a map of tokens to endpoints, including the bootstrapping ones. Use get_local_storage_service().get_token_to_endpoint_map() for it. $ nodetool -p 7100 status Status=Up/Down \|/ State=Normal/Leaving/Joining/Moving -- Address Load Tokens Owns Host ID Rack UN 127.0.0.1 12645 256 ? eac5b6cf-5fda-4447-8104-a7bf3b773aba rack1 UN 127.0.0.2 12635 256 ? 2ad1b7df-c8ad-4cbc-b1f1-059121d2f0c7 rack1 UN 127.0.0.3 12624 256 ? 61f82ea7-637d-4083-acc9-567e0c01b490 rack1 UJ 127.0.0.4 ? 256 ? ced2725e-a5a4-4ac3-86de-e1c66cecfb8d rack1 Fixes #617	2015-12-09 10:43:51 +08:00
Tomasz Grabiec	d68a8b5349	Merge branch 'dev/amnon/index_summary_size_v2' from seastar-dev.git API for getting sstable index summary memory footprint from Amnon	2015-12-08 20:03:39 +01:00
Tomasz Grabiec	6ead7a0ec5	Merge tag 'large-blobs/v3' from git@github.com:avikivity/scylla.git Scattering of blobs from Avi: This patchset converts the stack to scatter managed_bytes in lsa memory, allowing large blobs (and collections) to be stored in memtable and cache. Outside memtable/cache, they are still stored sequentially, but it is assumed that the number of transient objects is bounded. The approach taken here is to scatter managed_bytes data in multiple blob_storage objects, but to linearize them back when accessing (for example, to merge cells). This allows simple access through the normal bytes_view. It causes an extra two copies, but copying a megabyte twice is cheap compared to accessing a megabyte's worth of small cells, so per-byte throughput is increased. Testing show that lsa large object space is kept at zero, but throughput is bad because Scylla easily overwhelms the disk with large blobs; we'll need Glauber's throttling patches or a really fast disk to see good throughput with this.	2015-12-08 19:15:13 +01:00
Avi Kivity	5c5331d910	tests: test large blobs in memtables	2015-12-08 15:17:09 +02:00
Avi Kivity	0c2fba7e0b	lsa: advertize our preferred maximum allocation size Let managed_bytes know that allocating below a tenth of the segment size is the right thing to do.	2015-12-08 15:17:09 +02:00
Avi Kivity	f9e2a9a086	mutation_partition: work on linearized atomic_cell_or_mutation objects Ensure that when we examine atomic_cell_or_mutation objects for merging, that they are contiguous in memory. When we are done we scatter them again.	2015-12-08 15:17:09 +02:00
Avi Kivity	ad975ad629	atomic_cell_or_collection: linearize(), unlinearize() Add linearize() and unlinearize() methods that allow making an atomic_cell_or_collection object temporarily contiguous, so we can examine it as a bytes_view.	2015-12-08 15:17:09 +02:00
Avi Kivity	13324607e6	managed_bytes: conform to allocation_strategy's max_preferred_allocation_size Instead of allocating a single blob_storage, chain multiple blob_storage objects in a list, each limited not to exceed the allocation_strategy's max_preferred_allocation_size. This allows lsa to allocate each blob_storage object as an lsa managed object that can be migrated in memory. Also provide linearize()/scatter() methods that can be used to temporarily consolidate the storage into a single blob_storage. This makes the data contiguous, so we can use a regular bytes_view to examine it.	2015-12-08 15:17:08 +02:00
Takuya ASADA	8c98e239d0	dist: use /etc/scylla as SCYLLA_CONF directory on AMI We don't need copy /var/lib/scylla/conf on RAID anymore, it moved to /etc/scylla. Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-08 11:09:12 +02:00
Avi Kivity	098136f4ab	Merge "Convert serialization of query::result to use db::serializer<>" from Tomasz Reviewed-by: Nadav Har'El <nyh@scylladb.com>	2015-12-07 16:53:34 +02:00
Amnon Heiman	3ce7fa181c	API: Add the implementation for index_summary_off_heap_memory This adds the implementation for the index_summary_off_heap_memory for a single column family and for all of them. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-07 15:15:39 +02:00
Amnon Heiman	e786f1d02f	sstable: Add get_summary function The get_summary method returns a const reference to the summary object. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-07 14:52:18 +02:00
Amnon Heiman	bae286a5b4	Add memory_footprint method to summary_ka Similiar to origin, off heap memory, memory_footprint is the size of queus multiply by the structure size. memory_footprint is used by the API to report the memory that is taken by the summary. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-07 14:52:18 +02:00
Amnon Heiman	2086c651ba	column_family: get_snapshot_details should return empty map for no snapshots If there is no snapshot directory for the specific column family, get_snapshot_details should return an empty map. This patch check that a directory exists before trying to iterate over it. Fixes #619 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-07 12:51:04 +01:00
Tomasz Grabiec	b43b5af894	Merge tag 'tgrabiec/make-future-values-nothrow-move-constructible-v3' from seastar-dev.git Seastar's future<> now requires types to be nothrow move constructible. This series makes Scylla code comply.	2015-12-07 10:43:18 +01:00
Tomasz Grabiec	95f515a6bd	Move seastar submodule head Scylla changes: sstable.cc: Remove file_exists() function which conflicts with seastar's Amnon Heiman (2): reactor: Add file_exists method Add a wrapper for file_exists Avi Kivity (2): Merge "Introduce shared_future" from Tomasz Merge ""scripts: a few fixes in posix_net_conf.sh" from Vlad Gleb Natapov (3): rpc: not stop client in error state avoid allocation in parallel_for_each is there is nothing to do memory: fix size_to_idx calculation Nadav Har'El (1): test: fix use-after-free in timertest Pawe�� Dziepak (1): memory: use size instead of old_size to shrink memory block Tomasz Grabiec (7): file: Mark move constructor as noexcept core: future: Add static asserts about type's noexcept guarantees core: future: Drop now redundant move_noexcept flag core: future_state: Make state getters non-destructive for non-rvalue-refs core: future: Make get_available_state() noexcept core: Introduce shared_future Make json_return_type movable Vlad Zolotarov (8): scripts: posix_net_conf.sh: ban NIC IRQs from being moved by irqbalance scripts: posix_net_conf.sh: exclude CPU0 siblings from RPS scripts: posix_net_conf.sh: Configure XPS scripts: posix_net_conf.sh: Add a new mode for MQ NICs scripts: posix_net_conf.sh: increase some backlog sizes core: to_sstring(): cleanup core: to_sstring_strintf(): always use %g(or %lg) format for floating point values core: prevent explicit calls for to_sstring_sprintf()	2015-12-07 10:41:39 +01:00
Glauber Costa	79e70568d7	scylla-setup: do not add discard to the command line In a recent discussion with the XFS developers, Dave Chinner recommended us not to use discard, but rather issue fstrims explicitly. In machines like Amazon's c3-class, the situation is made worse by the fact that discard is not supported by the disk. Contrary to my intuition, adding the discard mount option in such situation is not a nop and will just create load for no reason. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-12-07 11:22:27 +02:00
Tomasz Grabiec	934d3f06d1	api: Make histogram reduction work on domain value instead of json objects Objects extending json_base are not movable, so we won't be able to pass them via future<>, which will assert that types are nothrow move constructible. This problem only affects httpd::utils_json::histogram, which is used in map-reduce. This patch changes the aggregation to work on domain value (utils::ihistrogram) instead of json objects.	2015-12-07 09:50:28 +01:00
Tomasz Grabiec	c0ac7b3a73	commitlog: Wrap subscription in a unique_ptr<> to make it nothrow movable future<> will require nothrow move constructible types.	2015-12-07 09:50:28 +01:00
Tomasz Grabiec	657841922a	Mark move constructors noexcept when possible	2015-12-07 09:50:27 +01:00
Tomasz Grabiec	fdc28a73f8	thrift: Make with_cob() handle not nothrow move constructible types	2015-12-07 09:50:27 +01:00
Tomasz Grabiec	538de7222a	Introduce noexcept_traits	2015-12-07 09:50:27 +01:00
Tomasz Grabiec	bc23ebcbc3	schema_tables: Replace schema_result::value_type with equivalent movable type future<> requires and will assert nothrow move constructible types.	2015-12-07 09:50:27 +01:00
Avi Kivity	91c2af2803	Merge "nodetool removenode fix + cleanup" from Asias	2015-12-07 10:41:51 +02:00
Takuya ASADA	2891291ad1	dist: add swagger-ui and api-doc on ubuntu package Fixes .deb part of #520 Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-07 10:39:59 +02:00
Takuya ASADA	3f0ca277e5	dist: add swagger-ui and api-doc on rpm package Fixes .rpm part of #520 Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-07 10:39:59 +02:00
Avi Kivity	2437fc956c	allocation_strategy: expose preferred allocation size limit Our premier allocation_strategy, lsa, prefers to limit allocations below a tenth of the segment size so they can be moved around; larger allocations are pinned and can cause memory fragmentation. Provide an API so that objects can query for this preferred size limit. For now, lsa is not updated to expose its own limit; this will be done after the full stack is updated to make use of the limit, or intermediate steps will not work correctly.	2015-12-06 16:23:42 +02:00
Vlad Zolotarov	564cb2bcd1	gms::versioned_value: don't use to_sstring_sprintf() directly Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2015-12-06 12:24:54 +02:00
Raphael S. Carvalho	d435ca7da6	enable more logging for leveled compaction strategy Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2015-12-06 11:36:50 +02:00
Pekka Enberg	a95a7294ef	types: Fix 'varint' type value compatibility check Fixes #575. Signed-off-by: Pekka Enberg <penberg@scylladb.com>	2015-12-04 13:25:34 +01:00
Glauber Costa	5e8249f062	commitlog: fix but preventing flushing with default max_size value The config file expresses this number in MB, while total_memory() gives us a quantity in bytes. This causes the commitlog not to flush until we reach really skyhigh numbers. While we need this fix for the short term before we cook another release, I will note that for the mid/long term, it would be really helpful to stop representing memory amounts as integers, and use an explicit C++ type for those. That would have prevented this bug. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-12-04 09:29:19 +02:00
Vlad Zolotarov	cd215fc552	types: map::to_string() - non-empty implementation Print a map in the form of [(]{ key0 : value0 }[, { keyN : valueN }]*[)] The map is printed inside () brackets if it's frozen. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2015-12-03 18:46:12 +01:00
Amnon Heiman	54b4f26cb0	API: Change the compaction summary to use an object In origin, there are two APIs to get the information about the current running compactions. Both APIs do the string formatting. This patch changes the API to have a single API get_compaction that would return a list of summary object. The jmx would do the string formatting for the two APIs. This change gives a better API experience is it's better documented and would make it easier to support future format changes in origin. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-03 11:57:37 +02:00
Asias He	dcb9b441ab	storage_service: Fix debug build Start non-seed with debug build I saw: ==9844==WARNING: ASan is ignoring requested __asan_handle_no_return: stack top: 0x7ffdabd73000; bottom 0x7fe309218000; size: 0x001aa2b5b000 (114398965760) False positive error reports may follow For details see http://code.google.com/p/address-sanitizer/issues/detail?id=189 DEBUG [shard 0] storage_service - Starting shadow gossip round to check for endpoint collision ================================================================= ==9844==ERROR: AddressSanitizer: stack-buffer-overflow on address 0x7fe309219ad0 at pc 0x00000495a88e bp 0x7fe309219960 sp 0x7fe309219950 WRITE of size 8 at 0x7fe309219ad0 thread T0 #0 0x495a88d in _Head_base<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work> /usr/include/c++/5.1.1/tuple:115 #1 0x495a993 in _Tuple_impl<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work, std::default_delete<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work>, void> /usr/include/c++/5.1.1/tuple:213 #2 0x495aa73 in tuple<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work*, std::default_delete<seastar::async(Func&&, Args&& ...) [with Func = service::storage_service::check_for_endpoint_collision()::<lambda()>; Args = {}; futurize_t<typename std::result_of<typename std::decay<_Tp>::type(std::decay_t<Args>...)>::type> = future<>]::work>, void> /usr/include/c++/5.1.1/tuple:613 #3 0x495ab82 in unique_ptr /usr/include/c++/5.1.1/bits/unique_ptr.h:206 ... #16 0x4d44c8e in _M_invoke /usr/include/c++/5.1.1/functional:1871 #17 0x5d2fb7 in std::function<void ()>::operator()() const /usr/include/c++/5.1.1/functional:2271 #18 0x8a1e70 in seastar::thread_context::main() core/thread.cc:139 #19 0x8a1d89 in seastar::thread_context::s_main(unsigned int, unsigned int) core/thread.cc:130 #20 0x7fe311b6cf0f (/lib64/libc.so.6+0x48f0f) I'm not sure why this patch helps. Perhaps the exception makes ASAN unhappy. Anyway, this patch makes the debug build work again. Fixes #613.	2015-12-03 10:42:11 +02:00
Tomasz Grabiec	d64db98943	query: Convert serialization of query::result to use db::serializer<> That's what we're trying to standardize on. This patch also fixes an issue with current query::result::serialize() not being const-qualified, because it modifies the buffer. messaging_service did a const cast to work this around, which is not safe.	2015-12-03 09:19:11 +01:00
Tomasz Grabiec	d4d3a5b620	bytes_ostream: Make size_type and value_type public	2015-12-03 09:19:11 +01:00
Tomasz Grabiec	96d215168e	Merge tag 'asias/gossip_start_stop/fix/v1' from seastar-dev.git Fixes for issues in tests from Asias.	2015-12-03 09:10:55 +01:00
Tomasz Grabiec	f0cfa61968	Relax header dependencies	2015-12-03 09:10:02 +01:00
Tomasz Grabiec	9e0c498425	Merge branch 'dev/amnon/latency_clock_v2' From Amnon: After this series an example run of cfhistograms report a maximal 0.5s latency as it should	2015-12-02 19:58:43 +01:00
Amnon Heiman	1812fe9e70	API: Add the get_version to messaging_service swagger definition file Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-02 14:45:44 +02:00
Amnon Heiman	ae53604ed7	API: Add the get_version implementation to messaging service This patch adds the implementation to the get_version. After this patch the following url will be available: messaging_service/version?addr=127.0.0.1 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-02 13:29:40 +02:00
Avi Kivity	53e3e79349	Merge "API: Stubing the compaction manager" from Amnon "This series allows the compaction manager to be used by the nodetool as a stub implementation. It has two changes: * Add to the compaction manager API a method that returns a compaction info object * Stub all the compaction method so that it will create an unimplemented warning but will not fail, the API implementation will be reverted when the work on compaction will be completed."	2015-12-02 13:28:34 +02:00
Takuya ASADA	871bfb1c94	dist: generate correct distribution codename on debian/changelog Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-02 12:38:52 +02:00
Takuya ASADA	b61ea247d2	dist: check supported Ubuntu release Warn if unsupported release. Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-02 12:38:52 +02:00
Takuya ASADA	0c66c25250	dist: fix typo on scylla_prepare Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2015-12-02 11:30:15 +02:00
Asias He	3004866f59	gossip: Rename start to start_gossiping So that we have a more consistent name start_gossiping() and stop_gossiping() and it will not confuse with get_gossiper.start().	2015-12-02 16:50:34 +08:00
Asias He	5c3951b28a	gossip: Get rid of the handler helper	2015-12-02 16:50:34 +08:00
Asias He	7a6ad7aec2	gossip: Fix Assertion `local_is_initialized()' failed This patch fixes the following cql_query_test failure. cql_query_test: scylla/seastar/core/sharded.hh:439: Service& seastar::sharded<Service>::local() [with Service = gms::gossiper]: Assertion `local_is_initialized()' failed. The problem is in gossiper::stop() we call gossip::add_local_application_state() which will in turn call gms::get_local_gossiper(). In seastar::sharded::stop _instances[engine().cpu_id()].service = nullptr; return inst->stop().then([this, inst] { return _instances[engine().cpu_id()].freed.get_future(); }); We set the _instances to nullptr before we call the stop method, so local_is_initialized asserts when we try to access get_local_gossiper again. To fix, we make the stopping of gossiper explicit. In the shutdown procedure, we call stop_gossiping() explicitly. This has two more advantages: 1) The api to stop gossip is now calling the stop_gossiping() instead of sharing the seastar::sharded's stop method. 2) We can now get rid of the _handler seastar::sharded helper.	2015-12-02 16:50:34 +08:00

1 2 3 4 5 ...

7565 Commits