scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 23:13:15 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	2dc78a6ca2	data::cell: expose size overhead of external chunks	2018-06-28 18:01:17 +01:00
Paweł Dziepak	6adc78d690	imr::utils::object: expose size overhead	2018-06-28 18:01:17 +01:00
Paweł Dziepak	e69f2c361c	tests/mutation: properly mark atomic_cells that are collection members	2018-06-28 18:00:39 +01:00
Vladimir Krivopalov	82f76b0947	Use std::reference_wrapper instead of a plain reference in bound_view. The presence of a plain reference prohibits the bound_view class from being copyable. The trick employed to work around that was to use 'placement new' for copy-assigning bound_view objects, but this approach is ill-formed and causes undefined behaviour for classes that have const and/or reference members. The solution is to use a std::reference_wrapper instead. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <a0c951649c7aef2f66612fc006c44f8a33713931.1530113273.git.vladimir@scylladb.com>	2018-06-28 11:24:06 +01:00
Avi Kivity	c87a961667	Merge "Add multishard_writer support" from Asias " We need a multishard_writer which gets mutation fragments from a producer (e.g., from the network using the rpc streaming) and consumes the mutation fragments with a consumer (e.g., write to sstable). The multishard_writer will take care of the mutation fragments do not belong to current shard. This multishard_writer will be used in the new scylla streaming. " * 'asias/multishard_writer_v10.1' of github.com:scylladb/seastar-dev: tests: Add multishard_writer_test to test.py tests: Add test for multishard_writer multishard_writer: Introduce multishard_writer tests: Allow random_mutation_generator to generate mutations belong to remote shrard	2018-06-28 12:36:55 +03:00
Asias He	fd8b7efb99	tests: Add multishard_writer_test to test.py For multishard_writer class testing.	2018-06-28 17:20:29 +08:00
Asias He	4050a4b24e	tests: Add test for multishard_writer	2018-06-28 17:20:29 +08:00
Asias He	f4b406cce1	multishard_writer: Introduce multishard_writer The multishard_writer class gets mutation_fragments generated from flat_mutation_reader and consumes the mutation_fragments with multishard_writer::_consumer. If the mutation_fragment does not belong to the shard multishard_writer is on, it will forward the mutation_fragment to the correct shard. Future returned by multishard_writer() becomes ready when all the mutation_fragments are consumed. Tests: tests/multishard_writer_test.cc Tests: dtest update_cluster_layout_tests.py Fixes #3497	2018-06-28 17:20:28 +08:00
Asias He	8eccff1723	tests: Allow random_mutation_generator to generate mutations belong to remote shrard - make_local_keys returns keys of current shard - make_keys returns keys of current or remote shard	2018-06-28 17:20:28 +08:00
Asias He	27cb41ddeb	range_streamer: Use float for time took for stream It is useful when the total time to stream is small, e.g, 2.0 seconds and 2.9 seconds. Showing the time as interger number of seconds is not accurate in such case. Message-Id: <d801b57279981c72acb907ad4b0190ba4d938a3d.1530175052.git.asias@scylladb.com>	2018-06-28 11:39:14 +03:00
Avi Kivity	e1efda8b0c	Merge "Disable sstable filtering based on min/max clustering key components" from Tomasz " With DateTiered and TimeWindow, there is a read optimization enabled which excludes sstables based on overlap with recorded min/max values of clustering key components. The problem is that it doesn't take into account partition tombstones and static rows, which should still be returned by the reader even if there is no overlap in the query's clustering range. A read which returns no clustering rows can mispopulate cache, which will appear as partition deletion or writes to the static row being lost. Until node restart or eviction of the partition entry. There is also a bad interaction between cache population on read and that optimization. When the clustering range of the query doesn't overlap with any sstable, the reader will return no partition markers for the read, which leads cache populator to assume there is no partition in sstables and it will cache an empty partition. This will cause later reads of that partition to miss prior writes to that partition until it is evicted from cache or node is restarted. Disable until a more elaborate fix is implemented. Fixes #3552 Fixes #3553 " * tag 'tgrabiec/disable-min-max-sstable-filtering-v1' of github.com:tgrabiec/scylla: tests: Add test for slicing a mutation source with date tiered compaction strategy tests: Check that database conforms to mutation source database: Disable sstable filtering based on min/max clustering key components	2018-06-27 14:28:27 +03:00
Calle Wilund	054514a47a	sstables::compress: Ensure unqualified compressor name if possible Fixes #3546 Both older origin and scylla writes "known" compressor names (i.e. those in origin namespace) unqualified (i.e. LZ4Compressor). This behaviour was not preserved in the virtualization change. But probably should be. Message-Id: <20180627110930.1619-1-calle@scylladb.com>	2018-06-27 14:16:50 +03:00
Piotr Sarna	03753cc431	database: make drop_column_family wait on reads in progress drop_column_family now waits for both writes and reads in progress. It solves possible liveness issues with row cache, when column_family could be dropped prematurely, before the read request was finished. Phaser operation is passed inside database::query() call. There are other places where reading logic is applied (e.g. view replicas), but these are guarded with different synchronization mechanisms, while _pending_reads_phaser applies to regular reads only. Fixes #3357 Reported-by: Duarte Nunes <duarte@scylladb.com> Signed-off-by: Piotr Sarna <sarna@scylladb.com> Message-Id: <d58a5ee10596d0d62c765ee2114ac171b6f087d2.1529928323.git.sarna@scylladb.com>	2018-06-27 10:02:56 +01:00
Piotr Sarna	e1a867cbe3	database: add phaser for reads Currently drop_column_family waits on write_in_progress phaser, but there's no such mechanism for reads. This commit adds a corresponding reads phaser. Refs #3357 Reported-by: Duarte Nunes <duarte@scylladb.com> Signed-off-by: Piotr Sarna <sarna@scylladb.com> Message-Id: <70b5fdd44efbc24df61585baef024b809cabe527.1529928323.git.sarna@scylladb.com>	2018-06-27 10:02:56 +01:00
Tomasz Grabiec	b4879206fb	tests: Add test for slicing a mutation source with date tiered compaction strategy Reproducer for https://github.com/scylladb/scylla/issues/3552	2018-06-26 18:54:44 +02:00
Tomasz Grabiec	826a237c2e	tests: Check that database conforms to mutation source	2018-06-26 18:54:44 +02:00
Tomasz Grabiec	19b76bf75b	database: Disable sstable filtering based on min/max clustering key components With DateTiered and TimeWindow, there is a read optimization enabled which excludes sstables based on overlap with recorded min/max values of clustering key components. The problem is that it doesn't take into account partition tombstones and static rows, which should still be returned by the reader even if there is no overlap in the query's clustering range. A read which returns no clustering rows can mispopulate cache, which will appear as partition deletion or writes to the static row being lost. Until node restart or eviction of the partition entry. There is also a bad interaction between cache population on read and that optimization. When the clustering range of the query doesn't overlap with any sstable, the reader will return no partition markers for the read, which leads cache populator to assume there is no partition in sstables and it will cache an empty partition. This will cause later reads of that partition to miss prior writes to that partition until it is evicted from cache or node is restarted. Disable until a more elaborate fix is implemented. Fixes #3552 Fixes #3553	2018-06-26 18:54:44 +02:00
Avi Kivity	9a7ecdb3b9	Merge "Deglobalise cache_tracker" from Paweł " Cache tracker is a thread-local global object that indirectly depends on the lifetimes of other objects. In particular, a member of cache_tracker: mutation_cleaner may extend the lifetime of a mutation_partition until the cleaner is destroyed. The mutation_partition itself depends on LSA migrators which are thread-local objects. Since, there is no direct dependency between LSA-migrators and cache_tracker it is not guarantee that the former won't be destroyed before the latter. The easiest (barring some unit tests that repeat the same code several billion times) solution is to stop using globals. This series also improves the part of LSA sanitiser that deals with migrators. Fixes #3526. Tests: unit(release) " * tag 'deglobalise-cache-tracker/v1-rebased' of https://github.com/pdziepak/scylla: mutation_cleaner: add disclaimer about mutation_partition lifetime lsa: enhance sanitizer for migrators lsa: formalise migrator id requirements row_cache: deglobalise row cache tracker	2018-06-26 16:38:12 +01:00
Asias He	c3b5a2ecd5	gossip: Fix tokens assignment in assassinate_endpoint The tokens vector is defined a few lines above and is needed outsie the if block. Do not redefine it again in the if block, otherwise the tokens will be empty. Found by code inspection. Fixes #3551. Message-Id: <c7a06375c65c950e94236571127f533e5a60cbfd.1530002177.git.asias@scylladb.com>	2018-06-26 16:38:12 +01:00
Tomasz Grabiec	6d6b93d1e7	flat_mutation_reader: Move field initialization to initializer list This works around a problem of std::terminate() being called in debug mode build if initialization of _current throws. Backtrace: Thread 2 "row_cache_test_" received signal SIGABRT, Aborted. 0x00007ffff17ce9fb in raise () from /lib64/libc.so.6 (gdb) bt #0 0x00007ffff17ce9fb in raise () from /lib64/libc.so.6 #1 0x00007ffff17d077d in abort () from /lib64/libc.so.6 #2 0x00007ffff5773025 in __gnu_cxx::__verbose_terminate_handler() () from /lib64/libstdc++.so.6 #3 0x00007ffff5770c16 in ?? () from /lib64/libstdc++.so.6 #4 0x00007ffff576fb19 in ?? () from /lib64/libstdc++.so.6 #5 0x00007ffff5770508 in __gxx_personality_v0 () from /lib64/libstdc++.so.6 #6 0x00007ffff3ce4ee3 in ?? () from /lib64/libgcc_s.so.1 #7 0x00007ffff3ce570e in _Unwind_Resume () from /lib64/libgcc_s.so.1 #8 0x0000000003633602 in reader::reader (this=0x60e0001160c0, r=...) at flat_mutation_reader.cc:214 #9 0x0000000003655864 in std::make_unique<make_forwardable(flat_mutation_reader)::reader, flat_mutation_reader>(flat_mutation_reader &&) (__args#0=...) at /usr/include/c++/7/bits/unique_ptr.h:825 #10 0x0000000003649a63 in make_flat_mutation_reader<make_forwardable(flat_mutation_reader)::reader, flat_mutation_reader>(flat_mutation_reader &&) (args#0=...) at flat_mutation_reader.hh:440 #11 0x000000000363565d in make_forwardable (m=...) at flat_mutation_reader.cc:270 #12 0x000000000303f962 in memtable::make_flat_reader (this=0x61300001d540, s=..., range=..., slice=..., pc=..., trace_state_ptr=..., fwd=..., fwd_mr=...) at memtable.cc:592 Message-Id: <1528792447-13336-1-git-send-email-tgrabiec@scylladb.com>	2018-06-25 20:03:23 +03:00
Avi Kivity	31eeae0126	Merge "Avoid buffer linearisation in read path" from Paweł " The read path on coordinator involves a lot of passing around buffers and some occasional processing. We start with query::result obtained from the storage_proxy which is then transformed into a cql3::result_set, which is then used to write a response. Buffers are copied and linearised quite excessively. This series attempts to remedy that by using view of fragmented buffers as much as possible. The first part deals with reading from query::result. ser::buffer_view is introduced which enables the IDL infrastructure to read a buffer without copying or linearising it. The second part is switching native protocol layer to use bytes_ostream instead of std::vector<char> to hold the generated response to the client. The last part introduces cql3::result_generator which is an alternative to cql3::result_set that passes buffer views without copying or linearising anything from query::result to the native protocl layer (or Thrift). It is only used in simple cases, when no processing at the CQL layer is required, except for paged queries which require some simple interpretation of the results and are supported by the result generator. Tests: unit(release), dtests(paging_test.py paging_additional_test.py cql_additional_tests.py cql_tracing_test.py cql_prepared_test.py cql_cast_test.py cql_tests.py) " * tag 'buffer-views-query-result/v2' of https://github.com/pdziepak/scylla: (34 commits) cql3: select_statement: use fetch_page_generator() if possible pager: add fetch_page_generator() pager: make the visitor handle_result() accepts a template parameter pager: make query_result_visitor base class a template parameter pager: make myvistor a member class of query_pager pager: make shared pointers to selection constant pager: merge query_pager and query_pagers::impl cql3: select_statement: use result_generator if possible cql3: selection: add is_trivial() cql3: result: support result_generator cql3: add lazy result_generator cql3: add result class cql3::result_set: fix encapsulation thrift: use cql3::result_set visiting interface transport: use cql3::result_set visiting interface cql3::result_set: add visit() transport: response: add write_int_placeholder() transport: steal response buffers and make send zero-copy transport: use reusable_buffer for compression transport: response: use bytes_ostream ...	2018-06-25 17:37:50 +03:00
Paweł Dziepak	bdc299cc38	mutation_cleaner: add disclaimer about mutation_partition lifetime mutation_cleaner has already caused problems by extending lifetime of mutation_partition past the lifetime of LSA migrators that it uses (due to the fact that both the cleaner and migrators where thread-local globals). Since, the long term goal is to make mutation_partition internal representation depend more and more on schema that lifetime extension may again cause problems in the future, so let's add a disclaimer that hopefuly, will help avoiding them.	2018-06-25 09:37:43 +01:00
Paweł Dziepak	55bf9d78a6	lsa: enhance sanitizer for migrators Current LSA sanitizer performs only basic checks on the migrators use, without doing any additonal reporting in case an error is detected. This patch enhances it so that when a problem is detected relevant stack traces get printed.	2018-06-25 09:37:43 +01:00
Paweł Dziepak	fcd9b1f821	lsa: formalise migrator id requirements object_descriptor uses special encoding for migrator ids which assumes that the valid ones are in a range smaller than uint32_t. Let's add some static asserts that make this fact more visible.	2018-06-25 09:37:43 +01:00
Paweł Dziepak	96b0577343	row_cache: deglobalise row cache tracker Row cache tracker has numerous implicit dependencies on ohter objects (e.g. LSA migrators for data held by mutation_cleaner). The fact that both cache tracker and some of those dependencies are thread local objects makes it hard to guarantee correct destruction order. Let's deglobalise cache tracker and put in in the database class.	2018-06-25 09:37:43 +01:00
Paweł Dziepak	2b1fcfe019	cql3: select_statement: use fetch_page_generator() if possible	2018-06-25 09:21:47 +01:00
Paweł Dziepak	1cf3cb285f	pager: add fetch_page_generator() fetch_page_generator() is an equivalent of fetch_page(), but instead of building a cql3::result_set it returns a cql3::result_generator().	2018-06-25 09:21:47 +01:00
Paweł Dziepak	f6fe831d49	pager: make the visitor handle_result() accepts a template parameter	2018-06-25 09:21:47 +01:00
Paweł Dziepak	fc87ca5926	pager: make query_result_visitor base class a template parameter So far query_result_visitor was tied to result_set_builder. The goal is to enable result_generator to work with paged queries as well so we need to decouple them.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	dc9a65ea76	pager: make myvistor a member class of query_pager It is going to be come a class template.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	319b2cde7e	pager: make shared pointers to selection constant Shared pointers make code harder to reason about, it is not easy to get rid of them in this piece of the code, but we can restore at least a bit of sanity by adding consts.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	327d3de51e	pager: merge query_pager and query_pagers::impl There is just a single implementation of query_pager and there is no reason to make anything virtual. Devirtualising this code will allow higher layers to pass visitors via templates.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	fa5dea91e7	cql3: select_statement: use result_generator if possible	2018-06-25 09:21:47 +01:00
Paweł Dziepak	3f1184d16d	cql3: selection: add is_trivial() cql3::result_generator supports only trivial selections.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	adad31ba6b	cql3: result: support result_generator cql3::result can now hold either a result_set or a result_generator. Some code that is not performance critical expects to get result_set so a way of converting the result_generator to a result_set is added.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	02443d10db	cql3: add lazy result_generator result_generator is a restricted alternative of result_set. It supports only the simples cases, but is much cheaper as it passes data almost directly from query::result to its visitor bypassing much of the CQL layer.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	dca68afce6	cql3: add result class So far the only way of returing a result of a CQL query was to build a result_set. An alternative lazy result generator is going to be introduced for the simple cases when no transformations at CQL layer are needed. To do that we need to hide the fact that there are going to be multiple representations of a cql results from the users.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	29cc4a4c0b	cql3::result_set: fix encapsulation	2018-06-25 09:21:47 +01:00
Paweł Dziepak	8f26d9c03f	thrift: use cql3::result_set visiting interface	2018-06-25 09:21:47 +01:00
Paweł Dziepak	54d5dc414d	transport: use cql3::result_set visiting interface	2018-06-25 09:21:47 +01:00
Paweł Dziepak	2e4234ab63	cql3::result_set: add visit() This visiting interface for result_set satisfies most of its users (at least all of those which are in the hot path). It will allow having an alternative of result_set (i.e. lazy result generator) which would provide exaclty the same interface.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	c0e7160625	transport: response: add write_int_placeholder() This allows the response writer to defer writing integers until later time. It will be used by lazy response generator which will know the number of rows in the response only after they are all written.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	88aff8eda8	transport: steal response buffers and make send zero-copy Each response is sent only once, so we can safely steal its buffers and pass them to the output_stream using the zero-copy interface.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	821e6683e3	transport: use reusable_buffer for compression Compression algorithms require us to linearise bytes_ostream. This may cause an excessive number of large allocations. Using reusable_buffers can avoid that.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	a7c4d407ce	transport: response: use bytes_ostream std::vector<char> is not a very good container for incrementally building a response. It may cause excessive copies and allocations. If the response is large it will put more pressure on the memory allocator by requiring the buffer to be contiguous. We already have bytes_ostream which avoids all of these problems, so let's use it.	2018-06-25 09:22:43 +01:00
Paweł Dziepak	c04d38b76b	transport: drop response::make_message()	2018-06-25 09:22:35 +01:00
Paweł Dziepak	444acf49af	transport: use std::unique_ptr for the response So far cql_server::response was passed around using shared pointers. They have very big cost of making it hard to reason about the code. All that is not necessary and we can easily switch to using much more sensible std::unique_ptr.	2018-06-25 09:22:24 +01:00
Paweł Dziepak	12f89299b2	transport: move response to a separate header There are some other translation units which right now are satisfied with the response being an incomplete type. This means that std::unique_ptr can't be used for it. Let's move the class declaration to a header that can be included where needed.	2018-06-25 09:21:47 +01:00
Paweł Dziepak	3b9ba30497	tests: add test for reusable buffers	2018-06-25 09:21:47 +01:00
Paweł Dziepak	b4c5e1a6d4	utils: add reusable_buffer This commit adds a helper class reusable_buffer which can be used to avoid excessive memory allocations of large buffers when bytes_ostream needs to be linearised. The idea is that reusable_buffer in most cases is going to be thread local so that multiple continuation chains can reuse the same large buffer.	2018-06-25 09:21:47 +01:00

1 2 3 4 5 ...

15901 Commits