scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-23 10:00:35 +00:00

Author	SHA1	Message	Date
Benny Halevy	4476800493	flat_mutation_reader: get rid of timeout parameter Now that the timeout is taken from the reader_permit. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-24 16:30:51 +03:00
Benny Halevy	fe479aca1d	reader_permit: add timeout member To replace the timeout parameter passed to flat_mutation_reader methods. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-24 14:29:44 +03:00
Botond Dénes	7bfa40a2f1	treewide: use make_tracking_only_permit() For all those reads that don't (won't or can't) pass through admission currently.	2021-07-14 17:19:02 +03:00
Botond Dénes	7a4381b491	mutation_writer: multishard_writer: migrate off the global reader concurrency semaphore Use a local one instead, and stop it when the writer is destroyed.	2021-07-08 12:31:36 +03:00
Benny Halevy	693d5d9e6b	mutation_writer: bucket_writer: consume: propagate _consume_fut if queue_reader_handle is_terminated When the queue_reader_handle is terminated it was either explicitly aborted or the reader was closed prematurely. In this case _consume_fut should hold the root-cause error (e.g. when compaction is stopped). Return it instead of trying to push the mutation fragment. If no error is returned from _consume_fut, make to sure to return either the queue_reader_handle error, if available, or a generic error since the writer. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-06-16 17:25:16 +03:00
Benny Halevy	5f31beaf97	flat_mutation_reader: unify reader_consumer declarations Put the reader_consumer declaration in flat_mutation_reader.hh and include it instead of declaring the same `using reader_consumer` declaration in several places. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210607075020.31671-1-bhalevy@scylladb.com>	2021-06-07 16:11:18 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Botond Dénes	a53e6bc6e8	mutation_writer: add segregate_by_partition Add a new segregator which segregates a stream, potentially containing duplicate or even out-of-order partitions, into multiple output streams, such that each output stream has strictly monotonic partitions. This segregator will be used by a new scrub compaction mode which is meant to fix sstables containing duplicate or out-of-order data.	2021-05-05 12:03:42 +03:00
Benny Halevy	e453f890f2	mutation_writer: multishard_writer: close readers when done Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	64c5b7fda6	mutation_writer: feed_writer: close reader when done Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	f29732573a	mutation_writer: bucket_writer: add close bucket_writer::close waits for the _consumer_fut. It is called both after consume_end_of_stream() and after abort(). _consumer_fut is expected to return an exception on the abort path. Wait for it and drop any exception so it won't be abandoned as seen in #7904. With that moved to close() time, consume_end_of_stream doesn't need to return a future and is made void all the way in the stack. This is ok since queue_reader_handle::push_end_of_stream is synchronous too. Added a unit test that aborts the reader consumer during `segregate_by_timestamp`, reproducing the Exceptional future ignored issue without the fix. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-01-19 19:03:58 +02:00
Benny Halevy	fc3f9a57ff	mutation_writer/feed_writers: refactor bucket/shard writers Consolidate shard_based_splitting_writer::shard_writer and timestamp_based_splitting_writer::bucket_writer common code into mutation_writer::bucket_writer. This provides a common place to handle consume_end_of_stream() and abort(), and in particular the handling of the underlying _conmsumer_fut. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-01-19 18:48:01 +02:00
Benny Halevy	a9d91a2d09	mutation_writer: update bucket/shard writers consume_end_of_stream After `61520a33d6` feed_writers doesn't call consume_end_of_stream after abort() so no need to test if (!_handle.is_terminated()) { and consume_end_of_stream is now called in then_wrapped rather than `finally` so it's ok if it throws. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-01-19 18:44:40 +02:00
Gleb Natapov	61520a33d6	mutation_writer: pass exceptions through feed_writer feed_writer() eats exception and transforms it into an end of stream instead. Downstream validators hate when this happens. Fixes #7482 Message-Id: <20201216090038.GB3244976@scylladb.com>	2020-12-16 13:18:19 +02:00
Botond Dénes	ff623e70b3	reader_concurrency_semaphore: name permits Require a schema and an operation name to be given to each permit when created. The schema is of the table the read is executed against, and the operation name, which is some name identifying the operation the permit is part of. Ideally this should be different for each site the permit is created at, to be able to discern not only different kind of reads, but different code paths the read took. As not all read can be associated with one schema, the schema is allowed to be null. The name will be used for debugging purposes, both for coredump debugging and runtime logging of permit-related diagnostics.	2020-10-13 12:32:13 +03:00
Botond Dénes	6ca0464af5	mutation_fragment: add schema and permit We want to start tracking the memory consumption of mutation fragments. For this we need schema and permit during construction, and on each modification, so the memory consumption can be recalculated and pass to the permit. In this patch we just add the new parameters and go through the insane churn of updating all call sites. They will be used in the next patch.	2020-09-28 11:27:23 +03:00
Botond Dénes	3fab83b3a1	flat_mutation_reader: impl: add reader_permit parameter Not used yet, this patch does all the churn of propagating a permit to each impl. In the next patch we will use it to track to track the memory consumption of `_buffer`.	2020-09-28 10:53:48 +03:00
Avi Kivity	422a7e07a3	timestamp_based_splitting_writer: supply a parameter to std::out_of_range contructor std::out-of-range does not have a default constructor, yet gcc somehow accepts a no-argument construction. Clang (correctly) doesn't, so add a parameter.	2020-09-21 16:32:53 +03:00
Piotr Jastrzebski	01ea159fde	codebase wide: use try_emplace when appropriate C++17 introduced try_emplace for maps to replace a pattern: if(element not in a map) { map.emplace(...) } try_emplace is more efficient and results in a more concise code. This commit introduces usage of try_emplace when it's appropriate. Tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <4970091ed770e233884633bf6d46111369e7d2dd.1597327358.git.piotr@scylladb.com>	2020-08-16 14:41:09 +03:00
Rafael Ávila de Espíndola	f6e407ecd2	everywhere: Prepare for seastar api v4 (when_all_succeed return value) The seastar api v4 changes the return type of when_all_succeed. This patch adds discard_result when that is best solution to handle the change. This doesn't do the actual update to v4 since there are still a few issues left to fix in seastar. A patch doing just the update will follow. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200617233150.918110-1-espindola@scylladb.com>	2020-06-18 15:13:56 +03:00
Avi Kivity	a4c44cab88	treewide: update concepts language from the Concepts TS to C++20 Seastar recently lost support for the experimental Concepts Technical Specification (TS) and gained support for C++20 concepts. Re-enable concepts in Scylla by updating our use of concepts to the C++20 standard. This change: - peels off uses of the GCC6_CONCEPT macro - removes inclusions of <seastar/gcc6-concepts.hh> - replaces function-style concepts (no longer supported) with equation-style concepts - semicolons added and removed as needed - deprecated std::is_pod replaced by recommended replacement - updates return type constraints to use concepts instead of type names (either std::same_as or std::convertible_to, with std::same_as chosen when possible) No attempt is made to improve the concepts; this is a specification update only. Message-Id: <20200531110254.2555854-1-avi@scylladb.com>	2020-06-02 09:12:21 +03:00
Raphael S. Carvalho	9ebf7b442e	timestamp_based_splitting_writer: fix use-after-move look-alike rt is moved before rt.tomb.timestamp is retrieved, so there's a something that looks like use-after-move here (but really isn't). found it while auditting the code. [avi: adjusted changelog to note that it's not really a use-after-move] Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200525141047.168968-1-raphaelsc@scylladb.com>	2020-05-27 08:40:05 +03:00
Glauber Costa	e44b2826ab	compaction: avoid abandoned futures when using interposers When using interposers, cancelling compactions can leave futures that are not waited for (resharding, twcs) The reason is when consume_end_of_stream gets called, it tries to push end_of_stream into the queue_reader_handle. Because cancelling a compaction is done through an exception, the queue_reader_handle is terminated already at this time. Trying to push to it generates another exception and prevents us from returning the future right below it. This patch adds a new method is_terminated() and if we detect that the queue_reader_handle is already terminated by this point, we don't try to push. We call it is_terminated() because the check is to see if the queue_reader_handle has a _reader. The reader is also set to null on successful destruction. Signed-off-by: Glauber Costa <glauber@scylladb.com> Reviewed-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200430175839.8292-1-glauber@scylladb.com>	2020-05-01 16:30:23 +03:00
Asias He	d86958d3b2	multishard_writer: Abort the queue attached to consumers when producer fails We have this in multishard_writer: future<uint64_t> multishard_writer::operator()() { return distribute_mutation_fragments().finally([this] { return wait_pending_consumers(); }).then([this] { return _consumed_partitions; }); } The wait_pending_consumers which waits for the consumers to finish is called even when distribute_mutation_fragments fails. When distribute_mutation_fragments fails and the failure is due to the producer fails, consumers can wait for data which will never come because the producer has failed already. This can cause a deadlock. To fix, when distribute_mutation_fragments fails, we should abort the queues that are attached to the readers used by the consumers. Fixes #6241	2020-04-20 14:53:24 +08:00
Glauber Costa	8fe10863f4	mutation_writer: introduce shard_based splitting writer This is similar to the timestamp based splitting writer, except that it splits data based on the shard where the partition key is supposed to be placed. It is similar to the multishard_writer, in the sense that it creates n streams for n shards, but it does not want to process the streams in the owner shards. We want to use that in processes like resharding where it is fine for a foreign shard to deal with a mutation. One option would be to augment the multishard_writer to optionally achieve these properties, but having a separate splitter is both simpler and faster. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-04-02 08:55:16 -04:00
Glauber Costa	a258f111c7	mutation_writer: factor out part of the code for the timestamp splitter I am about to introduce a new splitter. Therefore, move parts of it that are common to its own file. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-04-02 08:55:16 -04:00
Piotr Jastrzebski	e72696a8e6	sharding_info: rename the class to sharder Also rename all variables that were named si or sinfo to sharder. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-30 18:42:33 +02:00
Piotr Jastrzebski	db3d7df893	multishard_writer: stop calling i_partitioner::shard_count Replace it with sharding_info::shard_count. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-30 18:42:33 +02:00
Piotr Jastrzebski	8aabba6041	distribute_reader_and_consume_on_shards: use sharding_info::shard_of Switches all uses of i_partitioner::shard_of to sharding_info::shard_of. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-30 18:42:33 +02:00
Pavel Emelyanov	4fa12f2fb8	header: De-bloat schema.hh The header sits in many other headers, but there's a handy schema_fwd.hh that's tiny and contains needed declarations for other headers. So replace shema.hh with schema_fwd.hh in most of the headers (and remove completely from some). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200303102050.18462-1-xemul@scylladb.com>	2020-03-03 11:34:00 +01:00
Piotr Jastrzebski	9494da2102	distribute_reader_and_consume_on_shards: don't take partitioner This function already takes schema so it can get partitioner using schema::get_partitioner. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-17 10:59:15 +01:00
Kamil Braun	574e1cd514	tests: generalize timestamp_based_spliiting_writer and bucket_writer to UDTs.	2019-10-25 12:04:44 +02:00
Kamil Braun	bbdb438d89	collection_mutation: easier (de)serialization of collection_mutation(s). `collection_type_impl::serialize_mutation_form` became `collection_mutation(_view)_description::serialize`. Previously callers had to cast their data_type down to collection_type to use serialize_mutation_form. Now it's done inside `serialize`. In the future `serialize` will be generalized to handle UDTs. `collection_type_impl::deserialize_mutation_form` became a free standing function `deserialize_collection_mutation` with similiar benefits. Actually, noone needs to call this function manually because of the next paragraph. A common pattern consisting of linearizing data inside a `collection_mutation_view` followed by calling `deserialize_mutation_form` has been abstracted out as a `with_deserialized` method inside collection_mutation_view. serialize_mutation_form_only_live was removed, because it hadn't been used anywhere.	2019-10-25 10:42:58 +02:00
Kamil Braun	b1d16c1601	types: move collection_type_impl::mutation(_view) out of collection_type_impl. collection_type_impl::mutation became collection_mutation_description. collection_type_impl::mutation_view became collection_mutation_view_description. These classes now reside inside collection_mutation.hh. Additional documentation has been written for these classes. Related function implementations were moved to collection_mutation.cc. This makes it easier to generalize these classes to non-frozen UDTs in future commits. The new names (together with documentation) better describe their purpose.	2019-10-25 10:19:45 +02:00
Botond Dénes	ce647fac9f	timestamp_based_splitting_writer: fix the handling of partition tombstone Currently the handling of partition tombstones is broken in multiple ways: * The partition-tombstone is lost when the bucket is calculated for its timestamp (due to a misplaced `std::exchange()`). * When the `partition_start` fragment (containing the partition tombstone) is actually written to the bucket we emit another `partition_start` fragment before it because the bucket has not seen that partition before and we fail to notice that we are actually writing the partition header. This bug was allowed to fly under the radar because the unit test was accidentally not creating partition tombstones in the generated data (due to a mistake). It was discovered while working on unit tests for another test and fixing the data generation function to actually generate partition tombstones. This patch fixes both problems in the handling of partition tombstones but it doesn't yet fixes the test. That is deferred until the patch series which uncovered this bug is merged to avoid merge conflicts. The other series mentioned here is: [PATCH v6 00/15] compaction: allow collecting purged data Fixes: #4683 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190710092427.122623-1-bdenes@scylladb.com>	2019-07-10 12:36:57 +03:00
Botond Dénes	df29600eec	Add timestamp_based_splitting_writer This writer implements the core logic of time-window based data segregation. It splits the fragment stream provided by a reader, such that each atom (cell) in the stream will be written into a consumer based on the time-window its timestamp belongs to. The end result is that each consumer will only see fragments, whoose atoms all have timestamps belonging to the same time-window. When a mutation fragment has atoms belonging to different time-windows, it is split into as many fragments as needed so each has only atoms that belong to the same time-window.	2019-06-26 15:45:59 +03:00
Botond Dénes	2693f1838a	Introduce mutation_writer namespace Currently there is a single mutation_writer: `multishard_writer`, however in the next path we are going to add another one. This is the right moment to move these into a common namespace (and folder), we have way too much stuff scattered already in the top-level namespace (and folder). Also rename `tests/multishard_writer_test.cc` to `tests/mutation_writer_test.cc`, this test-suite will be the home of all the different mutation writer's unit test cases.	2019-06-26 15:45:59 +03:00

37 Commits