scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 03:20:37 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	f1b8d5f20f	compaction: introduce function to map compaction name to respective type Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-11 09:29:59 -03:00
Raphael S. Carvalho	a44bc233f5	compaction: refactor mapping of compaction type to string This will make it easier to introduce new type and also to map type to string and vice-versa, using reverse lookup. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-11 09:29:53 -03:00
Raphael S. Carvalho	503a0ea928	compaction: move compaction_name() out of line Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-11 09:29:46 -03:00
Raphael S. Carvalho	5206a97915	compaction: Fix leak of expired sstable in the backlog tracker expired sstables are skipped in the compaction setup phase, because they don't need to be actually compacted, but rather only deleted at the end. that is causing such sstables to not be removed from the backlog tracker, meaning that backlog caused by expired sstables will not be removed even after their deletion, which means shares will be higher than needed, making compaction potentially more aggressive than it have to. to fix this bug, let's manually register these sstables into the monitor, such that they'll be removed from the tracker once compaction completes. Fixes #6054. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210216203700.189362-1-raphaelsc@scylladb.com>	2021-02-18 11:12:00 +02:00
Benny Halevy	4b309e0829	compaction: log sstable origin Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-01 16:45:52 +02:00
Benny Halevy	22f6023ac3	sstables: sstable_writer_config: add origin member Add a string describing where the sstables originated from (e.g. memtable, repair, streaming, compaction, etc.) If configure_writer is called with a nullptr, the origin will be equal to an empty string. Introduce test_env_sstables_manager that provides an overload of configure_writer with no parmeters that calls the base-class' configure_writer with "test" origin. This was to reduce the code churn in this patch and to keep the tests simple. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-01 16:45:52 +02:00
Benny Halevy	29002e3b48	flat_mutation_reader: return future from next_partition To allow it to asynchronously close underlying readers on next_partition(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-01-13 17:35:07 +02:00
Botond Dénes	495f9d54ba	flat_mutation_reader: extract fragment stream validator into its own header To allow using it without pulling in the huge `flat_mutation_reader.hh`.	2021-01-11 08:07:42 +02:00
Raphael S. Carvalho	d55d65d77c	compaction: Enable filtering reader only on behalf of cleanup compaction After `13fa2bec4c`, every compaction will be performed through a filtering reader because consumers cannot do the filtering if interposer consumer is enabled. It turns out that filtering_reader is adding significant overhead when regular compactions are running. As no other compaction type need to actually do any filtering, let's limit filtering_reader to cleanup compaction. Alternatively, we could disable interposer consumer on behalf of cleanup, or add support for the consumers to do the filtering themselves but that would add lots of complexity. Fixes #7748. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20201230194516.848347-2-raphaelsc@scylladb.com>	2021-01-03 12:02:43 +02:00
Raphael S. Carvalho	e42d277805	compaction: Drop needless partition filter for regular compaction This filter is used to discard data that doesn't belong to current shard, but scylla will only make a sstable available to regular compaction after it was resharded on either boot or refresh. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20201230194516.848347-1-raphaelsc@scylladb.com>	2021-01-03 12:02:42 +02:00
Benny Halevy	8a745a0ee0	compaction: compaction_writer: destroy shared_sstable after the sstable_writer sstable_writer may depend on the sstable throughout its whole lifecycle. If the sstable is freed before the sstable_writer we might hit use-after-free as in the follwing case: ``` std::_Deque_iterator<sstables::compression::segmented_offsets::bucket, sstables::compression::segmented_offsets::bucket&, sstables::compression::segmented_offsets::bucket>::operator+=(long) at /usr/include/c++/10/bits/stl_deque.h:240 (inlined by) std::operator+(std::_Deque_iterator<sstables::compression::segmented_offsets::bucket, sstables::compression::segmented_offsets::bucket&, sstables::compression::segmented_offsets::bucket> const&, long) at /usr/include/c++/10/bits/stl_deque.h:378 (inlined by) std::_Deque_iterator<sstables::compression::segmented_offsets::bucket, sstables::compression::segmented_offsets::bucket&, sstables::compression::segmented_offsets::bucket>::operator[](long) const at /usr/include/c++/10/bits/stl_deque.h:252 (inlined by) std::deque<sstables::compression::segmented_offsets::bucket, std::allocator<sstables::compression::segmented_offsets::bucket> >::operator[](unsigned long) at /usr/include/c++/10/bits/stl_deque.h:1327 (inlined by) sstables::compression::segmented_offsets::push_back(unsigned long, sstables::compression::segmented_offsets::state&) at ./sstables/compress.cc:214 sstables::compression::segmented_offsets::writer::push_back(unsigned long) at ./sstables/compress.hh:123 (inlined by) compressed_file_data_sink_impl<crc32_utils, (compressed_checksum_mode)1>::put(seastar::temporary_buffer<char>) at ./sstables/compress.cc:519 seastar::output_stream<char>::put(seastar::temporary_buffer<char>) at table.cc:? (inlined by) seastar::output_stream<char>::put(seastar::temporary_buffer<char>) at ././seastar/include/seastar/core/iostream-impl.hh:432 seastar::output_stream<char>::flush() at table.cc:? seastar::output_stream<char>::close() at table.cc:? sstables::file_writer::close() at sstables.cc:? sstables::mc::writer::~writer() at writer.cc:? (inlined by) sstables::mc::writer::~writer() at ./sstables/mx/writer.cc:790 sstables::mc::writer::~writer() at writer.cc:? flat_mutation_reader::impl::consumer_adapter<stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> > >::~consumer_adapter() at compaction.cc:? (inlined by) std::_Optional_payload_base<sstables::compaction_writer>::_M_destroy() at /usr/include/c++/10/optional:260 (inlined by) std::_Optional_payload_base<sstables::compaction_writer>::_M_reset() at /usr/include/c++/10/optional:280 (inlined by) std::_Optional_payload<sstables::compaction_writer, false, false, false>::~_Optional_payload() at /usr/include/c++/10/optional:401 (inlined by) std::_Optional_base<sstables::compaction_writer, false, false>::~_Optional_base() at /usr/include/c++/10/optional:474 (inlined by) std::optional<sstables::compaction_writer>::~optional() at /usr/include/c++/10/optional:659 (inlined by) sstables::compacting_sstable_writer::~compacting_sstable_writer() at ./sstables/compaction.cc:229 (inlined by) compact_mutation<(emit_only_live_rows)0, (compact_for_sstables)1, sstables::compacting_sstable_writer, noop_compacted_fragments_consumer>::~compact_mutation() at ././mutation_compactor.hh:468 (inlined by) compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer>::~compact_for_compaction() at ././mutation_compactor.hh:538 (inlined by) std::default_delete<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> >::operator()(compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer>) const at /usr/include/c++/10/bits/unique_ptr.h:85 (inlined by) std::unique_ptr<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer>, std::default_delete<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> > >::~unique_ptr() at /usr/include/c++/10/bits/unique_ptr.h:361 (inlined by) stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> >::~stable_flattened_mutations_consumer() at ././mutation_reader.hh:342 (inlined by) flat_mutation_reader::impl::consumer_adapter<stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> > >::~consumer_adapter() at ././flat_mutation_reader.hh:201 auto flat_mutation_reader::impl::consume_in_thread<stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> >, flat_mutation_reader::no_filter>(stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> >, flat_mutation_reader::no_filter, std::chrono::time_point<seastar::lowres_clock, std::chrono::duration<long, std::ratio<1l, 1000l> > >) at ././flat_mutation_reader.hh:272 (inlined by) auto flat_mutation_reader::consume_in_thread<stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> >, flat_mutation_reader::no_filter>(stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> >, flat_mutation_reader::no_filter, std::chrono::time_point<seastar::lowres_clock, std::chrono::duration<long, std::ratio<1l, 1000l> > >) at ././flat_mutation_reader.hh:383 (inlined by) auto flat_mutation_reader::consume_in_thread<stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> > >(stable_flattened_mutations_consumer<compact_for_compaction<sstables::compacting_sstable_writer, noop_compacted_fragments_consumer> >, std::chrono::time_point<seastar::lowres_clock, std::chrono::duration<long, std::ratio<1l, 1000l> > >) at ././flat_mutation_reader.hh:389 (inlined by) seastar::future<void> sstables::compaction::setup<noop_compacted_fragments_consumer>(noop_compacted_fragments_consumer)::{lambda(flat_mutation_reader)#1}::operator()(flat_mutation_reader)::{lambda()#1}::operator()() at ./sstables/compaction.cc:612 ``` What happens here is that: compressed_file_data_sink_impl(output_stream<char> out, sstables::compression* cm, sstables::local_compression lc) : _out(std::move(out)) , _compression_metadata(cm) , _offsets(_compression_metadata->offsets.get_writer()) , _compression(lc) , _full_checksum(ChecksumType::init_checksum()) _compression_metadata points to a buffer held by the sstable object. and _compression_metadata->offsets.get_writer returns a writer that keeps a reference to the segmented_offsets in the sstables::compression that is used in the ~writer -> close path. Fixes #7821 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201227145726.33319-1-bhalevy@scylladb.com>	2020-12-27 17:02:13 +02:00
Avi Kivity	f802356572	Revert "Revert "Merge "raft: fix replication if existing log on leader" from Gleb"" This reverts commit `dc77d128e9`. It was reverted due to a strange and unexplained diff, which is now explained. The HEAD on the working directory being pulled from was set back, so git thought it was merging the intended commits, plus all the work that was committed from HEAD to master. So it is safe to restore it.	2020-12-08 19:19:55 +02:00
Avi Kivity	dc77d128e9	Revert "Merge "raft: fix replication if existing log on leader" from Gleb" This reverts commit `0aa1f7c70a`, reversing changes made to `72c59e8000`. The diff is strange, including unrelated commits. There is no understanding of the cause, so to be safe, revert and try again.	2020-12-06 11:34:19 +02:00
Avi Kivity	0584db1eb3	Merge "Unstall cleanup_compaction::get_ranges_for_invalidation" from Benny " This series adds maybe_yield called from cleanup_compaction::get_ranges_for_invalidation to avoid reactor stalls. To achieve that, we first extract bool_class can_yield to utils/maybe_yield.hh, and add a convience helper: utils::maybe_yield(can_yield) that conditionally calls seastar::thread::maybe_yield if it can (when called in a seastar thread). With that, we add a can_yield parameter to dht::to_partition_ranges and dht::partition_range::deoverlap (defaults to false), and use it from cleanup_compaction::get_ranges_for_invalidation, as the latter is always called from `consume_in_thread`. Fixes #7674 Test: unit(dev) " * tag 'unstall-get_ranges_for_invalidation-v2' of github.com:bhalevy/scylla: compaction: cleanup_compaction: get_ranges_for_invalidation: add yield points dht/i_partitioner: to_partition_ranges: support yielding locator: extract can_yield to utils/maybe_yield.hh	2020-11-29 14:10:39 +02:00
Benny Halevy	e1fe1f18c7	compaction: cleanup_compaction: get_ranges_for_invalidation: add yield points Avoid reactor stalls by allowing yielding in long-running loops as seen in #7674. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-25 13:46:32 +02:00
Kamil Braun	40d8bfa394	sstables: move sstable reader creation functions to `sstable_set` Lower level functions such as `create_single_key_sstable_reader` were made methods of `sstable_set`. The motivation is that each concrete sstable_set may decide to use a better sstable reading algorithm specific to the data structures used by this sstable_set. For this it needs to access the set's internals. A nice side effect is that we moved some code out of table.cc and database.hh which are huge files.	2020-11-19 17:52:39 +01:00
Raphael S. Carvalho	13fa2bec4c	compaction: Make sure a partition is filtered out only by producer If interposer consumer is enabled, partition filtering will be done by the consumer instead, but that's not possible because only the producer is able to skip to the next partition if the current one is filtered out, so scylla crashes when that happens with a bad function call in queue_reader. This is a regression which started here: `55a8b6e3c9` To fix this problem, let's make sure that partition filtering will only happen on the producer side. Fixes #7590. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20201111221513.312283-1-raphaelsc@scylladb.com>	2020-11-12 12:22:10 +02:00
Raphael S. Carvalho	74d35a2286	compaction: fix debug log for fully expired ssts the log is incorrectly printing actually compacted ssts, instead of fully expired ssts that weren't actually compacted Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20201020125053.109615-1-raphaelsc@scylladb.com>	2020-10-20 16:01:28 +03:00
Benny Halevy	e1692bec17	sstables: move compaction ancestors to sstable Compaction needs access to the sstable's ancestors so we need to keep the ancestors for the sstable separately from the metadata collector as the latter is about to be moved to the sstable writer. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 19:51:26 +03:00
Benny Halevy	a49a5f36c1	sstables: sstable_writer: optionally set sstable level via config And use compaction::make_sstable_writer_config to pass the compaction's `_sstable_level` to the writer via sstable_writer_config, instead of via the sstable metadata_collector, that is going to move from the sstable to the write_impl. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 19:49:36 +03:00
Benny Halevy	ac3c33ffca	sstables: sstable_writer: optionally set replay position via config And use compaction::make_sstable_writer_config to pass the compaction's replay_position (`_rp`) to the writer via sstable_writer_config, instead of via the sstable metadata_collector, that is going to move from the sstable to the write_impl. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 19:39:46 +03:00
Benny Halevy	e314eb3f78	sstables: compaction: make_sstable_writer_config Consolidate the code to make the sstable_writer_config for sstable writers into a helper method. Folowing patches will add the ability to set the replay position and sstable level via that config structure. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 18:01:46 +03:00
Botond Dénes	ff623e70b3	reader_concurrency_semaphore: name permits Require a schema and an operation name to be given to each permit when created. The schema is of the table the read is executed against, and the operation name, which is some name identifying the operation the permit is part of. Ideally this should be different for each site the permit is created at, to be able to discern not only different kind of reads, but different code paths the read took. As not all read can be associated with one schema, the schema is allowed to be null. The name will be used for debugging purposes, both for coredump debugging and runtime logging of permit-related diagnostics.	2020-10-13 12:32:13 +03:00
Avi Kivity	8932c4e919	compaction: allow _max_sstable_size = 0 Some test (run_based_compaction_test at least) use _max_sstable_size = 0 in order to force one partition per sstable. That triggers an overflow when calculating the expected bloom filter size. The overflow doesn't matter for normal operation, because the result later appears on a divisor, but does trigger a ubsan error. Squelch the error by bot dividing by zero here. I tried using _max_sstable_size = 1, but the test failed for other reasons. Closes #7375	2020-10-11 15:43:51 +03:00
Botond Dénes	dd372c8457	flat_mutation_reader: de-virtualize buffer_size() The main user of this method, the one which required this method to return the collective buffer size of the entire reader tree, is now gone. The remaining two users just use it to check the size of the reader instance they are working with. So de-virtualize this method and reduce its responsibility to just returning the buffer size of the current reader instance.	2020-10-06 08:22:56 +03:00
Botond Dénes	6ca0464af5	mutation_fragment: add schema and permit We want to start tracking the memory consumption of mutation fragments. For this we need schema and permit during construction, and on each modification, so the memory consumption can be recalculated and pass to the permit. In this patch we just add the new parameters and go through the insane churn of updating all call sites. They will be used in the next patch.	2020-09-28 11:27:23 +03:00
Botond Dénes	3fab83b3a1	flat_mutation_reader: impl: add reader_permit parameter Not used yet, this patch does all the churn of propagating a permit to each impl. In the next patch we will use it to track to track the memory consumption of `_buffer`.	2020-09-28 10:53:48 +03:00
Raphael S. Carvalho	7f7f366cb5	compaction: add debug msg to inform the amount of expired ssts skipped by compaction this information is useful when debugging compaction issues that involve fully expired ssts. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200828140401.96440-1-raphaelsc@scylladb.com>	2020-08-31 17:18:47 +03:00
Pavel Emelyanov	171822cff8	compaction: Use database from options to get local ranges The cleanup compaction wants to keep local tokens on-board and gets them from storage_service.get_local_ranges(). This method is the wrapper around database.get_keyspace_local_ranges() created in previous patch, the live database reference is already available on the descriptor's options, so we can short-cut the call. This allows removing the last explicit call for global storage_service instance from compaction code. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-21 14:58:40 +03:00
Piotr Jastrzebski	c001374636	codebase wide: replace count with contains C++20 introduced `contains` member functions for maps and sets for checking whether an element is present in the collection. Previously `count` function was often used in various ways. `contains` does not only express the intend of the code better but also does it in more unified way. This commit replaces all the occurences of the `count` with the `contains`. Tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <b4ef3b4bc24f49abe04a2aba0ddd946009c9fcb2.1597314640.git.piotr@scylladb.com>	2020-08-15 20:26:02 +03:00
Benny Halevy	c9cade833c	sstables: metadata_collector: make only for write path make a metadata_collector only when writing the sstable, no need to make one when reading. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:51:12 +03:00
Rafael Ávila de Espíndola	34d60efbf9	sstables: Delete write_failed It is no longer used. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-27 11:23:48 -07:00
Rafael Ávila de Espíndola	030f96be1a	sstables: Move monitor after writer in compaction_writer With this the monitor is destroyed first. It makes intuitive sense to me to destroy a monitor_X before X. This is also the order we had before `55a8b6e3c9`. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-27 11:23:47 -07:00
Botond Dénes	fe127a2155	sstables: clamp estimated_partitions to [1, +inf) in writers In some cases estimated number of partitions can be 0, which is albeit a legit estimation result, breaks many low-level sstable writer code, so some of these have assertions to ensure estimated partitions is > 0. To avoid hitting this assert all users of the sstable writers do the clamping, to ensure estimated partitions is at least 1. However leaving this to the callers is error prone as #6913 has shown it. As this clamping is standard practice, it is better to do it in the writers themselves, avoiding this problem altogether. This is exactly what this patch does. It also adds two unit tests, one that reproduces the crash in #6913, and another one that ensures all sstable writers are fine with estimated partitions being 0 now. Call sites previously doing the clamping are changed to not do it, it is unnecessary now as the writer does it itself. Fixes #6913 Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200724120227.267184-1-bdenes@scylladb.com>	2020-07-27 09:19:37 +02:00
Rafael Ávila de Espíndola	87b261ab32	sstables: Rename _writer to _compaction_writer Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 08:15:55 -07:00
Rafael Ávila de Espíndola	97b7fee78e	sstables: Move compaction_write_monitor to compaction_writer There is one monitor per writer, so we new keep them together in the compaction_writer struct. This trivially guarantees that the monitor is always destroyed before the writer. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 08:15:53 -07:00
Rafael Ávila de Espíndola	f8cc582e4a	sstables: Add couple of writer() getters to garbage_collected_sstable_writer This just reduces the noise of an upcoming patch. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 07:46:05 -07:00
Rafael Ávila de Espíndola	c740c66840	sstables: Move compaction_write_monitor earlier in the file This will used by followup patches. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 07:46:05 -07:00
Benny Halevy	eb1d558d00	compaction: print uuid in log messages By convention, print the following information in all compaction log messages: [{compaction.type} {keyspace}.{table} {compaction.uuid}] Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-16 13:55:23 +03:00
Benny Halevy	dec751cfbe	compaction: report_(start\|finish): just return description Rather than logging the message in the virtual callee method just return a string description and make the logger call in the common caller. 1. There is no need to do the logger call in the callee, it is simpler to format the log message in the the caller and just retrieve the per-compaction-type description. 2. Prepare to centrally print the compaction uuid. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-16 13:55:23 +03:00
Benny Halevy	e39fbe1849	compaction: move compaction uuid generation to compaction_info We'd like to use the same uuid both for printing compaction log messages and to update compaction_history. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-07-16 13:55:23 +03:00
Benny Halevy	a843945115	comapction: restore % in compaction completion message The % sign fell off in `c4841fa735` Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200625151352.736561-1-bhalevy@scylladb.com>	2020-06-25 18:11:59 +02:00
Raphael S. Carvalho	9033fa82d7	compaction: Reduce boilerplate to create new compaction type Run id and compaction type can now be figured out from the base class. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200622160645.177707-1-raphaelsc@scylladb.com>	2020-06-22 20:27:57 +02:00
Glauber Costa	e40aa042a7	distributed_loader: reshard before the node is made online This patch moves the resharding process to use the new directory_with_sstables_handler infrastructure. There is no longer a clear reshard step, and that just becomes a natural part of populate_column_family. In main.cc, a couple of changes are necessary to make that happen. The first one obviously is to stop calling reshard. We also need to make sure that: - The compaction manager is started much earlier, so we can register resharding jobs with it. - auto compactions are disabled in the populate method, so resharding doesn't have to fight for bandwidth with auto compactions. Now that we are resharding through the sstable_directory, the old resharding code can be deleted. There is also no need to deal with the resharding backlog either, because the SSTables are not yet added to the sstable set at this point. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-06-18 09:37:18 -04:00
Glauber Costa	0467bd0a94	compaction: add a new compaction type, Reshape From the point of view of selecting SSTables and its expected output, Reshaping really is just a normal compaction. However, there are some key differences that we would like to uphold: - Reshaping is done separately from the main SSTable set. It can be done with the node offline, or it can be done in a separate priority class. Either way, we don't want those SSTables to count towards backlog. For reads, because the SSTables are not yet registered in the backlog tracker (if offline or coming from upload), if we were to deduct compaction charges from it we would go negative. For writes, we don't want to deal with backlog management here because we will add the SSTable at once when reshaping is finished. - We don't need to do early replacements. - We would like to clearly mark the Reshaping compactions as such in the logs For the reasons above, it is nicer to add a new Reshape compaction type, a subclass of compaction, that upholds such properties. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-06-18 09:37:18 -04:00
Glauber Costa	c4841fa735	compaction: add a size and throught pretty printer. This is so we don't always use MB. Sometimes it is best to report GB, TB, and their equivalent throughput metrics. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-06-18 09:37:18 -04:00
Glauber Costa	ef85a2cec5	compaction: add default implementation for some pure functions There are some functions that are today pure that have an obvious implementation (for example on_new_partition, do nothing). We'll add default implementations to the compaction class, which reduces the boilerplate needed to add a new compaction type. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-06-18 09:00:28 -04:00
Glauber Costa	093328741d	compaction: test that sstable set is not null in update_pending_ranges SSTable_set is now an optional, and if we don't want to expire data it will be empty. We need to check that it is not empty before dereferencing it. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200610170647.142817-1-glauber@scylladb.com>	2020-06-15 15:43:08 +02:00
Raphael S. Carvalho	72ae76fb09	compaction: Fix a potential source of stalls for run-based strategies When compaction A completes, a request is issued so that all parallel compactions will replace compaction A's input sstables by respective output sstables, in the SSTable set snapshot used for expiration purposes. That's done to allow space of input SSTables to be released as soon as possible, helping a lot incremental compaction, but also the non-incremental approach. Recently I came to realization that we're copying the SSTable set, when doing the replacement, to make the code exception safe, but it turns out that if an exception is triggered, the compaction will fail anyway. So this copy is very useless and a potential source of reactor stalls if strategies like LCS is used. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200608192614.9354-1-raphaelsc@scylladb.com>	2020-06-10 18:44:44 +03:00
Glauber Costa	3972628fc0	compaction: split compaction.hh header compaction.hh is one of our heavy headers, but some users just want to use information on it about how to describe a compaction, not how to perform one. For that reason this patch splits the compaction_descriptor into a new header. The compaction_descriptor has, as a member type, compaction_options. That is moved too, and brings with it the compaction_type. Both of those structures would make sense in a separate header anyway. The compaction_descriptor also wants the creator_fn and replacer_fn functions. We also take this opportunity to rename them into something more descriptive Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-06-08 16:06:00 -04:00

1 2 3 4 5 ...

264 Commits