scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	a56e2c83f3	sstables: Keep priority class on sstable_directory Current code accepts priotity class as an argument to various functions that need it and all its callers use streaming class. Next patches will needs to sometimes use default class, but it will require heavy patching of the distributed loader. Things get simpler if the priority class is kept on sstable_directory on start. This change also simplifies the ongoing effort on unification of sched and IO classes. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-07-19 12:14:41 +03:00
Botond Dénes	9afd2dc428	Merge 'Make compaction manager switch to table abstraction ' from Raphael "Raph" Carvalho This work gets us a step closer to compaction groups. Everything in compaction layer but compaction_manager was converted to table_state. After this work, we can start implementing compaction groups, as each group will be represented by its own table_state. User-triggered operations that span the entire table, not only a group, can be done by calling the manager operation on behalf of each group and then merging the results, if any. Closes #11028 * github.com:scylladb/scylla: compaction: remove forward declaration of replica::table compaction_manager: make add() and remove() switch to table_state compaction_manager: make run_custom_job() switch to table_state compaction_manager: major: switch to table_state compaction_manager: scrub: switch to table_state compaction_manager: upgrade: switch to table_state compaction: table_state: add get_sstables_manager() compaction_manager: cleanup: switch to table_state compaction_manager: offstrategy: switch to table_state() compaction_manager: rewrite_sstables(): switch to table_state compaction_manager: make run_with_compaction_disabled() switch to table_state compaction_manager: compaction_reenabler: switch to table_state compaction_manager: make submit(T) switch to table_state compaction_manager: task: switch to table_state compaction: table_state: Add is_auto_compaction_disabled_by_user() compaction: table_state: Add on_compaction_completion() compaction: table_state: Add make_sstable() compaction_manager: make can_proceed switch to table_state compaction_manager: make stop compaction procedures switch to table_state compaction_manager: make get_compactions() switch to table_state compaction_manager: change task::update_history() to use table_state instead compaction_manager: make can_register_compaction() switch to table_state compaction_manager: make get_candidates() switch to table_state compaction_manager: make propagate_replacement() switch to table_state compaction: Move table::in_strategy_sstables() and switch to table_state compaction: table_state: Add maintenance sstable set compaction_manager: make has_table_ongoing_compaction() switch to table_state compaction_manager: make compaction_disabled() switch to table_state compaction_manager: switch to table_state for mapping of compaction_state compaction_manager: move task ctor into source	2022-07-18 15:18:29 +03:00
Benny Halevy	2e37dcf62a	database: drop_table_on_all_shards: remove table directory having no snapshots If the table to remove has no snapshots then completely remove its directory on storage as the left-over directory slows down operations on the keyspace and makes searching for live tables harder. Fixes #10896 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-17 14:33:34 +03:00
Benny Halevy	dd481e9f58	sstables: define table_subdirectories Define a constexpr array of all official table sub-dorectories. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-17 14:33:34 +03:00
Benny Halevy	c4a42c3a3f	sstables: officially define pending_delete_dir Rather than using the "pending_delete" string in `pending_delete_dir_basename()`, so it can be orderly removed in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-07-17 14:33:34 +03:00
Raphael S. Carvalho	31655acb5e	compaction_manager: make run_custom_job() switch to table_state Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-07-16 21:35:06 -03:00
Nadav Har'El	a7fa29bceb	cross-tree: fix header file self-sufficiency Scylla's coding standard requires that each header is self-sufficient, i.e., it includes whatever other headers it needs - so it can be included without having to include any other header before it. We have a test for this, "ninja dev-headers", but it isn't run very frequently, and it turns out our code deviated from this requirement in a few places. This patch fixes those places, and after it "ninja dev-headers" succeeds again. Fixes #10995 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #10997	2022-07-08 12:59:14 +03:00
Avi Kivity	973d2a58d0	Merge 'docs: move docs to docs/dev folder' from David Garcia In order to allow our Scylla OSS customers the ability to select a version for their documentation, we are migrating the Scylla docs content to the Scylla OSS repository. This PR covers the following points of the [Migration Plan](https://docs.google.com/document/d/15yBf39j15hgUVvjeuGR4MCbYeArqZrO1ir-z_1Urc6A/edit#): 1. Creates a subdirectory for dev docs: /docs/dev 2. Moves the existing dev doc content in the scylla repo to /docs/dev, but keep Alternator docs in /docs. 3. Flattens the structure in /docs/dev (remove the subfolders). 4. Adds redirects from `scylla.docs.scylladb.com/<version>/<document>` to `https://github.com/scylladb/scylla/blob/master/docs/dev/<document>.md` 5. Excludes publishing docs for /docs/devs. 1. Enter the docs folder with `cd docs`. 2. Run `make redirects`. 3. Enter the docs folder and run `make preview`. The docs should build without warnings. 4. Open http://127.0.0.1:5500 in your browser. You shoul donly see the alternator docs. 5. Open http://127.0.0.1:5500/stable/design-notes/IDL.html in your browser. It should redirect you to https://github.com/scylladb/scylla/blob/master/docs/dev/IDL.md and raise a 404 error since this PR is not merged yet. 6. Surf the `docs/dev` folder. It should have all the scylla project internal docs without subdirectories. Closes #10873 * github.com:scylladb/scylla: Update docs/conf.py Update docs/dev/protocols.md Update docs/dev/README.md Update docs/dev/README.md Update docs/conf.py Fix broken links Remove source folder Add redirections Move dev docs to docs/dev	2022-07-03 20:37:11 +03:00
David Garcia	8e7ebea335	Merge remote-tracking branch 'upstream/master' into move-dev-docs	2022-06-28 11:02:38 +01:00
Botond Dénes	6c818f8625	Merge 'sstables: generation_type tidy-up' from Michael Livshin - Use `sstables::generation_type` in more places - Enforce conceptual separation of `sstables::generation_type` and `int64_t` - Fix `extremum_tracker` so that `sstables::generation_type` can be non-default-constructible Fixes #10796. Closes #10844 * github.com:scylladb/scylla: sstables: make generation_type an actual separate type sstables: use generation_type more soundly extremum_tracker: do not require default-constructible value types	2022-06-28 08:50:12 +03:00
Benny Halevy	81fa1ce9a1	Revert 'Compact staging sstables' This patch reverts the following patches merged in `78750c2e1a` "Merge 'Compact staging sstables' from Benny Halevy" > `597e415c38` "table: clone staging sstables into table dir" > `ce5bd505dc` "view_update_generator: discover_staging_sstables: reindent" > `59874b2837` "table: add get_staging_sstables" > `7536dd7f00` "distributed_loader: populate table directory first" The feature causes regressions seen with e.g. https://jenkins.scylladb.com/view/master/job/scylla-master/job/dtest-daily-release/41/testReport/materialized_views_test/TestMaterializedViews/Run_Dtest_Parallel_Cloud_Machines___FullDtest___full_split011___test_base_replica_repair/ ``` AssertionError: Expected [[0, 0, 'a', 3.0]] from SELECT * FROM t_by_v WHERE v = 0, but got [] ``` Where views aren't updated properly. Apparently since `table::stream_view_replica_updates` doesn't exclude the staging sstables anymore and since they are cloned to the base table as new sstables it seems to the view builder that no view updates are required since there's no changes comparing to the base table. Reopens #9559 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #10890	2022-06-27 12:18:48 +03:00
David Garcia	bb21c3c869	Move dev docs to docs/dev	2022-06-24 18:07:08 +01:00
Benny Halevy	597e415c38	table: clone staging sstables into table dir clone staging sstables so their content may be compacted while views are built. When done, the hard-linked copy in the staging subdirectory will be simply unlinked. Fixes #9559 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-06-23 16:55:27 +03:00
Benny Halevy	cd68b04fbf	sstables: time_series_sstable_set: insert: make exception safe Need to erase the shared sstable from _sstables if insertion to _sstables_reversed fails. Fixes #10787 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-06-23 16:55:27 +03:00
Benny Halevy	9d41676116	sstables: move_to_new_dir: fix debug log message Remove extraneous `old_dir` arg. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-06-23 16:55:27 +03:00
Michael Livshin	d7c90b5239	sstables: make generation_type an actual separate type Now that `generation_type` is used properly (at least in some places), we turn to the compiler to help keep the generation/value separation intact. Fixes #10796. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-06-21 20:08:01 +03:00
Raphael S. Carvalho	aa667e590e	sstable_set: Fix partitioned_sstable_set constructor The sstable set param isn't being used anywhere, and it's also buggy as sstable run list isn't being updated accordingly. so it could happen that set contains sstables but run list is empty, introducing inconsistency. we're fortunate that the bug wasn't activated as it would've been a hard one to catch. found this while auditting the code. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220617203438.74336-1-raphaelsc@scylladb.com>	2022-06-21 11:58:13 +03:00
Michael Livshin	ab13127761	sstables: use generation_type more soundly `generation_type` is (supposed to be) conceptually different from `int64_t` (even if physically they are the same), but at present Scylla code still largely treats them interchangeably. In addition to using `generation_type` in more places, we provide (no-op) `generation_value()` and `generation_from_value()` operations to make the smoke-and-mirrors more believable. The churn is considerable, but all mechanical. To avoid even more (way, way more) churn, unit test code is left untreated for now, except where it uses the affected core APIs directly. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-06-20 19:37:31 +03:00
Benny Halevy	89c5e8413f	sstable_directory: process_sstable_dir: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-06-15 13:56:11 +03:00
Benny Halevy	6cafd83e1c	sstable_directory: process_sstable_dir: close directory_lister on error Otherwise, if we don't consume all lister's entries, ~directory_lister terminates since the directory_lister is destroyed without being closed. Add unit test to reproduce this issue. Fixes #10697 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-06-15 13:56:10 +03:00
Avi Kivity	06a62b150d	sstables: processing_result_generator: prefer standard coroutines over the technical specification with clang 14 Clang up to version 13 supports the coroutines technical specification (in std::experimental). 15 and above support standard coroutines (in namespace std). Clang 14 supports both, but with a warning for the technical specification coroutines. To avoid the warning, change the threshold for selecting standard coroutines from clang 15 to clang 14. This follow seastar commit 070ab101e2. Closes #10647	2022-06-12 20:05:28 +03:00
Botond Dénes	605ee74c39	Merge 'sstables: save Scylla version & build id in metadata' from Michael Livshin To provide a reasonably-definitive answer to "what exact version of Scylla wrote this?". Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Closes #10712 * github.com:scylladb/scylla: docs: document recently-added Scylla sstable metadata sections sstables: save Scylla version & build id in metadata scylla_sstable: generalize metadata visitor for disk_string build_id: cache the value	2022-06-03 07:49:51 +03:00
Botond Dénes	49215fcff7	Merge 'Remove `flat_mutation_reader` (v1)' from Michael Livshin - Introduce a simpler substitute for `flat_mutation_reader`-resulting-from-a-downgrade that is adequate for the remaining uses but is _not_ a full-fledged reader (does not redirect all logic to an `::impl`, does not buffer, does not really have `::peek()`), so hopefully carries a smaller performance overhead. The name `mutation_fragment_v1_stream` is kind of a mouthful but it's the best I have - (not tests) Use the above instead of `downgrade_to_v1()` - Plug it in as another option in `mutation_source`, in and out - (tests) Substitute deliberate uses of `downgrade_to_v1()` with `mutation_fragment_v1_stream()` - (tests) Replace all the previously-overlooked occurrences of `mutation_source::make_reader()` with `mutation_source::make_reader_v2()`, or with `mutation_source::make_fragment_v1_stream()` where deliberate or still required (see below) - (tests) This series still leaves some tests with `mutation_fragment_v1_stream` (i.e. at v1) where not called for by the test logic per se, because another missing piece of work is figuring out how to properly feed `mutation_fragment_v2` (i.e. range tombstone changes) to `mutation_partition`. While that is not done (and I think it's better to punt on it in this PR), we have to produce `mutation_fragment` instances in tests that `apply()` them to `mutation_partition`, thus we still use downgraded readers in those tests - Remove the `flat_mutation_reader` class and things downstream of it Fixes #10586 Closes #10654 * github.com:scylladb/scylla: fix "ninja dev-headers" flat_mutation_reader ist tot tests: downgrade_to_v1() -> mutation_fragment_v1_stream() tests: flat_reader_assertions: refactor out match_compacted_mutation() tests: ms.make_reader() -> ms.make_fragment_v1_stream() repair/row_level: mutation_fragment_v1_stream() instead of downgrade_to_v1() stream_transfer_task: mutation_fragment_v1_stream() instead of downgrade_to_v1() sstables_loader: mutation_fragment_v1_stream() instead of downgrade_to_v1() mutation_source: add ::make_fragment_v1_stream() introduce mutation_fragment_v1_stream tests: ms.make_reader() -> ms.make_reader_v2() tests: remove test_downgrade_to_v1_clear_buffer() mutation_source_test: fix indentation tests: remove some redundant calls to downgrade_to_v1() tests: remove some to-become-pointless ms.make_reader()-using tests tests: remove some to-become-pointless reader downgrade tests	2022-06-03 07:26:29 +03:00
Botond Dénes	0a25a2bff3	sstables: validate_checksums(): more readable checksum mismatch messages Replace: Compressed chunk checksum mismatch at chunk {}, offset {}, for chunk of size {}: expected={}, actual={} With: Compressed chunk checksum mismatch at offset {}, for chunk #{} of size {}: expected={}, actual={} This is a follow-up for #10693. Also bring the uncompressed chunk checksum check messages up to date with the compressed one (which #10693 forgot to do). Another change included is merging the advancement of the chunk index with the iteration over the chunks, so we don't maintain two counters (one in the iterator and an explicit one). Closes #10715	2022-06-02 19:38:39 +03:00
Michael Livshin	fc1b957367	sstables: save Scylla version & build id in metadata To provide a reasonably-definitive answer to "what exact version of Scylla wrote this?". Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-06-02 11:21:05 +03:00
Avi Kivity	f5062f4b5a	Merge 'Use generation_type for SSTable ancestors' from Raphael "Raph" Carvalho To avoid a discrepancy about underlying generation type once something other than integer is allowed for the sstable generation. Also simplifies one generic writer interface for sealing sstable statistics. Closes #10703 * github.com:scylladb/scylla: sstables: Use generation_type for compaction ancestors sstables: Make compaction ancestors optional when sealing statistics	2022-06-01 19:55:08 +03:00
Michael Livshin	029508b77c	flat_mutation_reader ist tot Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Raphael S. Carvalho	2a7eb16c02	sstables: Use generation_type for compaction ancestors Let's also use generation_type for compaction ancestors, so once we support something other than integer for SSTable generation, we won't have discrepancy about what the generation type is. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-05-31 15:28:02 -03:00
Raphael S. Carvalho	d36604703f	sstables: Make compaction ancestors optional when sealing statistics Compaction ancestors is only available in versions older than mx, therefore we can make it optional in seal_statistics(). The motivation is that mx writer will no longer call sstable::compaction_ancestors() which return type will be soon changed to type generation_type, so the returned value can be something other than an integer, e.g. uuid. We could kill compaction_ancestors in seal_statistics interface, but given that most generic write functions still work for older versions, if there were still a writer for them, I decided to not do it now. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-05-31 15:26:03 -03:00
Mikołaj Sielużycki	bc18e97473	sstable_writer: Fix mutation order violation The change - adds a test which exposes a problem of a peculiar setup of tombstones that trigger a mutation fragment stream validation exception - fixes the problem Applying tombstones in the order: range_tombstone_change pos(ck1), after_all_prefixed, tombstone_timestamp=1 range_tombstone_change pos(ck2), before_all_prefixed, tombstone=NONE range_tombstone_change pos(NONE), after_all_prefixed, tombstone=NONE Leads to swapping the order of mutations when written and read from disk via sstable writer. This is caused by conversion of range_tombstone_change (in memory representation) to range tombstone marker (on disk representation) and back. When this mutation stream is written to disk, the range tombstone markers type is calculated based on the relationship between range_tombstone_changes. The RTC series as above produces markers (start, end, start). When the last marker is loaded from disk, it's kind gets incorrectly loaded as before_all_prefixed instead of after_all_prefixed. This leads to incorrect order of mutations. The solution is to skip writing a new range_tombstone_change with empty tombstone if the last range_tombstone_change already has empty tombstone. This is redundant information and can be safely removed, while the logic of encoding RTCs as markers doesn't handle such redundancy well. Closes #10643	2022-05-31 13:39:48 +03:00
Avi Kivity	4b53af0bd5	treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines coroutine::parallel_for_each avoids an allocation and is therefore preferred. The lifetime of the function object is less ambiguous, and so it is safer. Replace all eligible occurences (i.e. caller is a coroutine). One case (storage_service::node_ops_cmd_heartbeat_updater()) needed a little extra attention since there was a handle_exception() continuation attached. It is converted to a try/catch. Closes #10699	2022-05-31 09:06:24 +03:00
Botond Dénes	3a943b23fb	sstables: validate_checksums(): add chunk index to error message When logging a failed checksum on a compressed chunk. Currently, only the offset is logged, but the index of the chunk whose checksum failed to validate is also interesting. Closes #10693	2022-05-30 17:11:28 +03:00
Avi Kivity	19316dee94	Merge 'auto-scale promoted index' from Benny Halevy Add column_index_auto_scale_threshold_in_kb to the configuration (defaults to 10MB). When the promoted index (serialized) size gets to this threshold, it's halved by merging each two adjacent blocks into one and doubling the desired_block_size. Fixes #4217 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #10646 * github.com:scylladb/scylla: sstables: mx: add pi_auto_scale_events metric sstables: mx/writer: auto-scale promoted index	2022-05-25 10:27:19 +03:00
Tomasz Grabiec	f87274f66a	sstable: partition_index_cache: Fix abort on bad_alloc during page loading When entry loading fails and there is another request blocked on the same page, attempt to erase the failed entry will abort because that would violate entry_ptr guarantees, which is supposed to keep the entry alive. The fix in `92727ac36c` was incomplete. It only helped for the case of a single loader. This patch makes a more general approach by relaxing the assert. The assert manifested like this: scylla: ./sstables/partition_index_cache.hh:71: sstables::partition_index_cache::entry::~entry(): Assertion `!is_referenced()' failed. Fixes #10617 Closes #10653	2022-05-25 09:27:04 +03:00
Benny Halevy	33bad72fd2	sstables: mx: add pi_auto_scale_events metric Counts the number of promoted index auto-scale events. A large number of those, relative to `partition_writes`, indicates that `column_index_size_in_kb` should be increased. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-05-24 13:32:39 +03:00
Benny Halevy	6677028212	sstables: mx/writer: auto-scale promoted index Add column_index_auto_scale_threshold_in_kb to the configuration (defaults to 10MB). When the promoted index (serialized) size gets to this threshold, it's halved by merging each two adjacent blocks into one and doubling the desired_block_size. Fixes #4217 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-05-24 13:32:35 +03:00
Benny Halevy	868cea21e0	sstable_directory: parallel_for_each_restricted: keep func alive across calls Without that there's use-after-free when called from distributed_loader::make_sstables_available where func is turned into a coroutine and the shared_sstable parameter is not explicitly copied and captured for the continuation of sst->move_to_new_dir. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-05-20 17:07:53 +03:00
Avi Kivity	5c481973a3	sstables/processing_result_generator.hh: refine check for coroutine standard We have a check for whether we can use standard coroutines (in namespace std) or the technical specification (in std::experimental), but it doesn't work since Clang doesn't report the correct standard version. Use a compiler versionspecific check, inspired by Seastar's check. This allows building with clang 14. Closes #10603	2022-05-19 11:31:40 +03:00
Benny Halevy	8a6f8c622d	sstables: writer: pass bytes_ostream by reference The bytes_stream param is passed by value from `write_promoted_index` (since `0d8463aba5`) causing an uneeded copy. This can lead to OOM if the promoted index is extremely large. Pass the bytes_ostream by reference instead to prevent this copy. Fixes #10569 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #10570	2022-05-15 17:53:38 +03:00
Avi Kivity	528ab5a502	treewide: change metric calls from make_derive to make_counter make_derive was recently deprecated in favor of make_counter, so make the change throughput the codebase. Closes #10564	2022-05-14 12:53:55 +02:00
Avi Kivity	5937b1fa23	treewide: remove empty comments in top-of-files After `fcb8d040` ("treewide: use Software Package Data Exchange (SPDX) license identifiers"), many dual-licensed files were left with empty comments on top. Remove them to avoid visual noise. Closes #10562	2022-05-13 07:11:58 +02:00
Piotr Sarna	209c2f5d99	sstables: define generation_type for sstables No functional changes intended - this series is quite verbose, but after it's in, it should be considerably easier to change the type of SSTable generations to something else - e.g. a string or timeUUID. Closes #10533	2022-05-11 14:46:30 +02:00
Botond Dénes	7501a075bd	sstables/index_reader: push down eof() check to advance_to(index_bound&, dht::ring_position_view) Commit `e8f3d7dd13` added eof() checks to public partition-level advance_to() methods, to ensure we do not attempt to re-read the last page of the index when at eof(). It was noted however that this check would be safer in advance_to(index_bound&, dht::ring_position_view) because that is the method that all these higher-level methods end up calling. Placing the check there would guarantee safety for all such operations. This path does exactly that: it pushes down the check to said method. One change needed for this to work is to check eof on the bound that is currently advanced, instead of unconditionally checking the lower bound. Closes #10531	2022-05-11 14:46:30 +02:00
Avi Kivity	94f677b790	Merge 'sstables/index_reader: short-circuit fast-forward-to when at EOF' from Botond Dénes Attempting to call advance_to() on the index, after it is positioned at EOF, can result in an assert failure, because the operation results in an attempt to move backwards in the index-file (to read the last index page, which was already read). This only happens if the index cache entry belonging to the last index page is evicted, otherwise the advance operation just looks-up said entry and returns it. To prevent this, we add an early return conditioned on eof() to all the partition-level advance-to methods. A regression unit test reproducing the above described crash is also added. Fixes: #10403 Closes #10491 * github.com:scylladb/scylla: sstables/index_reader: short-circuit fast-forward-to when at EOF test/lib/random_schema: add a simpler overload for fixed partition count	2022-05-08 14:17:40 +03:00
Avi Kivity	287c01ab4d	Merge ' sstables: consumer: reuse the fragmented_temporary_buffer in read_bytes()' from Michał Chojnowski primitive_consumer::read_bytes() destroys and creates a vector for every value it reads. This happens for every cell. We can save a bit of work by reusing the vector. Closes #10512 * github.com:scylladb/scylla: sstables: consumer: reuse the fragmented_temporary_buffer in read_bytes() utils: fragmented_temporary_buffer: add release()	2022-05-08 11:26:31 +03:00
Michał Chojnowski	ddc535a4a2	sstables: consumer: reuse the fragmented_temporary_buffer in read_bytes() read_bytes destroys and creates a vector for every value it reads. This happens for every cell. We can save a bit of work by reusing the vector.	2022-05-07 13:04:16 +02:00
Botond Dénes	e8f3d7dd13	sstables/index_reader: short-circuit fast-forward-to when at EOF Attempting to call advance_to() on the index, after it is positioned at EOF, can result in an assert failure, because the operation results in an attempt to move backwards in the index-file (to read the last index page, which was already read). This only happens if the index cache entry belonging to the last index page is evicted, otherwise the advance operation just looks-up said entry and returns it. To prevent this, we add an early return conditioned on eof() to all the partition-level advance-to methods. A regression unit test reproducing the above described crash is also added.	2022-05-05 14:42:37 +03:00
Avi Kivity	aee94b7176	Merge "Convert remaining mutation sources to v2" from Botond " After the recent conversion of the row-cache, two v1 mutation sources remained: the memtable and the kl sstable reader. This series converts both to a native v2 implementation. The conversion is shallow: both continue to read and process the underlying (v1) data in v1, the fragments are converted to v2 right before being pushed to the reader's buffer. This conversion is simple, surgical and low-risk. It is also better than the upgrade_to_v2() used previously. Following this, the remaining v1 reader implementations are removed, with the exception of the downgrade_to_v1(), which is the only one left at this point. Removing this requires converting all mutation sinks to accept a v2 stream. upgrade_to_v2() is now not used in any production code. It is still needed to properly test downgrade_to_v1() (which is till used), so we can't remove it yet. Instead it hidden as a private method of mutation_source. This still allows for the above mentioned testing to continue, while preventing anyone from being tempted to introduce new usage. tests: https://jenkins.scylladb.com/job/releng/job/Scylla-CI/191 " * 'convert-remaining-v1-mutation-sources/v2' of https://github.com/denesb/scylla: readers: make upgrade_to_v2() private test/lib/mutation_source_test: remove upgrade_to_v2 tests readers: remove v1 forwardable reader readers: remove v1 empty_reader readers: remove v1 delegating_reader sstables/kl: make reader impl v2 native sstables/kl: return v2 reader from factory methods sstables: move mp_row_consumer_reader_k_l to kl/reader.cc partition_snapshot_reader: convert implementation to native v2 mutation_fragment_v2: range_tombstone_change: add minimal_memory_usage()	2022-04-28 20:31:23 +03:00
Botond Dénes	70d019116f	sstables/kl: make reader impl v2 native The conversion is shallow: the meat of the logic remains v1, fragments are converted to v2 right before being pushed into the buffer. This approach is simple, surgical and is still better then a full upgrade_to_v2().	2022-04-28 14:12:24 +03:00
Botond Dénes	a22b02c801	sstables/kl: return v2 reader from factory methods This just moves the upgrade_to_v2() calls to the other side of said factory methods, preparing the ground for converting the kl reader impl to a native v2 one.	2022-04-28 14:12:24 +03:00

1 2 3 4 5 ...

2808 Commits