scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-01 05:35:48 +00:00

Author	SHA1	Message	Date
Ernest Zaslavsky	8d49bb8af2	sstables: Start using `make_data_or_index_source` in `sstable` Convert all necessary methods to be awaitable. Start using `make_data_or_index_source` when creating data_source for data and index components. For proper working of compressed/checksummed input streams, start passing stream creator functors to `make_(checksummed/compressed)_file_(k_l/m)_format_input_stream`.	2025-07-15 10:10:23 +03:00
Ernest Zaslavsky	8ac2978239	sstables: coroutinize futurized readers Coroutinize futurized readers and sources to get ready for using `make_data_or_index_source` in `sstable`	2025-07-06 09:18:39 +03:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Botond Dénes	f55dc71c3f	Merge 'Use checksummed input streams in `validate_checksums()`' from Nikos Dragazis With commits `ed7d352e7d` and `bb1867c7c7`, we now have input streams for both compressed and uncompressed SSTables that provide seamless checksum and digest checking. The code for these was based on `validate_checksums()`, which implements its own validation logic over raw streams. This has led to some duplicate code. This PR deduplicates the uncompressed case by modifying `validate_checksums()` to use a checksummed input stream instead of a raw stream. The same cannot be done for compressed SSTables though. The reason is that `validate_checksums()` needs to examine the whole data file, even if an invalid chunk is encountered. In the checksummed case we support that by offloading the error handling logic from the data source via a function parameter. In the compressed data source we cannot do that because it needs to return decompressed data and decompression may fail if the data are invalid. This PR also enables `validate_checksums()` to partially verify SSTables with just the per-chunk checksums if the digest is missing. In more detail, this PR consists of: * Port of some integrity checks from `do_validate_uncompressed()` to the checksummed data source. It should now be able to detect corruption due to truncated or appended chunks (expected number of chunks is retrieved from the CRC component). * Introduction of `error_handler` parameter in checksummed data source and `data_stream()`. * Refactoring of `validate_checksums()`. The JSON response of `sstable validate-checksums` was also modified to report a missing digest. * Tests for `validate_checksums()` against SSTables with truncated data, appended data, invalid digests, or no digest. Refs #19058. This PR is a hybrid of cleanup and feature. No backport is needed. Closes scylladb/scylladb#20933 * github.com:scylladb/scylladb: tools/scylla-sstable: Rename valid_checksums -> valid test: Check validate_checksums() with missing digest sstables: Allow validate_checksums() to report missing digests sstables: Refactor validate_checksums() to use checksummed data stream sstables: Add error_handler parameter to data_stream() sstables: Add error handler in checksummed data source sstables: Check for excessive chunks in checksummed data source sstables: Check for premature EOF in checksummed data source test: test_validate_checksums: Check SSTable with invalid digest test: test_validate_checksums: Check SSTable with appended data test: test_validate_checksums: Complement test for truncated SSTable	2024-12-04 10:46:18 +02:00
Nikos Dragazis	6091d5d789	sstables: Fix range of input stream in checksummed file data source The checksummed file data source uses the chunk size to enforce that the reads from the underlying file input stream will be aligned at the chunk boundary. This is necessary so that we can validate the checksum of each chunk. However, a mismatch in the numeric types caused a bug where the underlying file input stream would read a smaller portion of the data file than expected. The bug is located in the following lines: ``` auto start = _beg_pos & ~(chunk_size - 1); auto end = (_end_pos & ~(chunk_size - 1)) + chunk_size; ``` `_beg_pos` and `_end_pos` are `uint64_t`, whereas `chunk_size` is `uint32_t`. When executing the AND operation, the compiler converts the right operand from `uint32_t` to `uint64_t`. Since the integer is unsigned, the four most-significant bytes are filled with zeros, thus erroneously truncating the corresponding bytes of the position. Fix the bug by explicitly converting the chunk size to `uint64_t` before any arithmetic operations. Also, replace the handwritten alignment implementations with the `align_up()` and `align_down()` helpers. Finally, restrict the file end position to not exceed the file length. Since the last chunk can be smaller than the chunk size, it could happen that the end position exceeds the file length after the round-up. This is not a bug on its own since `make_file_input_stream()` can accept lengths that go beyond end-of-file, but still it makes the code more error prone and should be avoided. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#21665	2024-11-28 12:53:05 +02:00
Nikos Dragazis	77285f24c8	sstables: Add error handler in checksummed data source Currently, the checksummed data source treats an invalid checksum or digest as an unrecoverable error by throwing a `malformed_sstable_exception`. This does not allow to use this data source in places where it is required to resume after a failed checksum (e.g., in `validate_checksums()`). Make the error handling logic customizable via a callback function. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-11-25 15:55:49 +02:00
Nikos Dragazis	6e0be02165	sstables: Check for excessive chunks in checksummed data source For uncompressed SSTables, the expected number of chunks is the number of checksums in the CRC component. The data file must contain the same number of chunks. Otherwise, the SSTable should be considered as corrupted. Add a check in the checksummed data source to ensure that the data file does not contain more chunks than expected. Run this check every time the caller reads more data from the stream. The check will be triggered when they attempt to read past the expected number of chunks and more chunks are indeed available. This behavior is consistent with the compressed data source and allows for partial reads to succeed. This check will not be triggered if an SSTable has been corrupted by appending new data, but the new data do not overflow the last chunk. Since the SSTable metadata only record the expected number of chunks, we cannot know the exact expected file size at a byte-level. However, this kind of corruption will be detected by the checksum check, and by the digest check if enabled. In fact, the checksum check would suffice for all kinds of corruption due to appended data except for one case: when the pre-corruption data file was aligned at the chunk boundary, i.e., the last chunk was full. This patch closes this gap. Finally, note that, as a side-effect, this patch fixes a bug where we would do an out-of-bounds read on the checksum array. This patch is part of incorporating the functionality of `do_validate_uncompressed()` into the checksummed data source. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-11-25 15:55:49 +02:00
Nikos Dragazis	5c2b1fa125	sstables: Check for premature EOF in checksummed data source Uncompressed SSTables may have a last chunk that is smaller than the chunk size. The condition for premature EOF is when more chunks are expected when such a chunk is encountered. The expected number of chunks is the number of checksums in the CRC component. A premature EOF can happen if the data file has been truncated. An edge case is when the truncation happened at exactly the chunk boundary and before the SSTable was loaded. In this case, this check will not be triggered because the early return statement of `get()` will evaluate as true (`_pos` will match the `_end_pos`, which is the actual file size). But it will be caught by the digest check. This patch is part of incorporating the functionality of `do_validate_uncompressed()` into the checksummed data source. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-11-25 15:55:45 +02:00
Nikos Dragazis	2cc82f64e8	sstables: Allow data sources to disable digest check The compressed and checksummed data sources offer digest checking as an optional feature. It can be enabled via the boolean template parameter `check_digest`. If enabled, the data sources calculate the actual digest chunk-by-chunk whenever `get()` is called, and compare with the expected digest when all data have been read. If the actual digest cannot be calculated due to a partial read or skip, the data sources treat this condition as an internal error. Relax this constraint by allowing the data sources to handle digest checks as best effort, i.e., continue to operate with digest checking disabled if the actual digest cannot be calculated. We will use this in later patches to enable digest checking for compaction. Compaction can cause both partial reads and skips (e.g., in case of cleanup compaction) and we cannot predict skips beforehand. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-11-11 20:26:27 +02:00
Nikos Dragazis	0df1c01759	sstables: Add digest check in checksummed data source The checksummed data source verifies the checksum of each chunk in the data files of uncompressed SSTables. This is being leveraged by scrub in validation mode. Extend the data source to check the digest (full checksum) as well. Unlike checksums, this is added as an optional feature so that SSTables without a digest can still be validated in a per-chunk basis. To enable this, the caller needs to set the template parameter `check_digest` to true, and provide the expected digest. The data source calculates the digest incrementally through multiple get() calls and compares against the expected digest after reading the whole file range. If there is a mismatch, it throws an exception. Checking the digest requires reading the whole data file. If this cannot be satisfied (e.g., due to partial read or skip()), the data source fails immediately. If the user has successfully read the whole file range, it can be safely assumed that the digest is valid. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-10-03 18:08:56 +03:00
Nikos Dragazis	b7dfba4c18	sstables: Introduce checksummed file data source implementation Introduce a new data source implementation for uncompressed SSTables. This is just a thin wrapper for a raw data source that also performs checksum validation for each chunk. This way we can have consistent behavior for compressed and uncompressed SSTables. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:26:18 +03:00

11 Commits