scylladb/sstables at d2ca1ebfa0da8217eaa3c2642bfa44fe2b301f06 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-01 21:55:50 +00:00

Files

History

Nikos Dragazis 6091d5d789 sstables: Fix range of input stream in checksummed file data source

The checksummed file data source uses the chunk size to enforce that the
reads from the underlying file input stream will be aligned at the chunk
boundary. This is necessary so that we can validate the checksum of each
chunk.

However, a mismatch in the numeric types caused a bug where the
underlying file input stream would read a smaller portion of the data
file than expected.

The bug is located in the following lines:

```
auto start = _beg_pos & ~(chunk_size - 1);
auto end = (_end_pos & ~(chunk_size - 1)) + chunk_size;
```

`_beg_pos` and `_end_pos` are `uint64_t`, whereas `chunk_size` is
`uint32_t`. When executing the AND operation, the compiler converts the
right operand from `uint32_t` to `uint64_t`. Since the integer is
unsigned, the four most-significant bytes are filled with zeros, thus
erroneously truncating the corresponding bytes of the position.

Fix the bug by explicitly converting the chunk size to `uint64_t` before
any arithmetic operations. Also, replace the handwritten alignment
implementations with the `align_up()` and `align_down()` helpers.

Finally, restrict the file end position to not exceed the file length.
Since the last chunk can be smaller than the chunk size, it could happen
that the end position exceeds the file length after the round-up. This
is not a bug on its own since `make_file_input_stream()` can accept
lengths that go beyond end-of-file, but still it makes the code more
error prone and should be avoided.

Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>

Closes scylladb/scylladb#21665

2024-11-28 12:53:05 +02:00

..

sstables: Add integrity option to data_consume_single_partition()

2024-11-11 20:26:27 +02:00

cross-tree: change to_sstring_view() to to_string_view()

2024-11-18 14:57:49 +02:00

binary_search.hh

…

checksum_utils.hh

…

checksummed_data_source.cc

sstables: Fix range of input stream in checksummed file data source

2024-11-28 12:53:05 +02:00

checksummed_data_source.hh

sstables: Add digest check in checksummed data source

2024-10-03 18:08:56 +03:00

CMakeLists.txt

cmake/check_headers: correct typos

2024-10-08 09:38:16 +03:00

column_translation.hh

cross-tree: change to_sstring_view() to to_string_view()

2024-11-18 14:57:49 +02:00

component_type.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

compress.cc

treewide: replace boost::find_if with std::ranges::find_if

2024-11-19 10:50:01 +08:00

compress.hh

sstables: Add digest check in compressed data source

2024-10-03 18:09:01 +03:00

consumer.hh

treewide: de-static namespace scope functions in headers

2024-10-01 14:02:50 +03:00

data_source_types.hh

sstables: Allow data sources to disable digest check

2024-11-11 20:26:27 +02:00

disk_types.hh

sstables: scylla_metadata: add ext_timestamp_stats

2024-09-10 19:05:57 +03:00

downsampling.hh

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

exceptions.hh

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

file_writer.hh

sstable: extract file_writer out

2024-07-10 23:32:47 +03:00

filter.hh

…

generation_type.hh

sstables: generation_type: replace boost ranges with std ranges

2024-11-01 12:45:24 +03:00

hyperloglog.hh

treewide: de-static namespace scope functions in headers

2024-10-01 14:02:50 +03:00

index_entry.hh

sstables: bsearch_clustered_cursor: Unify skip_info logging

2024-10-03 14:16:05 +02:00

index_reader.hh

Merge 'sstables/index_reader: avoid unnecessary index page reads in single-partition reads' from Michał Chojnowski

2024-11-04 14:28:27 +02:00

integrity_checked_file_impl.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

integrity_checked_file_impl.hh

treewide: move log.hh into utils/log.hh

2024-10-22 06:54:46 +03:00

key.hh

sstables: fix a use-after-free in key_view::explode()

2024-03-07 09:07:07 +02:00

liveness_info.hh

sstables: do not include unused headers

2024-01-09 11:45:44 +02:00

m_format_read_helpers.cc

sstables: add fmt::formatter for sstables::bound_kind_m

2024-02-23 13:55:22 +08:00

m_format_read_helpers.hh

sstables: do not include unused headers

2024-01-09 11:45:44 +02:00

metadata_collector.cc

treewide: move log.hh into utils/log.hh

2024-10-22 06:54:46 +03:00

metadata_collector.hh

sstables: scylla_metadata: add ext_timestamp_stats

2024-09-10 19:05:57 +03:00

mutation_fragment_filter.hh

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

open_info.hh

sstables: fix a typo in comment: s/Mimicks/Mimics/

2024-05-21 12:14:10 +03:00

partition_index_cache_stats.hh

…

partition_index_cache.hh

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

prepended_input_stream.cc

…

prepended_input_stream.hh

…

processing_result_generator.hh

…

progress_monitor.hh

…

promoted_index_blocks_reader.hh

sstables: bsearch_clustered_cursor: Switch parsers to work with page_view

2024-09-27 01:25:15 +02:00

random_access_reader.cc

treewide: move log.hh into utils/log.hh

2024-10-22 06:54:46 +03:00

random_access_reader.hh

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

scanning_clustered_index_cursor.hh

Merge 'sstables: Reduce amount of I/O for clustering-key-bounded reads from large partitions' from Tomasz Grabiec

2024-10-28 21:13:23 +02:00

segmented_compress_params.hh

sstables: do not include unused headers

2024-01-09 11:45:44 +02:00

shareable_components.hh

sstables: Add digest in the SSTable components

2024-10-03 18:09:05 +03:00

shared_sstable.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

sstable_directory.cc

treewide: migrate from boost::adaptors::filtered to std::views::filter

2024-11-26 14:26:50 +02:00

sstable_directory.hh

treewide: Rename sstable registry location field to be owner

2024-10-11 14:11:28 +03:00

sstable_mutation_reader.cc

cross-tree: change to_sstring_view() to to_string_view()

2024-11-18 14:57:49 +02:00

sstable_mutation_reader.hh

sstables: Add integrity option to data_consume_single_partition()

2024-11-11 20:26:27 +02:00

sstable_set_impl.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

sstable_set.cc

treewide: migrate from boost::adaptors::filtered to std::views::filter

2024-11-26 14:26:50 +02:00

sstable_set.hh

sstables: Add integrity option to factories for sstable_set readers

2024-11-11 20:42:46 +02:00

sstable_version_k_l.hh

…

sstable_version_m.hh

…

sstable_version.cc

…

sstable_version.hh

…

sstable_writer.hh

…

sstables_manager.cc

Merge 'Remove all_datadirs vector of strings from table::config' from Pavel Emelyanov

2024-10-22 17:21:31 +03:00

sstables_manager.hh

sstables: Generate table::all_datadirs from db::config and storage_options

2024-10-21 15:13:27 +03:00

sstables_registry.hh

system_keyspace: Change sstables registry partition key type

2024-10-11 13:48:09 +03:00

sstables.cc

change remaining sstring_view to std::string_view

2024-11-18 16:48:57 +02:00

sstables.hh

sstables: Add integrity option to sstable::make_reader()

2024-11-11 20:40:31 +02:00

stats.hh

…

storage.cc

sstables: Open-code format_table_directory_name() moved recently

2024-10-21 15:18:19 +03:00

storage.hh

sstables: Generate table::all_datadirs from db::config and storage_options

2024-10-21 15:13:27 +03:00

types_fwd.hh

sstables: Disengage integrity_check from sstable class

2024-11-11 20:26:27 +02:00

types.hh

Merge 'sstables: Reduce amount of I/O for clustering-key-bounded reads from large partitions' from Tomasz Grabiec

2024-10-28 21:13:23 +02:00

version.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

writer_impl.hh

sstables: mx: writer: Never include partition_end marker in promoted index block width

2024-10-03 14:09:57 +02:00

writer.cc

…

writer.hh

compound_compat: replace use of boost ranges with std ranges

2024-10-30 19:58:07 +02:00