scylladb/sstables at 83ceedb18bdc33f8172ec9dc22e2eda13dae96be - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 04:06:59 +00:00

Files

History

Kefu Chai d2d1141188 sstables: writer: delegate flush() in checksummed_file_data_sink_impl

before this change, `checksummed_file_data_sink_impl` just inherits the
`data_sink_impl::flush()` from its parent class. but as a wrapper around
the underlying `_out` data_sink, this is not only an unusual design
decision in a layered design of an I/O system, but also could be
problematic. to be more specific, the typical user of `data_sink_impl`
is a `data_sink`, whose `flush()` member function is called when
the user of `data_sink` want to ensure that the data sent to the sink
is pushed to the underlying storage / channel.

this in general works, as the typical user of `data_sink` is in turn
`output_stream`, which calls `data_sink.flush()` before closing the
`data_sink` with `data_sink.close()`. and the operating system will
eventually flush the data after application closes the corresponding
fd. to be more specific, almost none of the popular local filesystem
implements the file_operations.op, hence, it's safe even if the
`output_stream` does not flush the underlying data_sink after writing
to it. this is the use case when we write to sstables stored on local
filesystem. but as explained above, if the data_sink is backed by a
network filesystem, a layered filesystem or a storage connected via
a buffered network device, then it is crucial to flush in a timely
manner, otherwise we could risk data lost if the application / machine /
network breaks when the data is considerered persisted but they are
_not_!

but the `data_sink` returned by `client::make_upload_jumbo_sink` is
a little bit different. multipart upload is used under the hood, and
we have to finalize the upload once all the parts are uploaded by
calling `close()`. but if the caller fails / chooses to close the
sink before flushing it, the upload is aborted, and the partially
uploaded parts are deleted.

the default-implemented `checksummed_file_data_sink_impl::flush()`
breaks `upload_jumbo_sink` which is the `_out` data_sink being
wrapped by `checksummed_file_data_sink_impl`. as the `flush()`
calls are shortcircuited by the wrapper, the `close()` call
always aborts the upload. that's why the data and index components
just fail to upload with the S3 backend.

in this change, we just delegate the `flush()` call to the
wrapped class.

Fixes #15079
Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>

Closes #15134

2023-08-24 18:03:10 +03:00

..

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

sstable/writer: log sstable name and pk when capping ldt

2023-08-21 19:25:32 +08:00

binary_search.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

checksum_utils.hh

build: switch to packaged libdeflate rather than a submodule

2022-11-17 08:01:00 +02:00

CMakeLists.txt

sstables: extract storage out

2023-05-09 16:47:00 +08:00

column_translation.hh

Introduce schema/ module

2023-02-15 11:01:50 +02:00

component_type.hh

sstable: specialize fmt::formatter<component_type>

2023-04-21 09:49:24 +03:00

compress.cc

treewide: add braces around subobject

2023-04-28 16:59:29 +08:00

compress.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

consumer.hh

Merge 'tree: finish the reader_permit state renames' from Botond Dénes

2023-05-04 18:29:04 +03:00

disk_types.hh

Introduce mutation/ module

2023-02-14 11:19:03 +02:00

downsampling.hh

cross-tree: fix header file self-sufficiency

2022-07-08 12:59:14 +03:00

exceptions.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

filter.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

generation_type.hh

Revert "Revert "Merge 'treewide: add uuid_sstable_identifier_enabled support' from Kefu Chai""

2023-06-21 13:02:40 +03:00

hyperloglog.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

index_entry.hh

sstables: do not use operator<< to print composite_view

2023-03-29 16:13:59 +08:00

index_reader.hh

build: disable implicit fallthrough

2023-07-10 19:36:06 +02:00

integrity_checked_file_impl.cc

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

integrity_checked_file_impl.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

key.hh

treewide: use std::lexicographical_compare_threeway

2023-04-21 14:28:18 +03:00

liveness_info.hh

Introduce mutation/ module

2023-02-14 11:19:03 +02:00

m_format_read_helpers.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

m_format_read_helpers.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

metadata_collector.cc

Introduce mutation/ module

2023-02-14 11:19:03 +02:00

metadata_collector.hh

types: move types.{cc,hh} into types

2023-02-19 21:05:45 +02:00

mutation_fragment_filter.hh

Introduce mutation/ module

2023-02-14 11:19:03 +02:00

open_info.hh

sstable: Piggyback on sstable parser and writer to provide bytes_on_disk

2023-04-27 12:06:48 -03:00

partition_index_cache.hh

sstables: partition_index_cache: clean up an unused type alias

2023-02-17 17:58:26 +03:00

prepended_input_stream.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

prepended_input_stream.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

processing_result_generator.hh

sstables: processing_result_generator: prefer standard coroutines over the technical specification with clang 14

2022-06-12 20:05:28 +03:00

progress_monitor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

promoted_index_blocks_reader.hh

build: disable implicit fallthrough

2023-07-10 19:36:06 +02:00

random_access_reader.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

random_access_reader.hh

sstable: Add missing pragma once to random_access_reader.hh

2023-04-27 12:06:48 -03:00

scanning_clustered_index_cursor.hh

Introduce schema/ module

2023-02-15 11:01:50 +02:00

segmented_compress_params.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

shareable_components.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

shared_sstable.hh

sstables: add a printer for shared_sstable

2023-04-10 23:31:35 +03:00

sstable_directory.cc

sstable_directory: Make sstable with required state

2023-08-14 14:56:02 +03:00

sstable_directory.hh

sstable_directory: Make sstable with required state

2023-08-14 14:56:02 +03:00

sstable_mutation_reader.cc

sstables: move mp_row_consumer_reader_k_l to kl/reader.cc

2022-04-28 14:12:24 +03:00

sstable_mutation_reader.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

sstable_set_impl.hh

sstable_set: maintain total bytes_on_disk

2023-08-14 21:07:27 +03:00

sstable_set.cc

sstable_set: maintain total bytes_on_disk

2023-08-14 21:07:27 +03:00

sstable_set.hh

sstable_set: maintain total bytes_on_disk

2023-08-14 21:07:27 +03:00

sstable_version_k_l.hh

types: move types.{cc,hh} into types

2023-02-19 21:05:45 +02:00

sstable_version_m.hh

types: move types.{cc,hh} into types

2023-02-19 21:05:45 +02:00

sstable_version.cc

add support for the ME sstable format

2022-02-16 18:21:24 +02:00

sstable_version.hh

types: move types.{cc,hh} into types

2023-02-19 21:05:45 +02:00

sstable_writer.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

sstables_manager.cc

sstable: Construct it with state

2023-08-14 15:28:54 +03:00

sstables_manager.hh

sstables_manager: Remove state-less make_sstable()

2023-08-14 15:28:54 +03:00

sstables.cc

sstable/storage: Make filesystem storage with initial state

2023-08-14 15:40:44 +03:00

sstables.hh

sstable: Maintain state

2023-08-14 15:40:44 +03:00

stats.hh

sstables: mx: add pi_auto_scale_events metric

2022-05-24 13:32:39 +03:00

storage.cc

sstable/storage: Make filesystem storage with initial state

2023-08-14 15:40:44 +03:00

storage.hh

sstable/storage: Make filesystem storage with initial state

2023-08-14 15:40:44 +03:00

types_fwd.hh

sstables: define run_identifier as a strong tagged_uuid type

2022-08-18 19:03:10 +03:00

types.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

version.hh

sstables: Expell sstable_version_types from_string() helper

2023-03-21 09:56:18 +03:00

writer_impl.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

writer.cc

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

writer.hh

sstables: writer: delegate flush() in checksummed_file_data_sink_impl

2023-08-24 18:03:10 +03:00