scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 11:30:36 +00:00

Author	SHA1	Message	Date
Botond Dénes	f1bbf705f9	Merge 'Cleanup sstables in resharding and other compaction types' from Benny Halevy This series extends sstable cleanup to resharding and other (offstrategy, major, and regular) compaction types so to: * cleanup uploaded sstables (#11933) * cleanup staging sstables after they are moved back to the main directory and become eligible for compaction (#9559) When perform_cleanup is called, all sstables are scanned, and those that require cleanup are marked as such, and are added for tracking to table_state::cleanup_sstable_set. They are removed from that set once released by compaction. Along with that sstables set, we keep the owned_ranges_ptr used by cleanup in the table_state to allow other compaction types (offstrategy, major, or regular) to cleanup those sstables that are marked as require_cleanup and that were skipped by cleanup compaction for either being in the maintenance set (requiring offstrategy compaction) or in staging. Resharding is using a more straightforward mechanism of passing the owned token ranges when resharding uploaded sstables and using it to detect sstable that require cleanup, now done as piggybacked on resharding compaction. Closes #12422 * github.com:scylladb/scylladb: table: discard_sstables: update_sstable_cleanup_state when deleting sstables compaction_manager: compact_sstables: retrieve owned ranges if required sstables: add a printer for shared_sstable compaction_manager: keep owned_ranges_ptr in compaction_state compaction_manager: perform_cleanup: keep sstables in compaction_state::sstables_requiring_cleanup compaction: refactor compaction_state out of compaction_manager compaction: refactor compaction_fwd.hh out of compaction_descriptor.hh compaction_manager: compacting_sstable_registration: keep a ref to the compaction_state compaction_manager: refactor get_candidates compaction_manager: get_candidates: mark as const table, compaction_manager: add requires_cleanup sstable_set: add for_each_sstable_until distributed_loader: reshard: update sstable cleanup state table, compaction_manager: add update_sstable_cleanup_state compaction_manager: needs_cleanup: delete unused schema param compaction_manager: perform_cleanup: disallow empty sorted_owened_ranges distributed_loader: reshard: consider sstables for cleanup distributed_loader: process_upload_dir: pass owned_ranges_ptr to reshard distributed_loader: reshard: add optional owned_ranges_ptr param distributed_loader: reshard: get a ref to table_state distributed_loader: reshard: capture creator by ref distributed_loader: reshard: reserve num_jobs buckets compaction: move owned ranges filtering to base class compaction: move owned_ranges into descriptor	2023-04-11 14:52:29 +03:00
Botond Dénes	38c98b370f	Update tools/jmx/ submodule * tools/jmx/ 48e16998...b7ae52bc (1): > install.sh: do not fail if jre-11 is not installed	2023-04-11 14:51:31 +03:00
Kefu Chai	dcce0c96a9	create-relocatable-package.py: error out if pigz fails before this change, we don't error out even if pigz fails. but there is chance that pigz fails to create the gzip'ed relocatable tarball either due to environmental issues or some other problems, and we are not aware of this until packaging scripts like `reloc/build_rpm.sh` tries to ungzip this corrupted gzip file. in this change, if pigz's status code is not 0, the status code is printed, and create-relocatable-package.py will return 1. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13459	2023-04-11 14:29:25 +03:00
Botond Dénes	a8e59d9fb2	Merge 'Metrics relabel from file' from Amnon Heiman This series adds an option to read the relabel config from file. Most of Scylla's metrics are reported per-shard, some times they are also reported per scheduling groups or per tables. With modern hardware, this can quickly grow to a large number of metrics that overload Scylla and the collecting server. One of the main issues around metrics reduction is that many of the metrics are only helpful in certain situations. For example, Scylla monitoring only looks at a subset of the metrics. So in large deployments it would be helpful to scrap only those. An option to do that, would be to mark all dashboards related metrics with a label value, and then Prometheus will request only metrics with that label value. There are two main limitations to scrap by label values: 1. some of the metrics we want to report are in seastar, so we'll need to label them somehow (we cannot just add random labels to seastar metrics) 2. things change, new metrics are introduce and we may want them, it's not practicall to re-compile and wait for a new release whenever we want to change a label just for monitoring. It will be best to have the option to add metrics freely and choose at runtime what to report. This series make use of Seastar API to perform metrics manipulation dynamically. It includes adding, removing, and changing labels and also enable and disable metrics, and enable and disable the skip_when_empty option. After this series the configuration could be used with: ```--relabel-config-file conf.yaml``` The general logic and format follows Prometheus metrics_relabel_config configuration. Where the configuration file looks like: ``` $ cat conf.yaml relabel_configs: - source_labels: [shard] action: drop target_label: shard regex: (2) - source_labels: [shard] action: replace target_label: level replacement: $1 regex: (.3) ``` Closes #12687 github.com:scylladb/scylladb: main: Load metrics relabel config from a file if it exists Add relabel from file support.	2023-04-11 12:47:09 +03:00
Botond Dénes	dba1d36aa6	Merge 'alternator: fix isolation of concurrent modifications to tags' from Nadav Har'El Alternator's implementation of TagResource, UntagResource and UpdateTimeToLive (the latter uses tags to store the TTL configuration) was unsafe for concurrent modifications - some of these modifications may be lost. This short series fixes the bug, and also adds (in the last patch) a test that reproduces the bug and verifies that it's fixed. The cause of the incorrect isolation was that we separately read the old tags and wrote the modified tags. In this series we introduce a new function, `modify_tags()` which can do both under one lock, so concurrent tag operations are serialized and therefore isolated as expected. Fixes #6389. Closes #13150 * github.com:scylladb/scylladb: test/alternator: test concurrent TagResource / UntagResource db/tags: drop unsafe update_tags() utility function alternator: isolate concurrent modification to tags db/tags: add safe modify_tags() utility functions migration_manager: expose access to storage_proxy	2023-04-11 11:17:23 +03:00
Anna Stuchlik	2921059ebb	doc: add a disclaimer about unsupported upgrade Fixes https://github.com/scylladb/scylla-enterprise/issues/2805 This commit adds the disclaimer that an upgrade by replacing the cluster nodes with nodes with a different release is not supported. Closes #13445	2023-04-11 10:47:39 +03:00
Kefu Chai	86b66a9875	build: cmake: drop test_table.CC this change mirrors the corresponding change in `configure.py` in `4b5b6a9010` . Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13461	2023-04-11 09:42:58 +03:00
Nadav Har'El	79114c5030	cql-pytest: translate Cassandra's tests for DELETE operations This is a translation of Cassandra's CQL unit test source file validation/operations/DeleteTest.java into our cql-pytest framework. There are 51 tests, and they did not reproduce any previously-unknown bug, but did provide additional reproducers for three known issues: Refs #4244 Add support for mixing token, multi- and single-column restrictions Refs #12474 DELETE prints misleading error message suggesting ALLOW FILTERING would work Refs #13250 one-element multi-column restriction should be handled like a single-column restriction Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13436	2023-04-11 09:10:11 +03:00
Botond Dénes	355583066e	Merge 'Reduce memory footprint of SSTable index summary' from Raphael "Raph" Carvalho SSTable summary is one of the components fully loaded into memory that may have a significant footprint. This series reduces the summary footprint by reducing the amount of token information that we need to keep in memory for each summary entry. Of course, the benefit of this size optimization is proportional to the amount of summary entries, which in turn is proportional to the number of partitions in a SSTable. Therefore we can say that this optimization will benefit the most tables which have tons of small-sized partitions, which will result in big summaries. Results: ``` BEFORE [1000000 pkeys] data size: 4035888890, summary -> memory footprint: 5843232, entries: 88158 [10000000 pkeys] data size: 40368888890, summary -> memory footprint: 55787128, entries: 844925 AFTER [1000000 pkeys] data size: 4035888890, summary -> memory footprint: 4351536, entries: 88158 [10000000 pkeys] data size: 40368888890, summary -> memory footprint: 42211984, entries: 844925 ``` That shows a 25% reduction in footprint, for both 1 and 10 million pkeys. Closes #13447 * github.com:scylladb/scylladb: sstables: Store raw token into summary entries sstables: Don't store token data into summary's memory pool	2023-04-11 08:29:11 +03:00
Botond Dénes	05b381bfa2	Merge 'Simple S3 storage for sstables' from Pavel Emelyanov The PR adds sstables storage backend that keeps all component files as S3 objects and system.sstables_registry ownership table that keeps track of what sstables objects belong to local node and their names. When a keyspace is configured with 'STORAGE = { 'type': 'S3' }' the respective class table object eventually gets the storage_options instance pointing to the target S3 endpoint and bucket. All the sstables created for that table attach the S3 storage implementation that maintains components' files as S3 objects. Writing to and reading from components is handled by the S3 client facilities from utils/. Changing the sstable state, which is -- moving between normal, staging and quarantine states -- is not yet implemented, but would eventually happen by updating entries in the sstables registry. To keep track of which node owns which objects, to provide bucket-wide uniqueness of object names and to maintain sstable state the storage driver keeps records in the system.sstables_registry ownership table. The table maps sstable location and generation to the object format, version, status-state () and (!) unique identifier (some time soon this identifier is supposed to be replaced with UUID sstables generations). The component object name is thus s3://bucket/uuid/component_basename. The registry is also used on boot. The distributed loader picks up sstables from all the tables found in schema and for S3-backed keyspaces it lists entries in the registry to a) identify those and b) get their unique S3-side identifiers to open by name. () About sstable's status and state. The state field is the part of today's sstable path on disk -- staging, quarantine, normal (root table data dir), etc. Since S3 doesn't have the renaming facility, moving sstable between those states is only possible by updating the entry in the registry. This is not yet implemented in this set (#13017) The status field tracks sstable' transition through its creation-deletion. It first starts with 'creating' status which corresponds to the today's TemporaryTOC file. After being created and written to the sstable moves into 'sealed' state which corresponds to the today's normal sstable being with the TOC file. To delete sstable atomically it first moves into 'removing' state which is equivalent to being in the deletion-log for the on-disk sstable. Once removed from the bucket, the entry is removed from the registry. To play with: 1. Start minio (installed by install-dependencies.sh) ``` export MINIO_ROOT_USER=${root_user} export MINIO_ROOT_PASSWORD=${root_pass} mkdir -p ${root_directory} minio server ${root_directory} ``` 2. Configure minio CLI, create anonymous bucket ``` mc config host rm local mc config host add local http://127.0.0.1:9000 ${root_user} ${root_pass} mc mb local/sstables mc anonymous set public local/sstables ``` 3. Start Scylla with object-storage feature enabled ``` scylla ... --experimental-features=keyspace-storage-options --workdir ${as_usual}``` 4. Create KS with S3 storage ``` create keyspace ... storage = { 'type': 'S3', 'endpoint': '127.0.0.1:9000', 'bucket': 'sstables' };``` The S3 client has a logger named "s3", it's useful to use on with `trace` verbosity. Closes #12523 * github.com:scylladb/scylladb: test: Add object-storage test distributed_loader: Print storage type when populating sstable_directory: Add ownership table components lister sstable_directory: Make components_lister and API sstable_directory: Create components lister based on storage options sstables: Add S3 storage implementation system_keyspace: Add ownership table system_keyspace: Plug to user sstables manager too sstable: Make storage instance based on storage options sstable_directory: Keep storage_options aboard sstable: Virtualize the helper that gets on-disk stats for sstable sstable, storage: Virtualize data sink making for small components sstable, storage: Virtualize data sink making for Data and Index sstable/writer: Shuffle writer::init_file_writers() sstable: Make storage an API utils: Add S3 readable file impl for random reads utils: Add S3 data sink for multipart upload utils: Add S3 client with basic ops cql-pytest: Add option to run scylla over stable directory test.py: Equip it with minio server sstables: Detach write_toc() helper	2023-04-11 08:17:25 +03:00
Benny Halevy	96660b2ef7	table: discard_sstables: update_sstable_cleanup_state when deleting sstables We need to remove the deleted sstables from update_sstable_cleanup_state otherwise their data and index files will remain opened and their storage space won't be reclaimed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:37:56 +03:00
Benny Halevy	4db961ecac	compaction_manager: compact_sstables: retrieve owned ranges if required If any of the sstables to-be-compacted requires cleanup, retrive the owned_ranges_ptr from the table_state. With that, staging sstables will eventually be cleaned up via regular compaction. Refs #9559 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:36:10 +03:00
Benny Halevy	9105f9800c	sstables: add a printer for shared_sstable Refactor the printing logic in compaction::formatted_sstables_list out to sstables::to_string(const shared_sstable&, bool include_origin) and operator<<(const shared_sstable) on top of it. So that we can easily print std::vector<shared_sstable> from compaction_manager in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:31:35 +03:00
Benny Halevy	d87925d9fc	compaction_manager: keep owned_ranges_ptr in compaction_state When perform_cleanup adds sstables to sstables_requiring_cleanup, also save the owned_ranges_ptr in the compaction_state so it could be used by other compaction types like regular, reshape, or major compaction. When the exhausted sstables are released, check if sstables_requiring_cleanup is empty, and if it is, clear also the owned_ranges_ptr. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:30:53 +03:00
Benny Halevy	c2bf0e0b72	compaction_manager: perform_cleanup: keep sstables in compaction_state::sstables_requiring_cleanup As a first step towards parallel cleanup by (regular) compaction and cleanup compaction, filter all sstables in perform_cleanup and keep the set of sstables in the compaction_state. Erase from that set when the sstables are unregistered from compaction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:30:39 +03:00
Benny Halevy	b3192b9f16	compaction: refactor compaction_state out of compaction_manager To use it both from compaction_manager and compaction_descriptor in a following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:28:16 +03:00
Benny Halevy	73280c0a15	compaction: refactor compaction_fwd.hh out of compaction_descriptor.hh So it can be used in the next patch that will refactor compaction_state out of class compaction_manager. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:19:04 +03:00
Benny Halevy	690697961c	compaction_manager: compacting_sstable_registration: keep a ref to the compaction_state To be used for managing sstables requiring cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:18:02 +03:00
Benny Halevy	cac60a09ac	compaction_manager: refactor get_candidates Allow getting candidates for compaction from an arbitrary range of sstable, not only the in_strategy_sstables. To be used by perform_cleanup to mark all sstables that require cleanup, even if they can't be compacted at this time. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:16:57 +03:00
Benny Halevy	bbfe839a73	compaction_manager: get_candidates: mark as const Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:16:12 +03:00
Benny Halevy	6ebafe74b9	table, compaction_manager: add requires_cleanup Returns true iff any of the sstables in the set requries cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:14:36 +03:00
Benny Halevy	d765686491	sstable_set: add for_each_sstable_until Calls a function on all sstables or until the function returns stop_iteration::yes. Change the sstable_set_impl interface to expose only for_each_sstable_until and let sstable_set::for_each_sstable use that, wrapping the void-returning function passed to it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:11:58 +03:00
Benny Halevy	db7fa9f3be	distributed_loader: reshard: update sstable cleanup state Since the sstables are loaded from foreign open info we should mark them for cleanup if needed (and owned_ranges_ptr is provided). This will allow a later patch to enable filtering for cleanup only for sstable sets containing sstables that require cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:11:00 +03:00
Benny Halevy	d0690b64c1	table, compaction_manager: add update_sstable_cleanup_state update_sstable_cleanup_state calls needs_cleanup and inserts (or erases) the sstable into the respective compaction_state.sstables_requiring_cleanup set. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:10:55 +03:00
Benny Halevy	1baca96de1	compaction_manager: needs_cleanup: delete unused schema param It isn't needed. The sstable already has a schema. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:03:53 +03:00
Benny Halevy	ac9f8486ba	compaction_manager: perform_cleanup: disallow empty sorted_owened_ranges I'm not sure why this was originally supported, maybe for upgrade sstables where we may want to rewrite the sstables without filtering any tokens, but perform_sstable_upgrade is now following a different code path and uses `rewrite_sstables` directly, without pigybacking on cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:03:03 +03:00
Benny Halevy	ecbd112979	distributed_loader: reshard: consider sstables for cleanup When called from `process_upload_dir` we pass a list of owned tokens to `reshard`. When they are available, run resharding, with implicit cleanup, also on unshared sstables that need cleanup. Fixes #11933 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:01:38 +03:00
Benny Halevy	3ccbb28f2a	distributed_loader: process_upload_dir: pass owned_ranges_ptr to reshard To facilitate implicit cleanup of sstables via resharding. Refs #11933 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:59:38 +03:00
Benny Halevy	aa4b18f8fb	distributed_loader: reshard: add optional owned_ranges_ptr param For passing owned_ranges_ptr from distributed_loader::process_upload_dir. Refs #11933 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:57:41 +03:00
Benny Halevy	f540af930b	distributed_loader: reshard: get a ref to table_state We don't reference the table itself, only as_table_state. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:57:11 +03:00
Benny Halevy	c6b7fcc26f	distributed_loader: reshard: capture creator by ref Now that reshard is a coroutine, creator is preserved in the coroutine frame until completion so we can simply capture it by reference now. Note that previously it was moved into the compaction descriptor, but the capture wasn't mutable so it was copied anyhow and this change doesn't introduced a regression. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:56:07 +03:00
Benny Halevy	7c9d16ff96	distributed_loader: reshard: reserve num_jobs buckets We know in advance how many buckets we need. We still need to emplace the first bucket upfront. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:55:35 +03:00
Benny Halevy	0c6ce5af74	compaction: move owned ranges filtering to base class Move the token filtering logic down from cleanup_compaction to regular_compaction and class compaction so it can be reused by other compaction types. Create a _owned_ranges_checker in class compaction when _owned_ranges is engaged, and use it in compaction::setup to filter partitions based on the owned ranges. Ref scylladb/scylladb#12998 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:55:09 +03:00
Benny Halevy	09df04c919	compaction: move owned_ranges into descriptor Move the owned_ranges_ptr, currently used only by cleanup and upgrade compactions, to the generic compaction descriptor so we apply cleanup in other compaction types. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 22:52:12 +03:00
Pavel Emelyanov	fd817e199c	Merge 'auth: replace operator<<(..) with fmt formatter' from Kefu Chai this is a part of a series to migrating from `operator<<(ostream&, ..)` based formatting to fmtlib based formatting. the goal here is to enable fmtlib to print `auth::auth_authentication_options` and `auth::resource_kind` without the help of fmt::ostream. and their `operator<<(ostream,..)` are dropped, as there are no users of them anymore. Refs #13245 Closes #13460 * github.com:scylladb/scylladb: auth: remove unused operator<<(.., resource_kind) auth: specialize fmt::formatter<resource_kind> auth: remove unused operator<<(.., authentication_option) auth: specialize fmt::formatter<authentication_option>	2023-04-10 17:05:09 +03:00
Pavel Emelyanov	21ef5bcc22	test: Add object-storage test The test does - starts scylla (over stable directory - creates S3-backed keyspace (minio is up and running by test.py already) - creates table in that keyspace and populates it with several rows - flushes the keyspace to make sstables hit the storage - checks that the ownership table is populated properly - restarts scylla - makes sure old entries exist Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:44:29 +03:00
Pavel Emelyanov	8b9e9671de	distributed_loader: Print storage type when populating On boot it's very useful to know which storage a table comes from, so add the respective info to existing log messages. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:44:29 +03:00
Pavel Emelyanov	f04c6cdf9a	sstable_directory: Add ownership table components lister When sstables are stored on object storage, they are "registered" in the system.sstables_registry ownership table. The sstable_directory is supposed to list sstables from this table, so here's the respective components lister. The lister is created by sstables_manager, by the time it's requested from the the system keyspace is already plugged. The lister only handles "sealed" sstables. Dangling ones are still ignored, this is to be fixed later. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:44:29 +03:00
Pavel Emelyanov	8bd9f7accf	sstable_directory: Make components_lister and API Now the lister is filesystem-specific. There will soon come another one for S3, so the sstable_directory should be prepared for that by making the lister an abstract class. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:44:29 +03:00
Pavel Emelyanov	5f7f0117e1	sstable_directory: Create components lister based on storage options The directory's lister is storage-specific and should be created differently for different storage options. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:44:29 +03:00
Pavel Emelyanov	950ee0efe8	sstables: Add S3 storage implementation The driver puts all componenets into s3://bucket/uuid/component_name objects where 'bucket' is the keyspace options configuration parameter, and the 'uuid' is the value obtained from the ownership table. E.g. s3://test_bucket/d0a743b0-ad38-11ed-85b5-39b6b0998182/Data.db The life-time is straightforward. Until sealed, the sstable has 'creating' status in the table, then it's updated to be 'sealed'. Prior to removing the objects the status is set to 'deleting' thus allowing the distributed loader to pick up the dangling objects un re-load (not yet implemented). Finally, the entry is deleted from the table. It needs the PR #12648 not to generate empty ks/cf directories on the local filesystem. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:44:29 +03:00
Pavel Emelyanov	08e9046d07	system_keyspace: Add ownership table The schema is CREATE TABLE system.sstables ( location text, generation bigint, format text, status text, uuid uuid, version text, PRIMARY KEY (location, generation) ) A sample entry looks like: location \| generation \| format \| status \| uuid \| version ---------------------------------------------------------------------+------------+--------+--------+--------------------------------------+--------- /data/object_storage_ks/test_table-d096a1e0ad3811ed85b539b6b0998182 \| 2 \| big \| sealed \| d0a743b0-ad38-11ed-85b5-39b6b0998182 \| me The uuid field points to the "folder" on the storage where the sstable components are. Like this: s3 `- test_bucket `- f7548f00-a64d-11ed-865a-0c1fbc116bb3 `- Data.db - Index.db - Filter.db - ... It's not very nice that the whole /var/lib/... path is in fact used as location, it needs the PR #12707 to fix this place. Also, the "status" part is not yet fully functional, it only supports three options: - creating -- the same as TemporaryTOC file exists on disk - sealed -- default state - deleting -- the analogy for the deletion log on disk The latter needs support from the distributed_loader, which's not yet there. In fact, distributes_loader also needs to be patched to actualy select entries from this table on load. Also it needs the mentioned PR #12707 to support staging and quarantine sstables. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:44:28 +03:00
Pavel Emelyanov	e34b86dd61	system_keyspace: Plug to user sstables manager too The sharded<sys_ks> instances are plugged to large data handler and compaction manager to maintain the circular dependency between these components via the interposing database instance. Do the same for user sstables manager, because S3 driver will need to update the local ownership table. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Pavel Emelyanov	4bb885b759	sstable: Make storage instance based on storage options This patch adds storage options lw-ptr to sstables_manager::make_sstable and makes the storage instance creation depend on the options. For local it just creates the filesystem storage instance, for S3 -- throws, but next patch will fix that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Pavel Emelyanov	df026e2cb5	sstable_directory: Keep storage_options aboard The class in question will need to know the table's storage it will need to list sstables from. For that -- construct it with the storage options taken from table. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Pavel Emelyanov	c060f3a52f	sstable: Virtualize the helper that gets on-disk stats for sstable When opening an existing (or just sealed) sstable its components are stat()-ed to get the on-disk sizes and a bit more. Stat-ing a file by name on S3 is not (yet) implemented and doing it file-by-file can be quite terrible. So add a method to return sstable stats in a storage-specific manner. For S3 this can be implemented by getting the info from the ownership table (in the future). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Pavel Emelyanov	0ddd27cb29	sstable, storage: Virtualize data sink making for small components This time sstable needs to create a data sink for a component without having the file at hand. That's pretty much the same as in previous patch, but the mathod declaration differs slightly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Pavel Emelyanov	ac1e56c9d9	sstable, storage: Virtualize data sink making for Data and Index Add the make_data_or_index_sink() virtual method and its implementation for filesystem_storage. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Pavel Emelyanov	1d4fcce5dd	sstable/writer: Shuffle writer::init_file_writers() The method needs to create two data sinks -- for Data and for Index files -- and then wrap it with more stuff (compression, checksums, streams, etc.). With S3 backend using file-output-stream won't work, becase S3 storage cannot provide writable file API (it has data_sink instead). This patch extracts file_data_sink creation so that it could be virtualized with storage API later. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Pavel Emelyanov	525a261a4e	sstable: Make storage an API Currently sstable carries a filesystem_storage instance on board. Next patches will make it possible to use some other storage with different data accessing methods. This patch makes sstable carry abstract storage interface and make the existing filesystem_storage implement it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00

1 2 3 4 5 ...

36122 Commits