scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-07 15:33:15 +00:00

Author	SHA1	Message	Date
Benny Halevy	db5c5ca59e	database: add get_non_local_strategy_keyspaces_erms To be used for getting a coheret set of all keyspaces with non-local replication strategy and their respective effective_replication_map. As an example, use it in this patch in storage_service::update_pending_ranges. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:01 +03:00
Benny Halevy	7ee6048255	database: add get_non_local_strategy_keyspaces For node operations, we currently call get_non_system_keyspaces but really want to work on all keyspace that have non-local replication strategy as they are replicated on other nodes. Reflect that in the replica::database function name. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:01 +03:00
Benny Halevy	d8484b3ee6	storage_service: coroutinize update_pending_ranges Before we make a change in getting the keyspaces and their effective_replication_map. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:01 +03:00
Benny Halevy	e541009f65	effective_replication_map: add get_replication_strategy And use it in storage_service::get_changed_ranges_for_leaving. A following patch will pass the e_r_m to storage_service::get_changed_ranges_for_leaving, rather than getting it there. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Benny Halevy	6794e15163	effective_replication_map: get_range_addresses: use the precalculated replication_map There is no need to call get_natural_endpoints for every token in sorted_tokens order, since we can just get the precalculated per-token endpoints already in the _replication_map member. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Benny Halevy	1d4aea4441	abstract_replication_strategy: get_pending_address_ranges: prevent extra vector copies Reduce large allocations and reactor stalls seen in #11005 by open coding `get_address_ranges` and using std::vector::insert to efficiently appending the ranges returned by `get_primary_ranges_for` onto the returned token_range_vector in contrast to building an unordered_multimap<inet_address, dht::token_range> first in `get_address_ranges` and traversing it and adding one token_range at a time. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Benny Halevy	7811b0d0aa	abstract_replication_strategy: reindent	2022-08-08 17:31:00 +03:00
Benny Halevy	ebe1edc091	utils: sequenced_set: expose set and `contains` method And use that in sights using the endpoint set returned by abstract_replication_strategy::calculate_natural_endpoints. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Benny Halevy	7017ad6822	abstract_replication_strategy: calculate_natural_endpoints: return endpoint_set So it could be used also for easily searching for an endpoint. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Benny Halevy	38934413d4	utils: sequenced_set: templatize VectorType Se we can use basic_sequenced_set<T, std::small_vector<T, N>> as locator::endpoint_set. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Benny Halevy	df380c30b9	utils: sanitize sequenced_set And templatize its Vector type so it can be used with a small_vector for inet_address_vector_replica_set. Mark the methods const/noexcept as needed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Benny Halevy	57d9275d4a	utils: sequenced_set: delete mutable get_vector method It is dangerous to use since modifying the sequenced_set vector will make it go out of sync with the associated unordered_set member, making the object unusable. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 17:31:00 +03:00
Avi Kivity	ba42852b0e	Merge 'Overhaul truncate and snapshot' from Benny Halevy This series is aimed at fixing #11132. To get there, the series untangles the functions that currently depend on the the cross-shard coordination in table::snapshot, namely database::truncate and consequently database::drop_column_family. database::get_table_on_all_shards is added here as a helper to get a foreign shared ptr of the the table shard from all shards, and it is later used by multiple functions to truncate and then take a snapshot of the sharded table. database::truncate_table_on_all_shards is defined to orchestrate the truncate process end-to-end, flushing or clearing all table shards before taking a snapshot if needed, using the newly defined table::snapshot_on_all_shards, and by that leaving only the discard_sstables job to the per-shard database::truncate function. The latter, snapshot_on_all_shards, orchestrates the snapshot process on all shards - getting rid of the per-shard table::snapshot function (after refactoring take_snapshot and finalize_snapshot out of it), and the associated dreaded data structures: snapshot_manager and pending_snapshots. Fixes #11132. Closes #11133 * github.com:scylladb/scylladb: table: reindent write_schema_as_cql table: coroutinize write_schema_as_cql table: seal_snapshot: maybe_yield when iterating over the table names table: reindent seal_snapshot table: coroutinize seal_snapshot table: delete unused snapshot_manager and pending_snapshots table: delete unused snapshot function table: snapshot_on_all_shards: orchestrate snapshot process table: snapshot: move pending_snapshots.erase from seal_snapshot table: finalize_snapshot: take the file sets as a param table: make seal_snapshot a static member table: finalize_snapshot: reindent table: refactor finalize_snapshot out of snapshot table: snapshot: keep per-shard file sets in snapshot_manager table: take_snapshot: return foreign unique ptr table: take_snapshot: maybe yield in per-sstable loop table: take_snapshot: simplify tables construction code table: take_snapshot: reindent table: take_snapshot: simplify error handling table: refactor take_snapshot out of snapshot utils: get rid of joinpoint database: get rid of timestamp_func database: truncate: snapshot table in all-shards layer database: truncate: flush table and views in all-shards layer database: truncate: stop and disable compaction in all-shards layer database: truncate: move call to set_low_replay_position_mark to all-shards layer database: truncate: enter per-shard table async_gate in all-shards layer database: truncate: move check for schema_tables keyspace to all-shards layer. database: snapshot_table_on_all_shards: reindent table: add snapshot_on_all_shards database: add snapshot_table_on_all_shards database: rename {flush,snapshot}_on_all and make static database: drop_table_on_all_shards: truncate and stop table in upper layer database: drop_table_on_all_shards: get all table shards before drop_column_family on each database: drop_column_family: define table& cf database: drop_column_family: reuse uuid for evict_all_for_table database: drop_column_family: move log message up a layer database: truncate: get rid of the unused ks param database: add truncate_table_on_all_shards database: drop_table_on_all_shards: do not accept a truncated_at timestamp_func database: truncate: get optional snapshot_name from caller database: truncate: fix assert about replay_position low_mark database_test: apply_mutation on the correct db shard	2022-08-07 19:15:42 +03:00
Benny Halevy	45ce635527	table: reindent write_schema_as_cql Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	3b2cce068a	table: coroutinize write_schema_as_cql and make sure to always close the output stream. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	dbae7807d1	table: seal_snapshot: maybe_yield when iterating over the table names Add maybe_yield calls in tight loop, potentially over thousands of sstable names to prevent reactor stalls. Although the per-sstable cost is very small, we've experienced stalls realted to printing in O(#sstables) in compaction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	3ba0c72b77	table: reindent seal_snapshot Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	41a2d09a5d	table: coroutinize seal_snapshot Handle exceptions, making sure the output stream is properly closed in all cases, and an intermediate error, if any, is returned as the final future. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	5316dbbe78	table: delete unused snapshot_manager and pending_snapshots Now that snapshot orchestration in snapshot_on_all_shards doesn't use snapshot_manager, get rid of the data structure. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	cca9068cfb	table: delete unused snapshot function Now that snapshot orchestration is done solely in snapshot_on_all_shards, the per-shard snapshot function can be deleted. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	351a3a313d	table: snapshot_on_all_shards: orchestrate snapshot process Call take_snapshot on each shard and collect the returns snapshot_file_set. When all are done, move the vector<snapshot_file_set> to finalize_snapshot. All that without resorting to using the snapshot_manager nor calling table::snapshot. Both will deleted in the following patches. Fixes #11132 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	84dfd2cabb	table: snapshot: move pending_snapshots.erase from seal_snapshot Now that seal_snapshot doesn't need to lookup the snapshot_manager in pending_snapshots to get to the file_sets, erasing the snapshot_manager object can be done in table::snapshot which also inserted it there. This will make it easier to get rid of it in a later patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	39276cacc3	table: finalize_snapshot: take the file sets as a param and pass it to seal_snapshot, so that the latter won't need to lookup and access the snapshot_manager object. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	4dd56bbd6d	table: make seal_snapshot a static member Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	7cb0a3f6f4	table: finalize_snapshot: reindent Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	12716866a9	table: refactor finalize_snapshot out of snapshot Write schema.cql and the files manifest in finalize_snapshot. Currently call it from table::snapshot, but it will be called in a later patch by snapshot_on_all_shards. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	240f83546d	table: snapshot: keep per-shard file sets in snapshot_manager To simplify processing of the per-shard file names for generating the manifest. We only need to print them to the manifest at the end of the process, so there's no point in copying them around in the process, just move the foreign unique unordered_set. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	5100c1ba68	table: take_snapshot: return foreign unique ptr Currently copying the sstable file names are created and destroyed on each shard and are copied by the "coordinator" shards using submit_to, while the coroutine holds the source on its stack frame. To prepare for the next patches that refactor this code so that the coordinator shard will submit_to each shard to perform `take_snapshot` and return the set of sstrings in the future result, we need to wrap the result in a foreign_ptr so it gets freed on the shard that created it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	b54626ad0e	table: take_snapshot: maybe yield in per-sstable loop There could be thousands of sstables so we better cosider yielding in the tight loop that copies the sstable names into the unordered_set we return. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	24a1a4069e	table: take_snapshot: simplify tables construction code Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	75e38ebccc	table: take_snapshot: reindent Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	67c1d00f44	table: take_snapshot: simplify error handling Don't catch exception but rather just return them in the return future, as the exception is handled by the caller. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	ff6508aa53	table: refactor take_snapshot out of snapshot Do the actual snapshot-taking code in a per-shard take_snapshot function, to be called from snapshot_on_all_shards in a following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	37b7a9cce2	utils: get rid of joinpoint Now that it is no longer used. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	56f336d1aa	database: get rid of timestamp_func Pass an optional truncated_at time_point to truncate_table_on_all_shards instead of the over-complicated timestamp_func that returns the same time_point on all shards anyhow, and was only used for coordination across shards. Since now we synchronize the internal execution phase in truncate_table_on_all_shards, there is no longer need for this timestamp_func. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	b640c4fd17	database: truncate: snapshot table in all-shards layer With that the database layer does no longer need to invoke the private table::snapshot function, so it can be defriended from class table. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	af0c71aa12	database: truncate: flush table and views in all-shards layer Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	6e07e6b7ac	database: truncate: stop and disable compaction in all-shards layer Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	e78dad1dfb	database: truncate: move call to set_low_replay_position_mark to all-shards layer Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	a8bd3d97b6	database: truncate: enter per-shard table async_gate in all-shards layer Start moving the per-shard state establishment logic to truncate_table_on_all_shards, so that we would evetually do only the truncate logic per-se in the per-shard truncate function. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	ff028316f2	database: truncate: move check for schema_tables keyspace to all-shards layer. Now that the per-shard truncate function is called only from truncate_table_on_all_shards, we can reject the schema_tables keyspace in the upper layer. There's no need to check that on each shard. While at it, reuse `is_system_keyspace`. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	fbe1fa1370	database: snapshot_table_on_all_shards: reindent Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	4d4ca40c38	table: add snapshot_on_all_shards Called from the respective database entry points. Will be called also from the database drop / truncate path and will be used for central coordination of per-shard table::snapshot so we don't have to depend on the snapshot_manager mechanism that is fragile and currently causes abort if we fail to allocate it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	be56a73e78	database: add snapshot_table_on_all_shards We need to snapshot a single table in several paths. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	d96b56fee2	database: rename {flush,snapshot}_on_all and make static Follow the convention of drop_table_on_all_shards. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	a1eed1a6e9	database: drop_table_on_all_shards: truncate and stop table in upper layer truncate the table on all shards then stop it on shards in the upper layer rather than in the per-shard drop_column_family() function, so we can further refactor truncate later, flushing and taking snapshot on all shards, before truncating. With that, rename drop_column_family to detach_columng_family as now it only deregisters the column family from containers that refer to it (even via its uuid) and then its caller is reponsible to take it from there. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	92cb7d448b	database: drop_table_on_all_shards: get all table shards before drop_column_family on each Se we the upper layer can flush, snapshot, and truncate the table on all shards, step by step. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	0aaaefbb5c	database: drop_column_family: define table& cf To reduce the churn in the following patch that will pass the table& as a parameter. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	bb1e5ffb8c	database: drop_column_family: reuse uuid for evict_all_for_table cf->schema()->id() is the same one returned by find_uuid(ks_name, cf_name); As a follow up, we should define a concrete table_id type and rename schema::id() to schema::table_id() to return it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	e800e1e720	database: drop_column_family: move log message up a layer Print once on "coordinator" shard. And promote to info level as it's important to log when we're dropping a table (and if we're going to take a snapshot). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00

1 2 3 4 5 ...

32519 Commits