scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 17:40:34 +00:00

Author	SHA1	Message	Date
Botond Dénes	e27127bb7f	test/cql-pytest: add test for query tombstone page limit Check that the replica returns empty pages as expected, when a large tombstone prefix/span is present. Large = larger than the configured query_tombstone_limit (using a tiny value of 10 in the test to avoid having to write many tombstones).	2022-08-10 09:14:59 +03:00
Botond Dénes	7730419f5c	query-result-writer: stop when tombstone-limit is reached The query result writer now counts tombstones and cuts the page (marking it as a short one) when the tombstone limit is reached. This is to avoid timing out on large span of tombstones, especially prefixes. In the case of unpaged queries, we fail the read instead, similarly to how we do with max result size. If the limit is 0, the previous behaviour is used: tombstones are not taken into consideration at all.	2022-08-10 06:03:38 +03:00
Botond Dénes	8066dbc635	service/pager: prepare for empty pages The pager currently assumes that an empty pages means the query is exhausted. Lift this assumption, as we will soon have empty short pages. Also, paging using filtering also needs to use the replica-provided last-position when the page is empty.	2022-08-10 06:03:38 +03:00
Botond Dénes	6a7dedfe34	service/storage_proxy: set smallest continue pos as query's continue pos We expect each replica to stop at exactly the same position when the digests match. Soon however, if replicas have a lot of tombstones, some may stop earlier then the others. As long as all digests match, this is fine but we need to make sure we continue from the smallest such positions on the next page.	2022-08-10 06:03:38 +03:00
Botond Dénes	2656968db2	service/storage_proxy: propagate last position on digest reads We want to transmit the last position as determined by the replica on both result and digest reads. Result reads already do that via the query::result, but digest reads don't yet as they don't return the full query::result structure, just the digest field from it. Add the last position to the digest read's return value and collect these in the digest resolver, along with the returned digests.	2022-08-10 06:03:37 +03:00
Botond Dénes	8c0dd99f7c	query: result_merger::get() don't reset last-pos on short-reads and last pages When merging multiple query-results, we use the last-position of the last result in the combined one as the combined result's last position. This only works however if said last result was included fully. Otherwise we have to discard the last-position included with the result and the pager will use the position of the last row in the combined result as the last position. The commit introducing the above logic mistakenly discarded the last position when the result is a short read or a page is not full. This is not necessary and even harmful as it can result in an empty combined result being delivered to the pager, without a last-position.	2022-08-10 06:01:49 +03:00
Botond Dénes	d1d53f1b84	query: add tombstone-limit to read-command Propagate the tombstone-limit from coordinator to replicas, to make sure all is using the same limit.	2022-08-10 06:01:47 +03:00
Botond Dénes	1b669cefed	service/storage_proxy: add get_tombstone_limit() To be used by coordinator side code to determine the correct tombstone limit to pass to read-command (tombstone limit field added in the next commit). When this limit is non-zero, the replica will start cutting pages after the tombstone limit is surpassed. This getter works similarly to `get_max_result_size()`: if the cluster feature for empty replica pages is set, it will return the value configured via db::config::query_tombstone_limit. System queries always use a limit of 0 (unlimited tombstones).	2022-08-09 10:00:40 +03:00
Botond Dénes	8cd2ef7a42	query: add tombstone_limit type Will be used in read_command. Add it before it is added to read-command so we can use the unlimited constant in code added in preparation to that.	2022-08-09 10:00:40 +03:00
Botond Dénes	33f0447ba0	db/config: add config item for query tombstone limit This will be the value used to break pages, after processing the specified amount of tombstones. The page will be cut even if empty. We could maybe use the already existing tombstone_{warn,fail}_threshold instead and use them as a soft/hard limit pair, like we did with page sizes.	2022-08-09 10:00:40 +03:00
Botond Dénes	1bc14b5e3b	gms: add cluster feature for empty replica pages So we can start using them only when the entire cluster supports it.	2022-08-09 10:00:40 +03:00
Botond Dénes	60a0e3d88b	tree: don't use query::read_command's IDL constructor It is not type safe: has multiple limits passed to it as raw ints, as well as other types that ints implicitly convert to. Furthermore the row limit is passed in two separate fields (lower 32 bits and upper 32 bits). All this make this constructor a minefield for humans to use. We have a safer constructor for some time but some users of the old one remain. Move them to the safe one.	2022-08-09 10:00:37 +03:00
Takuya ASADA	3ffc978166	main: move preinit_description to main() We don't need to wait for handling version options after scylla_main() called, we can handle it in main() instead. Closes #11221	2022-08-08 18:31:43 +03:00
Yaron Kaikov	2fe2306efb	configure.py: add date-stamp parameter When starting `Build` job we have a situation when `x86` and `arm` starting in different dates causing the all process to fail As suggested by @avikivity , adding a date-stamp parameter and will pass it through downstream jobs to get one release for each job Ref: scylladb/scylla-pkg#3008 Closes #11234	2022-08-08 17:28:38 +03:00
Avi Kivity	871127f641	Update tools/java submodule * tools/java ad6764b506...6995a83cc1 (1): > dist/debian: drop upgrading from scylla-tools < 2.0	2022-08-08 16:51:14 +03:00
Botond Dénes	49c00fa989	Merge 'Define strong uuid-class types for table_id, table_schema_version and query_id' from Benny Halevy We would like to define more distinct types that are currently defined as aliases to utils::UUID to identify resources in the system, like table id and schema version id. As with counter_id, the motivation is to restrict the usage of the distinct types so they can be used (assigned, compared, etc.) only with objects of the same type. Using with a generic UUID will then require explicit conversion, that we want to expose. This series starts with cleaning up the idl header definition by adding support for `import` and `include` statements in the idl-compiler. These allow the idl header to become self-sufficient and then remove manually-added includes from source files. The latter usually need only the top level idl header and it, in turn, should include other headers if it depends on them. Then, a UUID_class template was defined as a shared boiler plate for the various uuid-class. First, we convert counter_id to use it, rather than mimicking utils::UUID on its own. On top of utils::UUID_class<T>, we define table_id, table_schema_version, and query_id. Following up on this series, we should define more commonly used types like: host_id, streaming_plan_id, paxos_ballot_id. Fixes #11207 Closes #11220 * github.com:scylladb/scylladb: query-request, everywhere: define and use query_id as a strong type schema, everywhere: define and use table_schema_version as a strong type schema, everywhere: define and use table_id as a strong type schema: include schema_fwd.hh in schema.hh system_keyspace: get_truncation_record: delete unused lambda capture utils: uuid: define appending_hash<utils::tagged_uuid<Tag>> utils: tagged_uuid: rename to_uuid() to uuid() counters: counter_id: use base class create_random_id counters: base counter_id on utils::tagged_uuid utils: tagged_uuid: mark functions noexcept utils: tagged_uuid: bool: reuse uuid::bool operator raft: migrate tagged_id definition to utils::tagged_uuid utils: uuid: mark functions noexcept counters: counter_id delete requirement for triviality utils: bit_cast: require TriviallyCopyable To repair: delete unused include of utils/bit_cast.hh bit_cast: use std::bit_cast idl: make idl headers self-sufficient db: hints: sync_point: do not include idl definition file db/per_partition_rate_limit: tidy up headers self-sufficiency idl-compiler: include serialization impl and visitors in generated dist.impl.hh files idl-compiler: add include statements idl_test: add a struct depending on UUID	2022-08-08 13:20:40 +03:00
Benny Halevy	c71ef330b2	query-request, everywhere: define and use query_id as a strong type Define query_id as a tagged_uuid So it can be differentiated from other uuid-class types. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:13:28 +03:00
Benny Halevy	2b017ce285	schema, everywhere: define and use table_schema_version as a strong type Define table_schema_version as a distinct tagged_uuid class, So it can be differentiated from other uuid-class types, in particular table_id. Added reversed(table_schema_version) for convenience and uniformity since the same logic is currently open coded in several places. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:09:45 +03:00
Benny Halevy	257d74bb34	schema, everywhere: define and use table_id as a strong type Define table_id as a distinct utils::tagged_uuid modeled after raft tagged_id, so it can be differentiated from other uuid-class types, in particular from table_schema_version. Fixes #11207 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:09:41 +03:00
Benny Halevy	26aacb328e	schema: include schema_fwd.hh in schema.hh And remove repeated definitions and forward declarations of the same types in both places. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:28 +03:00
Benny Halevy	6e77ad9392	system_keyspace: get_truncation_record: delete unused lambda capture Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:28 +03:00
Benny Halevy	a390b8475b	utils: uuid: define appending_hash<utils::tagged_uuid<Tag>> And simplify usage for appending_hash<counter_shard_view> respectively. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:28 +03:00
Benny Halevy	8235cfdf7a	utils: tagged_uuid: rename to_uuid() to uuid() To make it more generic, similar to other uuid() get methods we have. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	813cffc2b5	counters: counter_id: use base class create_random_id Rather than defining generate_random, and use respectively in unit tests. (It was inherited from raft::internal::tagged_id.) This allows us to shorten counter_id's definition to just using utils::tagged_uuid<struct counter_id_tag>. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	e9cc24bc18	counters: base counter_id on utils::tagged_uuid Use the common base class for uuid-based types. tagged_uuid::to_uuid defined here for backward compatibility, but it will be renamed in the next patch to uuid(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	082d5efca8	utils: tagged_uuid: mark functions noexcept Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	1b78f8ba82	utils: tagged_uuid: bool: reuse uuid::bool operator utils::UUID defined operator bool the same way, rely on it rather than reimplementing it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	6436c614d7	raft: migrate tagged_id definition to utils::tagged_uuid So it can be used for other types in the system outside of raft, like counter_id, table_id, table_schema_version, and more. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	f0567ab853	utils: uuid: mark functions noexcept Before we define a tagged_uuid template. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	ea91ccaa20	counters: counter_id delete requirement for triviality This stemmed from utils/bit_cast overly strict requirement. Now that it was relaxed, these is no need for this static assert as counter_id is trivially copyable, and that is checked by bit_cast {read,write}_unaligned Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	c68e929b80	utils: bit_cast: require TriviallyCopyable To Like std::bit_cast (https://en.cppreference.com/w/cpp/numeric/bit_cast) we only require To to be trivially copyable. There's no need for it to be a trivial type. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	2948a4feb6	repair: delete unused include of utils/bit_cast.hh Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	79000bc02e	bit_cast: use std::bit_cast Now that scylla requries c++20 there's no need to define our own implementation in utils/bit_cast.hh Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	1fda686f96	idl: make idl headers self-sufficient Add include statements to satisfy dependencies. Delete, now unneeded, include directives from the upper level source files. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	cfc7e9aa59	db: hints: sync_point: do not include idl definition file idl definition files are not intended for direct inclusion in .cc files. Data types it represents are supposed to be defined in regular C++ header, so define them in db/hints/scyn_point.hh and include it rather then idl/hinted_handoff.idl.hh. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	82fa205723	db/per_partition_rate_limit: tidy up headers self-sufficiency include what's needed where needed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	83811b8e35	idl-compiler: include serialization impl and visitors in generated dist.impl.hh files They are generally required by the serialization implementation. This will simplify using them without having to hand pick what header to include in the .cc file that includes them. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	da4f0aae37	idl-compiler: add include statements For generating #include directives in the generated files, so we don't have to hand-craft include the dependencies in the right order. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Benny Halevy	4f275a17b4	idl_test: add a struct depending on UUID For testing the next change which adds import and include statements to the idl language. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Avi Kivity	ba42852b0e	Merge 'Overhaul truncate and snapshot' from Benny Halevy This series is aimed at fixing #11132. To get there, the series untangles the functions that currently depend on the the cross-shard coordination in table::snapshot, namely database::truncate and consequently database::drop_column_family. database::get_table_on_all_shards is added here as a helper to get a foreign shared ptr of the the table shard from all shards, and it is later used by multiple functions to truncate and then take a snapshot of the sharded table. database::truncate_table_on_all_shards is defined to orchestrate the truncate process end-to-end, flushing or clearing all table shards before taking a snapshot if needed, using the newly defined table::snapshot_on_all_shards, and by that leaving only the discard_sstables job to the per-shard database::truncate function. The latter, snapshot_on_all_shards, orchestrates the snapshot process on all shards - getting rid of the per-shard table::snapshot function (after refactoring take_snapshot and finalize_snapshot out of it), and the associated dreaded data structures: snapshot_manager and pending_snapshots. Fixes #11132. Closes #11133 * github.com:scylladb/scylladb: table: reindent write_schema_as_cql table: coroutinize write_schema_as_cql table: seal_snapshot: maybe_yield when iterating over the table names table: reindent seal_snapshot table: coroutinize seal_snapshot table: delete unused snapshot_manager and pending_snapshots table: delete unused snapshot function table: snapshot_on_all_shards: orchestrate snapshot process table: snapshot: move pending_snapshots.erase from seal_snapshot table: finalize_snapshot: take the file sets as a param table: make seal_snapshot a static member table: finalize_snapshot: reindent table: refactor finalize_snapshot out of snapshot table: snapshot: keep per-shard file sets in snapshot_manager table: take_snapshot: return foreign unique ptr table: take_snapshot: maybe yield in per-sstable loop table: take_snapshot: simplify tables construction code table: take_snapshot: reindent table: take_snapshot: simplify error handling table: refactor take_snapshot out of snapshot utils: get rid of joinpoint database: get rid of timestamp_func database: truncate: snapshot table in all-shards layer database: truncate: flush table and views in all-shards layer database: truncate: stop and disable compaction in all-shards layer database: truncate: move call to set_low_replay_position_mark to all-shards layer database: truncate: enter per-shard table async_gate in all-shards layer database: truncate: move check for schema_tables keyspace to all-shards layer. database: snapshot_table_on_all_shards: reindent table: add snapshot_on_all_shards database: add snapshot_table_on_all_shards database: rename {flush,snapshot}_on_all and make static database: drop_table_on_all_shards: truncate and stop table in upper layer database: drop_table_on_all_shards: get all table shards before drop_column_family on each database: drop_column_family: define table& cf database: drop_column_family: reuse uuid for evict_all_for_table database: drop_column_family: move log message up a layer database: truncate: get rid of the unused ks param database: add truncate_table_on_all_shards database: drop_table_on_all_shards: do not accept a truncated_at timestamp_func database: truncate: get optional snapshot_name from caller database: truncate: fix assert about replay_position low_mark database_test: apply_mutation on the correct db shard	2022-08-07 19:15:42 +03:00
Benny Halevy	45ce635527	table: reindent write_schema_as_cql Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	3b2cce068a	table: coroutinize write_schema_as_cql and make sure to always close the output stream. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	dbae7807d1	table: seal_snapshot: maybe_yield when iterating over the table names Add maybe_yield calls in tight loop, potentially over thousands of sstable names to prevent reactor stalls. Although the per-sstable cost is very small, we've experienced stalls realted to printing in O(#sstables) in compaction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	3ba0c72b77	table: reindent seal_snapshot Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	41a2d09a5d	table: coroutinize seal_snapshot Handle exceptions, making sure the output stream is properly closed in all cases, and an intermediate error, if any, is returned as the final future. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	5316dbbe78	table: delete unused snapshot_manager and pending_snapshots Now that snapshot orchestration in snapshot_on_all_shards doesn't use snapshot_manager, get rid of the data structure. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	cca9068cfb	table: delete unused snapshot function Now that snapshot orchestration is done solely in snapshot_on_all_shards, the per-shard snapshot function can be deleted. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	351a3a313d	table: snapshot_on_all_shards: orchestrate snapshot process Call take_snapshot on each shard and collect the returns snapshot_file_set. When all are done, move the vector<snapshot_file_set> to finalize_snapshot. All that without resorting to using the snapshot_manager nor calling table::snapshot. Both will deleted in the following patches. Fixes #11132 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	84dfd2cabb	table: snapshot: move pending_snapshots.erase from seal_snapshot Now that seal_snapshot doesn't need to lookup the snapshot_manager in pending_snapshots to get to the file_sets, erasing the snapshot_manager object can be done in table::snapshot which also inserted it there. This will make it easier to get rid of it in a later patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00
Benny Halevy	39276cacc3	table: finalize_snapshot: take the file sets as a param and pass it to seal_snapshot, so that the latter won't need to lookup and access the snapshot_manager object. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-07 12:53:05 +03:00

1 2 3 4 5 ...

32546 Commits