scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 17:40:34 +00:00

Author	SHA1	Message	Date
Avi Kivity	bceff1550c	tests: fix bad format string syntax Some sprint() calls use the fmt language instead of the printf syntax. Convert them all the way to format().	2018-11-01 13:16:17 +00:00
Botond Dénes	d467b518bc	multishard_mutation_query(): don't attempt to stop broken readers Currently, when stopping a reader fails, it simply won't be attempted to be saved, and it will be left in the `_readers` array as-is. This can lead to an assertion failure as the reader state will contain futures that were already waited upon, and that the cleanup code will attempt to wait on again. To prevent this, when stopping a reader fails, reset it to nonexistent state, so that the cleanup code doesn't attempt to do anything with it. Refs: #3830 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <a1afc1d3d74f196b772e6c218999c57c15ca05be.1539088164.git.bdenes@scylladb.com>	2018-10-09 15:59:50 +03:00
Botond Dénes	7d71b42651	multishard_mutation_query: add some tracing Add tracing for the following events: 1) Dismantling of the combined buffer. 2) Dismantling of the compaction state. 3) Cleaning up the readers. (1) and (2) can possibly have adverse effects on the performance of the query and hence it is important that details about the dismantled fragments is exposed in the tracing data. (3) is less critical but still good to know how much readers were created by the read (in case they aren't saved). Since normally (in strateful queries) this will always be 0 only trace this when it is non-zero (and is interesting).	2018-09-11 08:18:16 +03:00
Botond Dénes	b41be7c8e5	multishard_mutation_query: add comment to `read_context` Explain the purpose of the class and its intended usage and any gotchas the reader/modifier of the code has to keep in mind.	2018-09-11 08:18:16 +03:00
Botond Dénes	b6e1a8f32d	multishard_mutation_query: always cleanup readers properly Currently the reader cleanup code, which ensures the readers and their dependent objects are destroyed in the corect order and a single smp::submit_to() message, are only run when the readers are attempted to be saved. However proper cleanup is needed not only then, but also when the query is not stateful. Rename the current `cleanup()` method to `stop()`, make it public and call it from a `finally()` block after the page is finalized to ensure readers are properly cleaned up at all times. Also make sure that failures in `stop()` are never propagated so that a failure in the cleanup doesn't fail the read itself.	2018-09-11 08:18:16 +03:00
Botond Dénes	b3f1fe14e8	multishard_mutation_query: fix possible deadlock when creating a reader fails Failing to create a reader (`do_make_remote_reader()`) can lead to a deadlock if the reader is in any of the future_*_state states, as the `then()` block is not executed and hence the promise of the first future in the chain is not set. Avoid this by changing the `then()` to a `then_wrapped()` and using `set_exception()` and `set_value()` accordingly, such that the future is resolved on both the happy and error path.	2018-09-10 16:41:13 +03:00
Botond Dénes	253407bdc8	multishard_mutation_query: add badness counters Add badness counters that allow tracking problems. The following counters are added: 1) multishard_query_unpopped_fragments 2) multishard_query_unpopped_bytes 3) multishard_query_failed_reader_stops 4) multishard_query_failed_reader_saves The first pair of counters observe the amount of work range scan queries have to undo on each page. It is normal for these counters to be non-zero, however sudden spikes in their values can indicate problems. This undoing of work is needed for stateful range-scans to work. When stateful queries are enabled the `multishard_combining_reader` is dismantled and all unconsumed fragments in its and any of its intermediate reader's buffers are pushed back into the originating shard reader's buffer (via `unpop_mutation_fragment()`). This also includes the `partition_start`, the `static_row` (if there is one) and all extracted and active `range_tombstone` fragments. This together can amount to a substantial amount of fragments. (1) counts the amount of fragments moved back, while (2) counts the number of bytes. Monitoring size and quantity separately allows for detecting edge cases like moving many small fragments or just a few huge ones. The counters count the fragments/bytes moved back to readers located on the shard they belong to. The second pair of counters are added to detect any problems around saving readers. Since the failure to save a reader will not fail the read itself, it is necessary to add visibility to these failures by other means. (3) counts the number of times stopping a shard reader (waiting on pending read-aheads and next-partitions) failed while (4) counts the number of times inserting the reader into the `querier_cache` failed. Contrary to the first two counters, which will almost certainly never be zero, these latter two counters should always be zero. Any other value indicates problems in the respective shards/nodes.	2018-09-03 10:31:44 +03:00
Botond Dénes	97364c7ad9	database: add query_mutations_on_all_shards() This method allows for querying a range or ranges on all shards of the node. Under the hood it uses the multishard_combining_reader for executing the query. It supports paging and stateful queries (saving and reusing the readers between pages). All this is transparent to the client, who only needs to supply the same query::read_command::query_uuid through the pages of the query (and supply correct start positions on each page, that match the stop position of the last page).	2018-09-03 10:31:44 +03:00

8 Commits