scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-23 01:50:35 +00:00

Author	SHA1	Message	Date
Nadav Har'El	f1aaa91e21	merge: add metrics Merged pull request https://github.com/scylladb/scylla/pull/6030 from Piotr Dulikowski: Adds CDC-related metrics. Following counters are added, both for total and failed operations: Total number of CDC operations that did/did not perform splitting, Total number of CDC operations that touched a particular mutation part. Total number of preimage selects. Fixes #6002. Tests: unit(dev, debug) * 'cdc-metrics' of github.com:piodul/scylla: storage_proxy: track CDC operations in LWT flow storage_proxy: track CDC operations in logged batches storage_proxy: track CDC operations in standard flow storage_proxy: add cdc tracker hooks to write response handlers storage_proxy: move "else if" remainder into "else" block cdc: create an operation_result_tracker object cdc: add an object for tracking progress of cdc mutations cdc: count touched mutation parts in transformer::transform cdc: track preimage selects in metrics cdc: register metric counters cdc: fix non-atomic updates in splitting	2020-03-23 21:55:58 +02:00
Piotr Dulikowski	5a5cc57878	cdc: create an operation_result_tracker object An `operation_result_tracker` object is now returned as a second return value from the `augment_mutation_call` function.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	1b92cbeabe	cdc: add an object for tracking progress of cdc mutations CDC metrics, apart from tracking "total" metrics for all performed CDC operations, also track metrics for "failed" operations. Because the result of the CDC operation depends on whether all CDC mutations were written successfully by storage_proxy, checking for failure and incrementing appropriate counters is deferred after all write response handlers finish. The `cdc::operation_result_tracker` object was created for that purpose. It contains all the details needed to accurately update the metrics based on what actually happened in the `augment_mutation_call` function, and holds a flag which tells if any of write response handlers failed. This object is supposed to be referenced by write response handlers for CDC mutations created after the same `augment_mutation_call`. After all write response handlers are destroyed, the destructor of `operation_result_tracker` will update appropriate metrics. Actual creating and attaching this object to write response handlers will be done in subsequent commits.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	98e5fdc7ac	cdc: count touched mutation parts in transformer::transform Modifies the transformer::transform so that it also returns a set of flags indicating what parts of the mutation (e.g. rows, tombstones, collections, etc.) were processed during transforming.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	53570d8657	cdc: track preimage selects in metrics This commit causes preimage select counter to be increased after performing this operation.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	e7062de02b	cdc: register metric counters This patch defines a CDC metrics object and registers all of its counters. storage_proxy is chosen as the owner of the metrics object. Because in subsequent commits it will become possible for CDC metrics to be updated after a write operation ends, and because the cdc_service has shorter lifetime than storage_proxy, we could risk a use-after-free if we placed this object inside cdc_service.	2020-03-23 14:05:25 +01:00
Piotr Dulikowski	338e473946	cdc: fix non-atomic updates in splitting This patch fixes a bug in mutation splitting logic of CDC. In the part that handles updates of non-atomic clustering columns, the column definition was fetched from a static column of the same id instead of the actual definition of the clustering column. It could cause the value to be written to a wrong column. Tests: unit(dev)	2020-03-23 13:47:23 +01:00
Piotr Dulikowski	a693e6ff6c	cdc: fix non-atomic updates in splitting This patch fixes a bug in mutation splitting logic of CDC. In the part that handles updates of non-atomic clustering columns, the schema for serializing that column was looked up incorrectly in the table schema - instead of a `regular_column`, a `static_column` was looked up. Due to how the `column_at` function works, a correct schema was always returned if the table had no static columns. Therefore, in order for this bug to manifest, a table with a static column and a regular column with non-atomic collection was needed.	2020-03-23 10:20:24 +01:00
Botond Dénes	e0284bb9ee	treewide: add missing headers and/or forward declarations	2020-03-23 09:29:45 +02:00
Piotr Dulikowski	3bfb044bf1	cdc: do not create cdc$deleted columns for pks and cks Primary key and clustering key column should not have a corresponding "cdc$deleted_<name>" column in cdc log table, because it does not make sense to delete such a column from a row. Fixes: #6049 Tests: unit(dev)	2020-03-21 07:33:23 +01:00
Piotr Dulikowski	59727fb34b	cdc: remove result_callback The `result_callback` was a callback returned by `augment_mutation_call` that was supposed to be used in the CDC postimage implementation. Because CDC postimage was implemented without using this callback, and currently a no-op function is always returned, this callback can safely be removed.	2020-03-19 14:55:07 +02:00
Nadav Har'El	35d95d6887	merge: Add postimage implementation Merged pull request https://github.com/scylladb/scylla/pull/5996 from Calle Wilund: Fixes #4992 Implements post-image support by synthesizing it from pre-image + delta. Post-image data differs from the delta data in two ways: 1.) It merges non-atomics into an actual result value 2.) It contains all columns of the row, not just those affected by the update. For a non-atomic field, the post-image value of a column is either the pre-image or the delta (maybe null) Tested by adding post-image checks to pre-image test and collection/udt tests	2020-03-16 13:42:07 +02:00
Calle Wilund	0a3383c090	cdc: Add postimage implementation Fixes #4992 Implements post-image support by synthesizing it from pre-image + delta. Post-image data differs from the delta data in two ways: 1.) It merges non-atomics into an actual result value 2.) It contains _all_ columns of the row, not just those affected by the update. For a non-atomic field, the post-image value of a column is either the pre-image or the delta (maybe null) Tested by adding post-image checks to pre-image test and collection/udt tests	2020-03-16 09:21:06 +00:00
Piotr Dulikowski	b1e8170bf9	cdc: add tracing Adds information about the stages of CDC mutation augmentation to tracing sessions.	2020-03-15 11:54:10 +01:00
Calle Wilund	5c743bfd53	cdc: rename inner "process_cells" to avoid confusion Two lambdas should not share name in same function.	2020-03-09 13:06:32 +00:00
Piotr Dulikowski	5f652e58c1	cdc: allow dropping manually created tables with cdc log suffix The is_log_for_some_table function incorrectly assumed that database::find_schema would return a null pointer in case the queried schema does not exist. This patch fixes that, and now this function checks for existence of the schema using database::has_schema. Tests: unit(dev)	2020-03-09 12:17:13 +01:00
Nadav Har'El	6febd4199e	merge: cdc: on row delete, show the whole row as preimage Merged pull request https://github.com/scylladb/scylla/pull/5980 by Piotr Jastrzębski, based on https://github.com/scylladb/scylla/pull/5976 by Juliusz Stasiewicz: "If base mutation has at least one row tombstone, its preimage log entry displays all the base columns." Fixes #5709 Tests: unit(dev)	2020-03-08 14:54:59 +02:00
Juliusz Stasiewicz	68071d35ce	cdc: on row delete display the entire row as preimage If base mutation has at least one row tombstone, its preimage log entry is constructed from all the base columns. Fixes #5709 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-08 12:11:07 +01:00
Piotr Dulikowski	0e413efb48	cdc: correct static row preimage for case with no clustering row In case a static and a clustering row is written at the same time, but a clustering row with given key was not present, the preimage query was incorrectly configured and no rows were returned. This resulted in an empty preimage, while a preimage for static row should be present. This patch fixes this and now the static row is correctly written to cdc log in the case above. Tests: unit(dev)	2020-03-08 09:25:45 +01:00
Juliusz Stasiewicz	e2b76fd559	cdc: move the extractor of `pirow` columns into separate method Because it will be used more than once.	2020-03-06 17:54:42 +01:00
Piotr Dulikowski	f317283578	cdc: disallow creating nested CDC logs This change disallows creating CDC log tables for already existing CDC log tables. CDC logs nested in that way are not really useful and do not work at the moment, therefore disallowing their creation prevents confusion.	2020-03-06 10:47:13 +01:00
Piotr Dulikowski	38b7f1ad45	unit tests: register cdc extension before tests In the following commits, using cdc in tests will require registering cdc extension explicitly in db config.	2020-03-05 16:11:20 +01:00
Piotr Dulikowski	0f4f48ef76	cdc: construct cdc_options directly inside cdc_extension Instead of storing a raw map of options inside `cdc_extension`, the extension now converts them into `cdc_options` directly on construction. This removes the need to construct `cdc_options` object multiple times.	2020-03-05 16:09:44 +01:00
Piotr Sarna	c35160457b	Merge 'Clean up stream_id representation' from Piotr With #5950 we changed the representation of stream_id in CDC Log from two int columns to a single blob column. This PR cleans up stream_id representation internally. Now stream_id is stored as blob both in-memory and in internal CDC tables. Tests: unit(dev) * hawk/stream_id_representation: cdc: store stream_ids as blobs in internal tables cdc: improve do_update_streams_description cdc: Fix generate_topology_description cdc: add stream_id::operator< cdc: change stream_id representation	2020-03-05 14:14:29 +01:00
Kamil Braun	3200d415da	cdc: use a single timeuuid value for a batch of changes If a batch update is performed with a sequence of changes with a single timestamp, they will now show up in CDC with a single timeuuid in the `time` column, distinguished by different `batch_seq_no` values. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 12:32:57 +01:00
Kamil Braun	292eba9da0	cdc: replace `split` with `for_each_change` `for_each_change` is like `split` but it doesn't return a vector of mutations representing each change; instead, it takes as a parameter a function which gets called on each mutation. This reduced the memory usage and allows to preserve common context when handling each change (will be useful in next commits). Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 12:05:08 +01:00
Piotr Jastrzebski	57cfe6d0e1	cdc: store stream_ids as blobs in internal tables In new CDC Log format stream_id is represented by a single blob column so it makes sense to store it in the same form everywhere - including internal CDC tables. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:22 +01:00
Piotr Jastrzebski	b2acdc9307	cdc: improve do_update_streams_description Use std::set::insert that takes range instead of looping through elements and adding them one by one. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:22 +01:00
Piotr Jastrzebski	446722d6ed	cdc: Fix generate_topology_description In new CDC Log format we store only a single stream_id column. This means generate_topology_description has to use appropriate schema for generating tokens for stream_ids. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:22 +01:00
Piotr Jastrzebski	9a212dcaef	cdc: add stream_id::operator< Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:21 +01:00
Piotr Jastrzebski	f317a659d9	cdc: change stream_id representation New CDC Log format stores stream ids as blobs. It makes sense to keep them internally in the same form. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:30:10 +01:00
Piotr Sarna	f21bd57058	Merge "cdc: log static rows correctly" from Piotr Currently, writes to a static row in a base table are not reflected at all in the corresponding cdc log. This patch causes such writes to be properly logged. Fixes: #5744 Tests: unit(dev) * piodul/5744-handle-static-row-correctly-in-cdc: cdc_test: add tests for handling static row cdc: fix indentation in transformer::transform cdc: handle static rows separately in transformer::transform cdc: move process_cells higher (and fix captured variables) cdc: reduce dependencies on captured variables in process_cells cdc: fix preimage query for static rows	2020-03-05 10:42:15 +01:00
Piotr Dulikowski	204e204586	cdc: do not attempt to log empty mutations It is possible to produce an empty mutation using CQL. For example, the following query: DELETE FROM ks.tbl WHERE pk = 0 AND ck < 1 AND ck > 2; will attempt to delete from an empty range of rows. This is translated to the following mutation: {ks.tbl {key: pk{000400000000}, token:-3485513579396041028} {mutation_partition: static: cont=1 {row: }, clustered: {}}} Such mutation does not contain any timestamp, therefore it is difficult to determine what timestamp was used while making the query. This is problematic for CDC, because an entry in CDC log should be written with the same timestamp as a part of the mutation. Because an empty mutation does not modify the table in any way, we can safely skip logging such mutations in CDC and still preserve the ability to reconstruct the current state of the base table from full CDC log. Tests: unit(dev)	2020-03-05 08:32:54 +01:00
Piotr Dulikowski	39519ce923	cdc: fix indentation in transformer::transform	2020-03-05 00:16:17 +01:00
Piotr Dulikowski	0d05b17881	cdc: handle static rows separately in transformer::transform Before this patch, `transform` did not generate any log rows about static row change. This commit fixes that - now, a log row is created if a static row is changed, and this row is separate from the rows that describe changes to the clustering rows.	2020-03-05 00:16:17 +01:00
Piotr Dulikowski	6a0b0b5786	cdc: move process_cells higher (and fix captured variables) The `process_cells` lambda is moved outside the loop, because it will be used by other code in subsequent commits.	2020-03-05 00:15:57 +01:00
Piotr Dulikowski	f136f6e02c	cdc: reduce dependencies on captured variables in process_cells This is a preparation for moving the lambda outside the for loop. - `log_ck`, `pikey`, `pirow` are now passed as arguments, - `value` is now a variable local to the lambda, - `ttl` is now a variable local to the lambda that is returned.	2020-03-05 00:14:05 +01:00
Piotr Dulikowski	a7f51449c3	cdc: fix preimage query for static rows For static rows, we need to fetch at least one row from its partition in order to compute its preimage.	2020-03-04 18:43:55 +01:00
Piotr Jastrzebski	354e3c34c8	cdc log: merge stream_id columns into a single column Previously we had stream_id_1 and stream_id_2 columns of type long each. They were forming a partition key. In a new format we want a single stream_id column that forms a partition key. To be able to still store two longs, the new column will have type blob and its value will be concatenated bytes of two longs that partition key is composed of. We still want partition key to logically be two longs because those two values will be used by a custom partitioner later once we implement it. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-04 13:27:48 +02:00
Kamil Braun	5c4a237c12	cdc: split the mutation before passing it into `transform` If the mutation contains separate logical changes (e.g. with different timestamps and/or ttls), it will be split into multiple mutations, each passed into transform.	2020-03-03 13:17:51 +01:00
Kamil Braun	9924e3aa34	cdc: reduce code duplication in augment_mutation_call Now there's only one call to `transform`.	2020-03-03 13:17:51 +01:00
Kamil Braun	24a32a13b5	cdc: retrieve preimage anytime there are static/clustered row updates Previously we wouldn't retrieve the preimage if the mutation contained something different than static/clustered row updates, e.g. if it contained a partition deletion. However, there are mutations created from batch statements which can contain both a partition deletion and a set of row updates with a later timestamp. We want to retrieve the preimage too in this case.	2020-03-03 13:17:51 +01:00
Kamil Braun	529d30ef66	cdc: add `split` function This function takes a mutation and returns a set of mutations, each representing a separate change with a single timestamp and ttl.	2020-03-03 13:17:51 +01:00
Kamil Braun	132ea89c32	cdc: add `extract_changes` function This commit introduces a bunch of new structs describing a change made to a table, and an `extract_changes` function which takes a mutation and returns the set of changes contained in this mutation, separated by timestamp and ttl.	2020-03-03 13:17:51 +01:00
Kamil Braun	b5c944370e	cdc: add `should_split` function The function checks if there are multiple timestamps and/or ttls inside a mutation, which means separate changes should be created for this mutation in CDC.	2020-03-03 13:17:50 +01:00
Calle Wilund	ed0d1c5fe2	cdc: Break up data column tuple According to "new" spec: Data column is now pure frozen original type. If column is deleted (set to null), a metadata column cdc$deleted_<name> is set to true, to distinguish null column == not involved in row operation For non-atomic collections, a cdc$deleted_elements_<name> column is added, and when removing elements from collection this is where they are shown. For non-atomic assign, the "cdc$deleted_<name>" is true, and <name> is set to new value. column_op removed.	2020-03-03 08:52:20 +00:00
Calle Wilund	1085860c62	cdc: Rename metadata and data columns according to new spec Also use transformation methods for names in all code + tests to make switching again easier	2020-03-02 09:34:51 +00:00
Juliusz Stasiewicz	cf24ae86f3	cdc: distinguishing update from insert When incoming mutation contains live row marker the `operation` is described as "insert", not as an "update". Also, I extended the test case "test_row_delete" with one insert, which is expected to log different value of `operation` than update or delete. Renamed the test case accordingly. Test cases that relied on "update" being the same as "insert" are updated accordingly (`test_pre_image_logging`, `test_cdc_across_shards`, `test_add_columns`). Fixes #5723	2020-03-01 17:50:08 +02:00
Juliusz Stasiewicz	836183b847	cdc: fix `operation` value for row deletes Column `operation` now contains `operation::row_delete` (== 2) after queries like `delete from tbl where pk=x AND ck=y;`. Before this patch row deletes were treated as updates, which was incorrect because updates do not contain row tombstones (and row deletes do). Refs #5709	2020-02-26 11:58:50 +01:00
Calle Wilund	a3a764fd10	cdc: Handle non-atomic columns Fixes #5669 This implements non-atomic collection and UDT handling for both cdc preimage + delta. To be able to express deltas in a meaningful way (and reconstruct using it), non-atomic values are represented somewhat differently from regular values: * maps - stored as is (frozen) * sets - stored as is (frozen) * lists - stored as map<timeuuid, value> (frozen) this allows reconstructing the list, as otherwise things like list[0] = value cannot be represented in a meaningful way * udt - stored as tuple<tuple<field0>, tuple<field1>...> (frozen) UDTs are normally just tuples + metadata, but we need to distinguish the case of outer tuple element == null, meaning "no info/does not partake in mutation" from tuple element being a tuple(null) (i.e. empty tuple), meaning "set field to null"	2020-02-25 19:34:54 +02:00

1 2 3

104 Commits